Query lcl|NC_015263.1_cdsid_YP_004306306.1 [gene=LaPh949_gp146] [protein=putative phage structural protein] [protein_id=YP_004306306.1] [location=complement(105517..107058)] Match_columns 513 No_of_seqs 25 out of 28 Neff 5.1 Searched_HMMs 1612 Date Thu Nov 7 14:31:11 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_146 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_146_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:572 Length: 506 # 100.0 1.6E-79 9.8E-83 452.6 24.1 469 1-506 1-506 (506) 2 protein:vir:105154 Length: 525 100.0 4.4E-77 2.8E-80 439.2 27.6 478 1-501 1-525 (525) 3 protein:vir:1266 Length: 416 # 99.2 3E-10 1.9E-13 72.8 26.6 392 39-493 1-416 (416) 4 protein:vir:483 Length: 413 # 99.2 5.1E-10 3.2E-13 71.5 27.6 393 39-500 1-413 (413) 5 protein:vir:3153 Length: 467 # 99.1 1.2E-09 7.7E-13 69.4 27.6 407 70-512 1-467 (467) 6 protein:vir:102080 Length: 429 99.1 1.9E-09 1.2E-12 68.4 26.8 405 17-503 1-429 (429) 7 protein:vir:93610 Length: 454 99.0 4.7E-09 2.9E-12 66.2 33.2 423 11-513 1-444 (454) 8 protein:vir:63755 Length: 547 99.0 8E-09 5E-12 65.0 36.4 450 1-513 1-546 (547) 9 protein:vir:80644 Length: 551 99.0 8.8E-09 5.5E-12 64.7 35.2 444 1-513 23-550 (551) 10 protein:vir:100150 Length: 437 99.0 8E-09 5E-12 65.0 26.8 416 1-510 1-437 (437) 11 protein:vir:102727 Length: 945 98.9 1.3E-08 8E-12 63.9 33.5 444 1-513 52-537 (945) 12 protein:vir:105064 Length: 421 98.9 1.9E-08 1.2E-11 63.0 27.3 389 39-502 1-421 (421) 13 protein:vir:1431 Length: 419 # 98.9 2.3E-08 1.4E-11 62.4 26.2 387 44-513 1-416 (419) 14 protein:vir:4337 Length: 434 # 98.9 2.2E-08 1.3E-11 62.6 25.5 413 1-493 1-434 (434) 15 protein:vir:79772 Length: 648 98.8 3.4E-08 2.1E-11 61.5 31.3 443 1-513 17-516 (648) 16 protein:vir:10321 Length: 495 98.8 4.3E-08 2.6E-11 61.0 28.6 440 9-501 1-495 (495) 17 protein:vir:3843 Length: 397 # 98.8 4.7E-08 2.9E-11 60.8 26.3 381 23-506 1-397 (397) 18 protein:vir:80333 Length: 419 98.8 4.5E-08 2.8E-11 60.9 24.8 391 1-499 1-419 (419) 19 protein:vir:3868 Length: 417 # 98.8 5E-08 3.1E-11 60.6 27.1 388 42-512 1-417 (417) 20 protein:vir:5737 Length: 419 # 98.8 5E-08 3.1E-11 60.6 27.7 395 39-511 1-419 (419) 21 protein:vir:4454 Length: 414 # 98.8 6E-08 3.7E-11 60.2 30.1 391 28-500 1-414 (414) 22 protein:vir:7853 Length: 518 # 98.7 8E-08 5E-11 59.5 31.1 422 39-513 1-471 (518) 23 protein:vir:95599 Length: 563 98.7 8.2E-08 5.1E-11 59.4 35.5 451 1-513 13-545 (563) 24 protein:vir:99312 Length: 563 98.7 8.2E-08 5.1E-11 59.4 35.5 451 1-513 13-545 (563) 25 protein:vir:107605 Length: 432 98.7 1E-07 6.5E-11 58.9 29.3 400 28-503 1-432 (432) 26 protein:vir:102855 Length: 432 98.7 1E-07 6.5E-11 58.9 29.3 400 28-503 1-432 (432) 27 protein:vir:105002 Length: 432 98.7 1E-07 6.5E-11 58.9 29.3 400 28-503 1-432 (432) 28 protein:vir:79538 Length: 502 98.6 2.3E-07 1.4E-10 57.0 33.4 444 9-504 1-502 (502) 29 protein:vir:4828 Length: 382 # 98.6 2.8E-07 1.8E-10 56.5 23.3 364 28-503 1-382 (382) 30 protein:vir:389 Length: 530 # 98.5 3.4E-07 2.1E-10 56.0 30.3 445 4-501 1-530 (530) 31 protein:vir:98444 Length: 434 98.5 1.8E-07 1.1E-10 57.6 21.2 394 61-500 1-434 (434) 32 protein:vir:81095 Length: 416 98.5 3.8E-07 2.4E-10 55.8 26.8 388 1-503 3-416 (416) 33 protein:vir:4598 Length: 416 # 98.5 3.8E-07 2.4E-10 55.8 26.8 388 1-503 3-416 (416) 34 protein:vir:101648 Length: 518 98.5 5E-07 3.1E-10 55.1 30.5 423 29-513 1-471 (518) 35 protein:vir:8418 Length: 409 # 98.5 3E-07 1.9E-10 56.4 20.7 384 23-497 1-409 (409) 36 protein:vir:102118 Length: 409 98.4 6.6E-07 4.1E-10 54.5 26.3 383 1-497 1-409 (409) 37 protein:vir:94426 Length: 409 98.4 7.8E-07 4.9E-10 54.1 27.1 386 1-493 1-409 (409) 38 protein:vir:93943 Length: 409 98.4 9.1E-07 5.6E-10 53.7 27.2 386 1-493 1-409 (409) 39 protein:vir:1326 Length: 457 # 98.3 1.1E-06 7.1E-10 53.2 30.8 421 28-512 1-457 (457) 40 protein:vir:4156 Length: 542 # 98.3 1.4E-06 8.6E-10 52.7 26.4 433 11-513 1-482 (542) 41 protein:vir:81152 Length: 411 98.3 1.5E-06 9.2E-10 52.5 26.1 387 17-502 1-411 (411) 42 protein:vir:4194 Length: 540 # 98.3 1.5E-06 9.2E-10 52.5 30.4 425 11-513 1-480 (540) 43 protein:vir:96980 Length: 409 98.2 2.6E-06 1.6E-09 51.2 29.1 384 1-493 1-409 (409) 44 protein:vir:9702 Length: 406 # 98.2 2.6E-06 1.6E-09 51.2 25.6 384 1-509 1-406 (406) 45 protein:vir:107742 Length: 537 98.2 2.8E-06 1.7E-09 51.0 27.2 467 1-513 17-536 (537) 46 protein:vir:96738 Length: 505 98.2 3.3E-06 2.1E-09 50.6 33.1 447 1-492 1-505 (505) 47 protein:vir:100249 Length: 431 98.2 3.4E-06 2.1E-09 50.6 27.0 389 28-493 1-431 (431) 48 protein:vir:94049 Length: 532 98.1 9.4E-07 5.8E-10 53.6 16.9 458 1-509 22-532 (532) 49 protein:vir:104259 Length: 403 98.1 4E-06 2.5E-09 50.2 29.1 380 14-496 1-403 (403) 50 protein:vir:10362 Length: 432 98.1 4.6E-06 2.9E-09 49.8 29.6 399 36-511 1-432 (432) 51 protein:vir:99563 Length: 862 98.1 5.5E-06 3.4E-09 49.4 25.9 475 1-513 47-597 (862) 52 protein:vir:98396 Length: 441 98.1 5.5E-06 3.4E-09 49.4 27.1 406 1-503 13-441 (441) 53 protein:vir:81072 Length: 432 98.0 6.5E-06 4E-09 49.0 29.4 414 11-511 1-432 (432) 54 protein:vir:2683 Length: 412 # 98.0 6.6E-06 4.1E-09 49.0 30.7 391 9-508 1-412 (412) 55 protein:vir:95378 Length: 406 98.0 6.6E-06 4.1E-09 49.0 24.9 379 1-507 7-406 (406) 56 protein:vir:95542 Length: 548 98.0 6.7E-06 4.2E-09 48.9 32.9 461 9-513 1-547 (548) 57 protein:vir:8100 Length: 466 # 98.0 6.9E-06 4.3E-09 48.9 25.7 423 9-504 1-466 (466) 58 protein:vir:1380 Length: 422 # 98.0 7.2E-06 4.5E-09 48.8 28.1 386 28-488 1-422 (422) 59 protein:vir:3420 Length: 533 # 98.0 7.6E-06 4.7E-09 48.7 31.4 446 2-508 1-533 (533) 60 protein:vir:79984 Length: 441 98.0 8.8E-06 5.4E-09 48.3 27.6 401 1-503 12-441 (441) 61 protein:vir:9408 Length: 441 # 98.0 8.8E-06 5.4E-09 48.3 27.6 401 1-503 12-441 (441) 62 protein:vir:3989 Length: 392 # 98.0 8.9E-06 5.5E-09 48.3 26.0 371 26-491 1-392 (392) 63 protein:vir:1023 Length: 392 # 98.0 8.9E-06 5.5E-09 48.3 26.0 371 26-491 1-392 (392) 64 protein:vir:6240 Length: 457 # 98.0 9.1E-06 5.6E-09 48.2 30.3 423 23-508 1-457 (457) 65 protein:vir:100691 Length: 535 97.9 1.2E-05 7.5E-09 47.5 34.7 448 1-513 3-526 (535) 66 protein:vir:189 Length: 424 # 97.9 1.4E-05 8.7E-09 47.2 30.4 392 1-501 1-424 (424) 67 protein:vir:99916 Length: 504 97.8 1.6E-05 9.6E-09 46.9 29.1 430 36-493 1-504 (504) 68 protein:vir:97060 Length: 432 97.8 1.7E-05 1E-08 46.8 29.0 412 11-511 1-432 (432) 69 protein:vir:9359 Length: 348 # 97.8 1.7E-05 1.1E-08 46.7 29.5 327 91-493 1-348 (348) 70 protein:vir:96579 Length: 576 97.7 2.5E-05 1.5E-08 45.8 35.8 452 1-513 1-543 (576) 71 protein:vir:99452 Length: 651 97.7 2.9E-05 1.8E-08 45.4 20.2 452 1-513 1-556 (651) 72 protein:vir:1884 Length: 424 # 97.6 3.4E-05 2.1E-08 45.1 29.9 392 1-494 1-424 (424) 73 protein:vir:80796 Length: 574 97.6 4E-05 2.5E-08 44.7 35.4 455 1-513 1-544 (574) 74 protein:vir:100882 Length: 383 97.6 4.1E-05 2.5E-08 44.6 25.4 363 9-503 1-383 (383) 75 protein:vir:7987 Length: 456 # 97.6 4.4E-05 2.8E-08 44.4 27.1 396 36-488 1-456 (456) 76 protein:vir:81218 Length: 423 97.5 5E-05 3.1E-08 44.2 28.0 390 23-493 1-423 (423) 77 protein:vir:6382 Length: 553 # 97.5 5.4E-05 3.3E-08 44.0 27.3 458 1-501 1-553 (553) 78 protein:vir:7407 Length: 392 # 97.5 5.5E-05 3.4E-08 44.0 27.9 376 23-491 1-392 (392) 79 protein:vir:960 Length: 413 # 97.5 6.3E-05 3.9E-08 43.6 30.4 385 11-493 1-413 (413) 80 protein:vir:101647 Length: 460 97.4 8.2E-05 5.1E-08 43.0 30.9 401 23-513 1-460 (460) 81 protein:vir:98643 Length: 395 97.2 0.00012 7.6E-08 42.0 24.2 376 23-496 1-395 (395) 82 protein:vir:80134 Length: 403 97.2 0.00014 8.8E-08 41.7 24.7 378 9-498 1-403 (403) 83 protein:vir:9641 Length: 395 # 97.2 0.00014 9E-08 41.6 25.7 370 9-496 1-395 (395) 84 protein:vir:4995 Length: 384 # 97.2 0.00015 9.2E-08 41.6 21.1 365 28-489 1-384 (384) 85 protein:vir:4509 Length: 424 # 97.1 0.00015 9.6E-08 41.5 30.2 383 1-492 17-424 (424) 86 protein:vir:5249 Length: 437 # 97.1 0.00016 9.7E-08 41.4 28.5 409 14-510 1-437 (437) 87 protein:vir:78641 Length: 278 97.1 0.00019 1.2E-07 41.0 23.8 259 91-406 1-278 (278) 88 protein:vir:4854 Length: 386 # 97.0 0.00024 1.5E-07 40.4 26.2 368 1-493 3-386 (386) 89 protein:vir:101289 Length: 395 96.9 0.00024 1.5E-07 40.4 26.3 374 28-504 1-395 (395) 90 protein:vir:100650 Length: 395 96.9 0.00024 1.5E-07 40.4 26.3 374 28-504 1-395 (395) 91 protein:vir:9507 Length: 395 # 96.9 0.00024 1.5E-07 40.4 26.3 374 28-504 1-395 (395) 92 protein:vir:80680 Length: 441 96.8 0.00032 2E-07 39.7 25.0 383 12-496 1-441 (441) 93 protein:vir:78537 Length: 480 96.8 0.00034 2.1E-07 39.6 31.6 412 47-508 1-480 (480) 94 protein:vir:100187 Length: 385 96.8 0.00035 2.1E-07 39.6 26.5 363 9-504 1-385 (385) 95 protein:vir:4952 Length: 386 # 96.7 0.00037 2.3E-07 39.4 28.2 370 1-511 3-386 (386) 96 protein:vir:80040 Length: 461 96.7 0.00038 2.4E-07 39.3 27.8 432 1-507 1-461 (461) 97 protein:vir:107112 Length: 478 96.7 0.0004 2.5E-07 39.2 25.2 418 11-490 1-478 (478) 98 protein:vir:96179 Length: 468 96.7 0.0004 2.5E-07 39.2 25.2 411 11-490 1-468 (468) 99 protein:vir:79647 Length: 435 96.6 0.00044 2.7E-07 39.0 24.1 408 1-503 5-435 (435) 100 protein:vir:94666 Length: 723 96.6 0.00047 2.9E-07 38.8 29.8 396 36-513 1-446 (723) 101 protein:vir:96240 Length: 511 96.6 0.00049 3.1E-07 38.7 26.8 442 11-501 1-511 (511) 102 protein:vir:105819 Length: 456 96.5 0.00057 3.6E-07 38.4 27.5 404 36-488 1-456 (456) 103 protein:vir:102602 Length: 456 96.5 0.00057 3.6E-07 38.4 27.5 404 36-488 1-456 (456) 104 protein:vir:96366 Length: 511 96.2 0.0009 5.6E-07 37.3 25.8 446 11-501 1-511 (511) 105 protein:vir:78805 Length: 511 96.2 0.0009 5.6E-07 37.3 25.8 446 11-501 1-511 (511) 106 protein:vir:104338 Length: 422 96.2 0.00091 5.6E-07 37.3 25.8 388 11-498 1-422 (422) 107 protein:vir:94002 Length: 378 96.1 0.00099 6.2E-07 37.1 21.6 347 42-505 1-378 (378) 108 protein:vir:96068 Length: 765 96.1 0.0011 6.5E-07 36.9 28.0 464 1-513 50-573 (765) 109 protein:vir:96839 Length: 474 95.8 0.0014 8.5E-07 36.3 24.7 410 11-500 1-474 (474) 110 protein:vir:4223 Length: 486 # 95.6 0.0017 1.1E-06 35.8 27.1 416 40-510 1-486 (486) 111 protein:vir:2427 Length: 485 # 95.4 0.0021 1.3E-06 35.3 29.0 414 40-509 1-485 (485) 112 protein:vir:7768 Length: 484 # 95.3 0.0023 1.4E-06 35.1 25.7 416 40-512 1-484 (484) 113 protein:vir:95113 Length: 474 95.3 0.0023 1.5E-06 35.0 25.7 418 2-500 1-474 (474) 114 protein:vir:105292 Length: 478 95.2 0.0024 1.5E-06 34.9 25.1 409 11-498 1-478 (478) 115 protein:vir:107662 Length: 427 95.2 0.0025 1.5E-06 34.9 24.0 401 12-502 1-427 (427) 116 protein:vir:4089 Length: 395 # 95.1 0.0027 1.7E-06 34.6 25.0 373 23-508 1-395 (395) 117 protein:vir:99072 Length: 479 95.1 0.0029 1.8E-06 34.5 26.8 444 1-513 1-479 (479) 118 protein:vir:78310 Length: 376 94.9 0.0033 2E-06 34.2 25.6 355 28-487 1-376 (376) 119 protein:vir:97447 Length: 474 94.9 0.0033 2E-06 34.2 26.1 426 11-500 1-474 (474) 120 protein:vir:94498 Length: 474 94.9 0.0033 2E-06 34.2 26.1 426 11-500 1-474 (474) 121 protein:vir:6210 Length: 394 # 94.8 0.0034 2.1E-06 34.1 26.9 377 17-504 1-394 (394) 122 protein:vir:78083 Length: 537 94.8 0.0034 2.1E-06 34.1 32.9 440 36-513 1-535 (537) 123 protein:vir:267 Length: 348 # 94.8 0.0035 2.2E-06 34.0 25.5 313 1-418 1-348 (348) 124 protein:vir:5961 Length: 503 # 94.5 0.0042 2.6E-06 33.6 30.5 422 11-506 1-503 (503) 125 protein:vir:2500 Length: 501 # 94.0 0.0056 3.5E-06 32.9 31.1 436 5-508 1-501 (501) 126 protein:vir:9306 Length: 511 # 93.9 0.006 3.7E-06 32.8 26.5 445 11-501 1-511 (511) 127 protein:vir:99781 Length: 511 93.9 0.006 3.7E-06 32.8 28.8 447 11-501 1-511 (511) 128 protein:vir:1082 Length: 359 # 93.7 0.0066 4.1E-06 32.5 23.4 331 1-460 2-359 (359) 129 protein:vir:104082 Length: 485 93.6 0.007 4.3E-06 32.4 28.1 429 5-501 1-485 (485) 130 protein:vir:78227 Length: 480 93.4 0.0077 4.8E-06 32.2 31.8 413 47-508 1-480 (480) 131 protein:vir:93747 Length: 472 93.4 0.0078 4.8E-06 32.1 29.6 417 19-492 1-472 (472) 132 protein:vir:103951 Length: 511 93.3 0.0079 4.9E-06 32.1 27.7 441 11-501 1-511 (511) 133 protein:vir:99522 Length: 470 93.2 0.0084 5.2E-06 32.0 27.6 414 31-494 1-470 (470) 134 protein:vir:93867 Length: 378 93.2 0.0085 5.3E-06 31.9 21.5 349 42-505 1-378 (378) 135 protein:vir:97171 Length: 512 91.7 0.015 9.2E-06 30.6 22.1 432 1-501 1-512 (512) 136 protein:vir:95965 Length: 385 91.6 0.015 9.2E-06 30.6 25.1 361 23-492 1-385 (385) 137 protein:vir:1661 Length: 378 # 91.4 0.016 9.9E-06 30.4 24.3 348 42-505 1-378 (378) 138 protein:vir:100328 Length: 346 91.3 0.016 1E-05 30.4 24.4 310 34-411 1-346 (346) 139 protein:vir:94869 Length: 378 91.3 0.017 1E-05 30.4 24.8 350 42-505 1-378 (378) 140 protein:vir:1150 Length: 350 # 91.2 0.017 1.1E-05 30.3 22.4 318 1-406 1-350 (350) 141 protein:vir:2013 Length: 344 # 89.4 0.027 1.7E-05 29.2 22.7 316 1-411 1-344 (344) 142 protein:vir:78749 Length: 337 88.8 0.03 1.9E-05 28.9 24.2 305 1-409 1-337 (337) 143 protein:vir:1236 Length: 483 # 88.7 0.031 1.9E-05 28.9 28.3 418 23-500 1-483 (483) 144 protein:vir:4898 Length: 502 # 88.5 0.032 2E-05 28.8 29.7 443 1-498 1-502 (502) 145 protein:vir:3780 Length: 345 # 88.3 0.033 2.1E-05 28.7 22.9 316 1-408 1-345 (345) 146 protein:vir:79043 Length: 479 87.4 0.039 2.4E-05 28.3 26.5 406 23-493 1-479 (479) 147 protein:vir:38 Length: 496 # N 86.2 0.047 2.9E-05 27.9 27.6 417 11-494 1-496 (496) 148 protein:vir:96494 Length: 501 85.6 0.052 3.2E-05 27.6 27.8 445 1-505 1-501 (501) 149 protein:vir:79207 Length: 351 85.5 0.052 3.3E-05 27.6 21.9 318 1-421 1-351 (351) 150 protein:vir:98567 Length: 340 85.0 0.056 3.5E-05 27.4 22.3 308 1-413 1-340 (340) 151 protein:vir:106571 Length: 499 84.4 0.061 3.8E-05 27.3 30.4 421 26-512 1-499 (499) 152 protein:vir:3743 Length: 345 # 84.2 0.062 3.8E-05 27.2 24.4 316 1-408 1-345 (345) 153 protein:vir:106639 Length: 481 83.6 0.068 4.2E-05 27.0 32.4 419 9-502 1-481 (481) 154 protein:vir:78191 Length: 351 83.0 0.072 4.5E-05 26.8 22.5 318 1-421 1-351 (351) 155 protein:vir:95899 Length: 474 82.3 0.078 4.9E-05 26.6 25.6 416 11-497 1-474 (474) 156 protein:vir:96266 Length: 474 82.3 0.078 4.9E-05 26.6 25.6 416 11-497 1-474 (474) 157 protein:vir:95806 Length: 440 81.5 0.085 5.3E-05 26.5 28.3 389 57-492 1-440 (440) 158 protein:vir:105889 Length: 474 81.4 0.086 5.4E-05 26.4 32.2 398 24-494 1-474 (474) 159 protein:vir:94101 Length: 474 81.4 0.086 5.4E-05 26.4 32.2 398 24-494 1-474 (474) 160 protein:vir:94805 Length: 492 80.8 0.092 5.7E-05 26.3 28.8 420 11-500 1-492 (492) 161 protein:vir:2341 Length: 488 # 80.3 0.096 6E-05 26.2 27.1 435 11-512 1-488 (488) 162 protein:vir:6058 Length: 344 # 78.7 0.11 6.9E-05 25.8 24.0 314 1-409 1-344 (344) 163 protein:vir:858 Length: 378 # 78.7 0.11 6.9E-05 25.8 22.9 349 42-505 1-378 (378) 164 protein:vir:9871 Length: 429 # 78.6 0.11 7E-05 25.8 28.0 386 49-495 1-429 (429) 165 protein:vir:105461 Length: 470 76.9 0.13 8.1E-05 25.4 24.0 376 49-488 1-470 (470) 166 protein:vir:108215 Length: 469 76.7 0.13 8.2E-05 25.4 27.9 413 1-513 4-463 (469) 167 protein:vir:5691 Length: 344 # 75.4 0.15 9.1E-05 25.1 23.3 312 1-409 1-344 (344) 168 protein:vir:102330 Length: 451 73.0 0.18 0.00011 24.7 23.4 375 49-485 1-451 (451) 169 protein:vir:97336 Length: 492 72.8 0.18 0.00011 24.7 30.1 424 11-500 1-492 (492) 170 protein:vir:94742 Length: 409 72.4 0.18 0.00011 24.6 22.8 343 65-442 1-409 (409) 171 protein:vir:3964 Length: 453 # 72.3 0.18 0.00011 24.6 28.0 402 34-496 1-453 (453) 172 protein:vir:1587 Length: 508 # 70.7 0.21 0.00013 24.3 25.5 421 9-487 1-508 (508) 173 protein:vir:733 Length: 453 # 67.4 0.25 0.00016 23.9 26.4 407 1-489 1-453 (453) 174 protein:vir:103971 Length: 376 61.7 0.35 0.00022 23.1 23.3 319 1-421 26-376 (376) 175 protein:vir:8317 Length: 409 # 60.5 0.37 0.00023 22.9 20.6 361 28-488 1-409 (409) 176 protein:vir:79150 Length: 368 58.5 0.41 0.00026 22.7 22.6 337 1-430 1-368 (368) 177 protein:vir:98853 Length: 219 58.0 0.42 0.00026 22.6 14.8 199 171-410 1-219 (219) 178 protein:vir:107880 Length: 491 58.0 0.42 0.00026 22.6 33.2 409 1-513 1-433 (491) 179 protein:vir:94546 Length: 506 57.0 0.44 0.00027 22.5 24.7 425 12-495 1-506 (506) 180 protein:vir:103219 Length: 201 54.3 0.51 0.00031 22.2 13.1 193 276-492 1-201 (201) 181 protein:vir:2732 Length: 501 # 49.3 0.64 0.0004 21.6 30.6 420 1-503 28-501 (501) 182 protein:vir:102950 Length: 471 44.6 0.8 0.0005 21.1 22.6 387 12-494 1-471 (471) 183 protein:vir:8184 Length: 474 # 44.1 0.82 0.00051 21.1 26.5 393 36-492 1-474 (474) 184 protein:vir:98816 Length: 446 38.3 1.1 0.00067 20.4 22.0 390 21-467 1-446 (446) 185 protein:vir:95254 Length: 488 33.0 1.4 0.00086 19.8 24.4 427 11-513 1-481 (488) 186 protein:vir:79703 Length: 505 30.7 1.6 0.00097 19.5 26.4 416 9-473 1-505 (505) 187 protein:vir:80959 Length: 499 24.2 2.2 0.0014 18.7 30.0 411 11-492 1-499 (499) 188 protein:vir:5839 Length: 533 # 21.8 2.5 0.0016 18.4 25.3 436 14-513 1-528 (533) No 1 >protein:vir:572 Length: 506 # NCBI annotation: unknown # Family: family:all:6660 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046607;genbank:gi:9630180;genbank:GeneID:1261432 Probab=100.00 E-value=1.6e-79 Score=452.60 Aligned_cols=469 Identities=16% Similarity=0.223 Sum_probs=386.2 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCc-------ccccccccccchHHHHHHHhhhcc--ChhHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTP-------VFGAPVGSLTSSQSKVRKIVKEYR--NEGNQKTLR 71 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~-------~~~s~~~s~~~s~d~~k~~i~~~~--P~~n~~~ir 71 (513) ||--.|.- +|+ ..|--+. +| +++ -|+++||+-+....+..+.+++|+ |+.++++|. T Consensus 1 mvTl~K~~-----i~~-E~~~~~l-------N~--Y~TY~~~F~~GFi~~~~~NG~v~~i~~~~L~~~F~NPD~~~~~I~ 65 (506) T protein:vir:57 1 MVTLNKVD-----IES-EEYKQML-------ND--YSTYTSTFASGFISNMFSNGIVTEIEAEQLKNYFSNPDEFQEEIE 65 (506) T ss_pred Cceeechh-----ccH-HHHHHHH-------hh--hhHHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhcChHHHHHHHH Confidence 55433322 111 1111111 11 222 356778886554444444455554 999999999 Q ss_pred HHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 72 KVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 72 ~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ++++|+|+..|.+++|.+++.+|||+||.| +...+.+++++..+.+..+|+| ++| +.+++++|++-++.|+ T Consensus 66 ~L~~Y~YI~~~~i~QL~~LI~aLP~L~Y~I-----~~~~k~K~~~~~iS~lN~~L~K--v~H--K~LTRDLL~Q~A~aGT 136 (506) T protein:vir:57 66 DLAQYFYISTAEIHQLFELIEALPTLNYKI-----DSFNKVKSSDKHISLLNKSLHK--VKH--KRLTRDLLKQVATAGT 136 (506) T ss_pred HHHHHhhhhcchHHHHHHHHHhcCCcceee-----hhhhhccchhhHHHHHHHHHHH--HHH--HHHHHHHHHHhhccCc Confidence 999999999999999999999999999999 5566677788899999999999 666 9999999999999999 Q ss_pred EEEc-----CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCc------chhccccHHHHHHHHHHhhhhhccCcccc Q lcl|NC_015263. 152 VIDD-----KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSA------DIVDYYPKEIQEAVNKYTTMKKGNNKSAS 220 (513) Q Consensus 152 ~i~d-----~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~------~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~ 220 (513) ++.. +.+|++.+-.++|++++|+++|.|++++||.+|++. ..++.++|-|+| ++|+.+++ ++.++ T Consensus 137 LvG~WLG~~k~PY~~iF~~iKYVFP~~R~~G~~V~VvD~~~F~~~~~~~R~~~~~~LSP~I~~--~~Y~~~~~--~~~~~ 212 (506) T protein:vir:57 137 LVGIWLGDAKSPYPFIFDEIKYVFPSFRRNGDWVCVVDMELFTKYKDDQRNELLKSLSPYIKQ--SDYENFMK--DREKY 212 (506) T ss_pred eeEeeecCCCCcchhhhhhhhhhccccccCCceEEEEehHHhhhhhHHHHHHHHHhhhhhhhh--hhhhhHhh--hHHhh Confidence 9953 889999999999999999999999999999999974 358889999999 59999986 56679 Q ss_pred cCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeec-cccCCCCCccc-cCHH--H Q lcl|NC_015263. 221 NWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLE-TRSSNDNNDFT-LDMP--M 296 (513) Q Consensus 221 ~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip-~~~~n~~~~~~-vd~~--~ 296 (513) |+.+||.++|++.++++.+|+++.+.+++...++|++|.++|++++..|+|..|-.+.+. +++.+++|+++ +.++ . T Consensus 213 R~~~LP~~rT~~~R~~TL~RNQ~LG~~~~T~~L~Dv~HK~KLkD~E~SIA~KII~A~AVL~~~~~~~Ngeyt~~K~~~a~ 292 (506) T protein:vir:57 213 RFKELPQERTFPLRTGTLKRNQGLGTSWVTPGLYDVLHKKKLKDVERSIANKIINAVAVLTIGTDKGNGEYTNMKLPKAV 292 (506) T ss_pred hhhhcccccchhheeeeecccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhhheeeeeecccCCcccccccchHHH Confidence 999999999999999999999999999999999999999999999999999433344444 54456667776 4444 4 Q ss_pred HHHHHHHHHHhc----cccceEEEecccccccccccccccchhhhhhH-----HhhhhhhhhhhhhccCCCcchHHHHHH Q lcl|NC_015263. 297 MNYFHEALSMTV----PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKAT-----KNFWDNAGVSQILFSSDNKTSQGIAMS 367 (513) Q Consensus 297 ~~~~~~~ik~~L----p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~-----~~i~~~~GiS~~Lfn~d~~s~~~~~~S 367 (513) .+++|+.++.|| .+||++|..| ++.+|+|+.- .+|++..++ ++|.+|.|+|++|.||+++++++++++ T Consensus 293 K~Ki~~GVK~ALEK~~KDGv~~vs~P-DFA~~~FP~v--K~~~LD~~K~D~I~~DI~~A~GlS~~L~NG~~GNYAts~LN 369 (506) T protein:vir:57 293 KQKIHGGVKTALEKNQKDGVTVVSIP-DFADINFPDV--KADGLDGAKFDHINSDIQSAYGLSGSLLNGDGGNYATSSLN 369 (506) T ss_pred HHHHHHHHHHHHhcccccCeEEEecc-cccccccccc--cccCCCchhhcccchhhhhhhccchheecCCCcceeeeech Confidence 478999999999 7799999999 9999999843 336666665 889999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHhh---cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHH Q lcl|NC_015263. 368 IATDEQFIFGVINQLE-RWLNRYLLL---NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAF 443 (513) Q Consensus 368 I~~d~~~~~~~~~~iE-~~~N~~i~~---~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~ 443 (513) +.+..+.+..+++.|| +.+|..+.. ...++.|.|.|-..+|.+|+++++.++++.+.|+|.++++..++|+++++| T Consensus 370 LD~FYKrIGV~~E~IEqEvY~~L~~lvL~~~~~~NY~~~Y~KD~Pl~~~~K~D~LIKL~~~G~S~K~V~Dnl~GvS~E~Y 449 (506) T protein:vir:57 370 LDTFYKRIGVLMEDIEQEVYQKLFNLVLPAAQKDNYYMNYDKDKPLTLKEKMDILIKLNDKGWSIKHVVDNLAGVSWESY 449 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceeEeeCCCCccchhhhhchheeecccCccHHHHHHhhhccchHHH Confidence 9999999999999999 688887763 455778999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCC Q lcl|NC_015263. 444 TGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAK 506 (513) Q Consensus 444 ~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~ 506 (513) ++++.+|.|.|.|.+++.|.++|||.||++- |.|+ ++...+..|+.+.++.++.+|- T Consensus 450 ~E~tlYE~E~LKL~EKI~P~~~s~~~tGN~v----G~P~--~~~~~~D~Tv~Satsngndnpi 506 (506) T protein:vir:57 450 LEQTLYETEELKLQEKIRPYQTSYTFTGNEV----GRPN--EGNKNNDNTVKSATSNGNDNPI 506 (506) T ss_pred HHHHHHHHHHhhHHhhcCcccccceeccccc----CCCC--CCCCcccchhhhcccCCCCCCC Confidence 9999999999999999999999999999752 2222 1111223345555555555554 No 2 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=100.00 E-value=4.4e-77 Score=439.17 Aligned_cols=478 Identities=15% Similarity=0.173 Sum_probs=398.9 Q ss_pred CC-----CccchheeeeehhhhhhHHHHHHHHHHHHHhhc--cCcc----------cccccccccchHHHHHHHhhhcc- Q lcl|NC_015263. 1 MV-----KNKKKRLSMIDVESISSYSNKRNNRISILRDDN--RTPV----------FGAPVGSLTSSQSKVRKIVKEYR- 62 (513) Q Consensus 1 ~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~--~~~~----------~~s~~~s~~~s~d~~k~~i~~~~- 62 (513) |. |||..-+.-.|+++- .|--+ ++|+. ++++ |+++||+-+....+.-+.+++|+ T Consensus 1 ~~~~~~~~~~~~t~~k~~~~~e-~~~~~-------~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~f~ 72 (525) T protein:vir:10 1 MTRTKGSKNKSTTIEKQSLQIE-QLQEH-------INELERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLWFN 72 (525) T ss_pred CCCCcCCcccccchhhhhhhHH-HHHHH-------HhhhhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhc Confidence 43 344333333343321 12221 22322 3444 67788885554444444444444 Q ss_pred -ChhHHHHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHH Q lcl|NC_015263. 63 -NEGNQKTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKL 141 (513) Q Consensus 63 -P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~ 141 (513) |+.++++|.++++|+|+..|.+++|.+++.+|||+||.| +.....++++++.+.+..+|+| |-+++.++++ T Consensus 73 npd~~~~~i~~l~~y~yi~~~~v~ql~~li~~lp~l~y~i-----~~~~~~k~~~~~~s~~n~~l~k---~i~hk~ltrd 144 (525) T protein:vir:10 73 NPDKYINNIVNLLTYYYIIDGNVFQLYDLIFSLPPLDYQI-----KVLKRDKDYKEDLSTINLYLEK---KIQHKQLTRD 144 (525) T ss_pred ChHHHHHHHHHHHHHhhhhcchHHHHHHHHHhcCCcceee-----hhhhhccchhhHHHHHHHHHHH---hHHHHHHHHH Confidence 999999999999999999999999999999999999999 5666777888999999999999 8888999999 Q ss_pred HHHhcceeEEEEEc-----CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcc------hhccccHHHHHHHHHHhh Q lcl|NC_015263. 142 AMTVDIFYGYVIDD-----KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSAD------IVDYYPKEIQEAVNKYTT 210 (513) Q Consensus 142 ~l~~g~~~gy~i~d-----~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~------~L~~~p~Ei~~~y~~Y~~ 210 (513) +|++-++.|+++.. +.+|++.+-.++|++++|+++|.|++++||.+|++.. .++.++|-|+| ++|+. T Consensus 145 ll~q~a~~gtlig~wlg~~~~py~~vf~~~kyvfp~~r~~g~~v~vid~~~f~~~~~~~r~~~~~~lsp~i~~--~~y~~ 222 (525) T protein:vir:10 145 LLVQLAHSGTLIGTWLGSKREPYFNVFNNLKYVFPYGRAKGKMVAVIDLQWFDEMSELERKLTFENLSPLITE--NKYKK 222 (525) T ss_pred HHHHhhccCceeEeeecCCCCcchhhhhhhhhhccccccCCceEEEEehHHhhhhhHHHHHHHHHhhchhhhh--hhhhH Confidence 99999999999964 8899999999999999999999999999999999743 57889999999 59999 Q ss_pred hhhcc--CcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCC Q lcl|NC_015263. 211 MKKGN--NKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNN 288 (513) Q Consensus 211 ~k~~~--~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~ 288 (513) +++++ ++.++||++||.++|+..+++..+++++.+.++++..++++++.++|++++..|++.-|-...+...++++++ T Consensus 223 ~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~EqsIA~kii~a~avLk~gg~~gn 302 (525) T protein:vir:10 223 WKEYNGENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDLEQSIADKIIKAMAVLKFRGKDDN 302 (525) T ss_pred HhhcccccchhheeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHhhhhheeeeeccccCc Confidence 99766 5667889999999999999999999999999999999999999999999999999954445555554667777 Q ss_pred ccccCHHHHHHHHHHHHHhc------cccceEEEecccccccccccccccchh-----hhhhHHhhhhhhhhhhhhccCC Q lcl|NC_015263. 289 DFTLDMPMMNYFHEALSMTV------PDNVGVVTSPMEIDTVSFDKDSSTDDS-----VEKATKNFWDNAGVSQILFSSD 357 (513) Q Consensus 289 ~~~vd~~~~~~~~~~ik~~L------p~gv~~v~sP~~~d~i~ld~~~~~~dt-----v~~~~~~i~~~~GiS~~Lfn~d 357 (513) +..|-....+.+|++|+.+| .+|+++|..| ++.+|+|+.-....++ +.+..++|..|+|+|++|.||+ T Consensus 303 ~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~P-dfa~~efp~ik~~~~glDg~K~d~I~~DI~~A~GlS~sL~nGd 381 (525) T protein:vir:10 303 DSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMP-DFATFEFPEIKNGDKTLDPKKYDSIDNDITNATGISQVLTNGT 381 (525) T ss_pred cccCchHHHHHHHHHHHHHHhcccccccCeEEEecc-ceeecccccccCcccCCCchhhhhhhhhhhhhhccceeeecCC Confidence 77787777799999999999 4599999999 8999999765544343 6666799999999999999999 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHH Q lcl|NC_015263. 358 NKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLL---NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLAS 434 (513) Q Consensus 358 ~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~---~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa 434 (513) +++++++++|+.+..+.++.+++.||..+|..|.- ...+++|.|.+-+.++.++|++++.|+++++.|++.++.+. T Consensus 382 ggNyAtaslnld~fykkigVm~e~Iee~y~kL~d~Vl~~~k~~nyifnydkd~pi~~kkk~d~LIkL~d~g~s~k~vld- 460 (525) T protein:vir:10 382 KGNYASAKLNLDVFYKKIGVMLEIIEEIYNQLIDIILGEEKGCNYIFQYNKDTPIEREKKLDTLIKLEAQGYSAKYVLD- 460 (525) T ss_pred CCceeeeeeeHHHHHHHHHHHHHHHHHHHHHHHhhhcCcccCcceEEecCCCchhhhhhhhhhhhhhhccchhhhhhhh- Confidence 99999999999999999999999999999998764 45577899999999999999999999999999999999999 Q ss_pred HhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCc-ccccccCCC Q lcl|NC_015263. 435 LMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNE-TTGNKDSDE 501 (513) Q Consensus 435 ~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~e-t~~n~~~~~ 501 (513) +.|+++++|++..-+|.|.|.++++..|.++++|.||+++ +.-|.|+.++ .|+++ |..++++|- T Consensus 461 l~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SGk~~-n~iG~P~~dd--~~~~dati~s~~~~~ 525 (525) T protein:vir:10 461 ILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSGKDG-NDIGSPKLDD--SDSSDATIESKERGV 525 (525) T ss_pred hhccCcchHHHHHHHHHHHHHHhhhccccccceeeecccc-ccccCCccCC--CcchhhhhhhhhcCC Confidence 7899999999999999999999999999999999999653 4444444332 34444 567777766 No 3 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.16 E-value=3e-10 Score=72.80 Aligned_cols=392 Identities=13% Similarity=0.065 Sum_probs=191.0 Q ss_pred ccccccccccchHHHHH----HHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhhc-ccccceEeeccchhhhhhc Q lcl|NC_015263. 39 VFGAPVGSLTSSQSKVR----KIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYANM-PLYAYSVVPFKDISTANEN 113 (513) Q Consensus 39 ~~~s~~~s~~~s~d~~k----~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~m-pt~dY~I~P~~~~~~~~~~ 113 (513) |+.+.+|.+.+...... ..+..++.......-+.+..--+..++.+++.|+.+++. ..+...++--. +... + T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~-~~~~-~- 77 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRT-DGGI-E- 77 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEec-CCcc-c- Confidence 66666665443221111 111111100000000111111222456677888887432 23333332101 0000 0 Q ss_pred chhHHHHHHHHHH-hh----cChhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEEEEECCe-eEEEEEe Q lcl|NC_015263. 114 KLKKELATVTEFL-SR----LNPKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKISSVSGGV-YNYVIDL 185 (513) Q Consensus 114 ~~~~~y~~v~~~L-~k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG~-y~~~fD~ 185 (513) ...+ +.....| .+ +.-..+...++..++..|..|.|+.-+..+ ..+.++|+++|.|.--.++. +.|.+-. T Consensus 78 -~~~~-~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~ 155 (416) T protein:vir:12 78 -RKPE-HKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVL 155 (416) T ss_pred -cccc-cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEec Confidence 0111 1222333 33 445677888899999999999999866544 67899999999998665543 3222110 Q ss_pred eeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEE-ecCccccchhhHHHHHHhHHHHHHHHHHH- Q lcl|NC_015263. 186 DALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIK-INESSLTPVPPFAGTFDSIYDIHSFKDLR- 263 (513) Q Consensus 186 syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik-~~~~~~~~ip~f~~v~~d~~di~~~kdL~- 263 (513) +.. =++++...-+-|+ ...+...|++|...+...+--.....+.. T Consensus 156 ----~g~-----------------------------~~~~~~~eiih~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 202 (416) T protein:vir:12 156 ----NGK-----------------------------AIELYDYEVLHFKGLSTDGIHGKSPIGVVREHIGAQAAATKYNA 202 (416) T ss_pred ----CCe-----------------------------EEEecCccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 110 0233333333343 23344578888777765554433333322 Q ss_pred hhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-cccceEEEeccccccccccc-ccccchhhhhhHH Q lcl|NC_015263. 264 NDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-PDNVGVVTSPMEIDTVSFDK-DSSTDDSVEKATK 341 (513) Q Consensus 264 ~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~ 341 (513) +.-..-.....+++ +| + .++.++++++.+..+.+- -.++..+-..++++.+.+.. +..--.+.+-..+ T Consensus 203 ~~~~ng~~p~~il~-~~-------~--~~~~e~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~ 272 (416) T protein:vir:12 203 KLYKNEATPRGILK-VP-------A--FLDEKPKENVRKEWKRVNKVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKA 272 (416) T ss_pred HHHhcCCCCceEEe-cC-------C--CCCHHHHHHHHHHHHHHhcCCCeeecCCCceEEEccCChhhHHHHHHHHHHHH Confidence 11111111223332 22 1 366777777777666543 23444444555555555432 1122234555568 Q ss_pred hhhhhhhhhhhhccCC-CcchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh--c-ccceEEEEEecCCCCccHHHHHH Q lcl|NC_015263. 342 NFWDNAGVSQILFSSD-NKTSQGI-AMSIATDEQFIFGVINQLERWLNRYLLL--N-GMSKYFKATMLEVTHFSKKEAHD 416 (513) Q Consensus 342 ~i~~~~GiS~~Lfn~d-~~s~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~--~-~~~~~f~~~~l~~T~fn~ke~~~ 416 (513) +|-.+.||...++|.. +.+++.+ .....--..-+.-++.+||.++|+.|-. . ..+..|+|.+-+....+.++.++ T Consensus 273 ~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~ 352 (416) T protein:vir:12 273 QISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAE 352 (416) T ss_pred HHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHH Confidence 8999999998888643 2333332 2222233333446999999999998742 2 12345776666667778888888 Q ss_pred HHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCcccc-CCCCcCCCCcc Q lcl|NC_015263. 417 RYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEK-GKENGRPTNET 493 (513) Q Consensus 417 ~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~-~~~~grPt~et 493 (513) .+.++..-|. .+=..-+ .+|+.|.+ |-+..++|+..... +..+.....++. ...||.+.++. T Consensus 353 ~~~~~~~~G~~T~NE~R~-~~gl~Pi~------------ggd~~~~~~n~~~~--~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 353 YLKTLHETGVLNKDEIRE-LLERNPIE------------NGDKYISSLNYVFL--DFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred HHHHHHhCCCcCHHHHHH-HhCCCCCC------------Ccceeeeccccccc--cccchhhccccccccCCCCCcCCC Confidence 8888887774 3333333 45777642 23444555432211 000000000000 11223222221 No 4 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.16 E-value=5.1e-10 Score=71.52 Aligned_cols=393 Identities=12% Similarity=0.111 Sum_probs=190.6 Q ss_pred ccccccccccchHHHH--HHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcch Q lcl|NC_015263. 39 VFGAPVGSLTSSQSKV--RKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKL 115 (513) Q Consensus 39 ~~~s~~~s~~~s~d~~--k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~ 115 (513) ++-+..|.+.+....+ .+++.-+-.......-+.++.-.|...+.+.+.|+.+++ +..+...++--.... +... T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~---~~~~ 77 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTL---KTRV 77 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCc---ceee Confidence 3334444432211100 011110000000000111223334456677777777764 333444443211111 1111 Q ss_pred hHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEc-CcceeeeecCcceeEEEEEECCeeEEEEEeeecc Q lcl|NC_015263. 116 KKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDD-KESVMIQQFPNDICKISSVSGGVYNYVIDLDALV 189 (513) Q Consensus 116 ~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d-~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd 189 (513) . =+.+...|.. +.-..+...++..++..|..|.+.+.+ +.+.-+.++|+++|.+.--.+|.+.|.+-... T Consensus 78 ~--~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~-- 153 (413) T protein:vir:48 78 V--DERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKALGEVVELLPIDPGCVEPKLNSQWQPVYQVTFPD-- 153 (413) T ss_pred c--ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCCCcEEEEEEEcCceEEEEEcCCceEEEEEEecC-- Confidence 1 1334445542 556788888999999999999998864 44567889999999988666666555332110 Q ss_pred CcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhh Q lcl|NC_015263. 190 SADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAE 268 (513) Q Consensus 190 ~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~ 268 (513) .....++...-+-|+. ..+..+|++|...+...+--.....+. ...- T Consensus 154 ------------------------------g~~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~--~~~~ 201 (413) T protein:vir:48 154 ------------------------------GSVDVLTQDEIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEH--GARL 201 (413) T ss_pred ------------------------------ceEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHH--HHHH Confidence 0111233333333442 234457888877666443322222221 1122 Q ss_pred hhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc--cc---cceEEEeccccccccccc-ccccchhhhhhHHh Q lcl|NC_015263. 269 LQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV--PD---NVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKN 342 (513) Q Consensus 269 i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~---gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~ 342 (513) ..|-...-.-|.+ + -.++.++++++.+.+++.. ++ ++..+-..+++..+.+.. +..--++.+-..++ T Consensus 202 ~~ng~~p~gil~~---~----~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 274 (413) T protein:vir:48 202 FGNGAVTSGVLRT---E----QKLTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEE 274 (413) T ss_pred HhccCCcceEEEe---C----CCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHH Confidence 2221112222222 1 1356677777777777665 12 233333444555554421 11112344455688 Q ss_pred hhhhhhhhhhhccCCC-cchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcccceEEEEEecCCCCccHHHHHHHH Q lcl|NC_015263. 343 FWDNAGVSQILFSSDN-KTSQG-IAMSIATDEQFIFGVINQLERWLNRYLL--LNGMSKYFKATMLEVTHFSKKEAHDRY 418 (513) Q Consensus 343 i~~~~GiS~~Lfn~d~-~s~~~-~~~SI~~d~~~~~~~~~~iE~~~N~~i~--~~~~~~~f~~~~l~~T~fn~ke~~~~~ 418 (513) |..+.||...++|... .+.+. -...+.--..-+.-++++||..+|+.|- ....+..|+|.+-+....+.++.++.+ T Consensus 275 Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~ 354 (413) T protein:vir:48 275 ICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAY 354 (413) T ss_pred HHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHH Confidence 9999999988886432 22222 2222223333344688899999999874 333345577766666667888888888 Q ss_pred HHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccccccc Q lcl|NC_015263. 419 ITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKD 498 (513) Q Consensus 419 ~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~ 498 (513) .++.+-|.=..--.-+.+|+.|.+ +-|..+.|+- ...... .+++.|.|+.++..+.+ T Consensus 355 ~~~~~~g~~T~NE~R~~~g~~p~~------------ggD~~~~~~n---~~~~~~--------~~~~~~~~~~~~~~~~~ 411 (413) T protein:vir:48 355 ATGINWGIYSPNDCRDLEDMNPRP------------GGDVYLTPMN---MTTSPS--------AGDDNGKKKESGDADKT 411 (413) T ss_pred HHHHhCCCcCHHHHHHHhCCCCCC------------Ccceeecccc---cccccc--------ccccCCCCCCCCCcccc Confidence 887776643333333345666532 2344444432 211111 11233333322222111 Q ss_pred CC Q lcl|NC_015263. 499 SD 500 (513) Q Consensus 499 ~~ 500 (513) +. T Consensus 412 ~~ 413 (413) T protein:vir:48 412 AS 413 (413) T ss_pred CC Confidence 11 No 5 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.11 E-value=1.2e-09 Score=69.40 Aligned_cols=407 Identities=7% Similarity=0.029 Sum_probs=186.7 Q ss_pred HHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcC--------------hhHH Q lcl|NC_015263. 70 LRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLN--------------PKYN 134 (513) Q Consensus 70 ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n--------------~k~~ 134 (513) ||+|++ .|+.+++.|+.++. +..+-+.|.|-.... ....-...+.....+|.... ..+. T Consensus 1 l~~l~~----~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~--~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~ 74 (467) T protein:vir:31 1 MAELLE----HNETHAKCVHAKSRYVAGFGINIIPHPEAE--DPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNV 74 (467) T ss_pred Chhhhh----cCHHHHHHHHHHHHhhhcCCeEEEEccCcc--cccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHH Confidence 666665 58888888887763 334445555522111 11112233333333443332 3456 Q ss_pred HHHHHHHHHHhcceeEEEEEcCc--ceeeeecCcceeEEEEEECCeeEEEEE--eeeccCc--chhccccHHHHHHHHHH Q lcl|NC_015263. 135 FSKIVKLAMTVDIFYGYVIDDKE--SVMIQQFPNDICKISSVSGGVYNYVID--LDALVSA--DIVDYYPKEIQEAVNKY 208 (513) Q Consensus 135 ~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~lp~dyckIsg~~nG~y~~~fD--~syFd~~--~~L~~~p~Ei~~~y~~Y 208 (513) +..++.+++..|..|.+.+.+.. .+.+.++|+++|++..-.. .|+...+ -.||... ........+.-. .+ T Consensus 75 ~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 150 (467) T protein:vir:31 75 LQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDER-GFVQLLEEKEKYFGVAGDRYQTNGNGDLDP---VF 150 (467) T ss_pred HHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecc-eeEeecCCceeeEEeccccceeecccceee---ee Confidence 67788888889999999997654 4789999999999873322 2221111 1111100 000000000000 00 Q ss_pred hhhhhccCcccccCeeecCCceEEEEec--CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCC Q lcl|NC_015263. 209 TTMKKGNNKSASNWYEIQDKNSICIKIN--ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSND 286 (513) Q Consensus 209 ~~~k~~~~~~~~~W~~L~~~kt~~ik~~--~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~ 286 (513) . ..........+.++...-+-|+.. ....+|+||..++...+.-....++... .-..|-...-.-|-+ . T Consensus 151 ~---~~~~~~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~--~~f~ng~~p~gil~~----~ 221 (467) T protein:vir:31 151 V---DADDGSTGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNI--DFFENDGVPRIAIIV----K 221 (467) T ss_pred e---eeccccccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhccCCCceEEEe----c Confidence 0 000011123455666545555543 3455799999988776644444333221 112332111111111 1 Q ss_pred CCccccCHHHHHHHHHHHHHhccc----------cce------EEEecccccc-----ccccccc-cc---chhhhhhHH Q lcl|NC_015263. 287 NNDFTLDMPMMNYFHEALSMTVPD----------NVG------VVTSPMEIDT-----VSFDKDS-ST---DDSVEKATK 341 (513) Q Consensus 287 ~~~~~vd~~~~~~~~~~ik~~Lp~----------gv~------~v~sP~~~d~-----i~ld~~~-~~---~dtv~~~~~ 341 (513) +..++.++++.+.+.+++...+ |+. .+...++... .+|.... .+ -.+.+-..+ T Consensus 222 --~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~ 299 (467) T protein:vir:31 222 --GAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEH 299 (467) T ss_pred --CcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHH Confidence 1236777776666666554422 111 1111111112 2221111 11 133445567 Q ss_pred hhhhhhhhhhhhccC-CCcch-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---cceEEEEEecCCCCccHHHHH Q lcl|NC_015263. 342 NFWDNAGVSQILFSS-DNKTS-QGI-AMSIATDEQFIFGVINQLERWLNRYLLLNG---MSKYFKATMLEVTHFSKKEAH 415 (513) Q Consensus 342 ~i~~~~GiS~~Lfn~-d~~s~-~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~---~~~~f~~~~l~~T~fn~ke~~ 415 (513) +|-.+.||...++|- ++++. +.+ +....--...+.-++.+||..+|+.|-... ....++|.+-.....+.++.+ T Consensus 300 ~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~ 379 (467) T protein:vir:31 300 DILKVHDVPPVIAGVVESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEI 379 (467) T ss_pred HHHHHhCCCHHHcccCCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHH Confidence 899999999888852 22222 122 222222233344588999999999874322 244577777788788999999 Q ss_pred HHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccc-cccccccc--cCCccccCCCCcCCCC Q lcl|NC_015263. 416 DRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSF-NTSGSDIA--ENAIKEKGKENGRPTN 491 (513) Q Consensus 416 ~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~-T~Sg~~~~--~~~~~~~~~~~grPt~ 491 (513) +.+..+..-|. ..=... ..+|+.|.. ++.+.|..+.- +..|+... ...++.+....+.|.+ T Consensus 380 ~~~~~~~~~G~~T~NE~R-~~~Gl~pi~--------------d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (467) T protein:vir:31 380 ASQRVQAMQGLLTVNELR-DEFGFEPFP--------------EEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRADE 444 (467) T ss_pred HHHHHHHhCCCcCHHHHH-HHhCCCCCC--------------cccccCCcccccccccccCCCCcccCcCCCCCCCcccc Confidence 99988888884 344433 346776631 11222222211 11222100 0000000000011111 Q ss_pred cccccccCCCCCCCC--CCccCC Q lcl|NC_015263. 492 ETTGNKDSDETQRAK--DKPANT 512 (513) Q Consensus 492 et~~n~~~~~~~~~~--d~~~~~ 512 (513) ........-+++.+- ++-+++ T Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 445 IIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred hHhhhhhccccchhhhhccccCC Confidence 000000000111111 111111 No 6 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.06 E-value=1.9e-09 Score=68.40 Aligned_cols=405 Identities=13% Similarity=0.106 Sum_probs=194.8 Q ss_pred hhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cc Q lcl|NC_015263. 17 ISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MP 95 (513) Q Consensus 17 ~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mp 95 (513) .+-.++|-+..+ | +.++. -...........++-.. +.. ..++.--...++.+++.|+.+++ +. T Consensus 1 M~~~~~~f~~~~---r--~~~~~-----~~~~~~~~~~~~~~g~~-~~~-----~~v~~~~al~~~~v~~~i~~ia~~ia 64 (429) T protein:vir:10 1 MDSVKKFFNFEK---R--QTSQV-----IELNKDDEKLLEWLGIS-PST-----ISVKGKNALKVATVFACIKILSESVS 64 (429) T ss_pred Cchhhhhhcccc---c--Ccccc-----cccCCChHHHHHHhcCC-CCc-----ceechhhhhccHHHHHHHHHHHHhhc Confidence 111111111100 0 00110 01111111222222110 100 01111122356788888888876 33 Q ss_pred cccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCc--ceeeeecCcce Q lcl|NC_015263. 96 LYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQFPNDI 168 (513) Q Consensus 96 t~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~lp~dy 168 (513) .+...++ ... ..+.-+..-+.+...|.. +.-..+...++..++..|..|.+++-+.. .+.+.++|++. T Consensus 65 ~l~~~~~----~~~-~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~ 139 (429) T protein:vir:10 65 KLPLKIY----QED-EYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASK 139 (429) T ss_pred cCceEEE----Eec-CCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCce Confidence 3444443 111 111111111233444543 44567888889999999999999987654 46889999999 Q ss_pred eEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec--CccccchhhH Q lcl|NC_015263. 169 CKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN--ESSLTPVPPF 246 (513) Q Consensus 169 ckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~--~~~~~~ip~f 246 (513) |.+.--++|.+.+.....|+.... ..-..++++.-+-|+.+ .+...|++|. T Consensus 140 v~v~~~~~~~~~~~~~~~~~~~~~---------------------------g~~~~~~~~evih~~~~~~~~~~~G~s~i 192 (429) T protein:vir:10 140 VTVYIDDVGLLNSKTKMWYVVNTG---------------------------GQQRVLKPEEILHFKNGITLDGLVGVPTM 192 (429) T ss_pred eEEEEcCcccccccceEEEEEccC---------------------------CeEEEEccccEEEecCCCCCCCcccccHH Confidence 998754555554443333321100 01133555445556532 3345688888 Q ss_pred HHHHHhHHHHHHHHHHHhhHhhhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc---c--ccceEEEec Q lcl|NC_015263. 247 AGTFDSIYDIHSFKDLRNDKAELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV---P--DNVGVVTSP 318 (513) Q Consensus 247 ~~v~~d~~di~~~kdL~~~~~~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L---p--~gv~~v~sP 318 (513) ..+...+-......+.... -..| ...++ ++| + .++.++++++.+.+..+. . .++.++-.. T Consensus 193 ~~~~~~i~~~~~~~~~~~~--~~~ng~~~~~il-~~~-------~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g 260 (429) T protein:vir:10 193 EYLKSTLENSASADKFINN--FYKQGLQVKGLV-QYV-------G--DLNEDAKKVFRENFESMSSGLQNSHRIALMPVG 260 (429) T ss_pred HHHHHHHHHHHHHHHHHHH--HHhccCCccEEE-EcC-------C--CCCHHHHHHHHHHHHHHhccccccCceeecCCC Confidence 7777655444444442211 1222 12222 222 1 367777777777777655 1 234444344 Q ss_pred cccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccC-CCcchHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhh-cc Q lcl|NC_015263. 319 MEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSS-DNKTSQGIA-MSIATDEQFIFGVINQLERWLNRYLLL-NG 394 (513) Q Consensus 319 ~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~-d~~s~~~~~-~SI~~d~~~~~~~~~~iE~~~N~~i~~-~~ 394 (513) +++..+.+.. +..--++.+-..++|..+.||...++|. ++++++.+. ..+.-...-+.-++++||..+|+.|-. .. T Consensus 261 ~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~ 340 (429) T protein:vir:10 261 YQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSE 340 (429) T ss_pred ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhh Confidence 4555554431 1222233455568899999999988863 222333332 222233334446999999999997742 21 Q ss_pred c--ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccccccc Q lcl|NC_015263. 395 M--SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGS 472 (513) Q Consensus 395 ~--~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~ 472 (513) . +..|+|.+-....-+.++.++.+.++..-|.=..--.-+.+|+.|. | +-|..++|+. ...- T Consensus 341 ~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--------~----ggD~~~~~~n---~~~~- 404 (429) T protein:vir:10 341 LDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--------A----GGDRLLVNGN---MLPI- 404 (429) T ss_pred cCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--------C----CcCeeeeccc---ccch- Confidence 2 3456666656666788888888888877763222222233455552 1 3344444432 1100 Q ss_pred ccccCCccccCCCCcCCCCcccccccCCCCC Q lcl|NC_015263. 473 DIAENAIKEKGKENGRPTNETTGNKDSDETQ 503 (513) Q Consensus 473 ~~~~~~~~~~~~~~grPt~et~~n~~~~~~~ 503 (513) +......-.+++++|.+ .+++++++ T Consensus 405 d~~~~~~~k~g~~~~~~------~~~~~e~~ 429 (429) T protein:vir:10 405 DMAGQAYLKGGDTNGEV------SKEGNEGN 429 (429) T ss_pred hhccccccCCCCCCCCC------CCCCCCCC Confidence 00000000011222222 22233333 No 7 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.03 E-value=4.7e-09 Score=66.22 Aligned_cols=423 Identities=9% Similarity=0.039 Sum_probs=196.2 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc--ChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR--NEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~--P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) |.|.=.-+- ..++- -+++ .+...+..| .++-.-+ +-..- ..+..--.-..+.+.+.| T Consensus 1 ~~~~~~~~~-----~~~~~-~~~~--~~~~~~~~~----------~~~~~~~~g~~~~g---~~v~~~~al~~~~V~~~v 59 (454) T protein:vir:93 1 MWNLLRRTR-----KNQKS-GRDV--REAGWTSLF----------QAVAEPFAGAWQQG---VKADPEAVLSFHAVFACI 59 (454) T ss_pred CCCccccCc-----ccccc-cccc--cchhhhhhh----------hhhhhhhcchhhcC---cccChHHhhccHHHHHHH Confidence 222110000 00000 0000 000000000 0000000 00000 000000111234566677 Q ss_pred HHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--eee Q lcl|NC_015263. 89 NFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VMI 161 (513) Q Consensus 89 dy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~i 161 (513) +.++. +..+...++-- ..+...+ .. .+ .-...++.+=| --++...++..++..|..|.+++-+..+ ..+ T Consensus 60 ~~Ia~~iA~lp~~~~~~-~~~g~~~-~~-~~-~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L 135 (454) T protein:vir:93 60 SLISQDIAKMRLRLMQT-DAQGIRR-ET-RR-GDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKEL 135 (454) T ss_pred HHHHHhhccCceEEEEe-ccCCccc-hh-hh-HHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEE Confidence 76532 33333344310 0111111 11 11 11233444434 4567778888999999999999876544 579 Q ss_pred eecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe--cCcc Q lcl|NC_015263. 162 QQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI--NESS 239 (513) Q Consensus 162 q~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~--~~~~ 239 (513) .++|++.|.|.-..+|...|.+....- . .....+.++.+--+-|+. ..+. T Consensus 136 ~~i~~~~v~v~~~~~g~~~y~~~~~~~----~------------------------~~~~~~~~~~~eViH~k~~~~~~~ 187 (454) T protein:vir:93 136 RILDWNRVEPLVADDGEVFYRITPDRN----C------------------------GITEAVTVPAREVIHDRFNCFFHP 187 (454) T ss_pred EEEcCcceEEEEcCCCcEEEEEEeccc----c------------------------ccceeEEecCcceEEeccCCCCCC Confidence 999999999987778877666532220 0 001234455554555664 2345 Q ss_pred ccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc--cc--cceEE Q lcl|NC_015263. 240 LTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV--PD--NVGVV 315 (513) Q Consensus 240 ~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~--gv~~v 315 (513) ..|++|...+...+--....++... .-..|-...-.-|.+ ++ .++.++++++.+.+++.. .+ ++..+ T Consensus 188 ~~G~sp~~~~~~~i~~~~~~~~~~~--~~f~ng~~p~gil~~-----~~--~l~~e~~~~~~~~~~~~~~g~n~g~~~vl 258 (454) T protein:vir:93 188 LIGLPPVYAAGLAATQGHHIQENST--SFFRNGGRPSGVIEI-----PG--SITEENAKKLKSNWDSGYTGENAGKTAIL 258 (454) T ss_pred ceeccHHHHHHHHHHHHHHHHHHHH--HHHhccCCccEEEec-----CC--CCCHHHHHHHHHHHHHHhcccccCCceec Confidence 6788888777666554444444221 122331111111221 11 256667777766666655 11 24444 Q ss_pred Eecccccccccccccccch---hhhhhHHhhhhhhhhhhhhccCC-CcchHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 316 TSPMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSSD-NKTSQGIA-MSIATDEQFIFGVINQLERWLNRYL 390 (513) Q Consensus 316 ~sP~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~-~SI~~d~~~~~~~~~~iE~~~N~~i 390 (513) -..+++..+.++ ..+.+ +..-..++|..+.||...++|.. +.+++.+. ..+.--..-+.-++.+||.++|+.| T Consensus 259 ~~g~~~~~l~~~--~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L 336 (454) T protein:vir:93 259 SNGAKYNPTTFS--PVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEAL 336 (454) T ss_pred cCCceEEEcccC--hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 455555555553 22223 33445578999999999888632 23333332 2233333344469999999999988 Q ss_pred hhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccc-c Q lcl|NC_015263. 391 LLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFN-T 469 (513) Q Consensus 391 ~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T-~ 469 (513) -.. .+..|+|.+-+....+.++.++.+.++.+-|.=..--+-..+|+.|.+ |-|..++|...-.. . T Consensus 337 ~~~-~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~------------ggD~~~~~~~~~~~~~ 403 (454) T protein:vir:93 337 ETG-ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLA------------GGDALYLQQQNYSLEA 403 (454) T ss_pred cCC-CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CCCeeeeccCccchHh Confidence 543 234577777666667888888888888777743222233345666642 22344444321100 0 Q ss_pred cccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 470 SGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 470 Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~~ 513 (513) .++ ...........|.|..+.......+++..++...+++. T Consensus 404 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~ 444 (454) T protein:vir:93 404 LSR---RDAREDPFASSGKTASVPQAVAASDGNKAITETEHDAV 444 (454) T ss_pred hhc---cCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccchh Confidence 111 11111111112223222222222333333333333333 No 8 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.98 E-value=8e-09 Score=64.97 Aligned_cols=450 Identities=10% Similarity=0.056 Sum_probs=196.3 Q ss_pred CCCccchheeee---------ehhhhhhHHHHHHH---HHHHHHhhccCccccccc-----ccccchHHHHHHHhhhccC Q lcl|NC_015263. 1 MVKNKKKRLSMI---------DVESISSYSNKRNN---RISILRDDNRTPVFGAPV-----GSLTSSQSKVRKIVKEYRN 63 (513) Q Consensus 1 ~~~~~~~~~~~~---------~~~~~~~~~~~~~~---~~~i~~~~~~~~~~~s~~-----~s~~~s~d~~k~~i~~~~P 63 (513) |---++-|+-++ +...--+.+ +.++ +..+-+|-..-.+ ++++ |+.+-.. + . . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~k~~~~~~~~~-~~~~~~~~~~~~g~~~---~-----~-~ 69 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIA-IQQREQEQISKAMNNKEVAY-SQPVIGSMSANPGFKT---K-----P-S 69 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchh-hhhhhHHHHHHhhcccchhh-hchhhheeeccccccc---C-----C-c Confidence 322222222222 111111111 1111 1222222211111 1211 1111100 0 1 1 Q ss_pred hhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccc-----------cceEeeccchhhhhhcchhHHHHHHHHHHhhcCh Q lcl|NC_015263. 64 EGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLY-----------AYSVVPFKDISTANENKLKKELATVTEFLSRLNP 131 (513) Q Consensus 64 ~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~-----------dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~ 131 (513) ..+...|+++.+ .|..++++++.|+.+++ +.+| .+.|-+. +......+........+..+|+..|. T Consensus 70 ~~~~~~l~~l~~-~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k-~~~~~~~~~~~~~~~~l~~~l~~pn~ 147 (547) T protein:vir:63 70 IRNNQDLHGVLK-KFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLK-DLDKKPTSHDEATIKRIESFIEKTGV 147 (547) T ss_pred cCChhHHHHHHH-HhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEec-ccccccChhhHHHHHHHHHHHHhhCC Confidence 235667777765 56678999999998875 2222 1222211 11222223334445556677777764 Q ss_pred h---------HHHHHHHHHHHHhcceeEEEEEcCc--ceeeeecCcceeEEEEEECCeeEEE-EEeeeccCcchhccccH Q lcl|NC_015263. 132 K---------YNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQFPNDICKISSVSGGVYNYV-IDLDALVSADIVDYYPK 199 (513) Q Consensus 132 k---------~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~lp~dyckIsg~~nG~y~~~-fD~syFd~~~~L~~~p~ 199 (513) . .++..++.+++..|..|.+.+-+.+ ...+.+||+..|++.--.+|..... +-..+...... T Consensus 148 ~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~------ 221 (547) T protein:vir:63 148 DNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKI------ 221 (547) T ss_pred CCCCccchHHHHHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcCCcE------ Confidence 3 4667788888999999998886655 4678999999999984444421000 00000000000 Q ss_pred HHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCc-----cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhcee Q lcl|NC_015263. 200 EIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINES-----SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKL 274 (513) Q Consensus 200 Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~-----~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~i 274 (513) =..++...-+.|+.+.. ..+|+||...+...+.-....++.. ..-..|-.. T Consensus 222 ----------------------~~~~~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~--~~~f~Ng~~ 277 (547) T protein:vir:63 222 ----------------------VATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFN--DRFFSHGGT 277 (547) T ss_pred ----------------------EEEeccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHH--HHHHHcCCC Confidence 01334433455554221 3458888887776665555554433 122233222 Q ss_pred eeeeeccccCCCCCccccCHHHHHHHHHHHHHhc--cccceE--EEec--ccccccccccccccchh---hhhhHHhhhh Q lcl|NC_015263. 275 LIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV--PDNVGV--VTSP--MEIDTVSFDKDSSTDDS---VEKATKNFWD 345 (513) Q Consensus 275 i~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~gv~~--v~sP--~~~d~i~ld~~~~~~dt---v~~~~~~i~~ 345 (513) .-.-|-+ .+...++.++++.+.+.+.++. .++.+. |+.+ +++..+.+ +..+.+. .+-..+.|-. T Consensus 278 p~giL~~-----~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~~~g~~~~~l~~--~~~d~qfle~~~~~~~~Ia~ 350 (547) T protein:vir:63 278 TRGILQI-----KAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTP--SARDMEFEKWLNYLINVISA 350 (547) T ss_pred cceEEEe-----cCCCCCCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEEcCC--ChhHHHHHHHHHHHHHHHHH Confidence 2222222 1233478888878877777765 233332 3322 33333333 2233333 4445578999 Q ss_pred hhhhhhhhccCCCc-----------chHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHH Q lcl|NC_015263. 346 NAGVSQILFSSDNK-----------TSQGIAMSIA-TDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKE 413 (513) Q Consensus 346 ~~GiS~~Lfn~d~~-----------s~~~~~~SI~-~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke 413 (513) +.||...++|-... +.+.+..-.. .-..-+.-++.+||..+|+.|.... +..++|.|-+...-++.+ T Consensus 351 afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~-~~~~~~~f~~~~~~~~~~ 429 (547) T protein:vir:63 351 LYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEF-GDKYTFQFVGGDIKSELE 429 (547) T ss_pred HhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-CCceEEEeeccccccHHH Confidence 99999988863211 1222222222 2233344689999999999886432 234566665555444444 Q ss_pred HHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCcc------------c Q lcl|NC_015263. 414 AHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIK------------E 481 (513) Q Consensus 414 ~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~------------~ 481 (513) .... .++..-| ++++.|+-.++-++-.+=+-|..+.|+...........++..+. . T Consensus 430 ~~~~-~~~~~~g-----------~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (547) T protein:vir:63 430 SVKI-LAEKAKV-----------AMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQT 497 (547) T ss_pred HHHH-HHHHhCC-----------CcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhcccccccc Confidence 3332 2222222 45555555444333101123444444322221100000000000 0 Q ss_pred cCCCCcCCCCcccccc-----------------cCCCCCCCCCCccCCC Q lcl|NC_015263. 482 KGKENGRPTNETTGNK-----------------DSDETQRAKDKPANTQ 513 (513) Q Consensus 482 ~~~~~grPt~et~~n~-----------------~~~~~~~~~d~~~~~~ 513 (513) +.++++.|..+..++. ..++.....+-|.+.| T Consensus 498 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (547) T protein:vir:63 498 GNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQGMKGDKPNDWQ 546 (547) T ss_pred CCCCCCCCCCCCCCcccCCCcCccccccCccccchhhhhcCCCCccccC Confidence 0011111110000000 0000000001111111 No 9 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.97 E-value=8.8e-09 Score=64.75 Aligned_cols=444 Identities=10% Similarity=0.064 Sum_probs=195.5 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccc-----ccccchHHHHHHHhhhccChhHHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPV-----GSLTSSQSKVRKIVKEYRNEGNQKTLRKVSE 75 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~-----~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~ 75 (513) +--++.++++ |+..-++--+ ++|.... .++-.++++ |+-+.. .+- .++|. ..|+++.+ T Consensus 23 ~~~~~~~~~~-~~~~~~~~~~----~~k~~~~---~~~a~~~~~~~~~~~~~~~~---~r~---~~~~~---~~l~~~~~ 85 (551) T protein:vir:80 23 KHIEVDDNYS-IAIQQREQEQ----ISKAMNN---KEVAYSQPVIGSMSANPGFK---TKP---SIRNN---QDLHGVLK 85 (551) T ss_pred ccccccccee-eecccccHHH----HHHhhcc---CcceeecccccceecCcccc---cCc---cccCh---hHHHHHHH Confidence 1112222332 2222222111 2222211 222111221 221100 010 11233 33455544 Q ss_pred HHHhhcchHHHHHHHHhhc-ccc-----------cceEeeccchhhhhhcchhHHHHHHHHHHhhcChh---------HH Q lcl|NC_015263. 76 DLAVQSQQYQRLLNFYANM-PLY-----------AYSVVPFKDISTANENKLKKELATVTEFLSRLNPK---------YN 134 (513) Q Consensus 76 ~lY~~sg~~~rlidy~~~m-pt~-----------dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k---------~~ 134 (513) .|..|+++++.|+.+++. .+| .+.|.+.. ............+..+..+|+..|.. .+ T Consensus 86 -~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd-~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f 163 (551) T protein:vir:80 86 -KFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKD-LDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSF 163 (551) T ss_pred -HhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecc-cCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHH Confidence 566789999999988752 221 22232221 11222334455666677788887753 46 Q ss_pred HHHHHHHHHHhcceeEEEEEcCc--ceeeeecCcceeEEEEEECCeeEE-EEEeeeccCcchhccccHHHHHHHHHHhhh Q lcl|NC_015263. 135 FSKIVKLAMTVDIFYGYVIDDKE--SVMIQQFPNDICKISSVSGGVYNY-VIDLDALVSADIVDYYPKEIQEAVNKYTTM 211 (513) Q Consensus 135 ~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~lp~dyckIsg~~nG~y~~-~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~ 211 (513) +..++.+++..|..|.+.+-+.+ ..-+.+||+..|++.--.+|.... .+-..+...... T Consensus 164 ~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~------------------ 225 (551) T protein:vir:80 164 VKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKI------------------ 225 (551) T ss_pred HHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEeCCcE------------------ Confidence 66777888899999988886554 467899999999998544442100 000000000000 Q ss_pred hhccCcccccCeeecCCceEEEEecC-----ccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCC Q lcl|NC_015263. 212 KKGNNKSASNWYEIQDKNSICIKINE-----SSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSND 286 (513) Q Consensus 212 k~~~~~~~~~W~~L~~~kt~~ik~~~-----~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~ 286 (513) =..++.+.-+.|+.+. ...+|+||...+...+.-.....+.. ..-..|-...-.-|-+ T Consensus 226 ----------~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~--~~~f~Ng~~p~giL~~----- 288 (551) T protein:vir:80 226 ----------VATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFN--DRFFSHGGTTRGILQI----- 288 (551) T ss_pred ----------EEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHH--HHHHHcCCCcceEEEE----- Confidence 0123333344455321 13458888877766665544444432 1223332222111222 Q ss_pred CCccccCHHHHHHHHHHHHHhc--cccce--EEEe--cccccccccccccccch---hhhhhHHhhhhhhhhhhhhccCC Q lcl|NC_015263. 287 NNDFTLDMPMMNYFHEALSMTV--PDNVG--VVTS--PMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSSD 357 (513) Q Consensus 287 ~~~~~vd~~~~~~~~~~ik~~L--p~gv~--~v~s--P~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~d 357 (513) ++...++.++++.+.+.+.++. +++-+ .|+. .+++..+.++ ..+.+ +.+-..+.|-.+.||...++|-. T Consensus 289 ~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~--~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~ 366 (551) T protein:vir:80 289 KAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPS--ARDMEFEKWLNYLINVISALYGIDPAEINIP 366 (551) T ss_pred cCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCC--hhHHHHHHHHHHHHHHHHHHhcCCHHHcCcc Confidence 1234477888888888777766 23333 2332 2343343332 22323 34445678999999998888621 Q ss_pred Cc-----------chHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcC Q lcl|NC_015263. 358 NK-----------TSQGIAM-SIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYG 425 (513) Q Consensus 358 ~~-----------s~~~~~~-SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G 425 (513) .. +.+.+.. ...--..-+.-++.+||..||+.|.... +..+.|.|.....-++.+....+ ++..-| T Consensus 367 ~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~-~~~~~f~f~~~~~~~~~~~~~~~-~~~~~g 444 (551) T protein:vir:80 367 NNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEF-GDKYTFQFVGGDIKSELESVKIL-AEKAKV 444 (551) T ss_pred cccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc-CCceEEEeeccChhhHHHHHHHH-HHHhcC Confidence 11 1122222 2222333344689999999999886432 23466777666655655554422 222223 Q ss_pred CcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccc--------cC----CCC----cCC Q lcl|NC_015263. 426 FPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKE--------KG----KEN----GRP 489 (513) Q Consensus 426 ~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~--------~~----~~~----grP 489 (513) +++|-|+-.++-++--+=+-|..+.|+.+...+-....++..... .. +++ ..| T Consensus 445 -----------~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 513 (551) T protein:vir:80 445 -----------AMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIP 513 (551) T ss_pred -----------CcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCC Confidence 344444444333321011334455554333221111000000000 00 000 001 Q ss_pred CCc-ccccccCCCCCC------------CCCCccCCC Q lcl|NC_015263. 490 TNE-TTGNKDSDETQR------------AKDKPANTQ 513 (513) Q Consensus 490 t~e-t~~n~~~~~~~~------------~~d~~~~~~ 513 (513) +.. +.+..+.++... .-+-|.+.| T Consensus 514 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (551) T protein:vir:80 514 DGKDTTGDIGKDGQRKDKDNANAGKQGMKGDKPNDWQ 550 (551) T ss_pred CccccCCCccccccccCccccchhhhhcCCCCccccC Confidence 000 000000000000 001111111 No 10 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=98.97 E-value=8e-09 Score=64.97 Aligned_cols=416 Identities=10% Similarity=0.076 Sum_probs=189.0 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) |-|-|.||++-+.-. +.+... .|. |. +..... +.+--+ +. ..-..++.--+-. T Consensus 1 ~~~~~~~~~~~~~~~----~~~~~g-----------~~~------s~-~~~~~~-~~~~~~-~~---~~g~~v~~~~al~ 53 (437) T protein:vir:10 1 MKQGKQRALGRIKSS----FLKWLG-----------VPI------SL-TDGSFW-SAWGGM-GS---SSGETVTADSALQ 53 (437) T ss_pred CCcchhhhhhhhHHh----hhhhcC-----------Ccc------cC-CchhHH-Hhhccc-cc---CCCceechHhhhc Confidence 877777887654321 111110 011 00 000000 000000 00 0000111111224 Q ss_pred cchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-hc----ChhHHHHHHHHHHHHhcceeEEEEE Q lcl|NC_015263. 81 SQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RL----NPKYNFSKIVKLAMTVDIFYGYVID 154 (513) Q Consensus 81 sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~----n~k~~~~~i~~~~l~~g~~~gy~i~ 154 (513) ++.+.+.|+.++. +..+...++ ....+-+.....=+.+...|. += .-..+...++..++..|..|.+++. T Consensus 54 ~~~v~~ci~~Ia~~ia~lp~~~~----~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 129 (437) T protein:vir:10 54 LSAVWSCVRLIAETIATLPLNLY----QTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLR 129 (437) T ss_pred cHHHHHHHHHHHHHHhhCceeEE----EEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 4556666666653 233333332 111111111111122333343 22 3456777888889999999999876 Q ss_pred c-CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEE Q lcl|NC_015263. 155 D-KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICI 233 (513) Q Consensus 155 d-~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~i 233 (513) + +...-+.++|++.|.|.-..+|.+.|.+-.. + .....++++--+.| T Consensus 130 ~~g~~~~L~~l~p~~v~i~~~~~g~~~y~~~~~---~-----------------------------g~~~~~~~~dIih~ 177 (437) T protein:vir:10 130 SAGVLIGLELMLPQRTTVKRLTSGALQYTYRNV---D-----------------------------GTVSTLAEDDVFHV 177 (437) T ss_pred cCCcEEEEEEEcCcceEEEECCCCeEEEEEEec---C-----------------------------ceEEEEccccEEEe Confidence 5 4446789999999999877778766643110 0 12234555444555 Q ss_pred Ee-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c Q lcl|NC_015263. 234 KI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D 310 (513) Q Consensus 234 k~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~ 310 (513) +. ..+..+|++|...+...+--....++. ...-..|-...-.-|.+ . -.++.++++++.+.+.++.- . T Consensus 178 r~~~~d~~~G~spi~~~~~~i~~~~~~~~~--~~~~f~ng~~p~gil~~---~----~~l~~e~~~~~~~~~~~~~~g~~ 248 (437) T protein:vir:10 178 RGFSLDGLMGLTPIQYAREVLGNSTAANKT--SASVFRNGLRPSGVLST---D----QILQKEKRAEIRTDLAEQFGGAM 248 (437) T ss_pred cCcCCCCcccccHHHHHHHHHHHHHHHHHH--HHHHHhccCCccEEEEc---C----CCCCHHHHHHHHHHHHHHhcCcc Confidence 52 344567888876555443322222221 11122221111111111 1 13677777777777776651 2 Q ss_pred ---cceEEEeccccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCCcc----hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 311 ---NVGVVTSPMEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDNKT----SQGIAMSIATDEQFIFGVINQL 382 (513) Q Consensus 311 ---gv~~v~sP~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s----~~~~~~SI~~d~~~~~~~~~~i 382 (513) ++.++-..+++..+.+.... .--++..-..++|..+.||...++|....+ ++.-...+.--..-+.-++.+| T Consensus 249 nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl~P~~~~i 328 (437) T protein:vir:10 249 QAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTLRPWLTRI 328 (437) T ss_pred ccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHH Confidence 23344344555555443221 112334455688999999999998643221 2222333333333344689999 Q ss_pred HHHHHHHHhh--cccceEEEEEecCCCCccHHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccc Q lcl|NC_015263. 383 ERWLNRYLLL--NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEI 459 (513) Q Consensus 383 E~~~N~~i~~--~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~ 459 (513) |..+|+.|-. ...+..|+|.+-....-+.++..+.+.++..-|. .+=..- +.+|+.|.+ |-++. T Consensus 329 e~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R-~~~gl~pi~------------gg~~~ 395 (437) T protein:vir:10 329 EQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECR-AKENLPPMG------------GNAAV 395 (437) T ss_pred HHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHH-HHhCCCCCC------------CCcce Confidence 9999997743 3334456666666656677888888777766663 322222 234555533 12222 Q ss_pred cCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCcc Q lcl|NC_015263. 460 MTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPA 510 (513) Q Consensus 460 ~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~ 510 (513) +.+...-..+ ...++..+.. +.+.+ ....+.+..+...+..+ T Consensus 396 ~~~~~~~~~~--~~~~~~~~~~-~~~~~------~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 396 LTVQSALLPI--DKLGEHTTAT-AAQDA------LKAWLYQEEKTRATQER 437 (437) T ss_pred EeecCcccch--hhccCcCCCc-chhcc------ccccCCCCCCCCccccC Confidence 2221111111 1111111100 00000 00011111111111111 No 11 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.93 E-value=1.3e-08 Score=63.85 Aligned_cols=444 Identities=11% Similarity=0.043 Sum_probs=196.8 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCccc------ccccccc-cchHHHHHHHhhhccChhHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVF------GAPVGSL-TSSQSKVRKIVKEYRNEGNQKTLRKV 73 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~------~s~~~s~-~~s~d~~k~~i~~~~P~~n~~~ir~~ 73 (513) +.-|..--.|.| =|+-++.+.-+-+ -+|-. -..+|-. ..+...+- .+++ |..+. ..++ T Consensus 52 ~~~~~~~~~~~~---------~~~~~~~~kk~~i-~~pfkkk~~~~~~d~f~~s~es~s~vt-sls~--pdaf~--~vnV 116 (945) T protein:vir:10 52 LAWNSTVVYSII---------IFRKNQVLKKEKI-IVPYNHQEPPFKFNLFEYSPESLMYLP-SISD--PDAFF--LINL 116 (945) T ss_pred hhccceeeeeee---------eehhhhHHHhhcc-cccccccccchhhhhhhccCccceecc-cccC--cccee--eehh Confidence 111110001111 0111111111111 11100 0000000 00000000 0121 33221 1223 Q ss_pred HHHHHhhcchHHHHHHHHhh-cccccceEeec---cchhhhhhcchhHHHHHHHHHHhhcChh--------HHHHHHHHH Q lcl|NC_015263. 74 SEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPF---KDISTANENKLKKELATVTEFLSRLNPK--------YNFSKIVKL 141 (513) Q Consensus 74 s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~---~~~~~~~~~~~~~~y~~v~~~L~k~n~k--------~~~~~i~~~ 141 (513) .+-....+..+.+.|+.++. +..+.-.++-- +......+ ..+.-+-+..+|++-|.. .+...++.+ T Consensus 117 s~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~k--k~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~d 194 (945) T protein:vir:10 117 FRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLK--RIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDD 194 (945) T ss_pred hhhhhhccHHHHHHHHHHHhhhccCceEEEEecccCccccccc--ccccchHHHHHHhCCCcccChhHHHHHHHHHHHHH Confidence 33344456677777777643 33333333210 01111101 112233445566655532 244556788 Q ss_pred HHHhcceeEEEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCccc Q lcl|NC_015263. 142 AMTVDIFYGYVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSA 219 (513) Q Consensus 142 ~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~ 219 (513) ++..|..|.+.+-+.+| +-+.++|++.|+|.--.+|...+.+-.. .+.... T Consensus 195 LLL~GNAYieIiRd~~G~ii~L~pLdPs~Vti~~ddDG~~~y~Yv~~--idG~~~------------------------- 247 (945) T protein:vir:10 195 ILTIDRGAIVKIRDEQGNLVAITPVDGTTIKPILSEDTGIVVGYVQE--VDGAIV------------------------- 247 (945) T ss_pred HhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCCcEEEEEEEe--cCCceE------------------------- Confidence 89999999998865444 6789999999999876777554432111 011110 Q ss_pred ccCeeecCCceEEEEe--cCc---cccchhhHHHHHHhHHHHHHHHHHH-hhHhhhhhc---eeeeeeeccccCCCCCcc Q lcl|NC_015263. 220 SNWYEIQDKNSICIKI--NES---SLTPVPPFAGTFDSIYDIHSFKDLR-NDKAELQNY---KLLIQKLETRSSNDNNDF 290 (513) Q Consensus 220 ~~W~~L~~~kt~~ik~--~~~---~~~~ip~f~~v~~d~~di~~~kdL~-~~~~~i~n~---~ii~~kip~~~~n~~~~~ 290 (513) ..++...-+-+.- ..+ ...|+||...+...+-......+.- +.-. +|- ..+.+.=.-...+.+..- T Consensus 248 ---~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~Fs--kNGa~PsGILsvkg~~~~d~k~~~ 322 (945) T protein:vir:10 248 ---AHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYR--KGGSIPEGILAIEPPSYKEGDIYP 322 (945) T ss_pred ---EEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHH--hCCCccceEEEecCcccccccccc Confidence 1122211122221 112 1237788877766554444443322 2111 121 112211111112344556 Q ss_pred ccCHHHHHHHHHHHHHhcc---ccceEEEecccccccccccccccch---hhhhhHHhhhhhhhhhhhhccC-CCcchHH Q lcl|NC_015263. 291 TLDMPMMNYFHEALSMTVP---DNVGVVTSPMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSS-DNKTSQG 363 (513) Q Consensus 291 ~vd~~~~~~~~~~ik~~Lp---~gv~~v~sP~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~-d~~s~~~ 363 (513) .++.++++++.+..+++.- +|. .++.+-.++-.++..+..+.+ +.+-..++|..+.||...++|- ++.+++. T Consensus 323 ~LseEq~erlKe~wee~~sG~NnG~-piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SN 401 (945) T protein:vir:10 323 QLSREQLESIQRQLQAIMMGDYTQV-PILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKAT 401 (945) T ss_pred ccCHHHHHHHHHHHHHHhCCccccc-ceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcch Confidence 7888888888888887761 222 222332333333332222223 3444557899999999888863 2233333 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHH Q lcl|NC_015263. 364 IAM-SIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVA 442 (513) Q Consensus 364 ~~~-SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~ 442 (513) +.. .+.--..-+..++++||..+|+.|.....+..++|.|-.....+.++.++.+.++.+-|.=..--+-+.+|+.|.+ T Consensus 402 iEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~lGLpPIe 481 (945) T protein:vir:10 402 AEVMASLTKAKGLEPLMATISKGFDEVVSEFRNEKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSINEARMEKGLEPVP 481 (945) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCceeEEEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 333 3333334455799999999999986554566688888777777888888888887777743333333345666653 Q ss_pred HHHHHHHHHHhhCcccccCcccc---cccc-cccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 443 FTGLLKVENEMLDLPEIMTPLSS---SFNT-SGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 443 ~~~~~~~E~e~L~l~~~~~Pl~T---S~T~-Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~~ 513 (513) |-|..+.|..- ..+. .+..+ ..++......+++|+.+..+ .....+.|..++ T Consensus 482 ------------GGD~lli~~nn~~P~d~~~ka~~g-a~p~q~aq~~~dqp~~kGGe------~dEns~~psE~k 537 (945) T protein:vir:10 482 ------------WGDVPFSGLRNWKPEDEQAKAQQG-AMPPQLAQAMADQPSQQGGG------VDENSSVPSEQK 537 (945) T ss_pred ------------CcceeeeccccccccccccccccC-CCCcccccCCCCCCCCCCCC------CCCCCCCCCccc Confidence 22333333210 0000 01110 11111111222333322111 111112222222 No 12 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=98.89 E-value=1.9e-08 Score=62.97 Aligned_cols=389 Identities=12% Similarity=0.109 Sum_probs=187.7 Q ss_pred ccccccccccc----hHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhc Q lcl|NC_015263. 39 VFGAPVGSLTS----SQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANEN 113 (513) Q Consensus 39 ~~~s~~~s~~~----s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~ 113 (513) +|.+..|.... ..+....++.-. +......=..+..-..-..+.+.+.|++++. +..+.-.++ ...++.+ T Consensus 1 m~~~~~~~~~~~~~s~~~~w~~~~~~~-~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~----~~~~~g~ 75 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFWEAMLGGV-RSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELY----RRDKNGG 75 (421) T ss_pred CCCcchhcccccccCcchhhHHHhhhh-ccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEE----EEcCCCc Confidence 55555553111 111111111100 0000000001111223355667777777753 333333332 1111111 Q ss_pred c-hhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEe Q lcl|NC_015263. 114 K-LKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDL 185 (513) Q Consensus 114 ~-~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~ 185 (513) . ...+. .+...|. + +.-..+...++..++..|..|.+++.+.++ .-+.++|++.|.|.-..+|...|.+. T Consensus 76 ~~~~~~~-~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g~~~y~~~- 153 (421) T protein:vir:10 76 RQRATDH-PIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDGMPYYEIP- 153 (421) T ss_pred eeecccc-hHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCceEEEEEc- Confidence 0 11111 2333343 2 335666778888999999999999977554 67889999999998667787666542 Q ss_pred eeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccccchhhHHHHHHhHHHHHHHHHHHh Q lcl|NC_015263. 186 DALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRN 264 (513) Q Consensus 186 syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~ 264 (513) .... .+| .+--+.++. ..+.++|++|...+...+--....++.. T Consensus 154 ---~~g~---~~~----------------------------~~eiih~~~~~~d~~~G~spi~~~~~~i~~~~~~~~~~- 198 (421) T protein:vir:10 154 ---EIGE---TLP----------------------------MRMMHHVKVFSLDGYIGSSPIQTNADVLGLNLAVEEHA- 198 (421) T ss_pred ---CCCc---EEc----------------------------hhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHH- Confidence 1111 122 222233332 2344568888775554443333333222 Q ss_pred hHhhhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c---cceEEEecccccccccccccccch-- Q lcl|NC_015263. 265 DKAELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D---NVGVVTSPMEIDTVSFDKDSSTDD-- 334 (513) Q Consensus 265 ~~~~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~---gv~~v~sP~~~d~i~ld~~~~~~d-- 334 (513) ..-..| -..++. +| .+..-..+.++++++.+.+++..- + ++..+-..+++..+.+. ..+.+ T Consensus 199 -~~~f~ng~~~~gil~-~~-----~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~--~~d~q~~ 269 (421) T protein:vir:10 199 -SAVFRRGATMSGVIE-RP-----KEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQD--NEKAQLL 269 (421) T ss_pred -HHHHhcCCCccEEEE-ec-----CccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCC--hhHHHHH Confidence 111222 112222 33 222234677777777777777651 1 33444444555555443 22223 Q ss_pred -hhhhhHHhhhhhhhhhhhhccC-CCcchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh--cccceEEEEEecCCCCc Q lcl|NC_015263. 335 -SVEKATKNFWDNAGVSQILFSS-DNKTSQGI-AMSIATDEQFIFGVINQLERWLNRYLLL--NGMSKYFKATMLEVTHF 409 (513) Q Consensus 335 -tv~~~~~~i~~~~GiS~~Lfn~-d~~s~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~--~~~~~~f~~~~l~~T~f 409 (513) +.+-..++|..+.||...++|. ++.+++.+ ...+.--..-+.-++.+||..+|+.|-. ...+..|+|..-..... T Consensus 270 e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~ 349 (421) T protein:vir:10 270 QSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRG 349 (421) T ss_pred HHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhcc Confidence 3344567899999999888863 22333332 2222233333346999999999997743 32344577777777677 Q ss_pred cHHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcC Q lcl|NC_015263. 410 SKKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGR 488 (513) Q Consensus 410 n~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~gr 488 (513) +.++.++.+.++..-|. ..=..-+ .+|+.|.+ +-|..++|+-. + ...+..... +. T Consensus 350 d~~~~~~~~~~~~~~G~~T~NE~R~-~~gl~p~~------------ggD~~~~~~n~--~-~~~~~~~~~--------~~ 405 (421) T protein:vir:10 350 DQKSRYESYALGRQWGWLSVNDIRR-MENLPPIA------------GGDKYLTPLNM--V-DSAQIIPGD--------KK 405 (421) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCC------------Ccceeeecccc--c-cccccccCC--------CC Confidence 88899988888776662 3333222 34555431 33455555432 1 111110111 11 Q ss_pred CCCc--ccccccCCCC Q lcl|NC_015263. 489 PTNE--TTGNKDSDET 502 (513) Q Consensus 489 Pt~e--t~~n~~~~~~ 502 (513) |+.+ ...+++...| T Consensus 406 ~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 406 PTAQQMAEIDTILSRT 421 (421) T ss_pred cccccCcccccccccC Confidence 1111 1112222222 No 13 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=98.87 E-value=2.3e-08 Score=62.45 Aligned_cols=387 Identities=10% Similarity=0.073 Sum_probs=185.8 Q ss_pred cccccc-hHHH------HHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcch Q lcl|NC_015263. 44 VGSLTS-SQSK------VRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKL 115 (513) Q Consensus 44 ~~s~~~-s~d~------~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~ 115 (513) ||-... .... ...|+...+-......-..++.--.-..+.+.+.|+.++. +..+...++--.. ..+ ... T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~-~~~--~~~ 77 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSG-EDR--KPA 77 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecC-Ccc--ccc Confidence 222111 0000 0112222210000011111222223455667777877764 2333333431110 110 111 Q ss_pred hHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeec Q lcl|NC_015263. 116 KKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDAL 188 (513) Q Consensus 116 ~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syF 188 (513) .+ +.....|. + +.--.+...++..++..|..|.++.-+.++ .-+.++|+++|.|.--.+|.+.|.+.... T Consensus 78 -~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~~~~y~~~~~~- 154 (419) T protein:vir:14 78 -TD-HPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDLKPVYRVRGSD- 154 (419) T ss_pred -cc-cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEEEEEccCc- Confidence 11 12233333 2 234556677788889999999999876544 68999999999998667788777654221 Q ss_pred cCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHh Q lcl|NC_015263. 189 VSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKA 267 (513) Q Consensus 189 d~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~ 267 (513) .+|.+ --+-++. ..+..+|++|...+...+--.....+.. .. T Consensus 155 -------~~~~~----------------------------~i~h~~~~~~dg~~G~s~i~~~~~~i~~~~~~~~~~--~~ 197 (419) T protein:vir:14 155 -------PMPQR----------------------------LVHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYA--GK 197 (419) T ss_pred -------ccchh----------------------------heeEecCcCCCCcccccHHHHHHHHHHHHHHHHHHH--HH Confidence 12211 0111221 2234678888877765543333333321 12 Q ss_pred hhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc---c--cceEEEeccccccccccc-ccccchhhhh Q lcl|NC_015263. 268 ELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP---D--NVGVVTSPMEIDTVSFDK-DSSTDDSVEK 338 (513) Q Consensus 268 ~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp---~--gv~~v~sP~~~d~i~ld~-~~~~~dtv~~ 338 (513) -..| ...++. +| ++.....+.++++.+.+.+++..- + ++..+-..+++..+.+.. +..--++..- T Consensus 198 ~f~ng~~p~gil~-~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 271 (419) T protein:vir:14 198 SFMNGTALSGVIE-RP-----KDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRL 271 (419) T ss_pred HHhccCCccEEEE-ec-----CCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHH Confidence 2222 122222 23 344445677777777777776651 1 244444445555555421 1112233445 Q ss_pred hHHhhhhhhhhhhhhccCC-CcchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHh--hcccceEEEEEecCCCCccHHHH Q lcl|NC_015263. 339 ATKNFWDNAGVSQILFSSD-NKTSQGIAMSIATDEQF-IFGVINQLERWLNRYLL--LNGMSKYFKATMLEVTHFSKKEA 414 (513) Q Consensus 339 ~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~SI~~d~~~-~~~~~~~iE~~~N~~i~--~~~~~~~f~~~~l~~T~fn~ke~ 414 (513) ..++|....||...++|.. +.+++++-..-+..... +.-++++||..+|+.|- ....+..++|..-+...-+.++. T Consensus 272 ~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~ 351 (419) T protein:vir:14 272 SALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSR 351 (419) T ss_pred HHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHH Confidence 5688999999998888643 33333332222222222 33588999999999763 23334456666656666677888 Q ss_pred HHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccc Q lcl|NC_015263. 415 HDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETT 494 (513) Q Consensus 415 ~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~ 494 (513) ++.+.++.+-|.=..--.-+.+|+.|.+ +-|..+.|+. ...... +...+.|.+. T Consensus 352 ~~~~~~~~~~G~~T~NE~R~~~gl~p~~------------gGD~~~~~~n---~~~~~~-------~~~~~~~~~~---- 405 (419) T protein:vir:14 352 YAAYAVGRQWGWLSINDIRRLENMPPVK------------GGDIYLSPMN---MVDASK-------PQQLPVGKSE---- 405 (419) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CcCeeeeccc---cccccc-------cccccCCCCC---- Confidence 8888887766632222222334555532 2344555532 211111 0011111110 Q ss_pred ccccCCCCCCCCCCccCCC Q lcl|NC_015263. 495 GNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 495 ~n~~~~~~~~~~d~~~~~~ 513 (513) ..+.+.+.++| T Consensus 406 --------~~~~~~~e~~~ 416 (419) T protein:vir:14 406 --------PTKAAIDEIGR 416 (419) T ss_pred --------Cccccccchhc Confidence 01112222222 No 14 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=98.86 E-value=2.2e-08 Score=62.60 Aligned_cols=413 Identities=14% Similarity=0.124 Sum_probs=188.3 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) |++--.+.++---.-.-++++.. ..+.++.. .......++-. +...- ..++.--.-+ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~-~~~~~~~~~g~--~~~~g---~~v~~~~al~ 57 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGW-----------------GGKTIRLT-DGAFWSQFLGR--ESSSG---KKVTVDKAMK 57 (434) T ss_pred Cccchhhhhhhcccccchhhhcc-----------------cccccccC-chHHHHHHhcC--CccCC---ceechhhhhc Confidence 33221111111000000000000 00111100 11111111110 11100 0111111223 Q ss_pred cchHHHHHHHHh-hcccccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEE Q lcl|NC_015263. 81 SQQYQRLLNFYA-NMPLYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVID 154 (513) Q Consensus 81 sg~~~rlidy~~-~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~ 154 (513) ++.+.+.|+.++ ++..+...++ +...+.+..+..-+.+...|.. +.-..+...++..++..|..|.+++. T Consensus 58 ~~~V~~~i~~ia~~ia~lp~~~~----~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~ 133 (434) T protein:vir:43 58 LSAVWACVRLISTSVAGLPLGVY----ERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRR 133 (434) T ss_pred cHHHHHHHHHHHHhhhhCceEEE----EEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 445666777665 3444444443 1111111111122334444532 34567777888899999999999775 Q ss_pred c-CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEE Q lcl|NC_015263. 155 D-KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICI 233 (513) Q Consensus 155 d-~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~i 233 (513) + +..+-+.+||+++|.|.--.+|...|.+-.. +...++++...-+.| T Consensus 134 ~~G~~~~L~~l~p~~v~~~~~~~g~~~y~~~~~--------------------------------~g~~~~~~~~eVih~ 181 (434) T protein:vir:43 134 AAGRPAALDFLLPSRVDLECDENGRLKYFYTTK--------------------------------KGARREIERTNMLHI 181 (434) T ss_pred CCCcEEEEEEEcCcceEEEEcCCCeEEEEEEec--------------------------------CceEEEEccccEEEe Confidence 4 3346789999999999866778766543211 012445666555556 Q ss_pred Ee-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-c-- Q lcl|NC_015263. 234 KI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-P-- 309 (513) Q Consensus 234 k~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p-- 309 (513) +. ..+..+|++|...+...+--....++... .-..|-...-.-|-+ ++ .++.++++++.+.+++.. . T Consensus 182 ~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~--~~f~ng~~~~gil~~-----~~--~l~~e~~~~~r~~~~~~~g~~n 252 (434) T protein:vir:43 182 PAFTLDGRIGLSAIRYGVDVFGSVMSAEDAAN--GTFKNGLLPTVAFKV-----DR--ILQPAQREEFREYVKSVSGAMN 252 (434) T ss_pred cCcCCCCccccCHHHHHHHHHHHHHHHHHHHH--HHHhccCCcceEEec-----CC--CCCHHHHHHHHHHHHHhcCccc Confidence 64 23456788888777665544444444221 112221111111111 11 245556666655555432 1 Q ss_pred -ccceEEEeccccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCc-c--hHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_015263. 310 -DNVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNK-T--SQGIAMSIATDEQF-IFGVINQLE 383 (513) Q Consensus 310 -~gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~-s--~~~~~~SI~~d~~~-~~~~~~~iE 383 (513) .++..+-..+++..+.+.. +..--++.+-..++|..+.||...++|.... + ++.+.......-.. +.-++.+|| T Consensus 253 ag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie 332 (434) T protein:vir:43 253 SGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQIQ 332 (434) T ss_pred cCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 1233443444544444421 1122234555568899999999888864332 2 22222222222222 335899999 Q ss_pred HHHHHHHhh--cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccC Q lcl|NC_015263. 384 RWLNRYLLL--NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMT 461 (513) Q Consensus 384 ~~~N~~i~~--~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~ 461 (513) ..+|+.|-. +..+..|+|.+-+...-+.++..+.+.++.+-|+=..--+-..+|+.|. | |-|..++ T Consensus 333 ~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~--------~----ggD~~~~ 400 (434) T protein:vir:43 333 QCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPEL--------P----GGDILTV 400 (434) T ss_pred HHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--------C----CCCeEee Confidence 999998743 2223446666556656688888888888877774333223334566663 2 2334444 Q ss_pred cccccc-cccccccccCCc-cccCCCCcCCCCcc Q lcl|NC_015263. 462 PLSSSF-NTSGSDIAENAI-KEKGKENGRPTNET 493 (513) Q Consensus 462 Pl~TS~-T~Sg~~~~~~~~-~~~~~~~grPt~et 493 (513) |+.--- ..-+....+.+. .+....+|.|+++- T Consensus 401 ~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 401 QSNLVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred ccCccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 431100 001111001111 11112233333331 No 15 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.83 E-value=3.4e-08 Score=61.50 Aligned_cols=443 Identities=10% Similarity=0.063 Sum_probs=180.9 Q ss_pred CCC-----------ccchhe----eeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChh Q lcl|NC_015263. 1 MVK-----------NKKKRL----SMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEG 65 (513) Q Consensus 1 ~~~-----------~~~~~~----~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~ 65 (513) |.+ +---|| ++.---.-.-.+.++........-+-.--. + +.+.-++ +-+ |-- T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~--~---~~~g~~~-----~~e--pp~ 84 (648) T protein:vir:79 17 MWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIM--D---GGGGGRD-----FEE--PEF 84 (648) T ss_pred hccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHH--h---hcCCccc-----ccc--CCc Confidence 111 100010 000000000000111111000000000000 0 0000000 111 444 Q ss_pred HHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHH Q lcl|NC_015263. 66 NQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMT 144 (513) Q Consensus 66 n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~ 144 (513) +.+.|.+ +|.+++++++.|+.++. +..+...|.+-.+ .........+. ...-...++...++..++.+++. T Consensus 85 d~~~l~~----l~~~np~V~~aI~iia~~ia~l~~~i~~~~~---~~~~~~~~~~l-l~rPn~~~t~~~f~~~l~~~lll 156 (648) T protein:vir:79 85 DFNEITS----AYNTEGYVRQAVDKYIEMMFKADWDFVSKNP---NAVEYIRMRFT-LMAEATQIPTNQLFIEIAEDLVK 156 (648) T ss_pred CHHHHHH----HHhcChHHHHHHHHHHHHHhhCcceEEecCC---ccchhhHHHHH-hhccCCCCCHHHHHHHHHHHHHh Confidence 4444433 57889999999988753 3445555654221 11111111110 11111235566788889999999 Q ss_pred hcceeEEEEEcCcce-----------------eeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHH Q lcl|NC_015263. 145 VDIFYGYVIDDKESV-----------------MIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNK 207 (513) Q Consensus 145 ~g~~~gy~i~d~~~~-----------------~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~ 207 (513) .|..|.+++-++++. .+.|++++.+++..-.+|.... | . T Consensus 157 ~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~-----Y-------------------~ 212 (648) T protein:vir:79 157 YCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKG-----W-------------------Q 212 (648) T ss_pred cCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceee-----e-------------------E Confidence 999999999887763 3556777777765433332110 0 0 Q ss_pred HhhhhhccCcccccC-eeecCCceEEEEec--CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhce--eeeeeeccc Q lcl|NC_015263. 208 YTTMKKGNNKSASNW-YEIQDKNSICIKIN--ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYK--LLIQKLETR 282 (513) Q Consensus 208 Y~~~k~~~~~~~~~W-~~L~~~kt~~ik~~--~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~--ii~~kip~~ 282 (513) |.. .+ ..+ +.++++.-+.|+.. .+..+|+||...+...+--.....+... .-..|-. -.+-++|. T Consensus 213 y~~----~g---~~~~~~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~--~fF~NGa~P~gil~~~~- 282 (648) T protein:vir:79 213 QEQ----EG---QDKPQKFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVL--RLVYRNLHPLWHVKVGL- 282 (648) T ss_pred EEe----cC---CceeEEecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHH--HHHhccCCccEEEEeCC- Confidence 100 00 112 45666667777743 4566899999888776644444444221 1122211 11222331 Q ss_pred cCCCCCccccCHHHHHHHHHHHHHhccccceEEEecccccccccccccc-----cchhhhhhHHhhhhhhhhhhhhccCC Q lcl|NC_015263. 283 SSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKDSS-----TDDSVEKATKNFWDNAGVSQILFSSD 357 (513) Q Consensus 283 ~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~~~-----~~dtv~~~~~~i~~~~GiS~~Lfn~d 357 (513) ....+.-....++++++..+..+-.|.+. ..+.+.++-..+ -..+.+-..++|-.+.||...++|-. T Consensus 283 ---~~~~~e~~k~~~e~~~~~~~~~~i~gg~v-----~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~ 354 (648) T protein:vir:79 283 ---EQEGFGAEEGEVDLVRGEVENMDVEGGMV-----TTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRG 354 (648) T ss_pred ---CccchHHHHHHHHHHHHhccccccccccc-----ccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccC Confidence 11111112222344554444432111111 112222221111 11234555688999999999888632 Q ss_pred -CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------cceEEEEEecCCCCccHHHHHHHHHHHHhcC Q lcl|NC_015263. 358 -NKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNG-----------MSKYFKATMLEVTHFSKKEAHDRYITDAQYG 425 (513) Q Consensus 358 -~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~-----------~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G 425 (513) +++.+.+...-..-...+..+++.++++++..+.+.. ..+.++|.|-+...-..+...+.+.++.+-| T Consensus 355 ~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~G 434 (648) T protein:vir:79 355 GTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHN 434 (648) T ss_pred CCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCC Confidence 2333332222222233344466666666665443211 1234677777776667777777788777778 Q ss_pred CcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccC-CCCcCCCCcccccccCCCCCC Q lcl|NC_015263. 426 FPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKG-KENGRPTNETTGNKDSDETQR 504 (513) Q Consensus 426 ~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~-~~~grPt~et~~n~~~~~~~~ 504 (513) .-...-.-+.+|+.|.+--. ...-+...+.|....-...+..+.....+... ..+|++... .+++.++++. T Consensus 435 ilT~NEaR~~lGlpPi~~g~------~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~--~~~~~~~~~~ 506 (648) T protein:vir:79 435 AISEDEMRELIGRDPVDDGE------GRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKAT--DNKTKPTNQH 506 (648) T ss_pred CcCHHHHHHHhCCCCCCCCC------CccccccccccchhccccccCCCCCCCCCCCCcccccccccc--CCCCCCCCCC Confidence 54444455566887753000 00001111111110000000000000000000 111212111 1112222222 Q ss_pred CC-CCccCCC Q lcl|NC_015263. 505 AK-DKPANTQ 513 (513) Q Consensus 505 ~~-d~~~~~~ 513 (513) .+ .+|-+-+ T Consensus 507 g~~~~~~~~~ 516 (648) T protein:vir:79 507 GTKTSPKKQT 516 (648) T ss_pred CcCCCCcccc Confidence 11 1121111 No 16 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.80 E-value=4.3e-08 Score=60.99 Aligned_cols=440 Identities=10% Similarity=0.058 Sum_probs=222.1 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHH-hhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHH Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILR-DDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRL 87 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rl 87 (513) ++|+|+- +++-.-=+..+...-. +-.. +.....-+..+ +.|. + + ..+-..||.-+++|+.++|+.++. T Consensus 1 m~~~~~~-~~a~~~~~~~~~~~~~y~aa~-~~~~~~~~~~~-s~d~--~-~-----~~~~~~lr~RaRdl~rNn~~a~~a 69 (495) T protein:vir:10 1 MNMTPSG-YQSLASGLLVPVGASAYEGAS-GGHRWQDIGDY-GPDT--A-V-----ASGIQTLRARSHHNVRNNPWATNA 69 (495) T ss_pred CCccccc-ccccchhhhhHHHhhhhhccc-cCcccCCCCCC-ChhH--H-H-----HHHHHHHHHHHHHHHhcChHHHHH Confidence 6777773 2222211111111000 0000 10001111111 2221 1 1 135678999999999999999999 Q ss_pred HHHHhhcccccceEee-ccchhhhhhcchhHHHHHHHH---HHhhcChhHHHHHHHHHHHHhcceeEEEEEcC--c---- Q lcl|NC_015263. 88 LNFYANMPLYAYSVVP-FKDISTANENKLKKELATVTE---FLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDK--E---- 157 (513) Q Consensus 88 idy~~~mpt~dY~I~P-~~~~~~~~~~~~~~~y~~v~~---~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~--~---- 157 (513) |+.+.+.--=. =|.| ...........+...+..-+. .=-+++.-....-+.+..++.|..|.-.+... + T Consensus 70 v~~~~~~vVG~-Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~ 148 (495) T protein:vir:10 70 VATWVAAAVGN-GLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSV 148 (495) T ss_pred HHHHHHhhcCC-CcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCcc Confidence 99887754222 2344 221121222223333322211 11123344445667888899999998665432 2 Q ss_pred ceeeeecCcceeEEE-EE---ECCee-EEEEEeeeccCcc-hh-----ccccHHHHHHHHHHhhhhhccCcccccCeeec Q lcl|NC_015263. 158 SVMIQQFPNDICKIS-SV---SGGVY-NYVIDLDALVSAD-IV-----DYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ 226 (513) Q Consensus 158 ~~~iq~lp~dyckIs-g~---~nG~y-~~~fD~syFd~~~-~L-----~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~ 226 (513) +.-+|-+++|+|.-. +. .+|.+ +-.+= ||... -+ ..-|.+... ......|..+| T Consensus 149 ~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe---~d~~Gr~vaY~i~~~hpgd~~~------------~~~~~~~~rvp 213 (495) T protein:vir:10 149 PLQLQIIEPDMLASDIPDETLPSGGYVKGGIR---FSNGGKRKAYCFYRNHPAESSL------------IGDPVDTVWIK 213 (495) T ss_pred ceEEEEechhhcCCCCCCCCCCCCCEEEeceE---ECCCCceEEEEEeecCCCcccc------------cccccceeeec Confidence 357899999999632 11 22322 23332 23211 12 112322110 01123577788 Q ss_pred CCceEEEEecCcccc---chhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHH Q lcl|NC_015263. 227 DKNSICIKINESSLT---PVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEA 303 (513) Q Consensus 227 ~~kt~~ik~~~~~~~---~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ 303 (513) -+. |++-+. ..+. |+|.|++++ .+-++++|.+-....+.++. .+...|--..+.+.+...+.......- .. T Consensus 214 A~~-vlH~f~-~r~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A--~~~~fi~~~~~~~~~~~~~~~~~~~~~-~~ 287 (495) T protein:vir:10 214 AEH-VLHVTV-LTVRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAA--LFAAFIQEATADSTGGPTIGQPKRSKG-GK 287 (495) T ss_pred hhh-eEeccc-cCCCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhh--hheeeeecCCCccccccccCccccccC-cc Confidence 653 444343 3343 889888765 59999999998888888866 333333311112221111111111100 01 Q ss_pred HHHhccccceEEEecccccccccccc----cccchhhhhhHHhhhhhhhhhhhhccCC--CcchHHHHHHHHHHHHHH-- Q lcl|NC_015263. 304 LSMTVPDNVGVVTSPMEIDTVSFDKD----SSTDDSVEKATKNFWDNAGVSQILFSSD--NKTSQGIAMSIATDEQFI-- 375 (513) Q Consensus 304 ik~~Lp~gv~~v~sP~~~d~i~ld~~----~~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~s~~~~~~SI~~d~~~~-- 375 (513) ....|..|....+-|= +.|+|-.. ..-.+.+......|-..+||+--.+.+| +.|++++..++...-..+ T Consensus 288 ~~~~l~pG~i~~L~pG--e~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~ 365 (495) T protein:vir:10 288 RITGLNPGTLQYLQPG--QEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQ 365 (495) T ss_pred cceecCCceeeecCCC--CeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHH Confidence 1112322322222332 35555332 2333557777788999999997777666 568888888775332222 Q ss_pred -------HHHHHH-HHHHHHHHHhhcccc------e---EEEEEe--cCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHh Q lcl|NC_015263. 376 -------FGVINQ-LERWLNRYLLLNGMS------K---YFKATM--LEVTHFSKKEAHDRYITDAQYGFPVKVYLASLM 436 (513) Q Consensus 376 -------~~~~~~-iE~~~N~~i~~~~~~------~---~f~~~~--l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~ 436 (513) ..|.+- -++|+-..+..+.+. . ..+..+ .+-...+..+.++.-+....-|+.++...++.+ T Consensus 366 ~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~ 445 (495) T protein:vir:10 366 VQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAER 445 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc Confidence 133332 344677665554332 0 123333 444467777888888999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHh---hCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 437 GIDPVAFTGLLKVENEM---LDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 437 G~~p~~~~~~~~~E~e~---L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) |.+|++++.+...|++. +||..--.|..+. -||. ..++..+.++ .++ T Consensus 446 G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~--~~~~----------~~~~~~~~~~------~~e 495 (495) T protein:vir:10 446 GYDMEELFDMISDANQLIDEYDLRLDSDPRYVN--GSGA----------EQKSVMEAAL------NNE 495 (495) T ss_pred CCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCC--CccC----------CCCCCCCCCC------CCC Confidence 99999999999999854 4442211122110 0110 0011101001 111 No 17 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.79 E-value=4.7e-08 Score=60.76 Aligned_cols=381 Identities=11% Similarity=0.111 Sum_probs=169.2 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceE Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSV 101 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I 101 (513) |.=.+++ ...+ +..+..|. .|+ .++..+ .....++.--+-+++.+.+.|++++. +..+...+ T Consensus 1 M~~f~~~-----~~~~-------~~~~~~~~--~~~-~~~~~~--~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~ 63 (397) T protein:vir:38 1 MPLLKLN-----KSHS-------QGFSLNDP--DWV-NFLTGG--EAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTS 63 (397) T ss_pred Ccchhhh-----hccc-------CcccCCch--hhh-hhhcCC--cCCceechHHhhccHHHHHHHHHHHHHHhhCcccc Confidence 1001110 0000 00111110 111 111000 00011111122346677777887754 44433221 Q ss_pred eeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEEEEE Q lcl|NC_015263. 102 VPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKISSVS 175 (513) Q Consensus 102 ~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~ 175 (513) . ......++..-| ...+...+...++..|..|.+++-+.++ +-+.++|+..|+|.--+ T Consensus 64 ~----------------~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~ 127 (397) T protein:vir:38 64 E----------------SDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQ 127 (397) T ss_pred c----------------ccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 1 011223444433 4466778888889999999998876554 68899999999998666 Q ss_pred CCe-eEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecC-c-cccchhhHHHHHHh Q lcl|NC_015263. 176 GGV-YNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINE-S-SLTPVPPFAGTFDS 252 (513) Q Consensus 176 nG~-y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~-~-~~~~ip~f~~v~~d 252 (513) +|. ..|.|-+.. . .....+.++..--+-|+... . ..+|+||...+... T Consensus 128 ~~~~~~y~~~~~~---~--------------------------~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~ 178 (397) T protein:vir:38 128 DGSGLIYNINFDE---P--------------------------AIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINE 178 (397) T ss_pred CCceEEEEEEecc---c--------------------------cccceeEecCccEEEecCCCCCCccccccHHHHHHHH Confidence 652 233222111 0 00112344554344455422 2 24688888887776 Q ss_pred HHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh--ccc--cceEEEeccccccccccc Q lcl|NC_015263. 253 IYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT--VPD--NVGVVTSPMEIDTVSFDK 328 (513) Q Consensus 253 ~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~--Lp~--gv~~v~sP~~~d~i~ld~ 328 (513) +.-.....+.... -..|-...-..|-+ +.. ++.++.+++.+..+.. ..+ ++..+-..+++..+.+.. T Consensus 179 i~~~~~~~~~~~~--~f~ng~~~~~il~~---~~~----~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~ 249 (397) T protein:vir:38 179 QQIKDASNELTLK--ALKQSVTASAVLTI---QKG----GLLDAETRIARSKEISKQIHNSDGPVVIDALEDYKPLEVKG 249 (397) T ss_pred HHHHHHHHHHHHH--HHhccCCccEEEEe---CCC----CCHHHHHHHHHHHHHHhcccccCCceecCCCceEEecCCCh Confidence 6555555543321 22221111111211 111 2333333333333222 122 233333344444444431 Q ss_pred c-cccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcccceEEEEEecCC Q lcl|NC_015263. 329 D-SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIF-GVINQLERWLNRYLLLNGMSKYFKATMLEV 406 (513) Q Consensus 329 ~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~-~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~ 406 (513) . ..--.+.+-..++|..+.||+..++|+...+++.+... ...-..+. -++++||..+|+.|-.. .. |.+.++- T Consensus 250 ~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e~~-~~~~~~~l~P~~~~ie~~ln~~l~~~-~~--~~~~~~~- 324 (397) T protein:vir:38 250 NIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSITQI-SGQYAKSLNRYVQAIVGELNDKLHAN-IS--ANIRFAI- 324 (397) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCh-hc--ccccccc- Confidence 1 12223456667889999999999998865544333221 22222233 58899999999988543 22 3333332 Q ss_pred CCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCC Q lcl|NC_015263. 407 THFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKEN 486 (513) Q Consensus 407 T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~ 486 (513) ..+.++..+.+.++.+-|. ++|.|+-.++-.+- ..+ .+...|.....+. .+ ......+++. T Consensus 325 -~~d~~~~~~~~~~~~~~G~-----------~t~nE~R~~lg~~p-~~~-~d~~~~~~~~~~~-~~----~~~~~~g~~~ 385 (397) T protein:vir:38 325 -DAMGDQYASTISSSVKGGT-----------IAGNQARFILQNSG-YLA-KDLPDPEKEPQQA-IQ----LIQQEGGEND 385 (397) T ss_pred -cCCHHHHHHHHHHHHhCCC-----------cCHHHHHHHhCCCC-CCC-Ccccccccccccc-cc----ccccccCCCC Confidence 2367777887777665552 35555444333221 111 1222222111111 00 0010011122 Q ss_pred cCCCCcccccccCCCCCCCC Q lcl|NC_015263. 487 GRPTNETTGNKDSDETQRAK 506 (513) Q Consensus 487 grPt~et~~n~~~~~~~~~~ 506 (513) |.++++..++ |. T Consensus 386 ~~~~~e~~~~--------~~ 397 (397) T protein:vir:38 386 GNNSDERGSD--------PE 397 (397) T ss_pred CCCCCCCCCC--------CC Confidence 2222221111 11 No 18 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=98.78 E-value=4.5e-08 Score=60.88 Aligned_cols=391 Identities=10% Similarity=0.079 Sum_probs=183.1 Q ss_pred CCCccc-hheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKNKK-KRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~ 79 (513) |.-+|. +|..--.......| .+.+|....+ ...+.++..-.- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~--------------------~~~~~g~~~s-----------------~~~~~v~~~~al 43 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGW--------------------VSALLGSARS-----------------EAGQVVTPASAL 43 (419) T ss_pred CCcccccccccCcCCCCcchh--------------------hHHhhccccc-----------------ccCcccChHHhh Confidence 554432 22111100000001 0011110000 000001000111 Q ss_pred hcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEE Q lcl|NC_015263. 80 QSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVI 153 (513) Q Consensus 80 ~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i 153 (513) ..+.+.+.|+.++. +..+...++ ......+...++ +.+...|. + +.--.+...++..++..|..|.+++ T Consensus 44 ~~~~v~~cv~~ia~~ia~lp~~~~----~~~~~~~~~~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~ 118 (419) T protein:vir:80 44 SLTVLQNCVTLLAESIAQLPVELY----ERSGDDRKPATD-HPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFID 118 (419) T ss_pred ccHHHHHHHHHHHHhhccCceEEE----EecCCCcccccc-cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 23455566666543 222222332 111111111111 22334444 2 3355667788888899999999998 Q ss_pred EcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceE Q lcl|NC_015263. 154 DDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSI 231 (513) Q Consensus 154 ~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~ 231 (513) -+..+ .-+.++|++.|.|.--.+|.+.|.+.... .+ +.+--+ T Consensus 119 r~~~G~~~~L~~i~~~~v~i~~~~~~~~~y~~~~~~--------~~----------------------------~~~~i~ 162 (419) T protein:vir:80 119 RDQDGVIQGLYPLDNEAVTVMKGPDLKPMYRVAGAD--------PL----------------------------PQRLVH 162 (419) T ss_pred ECCCCcEEEEEEecCceEEEEECCCceEEEEEcCcc--------cc----------------------------chhheE Confidence 76555 67999999999998666777776553210 11 222223 Q ss_pred EEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh Q lcl|NC_015263. 232 CIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT 307 (513) Q Consensus 232 ~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~ 307 (513) .++. ..+.++|+||...+...+--.....+.. ..-..| -..++ ++| .+..-..+.++++++.+.+.++ T Consensus 163 h~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~--~~~f~ng~~~~gil-~~~-----~~~~~~~~~~~~~~~~~~~~~~ 234 (419) T protein:vir:80 163 HVRWMSINGYTGLSPVLLHANAIGHAQAIQQYA--GKSFMNGTALSGVI-ERP-----TDAPALKDQASVDRITDGWNAK 234 (419) T ss_pred EecCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHHHhcCCCccEEE-Eec-----CCCCcccCHHHHHHHHHHHHHH Confidence 3442 2345678888877776554433333322 111222 12222 233 2333445677777777776665 Q ss_pred cc--c---cceEEEeccccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHHHH-HHHHHHHHHHHHH Q lcl|NC_015263. 308 VP--D---NVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGIAM-SIATDEQFIFGVI 379 (513) Q Consensus 308 Lp--~---gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~-SI~~d~~~~~~~~ 379 (513) .- + ++..+-..+++..+.+.. +..--++..-..++|..+.||...++|.. +++++++.. .+.--..-+.-++ T Consensus 235 ~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~ 314 (419) T protein:vir:80 235 FGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWV 314 (419) T ss_pred hcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHH Confidence 51 1 233444444554444421 11222334455688999999999888643 233333322 2222222244588 Q ss_pred HHHHHHHHHHHh--hcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcc Q lcl|NC_015263. 380 NQLERWLNRYLL--LNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLP 457 (513) Q Consensus 380 ~~iE~~~N~~i~--~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~ 457 (513) ++||..+|+.|- ....+..|+|..-....-+.++.++.+.++..-|. ++|.|+-.++-++- +=|-| T Consensus 315 ~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~-----------~T~NE~R~~~g~~p-~~gGD 382 (419) T protein:vir:80 315 KRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGW-----------LSINDIRRLENMPP-VKGGD 382 (419) T ss_pred HHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCC-----------cCHHHHHHHhCCCC-CCCcc Confidence 999999999763 23334556776666666678888888877665552 45555543332221 00234 Q ss_pred cccCcccccccccccccccCCccccCCCCcCCCCcc-----cccccC Q lcl|NC_015263. 458 EIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNET-----TGNKDS 499 (513) Q Consensus 458 ~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et-----~~n~~~ 499 (513) ..++|+- ....+ .++..+.|.|.+.. .+...+ T Consensus 383 ~~~~~~n---~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~l~ 419 (419) T protein:vir:80 383 IYLSPMN---MVDAS-------KPQPIPMGKTEPTKAALDEIGRILS 419 (419) T ss_pred eeeeccc---ccccc-------ccccccCCCCCchhhhHHHHHhhcC Confidence 4555532 21111 11111122221110 011111 No 19 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=98.78 E-value=5e-08 Score=60.62 Aligned_cols=388 Identities=9% Similarity=0.016 Sum_probs=174.7 Q ss_pred cccccccchHHHHHHHhhhc-----cC---hhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhh Q lcl|NC_015263. 42 APVGSLTSSQSKVRKIVKEY-----RN---EGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANE 112 (513) Q Consensus 42 s~~~s~~~s~d~~k~~i~~~-----~P---~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~ 112 (513) =.+|+.-. .+..-.|..-+ +| ..+.. +.. -.++.+.+.|+.+++ +..+...++ . ...+ T Consensus 1 m~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~-~~A------l~~~~V~~cv~~ia~~iA~lp~~~~----~-~~~~ 67 (417) T protein:vir:38 1 MKLFRGLA-TEVDPHWADHLLDSGVIPSFRGGYLG-ISA------LRNSDVLTAVSIVSGDVSRFPLVIT----D-SSTD 67 (417) T ss_pred Cccccccc-cCCCccchhhhcccccccccCCceec-hhh------cccHHHHHHHHHHHHhhccCeeEEE----E-cCCc Confidence 11222111 00001111110 01 11110 000 123445556666543 223333332 1 1111 Q ss_pred cchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEEEEcCc---ceeeeecCcceeEEEEEECCeeEEEEE Q lcl|NC_015263. 113 NKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYVIDDKE---SVMIQQFPNDICKISSVSGGVYNYVID 184 (513) Q Consensus 113 ~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~---~~~iq~lp~dyckIsg~~nG~y~~~fD 184 (513) +.... +.+...|. +=| -..+...++..++..|..|.+++-+.. +..+.++|+++|.|....+|.+.|.|. T Consensus 68 ~~~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~~y~~~ 145 (417) T protein:vir:38 68 EVIDL--ANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNIIYRFT 145 (417) T ss_pred ceecc--chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeEEEEEE Confidence 11111 12223343 233 345666778888999999999886533 456789999999998777788777654 Q ss_pred eeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec-CccccchhhHHHHHHhHHHHHHHHHHH Q lcl|NC_015263. 185 LDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN-ESSLTPVPPFAGTFDSIYDIHSFKDLR 263 (513) Q Consensus 185 ~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~-~~~~~~ip~f~~v~~d~~di~~~kdL~ 263 (513) ... ......++..--+-|+.. .+...|++|...+...+--....++.. T Consensus 146 ~~~-------------------------------~~~~~~~~~~dviH~r~~~~d~~~G~s~l~~~~~~i~~~~~~~~~~ 194 (417) T protein:vir:38 146 PYN-------------------------------SSMQKVCGFEDVIHWKFFSYDTIMGRSPLLSLGDEIGLQESGVSTL 194 (417) T ss_pred EcC-------------------------------CcEEEEecCcceEEecCCCCCCccccCHHHHHHHHHHHHHHHHHHH Confidence 321 012233444444555542 234568888776654442222222211 Q ss_pred hhHhhhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c--cceEEEecccccccccccccccchh- Q lcl|NC_015263. 264 NDKAELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D--NVGVVTSPMEIDTVSFDKDSSTDDS- 335 (513) Q Consensus 264 ~~~~~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~--gv~~v~sP~~~d~i~ld~~~~~~dt- 335 (513) . .-..| -..|+ +.| -.++.++.+++.+.+++... + ++.++-..+++..+.+.. .+.+. T Consensus 195 ~--~~f~ng~~p~~il-~~~---------~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~--~d~q~l 260 (417) T protein:vir:38 195 Q--KFFKSGLKGSIIK-AKE---------SRLSAEARQKIREDFERAQAGADAGSPIIVDATMDYQPLEVDT--NVLNLI 260 (417) T ss_pred H--HHHhccCCCcEEE-EeC---------CCCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEccCCH--HHHHHH Confidence 1 11122 11221 111 12666777777777766552 1 233334455555555532 22232 Q ss_pred --hhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHH Q lcl|NC_015263. 336 --VEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKE 413 (513) Q Consensus 336 --v~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke 413 (513) ..-..+.|..+.||...++|+..+.++.-.....-...-+.-++++||..+|+.|-...-...+.|.| ++..+.+.. T Consensus 261 e~~~~~~~~Ia~~fgVPp~~lg~~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~~~~~~~f-d~~~l~~~~ 339 (417) T protein:vir:38 261 NSNNYSTAQIAKALRVPAYRLAQNSPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQRHQYCIGF-DTKSVNGLP 339 (417) T ss_pred HHHHhhHHHHHHHhCCCHHHhCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhhcccceEEe-chhhhhHHH Confidence 33334779999999999997655544444444444444455799999999999885322222234444 333333322 Q ss_pred HHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHH-HhhCcccccCcccccccccccccccCCccccCCCCcCCCCc Q lcl|NC_015263. 414 AHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVEN-EMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNE 492 (513) Q Consensus 414 ~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~-e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~e 492 (513) . ..++++..-| -++|.|+-.++-+|- +--+.|..+.|+.. +.=...++....+.....||.+..+ T Consensus 340 ~-~~~~~~~~~G-----------~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~--~~~d~~~~~~~~~~~~~kgg~~~~~ 405 (417) T protein:vir:38 340 I-ADVNTAVNGG-----------LWTGNEGRAELGKKPLKDPNMDRIQSTLNT--VFLDQKEAYQAEHAAELKGGDTNAK 405 (417) T ss_pred H-HHHHHHHhCC-----------CcCHHHHHHHhCCCCCCCCCCCeeeecccc--cccccccccccccccccCCCCCCCC Confidence 2 2233333222 237777765554432 11122333444331 1101111111111111233333222 Q ss_pred ccccccCCCCCCCCCCccCC Q lcl|NC_015263. 493 TTGNKDSDETQRAKDKPANT 512 (513) Q Consensus 493 t~~n~~~~~~~~~~d~~~~~ 512 (513) +.. +..+.++|. T Consensus 406 ~~~--------~~~~~~~~~ 417 (417) T protein:vir:38 406 GNQ--------NGSGTNANS 417 (417) T ss_pred CCC--------cCCCCcCCC Confidence 111 111222222 No 20 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=98.78 E-value=5e-08 Score=60.61 Aligned_cols=395 Identities=11% Similarity=0.052 Sum_probs=184.7 Q ss_pred ccccccccccchHHHHHHH-hhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhh-cch Q lcl|NC_015263. 39 VFGAPVGSLTSSQSKVRKI-VKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANE-NKL 115 (513) Q Consensus 39 ~~~s~~~s~~~s~d~~k~~-i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~-~~~ 115 (513) |+....|....+....... +...++......-..++..-.-..+.+++.|+.+++ +..+...++ ...... ... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~----~~~~~g~~~~ 76 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLY----RRTENGGREI 76 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEE----EEcCCCceec Confidence 3333333322211110000 000000000000000111111234457777777754 333444443 111111 111 Q ss_pred hHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEcCc--ceeeeecCcceeEEEEEECCeeEEEEEeeec Q lcl|NC_015263. 116 KKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQFPNDICKISSVSGGVYNYVIDLDAL 188 (513) Q Consensus 116 ~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~lp~dyckIsg~~nG~y~~~fD~syF 188 (513) ..+ +.+...|. + +.-..+...++..++..|..|.++..+.. .+-+.++|+++|.+.--.+|...|.++- T Consensus 77 ~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~~y~~~~--- 152 (419) T protein:vir:57 77 AFD-HPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMPYYDIPS--- 152 (419) T ss_pred ccc-chHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceEEEEEcC--- Confidence 111 12333343 2 23566677888899999999999987654 4689999999999986666765443320 Q ss_pred cCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHh Q lcl|NC_015263. 189 VSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKA 267 (513) Q Consensus 189 d~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~ 267 (513) .. ..++.+.-+-++. ..+.++|+||...+...+--....++... . T Consensus 153 -~~-------------------------------~~~~~~~vih~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~--~ 198 (419) T protein:vir:57 153 -IG-------------------------------EILPMRMVHHIKSFSLDGYIGTSPIQTNPDVLGLGIAVEQHAA--Q 198 (419) T ss_pred -Cc-------------------------------eEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHH--H Confidence 10 1122222233332 23456788887766654443333333221 1 Q ss_pred hhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc-----ccceEEEeccccccccccc-ccccchhhhh Q lcl|NC_015263. 268 ELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP-----DNVGVVTSPMEIDTVSFDK-DSSTDDSVEK 338 (513) Q Consensus 268 ~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp-----~gv~~v~sP~~~d~i~ld~-~~~~~dtv~~ 338 (513) -..| -..++ +.| .+.+..++.++++.+.+...++.- -++..+...+++..+.+.. +..--.+.+- T Consensus 199 ~f~ng~~p~gil-~~~-----~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 272 (419) T protein:vir:57 199 VFARGTTMSGVI-ERP-----FEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQY 272 (419) T ss_pred HHHccCCccEEE-Eec-----CcCCcccCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHH Confidence 1222 12222 233 222335777777777777776651 1233444444554444421 1111234455 Q ss_pred hHHhhhhhhhhhhhhccCCC-cchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhh--cccceEEEEEecCCCCccHHHH Q lcl|NC_015263. 339 ATKNFWDNAGVSQILFSSDN-KTSQGIAMSIATD-EQFIFGVINQLERWLNRYLLL--NGMSKYFKATMLEVTHFSKKEA 414 (513) Q Consensus 339 ~~~~i~~~~GiS~~Lfn~d~-~s~~~~~~SI~~d-~~~~~~~~~~iE~~~N~~i~~--~~~~~~f~~~~l~~T~fn~ke~ 414 (513) ..++|..+.||...++|... ++++++...-+.. ..-+.-++++||..+|+.|-. ...+..|+|.+-+....+.++. T Consensus 273 ~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~ 352 (419) T protein:vir:57 273 TVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSR 352 (419) T ss_pred HHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHH Confidence 55889999999988886432 2333332222222 333345888999999997642 2234457776666666788888 Q ss_pred HHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccc Q lcl|NC_015263. 415 HDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETT 494 (513) Q Consensus 415 ~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~ 494 (513) ++.+.++.+-|. ++|.|+-.++-+|--. |-|..++|+-- +..+ ...++|.|+++.. T Consensus 353 ~~~~~~~~~~G~-----------~T~NE~R~~~gl~p~~-ggD~~~~~~n~--~~~~----------~~~~~~~~~~~~~ 408 (419) T protein:vir:57 353 YESYALGRQWGW-----------LSVNDIRRMENLTPIP-GGDKYLTPLNM--VDSK----------ALTGIGKATPQQL 408 (419) T ss_pred HHHHHHHHhCCC-----------cCHHHHHHHhCCCCCC-CcCeeeecccc--cccc----------ccccccCCCcccC Confidence 888888776662 3444443332222101 33455555331 1101 0112222322211 Q ss_pred ccccCCCCCCCCCCccC Q lcl|NC_015263. 495 GNKDSDETQRAKDKPAN 511 (513) Q Consensus 495 ~n~~~~~~~~~~d~~~~ 511 (513) . +..+.+.++ | T Consensus 409 ~--~~~~~~~~~----~ 419 (419) T protein:vir:57 409 K--DIEAILCTR----N 419 (419) T ss_pred c--chhhhhhcc----C Confidence 1 111111111 1 No 21 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.76 E-value=6e-08 Score=60.16 Aligned_cols=391 Identities=11% Similarity=0.109 Sum_probs=187.4 Q ss_pred HHHHHhhccCcccccccccccchHHH-H-HHHhhhc-cChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEee Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGSLTSSQSK-V-RKIVKEY-RNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVP 103 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s~~~s~d~-~-k~~i~~~-~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P 103 (513) -.++.. +|...+.... . .+++... .+.. ...-+.++.-.+-.++.+.+.|+++++ +..+...++- T Consensus 1 Mg~f~~----------lf~r~~~~~~~~~~~~~~~~~~~~~-~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~ 69 (414) T protein:vir:44 1 MVFFSG----------LFQRKSDAPVTTPAELADAIGLSYD-TYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYH 69 (414) T ss_pred Cchhhh----------hhccCccCcccchhhHhHhhccCcc-ccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEE Confidence 222222 2222111100 0 0111110 0000 000111222234467778888888764 4455555542 Q ss_pred ccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEc-CcceeeeecCcceeEEEEEECC Q lcl|NC_015263. 104 FKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDD-KESVMIQQFPNDICKISSVSGG 177 (513) Q Consensus 104 ~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d-~~~~~iq~lp~dyckIsg~~nG 177 (513) ..... .+ ...-+.+...|. + +.-..+...++..++..|..|.+++.+ +....+.++|+.+|.|....+| T Consensus 70 ~~~~~--~~---~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~g~~~~L~~l~~~~v~~~~~~~~ 144 (414) T protein:vir:44 70 LNGSL--KQ---RATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDPGCVVPKLNSSW 144 (414) T ss_pred ecCCc--ee---ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcCceEEEEECCCC Confidence 21111 01 111122334443 2 234566777888889999999998865 4445789999999999877777 Q ss_pred eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccccchhhHHHHHHhHHHH Q lcl|NC_015263. 178 VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSLTPVPPFAGTFDSIYDI 256 (513) Q Consensus 178 ~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~~~ip~f~~v~~d~~di 256 (513) .+.|.+.+. +.. . ..++...-+-|+. ..+.+.|++|...+-..+--. T Consensus 145 ~~~y~~~~~---~g~-~----------------------------~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~ 192 (414) T protein:vir:44 145 EPVYQVTFP---DGS-T----------------------------DVLSQEDIWHVRTLTLDGLVGLNPIAYAREAISLA 192 (414) T ss_pred cEEEEEEec---Cce-E----------------------------EEEccccEEEecCCCCCCcccccHHHHHHHHHHHH Confidence 766544321 100 0 1233333333442 234567888877665443322 Q ss_pred HHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c---cceEEEecccccccccccccc Q lcl|NC_015263. 257 HSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D---NVGVVTSPMEIDTVSFDKDSS 331 (513) Q Consensus 257 ~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~---gv~~v~sP~~~d~i~ld~~~~ 331 (513) ...++.. ..-..|-...-..|.+ + -.++.++++++.+.+.++.- + ++..+-..+++..+.+. .. T Consensus 193 ~~~~~~~--~~~f~ng~~p~gil~~---~----~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~--~~ 261 (414) T protein:vir:44 193 AATEEHG--ARLFSNGAVTSGVLRT---E----QTLSDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALN--AE 261 (414) T ss_pred HHHHHHH--HHHHhccCCCceEEEe---C----CCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCC--hH Confidence 2222211 1112221111111222 1 13677788888777776651 1 12233333444444432 22 Q ss_pred cchh---hhhhHHhhhhhhhhhhhhccCC-CcchHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhh--cccceEEEEEec Q lcl|NC_015263. 332 TDDS---VEKATKNFWDNAGVSQILFSSD-NKTSQGIA-MSIATDEQFIFGVINQLERWLNRYLLL--NGMSKYFKATML 404 (513) Q Consensus 332 ~~dt---v~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~-~SI~~d~~~~~~~~~~iE~~~N~~i~~--~~~~~~f~~~~l 404 (513) +.+. .+-..++|..+.||...++|.. +.+++.+. ..+.--..-+.-++++||..+|+.|-. ...+..|+|..- T Consensus 262 d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~ 341 (414) T protein:vir:44 262 DSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRKSKQGVFYAKFNAG 341 (414) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccCceEEEEech Confidence 2233 3344477999999998888643 22333322 222233333446889999999998743 223445677666 Q ss_pred CCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCC Q lcl|NC_015263. 405 EVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGK 484 (513) Q Consensus 405 ~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~ 484 (513) +...-+.++..+.+.++.+-|.=..--.-+.+|+.|.+ +-|..++|...+-. +.+ . . T Consensus 342 ~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~------------ggD~~~~~~n~~~~--~~~------~---~ 398 (414) T protein:vir:44 342 ALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRP------------GGDVYLTPMNMTTK--PSD------G---S 398 (414) T ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------Ccceeccccccccc--CCc------c---c Confidence 66667888999988888777743222233345666631 33445555432111 100 0 0 Q ss_pred CCcCCCCcccccccCC Q lcl|NC_015263. 485 ENGRPTNETTGNKDSD 500 (513) Q Consensus 485 ~~grPt~et~~n~~~~ 500 (513) ..|.++.++.+...+. T Consensus 399 ~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 399 KAGKQKDNANADETTS 414 (414) T ss_pred cCCCCCCCCCCCCCCC Confidence 1111111111111111 No 22 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=98.73 E-value=8e-08 Score=59.49 Aligned_cols=422 Identities=12% Similarity=0.072 Sum_probs=196.5 Q ss_pred ccccccccccchH-HHHHHHhhhcc---ChhHH--HHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhh Q lcl|NC_015263. 39 VFGAPVGSLTSSQ-SKVRKIVKEYR---NEGNQ--KTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTAN 111 (513) Q Consensus 39 ~~~s~~~s~~~s~-d~~k~~i~~~~---P~~n~--~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~ 111 (513) +.-++=++.++.. .....++.+.+ |.... .....+..-.|.+++.+.+.|+.++. +..+...++-... T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~----- 75 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSG----- 75 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcC----- Confidence 2223333322221 11123344322 22211 11223333356778888888888764 2333334431111 Q ss_pred hcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEEEEE-CCeeEEEEE Q lcl|NC_015263. 112 ENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKISSVS-GGVYNYVID 184 (513) Q Consensus 112 ~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~-nG~y~~~fD 184 (513) ++.......-...++.+=| .-.+...++..++..|..|.+++-+..+ +-+.++|+++|.|.--. +|.+.|.+. T Consensus 76 ~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~~y~~~ 155 (518) T protein:vir:78 76 DTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQ 155 (518) T ss_pred CccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEEEEEEE Confidence 1111122222334455544 3455677777888899999999977654 67899999999988553 355555553 Q ss_pred eeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccc-cchhhHHHHHHhHHHHHHHHHH Q lcl|NC_015263. 185 LDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSL-TPVPPFAGTFDSIYDIHSFKDL 262 (513) Q Consensus 185 ~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~-~~ip~f~~v~~d~~di~~~kdL 262 (513) ..- . ...+-+.++...-+.|+. +.+.. .|++|...+...+--.....+. T Consensus 156 ~~~--~---------------------------~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~ 206 (518) T protein:vir:78 156 AGA--G---------------------------VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNA 206 (518) T ss_pred ecC--C---------------------------ccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHH Confidence 221 0 001123344444555553 22333 4778877666544433333332 Q ss_pred HhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c---cceEEEecccccccccccccccchh-- Q lcl|NC_015263. 263 RNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D---NVGVVTSPMEIDTVSFDKDSSTDDS-- 335 (513) Q Consensus 263 ~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~---gv~~v~sP~~~d~i~ld~~~~~~dt-- 335 (513) ...-..|-...-..|-+ + -.++.++++++.+.+.+..- + ++..+-..+++..+.++ ..+.+. T Consensus 207 --~~~~f~Ng~~p~gvl~~---~----~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~--~~d~q~le 275 (518) T protein:vir:78 207 --TAAMWKNAGRPNLVLRH---E----KRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLT--AVEMQFIE 275 (518) T ss_pred --HHHHHhcCCCccEEEec---C----CCCCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCC--hhHHHHHH Confidence 22223332222112211 1 13667777777777777662 2 23333344455555443 222233 Q ss_pred -hhhhHHhhhhhhhhhhhhccCC-CcchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEEecCCCCccH Q lcl|NC_015263. 336 -VEKATKNFWDNAGVSQILFSSD-NKTSQG-IAMSIATDEQFIFGVINQLERWLNRYLLLN-GMSKYFKATMLEVTHFSK 411 (513) Q Consensus 336 -v~~~~~~i~~~~GiS~~Lfn~d-~~s~~~-~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-~~~~~f~~~~l~~T~fn~ 411 (513) ..-..++|..+.||...++|-. +.+++. -...+.--..-+.-++.+||..+|+.|... .....|+|..-+....+. T Consensus 276 ~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr~D~ 355 (518) T protein:vir:78 276 ARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) T ss_pred HHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcceEEeechhhhccCH Confidence 3345588999999998888632 222222 222222333334468999999999988543 223456665556666788 Q ss_pred HHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCC Q lcl|NC_015263. 412 KEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTN 491 (513) Q Consensus 412 ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~ 491 (513) ++.++.+.++.+-|+-..--.-..+|+.|.+--. -+.+=+...+.|+...- .|...++.++++.. .+..|.. T Consensus 356 ~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~-----gD~~~v~~n~~pl~~~~--~~~~~g~~~~~~~~-~~~~~~~ 427 (518) T protein:vir:78 356 EAKSESTQKMVNSGVATPNEGREIMGLPRSDDPK-----ADELYANSALQPLGATP--DGAVEGEEAPAPKR-PASTPVA 427 (518) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-----Cceeeecccceeccccc--ccccCCCCCCCCCC-CCccccc Confidence 8888888888777743333333345666543100 01111222334432210 00000011111100 0111111 Q ss_pred cccccc----------------cCCCC------CCCCCCccCCC Q lcl|NC_015263. 492 ETTGNK----------------DSDET------QRAKDKPANTQ 513 (513) Q Consensus 492 et~~n~----------------~~~~~------~~~~d~~~~~~ 513 (513) +..+.. ++++. ..|.+++.-.+ T Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (518) T protein:vir:78 428 SLDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPK 471 (518) T ss_pred ccccCccccCCCCCcccccccccccccchhcccCCCCcccccch Confidence 110000 00000 01111111111 No 23 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.72 E-value=8.2e-08 Score=59.42 Aligned_cols=451 Identities=10% Similarity=0.035 Sum_probs=186.5 Q ss_pred CCCccchheeeeeh-hhhhhHHHHHHHH----HHHHHhhccCcccccccccccchHHHHHH-----Hhhhcc-ChhHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDV-ESISSYSNKRNNR----ISILRDDNRTPVFGAPVGSLTSSQSKVRK-----IVKEYR-NEGNQKT 69 (513) Q Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~----~~i~~~~~~~~~~~s~~~s~~~s~d~~k~-----~i~~~~-P~~n~~~ 69 (513) -+.++|- +.-+-| +.+..-.+.+... -.|-+-++....-. ++.+++. .+..-. +-.+-.. T Consensus 13 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~--------~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (563) T protein:vir:95 13 DYGNNST-IAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAY--------AEPFIEMMDTNPEFRDKRSYMKNEHN 83 (563) T ss_pred ccccccc-cceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcc--------hhhhHhhhcccccccccccCCCCccc Confidence 0111111 112222 1122222222221 13333332222100 1111110 000000 1122222 Q ss_pred HHHHHHHHHhhcchHHHHHHHHhhc-ccccceE--------eeccchhhhhh--cchhHHHHHHHHHHhhcCh------- Q lcl|NC_015263. 70 LRKVSEDLAVQSQQYQRLLNFYANM-PLYAYSV--------VPFKDISTANE--NKLKKELATVTEFLSRLNP------- 131 (513) Q Consensus 70 ir~~s~~lY~~sg~~~rlidy~~~m-pt~dY~I--------~P~~~~~~~~~--~~~~~~y~~v~~~L~k~n~------- 131 (513) |+++++- +..|.+++++|+.+++- ..|-..+ .|+...+...+ ..-...-..+..+|..++. T Consensus 84 l~~~l~~-~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~ 162 (563) T protein:vir:95 84 LHDVLKK-FGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRD 162 (563) T ss_pred HHHHHHH-hhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcc Confidence 3333332 23478888888876642 2221111 01111111111 1111222222234443322 Q ss_pred --hHHHHHHHHHHHHhcceeEEEEE--c--CcceeeeecCcceeEEEEEECCee-EEEEEeeeccCcchhccccHHHHHH Q lcl|NC_015263. 132 --KYNFSKIVKLAMTVDIFYGYVID--D--KESVMIQQFPNDICKISSVSGGVY-NYVIDLDALVSADIVDYYPKEIQEA 204 (513) Q Consensus 132 --k~~~~~i~~~~l~~g~~~gy~i~--d--~~~~~iq~lp~dyckIsg~~nG~y-~~~fD~syFd~~~~L~~~p~Ei~~~ 204 (513) ..++..++.+++..|..|.|.+- + +..+-+.++|+.+|+|.--.+|.. .......++...... T Consensus 163 t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~---------- 232 (563) T protein:vir:95 163 SFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVV---------- 232 (563) T ss_pred hHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCcee---------- Confidence 35667788888999999998763 3 445689999999999986555532 111111111111111 Q ss_pred HHHHhhhhhccCcccccCeeecCCceEEEEecC--c---cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeee Q lcl|NC_015263. 205 VNKYTTMKKGNNKSASNWYEIQDKNSICIKINE--S---SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKL 279 (513) Q Consensus 205 y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~--~---~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~ki 279 (513) ..++..--+.+.-+. + ..+|+||...+...+--....++.-.. -..|-...-.-| T Consensus 233 ------------------~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~--~f~ng~~p~giL 292 (563) T protein:vir:95 233 ------------------ASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDR--FFSHGGTTRGIL 292 (563) T ss_pred ------------------EEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHH--HHHccCCCceEE Confidence 122222223333221 1 346888888777766555555553311 112211111111 Q ss_pred ccccCCCCCccccCHHHHHHHHHHHHHhc--cccce--EEEec--ccccccccccccccc---hhhhhhHHhhhhhhhhh Q lcl|NC_015263. 280 ETRSSNDNNDFTLDMPMMNYFHEALSMTV--PDNVG--VVTSP--MEIDTVSFDKDSSTD---DSVEKATKNFWDNAGVS 350 (513) Q Consensus 280 p~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~gv~--~v~sP--~~~d~i~ld~~~~~~---dtv~~~~~~i~~~~GiS 350 (513) -+ .++-.++.++++.+.+.+.++. .++-+ .++.+ +++..+.++ ..+. .+..-..+.|..+.||. T Consensus 293 ~~-----~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~--~~d~qfle~~~~~~~~Ia~afgVP 365 (563) T protein:vir:95 293 QI-----RSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPT--ANDMQFEKWLNYLINIISALYGID 365 (563) T ss_pred Ee-----CCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCC--hhHHHHHHHHHHHHHHHHHHhCCC Confidence 11 1233477888888888888776 22333 23333 344444432 2222 33444568899999999 Q ss_pred hhhccCCCc------------c-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHH Q lcl|NC_015263. 351 QILFSSDNK------------T-SQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDR 417 (513) Q Consensus 351 ~~Lfn~d~~------------s-~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~ 417 (513) ..++|-... + ++.-...+.--..-+.-++.+||..+|+.|-.. ....|++.|++...-+|.+..+. T Consensus 366 p~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~-~~~~~~~~f~r~D~~~~~e~~~~ 444 (563) T protein:vir:95 366 PAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISE-YGDKYTFQFVGGDTKSATDKLNI 444 (563) T ss_pred HHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh-cccccEEEeccCCHHHHHHHHHH Confidence 988862211 1 111112222233334468899999999977543 23457788887765555555433 Q ss_pred HHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccc-----------cCCC Q lcl|NC_015263. 418 YITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKE-----------KGKE 485 (513) Q Consensus 418 ~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~-----------~~~~ 485 (513) .++..-|+ .+=..-+ .+|+.|.+ |-|..+.|+-+.-+..+...+.....+ ...+ T Consensus 445 -~~~~~~G~lT~NE~R~-~~gl~Pi~------------gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (563) T protein:vir:95 445 -LKLETQIFKTVNEARE-EQGKKPIE------------GGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGD 510 (563) T ss_pred -HHHhcCCccCHHHHHH-HhCCCCCC------------CcceeecccccccccccccccCCCccccchhhhhcccccCCC Confidence 22223332 2222222 34555432 224445554332221111100000000 0001 Q ss_pred CcCCCCccc-----ccccCCCCCCCCCCc--cCCC Q lcl|NC_015263. 486 NGRPTNETT-----GNKDSDETQRAKDKP--ANTQ 513 (513) Q Consensus 486 ~grPt~et~-----~n~~~~~~~~~~d~~--~~~~ 513 (513) .+.|..+.. ...+.|+.+.+++-+ ...| T Consensus 511 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (563) T protein:vir:95 511 NDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQ 545 (563) T ss_pred CCCCCCCCCCCCCCCcccccccccccccccccccc Confidence 111211111 111222222222111 1111 No 24 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.72 E-value=8.2e-08 Score=59.42 Aligned_cols=451 Identities=10% Similarity=0.035 Sum_probs=186.5 Q ss_pred CCCccchheeeeeh-hhhhhHHHHHHHH----HHHHHhhccCcccccccccccchHHHHHH-----Hhhhcc-ChhHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDV-ESISSYSNKRNNR----ISILRDDNRTPVFGAPVGSLTSSQSKVRK-----IVKEYR-NEGNQKT 69 (513) Q Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~----~~i~~~~~~~~~~~s~~~s~~~s~d~~k~-----~i~~~~-P~~n~~~ 69 (513) -+.++|- +.-+-| +.+..-.+.+... -.|-+-++....-. ++.+++. .+..-. +-.+-.. T Consensus 13 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~--------~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (563) T protein:vir:99 13 DYGNNST-IAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAY--------AEPFIEMMDTNPEFRDKRSYMKNEHN 83 (563) T ss_pred ccccccc-cceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcc--------hhhhHhhhcccccccccccCCCCccc Confidence 0111111 112222 1122222222221 13333332222100 1111110 000000 1122222 Q ss_pred HHHHHHHHHhhcchHHHHHHHHhhc-ccccceE--------eeccchhhhhh--cchhHHHHHHHHHHhhcCh------- Q lcl|NC_015263. 70 LRKVSEDLAVQSQQYQRLLNFYANM-PLYAYSV--------VPFKDISTANE--NKLKKELATVTEFLSRLNP------- 131 (513) Q Consensus 70 ir~~s~~lY~~sg~~~rlidy~~~m-pt~dY~I--------~P~~~~~~~~~--~~~~~~y~~v~~~L~k~n~------- 131 (513) |+++++- +..|.+++++|+.+++- ..|-..+ .|+...+...+ ..-...-..+..+|..++. T Consensus 84 l~~~l~~-~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~ 162 (563) T protein:vir:99 84 LHDVLKK-FGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRD 162 (563) T ss_pred HHHHHHH-hhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcc Confidence 3333332 23478888888876642 2221111 01111111111 1111222222234443322 Q ss_pred --hHHHHHHHHHHHHhcceeEEEEE--c--CcceeeeecCcceeEEEEEECCee-EEEEEeeeccCcchhccccHHHHHH Q lcl|NC_015263. 132 --KYNFSKIVKLAMTVDIFYGYVID--D--KESVMIQQFPNDICKISSVSGGVY-NYVIDLDALVSADIVDYYPKEIQEA 204 (513) Q Consensus 132 --k~~~~~i~~~~l~~g~~~gy~i~--d--~~~~~iq~lp~dyckIsg~~nG~y-~~~fD~syFd~~~~L~~~p~Ei~~~ 204 (513) ..++..++.+++..|..|.|.+- + +..+-+.++|+.+|+|.--.+|.. .......++...... T Consensus 163 t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~---------- 232 (563) T protein:vir:99 163 SFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVV---------- 232 (563) T ss_pred hHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCcee---------- Confidence 35667788888999999998763 3 445689999999999986555532 111111111111111 Q ss_pred HHHHhhhhhccCcccccCeeecCCceEEEEecC--c---cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeee Q lcl|NC_015263. 205 VNKYTTMKKGNNKSASNWYEIQDKNSICIKINE--S---SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKL 279 (513) Q Consensus 205 y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~--~---~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~ki 279 (513) ..++..--+.+.-+. + ..+|+||...+...+--....++.-.. -..|-...-.-| T Consensus 233 ------------------~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~--~f~ng~~p~giL 292 (563) T protein:vir:99 233 ------------------ASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDR--FFSHGGTTRGIL 292 (563) T ss_pred ------------------EEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHH--HHHccCCCceEE Confidence 122222223333221 1 346888888777766555555553311 112211111111 Q ss_pred ccccCCCCCccccCHHHHHHHHHHHHHhc--cccce--EEEec--ccccccccccccccc---hhhhhhHHhhhhhhhhh Q lcl|NC_015263. 280 ETRSSNDNNDFTLDMPMMNYFHEALSMTV--PDNVG--VVTSP--MEIDTVSFDKDSSTD---DSVEKATKNFWDNAGVS 350 (513) Q Consensus 280 p~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~gv~--~v~sP--~~~d~i~ld~~~~~~---dtv~~~~~~i~~~~GiS 350 (513) -+ .++-.++.++++.+.+.+.++. .++-+ .++.+ +++..+.++ ..+. .+..-..+.|..+.||. T Consensus 293 ~~-----~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~--~~d~qfle~~~~~~~~Ia~afgVP 365 (563) T protein:vir:99 293 QI-----RSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPT--ANDMQFEKWLNYLINIISALYGID 365 (563) T ss_pred Ee-----CCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCC--hhHHHHHHHHHHHHHHHHHHhCCC Confidence 11 1233477888888888888776 22333 23333 344444432 2222 33444568899999999 Q ss_pred hhhccCCCc------------c-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHH Q lcl|NC_015263. 351 QILFSSDNK------------T-SQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDR 417 (513) Q Consensus 351 ~~Lfn~d~~------------s-~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~ 417 (513) ..++|-... + ++.-...+.--..-+.-++.+||..+|+.|-.. ....|++.|++...-+|.+..+. T Consensus 366 p~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~-~~~~~~~~f~r~D~~~~~e~~~~ 444 (563) T protein:vir:99 366 PAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISE-YGDKYTFQFVGGDTKSATDKLNI 444 (563) T ss_pred HHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh-cccccEEEeccCCHHHHHHHHHH Confidence 988862211 1 111112222233334468899999999977543 23457788887765555555433 Q ss_pred HHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccc-----------cCCC Q lcl|NC_015263. 418 YITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKE-----------KGKE 485 (513) Q Consensus 418 ~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~-----------~~~~ 485 (513) .++..-|+ .+=..-+ .+|+.|.+ |-|..+.|+-+.-+..+...+.....+ ...+ T Consensus 445 -~~~~~~G~lT~NE~R~-~~gl~Pi~------------gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (563) T protein:vir:99 445 -LKLETQIFKTVNEARE-EQGKKPIE------------GGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGD 510 (563) T ss_pred -HHHhcCCccCHHHHHH-HhCCCCCC------------CcceeecccccccccccccccCCCccccchhhhhcccccCCC Confidence 22223332 2222222 34555432 224445554332221111100000000 0001 Q ss_pred CcCCCCccc-----ccccCCCCCCCCCCc--cCCC Q lcl|NC_015263. 486 NGRPTNETT-----GNKDSDETQRAKDKP--ANTQ 513 (513) Q Consensus 486 ~grPt~et~-----~n~~~~~~~~~~d~~--~~~~ 513 (513) .+.|..+.. ...+.|+.+.+++-+ ...| T Consensus 511 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (563) T protein:vir:99 511 NDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQ 545 (563) T ss_pred CCCCCCCCCCCCCCCcccccccccccccccccccc Confidence 111211111 111222222222111 1111 No 25 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.69 E-value=1e-07 Score=58.86 Aligned_cols=400 Identities=11% Similarity=0.057 Sum_probs=192.7 Q ss_pred HHHHHhhccCccccccccc--ccc---------hHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGS--LTS---------SQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MP 95 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s--~~~---------s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mp 95 (513) -.|+.-+. ++|+ +.. .....-+|+-.. |.. ..++.--...++.+.+.|+.+++ +. T Consensus 1 M~~~~r~~-------~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~-~~~-----~~v~~~~al~~~~v~~~i~~ia~~ia 67 (432) T protein:vir:10 1 MKIVDSVK-------KFFNFEKRQTSQVIELNKDDEKLLEWLGIS-PST-----ISVKGKNALKVATVFACIKILSESVS 67 (432) T ss_pred CChHHHHH-------HhcCccccCcccccccCCchHHHHHHhCCC-cCc-----cccchhhhhccHHHHHHHHHHHHhhc Confidence 22222221 1111 010 011111111110 110 11222223446778888888876 22 Q ss_pred cccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcce Q lcl|NC_015263. 96 LYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDI 168 (513) Q Consensus 96 t~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dy 168 (513) .+...++ .. ...+..+..-+.+...|.. +.-..+...++..++..|..|.++.-+..+ +-+.++|++. T Consensus 68 ~lp~~~~----~~-~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~ 142 (432) T protein:vir:10 68 KLPLKIY----QE-DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASK 142 (432) T ss_pred cCceEEE----Ee-cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCce Confidence 3444443 11 0111111111223344432 335677788888899999999999876544 6788999999 Q ss_pred eEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec--CccccchhhH Q lcl|NC_015263. 169 CKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN--ESSLTPVPPF 246 (513) Q Consensus 169 ckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~--~~~~~~ip~f 246 (513) |.+.--++|...+.....|+... ......+++..-+-|+.+ .+...|+||. T Consensus 143 v~v~~d~~~~~~~~~~~~y~~~~---------------------------~g~~~~~~~~eiih~r~~~~~~~~~G~s~~ 195 (432) T protein:vir:10 143 VTVYIDDVGLLNSKTKMWYVVNT---------------------------GGQQRVLKPEEILHFKNGITLDGLVGVPTM 195 (432) T ss_pred eEEEEcCcccccccceEEEEEec---------------------------CCeEEEEccccEEEecCCCCCCCcccccHH Confidence 99874343433322222222100 012234556555556632 3445688888 Q ss_pred HHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc-----ccceEEEecccc Q lcl|NC_015263. 247 AGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP-----DNVGVVTSPMEI 321 (513) Q Consensus 247 ~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp-----~gv~~v~sP~~~ 321 (513) ..+...+--.....+.... -..|-...-..|-+ + + .++.++++++.+.++...- .++..+-..+++ T Consensus 196 ~~~~~~i~~~~~~~~~~~~--~~~ng~~p~gil~~---~--~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~ 266 (432) T protein:vir:10 196 EYLKSTLENSASADKFINN--FYKQGLQVKGLVQY---V--G--DLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQF 266 (432) T ss_pred HHHHHHHHHHHHHHHHHHH--HHhccCCccEEEEc---C--C--CCCHHHHHHHHHHHHHHhcccccCCcceecCCCceE Confidence 7776554444444432211 12221111111111 1 1 3667777777777776551 233444444455 Q ss_pred ccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh--cc-c Q lcl|NC_015263. 322 DTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGI-AMSIATDEQFIFGVINQLERWLNRYLLL--NG-M 395 (513) Q Consensus 322 d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~--~~-~ 395 (513) ..+.+.. +..--++.+-..++|..+.||...++|.. +++++.+ ...+.--..-+.-++.+||..+|++|-. .. . T Consensus 267 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~ 346 (432) T protein:vir:10 267 QPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDK 346 (432) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCC Confidence 5555432 11222344455688999999999888632 2223322 2232233334446999999999997742 21 2 Q ss_pred ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccc Q lcl|NC_015263. 396 SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIA 475 (513) Q Consensus 396 ~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~ 475 (513) +..|+|.+-....-+.++.++.+.++..-|+=..--+-+.+|+.|. | |-|..++|+. ...-.. T Consensus 347 g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi--------~----ggD~~~~~~n---~~~~~~-- 409 (432) T protein:vir:10 347 GFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--------A----GGDRLLVNGN---MLPIDM-- 409 (432) T ss_pred CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--------C----CCCeEeeccc---ccchhh-- Confidence 3356666666667788899999888887774333333334566652 1 2333444422 111110 Q ss_pred cCCccccCCCCcCCCCcccccccCCCCC Q lcl|NC_015263. 476 ENAIKEKGKENGRPTNETTGNKDSDETQ 503 (513) Q Consensus 476 ~~~~~~~~~~~grPt~et~~n~~~~~~~ 503 (513) .++.. ..+|. + ....+++.++++ T Consensus 410 --~~~~~-~k~~~-~-~~~~~~~~~~~~ 432 (432) T protein:vir:10 410 --AGQAY-LKGGD-T-NGEVSKEGNEGN 432 (432) T ss_pred --ccccc-cCCCC-C-CCCCCCCCCCCC Confidence 11100 01111 1 112222333333 No 26 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.69 E-value=1e-07 Score=58.86 Aligned_cols=400 Identities=11% Similarity=0.057 Sum_probs=192.7 Q ss_pred HHHHHhhccCccccccccc--ccc---------hHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGS--LTS---------SQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MP 95 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s--~~~---------s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mp 95 (513) -.|+.-+. ++|+ +.. .....-+|+-.. |.. ..++.--...++.+.+.|+.+++ +. T Consensus 1 M~~~~r~~-------~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~-~~~-----~~v~~~~al~~~~v~~~i~~ia~~ia 67 (432) T protein:vir:10 1 MKIVDSVK-------KFFNFEKRQTSQVIELNKDDEKLLEWLGIS-PST-----ISVKGKNALKVATVFACIKILSESVS 67 (432) T ss_pred CChHHHHH-------HhcCccccCcccccccCCchHHHHHHhCCC-cCc-----cccchhhhhccHHHHHHHHHHHHhhc Confidence 22222221 1111 010 011111111110 110 11222223446778888888876 22 Q ss_pred cccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcce Q lcl|NC_015263. 96 LYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDI 168 (513) Q Consensus 96 t~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dy 168 (513) .+...++ .. ...+..+..-+.+...|.. +.-..+...++..++..|..|.++.-+..+ +-+.++|++. T Consensus 68 ~lp~~~~----~~-~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~ 142 (432) T protein:vir:10 68 KLPLKIY----QE-DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASK 142 (432) T ss_pred cCceEEE----Ee-cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCce Confidence 3444443 11 0111111111223344432 335677788888899999999999876544 6788999999 Q ss_pred eEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec--CccccchhhH Q lcl|NC_015263. 169 CKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN--ESSLTPVPPF 246 (513) Q Consensus 169 ckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~--~~~~~~ip~f 246 (513) |.+.--++|...+.....|+... ......+++..-+-|+.+ .+...|+||. T Consensus 143 v~v~~d~~~~~~~~~~~~y~~~~---------------------------~g~~~~~~~~eiih~r~~~~~~~~~G~s~~ 195 (432) T protein:vir:10 143 VTVYIDDVGLLNSKTKMWYVVNT---------------------------GGQQRVLKPEEILHFKNGITLDGLVGVPTM 195 (432) T ss_pred eEEEEcCcccccccceEEEEEec---------------------------CCeEEEEccccEEEecCCCCCCCcccccHH Confidence 99874343433322222222100 012234556555556632 3445688888 Q ss_pred HHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc-----ccceEEEecccc Q lcl|NC_015263. 247 AGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP-----DNVGVVTSPMEI 321 (513) Q Consensus 247 ~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp-----~gv~~v~sP~~~ 321 (513) ..+...+--.....+.... -..|-...-..|-+ + + .++.++++++.+.++...- .++..+-..+++ T Consensus 196 ~~~~~~i~~~~~~~~~~~~--~~~ng~~p~gil~~---~--~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~ 266 (432) T protein:vir:10 196 EYLKSTLENSASADKFINN--FYKQGLQVKGLVQY---V--G--DLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQF 266 (432) T ss_pred HHHHHHHHHHHHHHHHHHH--HHhccCCccEEEEc---C--C--CCCHHHHHHHHHHHHHHhcccccCCcceecCCCceE Confidence 7776554444444432211 12221111111111 1 1 3667777777777776551 233444444455 Q ss_pred ccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh--cc-c Q lcl|NC_015263. 322 DTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGI-AMSIATDEQFIFGVINQLERWLNRYLLL--NG-M 395 (513) Q Consensus 322 d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~--~~-~ 395 (513) ..+.+.. +..--++.+-..++|..+.||...++|.. +++++.+ ...+.--..-+.-++.+||..+|++|-. .. . T Consensus 267 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~ 346 (432) T protein:vir:10 267 QPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDK 346 (432) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCC Confidence 5555432 11222344455688999999999888632 2223322 2232233334446999999999997742 21 2 Q ss_pred ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccc Q lcl|NC_015263. 396 SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIA 475 (513) Q Consensus 396 ~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~ 475 (513) +..|+|.+-....-+.++.++.+.++..-|+=..--+-+.+|+.|. | |-|..++|+. ...-.. T Consensus 347 g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi--------~----ggD~~~~~~n---~~~~~~-- 409 (432) T protein:vir:10 347 GFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--------A----GGDRLLVNGN---MLPIDM-- 409 (432) T ss_pred CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--------C----CCCeEeeccc---ccchhh-- Confidence 3356666666667788899999888887774333333334566652 1 2333444422 111110 Q ss_pred cCCccccCCCCcCCCCcccccccCCCCC Q lcl|NC_015263. 476 ENAIKEKGKENGRPTNETTGNKDSDETQ 503 (513) Q Consensus 476 ~~~~~~~~~~~grPt~et~~n~~~~~~~ 503 (513) .++.. ..+|. + ....+++.++++ T Consensus 410 --~~~~~-~k~~~-~-~~~~~~~~~~~~ 432 (432) T protein:vir:10 410 --AGQAY-LKGGD-T-NGEVSKEGNEGN 432 (432) T ss_pred --ccccc-cCCCC-C-CCCCCCCCCCCC Confidence 11100 01111 1 112222333333 No 27 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.69 E-value=1e-07 Score=58.86 Aligned_cols=400 Identities=11% Similarity=0.057 Sum_probs=192.7 Q ss_pred HHHHHhhccCccccccccc--ccc---------hHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGS--LTS---------SQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MP 95 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s--~~~---------s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mp 95 (513) -.|+.-+. ++|+ +.. .....-+|+-.. |.. ..++.--...++.+.+.|+.+++ +. T Consensus 1 M~~~~r~~-------~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~-~~~-----~~v~~~~al~~~~v~~~i~~ia~~ia 67 (432) T protein:vir:10 1 MKIVDSVK-------KFFNFEKRQTSQVIELNKDDEKLLEWLGIS-PST-----ISVKGKNALKVATVFACIKILSESVS 67 (432) T ss_pred CChHHHHH-------HhcCccccCcccccccCCchHHHHHHhCCC-cCc-----cccchhhhhccHHHHHHHHHHHHhhc Confidence 22222221 1111 010 011111111110 110 11222223446778888888876 22 Q ss_pred cccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcce Q lcl|NC_015263. 96 LYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDI 168 (513) Q Consensus 96 t~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dy 168 (513) .+...++ .. ...+..+..-+.+...|.. +.-..+...++..++..|..|.++.-+..+ +-+.++|++. T Consensus 68 ~lp~~~~----~~-~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~ 142 (432) T protein:vir:10 68 KLPLKIY----QE-DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASK 142 (432) T ss_pred cCceEEE----Ee-cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCce Confidence 3444443 11 0111111111223344432 335677788888899999999999876544 6788999999 Q ss_pred eEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec--CccccchhhH Q lcl|NC_015263. 169 CKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN--ESSLTPVPPF 246 (513) Q Consensus 169 ckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~--~~~~~~ip~f 246 (513) |.+.--++|...+.....|+... ......+++..-+-|+.+ .+...|+||. T Consensus 143 v~v~~d~~~~~~~~~~~~y~~~~---------------------------~g~~~~~~~~eiih~r~~~~~~~~~G~s~~ 195 (432) T protein:vir:10 143 VTVYIDDVGLLNSKTKMWYVVNT---------------------------GGQQRVLKPEEILHFKNGITLDGLVGVPTM 195 (432) T ss_pred eEEEEcCcccccccceEEEEEec---------------------------CCeEEEEccccEEEecCCCCCCCcccccHH Confidence 99874343433322222222100 012234556555556632 3445688888 Q ss_pred HHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc-----ccceEEEecccc Q lcl|NC_015263. 247 AGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP-----DNVGVVTSPMEI 321 (513) Q Consensus 247 ~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp-----~gv~~v~sP~~~ 321 (513) ..+...+--.....+.... -..|-...-..|-+ + + .++.++++++.+.++...- .++..+-..+++ T Consensus 196 ~~~~~~i~~~~~~~~~~~~--~~~ng~~p~gil~~---~--~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~ 266 (432) T protein:vir:10 196 EYLKSTLENSASADKFINN--FYKQGLQVKGLVQY---V--G--DLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQF 266 (432) T ss_pred HHHHHHHHHHHHHHHHHHH--HHhccCCccEEEEc---C--C--CCCHHHHHHHHHHHHHHhcccccCCcceecCCCceE Confidence 7776554444444432211 12221111111111 1 1 3667777777777776551 233444444455 Q ss_pred ccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh--cc-c Q lcl|NC_015263. 322 DTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGI-AMSIATDEQFIFGVINQLERWLNRYLLL--NG-M 395 (513) Q Consensus 322 d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~--~~-~ 395 (513) ..+.+.. +..--++.+-..++|..+.||...++|.. +++++.+ ...+.--..-+.-++.+||..+|++|-. .. . T Consensus 267 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~ 346 (432) T protein:vir:10 267 QPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDK 346 (432) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCC Confidence 5555432 11222344455688999999999888632 2223322 2232233334446999999999997742 21 2 Q ss_pred ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccc Q lcl|NC_015263. 396 SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIA 475 (513) Q Consensus 396 ~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~ 475 (513) +..|+|.+-....-+.++.++.+.++..-|+=..--+-+.+|+.|. | |-|..++|+. ...-.. T Consensus 347 g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi--------~----ggD~~~~~~n---~~~~~~-- 409 (432) T protein:vir:10 347 GFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--------A----GGDRLLVNGN---MLPIDM-- 409 (432) T ss_pred CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--------C----CCCeEeeccc---ccchhh-- Confidence 3356666666667788899999888887774333333334566652 1 2333444422 111110 Q ss_pred cCCccccCCCCcCCCCcccccccCCCCC Q lcl|NC_015263. 476 ENAIKEKGKENGRPTNETTGNKDSDETQ 503 (513) Q Consensus 476 ~~~~~~~~~~~grPt~et~~n~~~~~~~ 503 (513) .++.. ..+|. + ....+++.++++ T Consensus 410 --~~~~~-~k~~~-~-~~~~~~~~~~~~ 432 (432) T protein:vir:10 410 --AGQAY-LKGGD-T-NGEVSKEGNEGN 432 (432) T ss_pred --ccccc-cCCCC-C-CCCCCCCCCCCC Confidence 11100 01111 1 112222333333 No 28 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=98.59 E-value=2.3e-07 Score=57.01 Aligned_cols=444 Identities=10% Similarity=0.062 Sum_probs=220.2 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCccccccccc----ccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchH Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGS----LTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQY 84 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s----~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~ 84 (513) +++||- -|+..+=-+.+++..-|-.. ..+.++...- -..+.+. ...+ ..+-..||.-|++||.++|+. T Consensus 1 mn~~dr-~i~~~sP~~~~~R~~ar~~~-~~y~aa~~~r~~~~~~~~~s~-~~~~-----~~~~~~lr~RaRdl~rNn~~a 72 (502) T protein:vir:79 1 MAILDD-VIGVFSPGWKAARLRSRAVI-QAYEAVKTTRTHKARRENRTA-DQLS-----QYGAVSLREQARYLDNNHDLV 72 (502) T ss_pred CchHhh-HHhhcChHHHHHHHhhHHHH-hhccccCcccccCCCCCCCCh-HHHH-----HHHHHHHHHHHHHHHhcChHH Confidence 455551 11112223333333333211 1111111100 0111111 1111 125789999999999999999 Q ss_pred HHHHHHHhhcccccce--Eeecc-chhhhhhcchhHHHHHH-HHHHhhc------ChhHHHHHHHHHHHHhcceeEEEEE Q lcl|NC_015263. 85 QRLLNFYANMPLYAYS--VVPFK-DISTANENKLKKELATV-TEFLSRL------NPKYNFSKIVKLAMTVDIFYGYVID 154 (513) Q Consensus 85 ~rlidy~~~mpt~dY~--I~P~~-~~~~~~~~~~~~~y~~v-~~~L~k~------n~k~~~~~i~~~~l~~g~~~gy~i~ 154 (513) ++.|+.+.+.-.=.-- +.|-- .........+.+..... ..|.+.+ +.-....-+++..++.|..|.-++. T Consensus 73 ~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~ 152 (502) T protein:vir:79 73 IGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVS 152 (502) T ss_pred HHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEee Confidence 9999988775543321 22211 11111111222211111 1222322 2333445577888899999998865 Q ss_pred cCc---------ceeeeecCcceeEEEEEECCee-EEEEEeeeccCcc-hhcc-----ccHHHHHHHHHHhhhhhccCcc Q lcl|NC_015263. 155 DKE---------SVMIQQFPNDICKISSVSGGVY-NYVIDLDALVSAD-IVDY-----YPKEIQEAVNKYTTMKKGNNKS 218 (513) Q Consensus 155 d~~---------~~~iq~lp~dyckIsg~~nG~y-~~~fD~syFd~~~-~L~~-----~p~Ei~~~y~~Y~~~k~~~~~~ 218 (513) +.+ +.-+|-+++|.|. ....+|.+ +-.+ .||+.. .+.| -|.+ .. T Consensus 153 ~~~~~~~~g~~~~l~lq~iepd~l~-~~~~~~~~i~~GV---e~d~~Gr~~aY~i~~~hPgd----------------~~ 212 (502) T protein:vir:79 153 GRINSLTPSAGVHFWLEALEPDFIP-MTSDESNRLNQGV---FVDDWGRPEKYLVYKSRPVS----------------GR 212 (502) T ss_pred cccCccCCCcccceEEEEecchhcC-CCCCCCCeeEeee---EECCCCceEEEEEeecCCCC----------------Cc Confidence 432 3578999999995 22333332 1222 234322 2222 1322 11 Q ss_pred cccCeeecCCceEEEEecCcccc---chhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHH Q lcl|NC_015263. 219 ASNWYEIQDKNSICIKINESSLT---PVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMP 295 (513) Q Consensus 219 ~~~W~~L~~~kt~~ik~~~~~~~---~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~ 295 (513) ...|..+|-+ .|.+-++.+.+. |+|.|++++..+-++++|.+-....+.++...-..=+-+.. ....-...+.. T Consensus 213 ~~~~~rvpA~-~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~--~~~~~~~~~~~ 289 (502) T protein:vir:79 213 QMETKEVDAE-RMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDG--QSYEPDGNGSK 289 (502) T ss_pred ccceeEechh-heEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC--cccccccCCCC Confidence 2357777764 566666555553 99999999999999999999888888876642222222211 10000000000 Q ss_pred HHHHHHHHHHHhcccc-ceEEEecccccccccccc---c-ccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHHHHHHH Q lcl|NC_015263. 296 MMNYFHEALSMTVPDN-VGVVTSPMEIDTVSFDKD---S-STDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGIAMSIA 369 (513) Q Consensus 296 ~~~~~~~~ik~~Lp~g-v~~v~sP~~~d~i~ld~~---~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~SI~ 369 (513) ++.-.-.|-.| +....-| =+.|+|-.. . +-...+......|-..+||+--++.+| +.|++++..++. T Consensus 290 -----~~~~~~~l~pG~i~~~L~p--Ge~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~ 362 (502) T protein:vir:79 290 -----ENERELTIQPGIIYDDLKP--GEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELV 362 (502) T ss_pred -----CccccccccCCccccccCC--CceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHH Confidence 00000011112 1112223 134555222 2 223557777788999999995555444 557777777765 Q ss_pred HHHHHHH--------HHHHHH-HHHHHHHHhhcccc-------e-EEEEEe--cCCCCccHHHHHHHHHHHHhcCCcHHH Q lcl|NC_015263. 370 TDEQFIF--------GVINQL-ERWLNRYLLLNGMS-------K-YFKATM--LEVTHFSKKEAHDRYITDAQYGFPVKV 430 (513) Q Consensus 370 ~d~~~~~--------~~~~~i-E~~~N~~i~~~~~~-------~-~f~~~~--l~~T~fn~ke~~~~~~~~~~~G~~~~~ 430 (513) ..-..+- .|.+.| +.|+--.+..+.+. . ..+..+ .+-...+..+.+..-+.+..-|+.... T Consensus 363 e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~ 442 (502) T protein:vir:79 363 ESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATES 442 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHH Confidence 3322221 333332 33666555444332 1 123333 334456777777777999999999999 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCC Q lcl|NC_015263. 431 YLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQR 504 (513) Q Consensus 431 ~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~ 504 (513) ..++..|.+|++++.+...|++.++=.. +|+.+.= +.. ..++.+ ..+...| +++ .++++. T Consensus 443 ~~~a~~G~D~~~v~~q~a~e~~~~~~~G--l~~~~~~---~~~--~~~~~~-~~~~~e~-~~~-----~~~~e~ 502 (502) T protein:vir:79 443 DWVRAGGRNPDDVKRRRKAEIDENRKLD--LVFDTDP---ASD--KGGSSA-ATKRQEP-QHT-----DDQSEE 502 (502) T ss_pred HHHHHcCCCHHHHHHHHHHHHHHHHHcC--CCCCCCC---CCC--CCCCCC-CCCCCCC-CCC-----CCCCCC Confidence 9999999999999999999986433222 2222210 000 000000 0000001 011 111111 No 29 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=98.56 E-value=2.8e-07 Score=56.47 Aligned_cols=364 Identities=10% Similarity=0.095 Sum_probs=157.4 Q ss_pred HHHHHhhccCccc-ccccccccchHHHHHHH-hhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeec Q lcl|NC_015263. 28 ISILRDDNRTPVF-GAPVGSLTSSQSKVRKI-VKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPF 104 (513) Q Consensus 28 ~~i~~~~~~~~~~-~s~~~s~~~s~d~~k~~-i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~ 104 (513) -.+++-+...+.. .+..+. .+... +.-+.... .++.--+-+++.+.+.|+++++ +..+...++ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~------~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~-- 66 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFD------VVDSDFLASLKGNE------WVSAETALRNSDLFSIINQLSNDLATVKLITS-- 66 (382) T ss_pred CccccccccCCccccccccc------chhhhccccccCCc------ccchHhhhccHHHHHHHHHHHHhhccCceeee-- Confidence 0111111111100 000000 00000 00000000 0111112345677788888776 444555553 Q ss_pred cchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEEEEECC- Q lcl|NC_015263. 105 KDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKISSVSGG- 177 (513) Q Consensus 105 ~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG- 177 (513) ... +. ..+++-| ...+...++..++..|..|.+++-|..+ +.+.++|+++|.|.-..+| T Consensus 67 --~~~---------~~---~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~ 132 (382) T protein:vir:48 67 --RKK---------LQ---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKD 132 (382) T ss_pred --cch---------hh---hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCC Confidence 110 00 1233333 4677777888889999999999876544 6889999999999855554 Q ss_pred eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecC-c-cccchhhHHHHHHhHHH Q lcl|NC_015263. 178 VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINE-S-SLTPVPPFAGTFDSIYD 255 (513) Q Consensus 178 ~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~-~-~~~~ip~f~~v~~d~~d 255 (513) .+.|.+-..- . ....=+.++...-+.|+... + ...|+||...+...+-- T Consensus 133 ~~~y~~~~~~---~--------------------------~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~ 183 (382) T protein:vir:48 133 GIYYNITFDD---P--------------------------RIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDI 183 (382) T ss_pred eEEEEEEecC---c--------------------------cccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHH Confidence 2333221110 0 00011344454455566432 2 35788888877665543 Q ss_pred HHHHHHHHhhHhhhhhc---eeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc--cccceEEEecccccccccccc- Q lcl|NC_015263. 256 IHSFKDLRNDKAELQNY---KLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV--PDNVGVVTSPMEIDTVSFDKD- 329 (513) Q Consensus 256 i~~~kdL~~~~~~i~n~---~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~gv~~v~sP~~~d~i~ld~~- 329 (513) ....++.. ..-..|- ..++ ++| . .++.+++.++.+...+-. +.++..+-..+++..+.+... T Consensus 184 ~~~~~~~~--~~~~~ng~~p~~il-~~~-----~----~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d 251 (382) T protein:vir:48 184 QKASGNLT--INSLKNALNANGIL-KIK-----G----GGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNV 251 (382) T ss_pred HHHHHHHH--HHHHhccCCCceEE-EeC-----C----CCChHHHHHHHHHHHhhccCCCCeeEcCCCceEEEccCChhH Confidence 33333322 1122221 1222 223 1 123333333332222222 234444444555555554322 Q ss_pred cccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCc Q lcl|NC_015263. 330 SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHF 409 (513) Q Consensus 330 ~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~f 409 (513) ..--.+.+-..++|..+.||+..++|....+++.......--...+.-++++||..+|+.|....-.. .+..++. T Consensus 252 ~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~l~~~~~~~--~~~~~~~--- 326 (382) T protein:vir:48 252 SQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMSSDLYSKAVSRYLRPFLSELSQKLSCDVDAD--IFPAVDP--- 326 (382) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhhh--hhhhhcc--- Confidence 12223455566889999999999997654444443333333333444689999999999885431111 1112222 Q ss_pred cHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCC Q lcl|NC_015263. 410 SKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRP 489 (513) Q Consensus 410 n~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grP 489 (513) ........+.+++.-|. +++.|.-..+. + . + +.|.. .... ++..| T Consensus 327 ~~~~~~~~~~~l~~~g~-----------~t~~e~r~~l~-~--~-g----~~~~~----~~~~------------~~~~~ 371 (382) T protein:vir:48 327 TGSNYISRINSLVKTGT-----------LAQNQGLYILQ-Q--A-E----ILPKE----LPNG------------ENPNS 371 (382) T ss_pred chhHHHHHHHHHhhcCc-----------cCHHHHHHHHh-h--C-C----CCCcc----hhhh------------hcCCC Confidence 22333334334333331 23333322221 0 0 1 00100 0000 00111 Q ss_pred CCcccccccCCCCC Q lcl|NC_015263. 490 TNETTGNKDSDETQ 503 (513) Q Consensus 490 t~et~~n~~~~~~~ 503 (513) +.++ .+.++++ T Consensus 372 ~~~G---Gd~~~~~ 382 (382) T protein:vir:48 372 TLKG---GEEDGQD 382 (382) T ss_pred CCCC---CCCCCCC Confidence 1010 0000000 No 30 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.53 E-value=3.4e-07 Score=56.03 Aligned_cols=445 Identities=10% Similarity=0.010 Sum_probs=219.0 Q ss_pred ccchheeeeeh----hhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh Q lcl|NC_015263. 4 NKKKRLSMIDV----ESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV 79 (513) Q Consensus 4 ~~~~~~~~~~~----~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~ 79 (513) -|..+++-.|- ...+.|. .-.-+..-...-+.. .+.+...+ + ..+-..||.-|++|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-----------~~a~~~~~~~~~w~~-~~~s~~~~-i-----~~~~~~lr~RaRdl~r 62 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYH-----------GGGGGFGGQLRGWNP-PSESADAA-L-----LPNYSRGNARADDLVR 62 (530) T ss_pred CccceeecCccccchHHHhhhh-----------cccCCCCCccccccc-CCCCHHHH-H-----HHHHHHHHHHHHHHHh Confidence 12223332221 1111110 000000000000000 01111111 1 1256789999999999 Q ss_pred hcchHHHHHHHHhhcccccceEeeccchh---hhhhcchhHHHHHHH-----HHHh----------hcChhHHHHHHHHH Q lcl|NC_015263. 80 QSQQYQRLLNFYANMPLYAYSVVPFKDIS---TANENKLKKELATVT-----EFLS----------RLNPKYNFSKIVKL 141 (513) Q Consensus 80 ~sg~~~rlidy~~~mpt~dY~I~P~~~~~---~~~~~~~~~~y~~v~-----~~L~----------k~n~k~~~~~i~~~ 141 (513) +|++.++.|+.+.+.--=. =|.|-...+ .-.+++.-+++.+.. .|-+ +++.-....-+++. T Consensus 63 Nn~~a~~av~~~~~nvVG~-Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~ 141 (530) T protein:vir:38 63 NNGYAANAVQLHQDHIVGS-FFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAM 141 (530) T ss_pred cChHHHHHHHHHHHHhhCC-CceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHH Confidence 9999999999887754333 222211110 001111112221111 1222 22344555667788 Q ss_pred HHHhcceeEEEEEcCc-----ceeeeecCcceeEEE-EEECCeeEEEEEeeeccCcc-hhccc-----cHHHHHHHHHHh Q lcl|NC_015263. 142 AMTVDIFYGYVIDDKE-----SVMIQQFPNDICKIS-SVSGGVYNYVIDLDALVSAD-IVDYY-----PKEIQEAVNKYT 209 (513) Q Consensus 142 ~l~~g~~~gy~i~d~~-----~~~iq~lp~dyckIs-g~~nG~y~~~fD~syFd~~~-~L~~~-----p~Ei~~~y~~Y~ 209 (513) .++.|..|.=.....+ +.-+|-+++|+|.=. ...+|.+. ++==.||+.. -+.|+ |+. T Consensus 142 ~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i--~~GIe~d~~Gr~~aY~i~~~~~~~--------- 210 (530) T protein:vir:38 142 HAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNC--RAGVKINDSGAALGYYVSDDGYPG--------- 210 (530) T ss_pred HhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCee--EeeeEECCCCceEEEEEeeccCCC--------- Confidence 8999999997775443 367899999998622 22333321 1211344322 22221 221 Q ss_pred hhhhccCcccccCeeecCC-----ceEEEEecCcccc---chhhHHHHHHhHHHHHHHHHHHhhHhhhhhce--eeeeee Q lcl|NC_015263. 210 TMKKGNNKSASNWYEIQDK-----NSICIKINESSLT---PVPPFAGTFDSIYDIHSFKDLRNDKAELQNYK--LLIQKL 279 (513) Q Consensus 210 ~~k~~~~~~~~~W~~L~~~-----kt~~ik~~~~~~~---~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~--ii~~ki 279 (513) .....|..++-. .-|++-++...+. |+|.|++++..+-++++|.+-....+.++... +|.+.. T Consensus 211 -------~~~~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~ 283 (530) T protein:vir:38 211 -------WMAQNWTYIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESEL 283 (530) T ss_pred -------ccccccceeeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccC Confidence 001234333321 1355555555443 99999999999999999999887777776632 234444 Q ss_pred cccc-------CCCCC--ccccCH-HHHHHHHHHHHHhccccceEEEecccccccccccc----cccchhhhhhHHhhhh Q lcl|NC_015263. 280 ETRS-------SNDNN--DFTLDM-PMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKD----SSTDDSVEKATKNFWD 345 (513) Q Consensus 280 p~~~-------~n~~~--~~~vd~-~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~----~~~~dtv~~~~~~i~~ 345 (513) |-.+ ...++ ...... ......++.-.-.|-.|-....-|= +.|+|-.. .+-.+.+......|-. T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG--e~i~~~~p~~p~~~~~~f~~~~lr~iaa 361 (530) T protein:vir:38 284 DTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPG--DSLNLQSAQDTDNGYSTFEQSLLRYIAA 361 (530) T ss_pred CccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCC--CeeeeeCCCCCCCCHHHHHHHHHHHHHh Confidence 4110 00010 111111 1111111111112322222222221 34555332 2333567777788999 Q ss_pred hhhhhhhhccCC--CcchHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHhhcccc--------e------EEE Q lcl|NC_015263. 346 NAGVSQILFSSD--NKTSQGIAMSIATDEQFIFGVIN---------QLERWLNRYLLLNGMS--------K------YFK 400 (513) Q Consensus 346 ~~GiS~~Lfn~d--~~s~~~~~~SI~~d~~~~~~~~~---------~iE~~~N~~i~~~~~~--------~------~f~ 400 (513) .+||+--++.+| +.|++++..++...-..+-..+. ..++|+--.+....+. + ..+ T Consensus 362 glGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~ 441 (530) T protein:vir:38 362 GLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGN 441 (530) T ss_pred hcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhc Confidence 999996666666 56888888776644333333222 2344665545443331 0 123 Q ss_pred EEe--cCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCC Q lcl|NC_015263. 401 ATM--LEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENA 478 (513) Q Consensus 401 ~~~--l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~ 478 (513) ..+ .+-...+-.+.+...+....-|+.++...++..|.+|++++.+...|++.++=-.+..|-.++.+...+. T Consensus 442 ~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~----- 516 (530) T protein:vir:38 442 ANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGV----- 516 (530) T ss_pred eeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCC----- Confidence 333 4444677778888889999999999999999999999999999999986433111222222221111000 Q ss_pred ccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 479 IKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 479 ~~~~~~~~grPt~et~~n~~~~~ 501 (513) ......|.+.+.+ . T Consensus 517 ----~~~~~~~~d~~~~-----a 530 (530) T protein:vir:38 517 ----KKSNEEEQDGARA-----A 530 (530) T ss_pred ----CCCCCCCCCCCCC-----C Confidence 0010111100000 0 No 31 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.53 E-value=1.8e-07 Score=57.58 Aligned_cols=394 Identities=10% Similarity=0.044 Sum_probs=179.6 Q ss_pred ccChhHHHHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHH Q lcl|NC_015263. 61 YRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVK 140 (513) Q Consensus 61 ~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~ 140 (513) |||.+-..+.+.+.+-+ ..+.-+.++|-++....++.+..| + .+.....++ .++.-++......+.+ T Consensus 1 ~l~~~~~~~~~~~~~~~--v~n~~~~ivd~~~~~l~~~gf~~~----d----~~~~~~~~~---i~~~N~~d~~~~~~~~ 67 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKA--RTNFCGLIANASVHRLLALGVTGP----D----GEPDTRASR---WWQANRLDSRQKLVWR 67 (434) T ss_pred CCCCCccHHHHHhhhhh--hccchHHHHHHHHhhhccCceecC----C----CchHHHHHH---HHHhcChhHHHHHHHH Confidence 88777777777654432 346778888988887666665432 1 122233333 3555567888899999 Q ss_pred HHHHhcceeEEEEEcCcc--------eeeeecCcceeEEEEE-ECCeeEEEEEeeeccCc--ch-hccccHHHHHHHHHH Q lcl|NC_015263. 141 LAMTVDIFYGYVIDDKES--------VMIQQFPNDICKISSV-SGGVYNYVIDLDALVSA--DI-VDYYPKEIQEAVNKY 208 (513) Q Consensus 141 ~~l~~g~~~gy~i~d~~~--------~~iq~lp~dyckIsg~-~nG~y~~~fD~syFd~~--~~-L~~~p~Ei~~~y~~Y 208 (513) ++++.|..|.+...+.++ ..|..+||++|.++-= ..+...+++-...-+.. .. .-+++.....-|.. T Consensus 68 ~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~- 146 (434) T protein:vir:98 68 MAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTR- 146 (434) T ss_pred HHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCcEEEEEEe- Confidence 999999999887754332 3588899999987753 22556666644331111 11 00111110000000 Q ss_pred hhhhhccCcccccCee---ec------CCceEEEEe-cCcc--ccchhhHHHHHHhHHHHHHHHHHHhhH--hhhhhcee Q lcl|NC_015263. 209 TTMKKGNNKSASNWYE---IQ------DKNSICIKI-NESS--LTPVPPFAGTFDSIYDIHSFKDLRNDK--AELQNYKL 274 (513) Q Consensus 209 ~~~k~~~~~~~~~W~~---L~------~~kt~~ik~-~~~~--~~~ip~f~~v~~d~~di~~~kdL~~~~--~~i~n~~i 274 (513) .............|.. ++ -..-.++.+ |... .++. |-|.++.++.+.=+...+. ...+- T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~----sd~e~vi~liDa~~~~~s~~~~~~~~--- 219 (434) T protein:vir:98 147 ERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPE----PEFAGVLDIQDRVNLGILNRMAASRF--- 219 (434) T ss_pred eccccccccccccceecccccccccCCCCccceEEeccCCCcCcCCc----chhhhHHHHHHHHHHHHHHHHHHHHH--- Confidence 0000000011112211 00 011112222 2211 1233 3334444443333322111 11111 Q ss_pred eeeeeccc--cCCCCCccccCHHHHHHHHHHHHHhccccceEEEecc-ccccccccccc-ccc-hhhhhhHHhhhhhhhh Q lcl|NC_015263. 275 LIQKLETR--SSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPM-EIDTVSFDKDS-STD-DSVEKATKNFWDNAGV 349 (513) Q Consensus 275 i~~kip~~--~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~-~~d~i~ld~~~-~~~-dtv~~~~~~i~~~~Gi 349 (513) ...|.+ -|-+..++..+......+.+.... -+..+ ...|- +.+-..|+... ... +.+..-..++....++ T Consensus 220 --~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~-~~~~i--~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~ 294 (434) T protein:vir:98 220 --SGFRQKWIKGHKFAKRTDPATGMTVVDQPFVP-SPSAV--WASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQT 294 (434) T ss_pred --hcchhhhhcCCCcccccccccccchhhhhhhc-ccccc--ccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCC Confidence 111200 011111111111111111111111 01111 11110 11112233211 111 3355555777777888 Q ss_pred hhhhccCCCcc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhccc---ceEEEEEecCCCCccHHHHHHHHHH Q lcl|NC_015263. 350 SQILFSSDNKT--SQGIAMSIATDEQFIFGVINQLERWLNRYL----LLNGM---SKYFKATMLEVTHFSKKEAHDRYIT 420 (513) Q Consensus 350 S~~Lfn~d~~s--~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i----~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~ 420 (513) ..-.|+++.++ +.+++.....-...+.+..+.++.-+.+.+ ..... ....++.|-+..+-|..+.++.+.+ T Consensus 295 p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~k 374 (434) T protein:vir:98 295 PTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATK 374 (434) T ss_pred CHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHH Confidence 88888876444 444555544444444455555544333322 22222 2358899999999999999999999 Q ss_pred HHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 421 DAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 421 ~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) +.+-|.|. ..+...+|+++.++-.+.+.+.+. .+..-..+.++.-+..| ..|. .+..+.| T Consensus 375 l~~~g~~~-e~~~~~lg~~~~e~~r~~~e~~~~-~~~~~~~~~~~~~~~~g---------------~~~~---~~~~~dg 434 (434) T protein:vir:98 375 LKSIGYPL-DVIAEELDESPARVRRIVAGAASQ-ALLAASLLPAPGAPSAG---------------NVPD---SGGAVDG 434 (434) T ss_pred HHhcCCcH-HHHHHhCCCCHHHHHHHHHHHHHH-HHHHHhhhccCCCCCCC---------------CCCc---ccCCCCC Confidence 99988765 556678999998765544432211 11111111121111111 1111 1111111 No 32 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.52 E-value=3.8e-07 Score=55.75 Aligned_cols=388 Identities=13% Similarity=0.058 Sum_probs=174.2 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhc---c-----ChhHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEY---R-----NEGNQKTLRK 72 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~---~-----P~~n~~~ir~ 72 (513) +.+..+||-+ ..++.. ...++... . +......|+ T Consensus 3 ~f~~~~~r~~------------------------~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~al~- 44 (416) T protein:vir:81 3 IFYKNEKRDL------------------------QYNEDD-------------LQMMVQTLPGFQGTKLRQYKDIEAIR- 44 (416) T ss_pred cccccccccc------------------------cCCCcc-------------hhHHHHHhccccccCccccchhhhhc- Confidence 3333333321 111110 01111100 0 111111222 Q ss_pred HHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhc Q lcl|NC_015263. 73 VSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVD 146 (513) Q Consensus 73 ~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g 146 (513) ...+.+.|+.+++ +..+...++ ++. ....+ +.+...|. +=| ...+...+...++..| T Consensus 45 --------~~~v~~cv~~Ia~~iA~~p~~~~----~~~----~~~~~-~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~G 107 (416) T protein:vir:81 45 --------HSDIFTAVMMIASDLARMPIRVT----VNG----QINYS-DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTS 107 (416) T ss_pred --------chHHHHHHHHHHHhhccCceEEe----cCc----ccccc-chHHHHHhcccccCCCHHHHHHHHHHHHhhcC Confidence 1223334444422 222223333 111 11111 12333343 222 3466677888889999 Q ss_pred ceeEEEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCee Q lcl|NC_015263. 147 IFYGYVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYE 224 (513) Q Consensus 147 ~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~ 224 (513) ..|.++.-+.++ +-+.++|++.|.|.--.+|.+.|.+- .++.... ..-.. T Consensus 108 na~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~--~~~~~~~--------------------------~~~~~ 159 (416) T protein:vir:81 108 HGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQ--RIDSNGN--------------------------NIERN 159 (416) T ss_pred CeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEE--EecCCCc--------------------------eeEEE Confidence 999998876554 67899999999998656677554331 1110000 01123 Q ss_pred ecCCceEEEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceee--eeeeccccCCCCCccccCHHHHHHHH Q lcl|NC_015263. 225 IQDKNSICIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLL--IQKLETRSSNDNNDFTLDMPMMNYFH 301 (513) Q Consensus 225 L~~~kt~~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii--~~kip~~~~n~~~~~~vd~~~~~~~~ 301 (513) ++...-+-|+. ..+...|++|...+...+--.....+... .-..|-... +=++| + ...|.++.+++. T Consensus 160 ~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~--~~f~ng~~~~gil~~~----~----~~~~~~~~~~~~ 229 (416) T protein:vir:81 160 VKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLN--NFLRNGTHAGGILKMK----G----VLDNKKARDRAR 229 (416) T ss_pred EccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHH--HHHhccCCCcEEEEeC----C----CCCCHHHHHHHH Confidence 44433444553 22345677776666543332222222221 112221111 11233 1 112445555555 Q ss_pred HHHHHhcc--c---cceEEEeccccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHH Q lcl|NC_015263. 302 EALSMTVP--D---NVGVVTSPMEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFI 375 (513) Q Consensus 302 ~~ik~~Lp--~---gv~~v~sP~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~ 375 (513) +.+.+++- . ++..+-..+++..++++... .--.+..-+.++|..+.||...++|.++.+++.....+.- ..-+ T Consensus 230 ~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~-~~~l 308 (416) T protein:vir:81 230 EEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY-LSTL 308 (416) T ss_pred HHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH-HHHH Confidence 66665551 1 23333333444444443211 1112234455789999999999998776655544443332 1234 Q ss_pred HHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhC Q lcl|NC_015263. 376 FGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLD 455 (513) Q Consensus 376 ~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~ 455 (513) .-++++||..+|+.|-....+..|+|.+-+....+.++.++.+.++..-|.=..--.-..+|+.|.+ -.+ T Consensus 309 ~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~----------~gd 378 (416) T protein:vir:81 309 KPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIP----------GGN 378 (416) T ss_pred HHHHHHHHHHHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC----------CCC Confidence 4689999999999986555566788887777777888888888887776632222222233444422 001 Q ss_pred cccccCcccccccccccccccCCccccCCCCcCCCCcccc-cccCCCCC Q lcl|NC_015263. 456 LPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTG-NKDSDETQ 503 (513) Q Consensus 456 l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~-n~~~~~~~ 503 (513) -+..+.|+........ ++ -+..++ .++.+ .+..++++ T Consensus 379 ~~~~~~~~n~~~~~~~-------~~---~~~~~~-~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 379 GSIHRVDLNHVNIELV-------DE---YQMNKS-RATDKKLKGGEENE 416 (416) T ss_pred cceEeecccccccccc-------cc---cCcccc-cccccccCCCCCCC Confidence 1111111111110000 00 000000 00111 11122222 No 33 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.52 E-value=3.8e-07 Score=55.75 Aligned_cols=388 Identities=13% Similarity=0.058 Sum_probs=174.2 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhc---c-----ChhHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEY---R-----NEGNQKTLRK 72 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~---~-----P~~n~~~ir~ 72 (513) +.+..+||-+ ..++.. ...++... . +......|+ T Consensus 3 ~f~~~~~r~~------------------------~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~al~- 44 (416) T protein:vir:45 3 IFYKNEKRDL------------------------QYNEDD-------------LQMMVQTLPGFQGTKLRQYKDIEAIR- 44 (416) T ss_pred cccccccccc------------------------cCCCcc-------------hhHHHHHhccccccCccccchhhhhc- Confidence 3333333321 111110 01111100 0 111111222 Q ss_pred HHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhc Q lcl|NC_015263. 73 VSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVD 146 (513) Q Consensus 73 ~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g 146 (513) ...+.+.|+.+++ +..+...++ ++. ....+ +.+...|. +=| ...+...+...++..| T Consensus 45 --------~~~v~~cv~~Ia~~iA~~p~~~~----~~~----~~~~~-~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~G 107 (416) T protein:vir:45 45 --------HSDIFTAVMMIASDLARMPIRVT----VNG----QINYS-DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTS 107 (416) T ss_pred --------chHHHHHHHHHHHhhccCceEEe----cCc----ccccc-chHHHHHhcccccCCCHHHHHHHHHHHHhhcC Confidence 1223334444422 222223333 111 11111 12333343 222 3466677888889999 Q ss_pred ceeEEEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCee Q lcl|NC_015263. 147 IFYGYVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYE 224 (513) Q Consensus 147 ~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~ 224 (513) ..|.++.-+.++ +-+.++|++.|.|.--.+|.+.|.+- .++.... ..-.. T Consensus 108 na~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~--~~~~~~~--------------------------~~~~~ 159 (416) T protein:vir:45 108 HGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQ--RIDSNGN--------------------------NIERN 159 (416) T ss_pred CeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEE--EecCCCc--------------------------eeEEE Confidence 999998876554 67899999999998656677554331 1110000 01123 Q ss_pred ecCCceEEEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceee--eeeeccccCCCCCccccCHHHHHHHH Q lcl|NC_015263. 225 IQDKNSICIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLL--IQKLETRSSNDNNDFTLDMPMMNYFH 301 (513) Q Consensus 225 L~~~kt~~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii--~~kip~~~~n~~~~~~vd~~~~~~~~ 301 (513) ++...-+-|+. ..+...|++|...+...+--.....+... .-..|-... +=++| + ...|.++.+++. T Consensus 160 ~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~--~~f~ng~~~~gil~~~----~----~~~~~~~~~~~~ 229 (416) T protein:vir:45 160 VKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLN--NFLRNGTHAGGILKMK----G----VLDNKKARDRAR 229 (416) T ss_pred EccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHH--HHHhccCCCcEEEEeC----C----CCCCHHHHHHHH Confidence 44433444553 22345677776666543332222222221 112221111 11233 1 112445555555 Q ss_pred HHHHHhcc--c---cceEEEeccccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHH Q lcl|NC_015263. 302 EALSMTVP--D---NVGVVTSPMEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFI 375 (513) Q Consensus 302 ~~ik~~Lp--~---gv~~v~sP~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~ 375 (513) +.+.+++- . ++..+-..+++..++++... .--.+..-+.++|..+.||...++|.++.+++.....+.- ..-+ T Consensus 230 ~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~-~~~l 308 (416) T protein:vir:45 230 EEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY-LSTL 308 (416) T ss_pred HHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH-HHHH Confidence 66665551 1 23333333444444443211 1112234455789999999999998776655544443332 1234 Q ss_pred HHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhC Q lcl|NC_015263. 376 FGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLD 455 (513) Q Consensus 376 ~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~ 455 (513) .-++++||..+|+.|-....+..|+|.+-+....+.++.++.+.++..-|.=..--.-..+|+.|.+ -.+ T Consensus 309 ~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~----------~gd 378 (416) T protein:vir:45 309 KPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIP----------GGN 378 (416) T ss_pred HHHHHHHHHHHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC----------CCC Confidence 4689999999999986555566788887777777888888888887776632222222233444422 001 Q ss_pred cccccCcccccccccccccccCCccccCCCCcCCCCcccc-cccCCCCC Q lcl|NC_015263. 456 LPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTG-NKDSDETQ 503 (513) Q Consensus 456 l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~-n~~~~~~~ 503 (513) -+..+.|+........ ++ -+..++ .++.+ .+..++++ T Consensus 379 ~~~~~~~~n~~~~~~~-------~~---~~~~~~-~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 379 GSIHRVDLNHVNIELV-------DE---YQMNKS-RATDKKLKGGEENE 416 (416) T ss_pred cceEeecccccccccc-------cc---cCcccc-cccccccCCCCCCC Confidence 1111111111110000 00 000000 00111 11122222 No 34 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=98.48 E-value=5e-07 Score=55.14 Aligned_cols=423 Identities=13% Similarity=0.079 Sum_probs=189.2 Q ss_pred HHHHhhccCcccccccccccchHHHHHHHhhhcc---ChhHHHHHH--HHHHHHHhhcchHHHHHHHHhh-cccccceEe Q lcl|NC_015263. 29 SILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR---NEGNQKTLR--KVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVV 102 (513) Q Consensus 29 ~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~---P~~n~~~ir--~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~ 102 (513) .+++. --..+||+ ..+. .-++.+.+ |......=+ .+.-=+|..++.+.+.|+.++. +..+...++ T Consensus 1 ~~~~~---~~~~~~p~-----~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~ 71 (518) T protein:vir:10 1 MLLAN---GQTLSAPA-----MAEL-SPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred CcccC---ceeecCch-----hhhh-hhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEE Confidence 11110 00001110 0001 11222211 211111101 1111235677788888887764 223333333 Q ss_pred eccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEEEEE- Q lcl|NC_015263. 103 PFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKISSVS- 175 (513) Q Consensus 103 P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~- 175 (513) -... +. .......-...+|.+=| ...+...++..++..|..|.+++-+.++ +.+.++|+++|.|.--. T Consensus 72 ~~~~-~~----~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~ 146 (518) T protein:vir:10 72 FTSG-DT----ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR 146 (518) T ss_pred EEcC-CC----ceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCC Confidence 1111 11 11112222334455444 3456667777888999999999877655 67899999999988553 Q ss_pred CCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec-Cccc-cchhhHHHHHHhH Q lcl|NC_015263. 176 GGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN-ESSL-TPVPPFAGTFDSI 253 (513) Q Consensus 176 nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~-~~~~-~~ip~f~~v~~d~ 253 (513) +|.+.|.|...-- ....-++++...-+-|+.. .+.. .|++|...+...+ T Consensus 147 ~~~~~y~~~~~~~-----------------------------~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i 197 (518) T protein:vir:10 147 TGRYEYYFQAGAG-----------------------------VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTI 197 (518) T ss_pred CCEEEEEEEecCC-----------------------------ccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHH Confidence 4566665543210 0011233344333444432 2333 5788877665544 Q ss_pred HHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c---cceEEEeccccccccccc Q lcl|NC_015263. 254 YDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D---NVGVVTSPMEIDTVSFDK 328 (513) Q Consensus 254 ~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~---gv~~v~sP~~~d~i~ld~ 328 (513) --.....+.. ..-..|-...-..|-+ + + .++.++++++.+.+++..- + ++..+-..+++..+.+. T Consensus 198 ~~~~a~~~~~--~~~f~ng~~p~gil~~---~--~--~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s- 267 (518) T protein:vir:10 198 FSEDSSRNAT--AAMWKNAGRPNLVLRH---E--K--RLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLT- 267 (518) T ss_pred HHHHHHHHHH--HHHHhcCCCccEEEec---C--C--CCCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCC- Confidence 4443333321 1222332111111111 1 1 3667777777777776652 1 23444444455555443 Q ss_pred ccccchh---hhhhHHhhhhhhhhhhhhccCC-CcchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEE Q lcl|NC_015263. 329 DSSTDDS---VEKATKNFWDNAGVSQILFSSD-NKTSQG-IAMSIATDEQFIFGVINQLERWLNRYLLLN-GMSKYFKAT 402 (513) Q Consensus 329 ~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d-~~s~~~-~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-~~~~~f~~~ 402 (513) ..+.+. .+-..++|..+.||...++|-. +.+++. -.....-...-+.-++.+||..+|+.|... ..+..|+|. T Consensus 268 -~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~fd 346 (518) T protein:vir:10 268 -AVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFD 346 (518) T ss_pred -hhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEe Confidence 222233 3344478999999998888622 222222 222222233334468999999999988643 223456666 Q ss_pred ecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCcccc Q lcl|NC_015263. 403 MLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEK 482 (513) Q Consensus 403 ~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~ 482 (513) .-+....+.++.++.+.++..-|.-..--.-..+|+.|.+-- .-+.+=+...+.|+...-- |...++.+++++ T Consensus 347 ~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~-----~gD~~~~~~n~~pl~~~~~--~~~~g~~~~~~~ 419 (518) T protein:vir:10 347 IDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDP-----KADELYANSALQPLGATPD--GAVEGEEAPAPK 419 (518) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC-----CCCeeeecccceecccccc--cccCCCCCCCCC Confidence 556556678888888888877774333323334566665310 0011112222344321110 110011111111 Q ss_pred CCCCcCCCCccccc----------------ccCCCC------CCCCCCccCCC Q lcl|NC_015263. 483 GKENGRPTNETTGN----------------KDSDET------QRAKDKPANTQ 513 (513) Q Consensus 483 ~~~~grPt~et~~n----------------~~~~~~------~~~~d~~~~~~ 513 (513) .+ ...|..+..+. .+++++ ..|++++.-.+ T Consensus 420 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (518) T protein:vir:10 420 RP-ASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPK 471 (518) T ss_pred CC-CccccccccccccccCCCCCcccccccccccccchhccccCCCcccccch Confidence 10 01111110000 000000 11111111111 No 35 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.45 E-value=3e-07 Score=56.35 Aligned_cols=384 Identities=11% Similarity=0.089 Sum_probs=178.7 Q ss_pred HHHHHHHHHHhh-ccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccce Q lcl|NC_015263. 23 KRNNRISILRDD-NRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYS 100 (513) Q Consensus 23 ~~~~~~~i~~~~-~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~ 100 (513) |-=+.++.-+-- .+... ...+..... ..+ . .+... +..--+-..+.+.+.|+.++. +..+... T Consensus 1 Mgl~~~~f~~~~~~~~~~---~~~~~~~~~---~~~-~--~~g~~------v~~~~al~~~~v~~~v~~ia~~iA~lp~~ 65 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLT---KISGIPSPA---EDW-A--MHGDR------PGANSAMTLGAFYACVTLLADTVASLSID 65 (409) T ss_pred CchhhhhhcCCCcccccc---ccccccccc---chh-h--ccCcc------cchhhhhccHHHHHHHHHHHHhhhhCceE Confidence 211222111100 00000 000000000 000 0 00000 111111234566777776642 3344444 Q ss_pred EeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEEE-Ec--CcceeeeecCcceeEEE Q lcl|NC_015263. 101 VVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYVI-DD--KESVMIQQFPNDICKIS 172 (513) Q Consensus 101 I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~i-~d--~~~~~iq~lp~dyckIs 172 (513) ++-..+. +. ..-+.+...|. +-| -..+...++..++..|..|.|+. .+ +....+.++|+++|.|. T Consensus 66 ~~~~~~~-----~~--~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~ 138 (409) T protein:vir:84 66 AYRKKDN-----VR--IPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVT 138 (409) T ss_pred EEEecCC-----cc--cccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEE Confidence 4421111 11 11133444453 333 45677778888999999999975 33 44578999999999998 Q ss_pred EEEC--CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCc-cccchhhHHH Q lcl|NC_015263. 173 SVSG--GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NES-SLTPVPPFAG 248 (513) Q Consensus 173 g~~n--G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~-~~~~ip~f~~ 248 (513) -..+ |.+++.+ |..+.. +++.+--+-|+- ..+ ...|+||... T Consensus 139 ~~~~~~~~~~~~~---~~~~g~-------------------------------~~~~~dvih~~~~~~~~~~~G~s~i~~ 184 (409) T protein:vir:84 139 DAKDEDGDWIEPV---YRIDGK-------------------------------VVPNHRIMHIKRYPVAGCALGMSPIEK 184 (409) T ss_pred EcCCCcceEEEEE---ecCCce-------------------------------EEchhhEEEecCCCCCcccccccHHHH Confidence 6544 3333221 211111 112211222331 112 2357888776 Q ss_pred HHHhHHHHHHHHHHHhhHhhhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccc--cceEEEecccccc Q lcl|NC_015263. 249 TFDSIYDIHSFKDLRNDKAELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPD--NVGVVTSPMEIDT 323 (513) Q Consensus 249 v~~d~~di~~~kdL~~~~~~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~--gv~~v~sP~~~d~ 323 (513) +...+--....++... .-..| ...++ ++| + .++.++++++.+...+...+ ++.++-..+++.. T Consensus 185 ~~~~i~~~~~~~~~~~--~~f~ng~~p~gil-~~~-------~--~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~ 252 (409) T protein:vir:84 185 AASAIGLGLAAERYGL--RWFRDSANPSGIL-SSD-------A--DLTPDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQS 252 (409) T ss_pred HHHHHHHHHHHHHHHH--HHHhcCCCccEEE-ecC-------C--CCCHHHHHHHHHHHHHHhccCCCeeecCCCceEEE Confidence 6554443333333221 11222 12222 122 1 36667777766666665533 2333333345555 Q ss_pred ccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCcch---HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceE Q lcl|NC_015263. 324 VSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNKTS---QGI-AMSIATDEQFIFGVINQLERWLNRYLLLNGMSKY 398 (513) Q Consensus 324 i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~---~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~ 398 (513) +.+.. +..--.+.+-..++|..+.||...++|....++ +.+ ...+.--..-+.-++++||..+|++|.. +.. T Consensus 253 ~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L~~---g~~ 329 (409) T protein:vir:84 253 VSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTFLPR---GQF 329 (409) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCe Confidence 54432 111123345666889999999998886432221 222 2222222222445899999999998843 445 Q ss_pred EEEEecCCCCccHHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccC Q lcl|NC_015263. 399 FKATMLEVTHFSKKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAEN 477 (513) Q Consensus 399 f~~~~l~~T~fn~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~ 477 (513) ++|.+-+...-+.++.++.+.++..-|. .+=..-+ .+|+.|.+ +-|..++|+ +...... T Consensus 330 i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~-~~g~~p~~------------ggD~~~~~~---n~~~~~~---- 389 (409) T protein:vir:84 330 VKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRA-WEDAPPIP------------EGDIHLQPM---NFVPLGY---- 389 (409) T ss_pred EEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCC------------Ccceeeecc---ccccccc---- Confidence 7777767777788999998888887773 3333222 35665532 234444443 2211111 Q ss_pred CccccCCCCcCCCCcccccc Q lcl|NC_015263. 478 AIKEKGKENGRPTNETTGNK 497 (513) Q Consensus 478 ~~~~~~~~~grPt~et~~n~ 497 (513) .+.....+++.|..++.+|+ T Consensus 390 ~~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 390 VPPEEPAQEPQPNSATEGNK 409 (409) T ss_pred CCccccCcCCCCCCccCCCC Confidence 11111123333444444444 No 36 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=98.44 E-value=6.6e-07 Score=54.46 Aligned_cols=383 Identities=14% Similarity=0.077 Sum_probs=180.1 Q ss_pred CCCcc-chheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKNK-KKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~ 79 (513) |.=.| .+|-+ ++++ ..+...-.|+-.- |.+- .++.--+. T Consensus 1 m~f~~~~~~~~----~~~~------------------------------~~~~~~~~~~g~~-~~~~-----~v~~~~al 40 (409) T protein:vir:10 1 MLFRKGFKNQS----QEIS------------------------------IDDKKILEWLGIN-PSET-----YVNGKSCL 40 (409) T ss_pred CcccccccCcC----CCCC------------------------------CChHHHHHHhcCC-cCcc-----eechhhhh Confidence 32211 12211 0110 0000001111100 1110 00011122 Q ss_pred hcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEE Q lcl|NC_015263. 80 QSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVI 153 (513) Q Consensus 80 ~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i 153 (513) ++..+++.|+++++ +..+...++ +. .......+ -+.+...|. + +.-..+...++..++..|..|.+++ T Consensus 41 ~~~~v~~~i~~ia~~ia~lp~~~~----~~-~~~~~~~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~ 114 (409) T protein:vir:10 41 KQATVFGCIRILSDNISKLPIKIY----QK-KDGIKRVP-DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALD 114 (409) T ss_pred ccHHHHHHHHHHHHhhhhCceEEE----Ee-cCCeeecc-CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 45667778887754 223444443 11 11111111 122333443 2 2355667788888999999999998 Q ss_pred EcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceE Q lcl|NC_015263. 154 DDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSI 231 (513) Q Consensus 154 ~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~ 231 (513) -+..+ .-+.++|+++|+|.--.+|.....-.+.|. |. .. ......++...-+ T Consensus 115 r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~-------------------~~------~~-~g~~~~~~~~evi 168 (409) T protein:vir:10 115 FKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYL-------------------YT------DD-LGQRHKFMSDEIL 168 (409) T ss_pred EcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEE-------------------EE------eC-CceeEEeccccEE Confidence 76554 688999999999875444543322222221 00 00 0012334444444 Q ss_pred EEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhc---eeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh Q lcl|NC_015263. 232 CIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNY---KLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT 307 (513) Q Consensus 232 ~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~---~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~ 307 (513) -|+- ..+..+|++|...+...+-......+.. ..-+.|- ..++ ++| + .++.++++++.+.+++. T Consensus 169 h~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~--~~~f~ng~~~~gil-~~~-------~--~l~~e~~~~~~~~~~~~ 236 (409) T protein:vir:10 169 HFKGLTADGLAGLSVIELLNHLIENGKSSETYL--NNFFKNGLQVKGLV-QYA-------G--DLNPEAEEVFKENFERM 236 (409) T ss_pred EecCcCCCCcccccHHHHHHHHHHHHHHHHHHH--HHHHhccCCCcEEE-EcC-------C--CCCHHHHHHHHHHHHHH Confidence 4542 3345678888766655443333333322 1123331 2222 122 1 25666776666666655 Q ss_pred c---c--ccceEEEecccccccccccccccch---hhhhhHHhhhhhhhhhhhhccCC-CcchHHHHHH-HHHHHHHHHH Q lcl|NC_015263. 308 V---P--DNVGVVTSPMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSSD-NKTSQGIAMS-IATDEQFIFG 377 (513) Q Consensus 308 L---p--~gv~~v~sP~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~S-I~~d~~~~~~ 377 (513) . - .++..+-..+++..+.++ ..+.+ +.+-..++|..+.||...+++.. +.+++.+... +.--+.-+.- T Consensus 237 ~~g~~n~~~~~vl~~g~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P 314 (409) T protein:vir:10 237 SSGLKNAHRIAMLPIGYKFEPISQK--LVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYIDTLQS 314 (409) T ss_pred hccccccCCceecCCCceEEEccCC--hhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHH Confidence 4 1 123444444444444443 23333 34455688999999998888643 2233333222 2222222335 Q ss_pred HHHHHHHHHHHHHhh--c-ccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhh Q lcl|NC_015263. 378 VINQLERWLNRYLLL--N-GMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEML 454 (513) Q Consensus 378 ~~~~iE~~~N~~i~~--~-~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L 454 (513) ++++||..+|+.|-. . ..+..|+|.+-+....+.++.++.+.++..-|.=..--.-+.+|+.|.+ T Consensus 315 ~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~------------ 382 (409) T protein:vir:10 315 ILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLE------------ 382 (409) T ss_pred HHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------ Confidence 899999999997742 2 2344577766666677888888888888777733222223345665532 Q ss_pred CcccccCcccccccccccccccCCccccCCCCcCCCCcccccc Q lcl|NC_015263. 455 DLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNK 497 (513) Q Consensus 455 ~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~ 497 (513) |-|..++|....-. ++.|.+..+. |.+ T Consensus 383 ggD~~~~~~n~~~~---------------~~~~~~~~kg-Ge~ 409 (409) T protein:vir:10 383 GGDVLLINGNMIPV---------------KMAGEQYSKG-GEK 409 (409) T ss_pred CcCeeeeccCccch---------------hhcccccccc-CCC Confidence 23444444321111 0111111000 000 No 37 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=98.41 E-value=7.8e-07 Score=54.06 Aligned_cols=386 Identities=11% Similarity=0.078 Sum_probs=179.4 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccc--ccccchHHHHHHHhhhccChhHHHHHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPV--GSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLA 78 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~--~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY 78 (513) |.|.+ .++...+ . ++......+. +.+ |+..+.+ ....++.-.| T Consensus 1 ~~~~~----------~~~~~k~-----~-~~~~~~~~~~--~~~~~~~~~~~~-----------------~~~~v~~~~a 45 (409) T protein:vir:94 1 MAKEN----------IVTRIKK-----K-LIDNWIDQSA--SKLYDFSPWKNK-----------------SFWGVINNTL 45 (409) T ss_pred Ccccc----------cchhhhh-----H-HhhhhhcCCc--ccccccccccCc-----------------cccccchhhh Confidence 33321 1222222 1 1111111111 111 1110000 1112233335 Q ss_pred hhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEE Q lcl|NC_015263. 79 VQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYV 152 (513) Q Consensus 79 ~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~ 152 (513) .+++.+.+.|+++++ +..+...++ .... ... +.+...|. +=| -..+...++..++..|..|.++ T Consensus 46 ~~~~~v~~~i~~Ia~~ia~lp~~~~----~~~~---~~~---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 115 (409) T protein:vir:94 46 ETNETIFSAITKLSNSMASLPLKMY----EDYK---VVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLI 115 (409) T ss_pred hccHHHHHHHHHHHHhhhhCceeEe----eccc---ccc---hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 556667777777653 333444443 1111 111 22333343 223 4555677888889999999999 Q ss_pred EEcC--cceeeeecCcceeEEEEEECC-eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCc Q lcl|NC_015263. 153 IDDK--ESVMIQQFPNDICKISSVSGG-VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKN 229 (513) Q Consensus 153 i~d~--~~~~iq~lp~dyckIsg~~nG-~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~k 229 (513) +-+. .++-+.+||+++|.+.-..+| .+.|.|... +... +.++..- T Consensus 116 ~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~---~g~~-----------------------------~~~~~~d 163 (409) T protein:vir:94 116 ERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAA---TGNK-----------------------------LIVHNMD 163 (409) T ss_pred EECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcC---CceE-----------------------------EEEcccc Confidence 8654 457899999999999866553 444444321 1111 1223322 Q ss_pred eEEEEe--cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh Q lcl|NC_015263. 230 SICIKI--NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT 307 (513) Q Consensus 230 t~~ik~--~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~ 307 (513) -+-|+- -.+...|++|...+ .+.+++...-+--. +.+..---.-| + .....++.++++.+.+.+++. T Consensus 164 vih~r~~~~~~~~~G~s~l~~~-~~~i~~~~~~~~~~----~~~~~~~~~~i-~-----~~~~~l~~e~~~~~~~~~~~~ 232 (409) T protein:vir:94 164 MLHFKHIVASNMVQGISPIDVL-KNTTDFDNAVRTFN----LTEMQKPDSFM-L-----KYGSNVGKEKRQQVLEDFKQY 232 (409) T ss_pred EEEecCCCCCCccccccHHHHH-HHHHHHHHHHHHHH----HHhcCCCCeeE-E-----ecCCCCCHHHHHHHHHHHHHH Confidence 333431 12345677776543 44555443322111 11111000111 1 012236677777766666665 Q ss_pred ccc--cceEEEecccccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCCc-chHHHHH-HHHHHHHHHHHHHH Q lcl|NC_015263. 308 VPD--NVGVVTSPMEIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDNK-TSQGIAM-SIATDEQFIFGVIN 380 (513) Q Consensus 308 Lp~--gv~~v~sP~~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~~-s~~~~~~-SI~~d~~~~~~~~~ 380 (513) .-+ ++..+-..+++..+.+. ..+.+. .+-..++|..+.||...++|+... +.+.+.. ...--..-+.-+++ T Consensus 233 ~~~~g~~~vl~~g~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~ 310 (409) T protein:vir:94 233 YEENGGILFQEPGVEIEPLPKK--YVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVK 310 (409) T ss_pred hhcCCCeeecCCCceEEEcCCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHH Confidence 532 33333333444444432 222233 334558899999999998876432 3233322 22222323446899 Q ss_pred HHHHHHHHHHhhc--c-cceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcc Q lcl|NC_015263. 381 QLERWLNRYLLLN--G-MSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLP 457 (513) Q Consensus 381 ~iE~~~N~~i~~~--~-~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~ 457 (513) +||..+|+.|-.. . .+..|+|..-+....+.++.++.+.++..-|.=..--.-+.+|+.|.+ +-| T Consensus 311 ~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~------------ggD 378 (409) T protein:vir:94 311 QYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE------------GGD 378 (409) T ss_pred HHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CcC Confidence 9999999987432 1 234577766666667788888888888777743222233345666642 223 Q ss_pred cccCcccccccccccccccCCccccCCCCcCCCCcc Q lcl|NC_015263. 458 EIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNET 493 (513) Q Consensus 458 ~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et 493 (513) ..+.|. +...-......... ..-|+..++|+ T Consensus 379 ~~~~~~---n~~~~~~~~~~~~~--~kGG~~n~~e~ 409 (409) T protein:vir:94 379 KPLISG---DLYPIDTPLELRKS--LKGGDKNVNES 409 (409) T ss_pred eEeecc---cccccccchhhccc--ccCCCCCcCCC Confidence 333332 21111000000000 00111111111 No 38 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=98.39 E-value=9.1e-07 Score=53.71 Aligned_cols=386 Identities=11% Similarity=0.094 Sum_probs=180.1 Q ss_pred CCCccc-hheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKNKK-KRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~ 79 (513) |-|.+- .|+ ...+....-..+. +.++. +.|... .....++.--+. T Consensus 1 ~~~~~~~~~~-----------------~~~~~~~~~~~~~--~~~~~--------------~~~~~~-~~~~~v~~~~~~ 46 (409) T protein:vir:93 1 MAKENIVTRI-----------------KKKLIDNWIDQST--SKLYD--------------FSPWKN-RSFWGVINNTLE 46 (409) T ss_pred CCccchhhhh-----------------hhhhhhhhhcccc--ccccc--------------cccccC-ccccccchhhhh Confidence 333221 110 0001111111111 11111 001000 011112233345 Q ss_pred hcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEEE Q lcl|NC_015263. 80 QSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYVI 153 (513) Q Consensus 80 ~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~i 153 (513) .++.+.+.|++++. +..+.-.++ .+ ++... +.+...|. +=| -..+...++..++..|..|.++. T Consensus 47 ~~~~V~~ci~~Ia~~ia~lp~~~~----~~---~~~~~---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~ 116 (409) T protein:vir:93 47 TNETIFSAITKLSNSMASLPLKMY----ED---YKVVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE 116 (409) T ss_pred ccHHHHHHHHHHHHhhhhCceeEe----ec---ccccc---chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEE Confidence 66667777777653 222333332 11 11111 22333343 233 45566788888899999999998 Q ss_pred EcCc--ceeeeecCcceeEEEEEECC-eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCce Q lcl|NC_015263. 154 DDKE--SVMIQQFPNDICKISSVSGG-VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNS 230 (513) Q Consensus 154 ~d~~--~~~iq~lp~dyckIsg~~nG-~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt 230 (513) -+.. ..-+.+||+++|.+.--.+| .+.|.|...- .. =+.++...- T Consensus 117 r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~---g~-----------------------------~~~~~~~eV 164 (409) T protein:vir:93 117 RDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAAT---GN-----------------------------KLIVHNMDM 164 (409) T ss_pred ECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCC---ce-----------------------------EEEEccccE Confidence 7654 46899999999999865553 3444443211 00 012333333 Q ss_pred EEEEe--cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc Q lcl|NC_015263. 231 ICIKI--NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV 308 (513) Q Consensus 231 ~~ik~--~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L 308 (513) +-|+- ..+..+|++|...+ .+.+++...-+--. +.+..---+-|-. ....++.++++.+.+.+++.. T Consensus 165 ih~r~~~~~~~~~G~s~i~~~-~~~i~~~~~~~~~~----~~~~~~~~~~i~~------~~~~l~~e~~~~~~~~~~~~~ 233 (409) T protein:vir:93 165 LHFKHIVASNMVQGISPIDVL-KNTTDFDNAVRTFN----LTEMQKPDSFMLK------YGSNVGKEKRQQVLEDFKQYY 233 (409) T ss_pred EEeCCCCCCCccccccHHHHH-HHHHHHHHHHHHHH----HHhcCCCCceEEe------cCCCCCHHHHHHHHHHHHHHh Confidence 33432 12344677776554 44444443222111 1111100011110 112367777777777777666 Q ss_pred cc--cceEEEecccccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCCc-chHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_015263. 309 PD--NVGVVTSPMEIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDNK-TSQGIAM-SIATDEQFIFGVINQ 381 (513) Q Consensus 309 p~--gv~~v~sP~~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~~-s~~~~~~-SI~~d~~~~~~~~~~ 381 (513) -+ ++..+-..+++..+.+. ..+.+. .+-..++|..+.||...++|+... +.+.+.. ...--..-+.-++++ T Consensus 234 ~~~g~~~vl~~g~~~~~l~~~--~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ 311 (409) T protein:vir:93 234 EENGGILFQEPGVEIEPLPKK--YVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQ 311 (409) T ss_pred hcCCCeeecCCCceEEEcCCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHH Confidence 33 23333333444444432 223233 334568899999999888876433 2223322 222223334468999 Q ss_pred HHHHHHHHHhhc-cc--ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCccc Q lcl|NC_015263. 382 LERWLNRYLLLN-GM--SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPE 458 (513) Q Consensus 382 iE~~~N~~i~~~-~~--~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~ 458 (513) ||..+|+.|-.. .. +..|+|..-+....+.++.++.+.++..-|.=..--.-+.+|+.|.+ +-|. T Consensus 312 ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~------------ggD~ 379 (409) T protein:vir:93 312 YEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE------------GGDK 379 (409) T ss_pred HHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CcCe Confidence 999999977432 22 34577766666667888999998888877743332233346776642 2233 Q ss_pred ccCcccccccccccccccCCccccCCCCc-CCCCcc Q lcl|NC_015263. 459 IMTPLSSSFNTSGSDIAENAIKEKGKENG-RPTNET 493 (513) Q Consensus 459 ~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~g-rPt~et 493 (513) .++|.. ...-........ ...|| ..++|+ T Consensus 380 ~~~~~n---~~~~~~~~~~~~---~~~gG~~n~~e~ 409 (409) T protein:vir:93 380 PLISGD---LYPIDTPLELRK---SLKGGDKNVNES 409 (409) T ss_pred eeeccc---ccccccchhhcc---cccCCCCCcCCC Confidence 344322 111110000000 01111 111111 No 39 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.35 E-value=1.1e-06 Score=53.16 Aligned_cols=421 Identities=13% Similarity=0.066 Sum_probs=181.9 Q ss_pred HHHHHhhccCccc-cccccccc--chHHHHHHHhhhcc-ChhHHHHHHHHHHHHHhhcchHHHHHHHHh-hcccccceEe Q lcl|NC_015263. 28 ISILRDDNRTPVF-GAPVGSLT--SSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAVQSQQYQRLLNFYA-NMPLYAYSVV 102 (513) Q Consensus 28 ~~i~~~~~~~~~~-~s~~~s~~--~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~~sg~~~rlidy~~-~mpt~dY~I~ 102 (513) -++|..+-.-... ....+... +..|.. +-.+. +-.. -..++.--.-+.+.+.+.|++++ ++..+...++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~---g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~ 74 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPS---IYNLGAVAAS---GETVTPHDALQVSAVFASVRLLSETIATLPLSTY 74 (457) T ss_pred CchhhhhhcccccccccccccccccccchH---HHhhcccccC---CceechHHhhccHHHHHHHHHHHHhhccCceEEE Confidence 2222222111000 00000000 000000 00000 0000 00111111223445666666664 3455555554 Q ss_pred eccchhhhhhcchhHHHHHHHHHHhh----cChhHHHHHHHHHHHHhcceeEEEEEcC-cceeeeecCcceeEEEEEECC Q lcl|NC_015263. 103 PFKDISTANENKLKKELATVTEFLSR----LNPKYNFSKIVKLAMTVDIFYGYVIDDK-ESVMIQQFPNDICKISSVSGG 177 (513) Q Consensus 103 P~~~~~~~~~~~~~~~y~~v~~~L~k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~-~~~~iq~lp~dyckIsg~~nG 177 (513) -- .... .+.. ........|+. +....++..++..++..|..|.+++.++ ..+.+.+||++.|.|.-..++ T Consensus 75 ~~--~~~~-~~~~--~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~g~~~~l~~l~p~~v~v~~~~~~ 149 (457) T protein:vir:13 75 SK--RGGS-RKEI--VTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQGPNIVGLDVLDPTKIHVHMVMVD 149 (457) T ss_pred Ee--cCCc-cccc--ccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEccCceEEEEecCC Confidence 21 1111 1111 11223333443 4455677788888889999999987653 446889999999999755332 Q ss_pred e-e-----EEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec-Cc-cccchhhHHHH Q lcl|NC_015263. 178 V-Y-----NYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN-ES-SLTPVPPFAGT 249 (513) Q Consensus 178 ~-y-----~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~-~~-~~~~ip~f~~v 249 (513) . . .|.++... +...+. ..++..-+.|+.. .+ ...|+||...+ T Consensus 150 ~~~~~~~~~y~~~~~~--~~~~~~----------------------------~~~~~diih~~~~~~~~~~~G~s~i~~~ 199 (457) T protein:vir:13 150 GLRRKVFEAYDIDADG--NEVLLG----------------------------WFTPRDVLHIPGMMLPGDFVGCSPISYA 199 (457) T ss_pred CccceeEEEEEEecCC--ceeeEE----------------------------eeCccceEEecCCCCCCccccccHHHHH Confidence 2 1 11111110 000111 1222233444432 22 35788887766 Q ss_pred HHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c---cceEEEeccccccc Q lcl|NC_015263. 250 FDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D---NVGVVTSPMEIDTV 324 (513) Q Consensus 250 ~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~---gv~~v~sP~~~d~i 324 (513) ...+--....++... .-..|-...-.-|-+ ++ .++.++++++.+.+.++.- + ++..+-..+++..+ T Consensus 200 ~~~i~~~~~~~~~~~--~~f~ng~~p~gil~~-----~~--~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l 270 (457) T protein:vir:13 200 RESIGLALAAQKYGS--KFFANGAMPGAVVEV-----PG--TMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKV 270 (457) T ss_pred HHHHHHHHHHHHHHH--HHHhcCCCcceEEEc-----CC--CCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEc Confidence 655544444443321 111221111111111 11 4677788888888877762 2 23333334455555 Q ss_pred cccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCcc----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cce Q lcl|NC_015263. 325 SFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNKT----SQGIAMSIATDEQFIFGVINQLERWLNRYLLLNG--MSK 397 (513) Q Consensus 325 ~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s----~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~--~~~ 397 (513) .++. +..--.+.+-..++|..+.||...++|....+ ++.-...+.-...-+.-++++||..+|+.|-... .+. T Consensus 271 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~ 350 (457) T protein:vir:13 271 AMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFR 350 (457) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccCce Confidence 4431 11112334455578999999998888533222 2222233222222344688999999999885432 223 Q ss_pred EEEEEecCCCCccHHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccccccccccc Q lcl|NC_015263. 398 YFKATMLEVTHFSKKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAE 476 (513) Q Consensus 398 ~f~~~~l~~T~fn~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~ 476 (513) .|+|..-+...-+.++.++.+.++..-|. .+=..-+ .+|+.|.+ -. .-+..++|+-. +.-|.. .+ T Consensus 351 ~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~-~~gl~Pi~--------~g--~~d~~~~~~n~--~~~~~~-~~ 416 (457) T protein:vir:13 351 FVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRA-AEDMTPLP--------DG--LGEKYRVPLNL--GEVGEE-PE 416 (457) T ss_pred eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCC--------CC--cccceeecccc--cccccc-cc Confidence 45665555556688899999888887773 3333333 35666632 10 11344444321 111111 01 Q ss_pred CCccccCCCCcCCCCccc-----ccccCCCCCCCCCCccCC Q lcl|NC_015263. 477 NAIKEKGKENGRPTNETT-----GNKDSDETQRAKDKPANT 512 (513) Q Consensus 477 ~~~~~~~~~~grPt~et~-----~n~~~~~~~~~~d~~~~~ 512 (513) ..+.+...+.+.|+.+.. ..+..+++..+.+...++ T Consensus 417 ~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 417 PEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred ccccCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 111111111111211111 111111111111111111 No 40 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.32 E-value=1.4e-06 Score=52.70 Aligned_cols=433 Identities=11% Similarity=0.067 Sum_probs=186.3 Q ss_pred eeeh-hhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 11 MIDV-ESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 11 ~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) |.+- =||.+-++-... .-.++..- .+. ......|+ |-=+-+.|+ -++.+|+.+++.| T Consensus 1 ~~~~~~~i~s~~~~~~i---~~~~~~s~----------~~~----~~~~~~~~~pp~~~~~la----~l~~~n~~v~scI 59 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAI---KREEVESQ----------ALG----ETRFEEYVEPKVNPLVLL----SLLQVNPYHASAC 59 (542) T ss_pred Cccccccccccccchhh---hhcccccc----------ccc----cccCCccccCCCCHHHHH----HHHhhcHHHHHHH Confidence 3221 122222211100 00000000 000 00011222 222333333 3777888999999 Q ss_pred HHHh-hcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcC--cceeeeecC Q lcl|NC_015263. 89 NFYA-NMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDK--ESVMIQQFP 165 (513) Q Consensus 89 dy~~-~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~--~~~~iq~lp 165 (513) +.++ ++..+.+.+.+-... .....+.+ ..++...++..+..+++..|..|.+.+-+. ....+.++| T Consensus 60 ~~ia~~IA~l~~~~~~~~~~--~l~~~lpN---------~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~ 128 (542) T protein:vir:41 60 SIKANDIIRTGYILEGDDEG--VVDEFIRA---------CKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIP 128 (542) T ss_pred HHHHHHHhhCceeeecccch--hhhhhcCC---------CCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEc Confidence 9987 566666666432111 00000000 113345677888889999999999988665 446789999 Q ss_pred cceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccC-eeecCCceEEEEec--Cccccc Q lcl|NC_015263. 166 NDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNW-YEIQDKNSICIKIN--ESSLTP 242 (513) Q Consensus 166 ~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W-~~L~~~kt~~ik~~--~~~~~~ 242 (513) +.+|++.--. +.|....+-.- ......|.- .+. . +.....| ..++..--+-|+.. .+..+| T Consensus 129 ~~~v~v~~d~-~~~~~~~~~~~---~~~~~~y~~-------~~~----~-~~~~g~~~~~~~~~eIiHir~~~~~~~~~G 192 (542) T protein:vir:41 129 SHTIRVHKDG-SRYRQTWDGVN---ITHFKDYRY-------EGE----I-NPETGEDQDSVGANELVFIHIPSPVCSYYG 192 (542) T ss_pred CcceEEEEcC-CeeEeeecCCc---ceeEEeecc-------ccc----c-cccccccccccCcccEEEecCCCCCCCccc Confidence 9999987322 22222111110 000000000 000 0 0000111 12333333445543 345689 Q ss_pred hhhHHHHHHhHHHHHHHHHHHhhHhhhhhceee--eeeeccc-cCCCCCccccCHHHHHHHHHHHHHhccc-----cceE Q lcl|NC_015263. 243 VPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLL--IQKLETR-SSNDNNDFTLDMPMMNYFHEALSMTVPD-----NVGV 314 (513) Q Consensus 243 ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii--~~kip~~-~~n~~~~~~vd~~~~~~~~~~ik~~Lp~-----gv~~ 314 (513) +||..++...+.-....++.... -..|-... +-++|=. ......+-.++.++++.+.+.+++++-. |... T Consensus 193 lspi~~~~~~i~~~~~~~~~~~~--~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~ 270 (542) T protein:vir:41 193 VPRYVSAAPAILAMQKIDEYNYA--FFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPL 270 (542) T ss_pred ccHHHHHHHHHHHHHHHHHHHHH--HHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCcee Confidence 99999988776555555543321 23332211 2233310 0112234457777777777777665521 2223 Q ss_pred EE-e------cccccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCCcc---hHHH-HHHHHHHHHHHHHHHH Q lcl|NC_015263. 315 VT-S------PMEIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDNKT---SQGI-AMSIATDEQFIFGVIN 380 (513) Q Consensus 315 v~-s------P~~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~~s---~~~~-~~SI~~d~~~~~~~~~ 380 (513) |+ . .+++..+.+. ..+.+. .+-..++|-.+.||...++|....+ ++.+ .....-....+.-+++ T Consensus 271 vL~~~~~~~~g~~~~pl~~~--~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~ 348 (542) T protein:vir:41 271 VFSIPGGDTVKVTFTPLNTS--QKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQN 348 (542) T ss_pred EeeccCCcccceeEEEcCCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHH Confidence 32 1 2233333222 222222 3445578999999999888654222 1222 2233333444456999 Q ss_pred HHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhC-CCHHHHHHHHHHHHHhhCcccc Q lcl|NC_015263. 381 QLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMG-IDPVAFTGLLKVENEMLDLPEI 459 (513) Q Consensus 381 ~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G-~~p~~~~~~~~~E~e~L~l~~~ 459 (513) +||..+|+.|....- ..+.|.|-...-. +.+...++.. +.. .| ++|.|+-..+ + ..=+.++. T Consensus 349 ~ie~~ln~~L~~~~~-~~~~~~f~~~~ll-~~d~~~~~~~-----------~v~-~GilT~NE~Re~L--~-g~~pgdd~ 411 (542) T protein:vir:41 349 IISSILTDFFQVKFN-PKTRFKFNDETLL-ESDSVRNCAL-----------LVQ-SGVLTPAEARERL--F-GLDGGPDI 411 (542) T ss_pred HHHHHHHhhcccccC-CceEEEecchhhc-chHHHHHHHH-----------HHh-CCCCCHHHHHHhh--C-CCCCCCcc Confidence 999999998864322 2234444333222 2222332222 222 36 5888875322 1 11133343 Q ss_pred c-CcccccccccccccccCCcc------ccCCCCcCC-CCccc----------ccccCCCCCCCCCCccCCC Q lcl|NC_015263. 460 M-TPLSSSFNTSGSDIAENAIK------EKGKENGRP-TNETT----------GNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 460 ~-~Pl~TS~T~Sg~~~~~~~~~------~~~~~~grP-t~et~----------~n~~~~~~~~~~d~~~~~~ 513 (513) + .|...+-.. ......+... .|.++.+.| .+++. +.++..+++.++++-.-++ T Consensus 412 ~l~p~~~~~~~-~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (542) T protein:vir:41 412 FMVPSKGAAKS-VKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLAEFRAEAYEAGK 482 (542) T ss_pred ccccccccccc-cccCCcCCCCCchhhhhhcccccCccccccccccccchhhcccccchhhhhHHhHHhcCc Confidence 3 332221110 0001111110 111222333 11111 1112222222222222222 No 41 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.31 E-value=1.5e-06 Score=52.54 Aligned_cols=387 Identities=12% Similarity=0.090 Sum_probs=179.9 Q ss_pred hhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc---ChhHHHHHHHHHHHHHhhcchHHHHHHHHhh Q lcl|NC_015263. 17 ISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR---NEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN 93 (513) Q Consensus 17 ~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~---P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~ 93 (513) ..-+.++.. +-... ++..+ .+. ..+-.|+ +......+ ..+.+.+.|+.+++ T Consensus 1 MG~~~~~~~--------~~~~~---~~~~~--~~~----~~~~~~~g~~~~~~~~al---------~~~~V~~~v~~Ia~ 54 (411) T protein:vir:81 1 MGWWSRLTR--------FFRPR---NETVD--MTN----PLLLQWLGVDPDTPRNQL---------SEATYFACLKILSE 54 (411) T ss_pred CchHHHHHh--------hccCc---ccccc--cch----HHHHHHhcCcccChhhhh---------ccHHHHHHHHHHHH Confidence 011111100 00000 00000 011 1122222 11111111 24456666776654 Q ss_pred -cccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEcC-cceeeeecCc Q lcl|NC_015263. 94 -MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDDK-ESVMIQQFPN 166 (513) Q Consensus 94 -mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~-~~~~iq~lp~ 166 (513) +..+...++--.. ++..+..-+.+...|. + +.-..+...++..++..|..|.+++.++ ....+.++|+ T Consensus 55 ~iA~lp~~~~~~~~-----~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~l~~l~~ 129 (411) T protein:vir:81 55 SLGKLPLKMYQKTE-----RGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSGPQLQALWILPS 129 (411) T ss_pred hHhhCceeEEEecC-----CceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCceEEEEEECC Confidence 2244444442111 1111111123344453 2 3355777888888899999999988654 4457899999 Q ss_pred ceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec--Cccccchh Q lcl|NC_015263. 167 DICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN--ESSLTPVP 244 (513) Q Consensus 167 dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~--~~~~~~ip 244 (513) ++|.+.--.+|.+.......| .| +......=++++.+.-+-|+.+ .+...|++ T Consensus 130 ~~v~~~~~~~~~~~~~~~~~~-------------------~~------~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s 184 (411) T protein:vir:81 130 QYVTIVVDDRGLLGEKNAIWY-------------------RY------NDPYDGKMYVFRNDEILHFKTSVTFDGITGLS 184 (411) T ss_pred ceEEEEEcCcccccccceEEE-------------------EE------EecCCceEEEEccccEEEEcCCCCCCCccccc Confidence 999998545554321111111 00 0000011234566556666643 34567888 Q ss_pred hHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc--ccc---ceEEEecc Q lcl|NC_015263. 245 PFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV--PDN---VGVVTSPM 319 (513) Q Consensus 245 ~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~g---v~~v~sP~ 319 (513) |...+...+--....++.... -..|-...-..|-+ + -.++.++++++.+.+++.. +++ +..+-..+ T Consensus 185 ~~~~~~~~i~~~~~~~~~~~~--~f~ng~~p~gil~~---~----~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~ 255 (411) T protein:vir:81 185 VRDVLKHTVDGALESQKFMNN--LYKTGLTGKAVLEY---T----GDLNQEARDRLVKGFEQFANGSKNAGKIIPVPLGM 255 (411) T ss_pred HHHHHHHHHHHHHHHHHHHHH--HHhccCCCceEEEe---C----CCCCHHHHHHHHHHHHHHhcCccccCCceecCCCc Confidence 887777655555544443321 12232112221211 1 1256677777777777655 222 22333334 Q ss_pred ccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCC-cchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhc--- Q lcl|NC_015263. 320 EIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDN-KTSQGI-AMSIATDEQFIFGVINQLERWLNRYLLLN--- 393 (513) Q Consensus 320 ~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~-~s~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~~--- 393 (513) ++..+.++. +..--...+-..++|..+.||+..++|... ++.+.+ ...+.--..-+.-++++||.++|+.|-.. T Consensus 256 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~ 335 (411) T protein:vir:81 256 KLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSNDLI 335 (411) T ss_pred eEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhc Confidence 444444321 111112344556889999999998885432 222222 22222233334468999999999987432 Q ss_pred ccceEEEEEecCCCCccHHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccccccc Q lcl|NC_015263. 394 GMSKYFKATMLEVTHFSKKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGS 472 (513) Q Consensus 394 ~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~ 472 (513) ..+..|+|..-+....+.++.++.+.++..-|. ..=..- +.+|+.|.+ +-|..++|+. ++ .=. T Consensus 336 ~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R-~~~gl~p~~------------ggD~~~~~~n--~~-pl~ 399 (411) T protein:vir:81 336 SQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEAR-DYLDMPADD------------YGNNLMANGN--YI-PLS 399 (411) T ss_pred CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHH-HHhCCCCCC------------CCCeeeeccC--cc-chh Confidence 223456766666666788888888888877773 333322 235665532 1122222211 01 000 Q ss_pred ccccCCccccCCCCcCCCCcccccccCCCC Q lcl|NC_015263. 473 DIAENAIKEKGKENGRPTNETTGNKDSDET 502 (513) Q Consensus 473 ~~~~~~~~~~~~~~grPt~et~~n~~~~~~ 502 (513) ..+++. ..|| ++ T Consensus 400 ~~~~~~-----~kgG-------------d~ 411 (411) T protein:vir:81 400 MLGANY-----GKGG-------------DS 411 (411) T ss_pred hhhhhh-----ccCC-------------CC Confidence 000000 0111 00 No 42 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.31 E-value=1.5e-06 Score=52.53 Aligned_cols=425 Identities=12% Similarity=0.082 Sum_probs=177.9 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHhhcchHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAVQSQQYQRLLN 89 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~~sg~~~rlid 89 (513) |... ..-++++++-. ..++..-++.......-.|+ |-=|.+.|++ ++..++++++.|+ T Consensus 1 ~~~~-------------~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~pp~~~~~La~----~~~~n~~v~scI~ 59 (540) T protein:vir:41 1 MFNY-------------HLSIKSLEKYR----AIKGDTDSQALKEDRFEEYVEPKVHPLVLLS----LLQVNPYHASACS 59 (540) T ss_pred CCCc-------------ccChhhccchh----hhhccccccccccCCCCccccCCCCHHHHHH----HHHhcHHHHHHHH Confidence 1110 00011111110 01111111111122222333 3334555543 4666777778888 Q ss_pred HHhhc-ccccceEeeccchhhhhhcchhHHHHHHHHHH--hhcChhHHHHHHHHHHHHhcceeEEEEEcCc--ceeeeec Q lcl|NC_015263. 90 FYANM-PLYAYSVVPFKDISTANENKLKKELATVTEFL--SRLNPKYNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQF 164 (513) Q Consensus 90 y~~~m-pt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L--~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~l 164 (513) .++.. ..+.+.|..-.. . ...+| ..++...++..++.+++..|..|.+++.+.. ..-+.++ T Consensus 60 ~ia~~ia~~~~~i~~~~~-------~-------~~~~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i 125 (540) T protein:vir:41 60 IKANDILRTGYLIDGDDG-------G-------VEELLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYI 125 (540) T ss_pred HHHHHHhcCCceEecCcc-------c-------hhhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEe Confidence 76543 445555542110 0 11111 1123456677888889999999999987654 4688999 Q ss_pred CcceeEEEEEECCeeEEEEEe---eeccCcchhccccHHHHHHHHHHhhhhhccCcccccC-eeecCCceEEEEec--Cc Q lcl|NC_015263. 165 PNDICKISSVSGGVYNYVIDL---DALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNW-YEIQDKNSICIKIN--ES 238 (513) Q Consensus 165 p~dyckIsg~~nG~y~~~fD~---syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W-~~L~~~kt~~ik~~--~~ 238 (513) |+++|++.--.++ |....|- .||..... +... +.....+ ..++.+--+-|+.. .+ T Consensus 126 ~~~~V~v~~~~~~-~~~~~d~~~~~~~~~~~~----~~~~--------------~~~~g~~~~~~~~~eViHir~~~~~~ 186 (540) T protein:vir:41 126 PAHTVRVHRDGSR-YMQTWDGIHVTYFKDYRY----EGEV--------------NPDNGEDQDGVGANEIIFIHLPSPIC 186 (540) T ss_pred CCcceEEeEcCce-eEeeecCceeeeeecccc----ccee--------------eccccccceeecccceEEecCCCCCC Confidence 9999998733222 2222221 12211110 0000 0000112 34555445556643 34 Q ss_pred cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceee--eeeeccccCCCCCccccCH--HHHHHHHHHHHHhcc---c- Q lcl|NC_015263. 239 SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLL--IQKLETRSSNDNNDFTLDM--PMMNYFHEALSMTVP---D- 310 (513) Q Consensus 239 ~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii--~~kip~~~~n~~~~~~vd~--~~~~~~~~~ik~~Lp---~- 310 (513) ..+|+||...+...+.-....++.. ..-..|-... +-++|- .-.+....+-+. ...+.+.+....++- + T Consensus 187 ~~~G~Spi~~~~~~i~~~~~~~~~~--~~~f~Ng~~p~giL~~~g-~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~n 263 (540) T protein:vir:41 187 SYYGVPRYLSAAPSILAMQKIDEYN--YAFFDNYTIPSYVITVTG-EFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEA 263 (540) T ss_pred CcccccHHHHHHHHHHHHHHHHHHH--HHHHhccCCCceEEEeCc-ccCchhccchHHHHHHHHHHHHHHHHHhcccccc Confidence 5689999998877666655555433 1223332222 122220 002222222222 222334444444331 1 Q ss_pred -cceEEE-------ecccccccccccccccch---hhhhhHHhhhhhhhhhhhhccCC---Ccc-hHHHHHHHHHHHHHH Q lcl|NC_015263. 311 -NVGVVT-------SPMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSSD---NKT-SQGIAMSIATDEQFI 375 (513) Q Consensus 311 -gv~~v~-------sP~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~d---~~s-~~~~~~SI~~d~~~~ 375 (513) |...|+ ..+++..+.+. ..+.+ +.+-+.+.|-.+.||...++|-. +.+ ++.-.....--...+ T Consensus 264 ag~~~vLe~~~~~~~g~~~~pl~~~--~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL 341 (540) T protein:vir:41 264 PHTPLVFSIPGGDTVEVTFTPLNTS--QKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVV 341 (540) T ss_pred ccceEEEecCCCcccceeEEecccc--hhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHH Confidence 222332 13444444442 22223 34556688999999999998622 122 222223333334444 Q ss_pred HHHHHHHHHHHHHHHhhc-ccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHH-HHHHHh Q lcl|NC_015263. 376 FGVINQLERWLNRYLLLN-GMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLL-KVENEM 453 (513) Q Consensus 376 ~~~~~~iE~~~N~~i~~~-~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~-~~E~e~ 453 (513) .-++++||..+|+.|... ..+..|+|...++. +.+....+..+ ...--++|.|+-..+ -+| T Consensus 342 ~P~~~~ie~~ln~~L~~~~~~~~~i~f~~~~ll---~~D~~~~~~~l-----------v~~G~lT~NE~Re~L~g~e--- 404 (540) T protein:vir:41 342 RPQQEIVSSVLTDFIQLKLDPGARFVFNEEILM---ESEFVHNYALL-----------VQCGVLTPSEVREKLFGLD--- 404 (540) T ss_pred HHHHHHHHHHHHHhhhhccCCceEEEecchhhc---chHHHHHHHHH-----------HhCCCCCHHHHHHHhCcCc--- Confidence 579999999999988653 22333444433332 22333333222 222125777775322 112 Q ss_pred hCcccc-cCcccccccccccccccCCccccC------CCCcCCCCccccc-ccCCCCCCC----------CCCccCCC Q lcl|NC_015263. 454 LDLPEI-MTPLSSSFNTSGSDIAENAIKEKG------KENGRPTNETTGN-KDSDETQRA----------KDKPANTQ 513 (513) Q Consensus 454 L~l~~~-~~Pl~TS~T~Sg~~~~~~~~~~~~------~~~grPt~et~~n-~~~~~~~~~----------~d~~~~~~ 513 (513) +.++. +.|.......-.+ ...+..+++. .+...|..+...+ ....+..++ +++-..++ T Consensus 405 -~gdd~~l~p~n~~~~~~~~-~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (540) T protein:vir:41 405 -GGPDMFMVPSSIGKSAMKR-QKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSDFRAEAYENGK 480 (540) T ss_pred -CCCcccccccccccccccc-cccccCCCCccccccccchhcccccCccccccccccccccccccccccCCccccchh Confidence 23333 4443322211111 1111111111 1111221111110 000011011 11111111 No 43 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=98.20 E-value=2.6e-06 Score=51.17 Aligned_cols=384 Identities=11% Similarity=0.112 Sum_probs=179.4 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccc--ccccchHHHHHHHhhhccChhHHHHHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPV--GSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLA 78 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~--~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY 78 (513) |.|.+ .++.. ...++.....-+. +.+ |+..+.+ .+ ..++.-.+ T Consensus 1 ~~~~~----------~~~~~------k~~~~~~~~~~~~--~~~~~~~~~~~~--------~~---------~~v~~~~a 45 (409) T protein:vir:96 1 MAKEN----------IVTRI------KKKLIDNWIDQSA--SKLYDFSPWKNK--------SF---------WGVINNTL 45 (409) T ss_pred Ccccc----------chhhh------hhHHhhhhhcccc--ccccccccccCc--------cc---------cccchhhH Confidence 33321 12211 1122222222221 111 1111100 00 11222234 Q ss_pred hhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEE Q lcl|NC_015263. 79 VQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYV 152 (513) Q Consensus 79 ~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~ 152 (513) ..++.+.+.|+.+++ +..+.-.++ .. .+... +.+.+.|. + +.-..+...++..++..|..|.++ T Consensus 46 ~~~~~V~~ci~~ia~~ia~lp~~~~----~~---~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 115 (409) T protein:vir:96 46 ETNETIFSAITKLSNSMASLPLKMY----ED---YKVVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLI 115 (409) T ss_pred hhhHHHHHHHHHHHHhhhhCceEEe----ec---ccccc---hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEE Confidence 455566666666543 222333332 11 11111 22333443 3 335566678888999999999998 Q ss_pred EEcC--cceeeeecCcceeEEEEEECC-eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCc Q lcl|NC_015263. 153 IDDK--ESVMIQQFPNDICKISSVSGG-VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKN 229 (513) Q Consensus 153 i~d~--~~~~iq~lp~dyckIsg~~nG-~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~k 229 (513) .-+. ...-+.+||+++|.+.--.++ ...|.+... +.. =+.+++.. T Consensus 116 ~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~---~g~-----------------------------~~~~~~~e 163 (409) T protein:vir:96 116 ERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAA---TGN-----------------------------KLIVHNMD 163 (409) T ss_pred EECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcC---Cce-----------------------------EEEEcccc Confidence 8654 447888999999999865543 333332210 110 01233333 Q ss_pred eEEEEe--cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh Q lcl|NC_015263. 230 SICIKI--NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT 307 (513) Q Consensus 230 t~~ik~--~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~ 307 (513) -+-|+- ..+..+|++|...+ .+.++++..-+-.. +.+..-. ..+-+. . ...++.++++.+.+..++. T Consensus 164 vih~r~~~~~~~~~G~s~l~~~-~~~i~~~~~~~~~~----~~~~~~~-~~~i~~---~--~~~l~~e~~~~~~~~~~~~ 232 (409) T protein:vir:96 164 MLHFKHIVASNMVQGISPIDVL-KNTTDFDNAVRTFN----LTEMQKP-DSFMLK---Y--GSNVSTEKRQQVLEDFKQY 232 (409) T ss_pred EEEeCCCCCCCccccccHHHHH-HHHHHHHHHHHHHH----HHhcCCC-ceeEEe---c--CCCCCHHHHHHHHHHHHHH Confidence 334442 23445677776554 45555443322111 1111111 111110 1 1236666776666666555 Q ss_pred ccc--cceEEEecccccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCCc-chHHHHHHHH-HHHHHHHHHHH Q lcl|NC_015263. 308 VPD--NVGVVTSPMEIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDNK-TSQGIAMSIA-TDEQFIFGVIN 380 (513) Q Consensus 308 Lp~--gv~~v~sP~~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~~-s~~~~~~SI~-~d~~~~~~~~~ 380 (513) .-+ ++..+-..+++..+.+. ..+.+. .+-..++|..+.||...++|+... +.+.+...-+ --..-+.-+++ T Consensus 233 ~~n~g~~~vl~~g~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~ 310 (409) T protein:vir:96 233 YEENGGILFQEPGVEIEPLPKK--YVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVK 310 (409) T ss_pred hhcCCCeeecCCCceEEEcCCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHH Confidence 522 34444455555555543 222232 334557899999999999975432 2222222222 22222345889 Q ss_pred HHHHHHHHHHhh--cc-cceEEEEEecCCCCccHHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCc Q lcl|NC_015263. 381 QLERWLNRYLLL--NG-MSKYFKATMLEVTHFSKKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDL 456 (513) Q Consensus 381 ~iE~~~N~~i~~--~~-~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l 456 (513) +||..+|+.|-. +. .+..|+|..-+...-+.++.++.+.++.+-|. .+=...+ .+|+.|.+ +- T Consensus 311 ~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~-~~g~~pi~------------gg 377 (409) T protein:vir:96 311 QYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIRE-WEDLPPVE------------GG 377 (409) T ss_pred HHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHH-HhCCCCCC------------Cc Confidence 999999997743 22 23457776666666788899999888888774 3333333 45666642 22 Q ss_pred ccccCcccccccccccccccCCccccCCCCc-CCCCcc Q lcl|NC_015263. 457 PEIMTPLSSSFNTSGSDIAENAIKEKGKENG-RPTNET 493 (513) Q Consensus 457 ~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~g-rPt~et 493 (513) |..+.|... ..-....... +...|| ..++|+ T Consensus 378 D~~~~~~n~---~~~~~~~~~~---~~~~gG~~n~~e~ 409 (409) T protein:vir:96 378 DKPLISGDL---YPIDTPLELR---KSLKGGDKNVNES 409 (409) T ss_pred ceeeecccc---cccccchhhc---ccccCCCCCcCCC Confidence 344443321 1110000000 001111 111111 No 44 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=98.20 E-value=2.6e-06 Score=51.16 Aligned_cols=384 Identities=10% Similarity=0.029 Sum_probs=166.7 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc--ChhHHHHHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR--NEGNQKTLRKVSEDLA 78 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~--P~~n~~~ir~~s~~lY 78 (513) |- -+ ++.-.+.++.++ ....++..-. +.-...+++ T Consensus 1 m~----------------~f------~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~Al~------- 37 (406) T protein:vir:97 1 MS----------------FF------QPLGTSKVSYDD--------------YISSVLAGDVSQKYLGVSALK------- 37 (406) T ss_pred Cc----------------cc------cccCCCCCCcch--------------HHHHHhcCCCCcccccchhhc------- Confidence 10 00 000000011111 1111111000 000111111 Q ss_pred hhcchHHHHHHHH----hhcccccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhccee Q lcl|NC_015263. 79 VQSQQYQRLLNFY----ANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFY 149 (513) Q Consensus 79 ~~sg~~~rlidy~----~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~ 149 (513) .+.+.+.|+.+ ++||. .+. .+ .+...++ +.+...|.. +.-..+...++..++..|..| T Consensus 38 --~~~V~~~i~~Ia~~iA~lp~---~~~---~~----~g~~~~~-~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay 104 (406) T protein:vir:97 38 --NSDILTATSIIAGDIARFPL---VKK---DV----NGDIIHD-EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSF 104 (406) T ss_pred --cHHHHHHHHHHHHhhhhCee---EEE---ec----Ccccccc-chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeE Confidence 22333444444 44442 221 11 1111122 234445542 335567777888999999999 Q ss_pred EEEEEc---CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeec Q lcl|NC_015263. 150 GYVIDD---KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ 226 (513) Q Consensus 150 gy~i~d---~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~ 226 (513) .++.-+ +...-+.++|++.|.+.-..+|...|.|.... .. .=+.++ T Consensus 105 ~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~~~y~~~~~~--~~-----------------------------~~~~~~ 153 (406) T protein:vir:97 105 SRILRDPKTNQALQFQFYRPSETTVEETDNHEIVYTFTDML--TA-----------------------------KQVKCF 153 (406) T ss_pred EEEEecCCCCeEEEEEEECCCeeEEEEcCCceEEEEEEecC--Cc-----------------------------eEEEEc Confidence 998754 34568999999999998777787777654221 00 012233 Q ss_pred CCceEEEEec-CccccchhhHHHHHHhHHHH-HHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHH Q lcl|NC_015263. 227 DKNSICIKIN-ESSLTPVPPFAGTFDSIYDI-HSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEAL 304 (513) Q Consensus 227 ~~kt~~ik~~-~~~~~~ip~f~~v~~d~~di-~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~i 304 (513) ...-+-|+.. .+...|+||...+.. .+++ ...++.... -..| -.-.+-|-+ . .-.++.++.+++.+.+ T Consensus 154 ~~evih~r~~~~dg~~G~spi~~~~~-~i~~~~a~~~~~~~--~f~n-g~~~~~i~~----~--~~~l~~e~~~~~~~~~ 223 (406) T protein:vir:97 154 AHDVIHWKFFSHDTILGRSPLLSLGD-EIDLQTGGINTLIK--FFKD-GFSSGILTM----K--GAQLSGDARQRARQEF 223 (406) T ss_pred cccEEEecCCCCCCcccccHHHHHHH-HHHHHHHHHHHHHH--HHhc-cCCCceEEe----c--CCCCCHHHHHHHHHHH Confidence 3334444432 233458888765543 3433 333332211 1122 111111111 1 1136677777777777 Q ss_pred HHhccc----cceEEEeccccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 305 SMTVPD----NVGVVTSPMEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVI 379 (513) Q Consensus 305 k~~Lp~----gv~~v~sP~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~ 379 (513) ++..-. ++..+-..+++..+.+.... .--++..-..++|..+.||...++|+....++.......--..-+.-++ T Consensus 224 ~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~~~~f~~~~l~P~~ 303 (406) T protein:vir:97 224 EKMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQLMEDYVTNDLPFYF 303 (406) T ss_pred HHHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHHHHHHHHHHHHHHH Confidence 666521 22233344454455443211 1112233345779999999999998655444444444333333344689 Q ss_pred HHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHH-hhCccc Q lcl|NC_015263. 380 NQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENE-MLDLPE 458 (513) Q Consensus 380 ~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e-~L~l~~ 458 (513) ++||..+|+.|-...-...+.++|. +.. ..+..++.+.++.+-| .++|.|+-..+-++-- .-+-|. T Consensus 304 ~~ie~~l~~kll~~~~~~~~~i~fd-~~~-~~~~~~~~~~~~~~~g-----------~~T~NE~R~~~g~~p~~~~~gD~ 370 (406) T protein:vir:97 304 DAITSELGLKTLNDKDRRLYHIEFD-TRS-VTGRNVDEIVKLVNNQ-----------ILTPNQGLVELGKQKSTDPNMDR 370 (406) T ss_pred HHHHHHHhhhhcChhhccceeEEEe-cCc-cchhhHHHHHHHHhCC-----------CcCHHHHHHHhCCCCCCCCCCCe Confidence 9999999997743211112334432 111 2334445544443322 3777777665544420 011334 Q ss_pred ccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCc Q lcl|NC_015263. 459 IMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKP 509 (513) Q Consensus 459 ~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~ 509 (513) .+.|+--.-. .. .++ |.+....+..+|+++...|.- T Consensus 371 ~~~~~n~~~~---~~---------~~~---~~~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 371 YQSSLNYVFL---DK---------KEE---YQDKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred EeeccCccch---hc---------ccc---cccccccccCCCCCCCCCCCC Confidence 4443221111 00 000 111000000111111111100 No 45 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=98.20 E-value=2.8e-06 Score=51.05 Aligned_cols=467 Identities=11% Similarity=0.047 Sum_probs=196.1 Q ss_pred CCCccchhe---eeeehhhhhhHHHHHHHHH-------------------HHHHhhccCcccccccccccchHHHHHHHh Q lcl|NC_015263. 1 MVKNKKKRL---SMIDVESISSYSNKRNNRI-------------------SILRDDNRTPVFGAPVGSLTSSQSKVRKIV 58 (513) Q Consensus 1 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~-------------------~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i 58 (513) .++..+-|. +--|++....-+--...+- .+..| .++ .....|+.+..... -..+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d--~~~-~~~~~~~~~~~~~~-~~~~ 92 (537) T protein:vir:10 17 IAERIEPRVGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMD--GLD-VEGGTFSAYANPNL-SEGL 92 (537) T ss_pred cccccccccCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhcc--ccc-cchhhhhhhccccc-cchh Confidence 111111111 1112222111110000000 00111 011 11122332222111 2223 Q ss_pred hhcc-ChhHHHHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHH Q lcl|NC_015263. 59 KEYR-NEGNQKTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSK 137 (513) Q Consensus 59 ~~~~-P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~ 137 (513) -.|+ +....- ..++ -+|..++.+|++|+-+....|=..+..-..+ ....+.+..+.+ -..+++++++..+.+ T Consensus 93 ~~~~~~~~~~~--~~l~-a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~-~~~~~~~~~~~l---~~~~~~l~~~~~l~~ 165 (537) T protein:vir:10 93 VLWYAQQAFIG--HQMC-ALIATHWLVNKACSQMPRDAMRKGYKIISDD-GNELDPKDAKFI---DRYDRAFNIKKHAIQ 165 (537) T ss_pred hhhccccCCcc--HHHH-HHHHhCchhhhhhhhhhHHhhcCCceeecCC-cccccHHHHHHH---HHHHHHhhHHHHHHH Confidence 3332 222221 2233 3689999999999999876665544331111 111122222333 334666676666665 Q ss_pred HHHHHHHhcceeEEEE-EcCcce-eeeecCcceeEEEEEECCe--eEEEEEeeeccCc--chhcc--ccHHHHHHHHHHh Q lcl|NC_015263. 138 IVKLAMTVDIFYGYVI-DDKESV-MIQQFPNDICKISSVSGGV--YNYVIDLDALVSA--DIVDY--YPKEIQEAVNKYT 209 (513) Q Consensus 138 i~~~~l~~g~~~gy~i-~d~~~~-~iq~lp~dyckIsg~~nG~--y~~~fD~syFd~~--~~L~~--~p~Ei~~~y~~Y~ 209 (513) .++-.-..|.-+.++. +..++. .-+||.++- +..|. +..+||-.+-... .++.. ..|.+-+- ..|. T Consensus 166 a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~-----i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P-~~y~ 239 (537) T protein:vir:10 166 FVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDG-----VMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEP-TYWL 239 (537) T ss_pred HHHhcccccceEEEEeecCcCCccccccccccc-----ccccceeEEEEechhhcccccchhhhccCCccccCCc-eeee Confidence 5555555555444544 334443 457776653 33333 3344443321100 00000 01111000 0110 Q ss_pred hhhhccCcccccCeeecCCceEEEEecCcc---------ccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeec Q lcl|NC_015263. 210 TMKKGNNKSASNWYEIQDKNSICIKINESS---------LTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLE 280 (513) Q Consensus 210 ~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~---------~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip 280 (513) .++ ..+.+.+-+.|. +... -||+|.+-.++..+.+.+.-... +-.-+....+.+-++. T Consensus 240 ----v~g------~~iH~SRli~f~-g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~--~~~l~~~~~~~v~k~~ 306 (537) T protein:vir:10 240 ----ING------KKYHRSHLAIYI-NDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANE--GPMLAMTKRQTVLKVD 306 (537) T ss_pred ----ecC------eEecceeEEEec-CCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHH--HHHHHHhcCCceeeec Confidence 000 123333333222 2211 24666666666655554433332 1122222223333333 Q ss_pred cccCCCCCccccCHHHHHHHHHHHHHhccccceEEEeccc---ccccccccccccchhhhhhHHhhhhhhhhh-hhhccC Q lcl|NC_015263. 281 TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPME---IDTVSFDKDSSTDDSVEKATKNFWDNAGVS-QILFSS 356 (513) Q Consensus 281 ~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~---~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS-~~Lfn~ 356 (513) .- . -.-+.+...+--+.+.+.- +..+.++..-+ ++.++.+-+ ...+.+....+.|-.++||- ..|||- T Consensus 307 ~~--~----~l~~~~~~~~r~~~~~~~r-~n~g~~~id~e~e~~e~~~~~ls-gl~~~l~~~~~~iAa~~~IP~t~L~G~ 378 (537) T protein:vir:10 307 AA--Q----VLANKQQFDETMSWWTATR-DNYQVRVVDKDNEDVVQIDTTLN-DLDKVIMNQYQLVCAIARTPAPKMLGT 378 (537) T ss_pred hH--H----hhcCHHHHHHHHHHHHhhc-CCcceeEecCCCceeEEEeccCC-CHHHHHHHHHHHHHhhhCCCceeeccC Confidence 10 0 0111222222222222222 33333443333 233332222 33467888888898889996 557763 Q ss_pred CC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccc-ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHH Q lcl|NC_015263. 357 DN-KTSQGIAMSIATDEQFIFGVINQLERWLNRYLL----LNGM-SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKV 430 (513) Q Consensus 357 d~-~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~----~~~~-~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~ 430 (513) .. +-+++-..-+..-...|-+.++++...+++++. .... -..|.|.|.++--.+.||+++..++.++-. .. T Consensus 379 sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~~~~~~~~~i~f~pL~~~s~kEkAei~~~~a~a~---~~ 455 (537) T protein:vir:10 379 VPTGFNSTGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSHLRKRIRVKVEFPPMDAPKESERADTFLKKMQAA---KL 455 (537) T ss_pred CccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHHH---HH Confidence 32 111222223333444444555555556665442 1211 225999999999999999999877765432 12 Q ss_pred HHHHHhC-CCHHHHHHHHHHHHHhhCcccccCcccccccccccc--cccCCccccCCCCcCCCCcccccccCCCCCCCCC Q lcl|NC_015263. 431 YLASLMG-IDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSD--IAENAIKEKGKENGRPTNETTGNKDSDETQRAKD 507 (513) Q Consensus 431 ~laa~~G-~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~--~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d 507 (513) ++. .| ++|.|+-..+..+- ..+++++. |-.+. .+.... .++..+....++.|.|+...-.+....+...+.+ T Consensus 456 ~~~--~G~i~~~Evr~~L~~~~-~~g~~~l~-~~~~~-ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (537) T protein:vir:10 456 AFE--MGAVDGVDVNEYLRMDP-TLGFTSIT-PAMRP-TDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRD 530 (537) T ss_pred HHH--cCCCCHHHHHHHHhccC-cccccccc-CCCCh-hhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCcc Confidence 222 35 89999999888764 33555443 22111 111100 0111111222334444433334445556666666 Q ss_pred CccCCC Q lcl|NC_015263. 508 KPANTQ 513 (513) Q Consensus 508 ~~~~~~ 513 (513) +.+.++ T Consensus 531 ~~a~~~ 536 (537) T protein:vir:10 531 SGAAFE 536 (537) T ss_pred CccccC Confidence 666666 No 46 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.16 E-value=3.3e-06 Score=50.61 Aligned_cols=447 Identities=9% Similarity=0.037 Sum_probs=224.1 Q ss_pred CCCccchheeeeehh-hhhhHHHHHHHHHHHHH-hhccCccccccccc-ccchHHHHHHHhhhccChhHHHHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVE-SISSYSNKRNNRISILR-DDNRTPVFGAPVGS-LTSSQSKVRKIVKEYRNEGNQKTLRKVSEDL 77 (513) Q Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~-~~~~~~~~~s~~~s-~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~l 77 (513) |-+ -.+|.+.+|.- +.......+..+...-. +-. .+......+. ..+..+.-.+ + ..+-+.||.-|++| T Consensus 1 ~~r-~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa-~~~r~~~~w~~~~~~~s~~~~-i-----~~~~~~lr~RaRdL 72 (505) T protein:vir:96 1 MKR-AEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAA-RRDRLGKAWLRRASRLSADEE-I-----YADLASLVQRAREQ 72 (505) T ss_pred CCC-CccccchhhcccchhhhhhHHHHHHhhhhcccc-cCCCccccccCCCCCCChHHH-H-----HHHHHHHHHHHHHH Confidence 544 44555666643 11222233222211000 000 0000001110 0011111111 1 23567899999999 Q ss_pred HhhcchHHHHHHHHhhcccccceEeeccch---hhhhhcchhHHHHHH-HHHHhh--cC------hhHHHHHHHHHHHHh Q lcl|NC_015263. 78 AVQSQQYQRLLNFYANMPLYAYSVVPFKDI---STANENKLKKELATV-TEFLSR--LN------PKYNFSKIVKLAMTV 145 (513) Q Consensus 78 Y~~sg~~~rlidy~~~mpt~dY~I~P~~~~---~~~~~~~~~~~y~~v-~~~L~k--~n------~k~~~~~i~~~~l~~ 145 (513) |.++++.++.|+.+.+.-.=.--|.|-... .......+.+..... ..+.++ |. .-....-+++..++. T Consensus 73 ~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~d 152 (505) T protein:vir:96 73 SINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARD 152 (505) T ss_pred HhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhC Confidence 999999999999777643311112221100 001111122221111 123332 22 333345677888999 Q ss_pred cceeEEEEEcC---cceeeeecCcceeEEE--E-EECCee-EEEEEeeeccCcc-hhcc-----ccHHHHHHHHHHhhhh Q lcl|NC_015263. 146 DIFYGYVIDDK---ESVMIQQFPNDICKIS--S-VSGGVY-NYVIDLDALVSAD-IVDY-----YPKEIQEAVNKYTTMK 212 (513) Q Consensus 146 g~~~gy~i~d~---~~~~iq~lp~dyckIs--g-~~nG~y-~~~fD~syFd~~~-~L~~-----~p~Ei~~~y~~Y~~~k 212 (513) |..|.=.+... .+.-+|-+++|+|.-- + ..+|.+ +-.+ .||... .+.| -|.+.... + T Consensus 153 GE~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GI---e~d~~Gr~~aY~i~~~hPgd~~~~---~---- 222 (505) T protein:vir:96 153 GEVLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSI---ELDAWERPVAYHLLVNHPGDNSYC---Y---- 222 (505) T ss_pred CceEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEece---EECCCCceEEEEEeecCCCccccc---c---- Confidence 99988665532 2367899999998533 1 123322 2233 344322 2222 23332111 1 Q ss_pred hccCcccccCeeecCCceEEEEecCccc---cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCc Q lcl|NC_015263. 213 KGNNKSASNWYEIQDKNSICIKINESSL---TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNND 289 (513) Q Consensus 213 ~~~~~~~~~W~~L~~~kt~~ik~~~~~~---~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~ 289 (513) ......|..+|-+ -|.+-++.+.+ =|+|.|++++..+-++++|.+-....+.++.. +...|--. ....+. T Consensus 223 ---~~~~~~~~rvpa~-~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~--~a~fi~~~-~~~~~~ 295 (505) T protein:vir:96 223 ---HYAGQTYERVPAD-EIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAK--KVGFYEQD-PEAYDQ 295 (505) T ss_pred ---ccccccccccCHh-HhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhh--heeeeecC-CccCCC Confidence 1112346667653 45555555544 39999999999999999999988888888663 33333211 011111 Q ss_pred cccCHHHHHHHHHHHHHhccccceEEEeccccccccccccc----ccchhhhhhHHhhhhhhhhhhhhccCC--CcchHH Q lcl|NC_015263. 290 FTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKDS----STDDSVEKATKNFWDNAGVSQILFSSD--NKTSQG 363 (513) Q Consensus 290 ~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~~----~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~s~~~ 363 (513) ..-+. ++.....|..|....+-|= +.|+|-... +-.+.+......|-..+||+--++.+| +.|+++ T Consensus 296 ~~~~~------~~~~~~~l~pG~i~~L~pG--e~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS 367 (505) T protein:vir:96 296 PPEDD------QGEIVEEVEAGTYQLLPYG--IRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSS 367 (505) T ss_pred ccccc------cCccccccCCceeeecCCC--CeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHH Confidence 11110 0111122433443333332 356553322 223557777788999999996666666 468888 Q ss_pred HHHHHHHHHHHH--------HHHHHH-HHHHHHHHHhhcccc-------eEEEEEe--cCCCCccHHHHHHHHHHHHhcC Q lcl|NC_015263. 364 IAMSIATDEQFI--------FGVINQ-LERWLNRYLLLNGMS-------KYFKATM--LEVTHFSKKEAHDRYITDAQYG 425 (513) Q Consensus 364 ~~~SI~~d~~~~--------~~~~~~-iE~~~N~~i~~~~~~-------~~f~~~~--l~~T~fn~ke~~~~~~~~~~~G 425 (513) +..+....-..+ -.|.+- .+.|+-..+..+.+. ...+..+ .+-...+-.+.+...+....-| T Consensus 368 ~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G 447 (505) T protein:vir:96 368 LRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNR 447 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcC Confidence 877765222221 133333 334666655544331 1123333 3344667777788889999999 Q ss_pred CcHHHHHHHHhCCCHHHHHHHHHHHHH---hhCcccccCcccccccccccccccCCccccCCCCcCCCCc Q lcl|NC_015263. 426 FPVKVYLASLMGIDPVAFTGLLKVENE---MLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNE 492 (513) Q Consensus 426 ~~~~~~laa~~G~~p~~~~~~~~~E~e---~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~e 492 (513) +......++..|.+|++++.+...|++ .+||.. ...+.+..+ .. ...+...|.++ T Consensus 448 ~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~----~~~~~~~~~------~~--~~~~~~~~~d~ 505 (505) T protein:vir:96 448 TRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVNP----TPPEQESKD------AT--TDEEDDSASDD 505 (505) T ss_pred CCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCC----CCCCCCCCC------CC--CCCCCCCCCCC Confidence 999999999999999999999999986 445421 111111000 00 00000001111 No 47 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=98.16 E-value=3.4e-06 Score=50.57 Aligned_cols=389 Identities=10% Similarity=0.062 Sum_probs=187.2 Q ss_pred HHHHHhhccCcc--c------------cccccccc------chHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHH Q lcl|NC_015263. 28 ISILRDDNRTPV--F------------GAPVGSLT------SSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRL 87 (513) Q Consensus 28 ~~i~~~~~~~~~--~------------~s~~~s~~------~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rl 87 (513) -+++.-+...+. . .++..+.. ..+-....++.. ....- ..++..-.-.++.+.+. T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~--~~~~g---~~v~~~~al~~~~V~~c 75 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRR--GELNG---GTGRETRALRNMAVLRC 75 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhcc--CccCc---ceechhhhhccHHHHHH Confidence 333333332110 0 00000000 000001111111 00000 00011111235667777 Q ss_pred HHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcC-ccee Q lcl|NC_015263. 88 LNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDK-ESVM 160 (513) Q Consensus 88 idy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~-~~~~ 160 (513) |+.++. +..+...|+- .. ..+.... -+.+...|.. +.-..+...++..++..|..|.+++-+. ..+- T Consensus 76 i~~Ia~~iA~lp~~v~~----~~-~~~~~~~-~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~ 149 (431) T protein:vir:10 76 VTLISGTIGMLPMNLIS----SD-DSKQVLT-DDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGNRPIR 149 (431) T ss_pred HHHHHHhhccCceEEEE----ec-Cceeeec-cchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCceEE Confidence 777753 2244444431 11 1111111 1234444542 2244566777888999999999988764 4568 Q ss_pred eeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCcc Q lcl|NC_015263. 161 IQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESS 239 (513) Q Consensus 161 iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~ 239 (513) +.++|+.+|.+.-..+|...|.+-.- ....++++...-+.|+. ..+. T Consensus 150 L~pl~~~~v~~~~~~~~~~~y~~~~~--------------------------------~g~~~~~~~~dViHir~~~~dg 197 (431) T protein:vir:10 150 LIPMDRGSAKGRLTSTWQIVYDYTTP--------------------------------TGDKIELPAREVFHLRDLSIDG 197 (431) T ss_pred EEEEcCceeEEEEcCCCeEEEEEEeC--------------------------------CceEEEEchhhEEEecCcCCCC Confidence 89999999999766677665543210 01223455544455552 3345 Q ss_pred ccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--cc---ceE Q lcl|NC_015263. 240 LTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--DN---VGV 314 (513) Q Consensus 240 ~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~g---v~~ 314 (513) ++|++|...+-..+--....++... .-..|-...-..|-+ ++ .++.++++++.+.++++.- ++ +.. T Consensus 198 ~~G~spi~~~~~~i~~~~~~~~~~~--~~f~ng~~p~gil~~-----~~--~ls~e~~~~~~~~~~~~~~g~~n~g~~~v 268 (431) T protein:vir:10 198 VSGVSRVKLSGNALELAEQAERAAS--RTFRTGVMAGGAIEV-----PK--ELSDNAYGRMKASVQENHTGSENAGSWML 268 (431) T ss_pred cccccHHHHHHHHHHHHHHHHHHHH--HHHhccCCccEEEec-----CC--CCCHHHHHHHHHHHHHHhcCccccCCcee Confidence 6788877655543322222222211 111221111111111 11 3677888888888877762 22 223 Q ss_pred EEecccccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCCcc-hHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 315 VTSPMEIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDNKT-SQGI-AMSIATDEQFIFGVINQLERWLNRY 389 (513) Q Consensus 315 v~sP~~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~~s-~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~ 389 (513) +-..+++..+.+. ..+.+. .+-..++|..+.||...++|....+ ++.+ ...+.-...-+.-++.+||..+|+. T Consensus 269 l~~g~~~~~l~~~--~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~ 346 (431) T protein:vir:10 269 LEEGATAKQFSNT--AASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAIFFIQYGLSHWFVSWEQAAARA 346 (431) T ss_pred cCCCceEEEccCC--hhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 3333454444442 222233 3344578999999999999865433 2222 2222223333446899999999998 Q ss_pred Hhh--cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhC-CCHHHHHHHHHHHH-HhhCcccccCcccc Q lcl|NC_015263. 390 LLL--NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMG-IDPVAFTGLLKVEN-EMLDLPEIMTPLSS 465 (513) Q Consensus 390 i~~--~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G-~~p~~~~~~~~~E~-e~L~l~~~~~Pl~T 465 (513) |-. ...+..|+|.+-.....+.++..+.+.++..-|.- .| ++|.|+-.++-++- +--+-|..+.|+-+ T Consensus 347 Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~--------~g~lT~NE~R~~~gl~p~~~~~gD~~~~p~n~ 418 (431) T protein:vir:10 347 FLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQ--------SPWMKQNEVREMLDLPRADDPVADQLRNPMTQ 418 (431) T ss_pred ccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccc--------cCccCHHHHHHHhCCCCCCCccccceeccccc Confidence 842 22344577766666666888888888887766642 12 46666655544431 11123445555432 Q ss_pred cccccccccccCCccccCCCCcCCCCcc Q lcl|NC_015263. 466 SFNTSGSDIAENAIKEKGKENGRPTNET 493 (513) Q Consensus 466 S~T~Sg~~~~~~~~~~~~~~~grPt~et 493 (513) .+. .++..|...| T Consensus 419 ----~~~-----------~~~~~~p~~~ 431 (431) T protein:vir:10 419 ----KQK-----------GSGDEPPATT 431 (431) T ss_pred ----ccC-----------CCCCCCCCCC Confidence 121 1112221111 No 48 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=98.14 E-value=9.4e-07 Score=53.62 Aligned_cols=458 Identities=12% Similarity=0.074 Sum_probs=175.6 Q ss_pred CCCccchh---eeeeehhhh--hhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-ChhHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKR---LSMIDVESI--SSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVS 74 (513) Q Consensus 1 ~~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s 74 (513) -+.+||++ +.|+--..+ ..|.- -.+|-..+ .+..... ++..++. .+--.|. |... ++--. T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~-~~a~~~g-~~~~~~~--~~~~~~~~~~~~---~~~~l 87 (532) T protein:vir:94 22 RVDAKRATHTSLGLATAHEIDPTAYSP-------YERNAAQN-AMAMDYG-LQTGRNG--RNALSFVEATSW---PGFPT 87 (532) T ss_pred hhhhhhhhhhhhhhhhhhhhccccccc-------cccccccc-ccccccc-cCccccc--cccccccccccc---chHHH Confidence 22233222 112211100 00000 00010000 0000001 1111111 0111121 2111 22223 Q ss_pred HHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEE Q lcl|NC_015263. 75 EDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVID 154 (513) Q Consensus 75 ~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~ 154 (513) ..+|..++.++++|+-.....|-...-.--..... .+. .....+-..+++++++..+.+.++.....|..+.+... T Consensus 88 ~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~-~~~---~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v 163 (532) T protein:vir:94 88 LALLAQLPEYRTMHETPADECVRAWGKITCSSKDE-LAA---DKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHL 163 (532) T ss_pred HHHHHcCchhhhhhccchHHHhhCCceEeeCCccc-cch---HHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEe Confidence 35889999999999999998887776652111111 111 22233344577777777777666666666666666654 Q ss_pred cCcce---eeeecCcceeEEEEEECCeeE--EEEEeeeccCcc--hhccccHHHHHHHHHHhhhhhccCcccccCeeecC Q lcl|NC_015263. 155 DKESV---MIQQFPNDICKISSVSGGVYN--YVIDLDALVSAD--IVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQD 227 (513) Q Consensus 155 d~~~~---~iq~lp~dyckIsg~~nG~y~--~~fD~syFd~~~--~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~ 227 (513) +.++. .=.+++. .+-++..|.++ .+||-.+..-.. ..+-+.|-+-+ -+.+. .. .=..+.+ T Consensus 164 ~~~~~~~~~~~p~~l---~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~----P~~y~-v~-----~g~~iH~ 230 (532) T protein:vir:94 164 KMDGDSVPADAPLLL---SPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYK----PDSWI-AT-----SGKKIHS 230 (532) T ss_pred ccCCccccccccccc---cccccccceeeEEEeechheecccccccccccccccCC----ceeEE-Ec-----cCeeecc Confidence 43332 1122222 22233444442 233332211000 00001111100 00000 00 0013444 Q ss_pred CceEEEEecCc--------cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhh-ceeeeeeeccccCCCCCccccCHHHHH Q lcl|NC_015263. 228 KNSICIKINES--------SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQN-YKLLIQKLETRSSNDNNDFTLDMPMMN 298 (513) Q Consensus 228 ~kt~~ik~~~~--------~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n-~~ii~~kip~~~~n~~~~~~vd~~~~~ 298 (513) .+-+.|.-+.. .-||+|.+-.++..+.+.+.....- ..|-. ..+.+-++-+. -.++..... T Consensus 231 SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~---~~l~~~~~~~v~k~~~a-------~~ls~~~~~ 300 (532) T protein:vir:94 231 SRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSV---SDTVKQFSMTNLATDMA-------QLLAPGGAQ 300 (532) T ss_pred ceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHH---HHHHHhcCCceeeechH-------HhhcchhHH Confidence 44444322111 1145665555665555554444322 11111 11111122211 111111122 Q ss_pred HHHHHHHH--hccc--cceEEEecc-cccccccccccccchhhhhhHHhhhhhhhhh-hhhccCCCcchHHHHHHHHHHH Q lcl|NC_015263. 299 YFHEALSM--TVPD--NVGVVTSPM-EIDTVSFDKDSSTDDSVEKATKNFWDNAGVS-QILFSSDNKTSQGIAMSIATDE 372 (513) Q Consensus 299 ~~~~~ik~--~Lp~--gv~~v~sP~-~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS-~~Lfn~d~~s~~~~~~SI~~d~ 372 (513) .+.+.+.. ..-+ |+..+.... +++.++.+-+ .-.+.+....+.|-.++||- ..|||-. .+|++.+=+.|. T Consensus 301 ~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~~~~ls-gl~~~l~~~~~~iAaa~~IP~t~LfG~s---p~GlnstGe~D~ 376 (532) T protein:vir:94 301 SLDARLQLFNLYRDNRNIGALDKGTEEIQQTNTPLS-GLDSLQAQSQEQMAAVSHIPLVKLLGIT---PNGLNASSDGEI 376 (532) T ss_pred HHHHHHHHHHhhcCCccceEEcCCCceeEEEecccC-CHHHHHHHHHHHHHhHhCCCeeeeecCC---cccccccchHHH Confidence 22222221 1112 333333332 3445544433 34578899999999999996 5677632 223332222222 Q ss_pred ----HHHHHHHHH-HHHHHHHHHhhc---c---cceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhC-CCH Q lcl|NC_015263. 373 ----QFIFGVINQ-LERWLNRYLLLN---G---MSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMG-IDP 440 (513) Q Consensus 373 ----~~~~~~~~~-iE~~~N~~i~~~---~---~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G-~~p 440 (513) ..|-+.++. +...++++++.. . .-..|.|.|.++--.+.||+++..++.++-- ..++. .| +++ T Consensus 377 ~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~a~---~~~~~--~Gvi~~ 451 (532) T protein:vir:94 377 RVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSPLMELDDKELAEVRQLNASTD---STLME--LGVIDA 451 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCCCCCCCHHHHHHHHHHHHHHH---HHHHh--cCCCCH Confidence 222222222 222344433221 1 1225999999999999999999876654321 11122 35 777 Q ss_pred HHHHHHHHHHHHhhCcccccCccccc--------cccccc-c---cccCCccccCCCCcCCCCc-ccccccCCCCCCCCC Q lcl|NC_015263. 441 VAFTGLLKVENEMLDLPEIMTPLSSS--------FNTSGS-D---IAENAIKEKGKENGRPTNE-TTGNKDSDETQRAKD 507 (513) Q Consensus 441 ~~~~~~~~~E~e~L~l~~~~~Pl~TS--------~T~Sg~-~---~~~~~~~~~~~~~grPt~e-t~~n~~~~~~~~~~d 507 (513) .++-..+..+- ..++.+..+.-.+. ..++.. + ++....++.++..++|++. ..+....+++++|-+ T Consensus 452 ~Evr~~l~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 530 (532) T protein:vir:94 452 KMVQQRLAADP-TSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQNDQPVG 530 (532) T ss_pred HHHHHHHhcCC-ccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCccccccCCCcC Confidence 77777665542 22333332221100 000000 0 0000001111111111100 112222222222222 Q ss_pred Cc Q lcl|NC_015263. 508 KP 509 (513) Q Consensus 508 ~~ 509 (513) .- T Consensus 531 ~~ 532 (532) T protein:vir:94 531 NR 532 (532) T ss_pred CC Confidence 21 No 49 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=98.13 E-value=4e-06 Score=50.20 Aligned_cols=380 Identities=17% Similarity=0.169 Sum_probs=171.3 Q ss_pred hhhhhh-HHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHh Q lcl|NC_015263. 14 VESISS-YSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYA 92 (513) Q Consensus 14 ~~~~~~-~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~ 92 (513) .- +.+ -+...+.-+-|.+++- |.....-+.... +.-.|...+.+.+.|++++ T Consensus 1 mg-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~------------------------t~~~~~~~~~v~~cv~~Ia 53 (403) T protein:vir:10 1 MG-FKSWITEKLNPGQRIIRDME--PVSHRTNRKPFT------------------------TGQAYSKIEILNRTANMVI 53 (403) T ss_pred Cc-chhhhhhccchhhhhhhccc--ccccccCCcccc------------------------cHHHHHHHHHHHHHHHHHH Confidence 11 111 2222232333444431 110000011110 1111223344444444443 Q ss_pred h-cccccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCc Q lcl|NC_015263. 93 N-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPN 166 (513) Q Consensus 93 ~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~ 166 (513) . +..+-..|............ . ..+.....|+. +.-..+...++..++..|..|.+.. +..+.++|+ T Consensus 54 ~~ia~~p~~v~~~~~~~~~~~~-~--~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~----~~~l~~l~~ 126 (403) T protein:vir:10 54 DSAAECSYTVGDKYNIVTYANG-V--KTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD----GTSLYHVPA 126 (403) T ss_pred HHHhhCceeEeecccccccccc-c--ccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe----CceeEeecC Confidence 2 22222344321110000000 0 11122233442 2245677778888889998776542 345789999 Q ss_pred ceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe------cCccc Q lcl|NC_015263. 167 DICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI------NESSL 240 (513) Q Consensus 167 dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~------~~~~~ 240 (513) +.|.|..-.++.+ +.+... +. . .+ ++ +.-+-|+. ..+.. T Consensus 127 ~~~~v~~~~~~~~-~~~~~~---~~--~-~~-----------------------~~-----~eiih~~~~~~~~~~~~~~ 171 (403) T protein:vir:10 127 ALMQVEADANKFI-KKFIFN---NQ--I-NY-----------------------RV-----DEIIFIKDNSYVCGTNSQI 171 (403) T ss_pred cceEEEEcCCceE-EEEEec---Cc--e-ee-----------------------cc-----cceEEecccccccCCCCCc Confidence 9999875444433 222110 00 0 01 11 11222221 12345 Q ss_pred cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc--ccc---ceEE Q lcl|NC_015263. 241 TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV--PDN---VGVV 315 (513) Q Consensus 241 ~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~g---v~~v 315 (513) +|++|...+...+--.....+.. ..-..|-...-.-|.+ + -.++.++++++.+.+.++. .++ +..+ T Consensus 172 ~G~s~i~~~~~~i~~~~~~~~~~--~~~f~ng~~~~gil~~---~----~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl 242 (403) T protein:vir:10 172 SGQSRVATVIDSLEKRSKMLNFK--EKFLDNGTVIGLILET---D----EILNKKLRERKQEELQLDYNPSTGQSSVLIL 242 (403) T ss_pred ccccHHHHHHHHHHHHHHHHHHH--HHHHhccCCcceEEEe---C----CCCCHHHHHHHHHHHHHHhCCcccCcceeec Confidence 67777655544443333333222 1122231111111111 1 1367777777777777765 222 2222 Q ss_pred Eecccccccccccccccch---hhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015263. 316 TSPMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLL 392 (513) Q Consensus 316 ~sP~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~ 392 (513) -..+++..+.+..+..+-+ +.+-..+.|-.+.||...++|..+.+ +.-.....--..-+.-++.+||..+|+.| T Consensus 243 ~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~s-n~e~~~~~f~~~tl~P~~~~ie~~l~~~L-- 319 (403) T protein:vir:10 243 DGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNA-NIRPNIELFYYMTIIPMLNKLTSSLTFFF-- 319 (403) T ss_pred CCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCc-CHHHHHHHHHHHHHHHHHHHHHHHHHHhc-- Confidence 2333444444432222222 23344577999999999999744332 22222222222333468999999999987 Q ss_pred cccceEEEEEecCCCCc--cHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccccc Q lcl|NC_015263. 393 NGMSKYFKATMLEVTHF--SKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTS 470 (513) Q Consensus 393 ~~~~~~f~~~~l~~T~f--n~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~S 470 (513) +..|+|.+-..... ..+..++.+.++..-|.-..--.=+.+|+.|.+ .-+.+..++|+...-.. T Consensus 320 ---~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~----------~~~~d~~~~p~n~~~~~- 385 (403) T protein:vir:10 320 ---GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLD----------DEQMNKIRIPANVAGSA- 385 (403) T ss_pred ---CceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC----------ccccccccccccccccc- Confidence 23455555444333 557777777777777754444444456777742 12456777776532111 Q ss_pred ccccccCCccccCCCCcCCCCccccc Q lcl|NC_015263. 471 GSDIAENAIKEKGKENGRPTNETTGN 496 (513) Q Consensus 471 g~~~~~~~~~~~~~~~grPt~et~~n 496 (513) ...++.++|.|...|.++ T Consensus 386 --------~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 386 --------TGVSGQEGGRPKGSTEGD 403 (403) T ss_pred --------ccCCCCcCCCCCCCcCCC Confidence 111234555554333333 No 50 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=98.10 E-value=4.6e-06 Score=49.84 Aligned_cols=399 Identities=10% Similarity=0.036 Sum_probs=189.0 Q ss_pred cCccccccccc-------ccchHHHHHHHhhhccChh---------HHHHHHHHHHHHHhhcchHHHHHHHHhh-ccccc Q lcl|NC_015263. 36 RTPVFGAPVGS-------LTSSQSKVRKIVKEYRNEG---------NQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYA 98 (513) Q Consensus 36 ~~~~~~s~~~s-------~~~s~d~~k~~i~~~~P~~---------n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~d 98 (513) +-|--+-.+|+ +....+......-. |.. ....-..++.--+-..+.+.+.|+.++. +..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp 78 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFT--PVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMP 78 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccc--cCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCc Confidence 55554444443 11100000000000 100 0011112233334456778888888753 33333 Q ss_pred ceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEc-CcceeeeecCcceeEEE Q lcl|NC_015263. 99 YSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDD-KESVMIQQFPNDICKIS 172 (513) Q Consensus 99 Y~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d-~~~~~iq~lp~dyckIs 172 (513) ..++ .... ++..+..-+.....|.. +.--.+...++..++..|..|.+.+.+ +....+.++|++.|.|. T Consensus 79 ~~~y----~~~~-~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L~~l~~~~v~v~ 153 (432) T protein:vir:10 79 LTMY----MRTP-DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQYLANDRLTIT 153 (432) T ss_pred eeEE----EecC-CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEcCCceEEE Confidence 3332 1111 11111111333344433 445666778888999999999888764 45578899999999998 Q ss_pred EEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccccchhhHHHHHH Q lcl|NC_015263. 173 SVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSLTPVPPFAGTFD 251 (513) Q Consensus 173 g~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~~~ip~f~~v~~ 251 (513) --.+|...|.+-. .+ ...++++.+.-+.|+. ..+...|++|...+-. T Consensus 154 ~~~~g~~~y~~~~---~~-----------------------------g~~~~~~~~~iih~~~~~~dg~~G~spi~~~~~ 201 (432) T protein:vir:10 154 TDTKGNTAYRYRR---TD-----------------------------GQMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQ 201 (432) T ss_pred EcCCCcEEEEEEe---cC-----------------------------ceEEEEcCccEEEecCCCCCCcccccHHHHHHH Confidence 6677776553211 01 1233455555555553 2334568888776664 Q ss_pred hHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-cccceEEEeccccccccccc-c Q lcl|NC_015263. 252 SIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-PDNVGVVTSPMEIDTVSFDK-D 329 (513) Q Consensus 252 d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~gv~~v~sP~~~d~i~ld~-~ 329 (513) .+--....++.. ..-..|-...-.-+-+ ++ .++.++++++.+.+..+- ..++..+-..+++..+.+.. + T Consensus 202 ~i~~~~~~~~~~--~~~f~ng~~~~gil~~-----~~--~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d 272 (432) T protein:vir:10 202 IFGTAIAAEAQA--ARAFRNGQLQSVYYQI-----DR--FLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNPVD 272 (432) T ss_pred HHHHHHHHHHHH--HHHHhcCCCcceEEec-----CC--CCCHHHHHHHHHHHhhhhhCCCceecCCCceEEEccCChHH Confidence 433323333321 1122331111111111 11 356667776666665543 23445555566666666532 1 Q ss_pred cccchhhhhhHHhhhhhhhhhhhhccCCCcc----hHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh--cccceEEEEE Q lcl|NC_015263. 330 SSTDDSVEKATKNFWDNAGVSQILFSSDNKT----SQGI-AMSIATDEQFIFGVINQLERWLNRYLLL--NGMSKYFKAT 402 (513) Q Consensus 330 ~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s----~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~--~~~~~~f~~~ 402 (513) ..--.+.+-..++|..+.||...++|....+ ++.+ ...+.-...-+.-++++||..+|+.|-. ...+..|+|. T Consensus 273 ~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd 352 (432) T protein:vir:10 273 AQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFD 352 (432) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEee Confidence 1222334555688999999999999743221 2222 2333333333446889999999997743 2223346666 Q ss_pred ecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCcccc Q lcl|NC_015263. 403 MLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEK 482 (513) Q Consensus 403 ~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~ 482 (513) .-....-+.++..+.+.++.+-|. ++|.|+-.++-+|--. | .+.+.+.++.++ .-...+. ...+ T Consensus 353 ~~~ll~~d~~~r~~~~~~~~~~G~-----------~T~NE~R~~~glppi~-g-~~~~~~~~~~~~-pl~~~~~-~~~~- 416 (432) T protein:vir:10 353 TSALLRADSAARSSYYSQLVNNGL-----------MTRDEAREIEGLPKLG-G-NAAVLTVQSAMV-PLDSIGL-QASP- 416 (432) T ss_pred chhhhccCHHHHHHHHHHHHhCCC-----------CCHHHHHHHhCCCCCC-C-CcceEeecCccc-chhhhcc-cCCC- Confidence 656666678888888777765552 3444444433332101 1 112222222221 1010000 0000 Q ss_pred CCCCcCCCCcccccccCCCCCCCCCCccC Q lcl|NC_015263. 483 GKENGRPTNETTGNKDSDETQRAKDKPAN 511 (513) Q Consensus 483 ~~~~grPt~et~~n~~~~~~~~~~d~~~~ 511 (513) .+.+|.|. ...+.+.+ T Consensus 417 ~~~~~~~~-------------~~~~~~~~ 432 (432) T protein:vir:10 417 EPASGLGN-------------QQQDKVSK 432 (432) T ss_pred CCCCCCCC-------------cccccccC Confidence 01111111 11111111 No 51 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=98.07 E-value=5.5e-06 Score=49.42 Aligned_cols=475 Identities=12% Similarity=0.071 Sum_probs=185.8 Q ss_pred CCCccchheeeeehhhhhhHHH---------HHHHHHHHHHhhccCccc--------------ccccccccc-hHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSN---------KRNNRISILRDDNRTPVF--------------GAPVGSLTS-SQSKVRK 56 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~i~~~~~~~~~~--------------~s~~~s~~~-s~d~~k~ 56 (513) -||.++-+... ++.-+|-. .++++.++..|-...|.- ...+..... +...+-+ T Consensus 47 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~ 123 (862) T protein:vir:99 47 PVQKEKPNPII---RSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPE 123 (862) T ss_pred CcccccCCCCC---Ccccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccch Confidence 24433333221 22222222 223444444444443321 011111000 1111112 Q ss_pred Hhhhcc-ChhHHHHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHH Q lcl|NC_015263. 57 IVKEYR-NEGNQKTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNF 135 (513) Q Consensus 57 ~i~~~~-P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~ 135 (513) ...+|+ |...... .++. +|..++.++++|+-.....+-...-..-.......+.+..+.+ -..+++++++.-+ T Consensus 124 ~~~~~~~~~~f~gy--ql~a-lY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~i---e~~~~rL~v~~~l 197 (862) T protein:vir:99 124 ALQDWYLSQGFIGH--QACA-LIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKF---KAIDVEFKVKENL 197 (862) T ss_pred hccccccccCcccH--HHHH-HHHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHH---HHHHHHhhHHHHH Confidence 223332 2222211 2333 7999999999999999998888877753222222222223333 3345666666555 Q ss_pred HHHHHHHHHhcceeEEEE-EcCcce-eeeecCcceeEEEEEECCee--EEEEEeeeccCcchhccccHHHHHHHHHHhhh Q lcl|NC_015263. 136 SKIVKLAMTVDIFYGYVI-DDKESV-MIQQFPNDICKISSVSGGVY--NYVIDLDALVSADIVDYYPKEIQEAVNKYTTM 211 (513) Q Consensus 136 ~~i~~~~l~~g~~~gy~i-~d~~~~-~iq~lp~dyckIsg~~nG~y--~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~ 211 (513) .+.++-.-..|..+.+.. +..++. .=+||+++. +..|.+ ..+||-.+-.....-.....-....|-+-+.+ T Consensus 198 ~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~-----I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y 272 (862) T protein:vir:99 198 IEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDG-----ITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFW 272 (862) T ss_pred HHHHHhcccccceEEEEEecCcCchhhhcCcCccc-----ccccceeEEEEechhhhcccccccccccccccccCCceee Confidence 544443333444444433 333443 346776552 344433 33344322111000000000000000000000 Q ss_pred hhccCcccccCeeecCCceEEEEecCc----------cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeecc Q lcl|NC_015263. 212 KKGNNKSASNWYEIQDKNSICIKINES----------SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLET 281 (513) Q Consensus 212 k~~~~~~~~~W~~L~~~kt~~ik~~~~----------~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~ 281 (513) . .++ ..+-+.+ +|.+... ..||+|.+-.++.-+.+.+.-... +-.-+....+.+-++.. T Consensus 273 ~-I~g------~~IH~SR--liif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~s--aa~Ll~ka~l~v~ktd~ 341 (862) T protein:vir:99 273 I-ISG------QKYHRSH--LIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANE--APLLAMNKRTTAIHTDT 341 (862) T ss_pred e-ecC------eeeccce--eEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHH--HHHHHHHhccceeechh Confidence 0 000 0122222 2333222 125677666666655554433332 22222222333334431 Q ss_pred ccCCCCCccccCHHHHHHHHHHHHHhc-cccceEEEecccccccccccccccchhhhhhHHhhhhhhhhhhh-hccCC-C Q lcl|NC_015263. 282 RSSNDNNDFTLDMPMMNYFHEALSMTV-PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQI-LFSSD-N 358 (513) Q Consensus 282 ~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~-Lfn~d-~ 358 (513) - . -..+.+.+.+--+.+.+.- -.|+..+-+.=+++.++.+-+ ...+.+....+.|-.++||-.. |||-. + T Consensus 342 l--~----~l~~ed~l~~r~~~~~~~rdN~Gi~liD~eEe~e~ls~slS-GL~dll~~~~q~IAaas~IP~tiLfGqspa 414 (862) T protein:vir:99 342 A--K----AIANEDKFIQRLMFWVRYRDNHAVKVLGTDETMEQFDTSLA-DFDAVIMGQYQLVASIAKTPATKLLGTAPK 414 (862) T ss_pred H--h----hhccHHHHHHHHHHHHhccCcceeEEecCCCceeEEecccC-ChHHHHHHHHHHHHhhhCCCceeecccCcc Confidence 0 0 0112222222222222222 134544444334454444433 4457888888999999999855 77643 2 Q ss_pred cchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH----hhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHH Q lcl|NC_015263. 359 KTSQGIAMSIATDEQFIFGVIN-QLERWLNRYL----LLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLA 433 (513) Q Consensus 359 ~s~~~~~~SI~~d~~~~~~~~~-~iE~~~N~~i----~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~la 433 (513) +-.++-..-+..-...|-++++ .++..+.+++ ........|.|.|.++.-.+.+|.++..++.++-= .. +. T Consensus 415 GlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg~~~d~~ieFnpL~~~sekEkAEi~kk~Aea~---~~-lv 490 (862) T protein:vir:99 415 GFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLSLGIQHEIDVVMEPVASMTAQQQADLNKTKAEGG---KV-LI 490 (862) T ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcceEEeCCCCCCCHHHHHHHHHHHHHHH---HH-HH Confidence 2222222222222222222222 1333333322 22223346999999999999999999977664211 12 22 Q ss_pred HHhC-CCHHHHHHHHHHHHHhhCcccc---cCc---ccc----cccccccccccCC-------ccccC-CCCcCC---CC Q lcl|NC_015263. 434 SLMG-IDPVAFTGLLKVENEMLDLPEI---MTP---LSS----SFNTSGSDIAENA-------IKEKG-KENGRP---TN 491 (513) Q Consensus 434 a~~G-~~p~~~~~~~~~E~e~L~l~~~---~~P---l~T----S~T~Sg~~~~~~~-------~~~~~-~~~grP---t~ 491 (513) . .| ++|.++...+..+. ..++... -++ ... +..+..+.....+ ++... .+++.| -. T Consensus 491 ~-sGvispdEvR~~L~~~~-~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~ 568 (862) T protein:vir:99 491 D-GGVISPDEERNRIRDDK-RSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMV 568 (862) T ss_pred h-cCCCCHHHHHHHHHhcC-CcCCCCCCcccccccCCCCcccccccccCCcccccccccccccccCCccccCCccccccc Confidence 2 46 79999999876542 3233221 111 000 0000000000000 00000 000000 00 Q ss_pred c------ccccccCCCCCC-CCCCccCCC Q lcl|NC_015263. 492 E------TTGNKDSDETQR-AKDKPANTQ 513 (513) Q Consensus 492 e------t~~n~~~~~~~~-~~d~~~~~~ 513 (513) + +.+......... +.+.|.-++ T Consensus 569 ~~~~~g~~~~~t~~~~a~~p~~~~~~~~~ 597 (862) T protein:vir:99 569 PSMKPGQMVGPEVGITAPMPEDDAPVAGV 597 (862) T ss_pred CCCCCCCccccccccccCCCccccccCcc Confidence 0 001001111111 111111111 No 52 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=98.06 E-value=5.5e-06 Score=49.42 Aligned_cols=406 Identities=12% Similarity=0.023 Sum_probs=174.8 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~ 79 (513) ..-.|.-|-+|+-.-- |. +.-.|++..+....+..+. ....+..... +...+.+|+. T Consensus 13 ~~~~~~~~~~~~~~~~------f~---~~e~r~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~al~~------- 70 (441) T protein:vir:98 13 FKSRKQSRKELVVVGI------FY---KNEKRDLQYNEDDLQMMVQ------TLPGFQGTKLRQYKDIEAIRH------- 70 (441) T ss_pred cccccchhhhhhcccc------cc---ccccccccCCCcchHHHHH------HhhcccccCccccchhhhhcc------- Confidence 1111122222221100 00 0001221111111000000 0000000000 1222222222 Q ss_pred hcchHHHHHHH----HhhcccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeE Q lcl|NC_015263. 80 QSQQYQRLLNF----YANMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYG 150 (513) Q Consensus 80 ~sg~~~rlidy----~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~g 150 (513) +.+.+.|+. +++|| ..++ .+ +....+ +.+...|. +=| --.+...++..++..|..|. T Consensus 71 --~~V~acv~~Ia~~iA~lp---l~~~--~~------~~~~~~-~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~ 136 (441) T protein:vir:98 71 --SDIFTAVMMIASDLARMP---IRVT--VN------GQINYS-DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYI 136 (441) T ss_pred --HHHHHHHHHHHHhhccCc---eEEe--cC------Cccccc-chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEE Confidence 222333343 44444 3343 11 111111 11233333 222 34667788888899999999 Q ss_pred EEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCC Q lcl|NC_015263. 151 YVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDK 228 (513) Q Consensus 151 y~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~ 228 (513) +++-+.++ .-+.++|++.|.+.--.+|...|.+-. ++-.. . ..-..+++. T Consensus 137 ~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~--~~~~~---------------~-----------~~~~~~~~~ 188 (441) T protein:vir:98 137 EITRDKTGEPMNLTFRKTSEIELKLDARGRLYYFHQR--IDSNG---------------N-----------NIERNVKFE 188 (441) T ss_pred EEEEcCCCcEEEEEEEcCceeEEEECCCCcEEEEEEE--eccCc---------------c-----------eeeEEEccc Confidence 98876554 678999999999976667766554321 11000 0 001234443 Q ss_pred ceEEEEe-cCccccchhhHHHHHHhHHHHH-HHHHHHhhHhhhhhceeeee--eeccccCCCCCccccCHHHHHHHHHHH Q lcl|NC_015263. 229 NSICIKI-NESSLTPVPPFAGTFDSIYDIH-SFKDLRNDKAELQNYKLLIQ--KLETRSSNDNNDFTLDMPMMNYFHEAL 304 (513) Q Consensus 229 kt~~ik~-~~~~~~~ip~f~~v~~d~~di~-~~kdL~~~~~~i~n~~ii~~--kip~~~~n~~~~~~vd~~~~~~~~~~i 304 (513) .-+-|+. ..+...|+||...+.. .+++. ..++... .-..|-...-. ++| + ...+.++.+.+.+.+ T Consensus 189 dviHir~~~~dg~~G~spi~~~~~-~i~~~~a~~~~~~--~~f~ng~~~~gil~~~----~----~~~~~e~~~~~~~~~ 257 (441) T protein:vir:98 189 DMLDIKFYSLDGINGLSLLDTLSR-TIESDNNGKDFLN--NFLRNGTHAGGILKMK----G----VLDNKKARDRAREEF 257 (441) T ss_pred cEEEeccCCCCCccccCHHHHHHH-HHHHHHHHHHHHH--HHHhccCCCcEEEEeC----C----CCCCHHHHHHHHHHH Confidence 3455554 2344578888666554 33332 2333221 11233111111 233 1 111345555555666 Q ss_pred HHhcc--c---cceEEEecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 305 SMTVP--D---NVGVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGV 378 (513) Q Consensus 305 k~~Lp--~---gv~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~ 378 (513) +++.- + ++..+-..+++..+.+... ..--++..-..++|..+.||...++|.++.+++.....+.-. .-+.-+ T Consensus 258 ~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~y~-~tl~P~ 336 (441) T protein:vir:98 258 HKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYL-STLKPY 336 (441) T ss_pred HHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHH-HHHHHH Confidence 55551 2 2344444455555554321 111123344457899999999999987766554433333211 223358 Q ss_pred HHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCccc Q lcl|NC_015263. 379 INQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPE 458 (513) Q Consensus 379 ~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~ 458 (513) +.+||..+|+.|-....+..|+|..-.+...+.++.++.+.++..-|. ++|.|+-.++-++- +=|=+ T Consensus 337 ~~~ie~~ln~~L~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~-----------~T~NE~R~~~gl~p-i~gGd- 403 (441) T protein:vir:98 337 ITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGK-----------MNIDEIRQRDGLAP-IPGGN- 403 (441) T ss_pred HHHHHHHHHhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCC-----------cCHHHHHHHhCCCC-CCCCC- Confidence 889999999988654445668877767777788888888877776663 34444432222210 00111 Q ss_pred ccCcccccccccccccccCCccccCCCCcCCCCccccc-ccCCCCC Q lcl|NC_015263. 459 IMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGN-KDSDETQ 503 (513) Q Consensus 459 ~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n-~~~~~~~ 503 (513) ......+.++..-. ..+. -+..++. ++..+ +.+++++ T Consensus 404 ~~~~~~~~n~~~~~----~~~~---~q~~~~~-~~~~~~kgGe~ne 441 (441) T protein:vir:98 404 GSIHRVDLNHVNIE----LVDE---YQMNKSR-ATDKKLKGGEENE 441 (441) T ss_pred cceEeecccccccc----cccc---ccccccc-ccccccCCCCCCC Confidence 11111111111100 0000 0000010 00011 1111111 No 53 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=98.03 E-value=6.5e-06 Score=49.02 Aligned_cols=414 Identities=11% Similarity=0.076 Sum_probs=185.4 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNF 90 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy 90 (513) |-|+.-..-++..++. ++ ..+|.....-++....... ...+--+ |.. .-..++.--+-..+.+.+.|+. T Consensus 1 ~~~~~~mg~f~r~~~~----~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~---~g~~v~~~~al~~~~V~~~i~~ 69 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAM----FV--PPDPVDIGGGQTFTPVNAT-ARDLGII-ISD---TGAAVNADAIMRLDAVAACVKL 69 (432) T ss_pred CCchhhcchhhhhhhh----cc--cccccccccccccccCccc-hhhhccc-ccc---cCcccchHhhhccHHHHHHHHH Confidence 5565555555443221 10 0122211111110000000 0001000 100 0001111223344666677776 Q ss_pred Hh-hcccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEE-cCcceeeee Q lcl|NC_015263. 91 YA-NMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVID-DKESVMIQQ 163 (513) Q Consensus 91 ~~-~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~-d~~~~~iq~ 163 (513) ++ ++..+...++-- ..+.. ... .+ +.+...|. + +.-..+...++..++..|..|.+++. ++....+.+ T Consensus 70 Ia~~ia~lp~~~y~~-~~~g~--~~~-~~-~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~g~~~~L~~ 144 (432) T protein:vir:81 70 VSQAIAAMPLTMYMR-TPDGR--KEA-VN-HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQY 144 (432) T ss_pred HHHhhhhCceeeEEe-cCCcc--eec-cc-chHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEE Confidence 64 233333333310 11111 111 11 22333443 2 33456777888889999999988765 445568889 Q ss_pred cCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccccc Q lcl|NC_015263. 164 FPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSLTP 242 (513) Q Consensus 164 lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~~~ 242 (513) +|++.|.+.--.+|...|.+-. .+ ...++++.+.-+.|+. ..+..+| T Consensus 145 l~~~~v~v~~~~~g~~~y~~~~---~~-----------------------------g~~~~~~~~~iih~r~~~~dg~~G 192 (432) T protein:vir:81 145 LANDRLTITTDPKGNTAYRYRR---TD-----------------------------GQMIDIPKQQIWKIMGYSLDGENG 192 (432) T ss_pred EcCCceEEEECCCCcEEEEEEe---cC-----------------------------ceEEEEccccEEEecCCCCCCccc Confidence 9999999986667765554311 01 1223344444444543 2234568 Q ss_pred hhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccc-cceEEEecccc Q lcl|NC_015263. 243 VPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPD-NVGVVTSPMEI 321 (513) Q Consensus 243 ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~-gv~~v~sP~~~ 321 (513) ++|...+-..+--....++.. ..-..|-...-.-|-+ + -.++.++++.+.+.++.+.-. ++..+-..+++ T Consensus 193 ~spi~~~~~~i~~~~~~~~~~--~~~f~ng~~~~gil~~---~----~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~ 263 (432) T protein:vir:81 193 LSAIRYGAQIFGTAIAAEAQA--ARAFRNGQLQSVYYQI---D----RFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDV 263 (432) T ss_pred ccHHHHHHHHHHHHHHHHHHH--HHHHhcCCCcceEEec---C----CCCCHHHHHHHHHHHhhhhcCCCceecCCCceE Confidence 888766554333333333221 1122331111111111 1 135667777777777654422 34455455566 Q ss_pred ccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCcc----hHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh--c Q lcl|NC_015263. 322 DTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNKT----SQGI-AMSIATDEQFIFGVINQLERWLNRYLLL--N 393 (513) Q Consensus 322 d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s----~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~--~ 393 (513) ..+.+.. +..--.+.+-..++|..+.||...++|....+ ++.+ ...+.-...-+.-++++||..+|+.|-. + T Consensus 264 ~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~ 343 (432) T protein:vir:81 264 KSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLSPAE 343 (432) T ss_pred EEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc Confidence 6665532 11222334455588999999999999644322 1222 2222222233445889999999997743 3 Q ss_pred ccceEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccccccc Q lcl|NC_015263. 394 GMSKYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGS 472 (513) Q Consensus 394 ~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~ 472 (513) ..+..|+|..-....-+.++..+.+.++.+-| +.+=...+ .+|+.|.+ |=++.+ ..++-++ .-. T Consensus 344 ~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~-~~glpp~~------------g~~~~~-~~~~~~~-pl~ 408 (432) T protein:vir:81 344 RRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEARE-IEGLPKLG------------GNAAVL-TVQSAMV-PLD 408 (432) T ss_pred cCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHH-HhCCCCCC------------CCcceE-eecCccc-chh Confidence 22345677666666677888888887776665 23333222 24554421 111121 1111111 000 Q ss_pred ccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccC Q lcl|NC_015263. 473 DIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPAN 511 (513) Q Consensus 473 ~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~ 511 (513) ..+... .++ | ..++.+...+.+.+ T Consensus 409 ~~~~~~----~~~---~--------~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 409 SIGLQA----SPE---P--------ASGLGNQQQDKVSK 432 (432) T ss_pred hhccCC----CCC---C--------CCCCCCcccccccC Confidence 000000 000 0 01111111111111 No 54 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=98.03 E-value=6.6e-06 Score=49.00 Aligned_cols=391 Identities=10% Similarity=0.074 Sum_probs=180.0 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) .+.|-.+.+.. .+.+.+....-..+.....-|+..... ....++.--+.+++.+.+.| T Consensus 1 m~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~v~~~~a~~~~~v~~~i 58 (412) T protein:vir:26 1 MNVIAKENIVT-----RIKKKLIDNWIDQSTSKLYDFSPWKNR-----------------SFWGVINNTLETNETIFSAI 58 (412) T ss_pred Cccchhhhhhh-----hhhhhHhhhhhcccccccccccccCCc-----------------cccccchhhhhccHHHHHHH Confidence 22222222211 122222222222221000111111111 01112223344566677777 Q ss_pred HHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--ee Q lcl|NC_015263. 89 NFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VM 160 (513) Q Consensus 89 dy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~ 160 (513) ++++. +..+.-.++ . .++... +.+...|. +-| -..+...++..++..|..|.++.-+..+ +- T Consensus 59 ~~ia~~iA~lp~~~~----~---~~~~~~---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~ 128 (412) T protein:vir:26 59 TKLSNSMASLPLKMY----E---DYKVVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSK 128 (412) T ss_pred HHHHHhHhhCceeEe----e---cccccc---chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEE Confidence 77652 222333332 1 111111 22333443 333 4556678888999999999999876544 67 Q ss_pred eeecCcceeEEEEEECC-eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe--cC Q lcl|NC_015263. 161 IQQFPNDICKISSVSGG-VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI--NE 237 (513) Q Consensus 161 iq~lp~dyckIsg~~nG-~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~--~~ 237 (513) +.+||+++|.+.--.++ ...|.+...- ... +.+++.--+-|+- .. T Consensus 129 L~~l~~~~v~v~~~~~~~~~~y~~~~~~---g~~-----------------------------~~~~~~evih~~~~~~~ 176 (412) T protein:vir:26 129 LFLLNPDVVEMLIENQSRELYYSIHAAT---GNK-----------------------------LIVHNMDMLHFKHIVAS 176 (412) T ss_pred EEEEcCceeEEEEeCCCcEEEEEEEcCC---ceE-----------------------------EEEccccEEEeCCCCCC Confidence 88999999999865553 3444433211 000 1233322333432 12 Q ss_pred ccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccc--cceEE Q lcl|NC_015263. 238 SSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPD--NVGVV 315 (513) Q Consensus 238 ~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~--gv~~v 315 (513) +..+|++|...+. ..+++...-+--. +.+..-.-.-+-. ....++.++++++.+.+++..-+ ++..+ T Consensus 177 ~~~~G~s~i~~~~-~~i~~~~a~~~~~----~~~~~~~~~~i~~------~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl 245 (412) T protein:vir:26 177 NMVQGISPIDVLK-NTTDFDNAVRTFN----LTEMQKPDSFMLK------YGSNVGKEKRQQVLEDFKQYYEENGGILFQ 245 (412) T ss_pred CCcccccHHHHHH-HHHHHHHHHHHHH----HHhcCCCCceEEe------cCCCCCHHHHHHHHHHHHHHhhcCCCeeec Confidence 3456888876654 4444443222111 1111111111111 11235667777766666665532 23233 Q ss_pred Eecccccccccccccccchhh---hhhHHhhhhhhhhhhhhccCCCc-chHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_015263. 316 TSPMEIDTVSFDKDSSTDDSV---EKATKNFWDNAGVSQILFSSDNK-TSQGIAMSIATDEQ-FIFGVINQLERWLNRYL 390 (513) Q Consensus 316 ~sP~~~d~i~ld~~~~~~dtv---~~~~~~i~~~~GiS~~Lfn~d~~-s~~~~~~SI~~d~~-~~~~~~~~iE~~~N~~i 390 (513) -..+++..+.+. ..+.+.+ +-..++|..+.||...++|+... +.+.+...-..... -+.-++++||..+|+.| T Consensus 246 ~~g~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kL 323 (412) T protein:vir:26 246 EPGVEIEPLPKK--YVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKL 323 (412) T ss_pred CCCceEEEcCCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 233344433332 2232332 33457899999999999876543 22223222222222 23458999999999977 Q ss_pred hhc--c-cceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccc Q lcl|NC_015263. 391 LLN--G-MSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSF 467 (513) Q Consensus 391 ~~~--~-~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~ 467 (513) -.. . .+..|+|..-+....+.++.++.+.++..-|.=..--.-+.+|+.|.+ +-|..++|.. + T Consensus 324 l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~------------ggD~~~~~~n--~ 389 (412) T protein:vir:26 324 LTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE------------GGDKPLISGD--L 389 (412) T ss_pred CCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CcCeeeeccc--c Confidence 432 1 234577776677777889999999888887743332233345666642 2233333221 1 Q ss_pred cccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCC Q lcl|NC_015263. 468 NTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDK 508 (513) Q Consensus 468 T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~ 508 (513) + .-.. .. + .+....|+.++...+ T Consensus 390 ~-~~~~----~~-----~--------~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 390 Y-PIDT----PL-----E--------LRKSLKGGDKNVNES 412 (412) T ss_pred c-cccc----ch-----h--------hcccccCCCCCcCCC Confidence 1 0000 00 0 000001111111111 No 55 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=98.03 E-value=6.6e-06 Score=48.97 Aligned_cols=379 Identities=11% Similarity=0.028 Sum_probs=171.9 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) .-+.++++..+-+ ......|....+.. . +.-+. -.+.+ T Consensus 7 ~~~~~~~~~~~~~-------------------------~~~~~~~~~~~~~~-----~----~~~~~--------~~~~~ 44 (406) T protein:vir:95 7 WRRTKRKSKIRAD-------------------------TGYVGLFMSGEDVS-----F----LVPGY--------VRLSD 44 (406) T ss_pred hcccccccccccc-------------------------chhhhhhccCcccC-----c----cccCH--------HHHhh Confidence 0111111111000 00001111000000 0 00011 11235 Q ss_pred cchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHH-HHhhc----ChhHHHHHHHHHHHHhcceeEEEE- Q lcl|NC_015263. 81 SQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTE-FLSRL----NPKYNFSKIVKLAMTVDIFYGYVI- 153 (513) Q Consensus 81 sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~-~L~k~----n~k~~~~~i~~~~l~~g~~~gy~i- 153 (513) ++.+++.|+.+++ +..+++.++--..... .. .+ ..... ++.+= .-..++..++..++..|..|.|++ T Consensus 45 ~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~---~~--~~-~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~ 118 (406) T protein:vir:95 45 NPEVRMAVHKIADLISSMTIYLMQNTEDGD---IR--IR-NELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFP 118 (406) T ss_pred cHHHHHHHHHHHHhhccCceEEEEecCCcc---ee--ec-chHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEE Confidence 7788888888764 4556666653221111 01 11 12222 23333 345666777778888887777754 Q ss_pred -Ec--CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCce Q lcl|NC_015263. 154 -DD--KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNS 230 (513) Q Consensus 154 -~d--~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt 230 (513) .+ +...-+.++|+++|++.--. +.|+|.++-. +++..-- T Consensus 119 ~~~~~g~~~~l~~i~~~~v~~~~~~-~~~~~~~~~~-------------------------------------~~~~~ev 160 (406) T protein:vir:95 119 KYTADGLIDELVPLTPSKVNFLDTP-DGYQVLYGGQ-------------------------------------TFNYDEV 160 (406) T ss_pred EECCCCcEEEEEEEcCceeEEEEcC-CeEEEEeccE-------------------------------------EEchhHE Confidence 23 34467899999999997444 4477655422 1222223 Q ss_pred EEEEecC---ccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh Q lcl|NC_015263. 231 ICIKINE---SSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT 307 (513) Q Consensus 231 ~~ik~~~---~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~ 307 (513) +.|+.+. +...|++|...+...+.-.....+... .-..|-...-.-|-+ +. .++.++++++.+.+++. T Consensus 161 ih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~--~~~~ng~~~~~il~~---~~----~l~~e~~~~~~~~~~~~ 231 (406) T protein:vir:95 161 LHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKK--SFMSGKYMPSLIVKV---DA----ATAELSSEEGRNAVFKK 231 (406) T ss_pred EEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHH--HHHhccCCcceEEEe---CC----CCCHHHHHHHHHHHHHH Confidence 4455422 233577877766665544444444332 122221111111111 11 25666766666666665 Q ss_pred c--cc--c-ceEEEecc-ccccc-cccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 308 V--PD--N-VGVVTSPM-EIDTV-SFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVI 379 (513) Q Consensus 308 L--p~--g-v~~v~sP~-~~d~i-~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~ 379 (513) + ++ | +..+..+. +...+ ++.. +..--+..+-..++|..+.||+..++|.... ......+.-.+ -+.-++ T Consensus 232 ~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~-~~~~~~~~~~~--~l~P~~ 308 (406) T protein:vir:95 232 YLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGEF-NRDEYNNFINS--TILPIA 308 (406) T ss_pred hccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCc-hHHHHHHHHHH--HHHHHH Confidence 5 12 2 22222221 11111 1211 1111123444558899999999988863322 22222222222 234588 Q ss_pred HHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccc Q lcl|NC_015263. 380 NQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEI 459 (513) Q Consensus 380 ~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~ 459 (513) ++||..+|+.|-.. .+..|+|.+-+....+.++.++.+.++..-|.=..--.-+.+|+.|.+ +-+.. T Consensus 309 ~~ie~~l~~~l~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~------------~gd~~ 375 (406) T protein:vir:95 309 KGIEQELTRKLLIS-PDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKE------------GLSEL 375 (406) T ss_pred HHHHHHHHHhcCCC-CCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------Cccee Confidence 99999999977532 344688877777788899999998888776632222222234444421 12233 Q ss_pred cCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCC Q lcl|NC_015263. 460 MTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKD 507 (513) Q Consensus 460 ~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d 507 (513) ++|+. +. .-.. .++.....+| ++++.+++++ T Consensus 376 ~~~~n--~~-~~~~----~~~~~~~k~g----------~~~~~~~~~~ 406 (406) T protein:vir:95 376 VILEN--YI-PLDK----IGDQSKLKGG----------DNSGADGQTD 406 (406) T ss_pred eeccC--cc-chhh----cccccccCCC----------CCCCCCCCCC Confidence 33321 11 0000 0000001111 1111111122 No 56 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=98.02 E-value=6.7e-06 Score=48.93 Aligned_cols=461 Identities=11% Similarity=0.062 Sum_probs=229.2 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccc-------cccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVG-------SLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS 81 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~-------s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s 81 (513) ..+|| .-|...+=-+.+++..-|-.- ..+.++... ...+.+ .+ + ..+-..||+-|++|+.++ T Consensus 1 Mn~iD-r~i~~~sP~~a~~R~~ar~~~-~~y~aa~~~r~~~~~~~~~s~~---~~-i-----~~~~~~lr~RaRdL~rNn 69 (548) T protein:vir:95 1 MNLID-RLLEPLAPELVARRLAAREAI-QAYEAARPGRTHKAKRQPLGAD---TS-L-----QKSAVSMREQCRKLDEDH 69 (548) T ss_pred CchHH-hHhhhcchHHHHHHHHhHHHh-ccccccCccccccccCCCCChH---HH-H-----HHHHHHHHHHHHHHHhcC Confidence 55566 233333323344443333211 111111110 011111 11 1 134678999999999999 Q ss_pred chHHHHHHHHhhccccc--ceEee--ccchhhhhhcchhHHHHHH-HHHHhhcC------hhHHHHHHHHHHHHhcceeE Q lcl|NC_015263. 82 QQYQRLLNFYANMPLYA--YSVVP--FKDISTANENKLKKELATV-TEFLSRLN------PKYNFSKIVKLAMTVDIFYG 150 (513) Q Consensus 82 g~~~rlidy~~~mpt~d--Y~I~P--~~~~~~~~~~~~~~~y~~v-~~~L~k~n------~k~~~~~i~~~~l~~g~~~g 150 (513) |+.++.|+.+.+.--=. --|.| .+...... ..+.+..... ..|-+.+. .-....-+++..++.|..|. T Consensus 70 ~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a-~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~ 148 (548) T protein:vir:95 70 DLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVH-AELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLA 148 (548) T ss_pred hHHHHHHHHHHHhccCccccceeeeecCCCHHHH-HHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEE Confidence 99999999876643321 12333 22111111 1121111111 12333333 33445567788899999998 Q ss_pred EEEEcCc---------ceeeeecCcceeEEEEEECCeeEEEEEeeeccCc-chhcc-----ccHHHHHHHHHHhhhhhcc Q lcl|NC_015263. 151 YVIDDKE---------SVMIQQFPNDICKISSVSGGVYNYVIDLDALVSA-DIVDY-----YPKEIQEAVNKYTTMKKGN 215 (513) Q Consensus 151 y~i~d~~---------~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~-~~L~~-----~p~Ei~~~y~~Y~~~k~~~ 215 (513) -.+.... +.-+|-+++|+|.-.--..|.+... =+ .||.. .-+.| -|.+.... T Consensus 149 ~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~~-GI-E~D~~Grp~aY~i~~~hPgd~~~~----------- 215 (548) T protein:vir:95 149 QKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIVQ-GI-ERDTWRRKRAYHLLKDHPGNLQTL----------- 215 (548) T ss_pred EeeecccccccCCcccceEEEEechhhcCCCCCCCCCceee-ee-EECCCCceEEEEEeecCCCccccc----------- Confidence 7775432 2478999999996332222222110 12 23432 12222 23332110 Q ss_pred CcccccCeeecCCceEEEEecCcccc---chhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCcccc Q lcl|NC_015263. 216 NKSASNWYEIQDKNSICIKINESSLT---PVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTL 292 (513) Q Consensus 216 ~~~~~~W~~L~~~kt~~ik~~~~~~~---~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~v 292 (513) .....|..+|-+ .|++-++.+.+. |+|.|++++..+-++++|.+-....+.++.. +...|-...+... ..- T Consensus 216 -~~~~~~~rvpA~-~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~--~a~fi~~~~~~~~--~~~ 289 (548) T protein:vir:95 216 -GGSLAVKRVEAE-RIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAA--LAMYIKKGNPDSY--TVE 289 (548) T ss_pred -ccccceeeechh-HheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhh--heeeeecCCCccc--cCC Confidence 111357778764 566666555443 9999999999999999999988888888763 3333321100000 000 Q ss_pred CHHHHHHHHHHHHHhccccce-EEEeccccccccccccc----ccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHHHH Q lcl|NC_015263. 293 DMPMMNYFHEALSMTVPDNVG-VVTSPMEIDTVSFDKDS----STDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGIAM 366 (513) Q Consensus 293 d~~~~~~~~~~ik~~Lp~gv~-~v~sP~~~d~i~ld~~~----~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~ 366 (513) ... .+....-.+-.|.. ...-|= +.|+|-... .-...+......|-..+||+--++.+| +.|++++-. T Consensus 290 ~~~----~~~~~~~~~~pG~iv~~L~pG--e~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~ 363 (548) T protein:vir:95 290 PGK----DRKNRTIPIAPGMVFDDLEPG--EDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQ 363 (548) T ss_pred CCc----ccccccccccCCccccccCCC--ceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHH Confidence 000 00000011212322 122332 356663322 223457777788888899995555444 557777776 Q ss_pred HHHHHHHH--------HHHHHHH-HHHHHHHHHhhcccc--------eEEEEEe--cCCCCccHHHHHHHHHHHHhcCCc Q lcl|NC_015263. 367 SIATDEQF--------IFGVINQ-LERWLNRYLLLNGMS--------KYFKATM--LEVTHFSKKEAHDRYITDAQYGFP 427 (513) Q Consensus 367 SI~~d~~~--------~~~~~~~-iE~~~N~~i~~~~~~--------~~f~~~~--l~~T~fn~ke~~~~~~~~~~~G~~ 427 (513) ++...-.. +-.|.+. -+.|+--.+....+. ...+..+ .+-...+-.+.++.-+....-|+. T Consensus 364 ~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~ 443 (548) T protein:vir:95 364 ELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFA 443 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCC Confidence 65522111 2234442 344666555543331 1244444 334567777888888999999999 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHH---hhCcccccCc-ccccccccccccccCCccccCCCCcCCC------------- Q lcl|NC_015263. 428 VKVYLASLMGIDPVAFTGLLKVENE---MLDLPEIMTP-LSSSFNTSGSDIAENAIKEKGKENGRPT------------- 490 (513) Q Consensus 428 ~~~~laa~~G~~p~~~~~~~~~E~e---~L~l~~~~~P-l~TS~T~Sg~~~~~~~~~~~~~~~grPt------------- 490 (513) +....++.+|.+|++++.+...|.+ .+||.----| +++ +-++.++.....+ +...+|.|- T Consensus 444 T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 520 (548) T protein:vir:95 444 DEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQL--VKSGMDPVEAVQK-VYLGVGKMLTADEARELVNRYG 520 (548) T ss_pred CHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccc--cccccCCCCchhh-hccccccccccchhHHhhccCC Confidence 9999999999999999999999984 4444311111 111 1122222111111 111112221 Q ss_pred -------Cc-ccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 491 -------NE-TTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 491 -------~e-t~~n~~~~~~~~~~d~~~~~~ 513 (513) ++ +...+..|..++|+ |.. T Consensus 521 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~ 547 (548) T protein:vir:95 521 AGLPVPGPDFPNESNNGGADGQPS----NPD 547 (548) T ss_pred CCCcCCCCCCCcccccCCCCCCCC----CCC Confidence 11 11112222222222 111 No 57 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=98.02 E-value=6.9e-06 Score=48.86 Aligned_cols=423 Identities=10% Similarity=0.035 Sum_probs=185.7 Q ss_pred eeeeehh----------hhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHH Q lcl|NC_015263. 9 LSMIDVE----------SISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLA 78 (513) Q Consensus 9 ~~~~~~~----------~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY 78 (513) .+|+|-- ++.-+.+.++.. ++.-.- ++...+.-.+..++.-..+.---+.-..++.-.| T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---------~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a 69 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEF--AFNGIG---------YGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAY 69 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhh--hccccc---------cccccccHHHHHhhccccccccCccccccchhhh Confidence 1222110 000011111000 000000 0011112122233321101000001112334445 Q ss_pred hhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEE Q lcl|NC_015263. 79 VQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVI 153 (513) Q Consensus 79 ~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i 153 (513) ..++.+.+.|+.++. +-.+...++-..+ .. ........ ...+|.+=| -..+...++..++..|..|.+++ T Consensus 70 ~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~--~~-~~~~~~~~--~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~ 144 (466) T protein:vir:81 70 QANGPVFACMLVRQLVFSSVRFRWQRLRD--GK-PSDTFGSR--DLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIV 144 (466) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecC--Cc-eeeccccH--HHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEE Confidence 567778888888753 4444445542211 11 11111111 223444433 45666778888899999999988 Q ss_pred EcCc----------ceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCe Q lcl|NC_015263. 154 DDKE----------SVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWY 223 (513) Q Consensus 154 ~d~~----------~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~ 223 (513) .+.. .+.+.++|++.|.+..-.+|...+.+.... ..... ..... T Consensus 145 r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~y~~~~--~~~~~------------------------~~~~~ 198 (466) T protein:vir:81 145 DGEFVRMRPDWVDVVVEERMVRGGRGELGGGQLGWRKVGYLYTE--GGRQS------------------------GNESV 198 (466) T ss_pred ecCccccccccCcceeEEEEecCcceEEEEcCCCceEEEEEEEe--cCccc------------------------cccee Confidence 6533 478999999999999877765554443222 10000 01234 Q ss_pred eecCCceEEEEec---CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHH Q lcl|NC_015263. 224 EIQDKNSICIKIN---ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYF 300 (513) Q Consensus 224 ~L~~~kt~~ik~~---~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~ 300 (513) .++.+.-+-|+.. .+...|++|...+...+--....++. ...-..|-...-.-|-+ + ..++.++++++ T Consensus 199 ~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~--~~~~f~ng~~p~gil~~---~----~~l~~e~~~~~ 269 (466) T protein:vir:81 199 GFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKH--QAKFFDNGATVNLVIKH---N----PMADPAAVKKW 269 (466) T ss_pred eeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHH--HHHHHhcCCCcceEEec---C----CCCCHHHHHHH Confidence 4555555666642 34456888887766544333333322 22223332211111111 1 13677777777 Q ss_pred HHHHHHhcc--cc---ceEEEecccccccccccccccch---hhhhhHHhhhhhhhhhhhhccCC----CcchHHHH-HH Q lcl|NC_015263. 301 HEALSMTVP--DN---VGVVTSPMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSSD----NKTSQGIA-MS 367 (513) Q Consensus 301 ~~~ik~~Lp--~g---v~~v~sP~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~d----~~s~~~~~-~S 367 (513) .+.+.++.- .+ +..+-..++++.+.+. ..+.+ +.+-..++|..+.||...++|-. +.+++.+. .. T Consensus 270 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~--~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~ 347 (466) T protein:vir:81 270 ADEVNSKHAGVDNAWKNLNLYPGADADVVGSN--LQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQAR 347 (466) T ss_pred HHHHHHHhcCccccccceEcCCCceEEEccCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHH Confidence 777777651 11 2233333454555442 22223 34455688999999999999632 22333333 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccce--EEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHH Q lcl|NC_015263. 368 IATDEQFIFGVINQLERWLNRYLLLNGMSK--YFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTG 445 (513) Q Consensus 368 I~~d~~~~~~~~~~iE~~~N~~i~~~~~~~--~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~ 445 (513) +.--..-+.-++++||..+|+.|-...-.. .|+|...+..--+.++..+...... .....+.. -|++|.|.-. T Consensus 348 ~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~----~~~~~~~~-~g~t~nE~r~ 422 (466) T protein:vir:81 348 RRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRA----ETINTLIT-AGYEPESVVA 422 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHHHHHHHHH----HHHHHHHH-cCCChhhccc Confidence 222333344699999999999986533322 3444333333334444444322111 11112222 4888877763 Q ss_pred HHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCC Q lcl|NC_015263. 446 LLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQR 504 (513) Q Consensus 446 ~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~ 504 (513) . | +..|+ .+.+ ++.-+ ..++....+......|+ ..++..+++++ T Consensus 423 ~---~-~~gd~--~~~~-~~~~~-----~~~~~~~~~~~~~~~~~---~~~~Gg~~ngn 466 (466) T protein:vir:81 423 A---V-NSGDL--RLLK-HTGLT-----SVQLLPPGVSASASSDT---PTSGGADDNGN 466 (466) T ss_pred c---c-cCCcc--cccc-CCCcc-----hhhhcccccccccCCCC---cccCCCCcCCC Confidence 2 2 12121 1111 11111 00111111111111111 11111122222 No 58 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.01 E-value=7.2e-06 Score=48.78 Aligned_cols=386 Identities=11% Similarity=0.072 Sum_probs=183.9 Q ss_pred HHHHHhhccCcccccccccccchHHHHHHHhhh--ccCh---hHHHHH-----HHHHHHHHhhcchHHHHHHHHhh-ccc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKE--YRNE---GNQKTL-----RKVSEDLAVQSQQYQRLLNFYAN-MPL 96 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~--~~P~---~n~~~i-----r~~s~~lY~~sg~~~rlidy~~~-mpt 96 (513) -+++.. +|+...+....+..-.. ..+- .....+ ..+..--.-..+.+.+.|+.++. +.. T Consensus 1 MG~f~~----------lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~ 70 (422) T protein:vir:13 1 MGFLRG----------LFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGK 70 (422) T ss_pred Cchhhh----------hhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhh Confidence 333333 33332222111110000 0000 000000 00111111234556666666543 233 Q ss_pred ccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEEEEcCc--ceeeeecCccee Q lcl|NC_015263. 97 YAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQFPNDIC 169 (513) Q Consensus 97 ~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~lp~dyc 169 (513) +...++- .. +.... +.+...|. +=| -..+...+..+++..|..|.++.-+.. ..-+.++|++.| T Consensus 71 lp~~~~~--~~-----~~~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v 141 (422) T protein:vir:13 71 LSLKIYK--DK-----EEYKE--HELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNV 141 (422) T ss_pred CceEEEe--cC-----ccccc--chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcce Confidence 3334431 11 11111 12333343 222 446778888889999999999987654 468899999999 Q ss_pred EEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe--cCccccchhhHH Q lcl|NC_015263. 170 KISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI--NESSLTPVPPFA 247 (513) Q Consensus 170 kIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~--~~~~~~~ip~f~ 247 (513) .+.--.+|.+...=...|.-. ... .+=+.+++..-+-|+. ..+...|++|.. T Consensus 142 ~~~~~~~~~~~~~~~~~y~~~-------------------------~~~-g~~~~~~~~eiih~~~~~~~~~~~G~s~~~ 195 (422) T protein:vir:13 142 TKIIDDDNFLSSLSKVWYVVT-------------------------DKN-GKEHKLLPDEMLHFIGDITLDGLIGIKPLD 195 (422) T ss_pred EEEEcCCcceeccceEEEEEE-------------------------eCC-CeEEEEcccceEEEcCCCCCCCcccccHHH Confidence 998766665432111111000 000 0112344444555553 234456888887 Q ss_pred HHHHhHHHHHHHHHHHhhHhhhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc---cc--cceEEEecc Q lcl|NC_015263. 248 GTFDSIYDIHSFKDLRNDKAELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV---PD--NVGVVTSPM 319 (513) Q Consensus 248 ~v~~d~~di~~~kdL~~~~~~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L---p~--gv~~v~sP~ 319 (513) .+...+--.....+.. ..-..| ...++ ++| . .++.++++++.+.++... .+ ++.++-..+ T Consensus 196 ~~~~~i~~~~~~~~~~--~~~f~ng~~p~gil-~~~-----~----~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~ 263 (422) T protein:vir:13 196 YLRCTIENGRATQEFI--NKFFKNGLSIKGIV-QYV-----G----DLDEKAKKIFKKEFESMSNGLENAHSISLLPFGY 263 (422) T ss_pred HHHHHHHHHHHHHHHH--HHHHhccCCccEEE-EeC-----C----CCCHHHHHHHHHHHHHHhcCccccCCceecCCCc Confidence 7776544333333322 111222 22222 122 1 266777777777777665 11 333443444 Q ss_pred cccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCC-cchHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_015263. 320 EIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDN-KTSQGIA-MSIATDEQFIFGVINQLERWLNRYLLLNG 394 (513) Q Consensus 320 ~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~-~s~~~~~-~SI~~d~~~~~~~~~~iE~~~N~~i~~~~ 394 (513) ++..+.+. ..+.+. .+-..++|..+.||+..++|... .+++.+. ..+.--..-+.-++++||.++|+.|-... T Consensus 264 ~~~~l~~~--~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~ 341 (422) T protein:vir:13 264 QFQPISLS--MADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQDKLFSQY 341 (422) T ss_pred eeeeccCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCChh Confidence 44444432 233333 33445789999999998887533 2223222 22222333344689999999999874321 Q ss_pred ---cceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccc Q lcl|NC_015263. 395 ---MSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSG 471 (513) Q Consensus 395 ---~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg 471 (513) .+..|+|..-+...-+.++.++.+.++..-|.=..--.-+.+|+.|.+ |-|..++|.- ...= T Consensus 342 ~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~------------ggD~~~~~~n---~~~l 406 (422) T protein:vir:13 342 ETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVE------------GGDRLLVNGN---MIPI 406 (422) T ss_pred hhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CcCeeeeccC---ccch Confidence 244566666666666888899998888888843333344456776642 2233443321 1100 Q ss_pred cccccCCccccCCCCcC Q lcl|NC_015263. 472 SDIAENAIKEKGKENGR 488 (513) Q Consensus 472 ~~~~~~~~~~~~~~~gr 488 (513) ...++ .....+..||+ T Consensus 407 ~~~~~-~~~~~g~~~g~ 422 (422) T protein:vir:13 407 EMAGE-QYKKGGEKGGK 422 (422) T ss_pred hhccc-ccccCCCcCCC Confidence 00000 00000111121 No 59 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.00 E-value=7.6e-06 Score=48.66 Aligned_cols=446 Identities=9% Similarity=0.026 Sum_probs=214.1 Q ss_pred CCccc-hhee----eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHH Q lcl|NC_015263. 2 VKNKK-KRLS----MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSED 76 (513) Q Consensus 2 ~~~~~-~~~~----~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~ 76 (513) .|+-- .++. +--+..+..|.- +.. ...+....-.-...+.+ .+ +. .+-+.||.-|++ T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~-~a~--------~~~~~~~~w~p~~~s~~---~~-~~-----~~~~~lr~RaRd 62 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHG-GGS--------GFGGQLRSWNPPSESVD---AA-LL-----PNFTRGNARADD 62 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhh-ccC--------CCCCcccccccCCCCHH---HH-HH-----HHHHHHHHHHHH Confidence 11110 0000 001111112200 000 00010000000111111 11 11 246789999999 Q ss_pred HHhhcchHHHHHHHHhhcccccceEeeccchhh---hhhcchhHHHHHHH-----HHHh----------hcChhHHHHHH Q lcl|NC_015263. 77 LAVQSQQYQRLLNFYANMPLYAYSVVPFKDIST---ANENKLKKELATVT-----EFLS----------RLNPKYNFSKI 138 (513) Q Consensus 77 lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~---~~~~~~~~~y~~v~-----~~L~----------k~n~k~~~~~i 138 (513) |+.++|+.++.|+.+.+.--=. =|.|-...+. -.+.+.-+.+.+.. .+.+ +++.-....-+ T Consensus 63 l~rNn~~a~~av~~~~~nvVG~-Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~ 141 (533) T protein:vir:34 63 LVRNNGYAANAIQLHQDHIVGS-FFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREG 141 (533) T ss_pred HHhcChHHHHHHHHHHHHhhCC-CceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHH Confidence 9999999999999887754222 1222111100 00111111221111 1222 22334445667 Q ss_pred HHHHHHhcceeEEEEEcCc-----ceeeeecCcceeEEEE-EECCee-EEEEEeeeccCcc-hhccc-----cHHHHHHH Q lcl|NC_015263. 139 VKLAMTVDIFYGYVIDDKE-----SVMIQQFPNDICKISS-VSGGVY-NYVIDLDALVSAD-IVDYY-----PKEIQEAV 205 (513) Q Consensus 139 ~~~~l~~g~~~gy~i~d~~-----~~~iq~lp~dyckIsg-~~nG~y-~~~fD~syFd~~~-~L~~~-----p~Ei~~~y 205 (513) ++..++.|..|.-...... +.-+|-+++|+|.=.- ..+|.+ +-.+ .||+.. .+.|+ |+... T Consensus 142 ~r~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GI---e~d~~Gr~~aY~i~~~~~~~~~--- 215 (533) T protein:vir:34 142 VAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGV---QINDSGAALGYYVSEDGYPGWM--- 215 (533) T ss_pred HHHHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeee---EECCCCCeEEEEEeecCCCCcc--- Confidence 8888999999997765433 4568999999986321 223332 1122 334322 12211 22110 Q ss_pred HHHhhhhhccCcccccCeeec-----CCceEEEEecCccc---cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhcee--e Q lcl|NC_015263. 206 NKYTTMKKGNNKSASNWYEIQ-----DKNSICIKINESSL---TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKL--L 275 (513) Q Consensus 206 ~~Y~~~k~~~~~~~~~W~~L~-----~~kt~~ik~~~~~~---~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~i--i 275 (513) ...|-.++ +..-|++-++...+ =|+|.|++++..+-++++|.+-....+.++..-- | T Consensus 216 -------------~~~~~~~~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi 282 (533) T protein:vir:34 216 -------------PQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATI 282 (533) T ss_pred -------------ccccceeeeeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeee Confidence 01121111 11235555555544 3999999999999999999998877777766322 2 Q ss_pred eeeecc-------cc---CCCCCccccCHHHHHHHHHHHHHhccccceEEEeccccccccccccc----ccchhhhhhHH Q lcl|NC_015263. 276 IQKLET-------RS---SNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKDS----STDDSVEKATK 341 (513) Q Consensus 276 ~~kip~-------~~---~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~~----~~~dtv~~~~~ 341 (513) .+..|- .+ ++....++-.......+++...-.|-.|....+.|= +.|+|-... +-.+.+..... T Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG--e~i~~~~~~~p~~~~~~f~~~~lr 360 (533) T protein:vir:34 283 ESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPG--DSLNLQTAQDTDNGYSVFEQSLLR 360 (533) T ss_pred ecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCC--CeeeecCCCCCCCCHHHHHHHHHH Confidence 333331 00 011111222222222222222222322322223332 356653322 22356777778 Q ss_pred hhhhhhhhhhhhccCC--CcchHHHHHHHHHHHHHHHH--------HH-HHHHHHHHHHHhhcccc--------e----- Q lcl|NC_015263. 342 NFWDNAGVSQILFSSD--NKTSQGIAMSIATDEQFIFG--------VI-NQLERWLNRYLLLNGMS--------K----- 397 (513) Q Consensus 342 ~i~~~~GiS~~Lfn~d--~~s~~~~~~SI~~d~~~~~~--------~~-~~iE~~~N~~i~~~~~~--------~----- 397 (513) .|-..+||+--.+.+| +.|++++-.++...-..+-. +. +..+.|+--.+....+. + T Consensus 361 ~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~ 440 (533) T protein:vir:34 361 YIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARS 440 (533) T ss_pred HHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHH Confidence 8999999996666666 56888887775533322222 22 22344665544433221 0 Q ss_pred -EEEEEe--cCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccccccccc Q lcl|NC_015263. 398 -YFKATM--LEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDI 474 (513) Q Consensus 398 -~f~~~~--l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~ 474 (513) .++..+ .+-...+..+.+...+....-|+..+...++..|.+|++++.+...|.+.++=.. .|+.+.-... T Consensus 441 ~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~g--l~~~~~~~~~---- 514 (533) T protein:vir:34 441 AWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAG--LKPPAWAAAA---- 514 (533) T ss_pred hhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcC--CCCCCCCCcC---- Confidence 123333 4445677778888889999999999999999999999999999999985543111 2222211100 Q ss_pred ccCCccccCCCCcCCCCcccccccCCCCCCCCCC Q lcl|NC_015263. 475 AENAIKEKGKENGRPTNETTGNKDSDETQRAKDK 508 (513) Q Consensus 475 ~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~ 508 (513) ..++..+..+ +..++...+ T Consensus 515 --------~~s~~~~~~~-------~~~~~~~~~ 533 (533) T protein:vir:34 515 --------FESGLRQSTE-------EEKSDSRAA 533 (533) T ss_pred --------ccCCCCCCCC-------CCcccCCCC Confidence 0011111100 001111111 No 60 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.97 E-value=8.8e-06 Score=48.31 Aligned_cols=401 Identities=13% Similarity=0.046 Sum_probs=174.2 Q ss_pred CCCccc-hheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccc------ccccchHHHHHHHhhhccChhHHHHHHHH Q lcl|NC_015263. 1 MVKNKK-KRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPV------GSLTSSQSKVRKIVKEYRNEGNQKTLRKV 73 (513) Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~------~s~~~s~d~~k~~i~~~~P~~n~~~ir~~ 73 (513) =.|+|| -|-.|+-.-- |. +.-.|++..++...+.. |+..+ +. +.....+|+ T Consensus 12 ~~~~~~~~~~~~~~~~l------f~---~~e~R~~~~~~~~~~~~~~~~~~~~~~~--------~~---~~~~~~al~-- 69 (441) T protein:vir:79 12 DFKSRKQSRKELVVVGI------FY---KNEKRDLQYNEDDLQMMVQTLPGFQGTK--------LR---QYKDIEAIR-- 69 (441) T ss_pred cccccccchhhhhcccc------cc---ccccccccCCCcchHHHHHHhcccCccc--------cc---ccchhhhhc-- Confidence 112211 1222221100 00 00112222222110000 00000 00 111122222 Q ss_pred HHHHHhhcchHHHHHHHH----hhcccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHH Q lcl|NC_015263. 74 SEDLAVQSQQYQRLLNFY----ANMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMT 144 (513) Q Consensus 74 s~~lY~~sg~~~rlidy~----~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~ 144 (513) .+.+.+.|+.+ ++|| ..++ .+ +..... +.+...|. +=| --.+...+...++. T Consensus 70 -------~~~V~~cv~~Ia~~iA~lp---~~~~--~~------~~~~~~-~~~~~lL~~~PN~~~t~~~f~~~~~~~lll 130 (441) T protein:vir:79 70 -------HSDIFTAVMMIASDLARMP---IRVT--VN------GQINYS-DRIVNLLNTRPNPMYNGYIFKLVVFVSALL 130 (441) T ss_pred -------cHHHHHHHHHHHHhhccCc---eeee--cC------cccccc-chHHHHHhcccCcCCCHHHHHHHHHHHHhh Confidence 22233344443 4444 3332 11 111111 12333343 222 34566788888899 Q ss_pred hcceeEEEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccC Q lcl|NC_015263. 145 VDIFYGYVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNW 222 (513) Q Consensus 145 ~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W 222 (513) .|..|.+++-+.++ +-+.++|++.|.|.--.+|.+.|.+-. ++-. .. ..- T Consensus 131 ~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~~--~~~~---------------~~-----------~~~ 182 (441) T protein:vir:79 131 TSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQR--IDSN---------------GN-----------NIE 182 (441) T ss_pred cCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEE--eccC---------------Cc-----------eeE Confidence 99999998876554 678999999999875455655443211 0000 00 011 Q ss_pred eeecCCceEEEEec-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhc---eeeeeeeccccCCCCCccccCHHHHH Q lcl|NC_015263. 223 YEIQDKNSICIKIN-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNY---KLLIQKLETRSSNDNNDFTLDMPMMN 298 (513) Q Consensus 223 ~~L~~~kt~~ik~~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~---~ii~~kip~~~~n~~~~~~vd~~~~~ 298 (513) ..++...-+-|+.. .+...|++|...+...+--....++... .-..|- ..++ ++| +. ..+.++.+ T Consensus 183 ~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~--~~f~ng~~p~gil-~~~----~~----~~~~e~~e 251 (441) T protein:vir:79 183 RNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLN--NFLRNGTHAGGIL-KMK----GV----LDNKKARD 251 (441) T ss_pred EEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHH--HHHhccCCCcEEE-EcC----CC----CCCHHHHH Confidence 23444444555542 3345688887666654433333333221 112221 2222 233 11 11344444 Q ss_pred HHHHHHHHhc--cc---cceEEEecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHH Q lcl|NC_015263. 299 YFHEALSMTV--PD---NVGVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDE 372 (513) Q Consensus 299 ~~~~~ik~~L--p~---gv~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~ 372 (513) ++.+.+.++. ++ ++..+-..+++..+.+... ..--.+.+-..++|..+.||...++|.++.+++.....+.- . T Consensus 252 ~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~~-~ 330 (441) T protein:vir:79 252 RAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY-L 330 (441) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH-H Confidence 5555555554 11 2333333445555544321 11112334455789999999999998766555433333321 1 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHH Q lcl|NC_015263. 373 QFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENE 452 (513) Q Consensus 373 ~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e 452 (513) .-+.-++.+||..+|+.|-....+..|+|..-.+...+.++.++.+.++..-|.=..--.-+.+|+.|.+ T Consensus 331 ~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~---------- 400 (441) T protein:vir:79 331 STLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIP---------- 400 (441) T ss_pred HHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC---------- Confidence 2344689999999999886554456677776666667788888888877776632222222234444432 Q ss_pred hhCcccccCcccccccccccccccCCccccCCCCcCCCCccccc-ccCCCCC Q lcl|NC_015263. 453 MLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGN-KDSDETQ 503 (513) Q Consensus 453 ~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n-~~~~~~~ 503 (513) |=+ ......+.+...-. ..+. .+..++. ++..+ +..++++ T Consensus 401 --ggd-~~~~~~~~n~~~~~----~~~~---~~~~~~~-~~~~~~kgGe~~e 441 (441) T protein:vir:79 401 --GGN-GSIHRVDLNHVNIE----LVDE---YQMNKSR-ATDKKLKGGEENE 441 (441) T ss_pred --CCC-cceEeecccccccc----cccc---ccccccc-ccccccCCCCCCC Confidence 111 11111111111100 0000 0000010 00111 1111111 No 61 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.97 E-value=8.8e-06 Score=48.31 Aligned_cols=401 Identities=13% Similarity=0.046 Sum_probs=174.2 Q ss_pred CCCccc-hheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccc------ccccchHHHHHHHhhhccChhHHHHHHHH Q lcl|NC_015263. 1 MVKNKK-KRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPV------GSLTSSQSKVRKIVKEYRNEGNQKTLRKV 73 (513) Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~------~s~~~s~d~~k~~i~~~~P~~n~~~ir~~ 73 (513) =.|+|| -|-.|+-.-- |. +.-.|++..++...+.. |+..+ +. +.....+|+ T Consensus 12 ~~~~~~~~~~~~~~~~l------f~---~~e~R~~~~~~~~~~~~~~~~~~~~~~~--------~~---~~~~~~al~-- 69 (441) T protein:vir:94 12 DFKSRKQSRKELVVVGI------FY---KNEKRDLQYNEDDLQMMVQTLPGFQGTK--------LR---QYKDIEAIR-- 69 (441) T ss_pred cccccccchhhhhcccc------cc---ccccccccCCCcchHHHHHHhcccCccc--------cc---ccchhhhhc-- Confidence 112211 1222221100 00 00112222222110000 00000 00 111122222 Q ss_pred HHHHHhhcchHHHHHHHH----hhcccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHH Q lcl|NC_015263. 74 SEDLAVQSQQYQRLLNFY----ANMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMT 144 (513) Q Consensus 74 s~~lY~~sg~~~rlidy~----~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~ 144 (513) .+.+.+.|+.+ ++|| ..++ .+ +..... +.+...|. +=| --.+...+...++. T Consensus 70 -------~~~V~~cv~~Ia~~iA~lp---~~~~--~~------~~~~~~-~~~~~lL~~~PN~~~t~~~f~~~~~~~lll 130 (441) T protein:vir:94 70 -------HSDIFTAVMMIASDLARMP---IRVT--VN------GQINYS-DRIVNLLNTRPNPMYNGYIFKLVVFVSALL 130 (441) T ss_pred -------cHHHHHHHHHHHHhhccCc---eeee--cC------cccccc-chHHHHHhcccCcCCCHHHHHHHHHHHHhh Confidence 22233344443 4444 3332 11 111111 12333343 222 34566788888899 Q ss_pred hcceeEEEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccC Q lcl|NC_015263. 145 VDIFYGYVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNW 222 (513) Q Consensus 145 ~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W 222 (513) .|..|.+++-+.++ +-+.++|++.|.|.--.+|.+.|.+-. ++-. .. ..- T Consensus 131 ~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~~--~~~~---------------~~-----------~~~ 182 (441) T protein:vir:94 131 TSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQR--IDSN---------------GN-----------NIE 182 (441) T ss_pred cCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEE--eccC---------------Cc-----------eeE Confidence 99999998876554 678999999999875455655443211 0000 00 011 Q ss_pred eeecCCceEEEEec-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhc---eeeeeeeccccCCCCCccccCHHHHH Q lcl|NC_015263. 223 YEIQDKNSICIKIN-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNY---KLLIQKLETRSSNDNNDFTLDMPMMN 298 (513) Q Consensus 223 ~~L~~~kt~~ik~~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~---~ii~~kip~~~~n~~~~~~vd~~~~~ 298 (513) ..++...-+-|+.. .+...|++|...+...+--....++... .-..|- ..++ ++| +. ..+.++.+ T Consensus 183 ~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~--~~f~ng~~p~gil-~~~----~~----~~~~e~~e 251 (441) T protein:vir:94 183 RNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLN--NFLRNGTHAGGIL-KMK----GV----LDNKKARD 251 (441) T ss_pred EEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHH--HHHhccCCCcEEE-EcC----CC----CCCHHHHH Confidence 23444444555542 3345688887666654433333333221 112221 2222 233 11 11344444 Q ss_pred HHHHHHHHhc--cc---cceEEEecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHH Q lcl|NC_015263. 299 YFHEALSMTV--PD---NVGVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDE 372 (513) Q Consensus 299 ~~~~~ik~~L--p~---gv~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~ 372 (513) ++.+.+.++. ++ ++..+-..+++..+.+... ..--.+.+-..++|..+.||...++|.++.+++.....+.- . T Consensus 252 ~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~~-~ 330 (441) T protein:vir:94 252 RAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY-L 330 (441) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH-H Confidence 5555555554 11 2333333445555544321 11112334455789999999999998766555433333321 1 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHH Q lcl|NC_015263. 373 QFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENE 452 (513) Q Consensus 373 ~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e 452 (513) .-+.-++.+||..+|+.|-....+..|+|..-.+...+.++.++.+.++..-|.=..--.-+.+|+.|.+ T Consensus 331 ~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~---------- 400 (441) T protein:vir:94 331 STLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIP---------- 400 (441) T ss_pred HHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC---------- Confidence 2344689999999999886554456677776666667788888888877776632222222234444432 Q ss_pred hhCcccccCcccccccccccccccCCccccCCCCcCCCCccccc-ccCCCCC Q lcl|NC_015263. 453 MLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGN-KDSDETQ 503 (513) Q Consensus 453 ~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n-~~~~~~~ 503 (513) |=+ ......+.+...-. ..+. .+..++. ++..+ +..++++ T Consensus 401 --ggd-~~~~~~~~n~~~~~----~~~~---~~~~~~~-~~~~~~kgGe~~e 441 (441) T protein:vir:94 401 --GGN-GSIHRVDLNHVNIE----LVDE---YQMNKSR-ATDKKLKGGEENE 441 (441) T ss_pred --CCC-cceEeecccccccc----cccc---ccccccc-ccccccCCCCCCC Confidence 111 11111111111100 0000 0000010 00111 1111111 No 62 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.96 E-value=8.9e-06 Score=48.26 Aligned_cols=371 Identities=10% Similarity=0.032 Sum_probs=172.1 Q ss_pred HHHHHHHhhc---cCccccc--ccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-ccccc Q lcl|NC_015263. 26 NRISILRDDN---RTPVFGA--PVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYA 98 (513) Q Consensus 26 ~~~~i~~~~~---~~~~~~s--~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~d 98 (513) .=-.+.+=+. ..|...+ ..|... .. ..+..++ ...+ ..++..-.-..+.+.+.|+.++. +..+. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~-~~----~~~~~~~~~~~~----~~v~~~~al~~~~v~~~i~~ia~~ia~lp 71 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDG-ND----AQIMESLLGDNN----EWVSARAALRNSDLFSIILQLSSDLAIVK 71 (392) T ss_pred CcchhhhhhhcccccccccccccccccC-ch----hhhhhhhcCCCC----ceechHHhhccHHHHHHHHHHHHhhccCc Confidence 1111111111 1111000 000000 00 0011111 0000 01122222345777788887765 33443 Q ss_pred ceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEE Q lcl|NC_015263. 99 YSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKIS 172 (513) Q Consensus 99 Y~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIs 172 (513) ..+. .+. ....+++=| .-.+...++..++..|..|.++.-+.++ +.+.++|+++|.+. T Consensus 72 ~~~~---~~~-------------~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~ 135 (392) T protein:vir:39 72 INAE---KKK-------------NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTY 135 (392) T ss_pred eeec---cch-------------hhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEE Confidence 3332 110 012343433 3566677788999999999999876554 68999999999988 Q ss_pred EEEC-CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecC-c-cccchhhHHHH Q lcl|NC_015263. 173 SVSG-GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINE-S-SLTPVPPFAGT 249 (513) Q Consensus 173 g~~n-G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~-~-~~~~ip~f~~v 249 (513) --.+ |.+.|.+...-- .. ..-+.++.+.-+-|+... + ...|+||...+ T Consensus 136 ~~~~~~~~~y~~~~~~~-~~----------------------------~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~ 186 (392) T protein:vir:39 136 YFEYENGMYYNITFDDP-KI----------------------------EPILQAPQSDLIHMKLLSIDGGKTGISPLYSL 186 (392) T ss_pred EcCCCceEEEEEEecCc-cc----------------------------ceeEEEccccEEEecCCCCCCccccccHHHHH Confidence 5454 343333322110 00 011233443344455422 2 24689998887 Q ss_pred HHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-cccceEEEeccccccccccc Q lcl|NC_015263. 250 FDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-PDNVGVVTSPMEIDTVSFDK 328 (513) Q Consensus 250 ~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~gv~~v~sP~~~d~i~ld~ 328 (513) ...+--....++... .-..|-...-.-|-+ +++...+.+++.++.+....+- ..++..+-..+++..+.+.. T Consensus 187 ~~~i~~~~~~~~~~~--~~f~ng~~p~gil~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~ 259 (392) T protein:vir:39 187 RRESKIQRASDRLTI--SSLNSSLNVPGVLTV-----KGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKS 259 (392) T ss_pred HHHHHHHHHHHHHHH--HHHhccCCCceEEEe-----CCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCCh Confidence 776644444444322 223332222122222 1122333444444444444332 12333333444555554432 Q ss_pred c-cccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCC Q lcl|NC_015263. 329 D-SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVT 407 (513) Q Consensus 329 ~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T 407 (513) . ..--++.+-..++|..+.||+..++|+...+++.......--..-+.-++++||..+|++|... +++..-... T Consensus 260 ~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~-----~~~d~~~~~ 334 (392) T protein:vir:39 260 NVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYKLSDH-----ISVNMRPAI 334 (392) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccchhhh Confidence 1 1212445555688999999999999765444433333322223333468899999999987532 222222223 Q ss_pred CccHHHHHHHHHHHHhcC-CcHHHHHH--HHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCC Q lcl|NC_015263. 408 HFSKKEAHDRYITDAQYG-FPVKVYLA--SLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGK 484 (513) Q Consensus 408 ~fn~ke~~~~~~~~~~~G-~~~~~~la--a~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~ 484 (513) ....+++.+.+.++..-| +.+-...+ .-.|+.|.++- ..| .++|++ |+ T Consensus 335 ~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r---~~e--------~l~~~~------~G------------ 385 (392) T protein:vir:39 335 DPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLP---APE--------NTNKKT------TG------------ 385 (392) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccc---hhc--------CCCCCC------CC------------ Confidence 345566666777777666 33322222 12366665432 111 244433 22 Q ss_pred CCcCCCC Q lcl|NC_015263. 485 ENGRPTN 491 (513) Q Consensus 485 ~~grPt~ 491 (513) ++..|-+ T Consensus 386 d~~~p~p 392 (392) T protein:vir:39 386 QSNEPVP 392 (392) T ss_pred CCCCCCC Confidence 2222333 No 63 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.96 E-value=8.9e-06 Score=48.26 Aligned_cols=371 Identities=10% Similarity=0.032 Sum_probs=172.1 Q ss_pred HHHHHHHhhc---cCccccc--ccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-ccccc Q lcl|NC_015263. 26 NRISILRDDN---RTPVFGA--PVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYA 98 (513) Q Consensus 26 ~~~~i~~~~~---~~~~~~s--~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~d 98 (513) .=-.+.+=+. ..|...+ ..|... .. ..+..++ ...+ ..++..-.-..+.+.+.|+.++. +..+. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~-~~----~~~~~~~~~~~~----~~v~~~~al~~~~v~~~i~~ia~~ia~lp 71 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDG-ND----AQIMESLLGDNN----EWVSARAALRNSDLFSIILQLSSDLAIVK 71 (392) T ss_pred CcchhhhhhhcccccccccccccccccC-ch----hhhhhhhcCCCC----ceechHHhhccHHHHHHHHHHHHhhccCc Confidence 1111111111 1111000 000000 00 0011111 0000 01122222345777788887765 33443 Q ss_pred ceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEE Q lcl|NC_015263. 99 YSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKIS 172 (513) Q Consensus 99 Y~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIs 172 (513) ..+. .+. ....+++=| .-.+...++..++..|..|.++.-+.++ +.+.++|+++|.+. T Consensus 72 ~~~~---~~~-------------~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~ 135 (392) T protein:vir:10 72 INAE---KKK-------------NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTY 135 (392) T ss_pred eeec---cch-------------hhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEE Confidence 3332 110 012343433 3566677788999999999999876554 68999999999988 Q ss_pred EEEC-CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecC-c-cccchhhHHHH Q lcl|NC_015263. 173 SVSG-GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINE-S-SLTPVPPFAGT 249 (513) Q Consensus 173 g~~n-G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~-~-~~~~ip~f~~v 249 (513) --.+ |.+.|.+...-- .. ..-+.++.+.-+-|+... + ...|+||...+ T Consensus 136 ~~~~~~~~~y~~~~~~~-~~----------------------------~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~ 186 (392) T protein:vir:10 136 YFEYENGMYYNITFDDP-KI----------------------------EPILQAPQSDLIHMKLLSIDGGKTGISPLYSL 186 (392) T ss_pred EcCCCceEEEEEEecCc-cc----------------------------ceeEEEccccEEEecCCCCCCccccccHHHHH Confidence 5454 343333322110 00 011233443344455422 2 24689998887 Q ss_pred HHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-cccceEEEeccccccccccc Q lcl|NC_015263. 250 FDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-PDNVGVVTSPMEIDTVSFDK 328 (513) Q Consensus 250 ~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~gv~~v~sP~~~d~i~ld~ 328 (513) ...+--....++... .-..|-...-.-|-+ +++...+.+++.++.+....+- ..++..+-..+++..+.+.. T Consensus 187 ~~~i~~~~~~~~~~~--~~f~ng~~p~gil~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~ 259 (392) T protein:vir:10 187 RRESKIQRASDRLTI--SSLNSSLNVPGVLTV-----KGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKS 259 (392) T ss_pred HHHHHHHHHHHHHHH--HHHhccCCCceEEEe-----CCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCCh Confidence 776644444444322 223332222122222 1122333444444444444332 12333333444555554432 Q ss_pred c-cccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCC Q lcl|NC_015263. 329 D-SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVT 407 (513) Q Consensus 329 ~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T 407 (513) . ..--++.+-..++|..+.||+..++|+...+++.......--..-+.-++++||..+|++|... +++..-... T Consensus 260 ~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~-----~~~d~~~~~ 334 (392) T protein:vir:10 260 NVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYKLSDH-----ISVNMRPAI 334 (392) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccchhhh Confidence 1 1212445555688999999999999765444433333322223333468899999999987532 222222223 Q ss_pred CccHHHHHHHHHHHHhcC-CcHHHHHH--HHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCC Q lcl|NC_015263. 408 HFSKKEAHDRYITDAQYG-FPVKVYLA--SLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGK 484 (513) Q Consensus 408 ~fn~ke~~~~~~~~~~~G-~~~~~~la--a~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~ 484 (513) ....+++.+.+.++..-| +.+-...+ .-.|+.|.++- ..| .++|++ |+ T Consensus 335 ~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r---~~e--------~l~~~~------~G------------ 385 (392) T protein:vir:10 335 DPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLP---APE--------NTNKKT------TG------------ 385 (392) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccc---hhc--------CCCCCC------CC------------ Confidence 345566666777777666 33322222 12366665432 111 244433 22 Q ss_pred CCcCCCC Q lcl|NC_015263. 485 ENGRPTN 491 (513) Q Consensus 485 ~~grPt~ 491 (513) ++..|-+ T Consensus 386 d~~~p~p 392 (392) T protein:vir:10 386 QSNEPVP 392 (392) T ss_pred CCCCCCC Confidence 2222333 No 64 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=97.96 E-value=9.1e-06 Score=48.22 Aligned_cols=423 Identities=14% Similarity=0.074 Sum_probs=182.2 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHhhcchHHHHHHHHh-hcccccce Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAVQSQQYQRLLNFYA-NMPLYAYS 100 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~~sg~~~rlidy~~-~mpt~dY~ 100 (513) |.=+..|..| ...|...+......... ...+-... +- ..-..++.--+-+.+.+.+.|++++ ++..+... T Consensus 1 Mg~~~~l~~~--~~~~~~~~~~~~~~~~~---~~~~~~~~~~~---~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~ 72 (457) T protein:vir:62 1 MGFWSALFGR--GHSPALDAAEGRAWEPY---DPSIYNLGATA---SSGERVTPHDALQVSAVFASVRLLSETIATLPLS 72 (457) T ss_pred Cchhhhhhcc--ccccccccccccccccc---hhhhhhccccc---cCCceechHHhhccHHHHHHHHHHHHhHhhCceE Confidence 1111111110 00000000000000000 00000000 00 0000011111122345555566553 23333334 Q ss_pred EeeccchhhhhhcchhHHHHHHHHHHhh----cChhHHHHHHHHHHHHhcceeEEEEEcC-cceeeeecCcceeEEEEEE Q lcl|NC_015263. 101 VVPFKDISTANENKLKKELATVTEFLSR----LNPKYNFSKIVKLAMTVDIFYGYVIDDK-ESVMIQQFPNDICKISSVS 175 (513) Q Consensus 101 I~P~~~~~~~~~~~~~~~y~~v~~~L~k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~-~~~~iq~lp~dyckIsg~~ 175 (513) ++--. .. ..+ ..+..-...+|+. +....++..++..++..|..|.+++.+. ....+.+|+++.|.|.-.. T Consensus 73 ~~~~~--~~-~~~--~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~g~~~~l~~l~p~~v~v~~~~ 147 (457) T protein:vir:62 73 TYSKR--GG-TRK--EIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAGPNIAGLDVLDPTKIHVHMVM 147 (457) T ss_pred EEEec--CC-ccc--cccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcCcceEEEEec Confidence 43111 10 001 1122223333444 4467788888888999999999987754 3457889999999987553 Q ss_pred CCeeE-E---EEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec-Cc-cccchhhHHHH Q lcl|NC_015263. 176 GGVYN-Y---VIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN-ES-SLTPVPPFAGT 249 (513) Q Consensus 176 nG~y~-~---~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~-~~-~~~~ip~f~~v 249 (513) ++... . .+.+.. ..... .....+++.-+.|+.. .+ ...|+||...+ T Consensus 148 ~~~~~~~~~~~y~~~~-~g~~~---------------------------~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~ 199 (457) T protein:vir:62 148 VDGLRRKVFEAYDIDA-DGNEV---------------------------LLGWFTPRDVLHIPGMMLPGDFVGCSPISYA 199 (457) T ss_pred cCCccceeEEEEEEcc-CCcee---------------------------EEEeeCccceEEecCCCCCCceecccHHHHH Confidence 33211 1 111111 00000 1112233334445432 22 24688888776 Q ss_pred HHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c---cceEEEeccccccc Q lcl|NC_015263. 250 FDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D---NVGVVTSPMEIDTV 324 (513) Q Consensus 250 ~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~---gv~~v~sP~~~d~i 324 (513) ...+--....++.... -..|-...-.-|-+ ++ .++.++++++.+.+++..- + ++..+-..+++..+ T Consensus 200 ~~~i~~~~~~~~~~~~--~f~ng~~p~gil~~-----~~--~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l 270 (457) T protein:vir:62 200 RESIGLALAAQKYGAH--FFRNGAMPGAVVEV-----PG--TMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKV 270 (457) T ss_pred HHHHHHHHHHHHHHHH--HHhccCCcceEEEc-----CC--CCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEc Confidence 6555444444433211 12221111111111 11 3677788888888877761 1 23344444555555 Q ss_pred ccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCcc----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cce Q lcl|NC_015263. 325 SFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNKT----SQGIAMSIATDEQFIFGVINQLERWLNRYLLLNG--MSK 397 (513) Q Consensus 325 ~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s----~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~--~~~ 397 (513) .+... ..--.+.+-..++|..+.||...++|-...+ ++.-...+.--..-+.-++++||..+|+.|-... ... T Consensus 271 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~ 350 (457) T protein:vir:62 271 AMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFR 350 (457) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCccccCce Confidence 44321 1111233345578999999998888643322 2223333332233344688999999999885432 223 Q ss_pred EEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccC Q lcl|NC_015263. 398 YFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAEN 477 (513) Q Consensus 398 ~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~ 477 (513) .++|.+-....-+.++.++.+.++.+-|.=..--.-+.+|+.|.+ -..-|..+.|+-- +.-+... +. T Consensus 351 ~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~----------~g~~D~~~~~~n~--~~~~~~~-~~ 417 (457) T protein:vir:62 351 FVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP----------DGLGEKYRVPLNL--GEIGEEP-EP 417 (457) T ss_pred EEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC----------CCCcceeeecccc--ccccccc-cc Confidence 456655566666888889998888877732222233345666631 1112445555321 1111111 11 Q ss_pred CccccCCCCcCCCCc---------ccccccCCCCCCCCCC Q lcl|NC_015263. 478 AIKEKGKENGRPTNE---------TTGNKDSDETQRAKDK 508 (513) Q Consensus 478 ~~~~~~~~~grPt~e---------t~~n~~~~~~~~~~d~ 508 (513) .+.+++.+.+.|..+ +.+..+.++++...++ T Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 418 EPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred cccCCCccCCCCccCCCCCCCCCCCCCCCccccccccccC Confidence 111111111111111 1122222222222222 No 65 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=97.90 E-value=1.2e-05 Score=47.53 Aligned_cols=448 Identities=10% Similarity=0.032 Sum_probs=178.5 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHH-HHHh--hh---cc---ChhHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKV-RKIV--KE---YR---NEGNQKTLR 71 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~-k~~i--~~---~~---P~~n~~~ir 71 (513) .+|.-+.-.||-. .-.++|--.-|-.+=+++ ....|-- .+++++. .--+ .+ |. ...|...+. T Consensus 3 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~-------~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 73 (535) T protein:vir:10 3 ILKDLRNAFSLSN-KKSTSYIELGDYDKDIVN-KAIRPGR-------ASARDTVDGIDIADGNVAGQYSVASISDVLSTK 73 (535) T ss_pred hhHHHHHHHHhhh-hhhhhhHHHhhhhHHHHH-hhhhhhh-------hhhhccccccccccCCcccccccCccccccCHH Confidence 1111111112111 000111000000010000 0000100 0000000 0000 00 00 122222333 Q ss_pred HHHHHHHhhcchHHHHHHHHhhcccc-------cceEee--cc--chhhhhhcchhHHHHHHHHHHhhc-C----h---- Q lcl|NC_015263. 72 KVSEDLAVQSQQYQRLLNFYANMPLY-------AYSVVP--FK--DISTANENKLKKELATVTEFLSRL-N----P---- 131 (513) Q Consensus 72 ~~s~~lY~~sg~~~rlidy~~~mpt~-------dY~I~P--~~--~~~~~~~~~~~~~y~~v~~~L~k~-n----~---- 131 (513) ++.+. +..+.++++.|+.+.+..+. .-.+.+ +. ......++.....-+.+..+|... | - T Consensus 74 ~l~~~-~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~ 152 (535) T protein:vir:10 74 KLLKA-YADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTF 152 (535) T ss_pred HHHHH-hccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHH Confidence 33333 33466777777776655441 011111 11 111111222333444455556532 2 1 Q ss_pred hHHHHHHHHHHHHhc-ceeEEEEEcCc--ceeeeecCcceeEEEEEECC----eeEEEEEeeeccCcchhccccHHHHHH Q lcl|NC_015263. 132 KYNFSKIVKLAMTVD-IFYGYVIDDKE--SVMIQQFPNDICKISSVSGG----VYNYVIDLDALVSADIVDYYPKEIQEA 204 (513) Q Consensus 132 k~~~~~i~~~~l~~g-~~~gy~i~d~~--~~~iq~lp~dyckIsg~~nG----~y~~~fD~syFd~~~~L~~~p~Ei~~~ 204 (513) ..++.+++.+++..| ..|.+++.+.. ..-+.+||++.|+|.--.+| .+.|.+.- . T Consensus 153 ~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~~~~~~~----~-------------- 214 (535) T protein:vir:10 153 PRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPRKFEQFVS----E-------------- 214 (535) T ss_pred HHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccCceEEEEEec----C-------------- Confidence 234555666666554 56777776544 46789999999998743222 22221100 0 Q ss_pred HHHHhhhhhccCcccccCeeecCCceEEEEecCc-----cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeee Q lcl|NC_015263. 205 VNKYTTMKKGNNKSASNWYEIQDKNSICIKINES-----SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKL 279 (513) Q Consensus 205 y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~-----~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~ki 279 (513) .....++...-+-|+.+.. ..+|+||...+...+--....++... .-..|-...-.-| T Consensus 215 ---------------~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~--~~f~ng~~p~giL 277 (535) T protein:vir:10 215 ---------------TKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNA--RFFSQGGTTRGIL 277 (535) T ss_pred ---------------ceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHH--HHHhccCCccEEE Confidence 0122344444455554221 33588888777765555554444321 1122211111112 Q ss_pred ccccCCCCCccccCHHHHHHHHHHHHHhc--cccce--EEEe--cccccccccccccccch---hhhhhHHhhhhhhhhh Q lcl|NC_015263. 280 ETRSSNDNNDFTLDMPMMNYFHEALSMTV--PDNVG--VVTS--PMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVS 350 (513) Q Consensus 280 p~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~gv~--~v~s--P~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS 350 (513) -+ ...++..++.++++.+.+.+++.. .++.+ .|+. .+++..+.+. ..+.+ +..-..+.|-.+.||. T Consensus 278 ~~---~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~--~~D~qfle~~~~~~~eIa~afgVP 352 (535) T protein:vir:10 278 VI---DQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQN--SRDMEFDKFLNFMIYDTAAIFQMQ 352 (535) T ss_pred Ee---cCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCC--hhHHHHHHHHHHHHHHHHHHhCCC Confidence 22 345667888888888888887766 22222 2222 3444444442 22223 3444567899999999 Q ss_pred hhhccCCC-cc------------hHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHH Q lcl|NC_015263. 351 QILFSSDN-KT------------SQGIAMSIATDEQ-FIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHD 416 (513) Q Consensus 351 ~~Lfn~d~-~s------------~~~~~~SI~~d~~-~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~ 416 (513) ..++|-.. .+ ++++......... -+.-++.+||..+|+.|-... +..++|.|-....-+.++..+ T Consensus 353 p~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~-~~~~~f~f~~l~~~d~~~r~~ 431 (535) T protein:vir:10 353 PEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYV-DTDYRFSFTLGDAQDKLQEEQ 431 (535) T ss_pred HHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc-CCeEEEEeccccccCHHHHHH Confidence 88885321 11 1122222222222 233588999999999875432 234666666555555555444 Q ss_pred HHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHH-HhhCcccccCcccccccccc-cccccCCc-----cccCCCCcCC Q lcl|NC_015263. 417 RYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVEN-EMLDLPEIMTPLSSSFNTSG-SDIAENAI-----KEKGKENGRP 489 (513) Q Consensus 417 ~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~-e~L~l~~~~~Pl~TS~T~Sg-~~~~~~~~-----~~~~~~~grP 489 (513) .+. ...- -|+++.|+-.++-++- +-.|..-.+.+.+ +...+ ..+..+.+ .....+..++ T Consensus 432 ~~~-~~~~-----------g~lT~NE~R~~~gl~piegGD~~~~~~~~~--~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 497 (535) T protein:vir:10 432 VWK-LKLA-----------NGYFINEYRKDHGLKTVDGLDVPGFIGSAE--NFINATGFGQPNVPDSSDDSGSTLGERER 497 (535) T ss_pred HHH-HHHc-----------CCCCHHHHHHHhCCCCCCCccccccccchh--hcccccccccccCCCCCCCccccCCcccc Confidence 332 2211 2556666555443331 1112111111111 11101 00001111 1111000111 Q ss_pred CC----cccccccCCCCCCCCCCccCC-C Q lcl|NC_015263. 490 TN----ETTGNKDSDETQRAKDKPANT-Q 513 (513) Q Consensus 490 t~----et~~n~~~~~~~~~~d~~~~~-~ 513 (513) .. +....+.+++.+.+...|... | T Consensus 498 q~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 526 (535) T protein:vir:10 498 QERIQHSKDYEKGKDDPKSPLPKPSESDD 526 (535) T ss_pred CcccccccccccCCCCCCCCCCcCCCCCc Confidence 10 001111112222221111111 1 No 66 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=97.86 E-value=1.4e-05 Score=47.18 Aligned_cols=392 Identities=13% Similarity=0.122 Sum_probs=179.9 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhcc--------CcccccccccccchHHHHHHHhhhccChhHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNR--------TPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRK 72 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--------~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~ 72 (513) |-. -.||==...++++|.-+.. +|...+.+ ...+- ..++... . T Consensus 1 ~~~--------------~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~-~~~~~--------~~~~~~~------~ 51 (424) T protein:vir:18 1 MEE--------------PKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQT-GPVSA--------HGYLGDS------S 51 (424) T ss_pred CCC--------------CccccccCCCCchHHHHHhhccccccccccchhhc-ccccc--------ccccccc------c Confidence 111 1122222223333333321 12111111 00000 0000000 0 Q ss_pred HHHHHHhhcchHHHHHHHH----hhcccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHH Q lcl|NC_015263. 73 VSEDLAVQSQQYQRLLNFY----ANMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAM 143 (513) Q Consensus 73 ~s~~lY~~sg~~~rlidy~----~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l 143 (513) ++.--+-..+.+.+.|+.+ ++||.-=|...+ ... +..+..+ +.+...|. + +.--.+...++..++ T Consensus 52 v~~~~al~~~~v~~cv~~Ia~~iA~lp~~vy~~~~----~~~-~~~~~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~ll 125 (424) T protein:vir:18 52 INDERILQISTVWRCVSLISTLTACLPLDVFETDQ----NDN-RKKVDLS-NPLARLLRYSPNQYMTAQEFREAMTMQLC 125 (424) T ss_pred ccHHHhhccHHHHHHHHHHHHhhccCceEEEEecc----CCc-eeeeccc-cHHHHHHhhccCCCCCHHHHHHHHHHHHh Confidence 1111112223344445544 445432222211 110 1111111 12333443 3 334566778888999 Q ss_pred HhcceeEEEEEcC--cceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCccccc Q lcl|NC_015263. 144 TVDIFYGYVIDDK--ESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASN 221 (513) Q Consensus 144 ~~g~~~gy~i~d~--~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~ 221 (513) ..|..|.++.-+. ..+-+.++|+.+|.|. ..+|...|.+.. + .. T Consensus 126 l~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~-~~~~~~~y~~~~----~-----------------------------g~ 171 (424) T protein:vir:18 126 FYGNAYALVDRNSAGDVISLLPLQSANMDVK-LVGKKVVYRYQR----D-----------------------------SE 171 (424) T ss_pred hcCCeEEEEEECCCCcEEEEEEecCcceEEE-EcCCeEEEEEEe----C-----------------------------Ce Confidence 9999999988654 4478999999999986 344555444321 0 11 Q ss_pred CeeecCCceEEEE-ecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHH Q lcl|NC_015263. 222 WYEIQDKNSICIK-INESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYF 300 (513) Q Consensus 222 W~~L~~~kt~~ik-~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~ 300 (513) .++++++.-+.|+ ++.+...|++|...+...+--....++.. ..-..|-.-.-..|-+ ++-.++.++++++ T Consensus 172 ~~~~~~~eVihir~~~~dg~~G~spi~~~~~~i~~~~~~~~~~--~~~f~ng~~~~gil~~------~~~~l~~e~~~~~ 243 (424) T protein:vir:18 172 YADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQ--RDFFANGAKSPQILST------GEKVLTEQQRSQV 243 (424) T ss_pred EEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHH--HHHHhccCCcceEEEe------CCcCCCHHHHHHH Confidence 2344554455565 34455678888776554432222232221 1112221111111111 0113566677666 Q ss_pred HHHHHHhc--cc--cceEEEeccccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCcch---HHH-HHHHHHH Q lcl|NC_015263. 301 HEALSMTV--PD--NVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNKTS---QGI-AMSIATD 371 (513) Q Consensus 301 ~~~ik~~L--p~--gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~---~~~-~~SI~~d 371 (513) .+.+++.. ++ ++..+-..+++..+.+.. +..--.+..-..++|..+.||...++|....++ +.+ ...+.-- T Consensus 244 ~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~ 323 (424) T protein:vir:18 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) T ss_pred HHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHH Confidence 66666544 21 344554555555555431 111123344455789999999999986432221 222 2333333 Q ss_pred HHHHHHHHHHHHHHHHHHHhh--cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHH Q lcl|NC_015263. 372 EQFIFGVINQLERWLNRYLLL--NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKV 449 (513) Q Consensus 372 ~~~~~~~~~~iE~~~N~~i~~--~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~ 449 (513) ..-+.-++++||..+|++|-. +..+..|+|.+-+...-+.++..+.+.++.+-|.=..--.=+.+|+.|. T Consensus 324 ~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi-------- 395 (424) T protein:vir:18 324 QYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPL-------- 395 (424) T ss_pred HHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-------- Confidence 333346999999999998842 3234457777667777788999999888876663222222223455442 Q ss_pred HHHhhCcccccCccccccc-ccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 450 ENEMLDLPEIMTPLSSSFN-TSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 450 E~e~L~l~~~~~Pl~TS~T-~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) | |-|..++|+...-. .-| +.+.|++. |. T Consensus 396 ~----ggD~~~~~~n~~~l~~~~-------------~~~~~~~n-------~a 424 (424) T protein:vir:18 396 P----GGDVAMRQAQYVPITDLG-------------TNKEPRNN-------GA 424 (424) T ss_pred C----CcCeeeeccCccchhhhh-------------ccCCcccc-------CC Confidence 1 22344444332111 001 01112111 00 No 67 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=97.84 E-value=1.6e-05 Score=46.95 Aligned_cols=430 Identities=10% Similarity=0.013 Sum_probs=180.4 Q ss_pred cCcccccc---ccccc-chH---HHHHHHhhhccChhHHHHHHHHHHHHHh--------------------hcchHHHHH Q lcl|NC_015263. 36 RTPVFGAP---VGSLT-SSQ---SKVRKIVKEYRNEGNQKTLRKVSEDLAV--------------------QSQQYQRLL 88 (513) Q Consensus 36 ~~~~~~s~---~~s~~-~s~---d~~k~~i~~~~P~~n~~~ir~~s~~lY~--------------------~sg~~~rli 88 (513) +||-+.+. ++-.. ... +.+++.+..| ......++.+.+|+-. ..+.-+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~--~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iV 78 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQL--VDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAV 78 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHH--HHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHH Confidence 33321110 00000 011 1122222222 1112222222222222 122223344 Q ss_pred HHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc---eeeeecC Q lcl|NC_015263. 89 NFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES---VMIQQFP 165 (513) Q Consensus 89 dy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~---~~iq~lp 165 (513) +.++.-..+|.+-.| + ........++ ....-++....+.+.+++++.|.-|.+...+.++ ..|..++ T Consensus 79 d~~a~rl~~~Gf~~~----d---~~~~~~~l~~---i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~s 148 (504) T protein:vir:99 79 DTLARRCNLESFVWP----D---GDYGSIGGPD---VWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKS 148 (504) T ss_pred HHHHhhhccceeeCC----C---CChhhHHHHH---HHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEec Confidence 444443344433322 1 0111122222 3455567788899999999999999988865433 4688999 Q ss_pred cceeEEEEE-ECCeeEEEEEeeeccCcch---hc-cccHHHHHHHHHHhhhhhccCcccccCeeecCCceE---EEE-ec Q lcl|NC_015263. 166 NDICKISSV-SGGVYNYVIDLDALVSADI---VD-YYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSI---CIK-IN 236 (513) Q Consensus 166 ~dyckIsg~-~nG~y~~~fD~syFd~~~~---L~-~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~---~ik-~~ 236 (513) |.+|.++-= ..+...+++=..+-+.... .. ++|..+.. + ... ....|..=..+|.+ ++. +| T Consensus 149 P~~~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~----~---~~~---~~~~~~~~~~~~~~gvPvV~~~n 218 (504) T protein:vir:99 149 AMQATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVT----A---DMD---DDGDWHADVRTHKLGVPVEVLPY 218 (504) T ss_pred cceeEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEE----E---EEc---CCceeeeccccCCCCcceEEecc Confidence 999986642 3456566654443332211 11 12222211 0 000 00122211111221 111 22 Q ss_pred C---ccccchhhHHHHHHhHHHHHHHHHHH-hhHhhhhhce--eeeeeeccccCCCCCccccCHHHHHHHHHHHHHh--c Q lcl|NC_015263. 237 E---SSLTPVPPFAGTFDSIYDIHSFKDLR-NDKAELQNYK--LLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT--V 308 (513) Q Consensus 237 ~---~~~~~ip~f~~v~~d~~di~~~kdL~-~~~~~i~n~~--ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~--L 308 (513) . +.++|.+-++.-+.++.|.....=.. .+..+..... .|...-|-...+++|+ ....+...+... + T Consensus 219 ~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~------~~~~~~~~~~~i~~~ 292 (504) T protein:vir:99 219 KPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGS------MKPAWQIALARVFAL 292 (504) T ss_pred cccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCcccccccccc------ccchhhhhhhhhhcC Confidence 2 33455554443333443333222211 1112221111 1111111000122332 222333333322 2 Q ss_pred ccc-ceEEEeccccccccccccccc--chhhhhhHHhhhhhhhhhhhhcc--CC--CcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 309 PDN-VGVVTSPMEIDTVSFDKDSST--DDSVEKATKNFWDNAGVSQILFS--SD--NKTSQGIAMSIATDEQFIFGVINQ 381 (513) Q Consensus 309 p~g-v~~v~sP~~~d~i~ld~~~~~--~dtv~~~~~~i~~~~GiS~~Lfn--~d--~~s~~~~~~SI~~d~~~~~~~~~~ 381 (513) |+. -+.+...-..+--.|++.... -+.+...-.+|...+||..-.|| ++ ..|+.+++.....-...+.+-.+. T Consensus 293 ~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~ 372 (504) T protein:vir:99 293 PDDEDEPDAARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDD 372 (504) T ss_pred CCccccccccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 321 111111101111233332211 13344445566666777765664 33 235556766666555555566666 Q ss_pred HHHHHHHHH------hhccc--c---eEEEEEecCCCCccHHHHHHHHHHHHhcCC---cHHHHHHHHhCCCHHHHHHHH Q lcl|NC_015263. 382 LERWLNRYL------LLNGM--S---KYFKATMLEVTHFSKKEAHDRYITDAQYGF---PVKVYLASLMGIDPVAFTGLL 447 (513) Q Consensus 382 iE~~~N~~i------~~~~~--~---~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~---~~~~~laa~~G~~p~~~~~~~ 447 (513) ++.=+.+.+ ..+.. . ...++.|-+..+-|..+.++.+.|+.+-|. +....+...+|++|.++..+. T Consensus 373 f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~ 452 (504) T protein:vir:99 373 WSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRAL 452 (504) T ss_pred HHHHHHHHHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHH Confidence 555333321 11111 1 246778888899999999999999999873 234566777899999987665 Q ss_pred HHHH--HhhCcccccCcccccccccccccccCCccccC----CCCcCCCCcc Q lcl|NC_015263. 448 KVEN--EMLDLPEIMTPLSSSFNTSGSDIAENAIKEKG----KENGRPTNET 493 (513) Q Consensus 448 ~~E~--e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~----~~~grPt~et 493 (513) ..+. +..++-+.+...+..=...+..++...+.++. ..+|||+-+. T Consensus 453 ~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 453 AERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred HHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCcccCC Confidence 5433 23333334433332111111111111111111 1122222221 No 68 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=97.82 E-value=1.7e-05 Score=46.78 Aligned_cols=412 Identities=10% Similarity=0.057 Sum_probs=181.6 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCccccc--ccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGA--PVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s--~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) |.+-.-..-+.-.+. .+ ....|..+. ..++. .+.....+--. +- ..-..++.--+-..+.+.+.| T Consensus 1 ~~~~~~~g~~~~~~~----~~--~~~~~~~~~~~~~~~~---~~~~~~~~~~~-~~---~~g~~v~~~~a~~~~aV~~~v 67 (432) T protein:vir:97 1 MPDEKKLGLLGQLKA----MF--VPPDPVDIGGGQTFTP---VNATARDLGII-IS---DTGAAVNADAIMRLDAVAACV 67 (432) T ss_pred CCCcccCchhhhhHh----hc--CCcccccccccccccc---Cchhhhhhccc-cc---ccCcccchHhhhcchHHHHHH Confidence 443332222221111 11 011111100 00110 00001111000 00 011112222233456677777 Q ss_pred HHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEc-Ccceee Q lcl|NC_015263. 89 NFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDD-KESVMI 161 (513) Q Consensus 89 dy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d-~~~~~i 161 (513) +.+++ +..+...++ .... ++..+..-+.....|.. +.--.+...++..++..|..|.+++.+ +....+ T Consensus 68 ~~Ia~~ia~lp~~~y----~~~~-~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L 142 (432) T protein:vir:97 68 KLVSQAVAAMPLMMY----MRTP-DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESL 142 (432) T ss_pred HHHHHhhccCceEEE----EecC-CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEE Confidence 76642 333333332 1110 11111111223334432 345566777888999999999988764 445688 Q ss_pred eecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccc Q lcl|NC_015263. 162 QQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSL 240 (513) Q Consensus 162 q~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~ 240 (513) .++|+++|.|.--.+|.+.|.+.. .+. .-++++.+.-+.|+. ..+.. T Consensus 143 ~~l~p~~v~v~~~~~g~~~y~~~~---~~g-----------------------------~~~~~~~~~iih~r~~~~dg~ 190 (432) T protein:vir:97 143 QYLANDRLTITTDTKGNTAYRYRR---TDG-----------------------------QMIDIPRQQIWKIMGYSLDGE 190 (432) T ss_pred EEEcCcceEEEEcCCCcEEEEEEe---cCc-----------------------------eEEEEccccEEEecCcCCCCc Confidence 999999999986677876665321 111 112334433444442 23445 Q ss_pred cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc-ccceEEEecc Q lcl|NC_015263. 241 TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP-DNVGVVTSPM 319 (513) Q Consensus 241 ~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp-~gv~~v~sP~ 319 (513) .|++|...+-..+--....++.. ..-..|-...-.-|-+ ++ .++.++++.+.+.+..+-- .++..+-..+ T Consensus 191 ~G~spi~~~~~~i~~~~a~~~~~--~~~f~ng~~~~gil~~-----~~--~l~~e~~~~~~~~~~~~~nag~~~vl~~g~ 261 (432) T protein:vir:97 191 NGLSAIRYGAQIFGTAIAAEAQA--ARAFRNGQLQSVYYQI-----DR--FLTDDQYDSFSKKVSGSVEAGRAPLLEGGM 261 (432) T ss_pred ccccHHHHHHHHHHHHHHHHHHH--HHHHhccCCcceeEec-----CC--CCCHHHHHHHHHHHhhhhcCCCceecCCCc Confidence 68888776644332222222211 1112221111111111 11 2567777777776665432 2344544555 Q ss_pred ccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCcc----hHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhh- Q lcl|NC_015263. 320 EIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNKT----SQGIA-MSIATDEQFIFGVINQLERWLNRYLLL- 392 (513) Q Consensus 320 ~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s----~~~~~-~SI~~d~~~~~~~~~~iE~~~N~~i~~- 392 (513) ++..+.+.- +..--.+.+-..++|..+.||...++|....+ ++++. ..+.--..-+.-++++||..+|+.|-. T Consensus 262 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ie~~ln~kLl~~ 341 (432) T protein:vir:97 262 DVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLTP 341 (432) T ss_pred eEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCc Confidence 655665531 11212334455588999999999999643221 12332 222222223345889999999997743 Q ss_pred -cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccc Q lcl|NC_015263. 393 -NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSG 471 (513) Q Consensus 393 -~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg 471 (513) +..+..|+|..-+...-+.++.++.+.++.+-|. ++|.|+-.++-++- ++=.+.+.+.++-++ .- T Consensus 342 ~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~-----------~T~NE~R~~~glpp--~~g~~~~~~~~~~~~-pl 407 (432) T protein:vir:97 342 AERRRYFADFDTSALLRADSAARSSYYSQLVNNGL-----------MTRDEAREIEGLPK--LGGNAAVLTVQSAMV-PL 407 (432) T ss_pred cccCceEEEeechhhhccCHHHHHHHHHHHHhCCC-----------CCHHHHHHHhCCCC--CCCCcceEeeccccc-ch Confidence 2223456776666666678888888777766652 34444433322221 110112222222221 10 Q ss_pred cc-cccCCccccCCCCcCCCCcccccccCCCCCCCCCCccC Q lcl|NC_015263. 472 SD-IAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPAN 511 (513) Q Consensus 472 ~~-~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~ 511 (513) .. +.+..+. +.+|.+..+ .+.+.+ T Consensus 408 ~~~~~~~~~~---~~~~~~~~~-------------~~~~~~ 432 (432) T protein:vir:97 408 DSIGLQASPE---PASGLGNQQ-------------QDKVSK 432 (432) T ss_pred hhhcccCCCC---CCCCCCCcc-------------cccccC Confidence 10 0011110 011111111 111111 No 69 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=97.81 E-value=1.7e-05 Score=46.68 Aligned_cols=327 Identities=13% Similarity=0.142 Sum_probs=161.9 Q ss_pred HhhcccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEcCc--ceeeee Q lcl|NC_015263. 91 YANMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQ 163 (513) Q Consensus 91 ~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~ 163 (513) +++||- .++ . +++... +.+...|. + +.-..+...++..++..|..|.+..-+.. +.-+.+ T Consensus 1 ia~lp~---~~~----~---~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~ 67 (348) T protein:vir:93 1 MASLPL---KMY----E---DYKVVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFL 67 (348) T ss_pred Ccccce---EeE----e---cCcCcc---cHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 566654 232 1 111111 23344454 3 23556667788888999999999886544 478899 Q ss_pred cCcceeEEEEEECC-eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe--cCccc Q lcl|NC_015263. 164 FPNDICKISSVSGG-VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI--NESSL 240 (513) Q Consensus 164 lp~dyckIsg~~nG-~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~--~~~~~ 240 (513) +|++.|.|.-..+| ...|.+... .... +.+++.--+-|+- ..+.. T Consensus 68 l~~~~v~~~~~~~~~~~~y~~~~~---~g~~-----------------------------~~~~~~eiih~r~~~~~~~~ 115 (348) T protein:vir:93 68 LNPDVVEMLIENQSRELYYSIHAA---TGNK-----------------------------LIVHNMDMLHFKHIVASNMV 115 (348) T ss_pred EcCCceEEEEeCCCcEEEEEEEcC---CCeE-----------------------------EEEccccEEEecCCCCCCce Confidence 99999999855553 333333211 1110 1223322333432 12344 Q ss_pred cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccceEEE--ec Q lcl|NC_015263. 241 TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVT--SP 318 (513) Q Consensus 241 ~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~--sP 318 (513) .|++|...+. +.++++..-+--.. ....+-..++-+.+ ..++.++++.+.+.+++...+.-+.++ .. T Consensus 116 ~G~s~~~~~~-~~i~~~~~~~~~~~-~~~~~~~~~i~~~~---------~~l~~e~~~~~~~~~~~~~~n~~~~~vl~~g 184 (348) T protein:vir:93 116 QGISPIDVLK-NTTDFDNAVRTFNL-TEMQKPDSFMLKYG---------SNVSTEKRQQVLEDFKQYYEENGGILFQEPG 184 (348) T ss_pred eeccHHHHHH-HHHHHHHHHHHHHH-HhcCCCceeEEecC---------CCCCHHHHHHHHHHHHHHhhcCCCeeecCCC Confidence 6777766554 44444332221111 11111111111111 136777777777777666643223333 33 Q ss_pred ccccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCC-cchHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015263. 319 MEIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDN-KTSQGIAM-SIATDEQFIFGVINQLERWLNRYLLLN 393 (513) Q Consensus 319 ~~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~-~s~~~~~~-SI~~d~~~~~~~~~~iE~~~N~~i~~~ 393 (513) +++..+.+. ..+.+. .+-..++|..+.||...++++.. .+.+.+.. +..--..-+.-++++||..+|+.|-.. T Consensus 185 ~~~~~l~~~--~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~ 262 (348) T protein:vir:93 185 VEIEPLPKK--YVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTK 262 (348) T ss_pred ceEEEcCCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCc Confidence 344434332 222232 33355789999999999887543 33333322 222333334468999999999987422 Q ss_pred ---ccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccccc Q lcl|NC_015263. 394 ---GMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTS 470 (513) Q Consensus 394 ---~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~S 470 (513) ..+..|+|..-+...-+.++.++.+.++..-|.=..--.-+.+|+.|.+ |-|..++|.. ++.- T Consensus 263 ~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~------------ggD~~~~~~n--~~~~ 328 (348) T protein:vir:93 263 TDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE------------GGDKPLISGD--LYPI 328 (348) T ss_pred ccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC------------CcCeEeeccc--cccc Confidence 2244577766676667888889988888777732222222335555542 2233343322 1110 Q ss_pred ccccccCCccccCCCCcC-CCCcc Q lcl|NC_015263. 471 GSDIAENAIKEKGKENGR-PTNET 493 (513) Q Consensus 471 g~~~~~~~~~~~~~~~gr-Pt~et 493 (513) +.. ....+...||. .++++ T Consensus 329 ~~~----~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 329 DTP----LELRKSLKGGDKNVNES 348 (348) T ss_pred ccc----hhhcccccCCCCCcCCC Confidence 100 00000111111 11111 No 70 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=97.73 E-value=2.5e-05 Score=45.85 Aligned_cols=452 Identities=11% Similarity=0.037 Sum_probs=189.7 Q ss_pred CCCc---cchhee--e--eehhhhh-----hHHHHHHHHHHHHHhhccCccccccc-------ccccchHHHHHHHhhhc Q lcl|NC_015263. 1 MVKN---KKKRLS--M--IDVESIS-----SYSNKRNNRISILRDDNRTPVFGAPV-------GSLTSSQSKVRKIVKEY 61 (513) Q Consensus 1 ~~~~---~~~~~~--~--~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~~s~~-------~s~~~s~d~~k~~i~~~ 61 (513) ||.. -++|+. . -|..++- -=.+|+++.. ..+.-..+..+.++ ++-...+-. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~----~~~p 74 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEE--KSKELNKSLYGKQQAYAEPFLEVMDTNPEF----RTKR 74 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccChhHHHHHhhh--hhhhhccccCCccchhhcceeeeeecCCCc----cccC Confidence 5542 222222 1 0111111 1134554432 12222222211211 111111111 0111 Q ss_pred cChhHHHHHHHHHHHHHhhcchHHHHHHHHhh----ccccc--------ceEeeccchhhhhhcchhHHHHHHHHHHhhc Q lcl|NC_015263. 62 RNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN----MPLYA--------YSVVPFKDISTANENKLKKELATVTEFLSRL 129 (513) Q Consensus 62 ~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~----mpt~d--------Y~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~ 129 (513) .|..+...++++.+- |..++++++.|+.++. +++.. ..|.+.. ......+.....-..+..+|..+ T Consensus 75 ~~~~~~~~~~~~l~~-~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~-~~~~~~~~~~~~~~~l~~~l~~~ 152 (576) T protein:vir:96 75 SYMKNSDNLHDVLKQ-FGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRD-LDAEPGKKEKEEIKRIENFILNT 152 (576) T ss_pred cchhhhhhhHHHHHH-hhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEec-CcCccchhhhHhhhhHHhhHhhc Confidence 133455566665543 4467899999998764 11111 1121111 11111111112222222333332 Q ss_pred ---------ChhHHHHHHHHHHHHhcceeEEEEEcCc----ceeeeecCcceeEEEEEECCeeEEE-EEeeeccCcchhc Q lcl|NC_015263. 130 ---------NPKYNFSKIVKLAMTVDIFYGYVIDDKE----SVMIQQFPNDICKISSVSGGVYNYV-IDLDALVSADIVD 195 (513) Q Consensus 130 ---------n~k~~~~~i~~~~l~~g~~~gy~i~d~~----~~~iq~lp~dyckIsg~~nG~y~~~-fD~syFd~~~~L~ 195 (513) ....++..++.+++..|..|.|.+-+.+ .+.+.++|+..|++.--.+|...+. .-........ T Consensus 153 ~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~--- 229 (576) T protein:vir:96 153 GRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKK--- 229 (576) T ss_pred cCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCc--- Confidence 2345777888889999999999885433 4589999999999986666543211 1111100000 Q ss_pred cccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCc-----cccchhhHHHHHHhHHHHHHHHHHHhhHhhhh Q lcl|NC_015263. 196 YYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINES-----SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQ 270 (513) Q Consensus 196 ~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~-----~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~ 270 (513) .-..++...-+.|..+.. ..+|+||...+...+--.....+.. ..-.. T Consensus 230 -------------------------~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~--~~~f~ 282 (576) T protein:vir:96 230 -------------------------VVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFN--DRFFS 282 (576) T ss_pred -------------------------eEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHH--HHHHh Confidence 112233322233332211 3468888877665554444443322 11122 Q ss_pred hceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--ccce--EEEec--cccccccccc-ccccchhhhhhHHhh Q lcl|NC_015263. 271 NYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--DNVG--VVTSP--MEIDTVSFDK-DSSTDDSVEKATKNF 343 (513) Q Consensus 271 n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~gv~--~v~sP--~~~d~i~ld~-~~~~~dtv~~~~~~i 343 (513) |-...-.-|-+ .++-.++.++++.+.+.+.++.. ++.+ .++.+ +++..+.+.. +..--.+..-+.++| T Consensus 283 Ng~~p~giL~~-----~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~I 357 (576) T protein:vir:96 283 HGGTTRGILQI-----KSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINII 357 (576) T ss_pred ccCCCceEEEe-----CCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHH Confidence 22111111211 12334677888888888887762 2222 23333 3333333321 112223455566889 Q ss_pred hhhhhhhhhhccCCCc-------------chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCcc Q lcl|NC_015263. 344 WDNAGVSQILFSSDNK-------------TSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFS 410 (513) Q Consensus 344 ~~~~GiS~~Lfn~d~~-------------s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn 410 (513) -.+.||...++|-... .++.-.....--..-+.-++.+||..||+.|-... ...|.+.|++...-+ T Consensus 358 a~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~-~~~~~~~f~r~d~~~ 436 (576) T protein:vir:96 358 SALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEY-SDKYVFQFVGGDTKS 436 (576) T ss_pred HHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-cCceEEEeccCCHHH Confidence 9999999988863221 12222222222333344688999999999876442 345788888776555 Q ss_pred HHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccccccccc----ccCCcc----- Q lcl|NC_015263. 411 KKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDI----AENAIK----- 480 (513) Q Consensus 411 ~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~----~~~~~~----- 480 (513) +.+..+... +..-|+ .+=..- +.+|+.|.+ |-|..+.|+. ..+.... ...... T Consensus 437 ~~e~~~~~~-~~~~G~lT~NE~R-~~~gl~pie------------gGD~~~~~~~---~~~~~~~~~~~~~e~~~~~~~~ 499 (576) T protein:vir:96 437 ELDKIKILQ-EEVKTYKTVNEAR-KEKGLKPIE------------GGDVLLDGSF---IQSMSLNTQKEQYEDTKQKERF 499 (576) T ss_pred HHHHHHHHH-HHhcCccCHHHHH-HHhCCCCCC------------Ccceeccccc---cccccccccCCCCCCccccccc Confidence 555443321 121232 222211 224444432 2223333321 1111100 000000 Q ss_pred ------ccCCCCcCCCC---cccccccCCCCCCCCCCc--cCCC Q lcl|NC_015263. 481 ------EKGKENGRPTN---ETTGNKDSDETQRAKDKP--ANTQ 513 (513) Q Consensus 481 ------~~~~~~grPt~---et~~n~~~~~~~~~~d~~--~~~~ 513 (513) .++++...|+. ++..+...+...+.++.+ .++| T Consensus 500 ~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~~ 543 (576) T protein:vir:96 500 DMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGTDGQ 543 (576) T ss_pred cccccccCCCCCCCCCCCCCCCcccccccccCCCCCCccccccc Confidence 01111111111 111111111111111111 2222 No 71 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=97.68 E-value=2.9e-05 Score=45.45 Aligned_cols=452 Identities=12% Similarity=0.095 Sum_probs=195.9 Q ss_pred CCCcc-chheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKNK-KKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~ 79 (513) |-..| +-+-..|-++ ....-++...+|. +.++.+-....+ +-+|. |-=+.++| +.+++. T Consensus 1 ~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~-~~~~~~~~~~~~--~~~~~---p~~~~~~L---~~~~e~ 60 (651) T protein:vir:99 1 MTDTTGETQETKVHVE-----------GLGGEADLAKSPN-STQIPDHRIQSH--NVGVN---PPYNPDRL---AAFLEL 60 (651) T ss_pred CCCccceeeeeEEEee-----------ccccccccccccc-ccccchhhhccc--CCCCC---CCCCHHHH---HHHHhc Confidence 44333 1112222222 2334566677776 233322111111 11122 43355554 455556 Q ss_pred hcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHh-----------hcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 80 QSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-----------RLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 80 ~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-----------k~n~k~~~~~i~~~~l~~g~~ 148 (513) +.-+++-+--|=.++-.|-+-|.|...-....... +...++-..++ .+|....+..++..++.+-.. T Consensus 61 ~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~ 138 (651) T protein:vir:99 61 NETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASD--AQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHG 138 (651) T ss_pred ChHHHHHHHHHhhhhhccCceeeecccCCCCccch--HHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHH Confidence 66666666666677899999999844332222222 22223333333 345455555666555443333 Q ss_pred eE--EE--E-E-cCcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHH----------------- Q lcl|NC_015263. 149 YG--YV--I-D-DKESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAV----------------- 205 (513) Q Consensus 149 ~g--y~--i-~-d~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y----------------- 205 (513) +| |. + + .+.++.+..+|+.+.+.......+-.. +. ..|...|...+..+ T Consensus 139 tGna~ieiIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~---~~-----~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~ 210 (651) T protein:vir:99 139 VGWLALEMLTDIEGRPVGLAYVPARTVRVRRPQNRFDQP---RH-----PEEGRYVDGDVADIASRGYVQIRNGNRRYFG 210 (651) T ss_pred HhhHhhhhhhcCccchhhhhhcChhheeeecccccccch---hh-----hhhhcccccccchhHHHHHHHHHhcCcceEE Confidence 33 22 1 2 234456667888877765332111000 00 01111111100000 Q ss_pred ---HHHh------------------hhh------hccCcccccCe--------eecCCceEEEEec--CccccchhhHHH Q lcl|NC_015263. 206 ---NKYT------------------TMK------KGNNKSASNWY--------EIQDKNSICIKIN--ESSLTPVPPFAG 248 (513) Q Consensus 206 ---~~Y~------------------~~k------~~~~~~~~~W~--------~L~~~kt~~ik~~--~~~~~~ip~f~~ 248 (513) +.|. .+. ...+...+.|. .++...-+-|+.. .+.+.|+||... T Consensus 211 ~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~ 290 (651) T protein:vir:99 211 EAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVS 290 (651) T ss_pred EeeccccceeeeeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHH Confidence 0000 000 00000011121 2333333445532 345579999999 Q ss_pred HHHhHHHHHHHHHHHhhHhhhhhceeeee--eeccccCCCCCccccCHHHHHHHHHHHHHhccc--cceEEEe------- Q lcl|NC_015263. 249 TFDSIYDIHSFKDLRNDKAELQNYKLLIQ--KLETRSSNDNNDFTLDMPMMNYFHEALSMTVPD--NVGVVTS------- 317 (513) Q Consensus 249 v~~d~~di~~~kdL~~~~~~i~n~~ii~~--kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~--gv~~v~s------- 317 (513) +...+.-....++.... -..|-...-. ++| + ..++.++++.+.+.+++..-+ .+..+-. T Consensus 291 a~~~i~~a~~a~~~~~~--~f~NG~~p~gil~~~----~----~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~ 360 (651) T protein:vir:99 291 AIRTISADEAAKDYNRD--FFDNDTIPRMVIKVT----G----GELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQL 360 (651) T ss_pred HHHHHHHHHHHHHHHHH--HHhccCCCceEEEec----C----CCCCHHHHHHHHHHHHHHhccCCceEEeecccccccc Confidence 88887666666665422 2222211111 223 1 237777777776666654411 1222222 Q ss_pred ----cccccccccccccc-cchh---hhhhHHhhhhhhhhhhhhccCC-CcchHHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_015263. 318 ----PMEIDTVSFDKDSS-TDDS---VEKATKNFWDNAGVSQILFSSD-NKTSQGIAMSIAT-DEQFIFGVINQLERWLN 387 (513) Q Consensus 318 ----P~~~d~i~ld~~~~-~~dt---v~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~SI~~-d~~~~~~~~~~iE~~~N 387 (513) .+++..+++ +.. +.+. .+-..++|-.+.||...++|-. +.+++.+-..... ...-+.-++.+||.++| T Consensus 361 ~~~~g~~~~pls~--~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln 438 (651) T protein:vir:99 361 DEDVEIELEPMGQ--GISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLY 438 (651) T ss_pred cccCCceEEEcCc--CchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222222 221 2222 3344577999999998888532 3333333333333 33334469999999999 Q ss_pred HHHhhcc---cceEEEEEecC--CCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCc Q lcl|NC_015263. 388 RYLLLNG---MSKYFKATMLE--VTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTP 462 (513) Q Consensus 388 ~~i~~~~---~~~~f~~~~l~--~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~P 462 (513) +.|-... .+..++|.|.. +..-+.+...+.+..+.+-|.-...-.=+.+|+.|..- + ..+..+.| T Consensus 439 ~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~--------~--~gd~~l~~ 508 (651) T protein:vir:99 439 QIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGE--------P--YGEMTLSE 508 (651) T ss_pred HhhcCccccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--------c--cccccccc Confidence 9885432 23345555543 44456677777777777777433333334467777421 1 12445666 Q ss_pred ccccccccccccccCCccccCCCCcC--CCCcccccccCCCCCCCC---CC-ccCCC Q lcl|NC_015263. 463 LSSSFNTSGSDIAENAIKEKGKENGR--PTNETTGNKDSDETQRAK---DK-PANTQ 513 (513) Q Consensus 463 l~TS~T~Sg~~~~~~~~~~~~~~~gr--Pt~et~~n~~~~~~~~~~---d~-~~~~~ 513 (513) .++... | + . ...++..+. |.+++.. .++....-++ .. +-+-+ T Consensus 509 ~~~~~~--g-~--~---~~gge~~~~~~~~~~~~~-~~~e~~~~~~~~~~~e~~~~~ 556 (651) T protein:vir:99 509 FEAEVA--G-D--V---AGGGETEAVHEPPEENKI-GEREWDTVKSELTTKDPIEQM 556 (651) T ss_pred cccccc--c-c--c---ccCCCCcccccCcccccc-ccchhhhhhhhhcccchhhhh Confidence 654322 1 1 0 000111111 1111111 1110000000 00 00011 No 72 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=97.64 E-value=3.4e-05 Score=45.09 Aligned_cols=392 Identities=13% Similarity=0.096 Sum_probs=180.3 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhc--------cCcccccccccccchHHHHHHHhhhccChhHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDN--------RTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRK 72 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~--------~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~ 72 (513) |-.- .||==.+.+..+|.-+. .+|...+.... .+.. .++..- +-.....+ T Consensus 1 ~~~~--------------~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~-~v~~~~al-- 58 (424) T protein:vir:18 1 MEEP--------------KYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGP-VSAH----GHLGDS-SINDERIL-- 58 (424) T ss_pred CCCC--------------cceEeecCCCchHHHHHhhhcccccccccccccccc-cccc----cccccc-cccHHHhh-- Confidence 2211 12211222233332222 12221111110 0000 000000 01111112 Q ss_pred HHHHHHhhcchHHHHHHH----HhhcccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHH Q lcl|NC_015263. 73 VSEDLAVQSQQYQRLLNF----YANMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAM 143 (513) Q Consensus 73 ~s~~lY~~sg~~~rlidy----~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l 143 (513) ..+.+.+.|+. +++||.-=|.... +.. +..+..+ +.+...|. + +..-.+...++..++ T Consensus 59 -------~~~~v~~cv~~Ia~~iA~lp~~~~~~~~----~~~-~~~~~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~ll 125 (424) T protein:vir:18 59 -------QISTVWRCVSLISTLTACLPLDVFETDQ----NDN-RKKVDLS-NPLARLLRYSPNQYMTAQEFREAMTMQLC 125 (424) T ss_pred -------ccHHHHHHHHHHHHhhccCceEEEEeec----CCc-eeeeccc-cHHHHHHhhccCCCCCHHHHHHHHHHHHh Confidence 22334444444 4455543222221 111 1111111 12333343 2 335666788888999 Q ss_pred HhcceeEEEEEcCc--ceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCccccc Q lcl|NC_015263. 144 TVDIFYGYVIDDKE--SVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASN 221 (513) Q Consensus 144 ~~g~~~gy~i~d~~--~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~ 221 (513) ..|..|.++.-+.. .+.+.++|+..|.|. ..+|.+.|.+.. . .. T Consensus 126 l~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~-~~~~~~~y~~~~----~-----------------------------g~ 171 (424) T protein:vir:18 126 FYGNAYALVDRNSAGDVISLLPLQSANMDVK-LVGKKVVYRYQR----D-----------------------------SE 171 (424) T ss_pred hcCCeEEEEEECCCCcEEEEEEecCcceEEE-EcCCeEEEEEEe----C-----------------------------Ce Confidence 99999999886654 468999999999985 345555554321 0 11 Q ss_pred CeeecCCceEEEE-ecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHH Q lcl|NC_015263. 222 WYEIQDKNSICIK-INESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYF 300 (513) Q Consensus 222 W~~L~~~kt~~ik-~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~ 300 (513) .++++++.-+.|+ .+.+.+.|++|...+...+--....++.. ..-..|-.-.-..|-+. + -.++.++++++ T Consensus 172 ~~~~~~~eIih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~--~~~f~ng~~p~gil~~~--~----~~l~~e~~~~~ 243 (424) T protein:vir:18 172 YADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQ--RDFFANGAKSPQILSTG--E----KVLTEQQRSQV 243 (424) T ss_pred EEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHH--HHHHHccCCcceEEEeC--C----cCCCHHHHHHH Confidence 2344554455565 33455678888876654433223333322 11122211111111110 1 13566666666 Q ss_pred HHHHHHhc--c--ccceEEEeccccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCc-c---hHHHHHHHHHH Q lcl|NC_015263. 301 HEALSMTV--P--DNVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNK-T---SQGIAMSIATD 371 (513) Q Consensus 301 ~~~ik~~L--p--~gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~-s---~~~~~~SI~~d 371 (513) .+.+++.. + .++..+-..+++..+.+.. +..--++.+-..++|..+.||...++|.... + ++.-...+.-- T Consensus 244 ~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~ 323 (424) T protein:vir:18 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) T ss_pred HHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHH Confidence 66666544 1 2355555555665555531 1111233445558899999999999864322 2 22223333333 Q ss_pred HHHHHHHHHHHHHHHHHHHhh--cccceEEEEEecCCCCccHHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHH Q lcl|NC_015263. 372 EQFIFGVINQLERWLNRYLLL--NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLK 448 (513) Q Consensus 372 ~~~~~~~~~~iE~~~N~~i~~--~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~ 448 (513) ..-+.-++++||..+|+.|-. ...+..|+|.+-....-+.++..+.+.++.+-|. ..=..-+ .+|+.|.+ T Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~-~~gl~pi~------ 396 (424) T protein:vir:18 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRR-TDNLPPLP------ 396 (424) T ss_pred HHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCC------ Confidence 333446999999999998843 3234456776666666788888888888776663 2222222 34444421 Q ss_pred HHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccc Q lcl|NC_015263. 449 VENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETT 494 (513) Q Consensus 449 ~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~ 494 (513) |=|..+.|+...-. .. ...+..|+++.- T Consensus 397 ------gGD~~~~~~n~~~l--~~----------~~~~~~p~~~ga 424 (424) T protein:vir:18 397 ------GGDVAMRQSQYVPI--TD----------LGTNKEPRNNGA 424 (424) T ss_pred ------CcCeeeeccCccch--Hh----------hhccCCCccCCC Confidence 12333333321110 00 001111211100 No 73 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=97.60 E-value=4e-05 Score=44.72 Aligned_cols=455 Identities=9% Similarity=0.000 Sum_probs=188.4 Q ss_pred CCCccchheeee--ehhhhhhHHHHH-HHHHHHHHhhccCccccccccccc-chHHHHHH--Hh-------hhc------ Q lcl|NC_015263. 1 MVKNKKKRLSMI--DVESISSYSNKR-NNRISILRDDNRTPVFGAPVGSLT-SSQSKVRK--IV-------KEY------ 61 (513) Q Consensus 1 ~~~~~~~~~~~~--~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~s~~~s~~-~s~d~~k~--~i-------~~~------ 61 (513) |-|--.|-|..- .||..---+|+. -++..--+-+|. ++|+.- .++..-.+ .. -++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNN------EPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKT 74 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhc------cCCCHHHHHHhHhhhcccccchhhhhccccccccC Confidence 554444444321 122222223332 111111111111 122211 01100000 00 000 Q ss_pred c-ChhHHHHHHHHHHHHHhhcchHHHHHHHHhh------------cccccceEeeccchhhhhhcchhHHHHHHHHHHhh Q lcl|NC_015263. 62 R-NEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN------------MPLYAYSVVPFKDISTANENKLKKELATVTEFLSR 128 (513) Q Consensus 62 ~-P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~------------mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k 128 (513) + -..|-..+.++..- |..+.++++.|+.+.+ +..+.+.|+......... +.....-+.+..+|.. T Consensus 75 ~~~~~~~~~~~~~l~~-~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~-~~~~~~~~~l~~ll~~ 152 (574) T protein:vir:80 75 KPSIRNSQDLHKTLKK-FGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPT-SHDIANIKRIESFLEN 152 (574) T ss_pred cCccCCcccHHHHHHh-hccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCcc-chhhhhhhHHHHHHhc Confidence 0 00122234444332 2446777777776652 334556665433221111 1112222234445543 Q ss_pred cC---------hhHHHHHHHHHHHHhcceeEEEEEcCc--ceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccc Q lcl|NC_015263. 129 LN---------PKYNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYY 197 (513) Q Consensus 129 ~n---------~k~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~ 197 (513) .+ ...++..++.+++..|..|.+.+-+.+ .+-+.++|+.+|+|.--.+|.+...- ..| T Consensus 153 ~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~-~~y---------- 221 (574) T protein:vir:80 153 TAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNG-ERF---------- 221 (574) T ss_pred cCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCc-eEE---------- Confidence 22 234677888888999999998887654 46789999999999743333110000 000 Q ss_pred cHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCc-----cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhc Q lcl|NC_015263. 198 PKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINES-----SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNY 272 (513) Q Consensus 198 p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~-----~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~ 272 (513) |. ...+.....++...-+-|+.+.. ..+|+||...+...+--....++... .-..|- T Consensus 222 ----------~~------~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~--~~f~ng 283 (574) T protein:vir:80 222 ----------VQ------VIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFND--RFFSHG 283 (574) T ss_pred ----------EE------EeCCceEEEEccccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHH--HHHhcc Confidence 00 00001123445544444554322 23588888776655544444444221 122221 Q ss_pred eeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc--cccce--EEEec--ccccccccccccccc---hhhhhhHHhh Q lcl|NC_015263. 273 KLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV--PDNVG--VVTSP--MEIDTVSFDKDSSTD---DSVEKATKNF 343 (513) Q Consensus 273 ~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L--p~gv~--~v~sP--~~~d~i~ld~~~~~~---dtv~~~~~~i 343 (513) ...-.-|-+ .+.-.++.++++.+.+.+.+.. .++-+ .|+.+ +++..+.+. ..+. .+..-..+.| T Consensus 284 ~~p~gil~~-----~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s--~~D~qfle~~~~~~~~I 356 (574) T protein:vir:80 284 GTTRGILHV-----KTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPS--ANDMQFEKWLNYLINVI 356 (574) T ss_pred CCCceEEEe-----CCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCC--hhHHHHHHHHHHHHHHH Confidence 111222222 1122477788877777777665 12222 23332 343344332 2222 3344466889 Q ss_pred hhhhhhhhhhccCCCcc-----------hHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccH Q lcl|NC_015263. 344 WDNAGVSQILFSSDNKT-----------SQGIAMSIA-TDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSK 411 (513) Q Consensus 344 ~~~~GiS~~Lfn~d~~s-----------~~~~~~SI~-~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ 411 (513) -.+.||...++|-...+ .+.+...-. --..-+.-++.+||..+|+.|-... ...+.|.|.+...-.+ T Consensus 357 a~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~-~~~~~~~f~~~d~~~~ 435 (574) T protein:vir:80 357 SALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEF-GEKYQFQFRGGDLSAQ 435 (574) T ss_pred HHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc-CCceEEEecccchhhH Confidence 99999999888633211 122222222 2222333589999999999875432 2335555555443333 Q ss_pred HHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCC--------cc--- Q lcl|NC_015263. 412 KEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENA--------IK--- 480 (513) Q Consensus 412 ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~--------~~--- 480 (513) .+..... +. +.+ -.++|.|+-.++-+|- +=|-|..+.|+.....-....+.... +. T Consensus 436 ~~~~~~~-~~----------~~~-G~lT~NE~R~~lgl~P-i~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (574) T protein:vir:80 436 LDKLKII-EQ----------EGK-VFRTVNEIRHDKGLEP-IKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLE 502 (574) T ss_pred HHHHHHH-HH----------HhC-CccCHHHHHHHhCCCC-CCCCCEeeeccceeecccccccccCCccchhcccccccc Confidence 3333221 11 111 1346666665544432 11334455543221110000000000 00 Q ss_pred -ccCCCCcCC-CCcccccc-------cCCCCCCCCCCccCCC Q lcl|NC_015263. 481 -EKGKENGRP-TNETTGNK-------DSDETQRAKDKPANTQ 513 (513) Q Consensus 481 -~~~~~~grP-t~et~~n~-------~~~~~~~~~d~~~~~~ 513 (513) ..+++++.| +.+..... +.-+..+++++-..+| T Consensus 503 ~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 544 (574) T protein:vir:80 503 LSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSKKVNGK 544 (574) T ss_pred ccCCCCCCCCCCCCCCccccccchhhhhhhhhccchhhhcCC Confidence 000111111 11101100 1111122233333333 No 74 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.59 E-value=4.1e-05 Score=44.64 Aligned_cols=363 Identities=11% Similarity=0.069 Sum_probs=162.8 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) ..+++- +..+ +..+. +.... +. ..++.-... -....-++.--+-.++.+.+.| T Consensus 1 Mg~~~~--------------~~~~--k~~~~---~~~~~--~~---~~~~~~~~~---~~~~~~v~~~~~l~~~~v~~~i 53 (383) T protein:vir:10 1 MGLLTP--------------KNFS--KRNAK---NMVYP--SN---PAFFTTTVG---GMQLSYVSALSALQNTNVYSVI 53 (383) T ss_pred CCcccc--------------cccc--ccccc---ccccc--cc---hhhhhhhcc---CccccccchhHhhcchHHHHHH Confidence 111110 0000 00000 00000 00 011110000 0011112222233456777888 Q ss_pred HHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcceeeee Q lcl|NC_015263. 89 NFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQ 163 (513) Q Consensus 89 dy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~ 163 (513) +.+++ +..+...+. +. .....|+.-| ..++...++..++..|..|.+++.+ . +.. T Consensus 54 ~~ia~~ia~~~~~~~-----~~-----------~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~--~--~~~ 113 (383) T protein:vir:10 54 NRIASDVSSAHFKTE-----NT-----------ATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ--N--LEH 113 (383) T ss_pred HHHHHhhccCceeec-----cc-----------chhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC--c--eeE Confidence 88766 444444442 11 1123454443 5666788889999999999998754 3 334 Q ss_pred cCcceeEEEEEECC-eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecC----c Q lcl|NC_015263. 164 FPNDICKISSVSGG-VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINE----S 238 (513) Q Consensus 164 lp~dyckIsg~~nG-~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~----~ 238 (513) +|++.++|.-..++ .+.|.+.... .. .=++++...-+.|+... + T Consensus 114 ~p~~~~~v~~~~~~~~~~~~~~~~~---~~----------------------------~~~~~~~~evih~r~~~~~~~~ 162 (383) T protein:vir:10 114 IPNSDVQINYLPGNMGIVYTVLESN---DR----------------------------PKMVLRQDQMLHFRLMPDPQYR 162 (383) T ss_pred eecCcceEEEEEcCCceEEEEEEcC---Cc----------------------------eEEEEcccceEEeccCCCCccc Confidence 56666665543332 2222211110 00 01223343445555322 1 Q ss_pred cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc----ccceE Q lcl|NC_015263. 239 SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP----DNVGV 314 (513) Q Consensus 239 ~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp----~gv~~ 314 (513) ...|+||...+...+--....++... .-..|-...-..+-+ ++ ...+.++++.+.+.+++..- .++.. T Consensus 163 ~~~G~s~l~~~~~~i~~~~~~~~~~~--~~f~ng~~~~~il~~-----~~-~~~~~e~~~~~~~~~~~~~~~~n~~~~~v 234 (383) T protein:vir:10 163 YLIGRSPLESLQNALNLDDKASKSNM--SAMENQINPAGKLTI-----SN-YLSDGKDLESAREEFEKANTGDNSGRLMV 234 (383) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHH--HHHhccCCcceEEEe-----CC-CCCCHHHHHHHHHHHHHHhCccccCCccc Confidence 23588888876666554444444321 222222111122222 11 12355666666666665541 12334 Q ss_pred EEeccccccccccccccc--chhhhhhHHhhhhhhhhhhhhccCCCcc-hHHH---HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 315 VTSPMEIDTVSFDKDSST--DDSVEKATKNFWDNAGVSQILFSSDNKT-SQGI---AMSIATDEQFIFGVINQLERWLNR 388 (513) Q Consensus 315 v~sP~~~d~i~ld~~~~~--~dtv~~~~~~i~~~~GiS~~Lfn~d~~s-~~~~---~~SI~~d~~~~~~~~~~iE~~~N~ 388 (513) +-..++++.+.++..... .++.+-..++|..+.||...++|+..++ +.+. +.... -..-+--++++||..+|+ T Consensus 235 l~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~-~~~~l~P~~~~ie~~l~~ 313 (383) T protein:vir:10 235 LPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKAT-YLANLNSYVNPIVDELRL 313 (383) T ss_pred cCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 444455555544322111 1234445688999999999998764322 1111 11111 111233588889999998 Q ss_pred HHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccccc Q lcl|NC_015263. 389 YLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFN 468 (513) Q Consensus 389 ~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T 468 (513) .|-. ..|+|.+-+....+.++.++.+.++.+-|.=..--+=+.+|+.|. | -.|.+.-..| ++.+ T Consensus 314 ~l~~----~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~--------~--~~d~~~~~~~--~~~~ 377 (383) T protein:vir:10 314 KMNA----PDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGF--------L--PDNLPEFKPL--TNET 377 (383) T ss_pred hhCC----ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcc--------c--CCcccccCCC--cccC Confidence 8743 357777777777888999888888777663222212223444442 1 1111110000 0000 Q ss_pred ccccccccCCccccCCCCcCCCCcccccccCCCCC Q lcl|NC_015263. 469 TSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQ 503 (513) Q Consensus 469 ~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~ 503 (513) =| |+.+ T Consensus 378 -~g----------------------------Gd~e 383 (383) T protein:vir:10 378 -KG----------------------------GDDK 383 (383) T ss_pred -CC----------------------------CCCC Confidence 01 1111 No 75 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=97.56 E-value=4.4e-05 Score=44.45 Aligned_cols=396 Identities=10% Similarity=0.034 Sum_probs=172.6 Q ss_pred cCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcc----------------------hHHHHHHHHhh Q lcl|NC_015263. 36 RTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQ----------------------QYQRLLNFYAN 93 (513) Q Consensus 36 ~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg----------------------~~~rlidy~~~ 93 (513) ++| -.-.+.+++.+..| ......++.+..|+....+ -.+.++|-.+. T Consensus 1 ~~~---------~t~~~~~~~l~~~~--~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 69 (456) T protein:vir:79 1 MTA---------STPAEWLPVLTKRI--DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVAD 69 (456) T ss_pred CCC---------CCHHHHHHHHHHHH--HHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHh Confidence 222 12233445555544 3333445555555554433 23333443333 Q ss_pred cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEE Q lcl|NC_015263. 94 MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKIS 172 (513) Q Consensus 94 mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIs 172 (513) -..++-+-++-. .++......+ ..+...++......+.+.+++.|..|.+...+.++ ..+..++|.-|.++ T Consensus 70 ~l~~~g~~~~~~-----~d~~~~~~~~---~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i 141 (456) T protein:vir:79 70 RIIPNGITVGGS-----ADSDLALRAR---RIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVS 141 (456) T ss_pred hhccCCeecCCC-----CCccHHHHHH---HHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEE Confidence 333333322111 1122223333 34556678888899999999999998877755443 46778888888665 Q ss_pred EE--ECCeeEEEEEeeeccCcc----h-hccccHHHHHH------HHHHhhhhhccCcccccCe---eecCCceEE--EE Q lcl|NC_015263. 173 SV--SGGVYNYVIDLDALVSAD----I-VDYYPKEIQEA------VNKYTTMKKGNNKSASNWY---EIQDKNSIC--IK 234 (513) Q Consensus 173 g~--~nG~y~~~fD~syFd~~~----~-L~~~p~Ei~~~------y~~Y~~~k~~~~~~~~~W~---~L~~~kt~~--ik 234 (513) -- ....+.++ +.+....+ . .-+++++.... +..+.... .......|. +.+.....| +. T Consensus 142 ~d~~~~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~pvv~ 217 (456) T protein:vir:79 142 VDPLQPWRIRSA--MRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRL--VTRISDSWVPVGDAVVTGSPPPVVV 217 (456) T ss_pred EcCCCCCceEEE--EEEEEecCCceeEEEEEcCCceEEEEEEEEeecccccee--eeccCCceeecccccCCCCceeEEE Confidence 32 11223233 22222111 0 11122221110 00100000 000111121 111111111 11 Q ss_pred ecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHH-----HHHHHHHHHH--Hh Q lcl|NC_015263. 235 INESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMP-----MMNYFHEALS--MT 307 (513) Q Consensus 235 ~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~-----~~~~~~~~ik--~~ 307 (513) ++ .+.+++-|.++. ++.|..+.--. +.....+....-.-.+ . |-..+.+.+|.. ....|..... -. T Consensus 218 ~~--N~~~~gd~e~v~-~liD~~~~~~s-~~~~~~~~~a~~~~~~-~--G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~ 290 (456) T protein:vir:79 218 YQ--NPDGMGEVEPHI-DIINRINRAEL-QLLSTMAIQAFRQRAL-K--SSEHRLPKVDENGNAIDYASIFEAAPGALWE 290 (456) T ss_pred ec--CCCCCchhhhhH-HHHHHHHHHHH-HHHHHHHHHhhHHHHH-h--cCCcccccccccccccchhhhhhhhcccccc Confidence 22 122333333321 22222111101 1111111111100001 0 111122222211 1111111111 11 Q ss_pred ccccceEEEeccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccCC--CcchHHHHHHHHHHHHHHHHHHH--- Q lcl|NC_015263. 308 VPDNVGVVTSPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSSD--NKTSQGIAMSIATDEQFIFGVIN--- 380 (513) Q Consensus 308 Lp~gv~~v~sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~s~~~~~~SI~~d~~~~~~~~~--- 380 (513) +|.++-. -.|+... .-.+.+.....+|....|+..-.|+++ +.|+.+++.....-...+....+ T Consensus 291 ~~~~~~~---------~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~ 361 (456) T protein:vir:79 291 LPPGVDI---------WESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAK 361 (456) T ss_pred CCCCcce---------eeecccChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2333211 1222211 112446666678888889988888775 55666667666655555544333 Q ss_pred -HHHHHHHHHHhhccc--ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHH--HhhC Q lcl|NC_015263. 381 -QLERWLNRYLLLNGM--SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVEN--EMLD 455 (513) Q Consensus 381 -~iE~~~N~~i~~~~~--~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~--e~L~ 455 (513) .|++-+...+..... ...+++.|-+..+-|..+.++.+.++.+-|.+....+...+|+++.++- +.+.|. +..+ T Consensus 362 ~~l~~~~~l~~~~~g~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~-~~e~~r~~~e~~ 440 (456) T protein:vir:79 362 IGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIK-QDDLDRAREQIT 440 (456) T ss_pred HHHHHHHHHHHHhcCCCccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHH-HHHHHHHHHHHH Confidence 333333333333222 2258899999999999999999999999999999988889999998763 333322 1111 Q ss_pred cccccCcccccccccccccccCCccccCCCCcC Q lcl|NC_015263. 456 LPEIMTPLSSSFNTSGSDIAENAIKEKGKENGR 488 (513) Q Consensus 456 l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~gr 488 (513) . +.|.- .+. ...++-| T Consensus 441 ~------------~~~~~-~~~----~~~~~~~ 456 (456) T protein:vir:79 441 L------------FAGNP-VQR----PQEDGSR 456 (456) T ss_pred H------------HhhhH-hhc----CCCCCCC Confidence 0 11110 000 0001111 No 76 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=97.53 E-value=5e-05 Score=44.17 Aligned_cols=390 Identities=12% Similarity=0.106 Sum_probs=179.5 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceE Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSV 101 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I 101 (513) |-=++++-.|--..+......++... ... +..-..+ -+....+...+.+++.|+.++. +..+.-.+ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~---------~~~--~~~~~~~--~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~ 67 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIELVGPI---------FES--LKLSTKN--MTVEQIWEDQPHLRTVTTFIARNVASLQLQA 67 (423) T ss_pred CchhHhhccccccccCcccccccccc---------ccc--cccccch--hhHHHHHHhhhHHHHHHHHHHHhHhhCceEE Confidence 22222221111111111011111100 000 0000000 0112234456666677766543 22222223 Q ss_pred eeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcC----cceeeeecCcceeEEEE Q lcl|NC_015263. 102 VPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDK----ESVMIQQFPNDICKISS 173 (513) Q Consensus 102 ~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~----~~~~iq~lp~dyckIsg 173 (513) +-- ..+... ... ++ +.+...|..=| ...+...++..++..|..|.++.-+. .-+.+.|+|+.+.++.. T Consensus 68 ~~~-~~dg~~-~~~-~~-~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~ 143 (423) T protein:vir:81 68 FER-VEDGGR-ERV-RE-GHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRA 143 (423) T ss_pred EEE-ecCCce-eee-cc-chHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeee Confidence 210 011111 111 12 12334455433 56666777888899999999887653 33567778888777766 Q ss_pred EEC--CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEE-ecCccc-cchhhHHHH Q lcl|NC_015263. 174 VSG--GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIK-INESSL-TPVPPFAGT 249 (513) Q Consensus 174 ~~n--G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik-~~~~~~-~~ip~f~~v 249 (513) ..+ |.+.|.|.... .....++.++...-+-|+ .+.+.. .|++|...+ T Consensus 144 ~~~~~~~~~Y~~~~~~-----------------------------~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~ 194 (423) T protein:vir:81 144 YKDGWGSLDYIIIESG-----------------------------DNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSL 194 (423) T ss_pred ccCCCcceEEEEEEec-----------------------------CCCceEEEEcccceEEecCCCCCCccccccHHHHH Confidence 655 44455542111 001256677776555566 334444 488887776 Q ss_pred HHhHHHHHHHHHHHhhHhhhhhc---eeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcccc------ceEEEeccc Q lcl|NC_015263. 250 FDSIYDIHSFKDLRNDKAELQNY---KLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDN------VGVVTSPME 320 (513) Q Consensus 250 ~~d~~di~~~kdL~~~~~~i~n~---~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~g------v~~v~sP~~ 320 (513) ...+--....++.. ..-..|- ..++ +.+ .....-.++.++++.+.+.++.+.-.| +..+-..++ T Consensus 195 ~~~i~~~~~~~~~~--~~~f~ng~~p~gvi-~~~----~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~ 267 (423) T protein:vir:81 195 RDILGEQIEAAIFR--AQMWRNGPRPGMVI-MRD----PESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMK 267 (423) T ss_pred HHHHHHHHHHHHHH--HHHHhccCCCceEE-Eec----CcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCce Confidence 65443333333221 1112331 1222 122 122233578888888888888776322 222223344 Q ss_pred ccccccccccccchh---hhhhHHhhhhhhhhhhhhccCC-CcchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhc-c Q lcl|NC_015263. 321 IDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSD-NKTSQGI-AMSIATDEQFIFGVINQLERWLNRYLLLN-G 394 (513) Q Consensus 321 ~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~-~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-~ 394 (513) +..+.+. ..+.+. ..-..++|..+.||...++|.. +.+++.+ +....--..-+.-++.+||..+|+.|-.. . T Consensus 268 ~~~l~~s--~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~ 345 (423) T protein:vir:81 268 AENFHTT--SKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKALYGDNLGSWIRIIQDVMNLFLLPRVG 345 (423) T ss_pred EEeccCC--hhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccc Confidence 4444442 222233 3344577999999998888632 2222222 22222223334468899999999988542 1 Q ss_pred ---cceEEEEEecCCCCccHHHHHHHHHHHHh-cCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccc Q lcl|NC_015263. 395 ---MSKYFKATMLEVTHFSKKEAHDRYITDAQ-YGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNT 469 (513) Q Consensus 395 ---~~~~f~~~~l~~T~fn~ke~~~~~~~~~~-~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~ 469 (513) -+..|+|..-....-+.++..+.+.++.+ -|. .+=. +=+.+|+.|.+ |=|..+.|+- +. T Consensus 346 ~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE-~R~~~gl~p~~------------gGD~~~~p~n---~~ 409 (423) T protein:vir:81 346 IDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINE-VRAMDNLPSID------------GGDDLARPLN---TE 409 (423) T ss_pred cccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHH-HHHHhCCCCCC------------Ccceeecccc---cc Confidence 23456665555555677777777766553 242 2222 22234444432 3344555542 22 Q ss_pred cccccccCCccccCCCCcCCCCcc Q lcl|NC_015263. 470 SGSDIAENAIKEKGKENGRPTNET 493 (513) Q Consensus 470 Sg~~~~~~~~~~~~~~~grPt~et 493 (513) .+.. . +.+|.++ +| T Consensus 410 ~~~~---~------~~~~~~~-~t 423 (423) T protein:vir:81 410 FGDS---E------DAPGEEV-ET 423 (423) T ss_pred cCcc---C------CCCCCCC-CC Confidence 2210 0 0111110 11 No 77 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=97.51 E-value=5.4e-05 Score=43.99 Aligned_cols=458 Identities=10% Similarity=0.039 Sum_probs=220.4 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccc----cc--ccchHHHHHHHhhhccChhHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPV----GS--LTSSQSKVRKIVKEYRNEGNQKTLRKVS 74 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~----~s--~~~s~d~~k~~i~~~~P~~n~~~ir~~s 74 (513) |.+...+.++.+.... ..++...+ ...+.++.. ++ -..+.+...+ + ..+-+.||.-| T Consensus 1 m~~~~~r~~~~~a~~~--------~~~~~~~~---~~~y~gA~~~~r~~~~w~~~~~s~~~~-~-----~~~~~~lr~Ra 63 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGR--------PEQSASLG---GGGLEGASRLSRETVSWNPSLRSPDAL-I-----NPLKRIADARG 63 (553) T ss_pred Ccchhhhhhccccccc--------chhhhhhh---cccccccccCCCcccccccCCCChHHH-H-----HHHHHHHHHHH Confidence 6666655554443221 11111111 112221111 01 0011111112 1 23578899999 Q ss_pred HHHHhhcchHHHHHHHHhhcccccceEeeccchhhh----hhcchhHHHHHH-----HHHH----------hhcChhHHH Q lcl|NC_015263. 75 EDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTA----NENKLKKELATV-----TEFL----------SRLNPKYNF 135 (513) Q Consensus 75 ~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~----~~~~~~~~y~~v-----~~~L----------~k~n~k~~~ 135 (513) ++|+.++++.++.|+.+.+.--=. =|.|-...+.. ..++.-+.+.+. ..|- -+++.-... T Consensus 64 RdL~rNn~~a~~av~~~~~nvVG~-Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q 142 (553) T protein:vir:63 64 RDMADNDGFTNGAVGYQRDSIVGA-QYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLI 142 (553) T ss_pred HHHHhcChHHHHHHHHHHHhhccC-CceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHH Confidence 999999999999999888753333 22221110000 011111111111 1111 122334445 Q ss_pred HHHHHHHHHhcceeEEEEEcCc-----ceeeeecCcceeEEEE-EECCe-eEEEEEeeeccCcc-hhcc-----ccHHHH Q lcl|NC_015263. 136 SKIVKLAMTVDIFYGYVIDDKE-----SVMIQQFPNDICKISS-VSGGV-YNYVIDLDALVSAD-IVDY-----YPKEIQ 202 (513) Q Consensus 136 ~~i~~~~l~~g~~~gy~i~d~~-----~~~iq~lp~dyckIsg-~~nG~-y~~~fD~syFd~~~-~L~~-----~p~Ei~ 202 (513) .-+++..++.|..|.-.+...+ +.-+|-+++|.|.-.. ..+|. .+-.+ .||... .+.| -|.+.- T Consensus 143 ~l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GV---E~d~~Gr~vaY~i~~~hPgd~~ 219 (553) T protein:vir:63 143 RLGVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGV---QYDKRGRPQGYWIQVAHPGDLY 219 (553) T ss_pred HHHHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeee---EECCCCceEEEEeeccCCCccc Confidence 6678889999999987665322 3568999999986442 12222 22222 334222 2222 132211 Q ss_pred HHHHHHhhhhhccCcccccCe------eecCCceEEEEecCccc---cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhce Q lcl|NC_015263. 203 EAVNKYTTMKKGNNKSASNWY------EIQDKNSICIKINESSL---TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYK 273 (513) Q Consensus 203 ~~y~~Y~~~k~~~~~~~~~W~------~L~~~kt~~ik~~~~~~---~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ 273 (513) .. .. .. ..|. .++- .-|.+-++.+.+ =|+|.|++++..+-++++|.+-....+.++.-. T Consensus 220 ~~------~~--~~---~~~~r~~~~~~v~a-~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~ 287 (553) T protein:vir:63 220 QM------AP--DM---YKWKFVQQSKPWGR-RQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASY 287 (553) T ss_pred cc------cc--cc---cceeeeccccccCh-hHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhh Confidence 00 00 00 1122 2222 223333334433 389999999999999999999888888886632 Q ss_pred --eeeeeecc-------ccCCCCCccc----cCHHHHHHHHHH-HHHhccccceEEEeccccccccccccc----ccchh Q lcl|NC_015263. 274 --LLIQKLET-------RSSNDNNDFT----LDMPMMNYFHEA-LSMTVPDNVGVVTSPMEIDTVSFDKDS----STDDS 335 (513) Q Consensus 274 --ii~~kip~-------~~~n~~~~~~----vd~~~~~~~~~~-ik~~Lp~gv~~v~sP~~~d~i~ld~~~----~~~dt 335 (513) +|.+..|- .++..++... -..+.....++. ....|-.|....+.|= +.|+|-... +-... T Consensus 288 a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG--e~i~~~~p~~p~~~~~~F 365 (553) T protein:vir:63 288 AAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPG--TKLNLKPMGTPGGVGSEF 365 (553) T ss_pred eeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCC--CeeeecCCCCCCCCHHHH Confidence 22222221 0111111111 001111111111 1112212222223332 356663322 22355 Q ss_pred hhhhHHhhhhhhhhhhhhccCC--CcchHHHHHHHHHHH-------HH-HHHHHH-HHHHHHHHHHhhcccce------- Q lcl|NC_015263. 336 VEKATKNFWDNAGVSQILFSSD--NKTSQGIAMSIATDE-------QF-IFGVIN-QLERWLNRYLLLNGMSK------- 397 (513) Q Consensus 336 v~~~~~~i~~~~GiS~~Lfn~d--~~s~~~~~~SI~~d~-------~~-~~~~~~-~iE~~~N~~i~~~~~~~------- 397 (513) +......|-..+||+--++.+| +.|++++..++...- .+ +-.|.+ ..+.|+--.+..+.+.- T Consensus 366 ~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~ 445 (553) T protein:vir:63 366 EASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRD 445 (553) T ss_pred HHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccch Confidence 7777788889999995555555 568888877755322 21 123333 23446655554433310 Q ss_pred ------------EEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccc Q lcl|NC_015263. 398 ------------YFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSS 465 (513) Q Consensus 398 ------------~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~T 465 (513) ..+..-.+-...+..+-+..-+....-|+......++.+|.+|++++.+...|++.++=-.+..+... T Consensus 446 ~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~ 525 (553) T protein:vir:63 446 LFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSA 525 (553) T ss_pred hhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCC Confidence 11233444456777777888899999999999999999999999999999999854432222222222 Q ss_pred cccccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 466 SFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 466 S~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) +.+...... ....+. ..|. ...+.+.++ T Consensus 526 ~~~~~~~~~--~~~~~~----~~~~--~~~~~~~~e 553 (553) T protein:vir:63 526 KRSLGDGRD--AATGIA----EDPA--AAQTSQQGE 553 (553) T ss_pred ccccCCCcc--cCCCCC----CCCC--CCCcccccC Confidence 222111110 000000 0010 011111111 No 78 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.50 E-value=5.5e-05 Score=43.95 Aligned_cols=376 Identities=9% Similarity=0.026 Sum_probs=172.4 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccce Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYS 100 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~ 100 (513) |..=-.-.++-.+..|. +...+.....-. ..++-.++ ..... .++.--+-+.+.+.+.|+++++ +..+... T Consensus 1 m~m~~~~~~~~~~~~~~--~~~~~~~~~~~~-~~~~~~~~~~~~g~----~v~~~~al~~~~v~~~v~~ia~~ia~lp~~ 73 (392) T protein:vir:74 1 MILPILNFINQTNDPPE--AGSVQSYFPDGN-DAQIMESLLGDNNE----WVSARAALRNSDLFSIILQLSSDLAIVKIN 73 (392) T ss_pred CcchhhhhhhcccCccc--ccccccccccCc-hhhhhhhccCCCCc----ccchhhhhcchHHHHHHHHHHHhhccCcee Confidence 21111111111111111 000010000000 01111111 01000 1122223356788888888865 4555444 Q ss_pred EeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcc--eeeeecCcceeEEEEE Q lcl|NC_015263. 101 VVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKES--VMIQQFPNDICKISSV 174 (513) Q Consensus 101 I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~--~~iq~lp~dyckIsg~ 174 (513) ++ +. . ....+++=| .-.+...++..++..|..|.+.+-+.++ ..+.++|+++|.|.-- T Consensus 74 ~~----~~-~-----------~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~ 137 (392) T protein:vir:74 74 AE----KK-K-----------NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYF 137 (392) T ss_pred ec----cc-h-----------hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEc Confidence 43 10 0 011344433 3556677788899999999999877554 6899999999998855 Q ss_pred EC-CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec-Cc-cccchhhHHHHHH Q lcl|NC_015263. 175 SG-GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN-ES-SLTPVPPFAGTFD 251 (513) Q Consensus 175 ~n-G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~-~~-~~~~ip~f~~v~~ 251 (513) .+ |.+.|.+...- ... ..-+.++.+.-+-|+.. .+ ...|+||...+.. T Consensus 138 ~~~~~~~y~~~~~~----~~~-------------------------~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~ 188 (392) T protein:vir:74 138 EYENGMYYNITFDD----PKI-------------------------EPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRR 188 (392) T ss_pred CCCceEEEEEEecC----Ccc-------------------------ceeEEEcCccEEEecCCCCCCccccccHHHHHHH Confidence 54 44444332111 000 01123444444445542 22 2468888877766 Q ss_pred hHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-cccceEEEecccccccccccc- Q lcl|NC_015263. 252 SIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-PDNVGVVTSPMEIDTVSFDKD- 329 (513) Q Consensus 252 d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~gv~~v~sP~~~d~i~ld~~- 329 (513) .+--....++.. ..-..|-...-.-|-+ +++...+.+.+.++-+....+- ..++..+-..+++..+.++.. T Consensus 189 ~i~~~~~~~~~~--~~~f~ng~~p~~il~~-----~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d 261 (392) T protein:vir:74 189 ESKIQRASDRLT--ISSLNSSLNVPGVLTV-----KGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNV 261 (392) T ss_pred HHHHHHHHHHHH--HHHHhccCCCceEEEe-----CCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhH Confidence 654444444322 1222332211111112 1112233333444444333322 123333334555555555321 Q ss_pred cccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCc Q lcl|NC_015263. 330 SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHF 409 (513) Q Consensus 330 ~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~f 409 (513) ..--++.+-..++|..+.||...++|....+++........-..-+.-++++||..+|++|-.. +++.+-..... T Consensus 262 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~~~~~~l~p~~~~ie~~l~~~l~~~-----~~~~~~~~~~~ 336 (392) T protein:vir:74 262 AQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYKLSDH-----ISVNMRPAIDP 336 (392) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccch-----hcccchhhhcC Confidence 1212345556688999999999999765444443333333333334468889999999987432 33333333334 Q ss_pred cHHHHHHHHHHHHhcC-CcHHHHHH--HHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCC Q lcl|NC_015263. 410 SKKEAHDRYITDAQYG-FPVKVYLA--SLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKEN 486 (513) Q Consensus 410 n~ke~~~~~~~~~~~G-~~~~~~la--a~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~ 486 (513) +.++..+.+.++..-| +.+-...+ --.|+.|.++- ..| .++|.+ |++ + T Consensus 337 d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r---~~e--------nl~~~~------~Gd------------~ 387 (392) T protein:vir:74 337 LGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLP---APE--------NTNKKT------TGQ------------S 387 (392) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccc---hhc--------CCCCCC------CCC------------C Confidence 5666667766766666 23322221 11355553331 111 233432 222 1 Q ss_pred cCCCC Q lcl|NC_015263. 487 GRPTN 491 (513) Q Consensus 487 grPt~ 491 (513) ..|-+ T Consensus 388 ~~p~p 392 (392) T protein:vir:74 388 NEPVP 392 (392) T ss_pred CCCCC Confidence 22322 No 79 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=97.46 E-value=6.3e-05 Score=43.63 Aligned_cols=385 Identities=12% Similarity=0.101 Sum_probs=181.3 Q ss_pred eeehhhhhhHHHHHHH--HHHHHH---hhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNN--RISILR---DDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQ 85 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~--~~~i~~---~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~ 85 (513) |--++.+.--.||.=. .+-... -....+...+..+. ...++.......+. ....++.+. T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~---------~~~~~~~v~ 64 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMT-------LPNFFKELISDGYT---------KLSDSPEVR 64 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhhhhhcccccccccccc-------chhhHhhhccchhH---------HHhhchHHH Confidence 1111111111110000 000000 00001110000000 01111111111111 123467888 Q ss_pred HHHHHHh-hcccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEcCcc- Q lcl|NC_015263. 86 RLLNFYA-NMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDDKES- 158 (513) Q Consensus 86 rlidy~~-~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~- 158 (513) +.|+.++ ++..+...++.-..... +.. + +.....|. + ++-..+...++..++..|..|.+.+-+.++ T Consensus 65 ~cI~~ia~~ia~~~~~~~~~~~~~~---~~~--~-~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~ 138 (413) T protein:vir:96 65 MAVDCIADLVSNMTIQLMQNGETGD---KRI--K-NDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGD 138 (413) T ss_pred HHHHHHHHhhccCceEEEEecCCCc---ccc--c-cHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC Confidence 8888877 44555555553221111 111 1 22333343 2 335677788899999999999999987554 Q ss_pred --eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec Q lcl|NC_015263. 159 --VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN 236 (513) Q Consensus 159 --~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~ 236 (513) .-+.++|++.|++.-. +|.+.|.+... + .+.+++--+-|+.+ T Consensus 139 ~~~~L~~l~~~~v~~~~~-~~~~~y~~~~~---~--------------------------------~~~~~~evih~k~~ 182 (413) T protein:vir:96 139 KIIGLTPISPYKVTFNVS-DDDLDYSITFD---N--------------------------------KEYDPSTLLHFVLN 182 (413) T ss_pred ceEEEEEecCceeEEEEc-CCeEEEEEeec---C--------------------------------cEEchhhEEEEecc Confidence 4788999999998743 45555555311 1 01222234445643 Q ss_pred --Cc-cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc- Q lcl|NC_015263. 237 --ES-SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP- 309 (513) Q Consensus 237 --~~-~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp- 309 (513) .. ...|+||...+...+--.....+... .-..| -..+++ +| . .++.++++++.+.++++.- T Consensus 183 ~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~--~~~~ng~~p~gil~-~~-----~----~l~~e~~~~~~~~~~~~~~g 250 (413) T protein:vir:96 183 PSIERPFIGTGYKVALKDIVGNLKQASVTKK--GFMASEYMPNLIVS-VD-----S----DSDELSDEEGRENFEEMYLK 250 (413) T ss_pred CCCCCccccccHHHHHHHHHHHHHHHHHHHH--HHHhccCCccEEEE-eC-----C----CCCHHHHHHHHHHHHHHhcC Confidence 22 33588888777666544444444221 12222 122221 23 1 2567777777777777662 Q ss_pred -ccce-EEEeccccccc-cccc-ccccc---hhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 310 -DNVG-VVTSPMEIDTV-SFDK-DSSTD---DSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQL 382 (513) Q Consensus 310 -~gv~-~v~sP~~~d~i-~ld~-~~~~~---dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~i 382 (513) ++.+ .++.|-....+ .+.. +..+. +..+-..++|..+.||...++|....+ .....+ -...-+.-++++| T Consensus 251 ~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~-~~~~~~--~~~~~l~P~~~~i 327 (413) T protein:vir:96 251 RKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGTYN-KDEFNN--FINTKIMSIAQVI 327 (413) T ss_pred ccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCcch-HHHHHH--HHHHHHHHHHHHH Confidence 2222 22222111111 1110 11122 233344578999999999999754322 212222 2222344689999 Q ss_pred HHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCc Q lcl|NC_015263. 383 ERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTP 462 (513) Q Consensus 383 E~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~P 462 (513) |..+|+.|-.. +..|+|.+-+....+.++.++.+.++..-|.=..--+-+.+|+.|.+ +-|..+.| T Consensus 328 e~~ln~~ll~~--~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~------------~gd~~~~~ 393 (413) T protein:vir:96 328 QQTYNKLIVEE--DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDA------------EMDDLLVL 393 (413) T ss_pred HHHHHHhhCCC--CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------Ccceeeec Confidence 99999987543 34577777777778889999998888887743222233345666632 22334434 Q ss_pred ccccccccccccccCCccccCCCCcCCCCcc Q lcl|NC_015263. 463 LSSSFNTSGSDIAENAIKEKGKENGRPTNET 493 (513) Q Consensus 463 l~TS~T~Sg~~~~~~~~~~~~~~~grPt~et 493 (513) +. |+.-+.. + ...+..| .+| T Consensus 394 ~n--~~~~~~~-----~-~~~~~~~---~dt 413 (413) T protein:vir:96 394 EN--YLQQKDL-----V-NQKKLIQ---DET 413 (413) T ss_pred cc--ccchhhc-----c-cccCCCC---CCC Confidence 32 2210100 0 0000111 111 No 80 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=97.37 E-value=8.2e-05 Score=42.99 Aligned_cols=401 Identities=12% Similarity=0.078 Sum_probs=181.0 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-C--hhHHHHHHHHHHHHHhhcchHHHHHHHHh-hccccc Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-N--EGNQKTLRKVSEDLAVQSQQYQRLLNFYA-NMPLYA 98 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P--~~n~~~ir~~s~~lY~~sg~~~rlidy~~-~mpt~d 98 (513) |+|.=.-+++.....- .+.. ..|++ |+ | ...-.+...++.-.+..++.+.+.|+.++ ++..+- T Consensus 1 ~~~~~~~~~~~~~~~~---------~~~~---~~~~~-~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp 67 (460) T protein:vir:10 1 MANRIIRALRELTGLD---------NKFN---DAFIK-YIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVP 67 (460) T ss_pred CchhHHHHHhhhhccC---------CCch---HHHHH-hhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCc Confidence 4443222233322111 0101 23442 21 1 01112445556666777788888888877 345555 Q ss_pred ceEeecc-chhhhh-------------------hcc---hhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 99 YSVVPFK-DISTAN-------------------ENK---LKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 99 Y~I~P~~-~~~~~~-------------------~~~---~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy 151 (513) ..++-.. +..... +.. ....-.-...++++=| ...+...++..++..|..|.+ T Consensus 68 ~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~ 147 (460) T protein:vir:10 68 YTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFY 147 (460) T ss_pred eEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEE Confidence 5554311 110000 000 0000011122343333 456667778888999999999 Q ss_pred EEEcC------cceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeee Q lcl|NC_015263. 152 VIDDK------ESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEI 225 (513) Q Consensus 152 ~i~d~------~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L 225 (513) ++-+. ....+.++|+++|.+.--.+|...+...-.+ .|. ......=..+ T Consensus 148 i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~-------------------~~~------~~~~g~~~~~ 202 (460) T protein:vir:10 148 LMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIK-------------------SYM------LIQGDQFIEF 202 (460) T ss_pred EEecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeee-------------------EEE------EecCceeEEe Confidence 98643 3356899999999998666665443321111 000 0000112345 Q ss_pred cCCceEEEEecCc-------cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHH Q lcl|NC_015263. 226 QDKNSICIKINES-------SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMN 298 (513) Q Consensus 226 ~~~kt~~ik~~~~-------~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~ 298 (513) +++.-+.|+.... ...|+||...+...+--....++... .-..|-. ...-| + .. .-.++.++++ T Consensus 203 ~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~--~~f~ng~-~~~~i-~---~~--~~~l~~e~~~ 273 (460) T protein:vir:10 203 NEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNV--KTMQNGG-VFGFI-H---GG--STGLTQPQAD 273 (460) T ss_pred cccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHH--HHHhcCC-Cccee-e---ec--CCCCCHHHHH Confidence 6655666775332 24688888776544444333333221 1112211 00001 1 01 1136677777 Q ss_pred HHHHHHHHhc--cc---cceEEEeccccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCc---chHHHHHHHH Q lcl|NC_015263. 299 YFHEALSMTV--PD---NVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNK---TSQGIAMSIA 369 (513) Q Consensus 299 ~~~~~ik~~L--p~---gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~---s~~~~~~SI~ 369 (513) ++.+.+.+.. ++ ++..+-..+++..+.+.. +..--++.+-..++|..+.||...++|...+ +++.+...-. T Consensus 274 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~ 353 (460) T protein:vir:10 274 SLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERK 353 (460) T ss_pred HHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHH Confidence 7777777765 22 233333444444444431 1111234455568899999999999875422 2223322222 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHhhcc---cceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHH Q lcl|NC_015263. 370 -TDEQFIFGVINQLERWLNRYLLLNG---MSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTG 445 (513) Q Consensus 370 -~d~~~~~~~~~~iE~~~N~~i~~~~---~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~ 445 (513) -...-+.-++.+||.++|+.|-... .+..++|.+-++.. ..+....+.+++.- --++|.|+-. T Consensus 354 ~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~--l~~d~~~~~~~~~~-----------g~~T~NE~R~ 420 (460) T protein:vir:10 354 RVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPE--MQTDMVAMASWLNT-----------IPVTPNEIRI 420 (460) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhh--HHHHHHHHHHHHhC-----------CCCCHHHHHH Confidence 2233344689999999999875422 22334554433321 11222222222222 2366766654 Q ss_pred HHHHHH--HhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 446 LLKVEN--EMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 446 ~~~~E~--e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~~ 513 (513) ++-+|- +. +-|..++|+ +.+.-. +.+....+ +..+..| T Consensus 421 ~~g~~pi~~~-~gD~~~~~~---n~~~~~------------~~~~~~~~--------------~~~nq~~ 460 (460) T protein:vir:10 421 AMKYETLNQD-GMDIVFMPS---NKVRID------------DVSNNLID--------------SAFNQNQ 460 (460) T ss_pred HhCCCCCCCC-CCCeeeecc---cccchh------------hcccccCC--------------CcccCCC Confidence 443331 00 112223332 111100 00000000 0000001 No 81 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=97.23 E-value=0.00012 Score=42.03 Aligned_cols=376 Identities=10% Similarity=0.013 Sum_probs=162.8 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceE Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSV 101 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I 101 (513) |-=+.++. ...+...+.+++ .+.+. ..+.-.|..++.+.+.|+.++. +..+...+ T Consensus 1 MGlf~~~~----~~~~~~~~~~~~--------~~~~~------------~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~ 56 (395) T protein:vir:98 1 MGILDFFS----FKKSGTLSDDDS--------GSTTS------------EKLTNVVLKEDALYKCVNYLARIISKSTFRL 56 (395) T ss_pred Ccchhhhc----CCCccccccccc--------chhhh------------hhcchhhhhhHHHHHHHHHHHHHHhhCceeE Confidence 11122211 111221122221 11111 1222233455567777777654 44444455 Q ss_pred eeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEEEEC Q lcl|NC_015263. 102 VPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISSVSG 176 (513) Q Consensus 102 ~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~~n 176 (513) +--++ + ... -+.+...|.. +.--.+...+...++..|..|-|...+.. ...|..||.+.+... T Consensus 57 ~~~~~-----~-~~~--~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~----~~~~~~~~~~~~~~~ 124 (395) T protein:vir:98 57 KTPEK-----L-TEN--QKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKG----IYVADSFTQDKKISG 124 (395) T ss_pred EecCC-----c-ccc--cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCc----eecCCcccccccccC Confidence 42111 1 111 1233444542 33456677788888999999988877654 245777777764443 Q ss_pred CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccc-cchhhHHHHHHhHHH Q lcl|NC_015263. 177 GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSL-TPVPPFAGTFDSIYD 255 (513) Q Consensus 177 G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~-~~ip~f~~v~~d~~d 255 (513) ..|..+. . +.|.- =.+++++.-+.||.+.... ..+.+.......++. T Consensus 125 ~~~~~~~----------~-----------~~~~~-----------~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~ 172 (395) T protein:vir:98 125 SQFKVSR----------V-----------QGQTY-----------EKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLG 172 (395) T ss_pred cccceee----------e-----------cCcee-----------eeEecCccEEEecCCCCCccccccchhhhHHHHHH Confidence 3322110 0 01100 0234554556676533222 122222222222221 Q ss_pred HHHHHHHHhhH-hhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-cccceEEEec--cccccccccccc- Q lcl|NC_015263. 256 IHSFKDLRNDK-AELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-PDNVGVVTSP--MEIDTVSFDKDS- 330 (513) Q Consensus 256 i~~~kdL~~~~-~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~gv~~v~sP--~~~d~i~ld~~~- 330 (513) ..--....... .-..+ .-...+......+... ....+.++.+.+....+. -++.+.++.+ +++..+++.... T Consensus 173 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~ 249 (395) T protein:vir:98 173 HVINNQKIANQIRFTMI--PPKDKVRERAQENSDG-GRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGA 249 (395) T ss_pred HHHHHHHHHHHHHHhhc--cccccccccccccCCc-HHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccc Confidence 11111111100 01111 1111111100011111 111223333444433333 3455555444 344444432211 Q ss_pred --ccc-h---hhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccceEEEEEe Q lcl|NC_015263. 331 --STD-D---SVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLL-NGMSKYFKATM 403 (513) Q Consensus 331 --~~~-d---tv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~-~~~~~~f~~~~ 403 (513) ..+ + +..-..++|..+.||...++|++..+.+-.... -...-+.-++.+||..+|+.|-. ......+.|.+ T Consensus 250 ~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~sn~e~~~~~--f~~~tl~P~~~~ie~~l~~kll~~~~~~~g~~f~~ 327 (395) T protein:vir:98 250 VKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIADNQKNYEL--LLEGPIESLITNIVDGLEYAIFDKSETLQGSFIKV 327 (395) T ss_pred cChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCcccHHHHHHH--HHHHHHHHHHHHHHHHHHHhcCChhhhcCcceeee Confidence 111 1 122334679999999999998775444443333 33344556999999999997643 22222345666 Q ss_pred cCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccC Q lcl|NC_015263. 404 LEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKG 483 (513) Q Consensus 404 l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~ 483 (513) -+....+.++.++.+.++.+-|.=..--+-+.+|+.|.+- + .-|..++|+ +...- . T Consensus 328 ~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~--------~--~gD~~~~~~---n~~~~-----------~ 383 (395) T protein:vir:98 328 TGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPD--------G--LGKVLYMTK---NYESV-----------L 383 (395) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--------C--CCceeeecc---cceec-----------c Confidence 6777778999999998888877433222333455555421 0 122233321 11000 0 Q ss_pred CCCcCCCCccccc Q lcl|NC_015263. 484 KENGRPTNETTGN 496 (513) Q Consensus 484 ~~~grPt~et~~n 496 (513) ..||.+.++ .+| T Consensus 384 ~~gge~~~~-~~~ 395 (395) T protein:vir:98 384 ERGGEVDEE-VET 395 (395) T ss_pred cccCCCCCC-CCC Confidence 123432211 121 No 82 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=97.17 E-value=0.00014 Score=41.68 Aligned_cols=378 Identities=11% Similarity=0.083 Sum_probs=148.4 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) ..+.+. |+. +. ...|......|.... ......+..+.. + ..++.++..| T Consensus 1 Mg~~~~--------f~~------k~-~~~~~~~~~~~~~~~-------~~~~~~~~~~~~----~-----~~~~~V~~~I 49 (403) T protein:vir:80 1 MGLFNF--------FRR------KT-RSEPTNAISWFLTQE-------AYDTLAIPGYTR----L-----SDNPEVRMAV 49 (403) T ss_pred Cccccc--------ccc------cc-cccccchhhhhcccc-------cccccccchhhh----h-----hhhHHHHHHH Confidence 111110 110 00 001110000010000 000100111111 1 1234566666 Q ss_pred HHHh-hcccccceEeeccchhhhhhcchhHHHHHHHHHHh-hcChh----HHHHHHHHHHHHh--cceeEEEEEcCcc-- Q lcl|NC_015263. 89 NFYA-NMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-RLNPK----YNFSKIVKLAMTV--DIFYGYVIDDKES-- 158 (513) Q Consensus 89 dy~~-~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k~n~k----~~~~~i~~~~l~~--g~~~gy~i~d~~~-- 158 (513) +.++ ++..+...++ +.. .++.... -+.....|. .=|.- .+...++..++.. |..|-+...+..+ T Consensus 50 ~~ia~~iA~~p~~~~----~~~-~~g~~~~-~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~ 123 (403) T protein:vir:80 50 HKIAELISSMTIHLM----QNT-DNGDIRI-KNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLI 123 (403) T ss_pred HHHHHhhhhCceEEE----Eec-CCceeec-CChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcE Confidence 6653 2222222332 110 1111111 122233333 33322 3445556666665 5455555555544 Q ss_pred eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCc Q lcl|NC_015263. 159 VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINES 238 (513) Q Consensus 159 ~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~ 238 (513) .-+.++|++.|.+.--. +.|++.++- . .|| .+--+.|+.+.. T Consensus 124 ~~L~~l~p~~v~~~~~~-~g~~~~y~~-----~----~~~----------------------------~~eiih~~~~~~ 165 (403) T protein:vir:80 124 DELIPLAPSKVSFVDTD-TGYQIWYQG-----K----AYN----------------------------YDEVLHFIVNPD 165 (403) T ss_pred EEEEEEcCCeeEEEEcC-CceEEEEee-----c----ccc----------------------------hhhEEEEeccCC Confidence 67889999999986433 445543321 0 111 112344554322 Q ss_pred ---cccchhhHHHHHHhHHHHHH-HHHHHhhHhhhhhceee--eeeeccccCCCCCccccCHHHHHHHHHHHHHhc---- Q lcl|NC_015263. 239 ---SLTPVPPFAGTFDSIYDIHS-FKDLRNDKAELQNYKLL--IQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV---- 308 (513) Q Consensus 239 ---~~~~ip~f~~v~~d~~di~~-~kdL~~~~~~i~n~~ii--~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L---- 308 (513) ...|++|... +.+.+.+.. ..+.. ..-..|-... +-++| .. ++.+++++..+.+.+.. T Consensus 166 ~~~~~~G~s~~~~-~~~~i~~~~~~~~~~--~~~~~ng~~p~~il~~~----~~-----~~~~~~~~~~~~~~~~~~~~~ 233 (403) T protein:vir:80 166 PEKPYMGRGYRVV-LKDIVNNLKQATTTK--KSFMSGKYMPSLIVKVD----AA-----TAELSSEEGRNAVFKKYLEAS 233 (403) T ss_pred CcCccccccHHHH-HHHHHHHHHHHHHHH--HHHHhccCCcceEEEeC----CC-----CChHHHHHHHHHHHHHHhhhh Confidence 2346676543 444444333 22221 1222221111 11223 11 22233333333322221 Q ss_pred cc-cceEEEecc-ccccc-cccc-ccccchhhhhhHHhhhhhhhhhhhhcc-CCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 309 PD-NVGVVTSPM-EIDTV-SFDK-DSSTDDSVEKATKNFWDNAGVSQILFS-SDNKTSQGIAMSIATDEQFIFGVINQLE 383 (513) Q Consensus 309 p~-gv~~v~sP~-~~d~i-~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn-~d~~s~~~~~~SI~~d~~~~~~~~~~iE 383 (513) -. +++.+..+. +...+ +++- +..--++.+-..++|..+.||...++| ++..+.... + --..-+.-++++|| T Consensus 234 ~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~--~--f~~~~l~P~~~~ie 309 (403) T protein:vir:80 234 EAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGKYDKDEYN--N--FINSTILPIAKGIE 309 (403) T ss_pred hcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCccHHHHH--H--HHHHHHHHHHHHHH Confidence 11 222222221 11111 1211 111123344455789999999999987 333322221 2 22223446899999 Q ss_pred HHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcc Q lcl|NC_015263. 384 RWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPL 463 (513) Q Consensus 384 ~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl 463 (513) ..+|+.|-.. ....|+|..-+...-+.++.++.+.++..-|.=..--+-+.+|+.|.+ +=+..++|+ T Consensus 310 ~~l~~kll~~-~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~------------ggd~~~~~~ 376 (403) T protein:vir:80 310 QELTRKLLIS-PDLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKE------------GLSELVILE 376 (403) T ss_pred HHHHHhccCC-CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CCCeEeecc Confidence 9999877432 223466655566666888888888888777733222222345655532 112233222 Q ss_pred cccccccccccccCCccccCCCCcCCCCccccccc Q lcl|NC_015263. 464 SSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKD 498 (513) Q Consensus 464 ~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~ 498 (513) . ++--.. .++.....+| ..+.+.+.++ T Consensus 377 n--~~pl~~-----~~~~~~~k~g-e~~~~~~~~~ 403 (403) T protein:vir:80 377 N--YIPLDK-----IGDQNKLKGG-EKGGADGQTD 403 (403) T ss_pred c--ccchhh-----ccchhhccCC-CCCCCCCCCC Confidence 1 110000 0000001111 1111111111 No 83 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=97.17 E-value=0.00014 Score=41.63 Aligned_cols=370 Identities=10% Similarity=0.054 Sum_probs=160.7 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) .++.| +-.. ..+...+.... . +.+...+.-.|..+..+.+.| T Consensus 1 Mgl~d---------~~~~---------~~~~~~~~~~~--------~------------~~~~~~~~~~~l~~~~v~~~i 42 (395) T protein:vir:96 1 MGILD---------FFSF---------KKSGTLSDDDS--------G------------STTSEKLTNVVLKEDALYKCV 42 (395) T ss_pred Ccchh---------hhcC---------CCCcccccccc--------c------------cchhhhcchhhhhhHHHHHHH Confidence 11111 0000 00111111110 1 111222233344556677777 Q ss_pred HHHhhc-ccccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCcceeee Q lcl|NC_015263. 89 NFYANM-PLYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQ 162 (513) Q Consensus 89 dy~~~m-pt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq 162 (513) +.++.. ..+...++-- ..+.... +.+...|.. +..-.+...++..++..|..|.+.+.+... T Consensus 43 ~~Ia~~ia~lp~~v~~~-----~~~~~~~---~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~---- 110 (395) T protein:vir:96 43 NYLARIISKSTFRIKAP-----EKLTENQ---KDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGI---- 110 (395) T ss_pred HHHHHhhccceeEEEeC-----Ccccccc---chHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCce---- Confidence 777643 4555555411 1111111 223344542 345666778888899999999998876543 Q ss_pred ecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccc-- Q lcl|NC_015263. 163 QFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSL-- 240 (513) Q Consensus 163 ~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~-- 240 (513) ..+..|+.+..+....|..+. +..|.-. ..++...-+.||.+.... T Consensus 111 ~~~~~~~~~~~~~~~~~~~v~---------------------~~~~~~~-----------~~~~~~dvih~k~~~~~~~~ 158 (395) T protein:vir:96 111 YVADAFTQDKKLSGNKFKVSR---------------------VQGQTYE-----------KIFTFDQVIYLKNDNSDLML 158 (395) T ss_pred ecCCccccccccccceeeeee---------------------eccceee-----------eEeccCceEEecccCCcccc Confidence 234455554433222221100 0011000 123444445555433211 Q ss_pred cc---hhhHHHHHHhHHHHHHHHH-HHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-cccceEE Q lcl|NC_015263. 241 TP---VPPFAGTFDSIYDIHSFKD-LRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-PDNVGVV 315 (513) Q Consensus 241 ~~---ip~f~~v~~d~~di~~~kd-L~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~gv~~v 315 (513) .+ +.....++...+++..... ++-...-..+ ......++ .-++ ....+.++++++...... .++.+.+ T Consensus 159 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~v~ 231 (395) T protein:vir:96 159 KVESLWEEYGELLGHVINNQKIANQIRFTMTPPKD-KVRERAQE----NSDG--GRQPKSDKDFFKRTIEKIRTESVVGI 231 (395) T ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHhhhccc-ccccceee----ccCc--hhhHHHHHHHHHHHHHHhhcCCcceE Confidence 11 1112222222223221111 1111111111 11111111 0011 122245555555555444 3444444 Q ss_pred Eecc--cccccccccccccchh---------hhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 316 TSPM--EIDTVSFDKDSSTDDS---------VEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLER 384 (513) Q Consensus 316 ~sP~--~~d~i~ld~~~~~~dt---------v~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~ 384 (513) +.+- ++..+++. ..+.+. .....+.|-.+.||...+++++..+......+.-.+. +.-++.+||. T Consensus 232 ~l~~g~~~~~l~~~--~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~~~sn~e~~~~~f~~~~--L~P~~~~ie~ 307 (395) T protein:vir:96 232 PVTANTNYEEYGSK--NTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIADNQKNYELLLEGP--IESLITNIVD 307 (395) T ss_pred EccCCceeEecccC--hhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCccHHHHHHHHHHHH--HHHHHHHHHH Confidence 4443 33333332 222222 2223467889999999999877555554444433333 4468999999 Q ss_pred HHHHHHhh-cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcc Q lcl|NC_015263. 385 WLNRYLLL-NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPL 463 (513) Q Consensus 385 ~~N~~i~~-~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl 463 (513) .+|+.|-. ......++|.+-....-+.++.++.+.++..-|.=..--+-+.+|+.|.+- + .-|..++|+ T Consensus 308 ~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~--------~--~gD~~~~~~ 377 (395) T protein:vir:96 308 GLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPD--------G--LGKVLYMTK 377 (395) T ss_pred HHHhhcCChhhhcCceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--------C--CCceeeecc Confidence 99997743 222223556666777788999999998888777322222223456555431 0 112233322 Q ss_pred cccccccccccccCCccccCCCCcCCCCccccc Q lcl|NC_015263. 464 SSSFNTSGSDIAENAIKEKGKENGRPTNETTGN 496 (513) Q Consensus 464 ~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n 496 (513) . .... .+.||.+. +..+| T Consensus 378 N---~~~~-----------~~~gge~~-~~~~~ 395 (395) T protein:vir:96 378 N---YESV-----------LERGGEVD-EEVET 395 (395) T ss_pred c---ceec-----------hhccCCCC-CCCCC Confidence 1 1000 01233221 11221 No 84 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.16 E-value=0.00015 Score=41.57 Aligned_cols=365 Identities=10% Similarity=0.067 Sum_probs=148.4 Q ss_pred HHHHHhhccCcccccccccccchHHHHHHH-hhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeecc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGSLTSSQSKVRKI-VKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFK 105 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~-i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~ 105 (513) -.++.-++..+.. +.-...+.-+..... +..+....++ +.--+-.++.+.+.|+.++. +..+...++ T Consensus 1 Mglf~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v------~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~--- 69 (384) T protein:vir:49 1 MPIFNITNLATES--PPSNQDSFFDITDPEFLDALNGSEWV------SAETALKNSDLFSIISQLSNDLATAKITTS--- 69 (384) T ss_pred CccccccccCccc--ccccchhhccccchhhcccccCCcee------chhhhhccHHHHHHHHHHHHHHhhCceeee--- Confidence 1111111111110 000000000000000 0000001111 01112346778888888876 556655553 Q ss_pred chhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCc--ceeeeecCcceeEEEEEEC-Ce Q lcl|NC_015263. 106 DISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQFPNDICKISSVSG-GV 178 (513) Q Consensus 106 ~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~lp~dyckIsg~~n-G~ 178 (513) .+ .+. ..+++=| ...+...++..++..|..|.++.-+.. .+.+.++|++.|++.-..+ |. T Consensus 70 ~~----------~~~---~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~ 136 (384) T protein:vir:49 70 RK----------QLQ---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNG 136 (384) T ss_pred cc----------hhh---hhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce Confidence 11 111 1232222 456667788889999999999997654 4688999999999975544 33 Q ss_pred eEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec-Ccc-ccchhhHHHHHHhHHHH Q lcl|NC_015263. 179 YNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN-ESS-LTPVPPFAGTFDSIYDI 256 (513) Q Consensus 179 y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~-~~~-~~~ip~f~~v~~d~~di 256 (513) ..|.|...- ......+.++.+.-|.|+.. .+. ..|+||...+...+--. T Consensus 137 ~~y~~~~~~-----------------------------~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~ 187 (384) T protein:vir:49 137 LYYNITFDD-----------------------------PRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQ 187 (384) T ss_pred EEEEEEecC-----------------------------ccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHH Confidence 333332211 00012345566556667742 333 56889988766655433 Q ss_pred HHHHHHHhhHhhhhh---ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh-ccccceEEEecccccccccccc-cc Q lcl|NC_015263. 257 HSFKDLRNDKAELQN---YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT-VPDNVGVVTSPMEIDTVSFDKD-SS 331 (513) Q Consensus 257 ~~~kdL~~~~~~i~n---~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~-Lp~gv~~v~sP~~~d~i~ld~~-~~ 331 (513) ....+... .-..| ...++ ++| + . ..+.+..+++.+..... -..++..+-..+++..+.+... .. T Consensus 188 ~~~~~~~~--~~~~ng~~~~~il-~~~----~---~-~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q 256 (384) T protein:vir:49 188 KASDKLTL--NALKNALNANGIL-KIK----G---G-GLLDFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQ 256 (384) T ss_pred HHHHHHHH--HHHhccCCCceEE-EeC----C---C-CChHHHHHHHHHHHhcccCCccceecCCCceEEEccCChhhHH Confidence 33333221 11222 11222 223 1 1 11112222222211111 1223334433444444444221 11 Q ss_pred cchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccH Q lcl|NC_015263. 332 TDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSK 411 (513) Q Consensus 332 ~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ 411 (513) --++.+-..++|..+.||+..++|+...+.++....-+.-..++-..++.|+..+++.|....... ....++.. . T Consensus 257 ~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~~~--~~~~~~~~---~ 331 (384) T protein:vir:49 257 LLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVDAD--ILPAVDPT---G 331 (384) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhh--hhhhhhcc---c Confidence 123455666899999999999998754333322211112222333344444444444443221000 00000011 1 Q ss_pred HHHHHHHHHHHhcCCcHHHHHHH---HhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcC Q lcl|NC_015263. 412 KEAHDRYITDAQYGFPVKVYLAS---LMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGR 488 (513) Q Consensus 412 ke~~~~~~~~~~~G~~~~~~laa---~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~gr 488 (513) ..+...+.++..-|.=...-..+ ..|+.|.|+-. .| .++|.. |++. +.+ T Consensus 332 ~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~r~---~~--------~~~p~~------gGd~-----------~~~ 383 (384) T protein:vir:49 332 SNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDLPE---GE--------TDSTLK------GGET-----------NEQ 383 (384) T ss_pred hHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhHHH---Hc--------CCCCCC------CCCC-----------CCC Confidence 11222222223333211111111 12555543221 11 233432 3321 111 Q ss_pred C Q lcl|NC_015263. 489 P 489 (513) Q Consensus 489 P 489 (513) = T Consensus 384 ~ 384 (384) T protein:vir:49 384 Y 384 (384) T ss_pred C Confidence 1 No 85 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=97.14 E-value=0.00015 Score=41.48 Aligned_cols=383 Identities=13% Similarity=0.132 Sum_probs=178.5 Q ss_pred CC-CccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MV-KNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~ 79 (513) |. ++=-+|.+.-+... .++.-. -.++....+ ... ++.--+- T Consensus 17 ~~~~~lf~~~~~~~~~~--~~~~~~-----------------~~~~~~~~~-------------~~~------vs~~~al 58 (424) T protein:vir:45 17 VLLDALFRSKSLENPST--PITGDA-----------------VDTDGLFRA-------------DVY------VSPETAM 58 (424) T ss_pred HHHHhhccccCCCCCcc--ccchhh-----------------hhhhccccC-------------Cce------echHHhh Confidence 11 11011111111000 000000 000000000 000 0111111 Q ss_pred hcchHHHHHHHH----hhcccccceEeeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeE Q lcl|NC_015263. 80 QSQQYQRLLNFY----ANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYG 150 (513) Q Consensus 80 ~sg~~~rlidy~----~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~g 150 (513) +.+.+.+.|+.+ ++|| ..++= ..+... ..... +.+...|.. +..-.+...++..++..|..|. T Consensus 59 ~~~~v~~cv~~Ia~~iA~lp---~~v~~--~~~~~~-~~~~~--~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~ 130 (424) T protein:vir:45 59 KLAAVYSCIYVLSSSLAQMP---LHVMR--RHKGKV-EPARD--HPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYT 130 (424) T ss_pred ccHHHHHHHHHHHHHHhhCc---eEEEE--ecCCce-eeccc--chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEE Confidence 223344444444 4454 33321 111111 11111 123334432 3345566778889999999999 Q ss_pred EEEEcC--cceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCC Q lcl|NC_015263. 151 YVIDDK--ESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDK 228 (513) Q Consensus 151 y~i~d~--~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~ 228 (513) ++.-+. ..+.+.++|++.|.|. ..+|.+.|.+.-..- . ..++++ T Consensus 131 ~i~r~~~G~~~~L~~l~~~~v~i~-~~~~~~~y~~~~~~~--------------------------------~-~~~~~~ 176 (424) T protein:vir:45 131 WVKRNRRGEVISLDCCMPWETTLM-NTGGRYTYGLYNEYG--------------------------------A-FAISPD 176 (424) T ss_pred EEEEcCCCcEEEEEEecCceEEEE-EcCCeEEEEEEecCc--------------------------------e-EEECcc Confidence 988654 4478999999999986 456777775532110 0 124444 Q ss_pred ceEEEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh Q lcl|NC_015263. 229 NSICIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT 307 (513) Q Consensus 229 kt~~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~ 307 (513) .-+-|+. ..+..+|++|...+...+--....++... .-..|-...-..|-+ .. .++.++++.+.+.+.++ T Consensus 177 eVih~r~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~--~~f~ng~~p~gil~~---~~----~l~~e~~~~~~~~~~~~ 247 (424) T protein:vir:45 177 DMIHIRALGNNQKMGLSPIMQHAETIGMGMSGQKYTE--SFFSGNARPAGIVSV---KS----GLNKESWGWLKDQWQKA 247 (424) T ss_pred cEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhccCCccEEEEe---CC----CCCHHHHHHHHHHHHHH Confidence 4444553 33456788888766555443333333221 122231111111222 11 25666776666666554 Q ss_pred c---cc---cceEEEeccccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCC-cchHH-HHHHHHHHHHHHHHH Q lcl|NC_015263. 308 V---PD---NVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDN-KTSQG-IAMSIATDEQFIFGV 378 (513) Q Consensus 308 L---p~---gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~-~s~~~-~~~SI~~d~~~~~~~ 378 (513) . .+ ++..+-..+++..+.+.. +..--++..-..++|..+.||...++|... .+++. -+..+.-...-+.-+ T Consensus 248 ~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~ 327 (424) T protein:vir:45 248 SQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPW 327 (424) T ss_pred hccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHH Confidence 4 22 233444555555555532 112224455566889999999998886432 23333 233333334444579 Q ss_pred HHHHHHHHHHHHhhc---ccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhC Q lcl|NC_015263. 379 INQLERWLNRYLLLN---GMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLD 455 (513) Q Consensus 379 ~~~iE~~~N~~i~~~---~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~ 455 (513) +++||..+|+.|-.. ..+..|+|..-.....+.++.++.+.++.+-|.=..--.=+.+|+.|. | | T Consensus 328 ~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi--------~----g 395 (424) T protein:vir:45 328 VTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPV--------E----G 395 (424) T ss_pred HHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--------C----C Confidence 999999999988532 123456776666666788899999888887773222222223455552 2 3 Q ss_pred cccccCcccccccccccccccCCccccCCCCcCCCCc Q lcl|NC_015263. 456 LPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNE 492 (513) Q Consensus 456 l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~e 492 (513) -|..++|+-.. +..| ...++..+.| .+++ T Consensus 396 gD~~~~~~n~~-~~~~------~~~~~~~~~~-~~~~ 424 (424) T protein:vir:45 396 LDEMLVSVNAA-NPAG------DFKPPKNDEG-KTNE 424 (424) T ss_pred cceeeeccccc-cccc------ccCCCCCCCC-CCCC Confidence 34455553211 1011 1111111111 1111 No 86 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=97.14 E-value=0.00016 Score=41.45 Aligned_cols=409 Identities=11% Similarity=0.027 Sum_probs=181.2 Q ss_pred hhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh Q lcl|NC_015263. 14 VESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN 93 (513) Q Consensus 14 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~ 93 (513) |.-.-+|.|+.. . .++.+++.-. .+ +... ...+.....+|..++.++++|+-... T Consensus 1 ~~~~D~~~~~~~-~-------------------~g~~~~~~~~---~~-~~~~-~~~~~~l~a~Y~~~~l~~~~vd~~a~ 55 (437) T protein:vir:52 1 MKFFDGIKSLAL-K-------------------LGSKQEQTYY---SP-SLSL-TDDLVQLEALWRDNWIANKVCIKRPE 55 (437) T ss_pred CchhhhhHhHHh-c-------------------CCCcccccee---ec-Cccc-cccHHHHHHHHHhCchhhHHhhcchH Confidence 333445555421 0 1111111000 01 1111 11223345679999999999999877 Q ss_pred cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEE Q lcl|NC_015263. 94 MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISS 173 (513) Q Consensus 94 mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg 173 (513) ..|=......-.+ .-.+...+.-..+++++++..+.+.++..=..|.-+-+...|+.. .-+||.+. T Consensus 56 d~~r~~~~i~~~d-------~~~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~-~~~pl~~~------ 121 (437) T protein:vir:52 56 DMVRNWREIYSND-------LNSKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQN-TSAPLKPT------ 121 (437) T ss_pred HhhcCCceEecCC-------CCHHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCC-cccccccC------ Confidence 7666655542111 111333345566888888777777666666677766676666554 44555431 Q ss_pred EECCeeEEEEEeeeccCcch-------hccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-----cCcccc Q lcl|NC_015263. 174 VSGGVYNYVIDLDALVSADI-------VDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-----NESSLT 241 (513) Q Consensus 174 ~~nG~y~~~fD~syFd~~~~-------L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-----~~~~~~ 241 (513) |.+.. |..|+...+ .+-..|- |-+.+.+. .+++ ..=+.+.+.+-+.|.- .....| T Consensus 122 ---~~~~~---~~v~~~~~v~~~~~~~~dp~s~~----fg~p~~y~-v~~~--~~~~~iH~SRii~~~~~~~~~~~~~~~ 188 (437) T protein:vir:52 122 ---ERLKR---LIILPKWKISPTGTKDDDVLSPN----FGRYSEYS-ILGG--SQSITVHHSRLIILNANDAPLSDNDIW 188 (437) T ss_pred ---CceeE---EEEechhhccccccccccccccc----cCcceEEE-EecC--CcceeEccceeEEecCccCCCcccccc Confidence 22211 222221100 0000000 00000000 0111 0112344444444431 123446 Q ss_pred chhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeec-cccCCCCCccccCHHHHHHHHHHHHHhccccceEEEec-- Q lcl|NC_015263. 242 PVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLE-TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSP-- 318 (513) Q Consensus 242 ~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip-~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP-- 318 (513) |+|.+-.++..+...+.-...-. .-+....+.+-+++ +...-.+| ..+.+.+..+.+... -++-+.++.. T Consensus 189 G~s~le~~~~~i~~~~~~~~~~~--~l~~~~~~~v~k~~~l~~~l~~~----~~~~~~~~~~~~~~~-~~~~~~~~~d~~ 261 (437) T protein:vir:52 189 GVSDLEKIIDVLKRFDSASVNVG--DLIFESKIDIFKIAGLSDKIAAG----MENEVASVISAVQEI-KSATNSLLLDAE 261 (437) T ss_pred CCchHHHHHHHHHHHHHHHHHHH--HHHHHcCCCceecchHHHHhcCC----cHHHHHHHHHHHHHh-cCCCceEEEcCC Confidence 88888888877777665555331 11222223333443 11000111 123344444444332 2333333333 Q ss_pred ccccccccccccccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHH----HHHHHHHHH-HHHHHHHHH--- Q lcl|NC_015263. 319 MEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDE----QFIFGVINQ-LERWLNRYL--- 390 (513) Q Consensus 319 ~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~----~~~~~~~~~-iE~~~N~~i--- 390 (513) -+++.++.+- +.-.+.+....++|-.++||-...+.|.. .+|+ .|-+.|. ..|-+++++ +..++++.+ T Consensus 262 ~~~e~~~~~~-sgl~~~l~~~~~~iaaa~~iP~t~L~G~s--~~Gl-asge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i 337 (437) T protein:vir:52 262 NEYDRKELTF-TGLKDLLTEFRNAVAGAADMPVTILFGQS--VSGL-ASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLI 337 (437) T ss_pred cceEEEecCc-CCHHHHHHHHHHHHHHHhcCchhhhcCcC--cccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2344443332 24447888999999999999964443442 2233 2333343 333344432 222344332 Q ss_pred -hhcc--cceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhC-CCHHHHHHHHHHHHHhhCcccccCccccc Q lcl|NC_015263. 391 -LLNG--MSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMG-IDPVAFTGLLKVENEMLDLPEIMTPLSSS 466 (513) Q Consensus 391 -~~~~--~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G-~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS 466 (513) .... ....|.+.|.++--.+.||+++..++.++- ... ++. .| ++|.+..+.+.- . + .+..+.-+ T Consensus 338 ~~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a---~~~-~~~-~g~i~~~e~r~~L~~-~---g---~~~~i~~~ 405 (437) T protein:vir:52 338 CNELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATA---ANT-LIQ-NGVLNEYQIANELRE-S---G---LFANISAE 405 (437) T ss_pred HHHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHH---HHH-HHh-cCCCCHHHHHHHHHh-c---C---CCCCCCcc Confidence 2221 122599999999999999999987665532 222 222 46 899998776632 1 1 11111111 Q ss_pred ccccccccccCCccccCCCCcCCCCcccccccCCCCCCC-CCCcc Q lcl|NC_015263. 467 FNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRA-KDKPA 510 (513) Q Consensus 467 ~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~-~d~~~ 510 (513) -.....+ ...|. +...+.+.+....+ ..+|. T Consensus 406 ~~~~~~~------------~~~~~-~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 406 HIEELKN------------ADEFA-GNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred ccccccC------------CCCCC-CccCCCCCCCCCCCCCCCCC Confidence 1000000 00000 00111111111111 11111 No 87 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.06 E-value=0.00019 Score=41.03 Aligned_cols=259 Identities=11% Similarity=0.093 Sum_probs=138.4 Q ss_pred HhhcccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEcCc--ceeeee Q lcl|NC_015263. 91 YANMPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDDKE--SVMIQQ 163 (513) Q Consensus 91 ~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~--~~~iq~ 163 (513) +++|| ..++ .. ++... +.+...|. + +.-..+...++..++..|..|.+..-+.+ ...+.+ T Consensus 1 ia~l~---~~~~----~~---~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~ 67 (278) T protein:vir:78 1 MASLP---LKMY----ED---YKVVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFL 67 (278) T ss_pred Cccce---eEEE----ec---Ccccc---cHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEE Confidence 44444 3332 11 11111 12233333 2 23567788888899999999999887644 478899 Q ss_pred cCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec--Ccccc Q lcl|NC_015263. 164 FPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN--ESSLT 241 (513) Q Consensus 164 lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~--~~~~~ 241 (513) +|+++|.+.--.+|. .+.+.++.- +...+.++..--+-|+.. .+... T Consensus 68 l~~~~v~v~~~~~~~-~~~y~~~~~------------------------------~g~~~~~~~~evih~~~~~~~~~~~ 116 (278) T protein:vir:78 68 LNPDVVEMLIENQSR-ELYYSIHAA------------------------------TGNKLIVHNMDMLHFKHIVASNMVQ 116 (278) T ss_pred ECCceeEEEEcCCCc-eEEEEEEcC------------------------------CceEEEEccccEEEECCCCCCCCee Confidence 999999987544442 222111110 012234444434445522 23456 Q ss_pred chhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcccc--ceEEEecc Q lcl|NC_015263. 242 PVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDN--VGVVTSPM 319 (513) Q Consensus 242 ~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~g--v~~v~sP~ 319 (513) |+|+..++...+--.....+.. . ....+-.-.+-+.| ..++.++++++.+.+++.+.+. +..+-..+ T Consensus 117 G~s~~~~~~~~i~~~~~~~~~~-~-~~~~~~~~~i~~~~---------~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~ 185 (278) T protein:vir:78 117 GISPIDVLKNTTDFDNAVRTFN-L-TEMQKPDSFMLKYG---------SNVGKEKRQQVLEDFKQYYEENGGILFQEPGV 185 (278) T ss_pred eccHHHHHHHHHHHHHHHHHHH-H-HHhcCCCcEEEEeC---------CCCCHHHHHHHHHHHHHHhccCCCceecCCCc Confidence 8888887766554444433321 1 11111111111111 2367788888877777777442 22333333 Q ss_pred cccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCC-cchHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhh-- Q lcl|NC_015263. 320 EIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDN-KTSQGIAMSIA-TDEQFIFGVINQLERWLNRYLLL-- 392 (513) Q Consensus 320 ~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~-~s~~~~~~SI~-~d~~~~~~~~~~iE~~~N~~i~~-- 392 (513) ++..+.+. ..+.+. .+-..++|..+.||...++|... .+.+.+...-+ .....+.-+.++||..+|++|-. T Consensus 186 ~~~~l~~~--~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~ 263 (278) T protein:vir:78 186 EIEPLPKK--YVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKT 263 (278) T ss_pred eEEEccCC--hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChh Confidence 44444432 223333 44566889999999999987543 23333322222 22333446899999999998742 Q ss_pred c-ccceEEEEEecCC Q lcl|NC_015263. 393 N-GMSKYFKATMLEV 406 (513) Q Consensus 393 ~-~~~~~f~~~~l~~ 406 (513) + ..+..|+|-+-++ T Consensus 264 e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 264 DREKIGILNLTLNLI 278 (278) T ss_pred HhcCCceEEEecccC Confidence 2 1245677776666 No 88 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=96.95 E-value=0.00024 Score=40.42 Aligned_cols=368 Identities=13% Similarity=0.111 Sum_probs=156.3 Q ss_pred CCCcc-chheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKNK-KKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~ 79 (513) +.... +.|-+.- .+.+.++++.. +. ++..+.... .++.-.+- T Consensus 3 ~f~~~~~~~~~~~--~~~~~~~~~~~-----------------~~------------~~~~~~~~~------~v~~~~~~ 45 (386) T protein:vir:48 3 IFNITNLATESPP--ISQGGFFDITD-----------------PD------------FLSTLNGSE------WVSAESAL 45 (386) T ss_pred ccccccccccccc--ccccccccccc-----------------ch------------hcccccCCc------eechhhhh Confidence 12111 1111100 00000000000 00 000000000 01111223 Q ss_pred hcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEE Q lcl|NC_015263. 80 QSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVID 154 (513) Q Consensus 80 ~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~ 154 (513) +.+.+.+.|+.+++ +..+...++ . +. ....+++-| ...+...++..++..|..|.++.- T Consensus 46 ~~~~v~~~i~~ia~~ia~~p~~~~----~---------~~---~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r 109 (386) T protein:vir:48 46 RNSDLFSIINQLSNDLATVKLTAS----R---------KQ---LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWR 109 (386) T ss_pred cchHHHHHHHHHHHhhccCceeec----c---------ch---hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEE Confidence 55667777776654 233434442 0 11 122455555 445666778888999999999887 Q ss_pred cCcc--eeeeecCcceeEEEEEECCe-eEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceE Q lcl|NC_015263. 155 DKES--VMIQQFPNDICKISSVSGGV-YNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSI 231 (513) Q Consensus 155 d~~~--~~iq~lp~dyckIsg~~nG~-y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~ 231 (513) +.++ +.+.++|+++|+|.-..+|. ..|.+...- . . ....++++...-+ T Consensus 110 ~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~---~-~-------------------------~~~~~~~~~~evi 160 (386) T protein:vir:48 110 NENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDD---P-R-------------------------IPPKQHVPQGDVL 160 (386) T ss_pred CCCCcEEEEEEecCceeEEEEcCCCceEEEEEEecC---c-c-------------------------ccceeEecCccEE Confidence 6655 67899999999998555543 222221110 0 0 0123445554444 Q ss_pred EEEec-Ccc-ccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc- Q lcl|NC_015263. 232 CIKIN-ESS-LTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV- 308 (513) Q Consensus 232 ~ik~~-~~~-~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L- 308 (513) -|+.. .+. ..|+||...+...+--.....+... .-..|-...-..|.+ .. .++.+++.++.+...... T Consensus 161 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~--~~~~ng~~~~~ii~~---~~----~~~~e~~~~~~~~~~~~~~ 231 (386) T protein:vir:48 161 HFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTL--NSLKNALNANGILKI---KG----GGLLDFKTKLSRSRQAMKQ 231 (386) T ss_pred EecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHH--HHHhccCCcceEEEe---CC----CCCHHHHHHHHHHHHHhhc Confidence 45532 222 4688888877665544444444321 122231111112221 11 133333333333222222 Q ss_pred -cccceEEEeccccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 309 -PDNVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWL 386 (513) Q Consensus 309 -p~gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~ 386 (513) ..++..+-..+++..+.++. +..--.+.+-..++|..+.||...++|...+.++.....+.--...+.-++++||..+ T Consensus 232 n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l 311 (386) T protein:vir:48 232 MQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSLDLYNKAVSRYLRPFLSEL 311 (386) T ss_pred CCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333444444555554421 1111233455668899999999999976544444444444433333446899999999 Q ss_pred HHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhh--CcccccCccc Q lcl|NC_015263. 387 NRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEML--DLPEIMTPLS 464 (513) Q Consensus 387 N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L--~l~~~~~Pl~ 464 (513) |+.|-.. +.+.+.....-........+.++..-|. +++.|.-..+-.+ .++ +++..-.+ T Consensus 312 ~~~l~~~-----~~~~~~~~~~~d~~~~~~~~~~l~~~g~-----------~t~nE~r~~lg~~-~~~~~~~~~~~~~-- 372 (386) T protein:vir:48 312 SQKLSCD-----VDADILPAVDPTGSNSVSRINSMVKSGT-----------LAQNQGLYILQQA-EILPKELPEGENP-- 372 (386) T ss_pred HHhhcch-----hhcchhhhhccChHHHHHHHHHHHhCCC-----------cCHHHHHHHhhcC-CCCCccchhhcCC-- Confidence 9988542 2222111111222233333333333331 3333333222111 000 00000000 Q ss_pred ccccccccccccCCccccCCCCcCCCCcc Q lcl|NC_015263. 465 SSFNTSGSDIAENAIKEKGKENGRPTNET 493 (513) Q Consensus 465 TS~T~Sg~~~~~~~~~~~~~~~grPt~et 493 (513) ......||.+..+. T Consensus 373 ---------------~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 373 ---------------NKTTLKGGEINGED 386 (386) T ss_pred ---------------CCCccCCCCCCCCC Confidence 00011222221111 No 89 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=96.95 E-value=0.00024 Score=40.39 Aligned_cols=374 Identities=13% Similarity=0.091 Sum_probs=169.9 Q ss_pred HHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHh-hcccccceEeeccc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYA-NMPLYAYSVVPFKD 106 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~-~mpt~dY~I~P~~~ 106 (513) -.++..+-...... . +....+.+..++.-.|.++..+++.|+.++ ++..+...++ . T Consensus 1 Mg~f~~lf~~~~~~------------------~--~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~---~ 57 (395) T protein:vir:10 1 MSILEKIFKTRKDI------------------T--YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVL---E 57 (395) T ss_pred CchhhhhhccCccc------------------c--ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEec---c Confidence 11111111111000 0 011123344455556677888999999988 4555555543 1 Q ss_pred hhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEEEECCeeEE Q lcl|NC_015263. 107 ISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISSVSGGVYNY 181 (513) Q Consensus 107 ~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~~nG~y~~ 181 (513) . +.... +.....|. +=| -..+...++..++..|..|.+.+.+... .++++-.|.+....+..+.. T Consensus 58 ~----~~~~~---~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 127 (395) T protein:vir:10 58 G----NRIQK---NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL---LIADSFYREEYALYDDIFKD 127 (395) T ss_pred C----Ccccc---chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe---EecCCccceeEeecCcceeE Confidence 1 11111 22333333 222 4556677888888999988877665543 34455555555443332221 Q ss_pred EEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCc--cccchhhHHHHHHhHHHHHHH Q lcl|NC_015263. 182 VIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINES--SLTPVPPFAGTFDSIYDIHSF 259 (513) Q Consensus 182 ~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~--~~~~ip~f~~v~~d~~di~~~ 259 (513) .. . . .| .....++...-+-|+.+.. ...+++|...+- ..++. T Consensus 128 ~~---~-~-----------------~~-----------~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~-~~~~~--- 171 (395) T protein:vir:10 128 VT---V-K-----------------DY-----------TYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYG-KIFGR--- 171 (395) T ss_pred EE---E-c-----------------Cc-----------eeeeeeccccEEEEccCCCCcccccchHHHHHH-HHHHH--- Confidence 11 0 0 00 0012344433444554221 223555543321 22111 Q ss_pred HHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc----cccceEEEec--cccccccccccccc- Q lcl|NC_015263. 260 KDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV----PDNVGVVTSP--MEIDTVSFDKDSST- 332 (513) Q Consensus 260 kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L----p~gv~~v~sP--~~~d~i~ld~~~~~- 332 (513) ..+...+...+-.-|-+ . .-.++.++++++.+.+++.. -.+.+.++.+ +++..+++...... T Consensus 172 -----~~~~~~~~~~~~gii~~----~--~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~ 240 (395) T protein:vir:10 172 -----MIGAQLKNYQIRGILKS----A--SSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNM 240 (395) T ss_pred -----HHHHHHhcCCCceEEEe----C--CCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccch Confidence 11111121111111211 1 11356777777777666654 1244444333 45555544322211 Q ss_pred --ch---hhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEEecCC Q lcl|NC_015263. 333 --DD---SVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLN-GMSKYFKATMLEV 406 (513) Q Consensus 333 --~d---tv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-~~~~~f~~~~l~~ 406 (513) .+ +..-..++|..+.||...++|++..+.+-...+.-++. +.-++.+||..+|+.|-.. .....++|.+-.. T Consensus 241 ~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~--l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l 318 (395) T protein:vir:10 241 PFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFC--LTPLLKKIQNELNAKLITQSMYLKDTRIEIVGV 318 (395) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHH--HHHHHHHHHHHHHHhhcChhhhcccceecchhh Confidence 11 23344577999999999999887655555555544433 4568999999999987432 2222355555555 Q ss_pred CCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCC Q lcl|NC_015263. 407 THFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKEN 486 (513) Q Consensus 407 T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~ 486 (513) ..-+.++.++.+.++..-|.-..--.-+.+|+.|.+ . -.-|..++|+- +.....+......++. T Consensus 319 ~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~--------~--g~~d~~~~~~n---~~~~~~~~~~~~~~~~--- 382 (395) T protein:vir:10 319 NKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSD--------N--PELDEYLITKN---YEKANSGENDEKEKDE--- 382 (395) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--------C--CCCceeeeccc---cccccccccccCcccc--- Confidence 555777888888877776633222222335655542 1 11233344332 2222211111100000 Q ss_pred cCCCCcccccccCCCCCC Q lcl|NC_015263. 487 GRPTNETTGNKDSDETQR 504 (513) Q Consensus 487 grPt~et~~n~~~~~~~~ 504 (513) ...++. +.+++++ T Consensus 383 ----~~~kgg-~~~~~g~ 395 (395) T protein:vir:10 383 ----NTLKGG-DEDESGD 395 (395) T ss_pred ----cccCCC-CCCCCCC Confidence 000111 1111111 No 90 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=96.95 E-value=0.00024 Score=40.39 Aligned_cols=374 Identities=13% Similarity=0.091 Sum_probs=169.9 Q ss_pred HHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHh-hcccccceEeeccc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYA-NMPLYAYSVVPFKD 106 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~-~mpt~dY~I~P~~~ 106 (513) -.++..+-...... . +....+.+..++.-.|.++..+++.|+.++ ++..+...++ . T Consensus 1 Mg~f~~lf~~~~~~------------------~--~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~---~ 57 (395) T protein:vir:10 1 MSILEKIFKTRKDI------------------T--YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVL---E 57 (395) T ss_pred CchhhhhhccCccc------------------c--ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEec---c Confidence 11111111111000 0 011123344455556677888999999988 4555555543 1 Q ss_pred hhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEEEECCeeEE Q lcl|NC_015263. 107 ISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISSVSGGVYNY 181 (513) Q Consensus 107 ~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~~nG~y~~ 181 (513) . +.... +.....|. +=| -..+...++..++..|..|.+.+.+... .++++-.|.+....+..+.. T Consensus 58 ~----~~~~~---~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 127 (395) T protein:vir:10 58 G----NRIQK---NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL---LIADSFYREEYALYDDIFKD 127 (395) T ss_pred C----Ccccc---chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe---EecCCccceeEeecCcceeE Confidence 1 11111 22333333 222 4556677888888999988877665543 34455555555443332221 Q ss_pred EEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCc--cccchhhHHHHHHhHHHHHHH Q lcl|NC_015263. 182 VIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINES--SLTPVPPFAGTFDSIYDIHSF 259 (513) Q Consensus 182 ~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~--~~~~ip~f~~v~~d~~di~~~ 259 (513) .. . . .| .....++...-+-|+.+.. ...+++|...+- ..++. T Consensus 128 ~~---~-~-----------------~~-----------~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~-~~~~~--- 171 (395) T protein:vir:10 128 VT---V-K-----------------DY-----------TYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYG-KIFGR--- 171 (395) T ss_pred EE---E-c-----------------Cc-----------eeeeeeccccEEEEccCCCCcccccchHHHHHH-HHHHH--- Confidence 11 0 0 00 0012344433444554221 223555543321 22111 Q ss_pred HHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc----cccceEEEec--cccccccccccccc- Q lcl|NC_015263. 260 KDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV----PDNVGVVTSP--MEIDTVSFDKDSST- 332 (513) Q Consensus 260 kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L----p~gv~~v~sP--~~~d~i~ld~~~~~- 332 (513) ..+...+...+-.-|-+ . .-.++.++++++.+.+++.. -.+.+.++.+ +++..+++...... T Consensus 172 -----~~~~~~~~~~~~gii~~----~--~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~ 240 (395) T protein:vir:10 172 -----MIGAQLKNYQIRGILKS----A--SSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNM 240 (395) T ss_pred -----HHHHHHhcCCCceEEEe----C--CCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccch Confidence 11111121111111211 1 11356777777777666654 1244444333 45555544322211 Q ss_pred --ch---hhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEEecCC Q lcl|NC_015263. 333 --DD---SVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLN-GMSKYFKATMLEV 406 (513) Q Consensus 333 --~d---tv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-~~~~~f~~~~l~~ 406 (513) .+ +..-..++|..+.||...++|++..+.+-...+.-++. +.-++.+||..+|+.|-.. .....++|.+-.. T Consensus 241 ~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~--l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l 318 (395) T protein:vir:10 241 PFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFC--LTPLLKKIQNELNAKLITQSMYLKDTRIEIVGV 318 (395) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHH--HHHHHHHHHHHHHHhhcChhhhcccceecchhh Confidence 11 23344577999999999999887655555555544433 4568999999999987432 2222355555555 Q ss_pred CCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCC Q lcl|NC_015263. 407 THFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKEN 486 (513) Q Consensus 407 T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~ 486 (513) ..-+.++.++.+.++..-|.-..--.-+.+|+.|.+ . -.-|..++|+- +.....+......++. T Consensus 319 ~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~--------~--g~~d~~~~~~n---~~~~~~~~~~~~~~~~--- 382 (395) T protein:vir:10 319 NKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSD--------N--PELDEYLITKN---YEKANSGENDEKEKDE--- 382 (395) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--------C--CCCceeeeccc---cccccccccccCcccc--- Confidence 555777888888877776633222222335655542 1 11233344332 2222211111100000 Q ss_pred cCCCCcccccccCCCCCC Q lcl|NC_015263. 487 GRPTNETTGNKDSDETQR 504 (513) Q Consensus 487 grPt~et~~n~~~~~~~~ 504 (513) ...++. +.+++++ T Consensus 383 ----~~~kgg-~~~~~g~ 395 (395) T protein:vir:10 383 ----NTLKGG-DEDESGD 395 (395) T ss_pred ----cccCCC-CCCCCCC Confidence 000111 1111111 No 91 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=96.95 E-value=0.00024 Score=40.39 Aligned_cols=374 Identities=13% Similarity=0.091 Sum_probs=169.9 Q ss_pred HHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHh-hcccccceEeeccc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYA-NMPLYAYSVVPFKD 106 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~-~mpt~dY~I~P~~~ 106 (513) -.++..+-...... . +....+.+..++.-.|.++..+++.|+.++ ++..+...++ . T Consensus 1 Mg~f~~lf~~~~~~------------------~--~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~---~ 57 (395) T protein:vir:95 1 MSILEKIFKTRKDI------------------T--YMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVL---E 57 (395) T ss_pred CchhhhhhccCccc------------------c--ccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEec---c Confidence 11111111111000 0 011123344455556677888999999988 4555555543 1 Q ss_pred hhhhhhcchhHHHHHHHHHHh-hcC----hhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEEEECCeeEE Q lcl|NC_015263. 107 ISTANENKLKKELATVTEFLS-RLN----PKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISSVSGGVYNY 181 (513) Q Consensus 107 ~~~~~~~~~~~~y~~v~~~L~-k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~~nG~y~~ 181 (513) . +.... +.....|. +=| -..+...++..++..|..|.+.+.+... .++++-.|.+....+..+.. T Consensus 58 ~----~~~~~---~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 127 (395) T protein:vir:95 58 G----NRIQK---NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL---LIADSFYREEYALYDDIFKD 127 (395) T ss_pred C----Ccccc---chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe---EecCCccceeEeecCcceeE Confidence 1 11111 22333333 222 4556677888888999988877665543 34455555555443332221 Q ss_pred EEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCc--cccchhhHHHHHHhHHHHHHH Q lcl|NC_015263. 182 VIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINES--SLTPVPPFAGTFDSIYDIHSF 259 (513) Q Consensus 182 ~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~--~~~~ip~f~~v~~d~~di~~~ 259 (513) .. . . .| .....++...-+-|+.+.. ...+++|...+- ..++. T Consensus 128 ~~---~-~-----------------~~-----------~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~-~~~~~--- 171 (395) T protein:vir:95 128 VT---V-K-----------------DY-----------TYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYG-KIFGR--- 171 (395) T ss_pred EE---E-c-----------------Cc-----------eeeeeeccccEEEEccCCCCcccccchHHHHHH-HHHHH--- Confidence 11 0 0 00 0012344433444554221 223555543321 22111 Q ss_pred HHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc----cccceEEEec--cccccccccccccc- Q lcl|NC_015263. 260 KDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV----PDNVGVVTSP--MEIDTVSFDKDSST- 332 (513) Q Consensus 260 kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L----p~gv~~v~sP--~~~d~i~ld~~~~~- 332 (513) ..+...+...+-.-|-+ . .-.++.++++++.+.+++.. -.+.+.++.+ +++..+++...... T Consensus 172 -----~~~~~~~~~~~~gii~~----~--~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~ 240 (395) T protein:vir:95 172 -----MIGAQLKNYQIRGILKS----A--SSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNM 240 (395) T ss_pred -----HHHHHHhcCCCceEEEe----C--CCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccch Confidence 11111121111111211 1 11356777777777666654 1244444333 45555544322211 Q ss_pred --ch---hhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccceEEEEEecCC Q lcl|NC_015263. 333 --DD---SVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLN-GMSKYFKATMLEV 406 (513) Q Consensus 333 --~d---tv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-~~~~~f~~~~l~~ 406 (513) .+ +..-..++|..+.||...++|++..+.+-...+.-++. +.-++.+||..+|+.|-.. .....++|.+-.. T Consensus 241 ~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~--l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l 318 (395) T protein:vir:95 241 PFSELSELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFC--LTPLLKKIQNELNAKLITQSMYLKDTRIEIVGV 318 (395) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHH--HHHHHHHHHHHHHHhhcChhhhcccceecchhh Confidence 11 23344577999999999999887655555555544433 4568999999999987432 2222355555555 Q ss_pred CCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCC Q lcl|NC_015263. 407 THFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKEN 486 (513) Q Consensus 407 T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~ 486 (513) ..-+.++.++.+.++..-|.-..--.-+.+|+.|.+ . -.-|..++|+- +.....+......++. T Consensus 319 ~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~--------~--g~~d~~~~~~n---~~~~~~~~~~~~~~~~--- 382 (395) T protein:vir:95 319 NKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSD--------N--PELDEYLITKN---YEKANSGENDEKEKDE--- 382 (395) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--------C--CCCceeeeccc---cccccccccccCcccc--- Confidence 555777888888877776633222222335655542 1 11233344332 2222211111100000 Q ss_pred cCCCCcccccccCCCCCC Q lcl|NC_015263. 487 GRPTNETTGNKDSDETQR 504 (513) Q Consensus 487 grPt~et~~n~~~~~~~~ 504 (513) ...++. +.+++++ T Consensus 383 ----~~~kgg-~~~~~g~ 395 (395) T protein:vir:95 383 ----NTLKGG-DEDESGD 395 (395) T ss_pred ----cccCCC-CCCCCCC Confidence 000111 1111111 No 92 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=383 Identities=9% Similarity=0.027 Sum_probs=167.6 Q ss_pred eehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHH-------------- Q lcl|NC_015263. 12 IDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDL-------------- 77 (513) Q Consensus 12 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~l-------------- 77 (513) ++-+. .+.++..+..| .....-++.+-+|+ T Consensus 1 ~~~~~----------------------------------~~~i~~l~~~~--~~~~~r~~~l~~Yy~G~~~i~~~~~~~~ 44 (441) T protein:vir:80 1 MNSDE----------------------------------LALIEGMYDRI--QRLSSWHCCIEGYYEGSNRVRDLGVAIP 44 (441) T ss_pred CCccH----------------------------------HHHHHHHHHHH--HHHHHHHHHHHHHHhcCCcchhcCcccc Confidence 22111 11222222222 12222233333332 Q ss_pred ------HhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 78 ------AVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 78 ------Y~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ....+.-+.++|-++.-..++. |...+. .+ +-...+.-++......+.+.+++.|..|.+ T Consensus 45 ~~~~~~k~~~n~~~~ivd~~~~~l~~~g----~~~~d~-------~~---l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~ 110 (441) T protein:vir:80 45 PELQRVQTVVSWPGIAVDALEERLDWLG----WTNGDG-------YG---LDGVYAANRLATASCDVHLDALIFGLSFVA 110 (441) T ss_pred hhhhhhhhhcchHHHHHHHHHhhhcccc----ccCCCh-------HH---HHHHHHhcCHHHHHHHHHHHHhhcCeeEEE Confidence 2333444444554443322221 322111 11 223345567888899999999999999999 Q ss_pred EEEcCcc-eeeeecCcceeEEEEE-ECCeeEEEEEeeeccCcch--hc-cccHHHHHHHHHHhhhhhccCcccccCeeec Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSV-SGGVYNYVIDLDALVSADI--VD-YYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ 226 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~-~nG~y~~~fD~syFd~~~~--L~-~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~ 226 (513) ...|.++ ..+..++|+.|.++-= ..+....++=..+-+.... .. ++|.++.. |.. .. ...|.... T Consensus 111 v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~----~~~-----~~-~~~~~~~~ 180 (441) T protein:vir:80 111 IIPHGDGTVSVRPQSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQ----VER-----RG-SREWVEVD 180 (441) T ss_pred EEeCCCCceEEEEEccceEEEEEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEE----EEE-----cC-Ccceeecc Confidence 8887776 4689999999987543 3455666654444222221 11 23433322 100 00 01122211 Q ss_pred C-----CceEEEEe-cC---ccccchhhHHHHHHhHHHHHHHHHH--HhhHhhhhhceeeeeeeccccCCCCCccccCHH Q lcl|NC_015263. 227 D-----KNSICIKI-NE---SSLTPVPPFAGTFDSIYDIHSFKDL--RNDKAELQNYKLLIQKLETRSSNDNNDFTLDMP 295 (513) Q Consensus 227 ~-----~kt~~ik~-~~---~~~~~ip~f~~v~~d~~di~~~kdL--~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~ 295 (513) . ..-.++.+ +. ..++|.+-++..+.++.|.-+.--. -.+.+-..+ .+++- . |-+.+++..+.. T Consensus 181 ~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~-~~~~i---~--G~~~~~~~~~~~ 254 (441) T protein:vir:80 181 RIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAY-PQRWV---T--GVSADEFSQPGW 254 (441) T ss_pred ccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcC-ceeee---e--cCCccccccchh Confidence 1 11112222 22 2334555555444444432221111 112221211 22211 1 101111111110 Q ss_pred HHHHHHHHHHHhcccc-ceEEEeccccccccccccc-cc-chhhhhhHHhhhhhhhhhhhhccCCCc---chHHHHHHHH Q lcl|NC_015263. 296 MMNYFHEALSMTVPDN-VGVVTSPMEIDTVSFDKDS-ST-DDSVEKATKNFWDNAGVSQILFSSDNK---TSQGIAMSIA 369 (513) Q Consensus 296 ~~~~~~~~ik~~Lp~g-v~~v~sP~~~d~i~ld~~~-~~-~dtv~~~~~~i~~~~GiS~~Lfn~d~~---s~~~~~~SI~ 369 (513) .+ .... + -++|.+ -+..+. --.++... .. -+.+..-.+.+....+++.--|++... |+.+++.... T Consensus 255 ~~-~~~~-i-~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~ 326 (441) T protein:vir:80 255 VL-SMAS-V-WAVDKDDDGDTPN-----VGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEES 326 (441) T ss_pred hh-cccc-c-ccCCCCCCCCcce-----eEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHH Confidence 00 0000 0 012211 111110 01122111 11 123455556677777887667766542 5555666655 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHh---hcc-c---ceEEEEEecCCCCccHHHHHHHHHHHHhcCCc--HHHHHHHHh Q lcl|NC_015263. 370 TDEQFIFGVINQLERWLNR----YLL---LNG-M---SKYFKATMLEVTHFSKKEAHDRYITDAQYGFP--VKVYLASLM 436 (513) Q Consensus 370 ~d~~~~~~~~~~iE~~~N~----~i~---~~~-~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~--~~~~laa~~ 436 (513) .-...+....+.++..+.+ .+. ... . ....++.|-+..+-|..+.++.+.++.+-|.+ +...+...+ T Consensus 327 ~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l 406 (441) T protein:vir:80 327 RLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEML 406 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhC Confidence 5555544433333332222 111 111 1 13578899999999999999999999998854 344566778 Q ss_pred CCCHHHHHHHHHHHHHhhC---cccccCcccccccccccccccCCccccCCCCcCCCCccccc Q lcl|NC_015263. 437 GIDPVAFTGLLKVENEMLD---LPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGN 496 (513) Q Consensus 437 G~~p~~~~~~~~~E~e~L~---l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n 496 (513) |+++.+.-.+.+.+.+.-+ --.-..+.++ .+- T Consensus 407 ~~~~~e~~~~~~e~~e~~~~~~~~~~~~~~~~----------------------------~~~ 441 (441) T protein:vir:80 407 GLDDVQVEAVMRHRAESSDPLAVLAGAISRQT----------------------------NEV 441 (441) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHhhhhhccc----------------------------ccC Confidence 9998877654443222211 0000111111 010 No 93 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=96.79 E-value=0.00034 Score=39.60 Aligned_cols=412 Identities=11% Similarity=0.036 Sum_probs=179.1 Q ss_pred ccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc--------------------chHHHHHHHHhhcccccceEeeccc Q lcl|NC_015263. 47 LTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS--------------------QQYQRLLNFYANMPLYAYSVVPFKD 106 (513) Q Consensus 47 ~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s--------------------g~~~rlidy~~~mpt~dY~I~P~~~ 106 (513) .++-++.+++.++.| ......+..+-.|+.... +.-+.++|..+.-..++.+..| T Consensus 1 ~~t~~d~i~~L~~~~--~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~--- 75 (480) T protein:vir:78 1 MTTYHEHVERLQGLL--ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS--- 75 (480) T ss_pred CCCHHHHHHHHHHHH--HHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceecC--- Confidence 344556666666654 444445555555554443 3334444544444444444332 Q ss_pred hhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEE------cCcc-eeeeecCcceeEEEEE--ECC Q lcl|NC_015263. 107 ISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVID------DKES-VMIQQFPNDICKISSV--SGG 177 (513) Q Consensus 107 ~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~------d~~~-~~iq~lp~dyckIsg~--~nG 177 (513) . .....+... ..++.-++...+..+.+.+++.|..|.+... |.++ .-+..++|..|.++-= ..+ T Consensus 76 -~---d~~~~~~l~---~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~ 148 (480) T protein:vir:78 76 -E---DSEGLEELW---NWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTR 148 (480) T ss_pred -C---CchhHHHHH---HHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCcc Confidence 1 111222233 3355667888899999999999999887652 2333 4678889998887652 234 Q ss_pred eeEEEEEeeeccC----cc--hhccc-cHHHHHHHHHHhhhhhccCcccccCeeecC--Cce----EEEEe-cC---ccc Q lcl|NC_015263. 178 VYNYVIDLDALVS----AD--IVDYY-PKEIQEAVNKYTTMKKGNNKSASNWYEIQD--KNS----ICIKI-NE---SSL 240 (513) Q Consensus 178 ~y~~~fD~syFd~----~~--~L~~~-p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~--~kt----~~ik~-~~---~~~ 240 (513) ...+++ .++.. .. ....| |.++.. |. ..+.....|..... +|. .++.+ +. +.+ T Consensus 149 ~~~~~i--~~~~~~d~~~~~~~~~~y~~~~~~~----~~----~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~ 218 (480) T protein:vir:78 149 RVTRAV--RLYTTRDDVAVPDRATLYLPDETVP----LR----RNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) T ss_pred ceEEEE--EEEEeecCCcceEEEEEEeCCeEEE----EE----ecCCCcccccccccccccCCCCcceEEeecccccCCc Confidence 455443 22211 10 11111 211111 10 00111112321110 111 12222 22 223 Q ss_pred cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccc--cCCCCCccccCH--HHHHHHHHHHHHhccccceEEE Q lcl|NC_015263. 241 TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETR--SSNDNNDFTLDM--PMMNYFHEALSMTVPDNVGVVT 316 (513) Q Consensus 241 ~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~--~~n~~~~~~vd~--~~~~~~~~~ik~~Lp~gv~~v~ 316 (513) +|++-+..-+.++.|.-+.--. +....++- ...|.. -|-+.+++..+. .........+ -+++.+-+- T Consensus 219 ~G~sdi~~~i~~l~Da~~~~~s-~~~~~~~~-----~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-- 289 (480) T protein:vir:78 219 YGRSEISPELRKVTDAASRTLM-NLQSASQI-----LGTPLRVISGVTTDELTNDGENTTLDIYYGRI-LTLASEAAK-- 289 (480) T ss_pred cCccchhHHHHHHHHHHHHHHH-HHHHHHHh-----hcchhhhhhCCCccccccccccchhhhhhhhh-ccCCCCCce-- Confidence 4444433322333222221111 11111110 111210 011111111110 0111111111 112222111 Q ss_pred eccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccCCCc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_015263. 317 SPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSSDNK---TSQGIAMSIATDEQFIFGVINQLERWLNRYL- 390 (513) Q Consensus 317 sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~---s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i- 390 (513) -..+++.. .--+.+.....+++...|+....|++.+. |+.+++.....-...+.+..+.+..-+.+.+ T Consensus 290 ------~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~r 363 (480) T protein:vir:78 290 ------ISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) T ss_pred ------EEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222211 11134555656677778888888987653 4445555555555555444443333222211 Q ss_pred ---h--hccc---ceEEEEEecCCCCccHHHHHHHHHHHHhcC--CcHHHHHHHHhCCCHHHHHHHHHHHHHh-hC-ccc Q lcl|NC_015263. 391 ---L--LNGM---SKYFKATMLEVTHFSKKEAHDRYITDAQYG--FPVKVYLASLMGIDPVAFTGLLKVENEM-LD-LPE 458 (513) Q Consensus 391 ---~--~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G--~~~~~~laa~~G~~p~~~~~~~~~E~e~-L~-l~~ 458 (513) . .... .+.+++.|-+..+-|..+.++.+.++.+-| ..+...+...+|+++.+.-.+.+.+.+. -+ ++. T Consensus 364 l~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~ 443 (480) T protein:vir:78 364 IAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDT 443 (480) T ss_pred HHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHH Confidence 1 1111 125788898888899999999999998876 3456666778999999877765543222 11 111 Q ss_pred ccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCC Q lcl|NC_015263. 459 IMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDK 508 (513) Q Consensus 459 ~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~ 508 (513) ..-|. ..+ ....+++..+..| +......++.+|++.+ T Consensus 444 ~~~~~-------~~~---~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 444 LYSTT-------KAQ---ADATPKPTVTETK---TETQTSPSGFNRTKTR 480 (480) T ss_pred hhccc-------cCC---CccccCCCCCCCC---CccCCCcccCCCcCCC Confidence 11111 110 0011111111111 1122233344444444 No 94 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=96.78 E-value=0.00035 Score=39.56 Aligned_cols=363 Identities=12% Similarity=0.081 Sum_probs=159.6 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) .++.+ +.. .-.+..+. ...+ .....-+.+-...+ ...++.--+-..+.+++.| T Consensus 1 Mg~~~--------------~~~-~~~~~~~~---~~~~---~~~~~~~~~~~~~~------~~~v~~~~al~~~~v~~~i 53 (385) T protein:vir:10 1 MGLLT--------------PRN-FNKRKAKN---MVYP---SNPAFFTTTVGGMQ------LSYVSALSALQNTNVYSVI 53 (385) T ss_pred Ccccc--------------chh-cccccccc---cccc---cchhhhhhhccccC------ccccCHHHhhccHHHHHHH Confidence 00000 000 00000000 0000 00000000000000 0011111233456677788 Q ss_pred HHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcceeeee Q lcl|NC_015263. 89 NFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQ 163 (513) Q Consensus 89 dy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~ 163 (513) +.+++ +..+...++ +. .....|++-| ...+...++..++..|..|.++..+ . +.- T Consensus 54 ~~ia~~ia~~p~~v~-----~~-----------~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~--~--~~~ 113 (385) T protein:vir:10 54 NRIASDVASAHFKTE-----NT-----------ATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ--N--LEH 113 (385) T ss_pred HHHHHHHhhCceeee-----cc-----------chhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC--c--eeE Confidence 87766 344433442 10 1122344433 3455666777888899999998754 2 345 Q ss_pred cCcceeEEEEEECC-eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecC----c Q lcl|NC_015263. 164 FPNDICKISSVSGG-VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINE----S 238 (513) Q Consensus 164 lp~dyckIsg~~nG-~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~----~ 238 (513) +|++.++|.-..++ .+.|.+... .. ..-+.++..--+.|+... + T Consensus 114 ~p~~~~~v~~~~~~~~~~~~~~~~---~~----------------------------~~~~~~~~~eiihik~~~~~~~~ 162 (385) T protein:vir:10 114 IPNSDVQINYLPGNMGIVYTVLES---ND----------------------------RPQMVLRQDQMLHFRLMPDPQYR 162 (385) T ss_pred eecCCceEEEEEcCCceEEEEEEc---CC----------------------------ceEEEEccccEEEeccCCCCccc Confidence 66666666544442 222211100 00 012345554456666421 2 Q ss_pred cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc---c-cceE Q lcl|NC_015263. 239 SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP---D-NVGV 314 (513) Q Consensus 239 ~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp---~-gv~~ 314 (513) ...|+||...+...+--....++... .-..|-...-..|-+ ++ ...+.++++.+.+.+++..- . ++.. T Consensus 163 ~~~G~s~i~~~~~~i~~~~~~~~~~~--~~~~ng~~~~gil~~-----~~-~~~~~e~~~~~~~~~~~~~~~~n~~~~~v 234 (385) T protein:vir:10 163 YLIGRSPLESLQNALNLDDKASKSNM--SAMENQINPAGKLTI-----SN-YLSDGKDLESAREEFEKANTGDNSGRLMV 234 (385) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHH--HHHhccCCcceEEEe-----CC-CCCCHHHHHHHHHHHHHHhCccccCCccc Confidence 34588887776665544444443221 122331111111221 11 12344566666666666541 1 2333 Q ss_pred EEecccccccccccccccchh----hhhhHHhhhhhhhhhhhhccCCC-cchHHHHHH-HHHH-HHHHHHHHHHHHHHHH Q lcl|NC_015263. 315 VTSPMEIDTVSFDKDSSTDDS----VEKATKNFWDNAGVSQILFSSDN-KTSQGIAMS-IATD-EQFIFGVINQLERWLN 387 (513) Q Consensus 315 v~sP~~~d~i~ld~~~~~~dt----v~~~~~~i~~~~GiS~~Lfn~d~-~s~~~~~~S-I~~d-~~~~~~~~~~iE~~~N 387 (513) +-..+++..+.+. ..+.+. .+-..++|..+.||...++|+.. .++...+.+ .... ..-+.-++++||..+| T Consensus 235 l~~g~~~~~l~~~--~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~ie~~l~ 312 (385) T protein:vir:10 235 LPDGFDYTQLEMK--TDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYVNPIVDELR 312 (385) T ss_pred cCCCceEEecCCC--hhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333444444432 222232 33445779999999999997643 222221111 1111 1123358888889899 Q ss_pred HHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHH-HhhCcccccCccccc Q lcl|NC_015263. 388 RYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVEN-EMLDLPEIMTPLSSS 466 (513) Q Consensus 388 ~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~-e~L~l~~~~~Pl~TS 466 (513) +.|-. ..|+|.+-+....+.++.++.+.++.+-|. ++|.|+-.++-.|- .-.+.+...+|.. T Consensus 313 ~~l~~----~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~-----------~T~NE~R~~~g~~p~p~~~~~~~~~~~~-- 375 (385) T protein:vir:10 313 LKMNA----PDLELDIKDMLDVDDSALINQVSNLAKSGV-----------LGAEQAQFILTRSGFLPDNLPEFKPLTT-- 375 (385) T ss_pred HhhCC----ceEEeechhhhccCHHHHHHHHHHHHhCCC-----------cCHHHHHHHhCCCccCCCCCccccCccc-- Confidence 87743 357887777777889999999888887772 24444333221110 0001111111111 Q ss_pred ccccccccccCCccccCCCCcCCCCcccccccCCCCCC Q lcl|NC_015263. 467 FNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQR 504 (513) Q Consensus 467 ~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~ 504 (513) ..=|+ +++++ T Consensus 376 -~~~~g---------------------------~~~dn 385 (385) T protein:vir:10 376 -QVKGG---------------------------DEGDN 385 (385) T ss_pred -ccCCC---------------------------CCCCC Confidence 00011 11111 No 95 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=96.75 E-value=0.00037 Score=39.42 Aligned_cols=370 Identities=10% Similarity=0.058 Sum_probs=154.5 Q ss_pred CCCc-cchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKN-KKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~ 79 (513) +.+. +++|-+.-. .-.++ -++..... ++... +. +.++.--+- T Consensus 3 ~f~~~~~~~~~~~~--~~~~~-----------~~~~~~~~-----~~~~~-------------~~------~~v~~~~al 45 (386) T protein:vir:49 3 IFNITNLATESPPI--NQESF-----------FDIADSDF-----LASLN-------------SS------EWVSAENAL 45 (386) T ss_pred hhhhhccCCCCccc--chhhh-----------hhhhhccc-----ccccc-------------CC------ceechhhhh Confidence 1111 111111000 00000 00000000 00000 00 011111122 Q ss_pred hcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEE Q lcl|NC_015263. 80 QSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVID 154 (513) Q Consensus 80 ~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~ 154 (513) +++.+.+.|+.+++ +..+...++ . +.+. ..+++-| ...+...++..++..|..|.++.- T Consensus 46 ~~~~v~~~i~~ia~~ia~~p~~~~----~---------~~~~---~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r 109 (386) T protein:vir:49 46 KNSDLFSIISQLSNDLATAKITTS----R---------KQLQ---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWR 109 (386) T ss_pred ccHHHHHHHHHHHHHhhhCceeec----c---------chhh---hhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEE Confidence 35566677776665 444444543 1 1111 1233333 456667788888999999999875 Q ss_pred cC--cceeeeecCcceeEEEEEECC-eeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceE Q lcl|NC_015263. 155 DK--ESVMIQQFPNDICKISSVSGG-VYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSI 231 (513) Q Consensus 155 d~--~~~~iq~lp~dyckIsg~~nG-~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~ 231 (513) +. ..+.+.++|+++|+|.-..+| .+.|.+-+. .. ....=+.++...-+ T Consensus 110 ~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~---~~--------------------------~~~~~~~~~~~evi 160 (386) T protein:vir:49 110 NDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFD---DP--------------------------HIAPKQHVPQNDIL 160 (386) T ss_pred CCCCcEEEEEEecCceeEEEEcCCCceEEEEEEEc---Cc--------------------------cccceeEEccccEE Confidence 54 446899999999999865543 232222110 00 00011334444444 Q ss_pred EEEec-Cc-cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc- Q lcl|NC_015263. 232 CIKIN-ES-SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV- 308 (513) Q Consensus 232 ~ik~~-~~-~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L- 308 (513) -|+.. .+ ...|+||...+...+--.....+... .-..|-...-..|-+ +.. ++.++++++.+...+.- T Consensus 161 h~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~--~~~~ng~~~~~il~~---~~~----~~~~~~~~~~~~~~~~~~ 231 (386) T protein:vir:49 161 HFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTI--SALKNALNANGILKI---KGG----GLLDFKTKVSRSRQAMKQ 231 (386) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHH--HHHHccCCccEEEEe---CCC----CChHHHHHHHHHHHHhcc Confidence 45542 22 34688888877665544444444322 122221111111111 111 12222222222222111 Q ss_pred -cccceEEEeccccccccccc-ccccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 309 -PDNVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWL 386 (513) Q Consensus 309 -p~gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~ 386 (513) ..++..+-..+++..+.++. +..--++.+-..++|..+.||...++|++..+.+.....-.--...+-.+++.|+..+ T Consensus 232 n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~~i~~~l~~i~~~~ 311 (386) T protein:vir:49 232 MQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNIYFKSVSRYLRPFVSEM 311 (386) T ss_pred CCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 12333433444554444321 1122234566668899999999999987655444332221222233345777888888 Q ss_pred HHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCccccc Q lcl|NC_015263. 387 NRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSS 466 (513) Q Consensus 387 N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS 466 (513) |+.|... +++..-.....+.+++...+.++..-|. ++|.|+-.++... .. .-.+ ... T Consensus 312 ~~~l~~~-----~~~~~~~~~~~d~~~~~~~~~~l~~~g~-----------~t~nE~r~~l~~~-~~-~~~~----~~~- 368 (386) T protein:vir:49 312 SKKLSCE-----VDVDISPAVDPTGSNYISLINSMVKSGT-----------LAQNQGLYILQQA-EI-LPKE----LPD- 368 (386) T ss_pred HHHhcch-----hcccchhhhccCHHHHHHHHHHHHhCCC-----------cCHHHHHHHHhhC-CC-CCCc----Ccc- Confidence 8877532 3333222222333444444444332221 5677766554321 11 1000 000 Q ss_pred ccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccC Q lcl|NC_015263. 467 FNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPAN 511 (513) Q Consensus 467 ~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~ 511 (513) +.+ .......||... + +| T Consensus 369 ----~~~-----~~~~~~~gGd~~----------~--------~~ 386 (386) T protein:vir:49 369 ----GKN-----PNRTSLKGGEIN----------E--------QD 386 (386) T ss_pred ----hhc-----cCCCCCCCCCCC----------C--------CC Confidence 000 000001111110 0 00 No 96 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=96.72 E-value=0.00038 Score=39.31 Aligned_cols=432 Identities=13% Similarity=0.086 Sum_probs=188.8 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~ 79 (513) |-+....+.+-++-.+ .+|.||..- + +. ...+|..- -..|. |.. .=..++..+|. T Consensus 1 ~~~~~~a~~~~~~~~a-~~~~~~~~~---------~--------g~-~~~~d~~~--~~~~~~~~~---~~~~~l~~lY~ 56 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKI-VNRNDFMVG---------H--------GK-ANSRDKLT--RQTPGNGQK---LDLKACENLYA 56 (461) T ss_pred Cccchhhhhhhhhhhh-hhhhHHHhh---------c--------CC-cchhhhhh--ccccCcccc---cCHHHHHHHHH Confidence 8888777777766444 346666521 0 00 00011100 01111 110 01356778999 Q ss_pred hcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEE-cCcc Q lcl|NC_015263. 80 QSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVID-DKES 158 (513) Q Consensus 80 ~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~-d~~~ 158 (513) .++..+++|+-.....+=+.+..- +. +.+.. ......+++++++..+.+.++...+.|.-+.++.. +.++ T Consensus 57 ~~~l~r~iVd~~a~d~~r~g~~i~-~~-----~~~~~---~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~ 127 (461) T protein:vir:80 57 SNSIAMNIVDIISEDMVRAGWSLK-TD-----NKEMK---KNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR 127 (461) T ss_pred hCCccchhhccchHHhhcCCeeee-cC-----CHHHH---HHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCc Confidence 999999999999888877765431 11 11111 22334577777777777777777777766666543 3322 Q ss_pred eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhcc--ccHHHHHHHHHHhhhh-------hccCcccccCeeecCCc Q lcl|NC_015263. 159 VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDY--YPKEIQEAVNKYTTMK-------KGNNKSASNWYEIQDKN 229 (513) Q Consensus 159 ~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~--~p~Ei~~~y~~Y~~~k-------~~~~~~~~~W~~L~~~k 229 (513) . .++.++=....++..-+|.=.++...... ..... ..|.+-+- ..|.... ...+..+..=+.+.+.+ T Consensus 128 ~--~~~~~~pl~~~~~~~~~~l~~~~~~~i~~-~~~~~dp~sp~fg~P-~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SR 203 (461) T protein:vir:80 128 E--QADLSTAIDPKTIKSIPYINTFNTQKVTQ-LYLNQDMFSEHFGEV-EFFEVNRVSQLGEEILSGTTASTSEQIHRSR 203 (461) T ss_pred c--ccCccCCcccccccceeEEEeccccccch-hhhcccCcCcccccc-eEEEEeccccccccccccccCccceEEcccc Confidence 1 11211111111111112211222222110 00000 11111000 0010000 00001011112333333 Q ss_pred eEEEEec--CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh Q lcl|NC_015263. 230 SICIKIN--ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT 307 (513) Q Consensus 230 t~~ik~~--~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~ 307 (513) -+.|.-. .+..||+|.+-.++..+...+.-...-. .-+....+-+-+++ +-.++..+...+.-+.++.. T Consensus 204 ii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~--~l~~~~~~~v~k~~-------~l~~~~~~~~~~~~~~~~~~ 274 (461) T protein:vir:80 204 IIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVG--QILYDFAFKVYKTD-------DIDALNKDDKANLTAMLDFM 274 (461) T ss_pred EEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHH--HHHHHhCCCceecc-------hHHhhhchHHHHHHHHHHHh Confidence 3333211 1123677777777776666655444221 11111111122222 11111111111222222322 Q ss_pred c-cccceEEEecccccccccccccccchhhhhhHHhhhhhhhhhh-hhccCCCcchHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_015263. 308 V-PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQ-ILFSSDNKTSQGIAMSIATDEQFIFGVINQ-LER 384 (513) Q Consensus 308 L-p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~-~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~-iE~ 384 (513) . -.|+..+-.-=+++.++.+- +...+.+....+.|-.++||-. .|||-..+..++...-+..-...|-+.+++ ++. T Consensus 275 ~~~~g~~~~d~~e~~e~~~~~l-sgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~yyd~i~~~qe~~l~p 353 (461) T protein:vir:80 275 FRTEALAIIKGDEQLTKESTNV-SGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMNYYARVSSIQENRLRP 353 (461) T ss_pred cCCceEEEEcCCcceEEEecCc-CCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHHHHHHHHHHHHHHHHHHH Confidence 2 12333322221223333222 2345778888899999999996 466644322222333344444555566643 555 Q ss_pred HHHHHHhhc----c--------cceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhC-CCHHHHHHHHHHHH Q lcl|NC_015263. 385 WLNRYLLLN----G--------MSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMG-IDPVAFTGLLKVEN 451 (513) Q Consensus 385 ~~N~~i~~~----~--------~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G-~~p~~~~~~~~~E~ 451 (513) ++++.+... . ....|.|.|.++...+.||+++..++.++- ...++. .| ++|+++-..+.-+ T Consensus 354 ~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a---~~~~~~--~g~is~~e~r~~l~~~- 427 (461) T protein:vir:80 354 QLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEA---DQIYIV--NGVLDPDEVKETRFGR- 427 (461) T ss_pred HHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHH---HHHHHh--cCCCCHHHHHHHHHHh- Confidence 766655321 1 123699999999999999999998877652 222233 46 8999987765432 Q ss_pred HhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCC Q lcl|NC_015263. 452 EMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKD 507 (513) Q Consensus 452 e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d 507 (513) +++++ .+.++ |-+. ...+.+- ...+.++.+++.+ T Consensus 428 --~~~~~-----~~~~~--~~~~--~~~~~~~-----------~~~~~~~~e~~~g 461 (461) T protein:vir:80 428 --FGLEN-----SSKFS--GDSA--EIDKLAK-----------LVYDAYAKKNADG 461 (461) T ss_pred --cCCCC-----CccCC--CCCc--hhhhhhh-----------hccccccccCCCC Confidence 23222 12222 2110 0000000 0011111111111 No 97 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=96.70 E-value=0.0004 Score=39.23 Aligned_cols=418 Identities=11% Similarity=0.086 Sum_probs=165.8 Q ss_pred eeehhh-h-hhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHH Q lcl|NC_015263. 11 MIDVES-I-SSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLL 88 (513) Q Consensus 11 ~~~~~~-~-~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rli 88 (513) |+|+.- | -.|++ .+-+.| ++- .....+.+++.++.| ....+.++.+.+|+.....+..|-. T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~-~~~------------~~~~~~~i~~~i~~~--~~~~~r~~~~~~Yy~g~~~i~~~~~ 63 (478) T protein:vir:10 1 MISINWPWDKPYHE--QVVEQI-KPK------------YETQEEMILRLVREH--KENIDNITMGERYYNHHPDILDAPF 63 (478) T ss_pred CccccccCCchhhh--HHHHHh-hhc------------cCChHHHHHHHHHHH--HHHHHHHHHHHHHhcccccccccch Confidence 555411 0 01111 000000 111 122345666767765 4455667777777665543332222 Q ss_pred HHHhhcccccc-----eEeec-------------cch-hh-hhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 89 NFYANMPLYAY-----SVVPF-------------KDI-ST-ANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 89 dy~~~mpt~dY-----~I~P~-------------~~~-~~-~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~ 148 (513) .+-.......+ .+.|| ++. .. .......+... .++. =++......+.+.+.+.|.. T Consensus 64 ~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~---~~~~-n~~~~~~~~~~~~~~~~G~~ 139 (478) T protein:vir:10 64 KRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQ---HTLN-HKWDDKLVDILTAASNKGIE 139 (478) T ss_pred hhhcccccccccccceeccchHHHHHHHHhhhhcccCceeecCChHHHHHHH---HHHh-ccHHHHHHHHHHHHhhCCeE Confidence 21111111000 01111 100 00 01111222222 2232 35778888999999999999 Q ss_pred eEEEEEcCc-ceeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcc--hhccc-cHHHHHHHHHHhhhh-h-----ccC Q lcl|NC_015263. 149 YGYVIDDKE-SVMIQQFPNDICKISSV--SGGVYNYVIDLDALVSAD--IVDYY-PKEIQEAVNKYTTMK-K-----GNN 216 (513) Q Consensus 149 ~gy~i~d~~-~~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~--~L~~~-p~Ei~~~y~~Y~~~k-~-----~~~ 216 (513) |.+...|.+ .+-+.-++|+-|.++-- ..|.+.+++ ++....+ ..+.| +.++.+ |.... . ... T Consensus 140 ~~~v~~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~~~i--r~~~~~~~~~~~~y~~~~i~~----~~~~~~~~~~~~~~~ 213 (478) T protein:vir:10 140 WVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFI--RVYELDGAERVEYWTKDDVTF----YELKEGQLIPDFYRS 213 (478) T ss_pred EEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEeeeCceEEEEEeCCcEEE----EEecCCeeecccccc Confidence 998876644 45677778887776643 125555543 3333211 12222 222211 11000 0 000 Q ss_pred ccc-ccCe-ee----cCCceEEEEecCccccchhhHHHHHHhHHHHHHHHH-HH-hhHhhhhhc--eeeeeeeccc-cCC Q lcl|NC_015263. 217 KSA-SNWY-EI----QDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKD-LR-NDKAELQNY--KLLIQKLETR-SSN 285 (513) Q Consensus 217 ~~~-~~W~-~L----~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kd-L~-~~~~~i~n~--~ii~~kip~~-~~n 285 (513) ... ..|. .. +..+-.++.+-. ...+.+-| .++.++.+.=+ +. +..+.++-. .+++ ..+ ++. T Consensus 214 ~~~~~~~~~~~~~~~~~g~vPvv~~~n-~~~g~sd~----e~v~~liDa~~~~~S~~~~~~~~~~~~~~~---~~g~~~~ 285 (478) T protein:vir:10 214 EDHIQPHYYQGNKLMSWGRVPFIPFKN-NPQEVSDL----FMYKTIIDALDKRLSDTQNTFDESVELIYI---LKGYEGE 285 (478) T ss_pred ccccccceecccccccCCcceEEEecc-CCCCCCcH----HHHHHHHHHHHHHHHHHHHHHHHhhCccee---eecCCcc Confidence 000 0010 00 011111222211 11233333 23222222111 11 111222211 1111 111 011 Q ss_pred CCCccccCHHHHHHHHHHHHHhccccceEEEecc---ccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCCcch Q lcl|NC_015263. 286 DNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPM---EIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDNKTS 361 (513) Q Consensus 286 ~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~---~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~ 361 (513) +.+++.-+++. .++ ..+.+- +++-+..+... .....++-..++|+.-+++-..-+.+.+++. T Consensus 286 ~~~~~~~~~~~-------------~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~ 351 (478) T protein:vir:10 286 DMKDFMHNLKY-------------YKA-ISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSP 351 (478) T ss_pred cccchhhhhhh-------------Cce-eEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccch Confidence 22222222111 111 111110 11111111111 1113355566778888887655554433344 Q ss_pred HHHHH--HHHHHHHHHHH----HHHHHHHHHHHHHhhcccce---EEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHH Q lcl|NC_015263. 362 QGIAM--SIATDEQFIFG----VINQLERWLNRYLLLNGMSK---YFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYL 432 (513) Q Consensus 362 ~~~~~--SI~~d~~~~~~----~~~~iE~~~N~~i~~~~~~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~l 432 (513) +|+.. -...-.+.+.. |-+.|++.++.+++...... .+.+.|-+..+.|..+.++.+.++.+ ..|....+ T Consensus 352 Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~~g-~iS~et~i 430 (478) T protein:vir:10 352 SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNFNVMVNELENSQIAMNSTG-LLSKETIL 430 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCCCCCCCHHHHHHHHHHHhC-CCChHHHH Confidence 44332 22222222222 22333333333333322222 47899999999999999999988742 25555544 Q ss_pred HHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCC Q lcl|NC_015263. 433 ASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPT 490 (513) Q Consensus 433 aa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt 490 (513) ..+|+ +|.+.+.+...|++... ...+ ....+......+..++ |.+. T Consensus 431 -~~~~~v~d~~~E~~ri~~E~~~~~--~~~~--------~~~~~~~d~~~~~~~d-~~~e 478 (478) T protein:vir:10 431 -GNHSWVQDPVAEMERIEQENIELN--QQLP--------DIEEGLNDEQQRQSED-NQSE 478 (478) T ss_pred -HhCCCCCCHHHHHHHHHHHHHHHH--Hhcc--------ccCCCCcccccccCcC-CCCC Confidence 56777 78999999999986521 1111 1111001111111111 1111 No 98 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=96.69 E-value=0.0004 Score=39.19 Aligned_cols=411 Identities=15% Similarity=0.141 Sum_probs=164.8 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh---------- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ---------- 80 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~---------- 80 (513) |||. +....+.....+=+.+. +......+.+++.+..| ....+.++.+-+|+... T Consensus 1 ~~~~--------~~~~~~~~~~~~~~~~~-----~~~~~~~~~i~~~i~~~--~~~~~~~~~~~~yY~g~~~i~~~~~~~ 65 (468) T protein:vir:96 1 MIDI--------FWPNEKPYHERVVEQIK-----PQYETQEEMILRLITKH--KENVEDITVGERYYNHQPDVLFNAPKR 65 (468) T ss_pred Cccc--------cCCcCceeehheeeccc-----ccccCcHHHHHHHHHHH--HHHHHHHHHHHHHhcCCCccccccccc Confidence 4442 11222222222222221 23334444555555543 33444455555555443 Q ss_pred ----------------cchHHHHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHH Q lcl|NC_015263. 81 ----------------SQQYQRLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAM 143 (513) Q Consensus 81 ----------------sg~~~rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l 143 (513) .+..+.+++..++ |.+ .|.. -........+....+ +. =++......+.+.++ T Consensus 66 ~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~-----~l~g~p~~--~~~~d~~~~~~l~~~---~~-n~~~~~~~~~~~~~~ 134 (468) T protein:vir:96 66 NVKGEIDPFKPDWRMYTNYHQNLVDQKVA-----YAVANPVT--YGTEDEKSLKTIQEV---LN-HKWDDKLVDILTAAS 134 (468) T ss_pred cccccccccccccccccchHHHHHHHHHh-----hhccCCce--eccCChHHHHHHHHH---Hh-cCHHHHHHHHHHHHh Confidence 2233333332222 111 0100 000111222333333 22 267778888999999 Q ss_pred HhcceeEEEEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccccH-HHHHHHHHHhhhhh-----c Q lcl|NC_015263. 144 TVDIFYGYVIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYYPK-EIQEAVNKYTTMKK-----G 214 (513) Q Consensus 144 ~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~p~-Ei~~~y~~Y~~~k~-----~ 214 (513) +.|..|.+..-|.++ +-+..++|+-|.++-. ..|.+.+++=....+.....+.|.+ .+.. |... .... . T Consensus 135 ~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~ 212 (468) T protein:vir:96 135 NKGVEWIQPYVDEQGEFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTF-YELK-DGQLIPDYYQ 212 (468) T ss_pred hcCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEE-EEEc-CCceeecccc Confidence 999999877776554 5577788888877743 2355555542221111222222211 1110 0000 0000 0 Q ss_pred cCcccccCeee-----cCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhceeeeeeeccccCCCC Q lcl|NC_015263. 215 NNKSASNWYEI-----QDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNYKLLIQKLETRSSNDN 287 (513) Q Consensus 215 ~~~~~~~W~~L-----~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~~ii~~kip~~~~n~~ 287 (513) ......-|... +..+-.++.+- ..+.+.+-|. ++.++.+.=+.. +..+.++...--.-.++=.++.+. T Consensus 213 ~~~~~~~~~~~~~~~~~~~~iPvv~~~-n~~~g~sd~e----~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~ 287 (468) T protein:vir:96 213 GEEHVQAHYYVGNKSMSWNRVPFIPFK-NNPQEVSDLF----MYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDL 287 (468) T ss_pred cccccccceeeccccccCCcccEEEec-CCCCCCCchH----HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccc Confidence 00000001100 00001112221 1122333333 333333322221 112222221111111110011222 Q ss_pred CccccCHHHHHHHHHHHHHhcc-ccceEEEeccccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCCcchHHHH Q lcl|NC_015263. 288 NDFTLDMPMMNYFHEALSMTVP-DNVGVVTSPMEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDNKTSQGIA 365 (513) Q Consensus 288 ~~~~vd~~~~~~~~~~ik~~Lp-~gv~~v~sP~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~ 365 (513) +.+..+++....+ .++ ++-| +++-+..+.+. .....++-..++|+..+|+-..-+.+.+++.+|+. T Consensus 288 ~~~~~~~~~~~~i------~~~~d~~~------~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 355 (468) T protein:vir:96 288 EEFMYNLKYYKAI------NVDGDGSG------GVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIA 355 (468) T ss_pred chhhhhhhcCceE------EecCCCCC------cceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHH Confidence 2222222111000 011 1000 11111111111 11233555667888888876666655444444443 Q ss_pred HHHH--HHHHHHHH----HHHHHHHHHHHHHhhccc---ceEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHH Q lcl|NC_015263. 366 MSIA--TDEQFIFG----VINQLERWLNRYLLLNGM---SKYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVKVYLASL 435 (513) Q Consensus 366 ~SI~--~d~~~~~~----~~~~iE~~~N~~i~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~ 435 (513) +-.. .-.+.+-. |-+-|++.++.+++.... -....+.|-+..+.+..+.++.+.++ | .|....+ .. T Consensus 356 lk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~i~f~~~~p~d~~e~a~~~~~~---g~iS~et~i-~~ 431 (468) T protein:vir:96 356 LKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLSIKVQDVEITFNFNVMVNELEQSQIGVNS---QYLSKETVV-TN 431 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCCcCHHHHHHHHHhc---CCCchHHHH-Hh Confidence 3211 11111111 222222223222232222 22578889999999999999987653 7 5655555 45 Q ss_pred hCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCC Q lcl|NC_015263. 436 MGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPT 490 (513) Q Consensus 436 ~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt 490 (513) +++ +|.+.+.+...|++.. ...+.. +-| .+...|| T Consensus 432 l~~v~D~~~E~~ri~~E~~~~------~~~~~~--~~~------------~~~~~~~ 468 (468) T protein:vir:96 432 HPWVDDPVAEMERIDQEELAL------PSIEEG--LNG------------KENNEPT 468 (468) T ss_pred CCCCCCHHHHHHHHHHHHHHH------HHHhhc--cCC------------CCCCCCC Confidence 666 7899999999998542 112211 111 2223354 No 99 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=96.65 E-value=0.00044 Score=38.98 Aligned_cols=408 Identities=10% Similarity=0.029 Sum_probs=173.3 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) |-|+| ||...-| +|.|..+. .........|. . +. ........+|.. T Consensus 5 m~~~~-~~~~~~D-----~~~~~~~~---------~~g~~~~~~~~-------------~--~~----~~~~~l~~~Y~~ 50 (435) T protein:vir:79 5 MSDKV-KAITKED-----GYNEIFGS---------KDGTFRPNAFY-------------M--QR----AAFKALSQFYEE 50 (435) T ss_pred ccccc-ccchhhc-----chhhhhcc---------cccccccCccc-------------C--Cc----CCHHHHHHHHhc Confidence 77764 4444333 45452111 11110000011 1 11 112345667999 Q ss_pred cchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCccee Q lcl|NC_015263. 81 SQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKESVM 160 (513) Q Consensus 81 sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~ 160 (513) ++.++++|+-.....|=..+-.. +.+ .+ +. .-..+++++++..+.+.++-....|.-+-+.....+.-. T Consensus 51 ~~l~~~~Vd~~aed~~r~g~~i~-g~~---~~----~~---~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~ 119 (435) T protein:vir:79 51 DGMARRIVDVIPEEMVTPGFKVD-GVK---NE----KS---FKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKML 119 (435) T ss_pred CchhhhhhccchHHhhcCCceec-CCC---hH----HH---HHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCc Confidence 99999999988876665544321 111 11 22 234566777666555555555555544444444222335 Q ss_pred eeecCcceeEEEEEECCeeE--EEEEeeeccCcch-hccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-- Q lcl|NC_015263. 161 IQQFPNDICKISSVSGGVYN--YVIDLDALVSADI-VDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-- 235 (513) Q Consensus 161 iq~lp~dyckIsg~~nG~y~--~~fD~syFd~~~~-L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-- 235 (513) -+||.++ |.+. .+||-.+..-... .+-+.|-+-+- ..|. .++.....=+.+-+.+-+.|.- T Consensus 120 ~~Pl~~~---------g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~P-~~y~----v~~~~~~~~~~iH~SRli~~~g~~ 185 (435) T protein:vir:79 120 KSPVKPG---------AQLEDIRVYDRYQITIHERETNARSVRYGEP-KLYK----ISPGGDIPEFFVHYSRICIIDGER 185 (435) T ss_pred ccccccC---------CceeeEEeechhhccchhhccCCcccccCcc-eEEE----EecCCCCCceEEcceeEEEecCCc Confidence 6777664 3222 1222222110000 00000000000 0010 0000000011233333232210 Q ss_pred ------cCccccchhhHH-HHHHhHHHHHHHHHHHhhHhhh-hhceeeeeeec-ccc--CCCCCccccCHHHHHHHHHHH Q lcl|NC_015263. 236 ------NESSLTPVPPFA-GTFDSIYDIHSFKDLRNDKAEL-QNYKLLIQKLE-TRS--SNDNNDFTLDMPMMNYFHEAL 304 (513) Q Consensus 236 ------~~~~~~~ip~f~-~v~~d~~di~~~kdL~~~~~~i-~n~~ii~~kip-~~~--~n~~~~~~vd~~~~~~~~~~i 304 (513) -....|+.+++. .+++.+.+. ......-..+ ....+-+-+++ +.. ++..++.. ........+.. T Consensus 186 ~p~~~~~~~~~~G~S~l~e~~~~~l~~~---~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~--~~~r~~~~~~~ 260 (435) T protein:vir:79 186 VSNEKRRQNDGWGASILNKRLIEAIVDY---NYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYA--ARLRLAQVDDE 260 (435) T ss_pred chhhhccccCcccchHHHHHHHHHHHHH---HHHHHHHHHHHHHhcCccccchhHHHhhcCccchHH--HHHHHHHHHHh Confidence 112334555543 344444433 3333222222 12223333443 111 11222211 11111112222 Q ss_pred HHhccccceEEEecc-cccccccccccccchhhhhhHHhhhhhhhhhh-hhccCCCcchHHHHHHHHHHH----HHHHHH Q lcl|NC_015263. 305 SMTVPDNVGVVTSPM-EIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQ-ILFSSDNKTSQGIAMSIATDE----QFIFGV 378 (513) Q Consensus 305 k~~Lp~gv~~v~sP~-~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~-~Lfn~d~~s~~~~~~SI~~d~----~~~~~~ 378 (513) +++ +|...+.... +++.++.+-+ .-.+.+....++|-.++||-. .|||-. .+|++.+-+.|. ..|-+. T Consensus 261 ~~~--~~~~~i~~~~e~~e~~~~~ls-gl~~~~~~~~~~iaaa~~IP~t~L~G~s---~~glnstgd~d~~~yyd~i~~~ 334 (435) T protein:vir:79 261 SGV--GKAIGIDATDEEYEVLNSDVS-GVPEFLQEKIDRIVALTGIHEIIIKNKN---TGGVSASQNTALETFYKLIDRK 334 (435) T ss_pred cCC--CCceeEecCCcceEEEecccC-CHHHHHHHHHHHHHhhhCCCeeeeccCC---ccccccchhHHHHHHHHHHHHH Confidence 221 2323333333 3555544333 345889999999999999995 566444 333333222333 333333 Q ss_pred HHH-HHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcc Q lcl|NC_015263. 379 INQ-LERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLP 457 (513) Q Consensus 379 ~~~-iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~ 457 (513) ++. +...+++.+.....+..|.++|.+.--.+.||+++..++.++ .... ++..--++|.++-..+...-..+++. T Consensus 335 Qe~~l~p~l~~l~~li~~s~d~~~~f~pL~~~sekEkAei~~~~a~---a~~~-~~~~g~i~~~e~r~~L~~~~~~~~~~ 410 (435) T protein:vir:79 335 RVEDYKPILEFLLPFMISETEWSIEFEPLSVPSDKDKAEIMAKNVE---SVVK-LKAEQAINLKETRDTLRSICPDLKIM 410 (435) T ss_pred HHHHHHHHHHHHHHHhhcCCCCeEEeCCCCCCCHHHHHHHHHHHHH---HHHH-HHhcCCCCHHHHHHHHHHhccccCCC Confidence 333 344566666554444569999999999999999998777654 1122 23222488888887774322222322 Q ss_pred cccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCC Q lcl|NC_015263. 458 EIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQ 503 (513) Q Consensus 458 ~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~ 503 (513) +-..+. -+.. +...|. ...++|++. T Consensus 411 ~~~~~~----------------~~~~-~d~~~~----~~~e~g~~~ 435 (435) T protein:vir:79 411 DNDNIE----------------LPEP-EDLDPE----PGQEGGLNK 435 (435) T ss_pred Cccccc----------------CCcc-ccCCCC----CCCCCCCCC Confidence 211110 0000 111121 112333333 No 100 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=96.61 E-value=0.00047 Score=38.84 Aligned_cols=396 Identities=10% Similarity=0.021 Sum_probs=178.5 Q ss_pred cCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcc Q lcl|NC_015263. 36 RTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENK 114 (513) Q Consensus 36 ~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~ 114 (513) +|+- .|...... .|-|.. ...+..-.|-.++.+.+.|+.++. +-.+...++- .++. T Consensus 1 ~~~~-----~~~~g~~~-------~~~~~~----~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~-------~~~~ 57 (723) T protein:vir:94 1 MTTF-----PSGAGGWN-------AWSADS----VFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRG-------PDGE 57 (723) T ss_pred Cccc-----ccCCCccc-------cccccc----cccccHHHHhhhHHHHHHHHHHHHhhccceeEEEc-------CCCc Confidence 2221 11111100 010000 000111123456677777777764 3333344431 1122 Q ss_pred hhHHHHHHHHHHhh-cC----hhHHHHHHHHHHHHhcceeEEEEEc-----CcceeeeecCcceeEEEEEECCeeEEE-- Q lcl|NC_015263. 115 LKKELATVTEFLSR-LN----PKYNFSKIVKLAMTVDIFYGYVIDD-----KESVMIQQFPNDICKISSVSGGVYNYV-- 182 (513) Q Consensus 115 ~~~~y~~v~~~L~k-~n----~k~~~~~i~~~~l~~g~~~gy~i~d-----~~~~~iq~lp~dyckIsg~~nG~y~~~-- 182 (513) ..+. +.+...|.. =| ...+...++..++..|..|.++.-+ +.+.-+.++|++-+.+....++...+. T Consensus 58 ~~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~ 136 (723) T protein:vir:94 58 LDEL-HPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQ 136 (723) T ss_pred cchh-hHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeee Confidence 2222 345555542 23 4556666777888999999998754 234567888888777765544332211 Q ss_pred ---EEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEec--CccccchhhHHHHHHhHHHHH Q lcl|NC_015263. 183 ---IDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKIN--ESSLTPVPPFAGTFDSIYDIH 257 (513) Q Consensus 183 ---fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~--~~~~~~ip~f~~v~~d~~di~ 257 (513) +.+... +. .=+.++.+.-+.|+.. .+..+|+||...+...+--.. T Consensus 137 ~~~y~~~~~-~G-----------------------------~~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~ 186 (723) T protein:vir:94 137 IIGYVIERT-DG-----------------------------VRVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADF 186 (723) T ss_pred eeEEEEEec-Cc-----------------------------eeEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHH Confidence 111110 00 1123444455666643 355678888877665554444 Q ss_pred HHHHHHhhHhhhhhc---eeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c--cceEEEe-----------cc Q lcl|NC_015263. 258 SFKDLRNDKAELQNY---KLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D--NVGVVTS-----------PM 319 (513) Q Consensus 258 ~~kdL~~~~~~i~n~---~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~--gv~~v~s-----------P~ 319 (513) ..++... .-..|- ..+.+ .| .++.++++.+.+.++++.- . |-..|+. ++ T Consensus 187 aa~~~~~--~~f~NG~~p~giL~-~~----------~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~ 253 (723) T protein:vir:94 187 YAATWQR--QSFKNGARPGGVVN-LG----------DMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGA 253 (723) T ss_pred HHHHHHH--HHHhcCCCcceEEE-cC----------CCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCc Confidence 4443221 112221 12221 11 2567777777666666551 1 2122221 23 Q ss_pred cccccccccccccch---hhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_015263. 320 EIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMS 396 (513) Q Consensus 320 ~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~ 396 (513) ++..+.++ ..+.+ +..-..++|-.+.||...+++++.+.++.......--..-+.-++++||..+|+.|-.. .+ T Consensus 254 ~~~~l~~s--~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~-~g 330 (723) T protein:vir:94 254 TFTSLSMS--PAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQAEAKAAVWTETLIPQMEVMASITDLQLLPD-IG 330 (723) T ss_pred eEEEccCC--HHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhccc-cc Confidence 33333332 22222 23334477999999998777766555554444443333444569999999999987543 23 Q ss_pred eEEEEEecC--CCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccc- Q lcl|NC_015263. 397 KYFKATMLE--VTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSD- 473 (513) Q Consensus 397 ~~f~~~~l~--~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~- 473 (513) ..++|.|-. .---+.++..+.+..+..-|+-..--.=+.+|+.|.+ - -+-+-.+.|..+... +.+ T Consensus 331 ~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~--------g--Gd~~~~~~p~~~~~a--~~~~ 398 (723) T protein:vir:94 331 WTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLP--------G--GIGQMTLTPYRAQFA--PAPA 398 (723) T ss_pred CceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--------C--Ccccceecccccccc--CCCC Confidence 334444432 2234677777777777776633333333344555541 0 011112344432211 100 Q ss_pred cc--------cCCccccCCCCcCCCCcccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 474 IA--------ENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 474 ~~--------~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~~ 513 (513) ++ ......+...+.+|..+.......--...+.+.|-.+- T Consensus 399 ~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 399 PAPAVEEGAARMLALLERVAADRPLPELPVRATTVLHHDPGPDPQQTL 446 (723) T ss_pred CCccchhhhHhhhhhccccccccCcCCCCCCCCCCCCCCcccCCchhH Confidence 00 00011122233344333221111111111111111111 No 101 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=96.58 E-value=0.00049 Score=38.72 Aligned_cols=442 Identities=12% Similarity=0.124 Sum_probs=178.7 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhh-ccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHH--- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDD-NRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQR--- 86 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~r--- 86 (513) |+-|..+-.++-.++..-..++.- |.++...+.-.-.....+.+++.|..|. ......++++-+|+.......++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~-~~~~~r~~~l~~Yy~g~~~i~~~~~~ 79 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHM-DYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHH-HhhHHHHHHHHHHhcccCccccccCc Confidence 666655555555554433344433 3333221111112223344555555442 11223455666666554443211 Q ss_pred --------------HHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 87 --------------LLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 87 --------------lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ...+++...+ .|.+ .|..- .....+..+. +..++..-++...+..+.+.+++.|..|.+ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~--~~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~ 153 (511) T protein:vir:96 80 RKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQY--QDDDKDVLEA---IEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (511) T ss_pred CcccccCcceeecchHHHHHHHHH-hhhccCCcee--ecCchHHHHH---HHHHHhhcCHHHHHHHHHHHHHhcCeeEEE Confidence 1112221111 1211 11000 0011122222 334455666889999999999999999988 Q ss_pred EEEcCcc-eeeeecCcceeEEEEEEC--CeeEEEEEeeeccCc--------c--hhccc-cHHHHHHHHHHhhhhhccCc Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSVSG--GVYNYVIDLDALVSA--------D--IVDYY-PKEIQEAVNKYTTMKKGNNK 217 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd~~--------~--~L~~~-p~Ei~~~y~~Y~~~k~~~~~ 217 (513) ...|.++ +-+..++|.-|.++--.. +.+.+++ +|.... . ..+-| +..+.+ |.. .. T Consensus 154 vy~ded~~~~i~~~~p~~~~~vydd~~~~~~~~~v--r~~~~~~~d~~~~~~~~~~~iyt~~~i~~----~~~----~~- 222 (511) T protein:vir:96 154 MIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGV--RYLRTKPIDKTDEDEVFTVDLFTSHGVYR----YLT----SR- 222 (511) T ss_pred EEeCCCCceEEEEEccceeEEEEcCCCCCceEEEE--EEEEeeeccccccceEEEEEEEeCCcEEE----EEe----cC- Confidence 8876544 677788999888875422 3344443 332110 0 01111 111110 100 00 Q ss_pred ccccCeeecC----------CceEEEEecCccccchhhHHHHHHhHHHHHHHHHHH-----hhHhhhhhceeee--eeec Q lcl|NC_015263. 218 SASNWYEIQD----------KNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR-----NDKAELQNYKLLI--QKLE 280 (513) Q Consensus 218 ~~~~W~~L~~----------~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~-----~~~~~i~n~~ii~--~kip 280 (513) ..|..+.. ..-.++.+- ....+.|- |.++.++.+.=+.. +......+ .+++ +..+ T Consensus 223 --~~~~~~~~~~~~~~~~~~~~vPvv~~~-nn~~g~gd----~e~v~~liDa~d~~~S~~~~~~~~~~~-~~lv~~g~~~ 294 (511) T protein:vir:96 223 --TNGLKLTPRENGFESHSFERMPITEFS-NNERRKGD----YEKVITLIDLYDNAESDTANYMSDLND-AMLLIKGNLN 294 (511) T ss_pred --CCcccccccccccccccCCceeeEEec-CCCCCCCc----hhhhHHHHHHHHHHHHHHHHHHHHhhC-ceeeeecCcc Confidence 11211111 111122221 12234333 33444443322211 11222222 2222 1111 Q ss_pred cccCCCCCccccCHHHHHHHHHHHHHhccccceEEEecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCc Q lcl|NC_015263. 281 TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNK 359 (513) Q Consensus 281 ~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~ 359 (513) . +.++..-+.+...........+.+.++...-.+ ++.-+.-+.+ ......++-..++|+.-+++..+-+.+-++ T Consensus 295 ~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~ 369 (511) T protein:vir:96 295 L----DPVEVRKQKEANVLFLEPTVYADSEGRETEGSV-DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG 369 (511) T ss_pred C----CchhhcccccccceecccccccccccccCCCCc-ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc Confidence 1 111111000000000000011111111110000 1111111111 111123455557788888887766644333 Q ss_pred --chHHHHHHHHHHHHHHHHHHH----HHHHHHH---HHHhhcc-c--ce---EEEEEecCCCCccHHHHHHHHHHHHhc Q lcl|NC_015263. 360 --TSQGIAMSIATDEQFIFGVIN----QLERWLN---RYLLLNG-M--SK---YFKATMLEVTHFSKKEAHDRYITDAQY 424 (513) Q Consensus 360 --s~~~~~~SI~~d~~~~~~~~~----~iE~~~N---~~i~~~~-~--~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~ 424 (513) |+..++.....-.+.+..-.. .|++.++ .++.... . .. ..++.|-+..+-|..+.++.+.++. T Consensus 370 n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~-- 447 (511) T protein:vir:96 370 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG-- 447 (511) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHHh-- Confidence 333333333322222222222 2222222 2222211 1 11 4688899999999999999999984 Q ss_pred C-CcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 425 G-FPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 425 G-~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) | .|....+ ..+++ +|.+.+.++..|++. .+.....+.. ++ +.+.+.|.+..++....+.++ T Consensus 448 G~iS~et~l-~~l~~v~D~~~E~~ri~~E~~~-~~~~~~~~~~-------~~-------~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 448 GKISQTTLM-SLFSFFQDPELEVKKIEEDEKE-SIKKAQKGIY-------KD-------PRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ccCChHHHH-HhCCCCCCHHHHHHHHHHHHHH-HHHHHhhccc-------cC-------CCCCCCCCCCCcccccccccC Confidence 6 6665555 45776 678999999998754 2222221111 11 011122222222222222222 No 102 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=96.49 E-value=0.00057 Score=38.36 Aligned_cols=404 Identities=8% Similarity=0.024 Sum_probs=178.3 Q ss_pred cCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcch----------------------HHHHHHHHhh Q lcl|NC_015263. 36 RTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQ----------------------YQRLLNFYAN 93 (513) Q Consensus 36 ~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~----------------------~~rlidy~~~ 93 (513) ++| -+-.+.+++.+..| ..-...++.+-+|+...... .+.++|-.+. T Consensus 1 ~~~---------~t~~~~~~~l~~~~--~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~ 69 (456) T protein:vir:10 1 MTA---------STPAEWLPVLTKRI--DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVAD 69 (456) T ss_pred CCC---------CCHHHHHHHHHHHH--HHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHh Confidence 333 23445556655554 44455666666666655432 1111222222 Q ss_pred cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcce-eeeecCcceeEEE Q lcl|NC_015263. 94 MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKESV-MIQQFPNDICKIS 172 (513) Q Consensus 94 mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~-~iq~lp~dyckIs 172 (513) -...+ ||..... .+.+.....+++ ++.-++......+.+.+++.|..|.+...+.++. .+..++|.-|..+ T Consensus 70 ~l~~~----~~~~~~~-~d~~~~~~~~~i---~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 70 RIIPN----GITVGGS-ADSDLALRARRI---WRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVS 141 (456) T ss_pred hhccC----CeecCCC-CCcchHHHHHHH---HHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEE Confidence 11112 2321111 112222333333 4556788889999999999999999888766653 5677888887665 Q ss_pred EE-EC-CeeEEEEEeeec-cCcc-h-hccccHHHHHHHHH---Hh-hhhhccCcccccCeee---cCCceEEEEecCccc Q lcl|NC_015263. 173 SV-SG-GVYNYVIDLDAL-VSAD-I-VDYYPKEIQEAVNK---YT-TMKKGNNKSASNWYEI---QDKNSICIKINESSL 240 (513) Q Consensus 173 g~-~n-G~y~~~fD~syF-d~~~-~-L~~~p~Ei~~~y~~---Y~-~~k~~~~~~~~~W~~L---~~~kt~~ik~~~~~~ 240 (513) -- .. ..+..++=...- +... + ...++.++...|.. |. ..+.........|+.. +..-.+|.-+--..+ T Consensus 142 ~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~ 221 (456) T protein:vir:10 142 VDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP 221 (456) T ss_pred EcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCC Confidence 32 11 222222211110 1111 1 11233333322110 00 0000001111223221 111122221111223 Q ss_pred cchhhHHHHHHhHHHHHHHHHHHhh-Hhhhhhc--eeeeeeec-cccCCCCCccccCHHHHHHHHHHHHHhccccceEE- Q lcl|NC_015263. 241 TPVPPFAGTFDSIYDIHSFKDLRND-KAELQNY--KLLIQKLE-TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVV- 315 (513) Q Consensus 241 ~~ip~f~~v~~d~~di~~~kdL~~~-~~~i~n~--~ii~~kip-~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v- 315 (513) .+++-|.+++ ++.|.-+.--.... -.+.... ..+.+.-+ ....++.|+.. +. ...| +.+.+.+ T Consensus 222 ~g~gd~e~vi-~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~-~~--~~~~--------~~~~~~~~ 289 (456) T protein:vir:10 222 DGMGEVEPHI-DIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAI-DY--ASIF--------EAAPGALW 289 (456) T ss_pred CCCchhhhhH-HHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccccc-ch--hhhh--------hhhccccc Confidence 4554444432 23332222212111 1111111 11121111 00012222221 11 1111 1111111 Q ss_pred Eeccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccCC--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHH--- Q lcl|NC_015263. 316 TSPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSSD--NKTSQGIAMSIATDEQFIFGVINQLERWLNR--- 388 (513) Q Consensus 316 ~sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~--- 388 (513) ..|=+.+--.|+... .--+.++...++|...+|+....|+++ +.|+.+++.....-...+....+.++.-+.+ T Consensus 290 ~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~r 369 (456) T protein:vir:10 290 ELPPGVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV 369 (456) T ss_pred cCCCCcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112111111222211 112446666677888889998888875 4455556666555555555444444432222 Q ss_pred -HHhhcc-c-ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHH--HhhCcccccCcc Q lcl|NC_015263. 389 -YLLLNG-M-SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVEN--EMLDLPEIMTPL 463 (513) Q Consensus 389 -~i~~~~-~-~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~--e~L~l~~~~~Pl 463 (513) .+.... . ...+++.|-+..+-|..+.++.+.++.+-|.++...+...+|+++.++ ...+.|. +... T Consensus 370 l~~~~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i-~~~e~er~~~e~~-------- 440 (456) T protein:vir:10 370 KALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQI-KQDDLDRAREQIT-------- 440 (456) T ss_pred HHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHH-HHHHHHHHHHHHH-------- Confidence 222211 1 236899999999999999999999999999999999988899999876 3333322 1101 Q ss_pred cccccccccccccCCccccCCCCcC Q lcl|NC_015263. 464 SSSFNTSGSDIAENAIKEKGKENGR 488 (513) Q Consensus 464 ~TS~T~Sg~~~~~~~~~~~~~~~gr 488 (513) +.+.+ +.+...+++.| T Consensus 441 -----~~~~~----~~~~~~~~~~~ 456 (456) T protein:vir:10 441 -----LFAGN----PVQRPQEDGSR 456 (456) T ss_pred -----HHhhh----hhhcCCCCCCC Confidence 01111 00000111111 No 103 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=96.49 E-value=0.00057 Score=38.36 Aligned_cols=404 Identities=8% Similarity=0.024 Sum_probs=178.3 Q ss_pred cCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcch----------------------HHHHHHHHhh Q lcl|NC_015263. 36 RTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQ----------------------YQRLLNFYAN 93 (513) Q Consensus 36 ~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~----------------------~~rlidy~~~ 93 (513) ++| -+-.+.+++.+..| ..-...++.+-+|+...... .+.++|-.+. T Consensus 1 ~~~---------~t~~~~~~~l~~~~--~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~ 69 (456) T protein:vir:10 1 MTA---------STPAEWLPVLTKRI--DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVAD 69 (456) T ss_pred CCC---------CCHHHHHHHHHHHH--HHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHh Confidence 333 23445556655554 44455666666666655432 1111222222 Q ss_pred cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcce-eeeecCcceeEEE Q lcl|NC_015263. 94 MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKESV-MIQQFPNDICKIS 172 (513) Q Consensus 94 mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~-~iq~lp~dyckIs 172 (513) -...+ ||..... .+.+.....+++ ++.-++......+.+.+++.|..|.+...+.++. .+..++|.-|..+ T Consensus 70 ~l~~~----~~~~~~~-~d~~~~~~~~~i---~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 70 RIIPN----GITVGGS-ADSDLALRARRI---WRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVS 141 (456) T ss_pred hhccC----CeecCCC-CCcchHHHHHHH---HHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEE Confidence 11112 2321111 112222333333 4556788889999999999999999888766653 5677888887665 Q ss_pred EE-EC-CeeEEEEEeeec-cCcc-h-hccccHHHHHHHHH---Hh-hhhhccCcccccCeee---cCCceEEEEecCccc Q lcl|NC_015263. 173 SV-SG-GVYNYVIDLDAL-VSAD-I-VDYYPKEIQEAVNK---YT-TMKKGNNKSASNWYEI---QDKNSICIKINESSL 240 (513) Q Consensus 173 g~-~n-G~y~~~fD~syF-d~~~-~-L~~~p~Ei~~~y~~---Y~-~~k~~~~~~~~~W~~L---~~~kt~~ik~~~~~~ 240 (513) -- .. ..+..++=...- +... + ...++.++...|.. |. ..+.........|+.. +..-.+|.-+--..+ T Consensus 142 ~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~ 221 (456) T protein:vir:10 142 VDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP 221 (456) T ss_pred EcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCC Confidence 32 11 222222211110 1111 1 11233333322110 00 0000001111223221 111122221111223 Q ss_pred cchhhHHHHHHhHHHHHHHHHHHhh-Hhhhhhc--eeeeeeec-cccCCCCCccccCHHHHHHHHHHHHHhccccceEE- Q lcl|NC_015263. 241 TPVPPFAGTFDSIYDIHSFKDLRND-KAELQNY--KLLIQKLE-TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVV- 315 (513) Q Consensus 241 ~~ip~f~~v~~d~~di~~~kdL~~~-~~~i~n~--~ii~~kip-~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v- 315 (513) .+++-|.+++ ++.|.-+.--.... -.+.... ..+.+.-+ ....++.|+.. +. ...| +.+.+.+ T Consensus 222 ~g~gd~e~vi-~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~-~~--~~~~--------~~~~~~~~ 289 (456) T protein:vir:10 222 DGMGEVEPHI-DIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAI-DY--ASIF--------EAAPGALW 289 (456) T ss_pred CCCchhhhhH-HHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccccc-ch--hhhh--------hhhccccc Confidence 4554444432 23332222212111 1111111 11121111 00012222221 11 1111 1111111 Q ss_pred Eeccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccCC--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHH--- Q lcl|NC_015263. 316 TSPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSSD--NKTSQGIAMSIATDEQFIFGVINQLERWLNR--- 388 (513) Q Consensus 316 ~sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~--- 388 (513) ..|=+.+--.|+... .--+.++...++|...+|+....|+++ +.|+.+++.....-...+....+.++.-+.+ T Consensus 290 ~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~r 369 (456) T protein:vir:10 290 ELPPGVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV 369 (456) T ss_pred cCCCCcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112111111222211 112446666677888889998888875 4455556666555555555444444432222 Q ss_pred -HHhhcc-c-ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHH--HhhCcccccCcc Q lcl|NC_015263. 389 -YLLLNG-M-SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVEN--EMLDLPEIMTPL 463 (513) Q Consensus 389 -~i~~~~-~-~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~--e~L~l~~~~~Pl 463 (513) .+.... . ...+++.|-+..+-|..+.++.+.++.+-|.++...+...+|+++.++ ...+.|. +... T Consensus 370 l~~~~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i-~~~e~er~~~e~~-------- 440 (456) T protein:vir:10 370 KALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQI-KQDDLDRAREQIT-------- 440 (456) T ss_pred HHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHH-HHHHHHHHHHHHH-------- Confidence 222211 1 236899999999999999999999999999999999988899999876 3333322 1101 Q ss_pred cccccccccccccCCccccCCCCcC Q lcl|NC_015263. 464 SSSFNTSGSDIAENAIKEKGKENGR 488 (513) Q Consensus 464 ~TS~T~Sg~~~~~~~~~~~~~~~gr 488 (513) +.+.+ +.+...+++.| T Consensus 441 -----~~~~~----~~~~~~~~~~~ 456 (456) T protein:vir:10 441 -----LFAGN----PVQRPQEDGSR 456 (456) T ss_pred -----HHhhh----hhhcCCCCCCC Confidence 01111 00000111111 No 104 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=96.19 E-value=0.0009 Score=37.29 Aligned_cols=446 Identities=11% Similarity=0.112 Sum_probs=174.8 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcc-cccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHH--- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPV-FGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQR--- 86 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~r--- 86 (513) |+-|..+...+-.++...-..+.-.+... ..+.-.-.....+.+.+.|..|+ ......++++.+|+........+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~-~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHM-DYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHH-HhhhHHHHHHHHHhhccCccccccCc Confidence 55565555555555444334443333322 11111111222334455555542 11123455566666555443211 Q ss_pred --------------HHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 87 --------------LLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 87 --------------lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ...+++...+ .|.+ .|..- ....+...+. +..++..-++...+..+.+.+++.|..|.+ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~--~~~d~~~~~~---l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~ 153 (511) T protein:vir:96 80 RKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQY--QDDDKDVLEA---IEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (511) T ss_pred ccccccCcceeecchHHHHHHHHh-hhhcccCcee--ecCchHHHHH---HHHHHhhcChhHHHHHHHHHHHhcCeeEEE Confidence 1111111111 1111 11000 0011112222 334455566888899999999999999998 Q ss_pred EEEcCcc-eeeeecCcceeEEEEEEC--CeeEEEEEeeeccCcc----------hhcccc-HHHHHHHHHHhhhhhccCc Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSVSG--GVYNYVIDLDALVSAD----------IVDYYP-KEIQEAVNKYTTMKKGNNK 217 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd~~~----------~L~~~p-~Ei~~~y~~Y~~~k~~~~~ 217 (513) ...|.++ +-+..++|.-|.++--.. +.+.+++ +|..... ..+.|. ..+. .|.... .... T Consensus 154 vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~v--r~~~~~~~~~~~~~~~~~~~vyt~~~i~----~~~~~~-~~~~ 226 (511) T protein:vir:96 154 MIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGV--RYLRTKPIDKTDEDEVFTVDLFTSHGVY----RYLTNR-TNGL 226 (511) T ss_pred EEeCCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEEeeeccccccceEEEEEEEeCCcEE----EEEecC-CCcc Confidence 8877654 677788999888775432 3444444 2221100 011111 1110 010000 0000 Q ss_pred ccccCeeecCCce----EEEEecCccccchhhHHHHHHhHHHHHHHHH-HH-hhHhhhhhceeeeeeeccccCCCCCccc Q lcl|NC_015263. 218 SASNWYEIQDKNS----ICIKINESSLTPVPPFAGTFDSIYDIHSFKD-LR-NDKAELQNYKLLIQKLETRSSNDNNDFT 291 (513) Q Consensus 218 ~~~~W~~L~~~kt----~~ik~~~~~~~~ip~f~~v~~d~~di~~~kd-L~-~~~~~i~n~~ii~~kip~~~~n~~~~~~ 291 (513) ...-|..=..+|. .++.+- ....+.| .|.++..+.+.=+ +. +....++... ..+.+- .|... T Consensus 227 ~~~~~~~~~~~~~~g~vPvv~~~-n~~~g~g----d~e~v~~liDa~~~~~S~~~~~~~~~~---~~~lv~----~G~~~ 294 (511) T protein:vir:96 227 KLTPRENSFESHSFERMPITEFS-NNERRKG----DYEKVITLIDLYDNAESDTANYMSDLN---DAMLLI----KGNLN 294 (511) T ss_pred cccccccccccCcCcccceEEec-CCCCCCC----chhhhHHHHHHHHHHHHHHHHHHHHhh---cchhhe----ecCcc Confidence 0000111000011 112221 1122333 3333333333222 11 1111222110 111110 01111 Q ss_pred cCHHHHHHHHHHHHHhc-------cccceEEEecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCc--ch Q lcl|NC_015263. 292 LDMPMMNYFHEALSMTV-------PDNVGVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNK--TS 361 (513) Q Consensus 292 vd~~~~~~~~~~ik~~L-------p~gv~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~--s~ 361 (513) .+.+........---.+ +++......+ ++.-+.-+.+ ......++-..++|+.-+++..+-+.+-++ |+ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg 373 (511) T protein:vir:96 295 LDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSV-DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG 373 (511) T ss_pred CCchhhcccccccceeccccceeccccccCCCCc-ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHH Confidence 11111111100000000 1111000000 1100100100 111122444557788888877766654333 44 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHHHHHHH---HHhhc-cc--ce---EEEEEecCCCCccHHHHHHHHHHHHhcC-Cc Q lcl|NC_015263. 362 QGIAMSIATDEQFIFG----VINQLERWLNR---YLLLN-GM--SK---YFKATMLEVTHFSKKEAHDRYITDAQYG-FP 427 (513) Q Consensus 362 ~~~~~SI~~d~~~~~~----~~~~iE~~~N~---~i~~~-~~--~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~ 427 (513) ..++.....-...+.. |-+.|++-++. ++... .. .. ..++.|-+..+-|..+.++.+.++. | .| T Consensus 374 ~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS 451 (511) T protein:vir:96 374 EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKIS 451 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCC Confidence 4444333322222222 22223332322 22221 11 11 3689999999999999999999985 6 56 Q ss_pred HHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 428 VKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 428 ~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) ....+. .+++ +|.+.+.++..|++. .+.....+. ++++ .+.+.+.|..++....+-++ T Consensus 452 ~et~l~-~l~~v~d~~~El~ri~~E~~~-~~~~~~~~~-------~~~~-------~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 452 QTTLMS-LFSFFQDPELEVKKIEEDEKE-SIKKAQKGI-------YKDP-------RDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred hHHHHH-hCCCCCCHHHHHHHHHHHHHH-HHHHHhhcc-------ccCC-------CCCCCCCCCCCccCcccccC Confidence 555554 5777 578999999988743 211111111 1110 11122222222222111111 No 105 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=96.19 E-value=0.0009 Score=37.29 Aligned_cols=446 Identities=11% Similarity=0.112 Sum_probs=174.8 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcc-cccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHH--- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPV-FGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQR--- 86 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~r--- 86 (513) |+-|..+...+-.++...-..+.-.+... ..+.-.-.....+.+.+.|..|+ ......++++.+|+........+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~-~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:78 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHM-DYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHH-HhhhHHHHHHHHHhhccCccccccCc Confidence 55565555555555444334443333322 11111111222334455555542 11123455566666555443211 Q ss_pred --------------HHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 87 --------------LLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 87 --------------lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ...+++...+ .|.+ .|..- ....+...+. +..++..-++...+..+.+.+++.|..|.+ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~--~~~d~~~~~~---l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~ 153 (511) T protein:vir:78 80 RKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQY--QDDDKDVLEA---IEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (511) T ss_pred ccccccCcceeecchHHHHHHHHh-hhhcccCcee--ecCchHHHHH---HHHHHhhcChhHHHHHHHHHHHhcCeeEEE Confidence 1111111111 1111 11000 0011112222 334455566888899999999999999998 Q ss_pred EEEcCcc-eeeeecCcceeEEEEEEC--CeeEEEEEeeeccCcc----------hhcccc-HHHHHHHHHHhhhhhccCc Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSVSG--GVYNYVIDLDALVSAD----------IVDYYP-KEIQEAVNKYTTMKKGNNK 217 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd~~~----------~L~~~p-~Ei~~~y~~Y~~~k~~~~~ 217 (513) ...|.++ +-+..++|.-|.++--.. +.+.+++ +|..... ..+.|. ..+. .|.... .... T Consensus 154 vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~v--r~~~~~~~~~~~~~~~~~~~vyt~~~i~----~~~~~~-~~~~ 226 (511) T protein:vir:78 154 MIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGV--RYLRTKPIDKTDEDEVFTVDLFTSHGVY----RYLTNR-TNGL 226 (511) T ss_pred EEeCCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEEeeeccccccceEEEEEEEeCCcEE----EEEecC-CCcc Confidence 8877654 677788999888775432 3444444 2221100 011111 1110 010000 0000 Q ss_pred ccccCeeecCCce----EEEEecCccccchhhHHHHHHhHHHHHHHHH-HH-hhHhhhhhceeeeeeeccccCCCCCccc Q lcl|NC_015263. 218 SASNWYEIQDKNS----ICIKINESSLTPVPPFAGTFDSIYDIHSFKD-LR-NDKAELQNYKLLIQKLETRSSNDNNDFT 291 (513) Q Consensus 218 ~~~~W~~L~~~kt----~~ik~~~~~~~~ip~f~~v~~d~~di~~~kd-L~-~~~~~i~n~~ii~~kip~~~~n~~~~~~ 291 (513) ...-|..=..+|. .++.+- ....+.| .|.++..+.+.=+ +. +....++... ..+.+- .|... T Consensus 227 ~~~~~~~~~~~~~~g~vPvv~~~-n~~~g~g----d~e~v~~liDa~~~~~S~~~~~~~~~~---~~~lv~----~G~~~ 294 (511) T protein:vir:78 227 KLTPRENSFESHSFERMPITEFS-NNERRKG----DYEKVITLIDLYDNAESDTANYMSDLN---DAMLLI----KGNLN 294 (511) T ss_pred cccccccccccCcCcccceEEec-CCCCCCC----chhhhHHHHHHHHHHHHHHHHHHHHhh---cchhhe----ecCcc Confidence 0000111000011 112221 1122333 3333333333222 11 1111222110 111110 01111 Q ss_pred cCHHHHHHHHHHHHHhc-------cccceEEEecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCc--ch Q lcl|NC_015263. 292 LDMPMMNYFHEALSMTV-------PDNVGVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNK--TS 361 (513) Q Consensus 292 vd~~~~~~~~~~ik~~L-------p~gv~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~--s~ 361 (513) .+.+........---.+ +++......+ ++.-+.-+.+ ......++-..++|+.-+++..+-+.+-++ |+ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg 373 (511) T protein:vir:78 295 LDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSV-DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG 373 (511) T ss_pred CCchhhcccccccceeccccceeccccccCCCCc-ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHH Confidence 11111111100000000 1111000000 1100100100 111122444557788888877766654333 44 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHHHHHHH---HHhhc-cc--ce---EEEEEecCCCCccHHHHHHHHHHHHhcC-Cc Q lcl|NC_015263. 362 QGIAMSIATDEQFIFG----VINQLERWLNR---YLLLN-GM--SK---YFKATMLEVTHFSKKEAHDRYITDAQYG-FP 427 (513) Q Consensus 362 ~~~~~SI~~d~~~~~~----~~~~iE~~~N~---~i~~~-~~--~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~ 427 (513) ..++.....-...+.. |-+.|++-++. ++... .. .. ..++.|-+..+-|..+.++.+.++. | .| T Consensus 374 ~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS 451 (511) T protein:vir:78 374 EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKIS 451 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCC Confidence 4444333322222222 22223332322 22221 11 11 3689999999999999999999985 6 56 Q ss_pred HHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 428 VKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 428 ~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) ....+. .+++ +|.+.+.++..|++. .+.....+. ++++ .+.+.+.|..++....+-++ T Consensus 452 ~et~l~-~l~~v~d~~~El~ri~~E~~~-~~~~~~~~~-------~~~~-------~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 452 QTTLMS-LFSFFQDPELEVKKIEEDEKE-SIKKAQKGI-------YKDP-------RDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred hHHHHH-hCCCCCCHHHHHHHHHHHHHH-HHHHHhhcc-------ccCC-------CCCCCCCCCCCccCcccccC Confidence 555554 5777 578999999988743 211111111 1110 11122222222222111111 No 106 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=96.18 E-value=0.00091 Score=37.26 Aligned_cols=388 Identities=11% Similarity=0.050 Sum_probs=167.6 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNF 90 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy 90 (513) |.-+. +|.|+.. .. .++ +..|....+ .-+.....+|..++.++++|+- T Consensus 1 ~~~~D---~~~n~~~-------gg-~~~---~~~~~~~~~------------------~~~~~l~a~Y~~~~l~~~~Vd~ 48 (422) T protein:vir:10 1 MVKTD---SYANIFL-------GG-SDG---SEIYGSLQN------------------QAPTILASLYADNALVRRIIDT 48 (422) T ss_pred Cccch---hhHHHHc-------CC-CCC---ccccCcccc------------------cCHHHHHHHHHhChhhHHHHhh Confidence 44443 4666532 00 001 000110000 1123455789999999999998 Q ss_pred HhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeE Q lcl|NC_015263. 91 YANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICK 170 (513) Q Consensus 91 ~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyck 170 (513) .....|=...-.. +++ . +.. .-.-+++++++..+.+.++-.-..|.-+-+.....+...-+||..+ T Consensus 49 ~aed~~r~g~~i~-~~~---~----~~~---~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~--- 114 (422) T protein:vir:10 49 IPETALAAGFHID-GID---D----EPA---FWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREG--- 114 (422) T ss_pred hhHHHhcCCcccc-CCC---H----HHH---HHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCcccccccc--- Confidence 8876653333221 110 1 111 2234677777665555555555555444454442233345666643 Q ss_pred EEEEECCeeE--EEEEeeeccCcchhccccHHHHHHHHHHhhhh-hccCcccccCe----------eecCCceEEEEe-- Q lcl|NC_015263. 171 ISSVSGGVYN--YVIDLDALVSADIVDYYPKEIQEAVNKYTTMK-KGNNKSASNWY----------EIQDKNSICIKI-- 235 (513) Q Consensus 171 Isg~~nG~y~--~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k-~~~~~~~~~W~----------~L~~~kt~~ik~-- 235 (513) |.+. .+||-... -|.+ |..+- ..+..--..|. .+-+.+-+.|.= T Consensus 115 ------g~~~~l~v~d~~~i--------~~~~-------~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~ 173 (422) T protein:vir:10 115 ------AELETVRVYDRTQV--------KVQT-------REENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGER 173 (422) T ss_pred ------CceeeEEeeccccc--------cchh-------cccCccccccCcceEEEEecCCCCcceeeccceeEEeCCCC Confidence 2222 22222221 0111 10000 00000000122 222323222210 Q ss_pred ------cCccccchhhHHH-HHHhHHHHHHHHHHHhhHhhh-hhceeeeeeec-cccCCCCCccccCHHHHHHHHHHHHH Q lcl|NC_015263. 236 ------NESSLTPVPPFAG-TFDSIYDIHSFKDLRNDKAEL-QNYKLLIQKLE-TRSSNDNNDFTLDMPMMNYFHEALSM 306 (513) Q Consensus 236 ------~~~~~~~ip~f~~-v~~d~~di~~~kdL~~~~~~i-~n~~ii~~kip-~~~~n~~~~~~vd~~~~~~~~~~ik~ 306 (513) .....||++++.. +++.+.+.+ .....-..| ....+.+-++. +..--.++ .....+..-++.+.. T Consensus 174 ~p~~~~~~~~~~G~S~l~~~~~~~i~~~~---~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~---~~~~~~~~r~~~~~~ 247 (422) T protein:vir:10 174 IPNVMRRQNDGWGRSVLSSDILDSIKDYT---NCERLATQLLKRKQQAVWKAKGLAELCDDS---EGFGAARLRLAQVDN 247 (422) T ss_pred chhhhcccCCcccchhHHHHHHHHHHHHH---HHHHHHHHHHHHhccccccchhHHHhcCCc---cchHHHHHHHHHHHH Confidence 0112245555543 344444433 333222222 12223333333 11100111 111122222222222 Q ss_pred hcc-ccceEEEec-ccccccccccccccchhhhhhHHhhhhhhhhhhh-hccCCCcchHHHHHHHHHHH----HHHHHHH Q lcl|NC_015263. 307 TVP-DNVGVVTSP-MEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQI-LFSSDNKTSQGIAMSIATDE----QFIFGVI 379 (513) Q Consensus 307 ~Lp-~gv~~v~sP-~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~-Lfn~d~~s~~~~~~SI~~d~----~~~~~~~ 379 (513) .-- .+...+... -+++.++.+-+ .-.+.+....++|-.++||-.. |||-. .+|++.+=+.|. ..|-+++ T Consensus 248 ~~~~~~~~~l~~~~e~~e~~~~~ls-gl~~~~~~~~~~iaaa~~IP~t~L~G~s---~~Glnatgd~d~~~yyd~i~~~Q 323 (422) T protein:vir:10 248 NSGVGQAIGIDAESEEYSVLNSDIG-GIDAFLDKKFDRIVALSGIHEIILKNKN---VGGVSSSQNTALETFHKLVDRKR 323 (422) T ss_pred hcCCccceeEecCCcceEEEecccC-ChHHHHHHHHHHHHhhhCCCeeeeccCC---cccccccchHHHHHHHHHHHHHH Confidence 211 222233333 24555544433 3458899999999999999955 55443 344443333333 3333444 Q ss_pred H-HHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhC-CCHHHHHHHHHHHHHhhCcc Q lcl|NC_015263. 380 N-QLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMG-IDPVAFTGLLKVENEMLDLP 457 (513) Q Consensus 380 ~-~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G-~~p~~~~~~~~~E~e~L~l~ 457 (513) + .+...+++.+.....+..|.++|.+.--.+.||+++..++.++- ... +.. .| ++|.++-..+.-.-...++. T Consensus 324 e~~l~p~l~~l~~~i~~s~~~~~~f~pL~~~sekekaei~~~~a~a---~~~-~~~-~g~i~~~e~r~~L~~~~~~~~~~ 398 (422) T protein:vir:10 324 NAELLPILEFLIPFIVNAEEWSVEFNPLAQESSKDKAEILEKNVNS---IAA-LIA-AGAMDIDEARDTLRTIAPEVKIN 398 (422) T ss_pred HHHHHHHHHHHHHHhcccCCcEEEeCCCCCCCHHHHHHHHHHHHHH---HHH-HHh-cCCCCHHHHHHHhhhhcccccCC Confidence 3 34556666666543445699999999999999999997776641 122 333 46 79988887775432222332 Q ss_pred cccCcccccccccccccccCCccccC-CCCcCCCCccccccc Q lcl|NC_015263. 458 EIMTPLSSSFNTSGSDIAENAIKEKG-KENGRPTNETTGNKD 498 (513) Q Consensus 458 ~~~~Pl~TS~T~Sg~~~~~~~~~~~~-~~~grPt~et~~n~~ 498 (513) +-+.|.-- .. ..+..|..+ .+.+ T Consensus 399 ~~~~~~~~----------------~~~~~~~~~~~~--~~~d 422 (422) T protein:vir:10 399 DGSVETEV----------------TISETSNDPLEV--PTDD 422 (422) T ss_pred CCCCcccc----------------chhhcCCCCCCC--CCCC Confidence 22222110 00 001111111 1000 No 107 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=96.11 E-value=0.00099 Score=37.05 Aligned_cols=347 Identities=15% Similarity=0.112 Sum_probs=149.4 Q ss_pred cccccccchHHHH-----HHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcch Q lcl|NC_015263. 42 APVGSLTSSQSKV-----RKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKL 115 (513) Q Consensus 42 s~~~s~~~s~d~~-----k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~ 115 (513) =.+|++..+-... .+-+..| +. +-.+..+..+++.|++++. +..+...++--.......++.. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~-~~----------~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~ 69 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQRVTAW-QN----------EAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLI 69 (378) T ss_pred CCccccchhcccccccCCcceeeee-cc----------chhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCccccccc Confidence 1223322110000 0001111 11 1122345567788877765 2222222221011111111111 Q ss_pred hHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEc-CcceeeeecCcceeEEEEEECCeeEEEEEeeecc Q lcl|NC_015263. 116 KKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDD-KESVMIQQFPNDICKISSVSGGVYNYVIDLDALV 189 (513) Q Consensus 116 ~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d-~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd 189 (513) ...=+.+...|.. +....+...++..++..|..|.|.+.+ .++..+.-.|.. +. T Consensus 70 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~---------~~----------- 129 (378) T protein:vir:94 70 SMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFAD---------DK----------- 129 (378) T ss_pred ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecC---------Ce----------- Confidence 1111233444542 335677788889999999999997643 334332222210 00 Q ss_pred CcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhh Q lcl|NC_015263. 190 SADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAEL 269 (513) Q Consensus 190 ~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i 269 (513) .+++++.-|-|+.......+++|...+...+ ...+ T Consensus 130 ---------------------------------~~~~~~diiH~~~~~~~~~g~s~l~~~~~~i------------~~~~ 164 (378) T protein:vir:94 130 ---------------------------------KEYKPEELVRLTSPFYINEDTSILDNALASI------------QTKL 164 (378) T ss_pred ---------------------------------eEeeeeeeEEecCcCCccchhHHHHHHHHHH------------HHHH Confidence 0112222333331111112333332222211 1111 Q ss_pred hhc--eeeeeeeccccCCCCCccccCHHHH----HHHHHHHHHhc----cccceEEEecccccccccccccccchhhhhh Q lcl|NC_015263. 270 QNY--KLLIQKLETRSSNDNNDFTLDMPMM----NYFHEALSMTV----PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKA 339 (513) Q Consensus 270 ~n~--~ii~~kip~~~~n~~~~~~vd~~~~----~~~~~~ik~~L----p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~ 339 (513) .+- ..+ -++| -.++.+.. ++|.+.++... ..|+..+-..+++..++++.....-....-. T Consensus 165 ~~~~~~gi-l~~~---------~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~ 234 (378) T protein:vir:94 165 EQGKLRGL-LKIN---------AFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLI 234 (378) T ss_pred hcccccce-eeeC---------CcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhHHHHHHH Confidence 121 111 1122 12333333 44444444432 2244455455555555553322222334455 Q ss_pred HHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-----cc----ceEEEEEecCCCCcc Q lcl|NC_015263. 340 TKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLN-----GM----SKYFKATMLEVTHFS 410 (513) Q Consensus 340 ~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-----~~----~~~f~~~~l~~T~fn 410 (513) .+.|..+.||...+++++. ++....+. -..-+.-++.+||..+|+.|-.. .. ...++|..-....-+ T Consensus 235 ~~~Ia~~fgVP~~~l~~~~--se~~~~~f--~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d 310 (378) T protein:vir:94 235 KSELLTGYFMNENILLGTA--SQEQQIYF--YNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFAT 310 (378) T ss_pred HHHHHHHhCCCHHHhcCCh--HHHHHHHH--HHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcC Confidence 6789999999988887653 22222222 22233468999999999988532 11 113566666677778 Q ss_pred HHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCC Q lcl|NC_015263. 411 KKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPT 490 (513) Q Consensus 411 ~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt 490 (513) .++.++.+.++..-|+=..--+-+.+|+.|.+ |=|..++|+- ...-.+.+...+ ...++.|+ T Consensus 311 ~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~------------gGD~~~~~~n---~~~~~~~~~~~~---~~~~~~~~ 372 (378) T protein:vir:94 311 LKELIDLYHENINGPIFTQNQLLVKMGEQPIE------------GGDVYIANLN---AVAVKNLSDLQG---SRKDVTST 372 (378) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CCCeeeeccc---ccccccchhhcC---CcCCCCCC Confidence 89999999998888843333333346776642 2334444443 211111111110 01111122 Q ss_pred CcccccccCCCCCCC Q lcl|NC_015263. 491 NETTGNKDSDETQRA 505 (513) Q Consensus 491 ~et~~n~~~~~~~~~ 505 (513) +| +++. T Consensus 373 ~e---------~~n~ 378 (378) T protein:vir:94 373 DE---------TNNQ 378 (378) T ss_pred CC---------CCCC Confidence 22 1111 No 108 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=96.06 E-value=0.0011 Score=36.91 Aligned_cols=464 Identities=10% Similarity=0.052 Sum_probs=180.1 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc-ChhHHHHHHHHHHHHHh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR-NEGNQKTLRKVSEDLAV 79 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~-P~~n~~~ir~~s~~lY~ 79 (513) +.+- ..+...++.+--.. .||. -.+. .-+.++.+.+-.++.+.+ ....++..|+ |....- ..++ -+|. T Consensus 50 ~~~~-~~~~~~~~~~~~~~--~~a~--ds~~-~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~f~g--yql~-alY~ 118 (765) T protein:vir:96 50 PEKA-PVIRSVKDFLEPGL--SVAM--DSAY-GDGPTPAAKAAAGGQNPY--VVPTMLQDWYNSQGFIG--YQAC-AIIS 118 (765) T ss_pred cccC-CCCCCCCcccCccc--ceec--cccc-cccccchHHHhhhccCcc--chhhHHHhhhcccCCcc--HHHH-HHHH Confidence 0000 00111122110000 0000 0000 001112222222222211 2233344443 322221 1344 3799 Q ss_pred hcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEc-Ccc Q lcl|NC_015263. 80 QSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDD-KES 158 (513) Q Consensus 80 ~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d-~~~ 158 (513) .++.++++|+-.....|=..+-.- +. ....+....+.+. ..+++++++.-+...++..-..|..|-+.+.+ .++ T Consensus 119 ~~~l~rkiVd~pAeDa~R~g~~I~-~~-~~e~~~~~~~~l~---~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~ 193 (765) T protein:vir:96 119 QHWLVDKACSMSGEDAARNGWELK-SD-GRKLSDEQSALIA---RRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDP 193 (765) T ss_pred hCchhhhhhhcchHHhhcCCceee-cC-ccccCHHHHHHHH---HHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCc Confidence 999999999999777666644321 11 1111222223233 34566666665555555555555555454433 333 Q ss_pred e-eeeecCcceeEEEEEECCeeEE--EEEeeeccCcchh----ccccHHHHHHHHHHhhhhhccCcccccCeeecCCceE Q lcl|NC_015263. 159 V-MIQQFPNDICKISSVSGGVYNY--VIDLDALVSADIV----DYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSI 231 (513) Q Consensus 159 ~-~iq~lp~dyckIsg~~nG~y~~--~fD~syFd~~~~L----~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~ 231 (513) . .=+||+++. +..|.+.+ +||--+-....+. +-..+-+-+- ..|. .++ ..+-+.+ T Consensus 194 ~~l~~PL~~~~-----I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P-~~y~----i~g------~~IH~SR-- 255 (765) T protein:vir:96 194 DYYEKPFNPDG-----IAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEP-DFWI----ISG------KKYHRSH-- 255 (765) T ss_pred chhhccccccc-----cccceeeEEEEechhhcccccchhccccccccccCcc-eeee----ecC------ceeccce-- Confidence 2 336776653 33344432 3332111000000 0011111000 0110 011 1233333 Q ss_pred EEEecCcc----------ccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHH Q lcl|NC_015263. 232 CIKINESS----------LTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFH 301 (513) Q Consensus 232 ~ik~~~~~----------~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~ 301 (513) +|.+.... -||+|-+-.++..+.+.+.-...- -.-+....+.+-++.. ...- .+.+.+.+-- T Consensus 256 li~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~--a~Ll~k~~~~v~k~~~--~~~l----~~~~~l~~r~ 327 (765) T protein:vir:96 256 LVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEA--PLLAMSKRTSTIHVDV--EKAI----ANEDAFNARL 327 (765) T ss_pred EEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHH--HHHHHHhccceeeech--Hhhh----ccHHHHHHHH Confidence 33332221 257776666666666555443322 1111122222223321 1111 1222222222 Q ss_pred HHHHHhc-cccceEEEecccccccccccccccchhhhhhHHhhhhhhhhh-hhhccCC-CcchHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 302 EALSMTV-PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVS-QILFSSD-NKTSQGIAMSIATDEQFIFGV 378 (513) Q Consensus 302 ~~ik~~L-p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS-~~Lfn~d-~~s~~~~~~SI~~d~~~~~~~ 378 (513) +.+.+.. -.|+..+-.-=+++.++.+-+ .-.+.+....++|-.++||- ..|||-. ++-+++-..-+..-...|-++ T Consensus 328 ~~~~~~r~n~g~~~id~ee~~e~~s~~ls-gl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~ 406 (765) T protein:vir:96 328 AFWIANRDNHGVKVIGIDETMEQFDTNLS-DFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESI 406 (765) T ss_pred HHHHHhcCCceeEEecCCcceeEEecccC-CHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHH Confidence 2233322 234444333334444444433 34578888999999999995 7888743 222222222233333333333 Q ss_pred HHH-HHHHHHHHH----hhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhC-CCHHHHHHHHHHHHH Q lcl|NC_015263. 379 INQ-LERWLNRYL----LLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMG-IDPVAFTGLLKVENE 452 (513) Q Consensus 379 ~~~-iE~~~N~~i----~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G-~~p~~~~~~~~~E~e 452 (513) ++. +...+++.+ ....+...|.+.|.++--.+.||+++..++.++-= .. +. ..| ++|.++-..+..+- T Consensus 407 Qe~~l~p~le~L~~li~~s~~i~~d~~i~FnpL~~~sekEkAei~~k~Aea~---~~-~~-~~Gvis~dEvR~~L~~~~- 480 (765) T protein:vir:96 407 QEHIFDPLLERHYLLLAKSESIDVQLEIVWNPVDSTTSQQQAELNNKKAATD---EI-YI-NSGVVSPDEVRERLRDDP- 480 (765) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCcceEEeCCCCCCCHHHHHHHHHHHHHHH---HH-HH-hcCCCCHHHHHHHHhccc- Confidence 322 222344433 33344446999999999999999999977664311 12 22 246 88999999876542 Q ss_pred hhCccccc---C---ccccccc------------ccccccccCCccccCCCCcCCCCc----ccccc------cCCCCCC Q lcl|NC_015263. 453 MLDLPEIM---T---PLSSSFN------------TSGSDIAENAIKEKGKENGRPTNE----TTGNK------DSDETQR 504 (513) Q Consensus 453 ~L~l~~~~---~---Pl~TS~T------------~Sg~~~~~~~~~~~~~~~grPt~e----t~~n~------~~~~~~~ 504 (513) ..+++++- + |...--+ .....+.++...+..-+++.|..+ +.... ..|.+.+ T Consensus 481 ~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~ 560 (765) T protein:vir:96 481 RSGYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAAT 560 (765) T ss_pred cCCCCCCCccccccccCCCccccccccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccC Confidence 22332110 0 1110000 000000000000000111111100 00000 0011111 Q ss_pred CC--CCccC--CC Q lcl|NC_015263. 505 AK--DKPAN--TQ 513 (513) Q Consensus 505 ~~--d~~~~--~~ 513 (513) ++ ++|+. .| T Consensus 561 ~p~~~~p~~~~~~ 573 (765) T protein:vir:96 561 PPSRPNPRAELRN 573 (765) T ss_pred ccccccccccchh Confidence 10 11100 11 No 109 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=95.85 E-value=0.0014 Score=36.29 Aligned_cols=410 Identities=13% Similarity=0.147 Sum_probs=171.5 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNF 90 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy 90 (513) |+|+ +-+-+....++++.+=.... ....+.+++.|+.| ......++.+-+|+.....+..|-..+ T Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~----------~~~~~~i~~~i~~~--~~~~~~~~~~~~Yy~g~~~i~~~~~~~ 65 (474) T protein:vir:96 1 MIVI---FWPNEKPYHERVVEQIKPKY----------ETQEEMIIRLINDH--KPKIDDITVGERYYNHDPDVLRLAPKL 65 (474) T ss_pred Ceee---ccCCCchhhhhHHHHhhhcc----------CChHHHHHHHHHHH--HHHHHHHHHHHHHhccCCcchhccchh Confidence 7664 33444455555553222111 23455667777775 455677787777776654332222111 Q ss_pred ----------------------HhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcc Q lcl|NC_015263. 91 ----------------------YANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDI 147 (513) Q Consensus 91 ----------------------~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~ 147 (513) +++..+ +|.+ .|..- ........+... .+++ =+.......+.+.+++.|. T Consensus 66 ~~~~~~~~~~~~~ki~~n~~~~Ivd~~~-~~l~g~p~~~--~~~d~~~~~~l~---~~~~-n~~~~~~~~~~~~~~~~G~ 138 (474) T protein:vir:96 66 DNKGEIDPLKPDWRMFTNYHQNLVDQKV-AYAVANPVTF--SSDDDKSLKTIQ---EVLN-HKWDDKLVDILTAASNKGI 138 (474) T ss_pred cccccccccccchhcccchHHHHHHhhh-hhhcccCcee--ecCchHHHHHHH---HHHh-cCHHHHHHHHHHHHHhcCe Confidence 111111 1111 11000 001111222222 2233 2566777888899999999 Q ss_pred eeEEEEEcCcc-eeeeecCcceeEEEEEE--CCeeEEEEEeeeccC--cchhccc-cHHHHHHHHHHhhhhh-------c Q lcl|NC_015263. 148 FYGYVIDDKES-VMIQQFPNDICKISSVS--GGVYNYVIDLDALVS--ADIVDYY-PKEIQEAVNKYTTMKK-------G 214 (513) Q Consensus 148 ~~gy~i~d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~syFd~--~~~L~~~-p~Ei~~~y~~Y~~~k~-------~ 214 (513) .|.+...|.++ +-+..++|+-|.++--. .+.+.+++ +++.. ....+.| +.++.. | .+..... . T Consensus 139 ~~~~~y~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~v--r~~~~~~~~~~~~yt~~~v~~-~-~~~~~~~~~~~~~~~ 214 (474) T protein:vir:96 139 EWLQPYIDENGEFKTFRVPAEQAIPIWTNKERDTLKAFI--RYYRLDGAERVEYWTDSDVTY-Y-EYQDGILIPDYYHGE 214 (474) T ss_pred eEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEeecCceEEEEEeCCeEEE-E-EecCCceeecccccc Confidence 88877765544 56778888888777442 35665554 33321 1111111 111100 0 0000000 0 Q ss_pred c--------CcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHH-----hhHhhhhhceeeeeeecc Q lcl|NC_015263. 215 N--------NKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR-----NDKAELQNYKLLIQKLET 281 (513) Q Consensus 215 ~--------~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~-----~~~~~i~n~~ii~~kip~ 281 (513) . ......|=.+| ++.+- ....+. |.|.++.++.+.=+.. +..+...+ .+++ +. T Consensus 215 ~~~~~~~~~~~~~~~~g~iP-----vv~~~-nn~~g~----sd~e~v~~liDa~d~~~S~~~~~~~~~~~-~~lv--~~- 280 (474) T protein:vir:96 215 EHIQSHYYVGNKRVSWGRVP-----FIPFK-NNPQEM----SDLFMYKTIIDAMDKRLSDTQNTFDESTE-LIYI--LK- 280 (474) T ss_pred ccccccccccccccCCCcee-----EEEec-cCCCCC----CcHHHHHHHHHHHHHHHHHHHHHHHHhcc-ceee--ee- Confidence 0 00000111111 11110 011222 3333333333322211 11222222 1221 11 Q ss_pred c-cCCCCCccccCHHHHHHHHHHHHHhcc-cc--ceEEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCC Q lcl|NC_015263. 282 R-SSNDNNDFTLDMPMMNYFHEALSMTVP-DN--VGVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSD 357 (513) Q Consensus 282 ~-~~n~~~~~~vd~~~~~~~~~~ik~~Lp-~g--v~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d 357 (513) + ++.+.+++..++.....+ .+| +| +..++.+.+.+ .....++-..++|+.-+++-..-+.+. T Consensus 281 g~~~~~~~~~~~~~~~~~~i------~~~~~~~~~~~l~~~~~~~--------~~~~~~~~l~~~i~~~s~~p~~~~~~~ 346 (474) T protein:vir:96 281 GYEGQDLDEFMRNLKYYKAI------NVDGDGSGVDTIQIEVPVQ--------SSKEYLDMLRDYVIEFGQGVDFQQDKF 346 (474) T ss_pred cCCcccccchhhhhhcCceE------EecCCCCceeEEeecCChH--------HHHHHHHHHHHHHHHHhCCcccccccc Confidence 1 011112221111110000 011 12 11222221111 111235555678888888766666544 Q ss_pred CcchHHHH--HHHHHHHHHHHH----HHHHHHHHHHHHHhhccc---ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcH Q lcl|NC_015263. 358 NKTSQGIA--MSIATDEQFIFG----VINQLERWLNRYLLLNGM---SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPV 428 (513) Q Consensus 358 ~~s~~~~~--~SI~~d~~~~~~----~~~~iE~~~N~~i~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~ 428 (513) +++.+|++ .....-.+.+-. |-+.|++.++.++..... .....+.|-+..+.|..+.++.+.+ .|.-+ T Consensus 347 ~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~i~i~f~~~~p~~~~e~~~~~~~---ag~iS 423 (474) T protein:vir:96 347 GNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNIKVQDVEITFNFNVMVNELEQSQIGVQ---SQYLS 423 (474) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHHHHh---cCCCc Confidence 33444433 332222222222 223333333333332222 2247889999999999999998654 37444 Q ss_pred HHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 429 KVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 429 ~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) ...+...+++ +|.+.+.+..-|++. ..+.+.|.. + +..|..++ +.++++ T Consensus 424 ~et~~~~~~~v~d~~~E~~ri~~E~~e--~~~~~~~~~------~------------~~~~~~~d---~~~e~~ 474 (474) T protein:vir:96 424 KETVVTNHPWVDDPVAELERIEQDNID--FNKQLPPLE------G------------DANGRAQD---NESETN 474 (474) T ss_pred hHHHHHhCCCCCCHHHHHHHHHHHHHH--HHhcccccc------c------------ccccccCC---CcccCC Confidence 5555556776 688888888888643 112232221 1 11121211 111111 No 110 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=95.64 E-value=0.0017 Score=35.76 Aligned_cols=416 Identities=11% Similarity=0.038 Sum_probs=168.0 Q ss_pred cccccccccc---hHHHHHHHhhhccChhHHHHHHHHHHHHHhhcc--------------------hHHHHHHHHhhccc Q lcl|NC_015263. 40 FGAPVGSLTS---SQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQ--------------------QYQRLLNFYANMPL 96 (513) Q Consensus 40 ~~s~~~s~~~---s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg--------------------~~~rlidy~~~mpt 96 (513) ..++.=+..- -...+++.++.| ..-...++.+-+|+.+... --+.++|..+.-.. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~--~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~ 78 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAF--EDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQA 78 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHH--HHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhc Confidence 1111111111 111234444444 2233444455555444432 11222222222111 Q ss_pred ccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcC--------cc-eeeeecCcc Q lcl|NC_015263. 97 YAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDK--------ES-VMIQQFPND 167 (513) Q Consensus 97 ~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~--------~~-~~iq~lp~d 167 (513) ++- |.... .....+...+ .+..-++......+.+.+++.|..|.+...+. ++ ..+..+|++ T Consensus 79 ~~g----~~~~~---~~~~~~~~~~---i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~ 148 (486) T protein:vir:42 79 VEG----FRLGD---ADEADEELWQ---WWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPT 148 (486) T ss_pred ccc----eecCC---CchhHHHHHH---HHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEeccc Confidence 111 22111 1111122333 34555678888999999999999998776432 22 367889999 Q ss_pred eeEEEEE-ECCeeEEEEEeeeccCcc---hhccccH-HHHHHHHHHhhhhhccCcccccCeeec-CCce----EEEEe-c Q lcl|NC_015263. 168 ICKISSV-SGGVYNYVIDLDALVSAD---IVDYYPK-EIQEAVNKYTTMKKGNNKSASNWYEIQ-DKNS----ICIKI-N 236 (513) Q Consensus 168 yckIsg~-~nG~y~~~fD~syFd~~~---~L~~~p~-Ei~~~y~~Y~~~k~~~~~~~~~W~~L~-~~kt----~~ik~-~ 236 (513) .|..+-= ..+...+++=+.+-+... ....|.+ ++.. | ... +..|.... .+|. .++.+ | T Consensus 149 ~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~----~~~---~~~~~~~~~~~h~~g~vPvv~~~n 217 (486) T protein:vir:42 149 RMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIG----W----FRA---DGEWAEWFNVPHGLGVVPVVPLPN 217 (486) T ss_pred ceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEE----E----Eec---CCcEEeecceecCCCCceEEEecc Confidence 8886643 345555555333322221 1222222 1111 0 000 12332221 1122 12222 2 Q ss_pred Cc---cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccc--cCCCCCccc-cCHHHHHHHHHHHHH--hc Q lcl|NC_015263. 237 ES---SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETR--SSNDNNDFT-LDMPMMNYFHEALSM--TV 308 (513) Q Consensus 237 ~~---~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~--~~n~~~~~~-vd~~~~~~~~~~ik~--~L 308 (513) .. .++|.+=+..-+.++.|..+.--. +.....+- ...|.+ -|-+...+. .+......+...... ++ T Consensus 218 ~~~~~~~~G~s~i~~~v~~liDa~~~~~s-~~~~~~e~-----~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (486) T protein:vir:42 218 RTRLSDLYGTSEITPELRSMTDAAARILM-LMQATAEL-----MGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARILAF 291 (486) T ss_pred ccccCCCCCcccchhhHHHHHHHHHHHHH-HHHHHHHh-----hcchHHHhhcCCccccccccccccchhhhhhchhccc Confidence 21 224444444333333322222111 11111111 111210 011111111 111111111111111 22 Q ss_pred cccceEEEeccccccccccccccc--chhhhhhHHhhhhhhhhhhhhccCCC---cchHHHHHHHHHHHHHHHHHH---- Q lcl|NC_015263. 309 PDNVGVVTSPMEIDTVSFDKDSST--DDSVEKATKNFWDNAGVSQILFSSDN---KTSQGIAMSIATDEQFIFGVI---- 379 (513) Q Consensus 309 p~gv~~v~sP~~~d~i~ld~~~~~--~dtv~~~~~~i~~~~GiS~~Lfn~d~---~s~~~~~~SI~~d~~~~~~~~---- 379 (513) |++-+. -..|++.... -+.+..-..++....++..--|++.. .|+.+++.....-...+-+.. T Consensus 292 ~~~~~~--------~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~ 363 (486) T protein:vir:42 292 EDAEGK--------IQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFG 363 (486) T ss_pred CCCCce--------EEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 332211 1123221111 13344444566666777777787665 344455555554444443333 Q ss_pred HHHHHHHHHHHhhc-ccc-----eEEEEEecCCCCccHHHHHHHHHHHHhc--CCcHHHHHHHHhCCCHHHHHHHHH--H Q lcl|NC_015263. 380 NQLERWLNRYLLLN-GMS-----KYFKATMLEVTHFSKKEAHDRYITDAQY--GFPVKVYLASLMGIDPVAFTGLLK--V 449 (513) Q Consensus 380 ~~iE~~~N~~i~~~-~~~-----~~f~~~~l~~T~fn~ke~~~~~~~~~~~--G~~~~~~laa~~G~~p~~~~~~~~--~ 449 (513) +.+++.+...+... ..+ ...++.|-+..+-|..+.++.+.++.+- |..+...+...+|+++.+.-.+-. . T Consensus 364 ~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~ 443 (486) T protein:vir:42 364 GAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDE 443 (486) T ss_pred HHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHH Confidence 33333333332221 111 2578899999999999999999998764 677888888899998887654443 3 Q ss_pred HHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCcc Q lcl|NC_015263. 450 ENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPA 510 (513) Q Consensus 450 E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~ 510 (513) |++... .+....+.+... ...+ ..+.++.|..+.- ...++.|. T Consensus 444 e~~~~~-------~~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~--------~~~~~~~~ 486 (486) T protein:vir:42 444 EEAAMG-------LGLLGTMVDADP-TVPG--SPSPTAPPKPQPA--------IESSGGDA 486 (486) T ss_pred HHHHHH-------HHHHHHhhcCCC-CCCC--CCCCCCCCCCCcc--------cCCCCCCC Confidence 443322 111111111110 0000 0011111111100 01111111 No 111 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=95.42 E-value=0.0021 Score=35.27 Aligned_cols=414 Identities=12% Similarity=0.045 Sum_probs=166.2 Q ss_pred cccccccccc---hHHHHHHHhhhccChhHHHHHHHHHHHHHhh--------------------cchHHHHHHHHhhccc Q lcl|NC_015263. 40 FGAPVGSLTS---SQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ--------------------SQQYQRLLNFYANMPL 96 (513) Q Consensus 40 ~~s~~~s~~~---s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~--------------------sg~~~rlidy~~~mpt 96 (513) ..++.=+..- -.....+.+..| ......++.+-+|+.+. .+.-+.++|-.+.... T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~--~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 78 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAF--EDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQA 78 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHH--HHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhc Confidence 0000000000 000111122222 11222233333333322 2334455554444333 Q ss_pred ccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc---------eeeeecCcc Q lcl|NC_015263. 97 YAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES---------VMIQQFPND 167 (513) Q Consensus 97 ~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~---------~~iq~lp~d 167 (513) ++- |.... .....+... ..+..-++....+.+.+.+++.|..|.+...+.+. ..+..++|+ T Consensus 79 ~~g----~~~~~---~~~~~~~l~---~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~ 148 (485) T protein:vir:24 79 VEG----FRLGD---ADEADEELW---QWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPT 148 (485) T ss_pred cCc----eecCC---CchhHHHHH---HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccc Confidence 332 22111 111112222 33455568888999999999999999988766443 257788888 Q ss_pred eeEEEEE-ECCeeEEEEEeeeccCcc---hhccccH-HHHHHHHHHhhhhhccCcccccCeeec-CCce----EEEEe-c Q lcl|NC_015263. 168 ICKISSV-SGGVYNYVIDLDALVSAD---IVDYYPK-EIQEAVNKYTTMKKGNNKSASNWYEIQ-DKNS----ICIKI-N 236 (513) Q Consensus 168 yckIsg~-~nG~y~~~fD~syFd~~~---~L~~~p~-Ei~~~y~~Y~~~k~~~~~~~~~W~~L~-~~kt----~~ik~-~ 236 (513) .|..+-- ..+...+++=..+=+... ...-|.+ ++.+ | . ..+ ..|.... .+|. .++.+ | T Consensus 149 ~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~---~-~~~---~~~~~~~~~~h~~g~vPvv~f~n 217 (485) T protein:vir:24 149 RMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFG----W---F-RAE---GEWVEWFSDPHGLGAVPVVPLPN 217 (485) T ss_pred eeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEE----E---E-ecC---CceEeecccccCCCcccEEEecc Confidence 8866532 224444444333311111 1111211 1110 0 0 001 1232211 1122 11222 2 Q ss_pred C---ccccchhhHHHHHHhHHHHHHHHHHH-hhHhhhhhceeeeeeeccc---cCCCCCccccCHHHHHHHHHHHHH--h Q lcl|NC_015263. 237 E---SSLTPVPPFAGTFDSIYDIHSFKDLR-NDKAELQNYKLLIQKLETR---SSNDNNDFTLDMPMMNYFHEALSM--T 307 (513) Q Consensus 237 ~---~~~~~ip~f~~v~~d~~di~~~kdL~-~~~~~i~n~~ii~~kip~~---~~n~~~~~~vd~~~~~~~~~~ik~--~ 307 (513) . ..++|.+-+...+.++.|. +.... +....++- ...|.. +...++....+.+....+...... + T Consensus 218 ~~~~~~~~G~s~i~~~v~~liDa--~~~~~s~~~~~~~~-----~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~ 290 (485) T protein:vir:24 218 RTRLSDLYGTSEITPELRSMTDA--AARILMLMQATAEL-----MGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILA 290 (485) T ss_pred CcccCCcCCcccchhhHHHHHHH--HHHHHHHHHHHHHh-----hcchhhhhccCCccccccccccccchhhhcccceec Confidence 1 1234444444433444332 22221 11111111 111210 001111111111111112111111 2 Q ss_pred ccccceEEEeccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccCCC---cchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 308 VPDNVGVVTSPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSSDN---KTSQGIAMSIATDEQFIFGVINQL 382 (513) Q Consensus 308 Lp~gv~~v~sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~d~---~s~~~~~~SI~~d~~~~~~~~~~i 382 (513) +|++-+.+ ..++... .--+.+.....++....+++...|++.. .|+.+++.-...-...+-+..+.+ T Consensus 291 ~~~~~~~~--------~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f 362 (485) T protein:vir:24 291 FEDAEGKI--------QQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIF 362 (485) T ss_pred cCCCCceE--------EeecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHH Confidence 23221111 1222211 1113355555666677788888887765 345556655555544444444433 Q ss_pred HHHHHH----HHh---hcccc---eEEEEEecCCCCccHHHHHHHHHHHHhcC--CcHHHHHHHHhCCCHHHHHHHHHH- Q lcl|NC_015263. 383 ERWLNR----YLL---LNGMS---KYFKATMLEVTHFSKKEAHDRYITDAQYG--FPVKVYLASLMGIDPVAFTGLLKV- 449 (513) Q Consensus 383 E~~~N~----~i~---~~~~~---~~f~~~~l~~T~fn~ke~~~~~~~~~~~G--~~~~~~laa~~G~~p~~~~~~~~~- 449 (513) ..-+.+ .+. ..... ...++.|-+..+-|..+.++.+.++.+-| .-+...+...+|+++.+.-.+-.. T Consensus 363 ~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ 442 (485) T protein:vir:24 363 GGAWEEAMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWD 442 (485) T ss_pred HHHHHHHHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHH Confidence 332222 222 11111 25788998888999999999999998866 445556667899999876554433 Q ss_pred -HHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCc Q lcl|NC_015263. 450 -ENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKP 509 (513) Q Consensus 450 -E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~ 509 (513) |+... +.+..-.+.+... . .++.+.++.+ .+...+..+.|+. T Consensus 443 ee~~~~-------~~~~~~~~~~~~~-~------~~~~~~~~e~----~~~~~~~~~~~~a 485 (485) T protein:vir:24 443 EEEAAM-------GLGLLGTMVDADP-T------VPGSPNPTPA----PKPQPAIEGGDSA 485 (485) T ss_pred HHHhhh-------hhhHHHhhcccCC-C------CCCCCCCCCC----CCCccCCCCCCCC Confidence 22221 1111111111110 0 0011111111 0111111111111 No 112 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=95.34 E-value=0.0023 Score=35.08 Aligned_cols=416 Identities=11% Similarity=0.054 Sum_probs=172.1 Q ss_pred cccccccccchH-HHHHHHhhhccChhHHHHHHHHHHHHHhhcch--------------------HHHHHHHHhhccccc Q lcl|NC_015263. 40 FGAPVGSLTSSQ-SKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQ--------------------YQRLLNFYANMPLYA 98 (513) Q Consensus 40 ~~s~~~s~~~s~-d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~--------------------~~rlidy~~~mpt~d 98 (513) .++++=.....+ +...+.|.+++ ......++.+.+|+...... -+.++|-.+....++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~ 79 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLF-TERTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQELE 79 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhccC Confidence 233333333321 12222222221 23334556666665554331 122334333322333 Q ss_pred ceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcce---------eeeecCccee Q lcl|NC_015263. 99 YSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKESV---------MIQQFPNDIC 169 (513) Q Consensus 99 Y~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~---------~iq~lp~dyc 169 (513) .+-.| . ..+..+..+++ ...-++......+.+.+++.|..|.+...+.++. .+..+++..| T Consensus 80 g~~~~----~---~~~~~~~l~~i---~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~ 149 (484) T protein:vir:77 80 GFRLG----G---ADKADEQLWDW---WQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNL 149 (484) T ss_pred ceecC----C---cchhHHHHHHH---HHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEecccee Confidence 32221 1 11222334433 4556788899999999999999999888766653 4778888888 Q ss_pred EEEEE-ECCeeEEEEEeeeccCcc---hhc-cccHHHHHHHHHHhhhhhccCcccccCeeecC-CceE----EEEe-cCc Q lcl|NC_015263. 170 KISSV-SGGVYNYVIDLDALVSAD---IVD-YYPKEIQEAVNKYTTMKKGNNKSASNWYEIQD-KNSI----CIKI-NES 238 (513) Q Consensus 170 kIsg~-~nG~y~~~fD~syFd~~~---~L~-~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~-~kt~----~ik~-~~~ 238 (513) ..+-= ..+...+++=..+=+... ... ++|.++.. |. . ....|..... +|.+ ++.+ |.. T Consensus 150 ~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~----~~-----~--~~~~~~~~~~~~~~~g~vPvv~f~N~~ 218 (484) T protein:vir:77 150 YAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVI----WN-----R--EDGQWVQVANVAHNLEMVPVIPIPNRT 218 (484) T ss_pred EEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEE----EE-----e--cCCceEeeccccCCCCCcceEEecccc Confidence 65432 224444443222211111 011 11221111 00 0 0112322211 1111 1222 222 Q ss_pred ---cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccc--cCCCCCccc-cCHHHHHHHHHHHHH--hccc Q lcl|NC_015263. 239 ---SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETR--SSNDNNDFT-LDMPMMNYFHEALSM--TVPD 310 (513) Q Consensus 239 ---~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~--~~n~~~~~~-vd~~~~~~~~~~ik~--~Lp~ 310 (513) .++|.+-|..-+.++.|..+.--. +.....+-. ..|.+ -|-..+++. .+......+...... ++|+ T Consensus 219 ~~~~~~G~s~i~~~v~~L~Da~~~~~s-~~~~~~~~~-----a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (484) T protein:vir:77 219 RLSDLYGTTEITPELRSVTDAAARTLM-LMQATAELM-----GVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAFED 292 (484) T ss_pred ccCccCCcccchHHHHHHHHHHHHHHH-HHHHHHHhh-----hhhHHHHhCCCcchhcccccccchhhhhhhhhhcccCC Confidence 224444444333333322221111 111111111 11210 011111111 111111111111111 1232 Q ss_pred cceEEEecccccccccccccc--cchhhhhhHHhhhhhhhhhhhhccCCC---cchHHHHHHHHHHHHHHHH----HHHH Q lcl|NC_015263. 311 NVGVVTSPMEIDTVSFDKDSS--TDDSVEKATKNFWDNAGVSQILFSSDN---KTSQGIAMSIATDEQFIFG----VINQ 381 (513) Q Consensus 311 gv~~v~sP~~~d~i~ld~~~~--~~dtv~~~~~~i~~~~GiS~~Lfn~d~---~s~~~~~~SI~~d~~~~~~----~~~~ 381 (513) +=+. ...|+.... --+.+.....++....++...-|++.. .|+..++.....-...+.. |-+. T Consensus 293 ~~~~--------~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 364 (484) T protein:vir:77 293 HESK--------AQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGA 364 (484) T ss_pred CCce--------eEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2111 112222111 113455555667777788877887654 3444556555544444433 3333 Q ss_pred HHHHHHHHHhhc-ccc-----eEEEEEecCCCCccHHHHHHHHHHHHhcC--CcHHHHHHHHhCCCHHHHHHHHH--HHH Q lcl|NC_015263. 382 LERWLNRYLLLN-GMS-----KYFKATMLEVTHFSKKEAHDRYITDAQYG--FPVKVYLASLMGIDPVAFTGLLK--VEN 451 (513) Q Consensus 382 iE~~~N~~i~~~-~~~-----~~f~~~~l~~T~fn~ke~~~~~~~~~~~G--~~~~~~laa~~G~~p~~~~~~~~--~E~ 451 (513) +++.+...+... ... ...++.|-+..+-|..+.++.+.|+.+-| ..+...+...+|+++.+.-.+-. .|. T Consensus 365 l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee 444 (484) T protein:vir:77 365 WEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEE 444 (484) T ss_pred HHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHH Confidence 333333333321 111 24788899999999999999999998865 66778888899998887655433 333 Q ss_pred HhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCC Q lcl|NC_015263. 452 EMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANT 512 (513) Q Consensus 452 e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~ 512 (513) ++.. ...+-++. +.. .+..+.+..++.+.+.++ .+..+.+ T Consensus 445 ~~~~-~~~~~~~~------~~~-~~~~~~~~~~~~~~~~~~-------------~~~~~~~ 484 (484) T protein:vir:77 445 QAQG-LGLMGTMF------GTD-PSGGGNPDNPETPEPQPN-------------PAEEAAA 484 (484) T ss_pred HHHH-HHHHhhhc------ccc-ccCCCCCCCCCcccccCC-------------CccccCC Confidence 2211 11111111 100 000010111111111111 1111111 No 113 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=95.30 E-value=0.0023 Score=35.00 Aligned_cols=418 Identities=10% Similarity=0.095 Sum_probs=159.4 Q ss_pred CCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc Q lcl|NC_015263. 2 VKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS 81 (513) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s 81 (513) .-|+-+|. +|.+-..-+ +-++++. .....+.+++.|..| ......++.+-+|+-... T Consensus 1 ~~~~~~~~--~~~~~~~~~-------~~~~~~~------------~~~~~~~i~~~i~~~--~~~~~~~~~~~~Yy~g~~ 57 (474) T protein:vir:95 1 MFNIIRMP--WDKPYGEEV-------VEQLKPQ------------FETQEEMIIRLIDDH--RKQLDKITVGQRYYDKDN 57 (474) T ss_pred CcceeecC--CCCchhhHH-------HHhhhhc------------cCChHHHHHHHHHHH--HHHHHHHHHHHHHhcccC Confidence 01111110 000000000 0000110 112233445555543 444555666555554433 Q ss_pred chHHH--------------------------HHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHH Q lcl|NC_015263. 82 QQYQR--------------------------LLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYN 134 (513) Q Consensus 82 g~~~r--------------------------lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~ 134 (513) .+..| +++..++ |.+ .|..- ...++...+ .+..+-.=++... T Consensus 58 ~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~-----~l~g~p~~~--~~~d~~~~~----~l~~~~~n~~~~~ 126 (474) T protein:vir:95 58 DIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVS-----YVASKPVTY--SCEDESVLK----IIHDVLDTRWDNK 126 (474) T ss_pred chhccccccccccccccccccceeccchHHHHHHHHHh-----hhccCCcee--ccCchHHHH----HHHHHHhccHHHH Confidence 22222 2221111 110 01000 001111112 2222222357778 Q ss_pred HHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccc-cHHHHHHHHHHhh Q lcl|NC_015263. 135 FSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYY-PKEIQEAVNKYTT 210 (513) Q Consensus 135 ~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~-p~Ei~~~y~~Y~~ 210 (513) +..+.+.+++.|..|.+..-|.++ +-+..++|+-|-++-- ..|.+.+++-.-..+.....+.| +.++.+ |.. T Consensus 127 ~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~----~~~ 202 (474) T protein:vir:95 127 LIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTY----YVL 202 (474) T ss_pred HHHHHHHHhhcCcEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEE----EEE Confidence 888999999999988887765554 5677788887776643 23565555433222222222222 111110 110 Q ss_pred hhhc---cCcccccCe-----eecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhceeeeeeec Q lcl|NC_015263. 211 MKKG---NNKSASNWY-----EIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNYKLLIQKLE 280 (513) Q Consensus 211 ~k~~---~~~~~~~W~-----~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~~ii~~kip 280 (513) .... .......|. .-+...-.++.+- ..+.++|-|. ++..+.+.=+.. +....++..+--+-.++ T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-nn~~g~sd~e----~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~ 277 (474) T protein:vir:95 203 ENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFK-NNPEEVSDIW----MYKSLIDAIDKRLSDAQNMFDESVELIYILK 277 (474) T ss_pred cCCccccccccCcccccccccccCCCccceEeec-CCCCCCCcHH----HHHHHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 0000 000000000 0011011112221 1222333333 333333222211 22222222111111111 Q ss_pred -cccCCCCCccccCHHHHHHHHHHHHHhccccce--EEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCC Q lcl|NC_015263. 281 -TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVG--VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSD 357 (513) Q Consensus 281 -~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~--~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d 357 (513) + .+.+.+++.-++.... .+ .++.+-+ .++.+.+. ......++...++|+.-+++-.+.+.+. T Consensus 278 g~-~~~~~~~~~~~~~~~~----~i--~~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~i~~~s~~p~~~~~~~ 342 (474) T protein:vir:95 278 GY-EGQDLEEFMRGLKYYK----AI--NVDGDGGVETIQVEVPV--------SSTKEYIDLMRAYIMEFGQGVDFQTDKF 342 (474) T ss_pred cC-Ccccchhhhhhhhccc----ee--eccCCCceeEEeecCCH--------HHHHHHHHHHHHHHHHHhCCcccccccc Confidence 1 0111122211111100 01 0122221 12221111 1111335566677888888766555333 Q ss_pred C--cchHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhhccc---ceEEEEEecCCCCccHHHHHHHHHHHHhcC-Cc Q lcl|NC_015263. 358 N--KTSQGIAMSIATDEQFIFG----VINQLERWLNRYLLLNGM---SKYFKATMLEVTHFSKKEAHDRYITDAQYG-FP 427 (513) Q Consensus 358 ~--~s~~~~~~SI~~d~~~~~~----~~~~iE~~~N~~i~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~ 427 (513) + .|+..++.....-.+.+.. |-+.|++.++.++..... ...+.+.|-+..+.+..+.++.+.++ | .| T Consensus 343 ~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~~e~a~~~~~~---g~iS 419 (474) T protein:vir:95 343 GSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKMDVKDIEISFNFNRMMNDAEQSQIIAQS---QYLS 419 (474) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHHHHhc---CCCc Confidence 2 3444344333322222222 222222223222222222 23578889999999999999998764 6 66 Q ss_pred HHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 428 VKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 428 ~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) ....+. .+|+ +|.+.+.++..|++.-. . ....+..++.+ +..+. +....++++ T Consensus 420 ~et~i~-~l~~v~d~~~E~~ri~~E~~~~~-----~-~~~~~~~~~~d------------~~~~~-~~~~~~~~~ 474 (474) T protein:vir:95 420 RETLVK-SSPLVDDYKAELERIEQEQMEYN-----K-QLPNLDDGGAD------------GAQQQ-ERSNDKESE 474 (474) T ss_pred hHHHHH-hCCCCCCHHHHHHHHHHHHHHHH-----h-cccccccccCC------------CCcCC-CCCccCCCC Confidence 666555 5776 68888999888874411 1 11112222221 00010 001111111 No 114 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=95.25 E-value=0.0024 Score=34.90 Aligned_cols=409 Identities=13% Similarity=0.128 Sum_probs=163.3 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhcc--Cccccccccc-----ccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc-- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNR--TPVFGAPVGS-----LTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS-- 81 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~s~~~s-----~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s-- 81 (513) |+|+ |+ ++.++.+++. .....+.+.+.++.| ......++.+-+|+.... T Consensus 1 ~~~~--------------------~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~--~~~~~~~~~~~~yY~g~~~i 58 (478) T protein:vir:10 1 MISI--------------------NWPWDKPYHEQVVEQIKPKYETQEEMILRLVREH--KENIDNITMGERYYNHHPDI 58 (478) T ss_pred Cccc--------------------cCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHH--HHHHHHHHHHHHHhcCCCch Confidence 5554 22 1111111111 012334445555555 334455666666654432 Q ss_pred ------------------------chHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHH Q lcl|NC_015263. 82 ------------------------QQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSK 137 (513) Q Consensus 82 ------------------------g~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~ 137 (513) +..+.+++..++...=+=+. +. ....+..+... .++. -++...+.. T Consensus 59 ~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~--~~----~~~d~~~~~l~---~~~~-n~~~~~~~~ 128 (478) T protein:vir:10 59 LDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVT--FG----VDNDKALKQIQ---HTLN-HKWDDKLVD 128 (478) T ss_pred hccccccccccccccccccceeccchHHHHHHHHHhhhccCCee--ee----cCChHHHHHHH---HHHh-cCHHHHHHH Confidence 22233333222211000000 00 01111222222 2233 367888899 Q ss_pred HHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccc-cHHHHHHHHHHhhhh- Q lcl|NC_015263. 138 IVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYY-PKEIQEAVNKYTTMK- 212 (513) Q Consensus 138 i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~-p~Ei~~~y~~Y~~~k- 212 (513) +.+.+++.|..|.+...|.++ +-+..++|+-|.++-- ..+.+.+++=.-..+..+....| +.++.. |.... T Consensus 129 ~~~~~~~~G~~~~~~~~d~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~----~~~~~~ 204 (478) T protein:vir:10 129 ILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTY----YELKEG 204 (478) T ss_pred HHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEE----EEEcCC Confidence 999999999999888776544 4566788888877643 23555555322111111222211 212111 10000 Q ss_pred -------hccCcccccCeeecCCce----EEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhc--eeeee Q lcl|NC_015263. 213 -------KGNNKSASNWYEIQDKNS----ICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNY--KLLIQ 277 (513) Q Consensus 213 -------~~~~~~~~~W~~L~~~kt----~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~--~ii~~ 277 (513) .........|......+. .++.+- ..+.+.+-|. ++.++.+.-+.. +....++-. .+++ T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-n~~~g~sd~~----~v~~liDa~~~~~S~~~~~~~~~~~p~~~- 278 (478) T protein:vir:10 205 QLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFK-NNPQEVSDLF----MYKTIIDALDKRLSDTQNTFDESVELIYI- 278 (478) T ss_pred eeeccccccccccccceecccccccCCccceEEec-cCCCCCCcHH----HHHHHHHHHHHHHHHHHHHHHHhhCceee- Confidence 000000000111111111 112221 1233444443 333333322211 222222211 1111 Q ss_pred eecccc-CCCCCccccCHHHHHHHHHHHHHhccccceEEEeccccccccc---ccc-cccchhhhhhHHhhhhhhhhhhh Q lcl|NC_015263. 278 KLETRS-SNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSF---DKD-SSTDDSVEKATKNFWDNAGVSQI 352 (513) Q Consensus 278 kip~~~-~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~l---d~~-~~~~dtv~~~~~~i~~~~GiS~~ 352 (513) ..+. +.+.+++.-+++. +-...+.+-+=..+.+ +.. ......++-..+.|+.-+++-.. T Consensus 279 --~~g~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 342 (478) T protein:vir:10 279 --LKGYEGEDMKDFMHNLKY--------------YKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDF 342 (478) T ss_pred --eecCCccccchhhhhhhh--------------cceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccc Confidence 1110 1111222211111 0111111100011111 111 11113355556778888887655 Q ss_pred hccCCCcchHH--HHHHHHHHHHHHHH----HHHHHHHHHHHHHhhccc---ceEEEEEecCCCCccHHHHHHHHHHHHh Q lcl|NC_015263. 353 LFSSDNKTSQG--IAMSIATDEQFIFG----VINQLERWLNRYLLLNGM---SKYFKATMLEVTHFSKKEAHDRYITDAQ 423 (513) Q Consensus 353 Lfn~d~~s~~~--~~~SI~~d~~~~~~----~~~~iE~~~N~~i~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~ 423 (513) -+.+.+++.+| ++.....-.+.+-. |-+.+++.++.++..... .....+.|-+..+.|..+.++.+.++.. T Consensus 343 ~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g 422 (478) T protein:vir:10 343 QQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITFNFNVMVNELENSQIAMNSTG 422 (478) T ss_pred CccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEecCCCCCCHHHHHHHHHHHhC Confidence 55443333333 33332222222222 223333333333333222 2247889999999999999999999843 Q ss_pred cCCcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccccccc Q lcl|NC_015263. 424 YGFPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKD 498 (513) Q Consensus 424 ~G~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~ 498 (513) ..|... +...+|+ +|.+.+.++..|++... ..+.+.. ++-.++++ ..+++. +++ T Consensus 423 -~iS~et-~~~~l~~v~D~~~E~~ri~~E~~~~~--~~~~~~~-----~~~~~~~~----~~~~~~--------~~~ 478 (478) T protein:vir:10 423 -LLSKET-ILSNHAWVEDPVAEMERIEQENIELN--QQLPDIE-----EGLNGEQQ----RQSENN--------QPE 478 (478) T ss_pred -CCChHH-HHHhCCCCCCHHHHHHHHHHHHHHHH--hhccccc-----cccCCCCC----CCCCCC--------CCC Confidence 355555 4456787 57888888888875421 1111111 11110000 001111 111 No 115 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=95.23 E-value=0.0025 Score=34.86 Aligned_cols=401 Identities=12% Similarity=0.074 Sum_probs=158.5 Q ss_pred eehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHH Q lcl|NC_015263. 12 IDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFY 91 (513) Q Consensus 12 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~ 91 (513) +-+-.--+|.||.+... ..++ |..+.++ .+ -.++ .+|..++.++++|+-. T Consensus 1 ~~~~~~d~~~~~~~~~~------~~~~------~~~~~~~------------~~-----~~l~-a~Y~~~~l~~~~Vd~~ 50 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGA------DGSP------KPFFMSD------------AS-----YHVG-SFYNDNATAKRIVDVI 50 (427) T ss_pred CCccccchHHHHhhcCC------CCcc------cCccccC------------ch-----HHHH-HHHHcCchhhhhhccc Confidence 11222234555532210 0000 0000000 01 1333 6799999999999988 Q ss_pred hhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEE-cCcceeeeecCcceeE Q lcl|NC_015263. 92 ANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVID-DKESVMIQQFPNDICK 170 (513) Q Consensus 92 ~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~-d~~~~~iq~lp~dyck 170 (513) ....|=...-.. ++. .+ +. .-..+++++++..+.+.++-.-..|.-+.++.. +.++ .-+|+- T Consensus 51 aed~~r~g~~i~-g~~---~~----~~---~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~-l~~p~~----- 113 (427) T protein:vir:10 51 PEEMVTAGFKMS-GVK---DE----KE---FKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRM-LTSQAK----- 113 (427) T ss_pred hHHhhcCCcccc-Ccc---HH----HH---HHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCc-cccccC----- Confidence 765554433321 111 11 12 333466777666555444444444444434333 3332 112221 Q ss_pred EEEEECCeeEEEEEeeeccCcchhccccHHH-----HHHHHHHhhhhhccCcccccCeeecCCceEEEEecC-------- Q lcl|NC_015263. 171 ISSVSGGVYNYVIDLDALVSADIVDYYPKEI-----QEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINE-------- 237 (513) Q Consensus 171 Isg~~nG~y~~~fD~syFd~~~~L~~~p~Ei-----~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~-------- 237 (513) ..|.+++ +..|+...+ -|.++ ..-|-+.+.+. .++.....=+.+-+.+-+.|. +. T Consensus 114 ----~~g~l~~---l~v~d~~~~---~~~~~~~dp~s~~fg~P~~y~-v~~~~~~~~~~iH~SRli~~~-g~~~p~~~~~ 181 (427) T protein:vir:10 114 ----PGAKLEG---VRVYDRFAI---TVEKRVTNARSPRYGEPEIYK-VSPGDNMQPYLIHHSRVFIAD-GERVAQQARK 181 (427) T ss_pred ----CCcceeE---EEEechhcc---cccccccCccccccCcceEEE-EecCCCCcceEEccccEEEec-CCCchhhhcc Confidence 1232221 122221110 01000 00000000000 001111111233333333321 11 Q ss_pred -ccccchhhHHHHHHhHHH-HHHHHHHHhhHhh-hhhceeeeeeec-cccCCCCCccccCHHHHHHHHHHHHHhccccce Q lcl|NC_015263. 238 -SSLTPVPPFAGTFDSIYD-IHSFKDLRNDKAE-LQNYKLLIQKLE-TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVG 313 (513) Q Consensus 238 -~~~~~ip~f~~v~~d~~d-i~~~kdL~~~~~~-i~n~~ii~~kip-~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~ 313 (513) ...|+.+++. +-+++ +..+......-.. +....+-+-+++ +..--.+++-. ...+..+....+..=-.|.. T Consensus 182 ~~~~~G~S~l~---~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~--~~~~~r~~~~~~~~~~~~~~ 256 (427) T protein:vir:10 182 QNQGWGASVLN---KSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQ--YAARLRLAQVDDNSGVGRAI 256 (427) T ss_pred cCCcccchhhh---HHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccch--HHHHHHHHHHHHhcCcccce Confidence 1223444443 22333 3334443322222 222233333443 11100111111 11122222222211012222 Q ss_pred EEEec-ccccccccccccccchhhhhhHHhhhhhhhhh-hhhccCCCcchHHHHHHHHHHH----HHHHHHHHH-HHHHH Q lcl|NC_015263. 314 VVTSP-MEIDTVSFDKDSSTDDSVEKATKNFWDNAGVS-QILFSSDNKTSQGIAMSIATDE----QFIFGVINQ-LERWL 386 (513) Q Consensus 314 ~v~sP-~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS-~~Lfn~d~~s~~~~~~SI~~d~----~~~~~~~~~-iE~~~ 386 (513) .+... =+++.++.+-+ .-.+.+....++|-.++||- ..|||- +.+|++.+-+.|- ..|-++++. +...+ T Consensus 257 ~l~~~~e~~e~~~~~ls-gl~~~~~~~~~~iaaa~~IP~t~L~G~---sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l 332 (427) T protein:vir:10 257 GIDAETEEYDVLNSDIS-GVPEFLSSKMDRIVSLSGIHEIIIKNK---NVGGVSASQNTALETFYKLVDRKREEDYRPLL 332 (427) T ss_pred eeecCCCceeEEecccC-ChHHHHHHHHHHHHhhhCCCeeeeccC---CccccccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 33332 23444443332 34578899999999999999 456653 3334433333333 333333322 44456 Q ss_pred HHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhC-CCHHHHHHHHHHHHHhhCcccccCcccc Q lcl|NC_015263. 387 NRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMG-IDPVAFTGLLKVENEMLDLPEIMTPLSS 465 (513) Q Consensus 387 N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G-~~p~~~~~~~~~E~e~L~l~~~~~Pl~T 465 (513) ++.+.....+..|.++|.+.--.+.+|+++..++.++ .... ++. .| ++|+++-..+..+-+.-+ + .|-.. T Consensus 333 ~~l~~~i~~s~~~~~~f~pL~~~s~kEkaei~~~~a~---a~~~-~~~-~gvi~~~e~r~~L~~~~~~~~---~-~~~~~ 403 (427) T protein:vir:10 333 EFLLPFIVDEEEWSIEFEPLSVPSKKEESEITKNNVE---SVTK-AIT-EQIIDLEEARDTLRSIAPEFK---L-KDGNN 403 (427) T ss_pred HHHHHHhhcCCCcEEEeCCCCCCCHHHHHHHHHHHHH---HHHH-HHh-cCCCCHHHHHHHHHhhhcccc---C-CCCcc Confidence 6666554334569999999999999999998777655 2222 333 35 899998876654321111 1 11110 Q ss_pred cccccccccccCCccccCCCCcCCCCcccccccCCCC Q lcl|NC_015263. 466 SFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDET 502 (513) Q Consensus 466 S~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~ 502 (513) . . ....+++.. ..|. .+....+++ T Consensus 404 ---~--~--~e~~~~~~e---~~p~---~~e~~~d~~ 427 (427) T protein:vir:10 404 ---I--N--IREPEETTE---PEPG---LGEKLEDEN 427 (427) T ss_pred ---c--c--ccccchhcC---CCCC---CCCCCCCCC Confidence 0 0 011111100 0111 111111111 No 116 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=95.12 E-value=0.0027 Score=34.65 Aligned_cols=373 Identities=12% Similarity=0.029 Sum_probs=155.0 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhhc-ccccceE Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYANM-PLYAYSV 101 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~m-pt~dY~I 101 (513) |--..++ ++.-.... .+.. .-++ .........+.-.|..+..+.+.|+++++. ..+...+ T Consensus 1 Mg~~~~~--~~~~~~~~---~~~~-------~~~~-------~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~ 61 (395) T protein:vir:40 1 MGFKSWV--SGFFNEEQ---RTLN-------LTDT-------VWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLT 61 (395) T ss_pred CchHHHH--Hhhhcccc---cccc-------cccc-------hhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceee Confidence 1000000 00000000 0000 0000 001112223333455677788888888653 2233333 Q ss_pred eeccchhhhhhcchhHHHHHHHHHHhh-cC----hhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEEEEC Q lcl|NC_015263. 102 VPFKDISTANENKLKKELATVTEFLSR-LN----PKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISSVSG 176 (513) Q Consensus 102 ~P~~~~~~~~~~~~~~~y~~v~~~L~k-~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~~n 176 (513) + .+.. ... +.....|.. =| ...+...++..++..|..|-|...+.- -.|..|++.. . T Consensus 62 ~---~~~~----~~~---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~-----~~~~~~~~~~---~ 123 (395) T protein:vir:40 62 Y---EKGE----EVR---KKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI-----YVADSFTKND---K 123 (395) T ss_pred c---cCCc----ccc---chHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce-----eecCCccccc---c Confidence 2 1111 111 223333433 22 345556778888899998877764431 1233343321 1 Q ss_pred CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHH Q lcl|NC_015263. 177 GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDI 256 (513) Q Consensus 177 G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di 256 (513) +.+.+.+.--.+ ..|.- =..++...-+-|+.+.. .+.++..++.....++ T Consensus 124 ~~~~~~~~~v~~-----------------~~~~~-----------~~~~~~~evih~r~~~~--~~~~~~~~l~~~~~~~ 173 (395) T protein:vir:40 124 SLYENTYTEVTL-----------------KDLTL-----------KKEFKESEVLHLTLNNE--SIKSIIDGFYLLYGDL 173 (395) T ss_pred ccccceeeeeee-----------------cCcee-----------eeeeccccEEEeecCCC--CccccchhHHHHHHHH Confidence 111111110000 01100 01344444555664332 2233444443333322 Q ss_pred HHHHHHHhhHhhhhhcee-eeeeeccccCCCCCccccCHHHHHHHHHHHHHh----ccccceEEE--ecccccccccccc Q lcl|NC_015263. 257 HSFKDLRNDKAELQNYKL-LIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT----VPDNVGVVT--SPMEIDTVSFDKD 329 (513) Q Consensus 257 ~~~kdL~~~~~~i~n~~i-i~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~----Lp~gv~~v~--sP~~~d~i~ld~~ 329 (513) . ....+.. ..+..+ -..++. ..+ .++.++.++..+.++++ .-++-+.++ ..+++..+.++ T Consensus 174 ~--~~~~~~~--~~~~~~~~~l~~~-----~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~-- 240 (395) T protein:vir:40 174 L--TAAVNKY--KKLNSRKIIVKLK-----AMF--GQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGD-- 240 (395) T ss_pred H--HHHHHHH--HhcCCCCceEEEe-----ccc--CCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCC-- Confidence 1 1111111 111111 011111 111 13444444444444433 333333333 33344444432 Q ss_pred cccchhhhh------hHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--c-cceEEE Q lcl|NC_015263. 330 SSTDDSVEK------ATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLN--G-MSKYFK 400 (513) Q Consensus 330 ~~~~dtv~~------~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~--~-~~~~f~ 400 (513) ..+.+.++- .-++|-.+.||...++|++..+..-+....-.+ -+.-++++||..+|+.|-.. . .+..|+ T Consensus 241 ~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~~~sn~e~~~~~f~~~--~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~ 318 (395) T protein:vir:40 241 SKIAESRDIKKMIDDVFEMVANSFNIPLGLAKGDTVGLSEQVNSFLMF--SINPIAEMFTDEGNRKFYGRDSVLERTYMK 318 (395) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHHHHHHHHHH--HHHHHHHHHHHHHHHhcCChhhhcCCceEE Confidence 222233221 125688899999999998766555554443333 34468999999999987432 2 245677 Q ss_pred EEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCcc Q lcl|NC_015263. 401 ATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIK 480 (513) Q Consensus 401 ~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~ 480 (513) |.+-+...-+.++.++.+.++..-|.=..--.-+.+|+.|.+ .. +-|..+.|+- ...... .+ T Consensus 319 fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~--------~~--~gD~~~~~~n---~~~~~~-----~~ 380 (395) T protein:vir:40 319 LDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVM--------SP--ETQERFVTKN---YAPLGE-----NE 380 (395) T ss_pred EechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--------CC--CCceeeeccc---cccccc-----cc Confidence 777777777888999988888777732222222335655542 00 1122233322 111110 00 Q ss_pred ccCCCCcCCCCcccccccCCCCCCCCCC Q lcl|NC_015263. 481 EKGKENGRPTNETTGNKDSDETQRAKDK 508 (513) Q Consensus 481 ~~~~~~grPt~et~~n~~~~~~~~~~d~ 508 (513) .. ..|| +++++..++ T Consensus 381 ~~-~kgg------------e~~~~~~~~ 395 (395) T protein:vir:40 381 ED-LKGG------------DINENKGDS 395 (395) T ss_pred cc-cCCC------------CCCCCcCCC Confidence 00 0111 111111111 No 117 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=95.05 E-value=0.0029 Score=34.52 Aligned_cols=444 Identities=10% Similarity=0.019 Sum_probs=164.2 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) |++=-. .-++.|.|..+.-- .|-..+.+.+.+.-.. .-|..+ +..|... |..+.....+...-+ .. T Consensus 1 ~~~~p~---~~l~~~~~~~~~~~-~l~~~~~~~~~r~~~~--~~YY~g------~~~i~~~-~~~~~~~~~~~~~~~-~~ 66 (479) T protein:vir:99 1 MIDLPD---EDLSSEGLAKYLET-KVFPKMNTECERLDDF--EAWTKN------GQEVPDL-ATRHKNKEREVLQQL-SR 66 (479) T ss_pred CccCCc---ccCChhHHHHHHHH-HHHHHHHHHhHHHHHH--HHHHhc------CCccccc-ccccCChhHHHHHHH-hh Confidence 332221 11222222222100 0000111111110000 000000 0001111 112222222211111 12 Q ss_pred cchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEE-----Ec Q lcl|NC_015263. 81 SQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVI-----DD 155 (513) Q Consensus 81 sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i-----~d 155 (513) .+.-+.++|.++.-..++-+..| +.. .....+ ..++.-++......+.+.+++.|..|.+.. +| T Consensus 67 ~n~~~~iVd~~~~~l~~~gf~~~----d~~----~~~~~~---~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d 135 (479) T protein:vir:99 67 KPWMGLMVNSFAQQLIVDGYRKT----GTN----ENAKGW---DTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLD 135 (479) T ss_pred cCcHHHHHHHHHhhcccccccCC----Cch----hhHHHH---HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcC Confidence 35566777776665444443321 111 112223 334555678888999999999999888776 23 Q ss_pred Ccc-eeeeecCcceeEEEEEEC-CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeec-CCce-- Q lcl|NC_015263. 156 KES-VMIQQFPNDICKISSVSG-GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ-DKNS-- 230 (513) Q Consensus 156 ~~~-~~iq~lp~dyckIsg~~n-G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~-~~kt-- 230 (513) .++ ..+..++|..|..+--.. ......+...+ +.......|... .+..+.. ....|...+ .+|. T Consensus 136 ~~g~~~i~~~~p~~~~~iydd~~~~~~~~~~~~~-~~~~~~~~~~~~------~~~~~~~----~~~~~~~~~~~~h~~g 204 (479) T protein:vir:99 136 GTTVARIKCIDPRDAFAIWEDPYWDEWPKYLLER-QPNGQYWWWTEE------DYSIFEF----KQGKFIYRETVSHDYG 204 (479) T ss_pred CCCceEEEEechhheEEEecCCcccceeeEEEee-cCceeEEEEecc------eEEEEEe----cCCceeeccccccCCC Confidence 444 356777899888764211 11122222222 111111122211 0100000 011232211 1111 Q ss_pred ----EEEEecCc-cccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeec----cccCCCCCccccCHHHHHHHH Q lcl|NC_015263. 231 ----ICIKINES-SLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLE----TRSSNDNNDFTLDMPMMNYFH 301 (513) Q Consensus 231 ----~~ik~~~~-~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip----~~~~n~~~~~~vd~~~~~~~~ 301 (513) +.|.-+.. ..++.+-|.. +.++.|..+.--. +.....+ ....| ++....++.-. +... +. T Consensus 205 ~vPvv~f~n~~~~~~~g~sd~e~-v~~liDa~~~~~s-~~~~~~~-----~~a~p~~~i~G~~~~~~~~~-~~~~---~~ 273 (479) T protein:vir:99 205 HIPFVRYVNVMDLRGVCYGDVEP-LVTVAKAIDKTGL-DILLVQH-----HQSFQIRWATGLMLPEGANA-DQEK---MR 273 (479) T ss_pred CcceEEeecCCCcCcCCcchhHH-HHHHHHHHHHHHH-HHHHHHH-----HhhchhhhhcCCCccccccc-chhc---cc Confidence 11222211 1245444443 2233332221111 1111111 11112 11001111000 0000 00 Q ss_pred HHHHHhccccceEEEeccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccC-CCcchHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 302 EALSMTVPDNVGVVTSPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSS-DNKTSQGIAMSIATDEQFIFGV 378 (513) Q Consensus 302 ~~ik~~Lp~gv~~v~sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~-d~~s~~~~~~SI~~d~~~~~~~ 378 (513) ... +++- +.-+-+.+-..++... .-.+.+.....+|....|+....||. .+.|+.+++.....-.+.+-.. T Consensus 274 ~~~-----~~i~-~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~ 347 (479) T protein:vir:99 274 FAQ-----ESML-ISQNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQKLFEK 347 (479) T ss_pred ccc-----ccce-eecCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHHHHHH Confidence 000 1110 1111122222333211 11134555556677777888788863 3345555666655444444433 Q ss_pred HHH----HHHHHHHHHhhccc-----ceEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCCCHHHHHHHHH Q lcl|NC_015263. 379 INQ----LERWLNRYLLLNGM-----SKYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGIDPVAFTGLLK 448 (513) Q Consensus 379 ~~~----iE~~~N~~i~~~~~-----~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~~p~~~~~~~~ 448 (513) .+. |++.+-..+.-... ...+++.|-+..+-|..+.++.+.|+.+-| .|....+.-+.|+++.++-.+.. T Consensus 348 ~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~ 427 (479) T protein:vir:99 348 QATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKE 427 (479) T ss_pred HHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHH Confidence 333 33322222221111 124777787777789999999999998776 66655555444999987654433 Q ss_pred HHH--HhhC-cccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 449 VEN--EMLD-LPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 449 ~E~--e~L~-l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~~ 513 (513) .+. +.++ +-+.+.. |.+. ++ +.|.|...+.........+.|+.=++.+- T Consensus 428 ~~~~~~~~~~~~~~~~~--------~~~~------~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 428 IYDREGDFGKYMRKLQN--------GPDP------AE--QRGGPNGATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred HHHHHHHHHHHHHHHhc--------ccCc------cc--ccCCCCCCCCCCCCCCCCcchhccCCCCC Confidence 321 1111 1111111 1000 00 00111111111112222222333333333 No 118 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=94.88 E-value=0.0033 Score=34.21 Aligned_cols=355 Identities=11% Similarity=0.109 Sum_probs=158.7 Q ss_pred HHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHh-hcccccceEeeccc Q lcl|NC_015263. 28 ISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYA-NMPLYAYSVVPFKD 106 (513) Q Consensus 28 ~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~-~mpt~dY~I~P~~~ 106 (513) -+++.. .|+.... ...+-.+ .....+..-.|.+++.+.+.|+.++ ++..+...++ . T Consensus 1 Mg~f~~----------l~~~~~~----~~~~~~~------~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~---~ 57 (376) T protein:vir:78 1 MGFFSE----------LFKRNKE----IEWMWDL------DFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLK---N 57 (376) T ss_pred Cchhhh----------hhccCCc----cccccch------hhccccchhhhhhhHHHHHHHHHHHHhhcccceeec---c Confidence 222222 2221110 0001110 1112222233446677889999888 5666666654 1 Q ss_pred hhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEEEECCeeEE Q lcl|NC_015263. 107 ISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISSVSGGVYNY 181 (513) Q Consensus 107 ~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~~nG~y~~ 181 (513) +... .. +.+...|. . +....+...++..++..|..|.|...+..+.....+|..--.+.... .+.+ T Consensus 58 ~~~~----~~---~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~--~~~~ 128 (376) T protein:vir:78 58 GETS----VR---DKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDV--FEGV 128 (376) T ss_pred cccc----cc---chHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeee--eeee Confidence 1111 11 22333343 2 22456667788888999999999988877654443443222111100 0000 Q ss_pred EEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHH Q lcl|NC_015263. 182 VIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKD 261 (513) Q Consensus 182 ~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kd 261 (513) .++ .|.- . ..++...-+-|+.+..+.. ++..++.... .. T Consensus 129 ~~~----------------------~~~~---------~--~~~~~~evih~~~~~~~~~--~~~~~~~~~~------~~ 167 (376) T protein:vir:78 129 TVK----------------------DYRY---------N--RNFSMDDVIFLEYGNERLS--AFTDGMFEDY------GE 167 (376) T ss_pred eee----------------------ccee---------e--eeeccccEEEeccCCCCch--hhhhHHHHHH------HH Confidence 000 1100 0 1244444555665443322 2222222111 11 Q ss_pred HHh--hHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc----ccceEEEe--ccccccccccccc-cc Q lcl|NC_015263. 262 LRN--DKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP----DNVGVVTS--PMEIDTVSFDKDS-ST 332 (513) Q Consensus 262 L~~--~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp----~gv~~v~s--P~~~d~i~ld~~~-~~ 332 (513) +.. ...-..+. -+...+-+ ...-.++.++++.+.+.++++.. ++.+.++. .+++..+.+.... .. T Consensus 168 ~~~~~~~~~~~~~-~~~~~~~~-----~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~ 241 (376) T protein:vir:78 168 LFGKMIRAQMRNF-QIRGAVNF-----KMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQ 241 (376) T ss_pred HHHHHHHHHHhcC-CCceeEEE-----ccCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccch Confidence 111 11111121 11111111 11224566777777777766652 33333333 3455555443221 11 Q ss_pred --ch---hhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCC Q lcl|NC_015263. 333 --DD---SVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVT 407 (513) Q Consensus 333 --~d---tv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T 407 (513) .| +.+-..+.|..+.||...++|++..+.+......-. .-+.-++++||..+|+.+-.. .+..+++.+-... T Consensus 242 ~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~~e~~~~~f~~--~~l~P~~~~ie~~l~~kll~~-~~~~~~~~~~~ll 318 (376) T protein:vir:78 242 SFDEVKKLRKEMIDYVASILGIPSSLLHGDMADLSNNMKAYME--YCIDPLTKKLEDELNAKLFTF-SEFLAGEHIKIIH 318 (376) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCHHHHHHHHHH--HHHHHHHHHHHHHHHhhhCCc-ccceecccchhhc Confidence 12 233445779999999999999876555544444333 334468999999999988432 2233444433344 Q ss_pred CccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHH-HhhCcccccCcccccccccccccccCCccccCCCC Q lcl|NC_015263. 408 HFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVEN-EMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKEN 486 (513) Q Consensus 408 ~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~-e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~ 486 (513) .-+.++..+.+.++..-|. ++|.|+-..+-++- +--.-|..+.|+ +..... ++.++ T Consensus 319 ~~d~~~~~~~~~~~~~~G~-----------~t~NE~R~~lg~~p~~~g~~d~~~~~~---n~~~~~---------~~~e~ 375 (376) T protein:vir:78 319 KKDIIENAEAVDKLVASGS-----------FNRNEVRELLGAERVDNPELDKYLITK---NYQSAD---------EGGED 375 (376) T ss_pred ccCHHHHHHHHHHHHhCCC-----------cCHHHHHHHhCCCCCCCCCCceeeecc---Cceehh---------ccccC Confidence 4577888888887777662 34444332221110 000112233232 111111 11222 Q ss_pred c Q lcl|NC_015263. 487 G 487 (513) Q Consensus 487 g 487 (513) | T Consensus 376 g 376 (376) T protein:vir:78 376 G 376 (376) T ss_pred C Confidence 2 No 119 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=94.88 E-value=0.0033 Score=34.20 Aligned_cols=426 Identities=10% Similarity=0.084 Sum_probs=162.9 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNF 90 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy 90 (513) ||.+=++-..-=++.==..++.+. .....+.+++.+..| ....+.++.+-+|+.....+..|...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~i~~~i~~~--~~~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQ------------FETQEEMIVRLIDDH--RKQLDKITVGQRYYDKDNDIVKQMKKV 66 (474) T ss_pred CcccccccCCCchhhHHHHhhhhc------------ccCHHHHHHHHHHHH--HHHHHHHHHHHHHhccccchhcccchh Confidence 332222211111100000011111 111224455555553 334455666666655443332221111 Q ss_pred Hhhccccc-----ceEeec-------------cch-hhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 91 YANMPLYA-----YSVVPF-------------KDI-STANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 91 ~~~mpt~d-----Y~I~P~-------------~~~-~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ....+.-. ....|| +.. ....++ ++..+.+..+..-++...+..+.+.+++.|..|.+ T Consensus 67 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d---~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~ 143 (474) T protein:vir:97 67 DVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCED---ENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQ 143 (474) T ss_pred ccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCc---HHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEE Confidence 00000000 001111 000 000011 22222222233347888889999999999999988 Q ss_pred EEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccccH-HHHHHHHHHhhhhh---ccCcccccCe- Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYYPK-EIQEAVNKYTTMKK---GNNKSASNWY- 223 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~p~-Ei~~~y~~Y~~~k~---~~~~~~~~W~- 223 (513) ..-|.++ +-+.-++|+-|.++-- ..+.+.+++-....+.....+.|.+ ++.+ |..... ........|. T Consensus 144 ~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~----y~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:97 144 VYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTY----YVLENGGLIPDYYYGANHVQ 219 (474) T ss_pred EEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEE----EEEcCCccccccccCcCccc Confidence 8776665 5677788887777643 1356666543322222222222211 1100 100000 0000000010 Q ss_pred ----eecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHH-H-hhHhhhhhceeeeeeec-cccCCCCCccccCHHH Q lcl|NC_015263. 224 ----EIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDL-R-NDKAELQNYKLLIQKLE-TRSSNDNNDFTLDMPM 296 (513) Q Consensus 224 ----~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL-~-~~~~~i~n~~ii~~kip-~~~~n~~~~~~vd~~~ 296 (513) .-+..+-.++.+- ..+.+.+-|.. +.++.+.=+. . +....++...--.-.+. + .+...+++.-++.. T Consensus 220 ~~~~~~~~g~vPvv~~~-nn~~g~sd~e~----v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~-~~~~~~~~~~~~~~ 293 (474) T protein:vir:97 220 SHFSNGNWGRVPFIAFK-NNPEEVSDIWM----YKSIIDAIDKRLSDAQNMFDESVELIYILKGY-EGEDLEEFMRGLKY 293 (474) T ss_pred ccccccCCCccceEEec-CCcCCCCcHHH----HHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-Ccccchhhhhhhhc Confidence 0011111123221 12234444433 3333322221 1 11112221111111111 0 01111222111110 Q ss_pred HHHHHHHHHHhccccce--EEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCCCcchHH--HHHHHHHHH Q lcl|NC_015263. 297 MNYFHEALSMTVPDNVG--VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQG--IAMSIATDE 372 (513) Q Consensus 297 ~~~~~~~ik~~Lp~gv~--~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~--~~~SI~~d~ 372 (513) . ..+ .++.|-+ .++.+.+. ......++-..++|+.-+++..+-+.+.+++.+| ++.....-. T Consensus 294 ~----~~i--~~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 359 (474) T protein:vir:97 294 Y----KAI--NVDGDGGVETIQVEVPV--------SSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLD 359 (474) T ss_pred c----cee--eccCCCceeEEeecCCH--------HHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHH Confidence 0 000 0122211 12211110 1111224444567888888766555333333333 332222222 Q ss_pred HHHH----HHHHHHHHHHHHHHhhccc---ceEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCC--CHHH Q lcl|NC_015263. 373 QFIF----GVINQLERWLNRYLLLNGM---SKYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGI--DPVA 442 (513) Q Consensus 373 ~~~~----~~~~~iE~~~N~~i~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~--~p~~ 442 (513) +.+- .|-+.|++.++.++..... ...+.+.|-+..+.|..+.++.+.++ | .|....+ ..+|+ +|.+ T Consensus 360 ~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~~~~---g~iS~et~l-~~l~~v~D~~~ 435 (474) T protein:vir:97 360 LKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQIIAQS---QYLSRETLV-KSSPLVDDYKA 435 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHHHHc---CCCCHHHHH-HhCCCCCCHHH Confidence 2222 1333333334433332222 22478899999999999999998775 5 6665555 45787 7889 Q ss_pred HHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 443 FTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 443 ~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) .+.+...|++.-. ....+ .+-.+. + +.+..+...++++. T Consensus 436 E~eri~~E~~~~~--~~~~~----~~~~~~------------~-~~~~~~~~~~~~~e 474 (474) T protein:vir:97 436 ELERIEQEQMEYN--KQLPN----LDDGGA------------D-GAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHHHHHHH--hhccc----cCCCCC------------C-CcccCCCCcccccC Confidence 9999999885411 11111 111111 1 11111111111111 No 120 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=94.88 E-value=0.0033 Score=34.20 Aligned_cols=426 Identities=10% Similarity=0.084 Sum_probs=162.9 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNF 90 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy 90 (513) ||.+=++-..-=++.==..++.+. .....+.+++.+..| ....+.++.+-+|+.....+..|...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~i~~~i~~~--~~~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQ------------FETQEEMIVRLIDDH--RKQLDKITVGQRYYDKDNDIVKQMKKV 66 (474) T ss_pred CcccccccCCCchhhHHHHhhhhc------------ccCHHHHHHHHHHHH--HHHHHHHHHHHHHhccccchhcccchh Confidence 332222211111100000011111 111224455555553 334455666666655443332221111 Q ss_pred Hhhccccc-----ceEeec-------------cch-hhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 91 YANMPLYA-----YSVVPF-------------KDI-STANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 91 ~~~mpt~d-----Y~I~P~-------------~~~-~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ....+.-. ....|| +.. ....++ ++..+.+..+..-++...+..+.+.+++.|..|.+ T Consensus 67 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d---~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~ 143 (474) T protein:vir:94 67 DVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCED---ENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQ 143 (474) T ss_pred ccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCc---HHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEE Confidence 00000000 001111 000 000011 22222222233347888889999999999999988 Q ss_pred EEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccccH-HHHHHHHHHhhhhh---ccCcccccCe- Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYYPK-EIQEAVNKYTTMKK---GNNKSASNWY- 223 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~p~-Ei~~~y~~Y~~~k~---~~~~~~~~W~- 223 (513) ..-|.++ +-+.-++|+-|.++-- ..+.+.+++-....+.....+.|.+ ++.+ |..... ........|. T Consensus 144 ~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~----y~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:94 144 VYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTY----YVLENGGLIPDYYYGANHVQ 219 (474) T ss_pred EEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEE----EEEcCCccccccccCcCccc Confidence 8776665 5677788887777643 1356666543322222222222211 1100 100000 0000000010 Q ss_pred ----eecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHH-H-hhHhhhhhceeeeeeec-cccCCCCCccccCHHH Q lcl|NC_015263. 224 ----EIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDL-R-NDKAELQNYKLLIQKLE-TRSSNDNNDFTLDMPM 296 (513) Q Consensus 224 ----~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL-~-~~~~~i~n~~ii~~kip-~~~~n~~~~~~vd~~~ 296 (513) .-+..+-.++.+- ..+.+.+-|.. +.++.+.=+. . +....++...--.-.+. + .+...+++.-++.. T Consensus 220 ~~~~~~~~g~vPvv~~~-nn~~g~sd~e~----v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~-~~~~~~~~~~~~~~ 293 (474) T protein:vir:94 220 SHFSNGNWGRVPFIAFK-NNPEEVSDIWM----YKSIIDAIDKRLSDAQNMFDESVELIYILKGY-EGEDLEEFMRGLKY 293 (474) T ss_pred ccccccCCCccceEEec-CCcCCCCcHHH----HHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-Ccccchhhhhhhhc Confidence 0011111123221 12234444433 3333322221 1 11112221111111111 0 01111222111110 Q ss_pred HHHHHHHHHHhccccce--EEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCCCcchHH--HHHHHHHHH Q lcl|NC_015263. 297 MNYFHEALSMTVPDNVG--VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQG--IAMSIATDE 372 (513) Q Consensus 297 ~~~~~~~ik~~Lp~gv~--~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~--~~~SI~~d~ 372 (513) . ..+ .++.|-+ .++.+.+. ......++-..++|+.-+++..+-+.+.+++.+| ++.....-. T Consensus 294 ~----~~i--~~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 359 (474) T protein:vir:94 294 Y----KAI--NVDGDGGVETIQVEVPV--------SSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLD 359 (474) T ss_pred c----cee--eccCCCceeEEeecCCH--------HHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHH Confidence 0 000 0122211 12211110 1111224444567888888766555333333333 332222222 Q ss_pred HHHH----HHHHHHHHHHHHHHhhccc---ceEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCC--CHHH Q lcl|NC_015263. 373 QFIF----GVINQLERWLNRYLLLNGM---SKYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGI--DPVA 442 (513) Q Consensus 373 ~~~~----~~~~~iE~~~N~~i~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~--~p~~ 442 (513) +.+- .|-+.|++.++.++..... ...+.+.|-+..+.|..+.++.+.++ | .|....+ ..+|+ +|.+ T Consensus 360 ~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~~~~---g~iS~et~l-~~l~~v~D~~~ 435 (474) T protein:vir:94 360 LKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQIIAQS---QYLSRETLV-KSSPLVDDYKA 435 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHHHHc---CCCCHHHHH-HhCCCCCCHHH Confidence 2222 1333333334433332222 22478899999999999999998775 5 6665555 45787 7889 Q ss_pred HHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 443 FTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 443 ~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) .+.+...|++.-. ....+ .+-.+. + +.+..+...++++. T Consensus 436 E~eri~~E~~~~~--~~~~~----~~~~~~------------~-~~~~~~~~~~~~~e 474 (474) T protein:vir:94 436 ELERIEQEQMEYN--KQLPN----LDDGGA------------D-GAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHHHHHHH--hhccc----cCCCCC------------C-CcccCCCCcccccC Confidence 9999999885411 11111 111111 1 11111111111111 No 121 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=94.85 E-value=0.0034 Score=34.15 Aligned_cols=377 Identities=11% Similarity=0.042 Sum_probs=154.8 Q ss_pred hhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cc Q lcl|NC_015263. 17 ISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MP 95 (513) Q Consensus 17 ~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mp 95 (513) ..-.+-|++. +.+... .+......+....+. .. ..++.--....+.+++.|+++++ +. T Consensus 1 MGl~~~~~~~---~~~~~~-~~~~~~~~~~~~~~~-----------~~------~~vt~~~al~~~~v~~~i~~Ia~~iA 59 (394) T protein:vir:62 1 MGLRDRFSNY---LFKKAE-KRGYLDNVLGKSIRY-----------SG------VYVTDSNILQSSDVYELLQDISNQMV 59 (394) T ss_pred Cchhhhhhhh---ccCCCC-chhhhhhhhhccccc-----------Cc------cccChhhhhccHHHHHHHHHHHHhhc Confidence 1111111110 000000 000011111111000 00 00111112245677888888764 34 Q ss_pred cccceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEE Q lcl|NC_015263. 96 LYAYSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKI 171 (513) Q Consensus 96 t~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckI 171 (513) .+...++- ++. +.. ++. -+..+|.+=| ...+...++.+++..|..|.|+..+.-+ ++ +.+.+ T Consensus 60 ~lp~~v~~---~~g---~~~-~~~-~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~-----~~-~~~~~ 125 (394) T protein:vir:62 60 LADIVVED---EFG---NEI-KDD-IALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH-----LA-SNVFT 125 (394) T ss_pred ccceEEEc---CCC---ccc-chh-hHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee-----cc-ccceE Confidence 44444531 111 111 221 1233454433 5567777888999999999987533221 11 22333 Q ss_pred EEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-cCccccchhhHHHHH Q lcl|NC_015263. 172 SSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-NESSLTPVPPFAGTF 250 (513) Q Consensus 172 sg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~~~~~~~ip~f~~v~ 250 (513) .--.+|.+.|..+- .+++..--+-|+. ..+...|++|...+. T Consensus 126 ~~~~~~~~~~~~~~-------------------------------------~~~~~~eiih~r~~~~d~~~G~s~~~~~~ 168 (394) T protein:vir:62 126 ELDDNLVEHFNIGG-------------------------------------HEIPPCMIRHVKNIGADHLRGKGILDLGR 168 (394) T ss_pred EECCceEEEEeeCC-------------------------------------EEechhheEEecCcCCCCccccChHHHHH Confidence 21133443332110 1233322233332 223346777766555 Q ss_pred HhHHHHHHHHHHHhhHhhhhhceee--eeeeccccCCCCCccccCHHHHHHHHHHHHHhcc--c---cceEEEecccccc Q lcl|NC_015263. 251 DSIYDIHSFKDLRNDKAELQNYKLL--IQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP--D---NVGVVTSPMEIDT 323 (513) Q Consensus 251 ~d~~di~~~kdL~~~~~~i~n~~ii--~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp--~---gv~~v~sP~~~d~ 323 (513) ..+--.....+. ...-..|-... +-++| +...-+.++.+.+.+.+.+..- + ++.++..+.+++- T Consensus 169 ~~i~~~~~~~~~--~~~~~~ng~~~~~il~~~-------~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~ 239 (394) T protein:vir:62 169 DTLEGVMSAEKT--LTDKYKKGGLLTFLLNLD-------AHINPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSI 239 (394) T ss_pred HHHHHHHHHHHH--HHHHHHccCCcceEEEeC-------CCCCcCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeE Confidence 433222222221 11222331111 11233 1111233334444444444441 1 2335555665554 Q ss_pred cccccccccchh---hhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEE Q lcl|NC_015263. 324 VSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFK 400 (513) Q Consensus 324 i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~ 400 (513) +.+.....+.+. .+-..++|..+.||...++|+...+ +.-.....-...-+.-++.+||..+|+.|-...-+..+. T Consensus 240 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~s-n~e~~~~~~~~~~l~P~~~~ie~~l~~kll~~~~~~~~~ 318 (394) T protein:vir:62 240 DTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIKE-DIEKAMMYIHNKAVRPIMKNFEDHLSLLFYAQNSGKRIK 318 (394) T ss_pred EecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCc-CHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccCceE Confidence 455432222233 3445588999999999999764432 222333333334455689999999999775432233466 Q ss_pred EEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHH-HhhCcccccCcccccccccccccccCCc Q lcl|NC_015263. 401 ATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVEN-EMLDLPEIMTPLSSSFNTSGSDIAENAI 479 (513) Q Consensus 401 ~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~-e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~ 479 (513) |.|-......-++..+.+.++..-| -++|.|+-.++-++- +--+-+..++|.. +|.=+. .+. T Consensus 319 ~~fd~~~~~~~~~~~~~~~~~~~~g-----------~~T~NE~R~~~gl~p~~~~~gd~~~~~~n--~~~~~~---~~~- 381 (394) T protein:vir:62 319 FKINILDFVTYSNKTNIGYNLVRTA-----------ITSPDNVADMLGFPKQNTKESQAIYISND--VTEIGK---KEA- 381 (394) T ss_pred EEechhhhcCHHHHHHHHHHHHhCC-----------CcCHHHHHHHhCCCCCCCCCCCeeecccc--cccccc---ccc- Confidence 6665555555555554443333222 257766655444431 0012233333332 121010 000 Q ss_pred cccCCCCcCCCCcccccccCCCCCC Q lcl|NC_015263. 480 KEKGKENGRPTNETTGNKDSDETQR 504 (513) Q Consensus 480 ~~~~~~~grPt~et~~n~~~~~~~~ 504 (513) +. ...+..+++++ T Consensus 382 ~~------------~~~kgge~~en 394 (394) T protein:vir:62 382 TD------------GSLGGGEENEN 394 (394) T ss_pred cc------------ccCCCCCCCCC Confidence 00 01111111222 No 122 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=94.83 E-value=0.0034 Score=34.12 Aligned_cols=440 Identities=11% Similarity=0.067 Sum_probs=167.6 Q ss_pred cCccccccccccc-chHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHH----------------------HHHh Q lcl|NC_015263. 36 RTPVFGAPVGSLT-SSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLL----------------------NFYA 92 (513) Q Consensus 36 ~~~~~~s~~~s~~-~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rli----------------------dy~~ 92 (513) +||.. ++... .....+++.+.+|+-......++.+-.|+.....+..|=. +|.. T Consensus 1 ~~~~~---~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k 77 (537) T protein:vir:78 1 MTSPL---LNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFT 77 (537) T ss_pred CCccc---ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHH Confidence 44431 11100 0122233444444322334556666666666543332111 1111 Q ss_pred hccc--ccceEeeccch-hhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcce Q lcl|NC_015263. 93 NMPL--YAYSVVPFKDI-STANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDI 168 (513) Q Consensus 93 ~mpt--~dY~I~P~~~~-~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dy 168 (513) -|.. -.|.+ ++. ....++.-.+++.+.+..+..-+....+..+.+.+.+.|..|.+...|.++ +-+..++++. T Consensus 78 ~Ivd~~~~yl~---G~Pv~~~~~d~~~~e~~~~l~~~~~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p~~ 154 (537) T protein:vir:78 78 ELVDQLAQYLL---SNGVEVKVKDEDNTQLDEILQEYFDEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDGLT 154 (537) T ss_pred HHHHHHhhhhc---ccCceeecCcchhHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEccce Confidence 0000 01111 111 111111112333333332223344556678888999999999888876554 5677788888 Q ss_pred eEEEEEECCeeEEEEEee----eccC----cc--hhccc-cHHHHHHHHHHhhhhhccCcccccCeee-------cCCce Q lcl|NC_015263. 169 CKISSVSGGVYNYVIDLD----ALVS----AD--IVDYY-PKEIQEAVNKYTTMKKGNNKSASNWYEI-------QDKNS 230 (513) Q Consensus 169 ckIsg~~nG~y~~~fD~s----yFd~----~~--~L~~~-p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L-------~~~kt 230 (513) |.++--..|.+...+=+- +..+ .. ..+.| +.++.. |.. .......|+.+ +...- T Consensus 155 ~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~----y~~----~~~~~~~~~~~~~~~~~~~i~~~ 226 (537) T protein:vir:78 155 LIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCY----YIQ----DDEGVSTTYKLDEAYNPNPAPHV 226 (537) T ss_pred eEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEE----EEe----cCCccccccccccccccccccee Confidence 877654445544433221 1110 00 12222 112111 100 00000000000 00000 Q ss_pred EEEE------e---c----CccccchhhH---------HHHHHhHHHHHHHHHHH-----hhHhhhhhceeeeeeec-cc Q lcl|NC_015263. 231 ICIK------I---N----ESSLTPVPPF---------AGTFDSIYDIHSFKDLR-----NDKAELQNYKLLIQKLE-TR 282 (513) Q Consensus 231 ~~ik------~---~----~~~~~~ip~f---------~~v~~d~~di~~~kdL~-----~~~~~i~n~~ii~~kip-~~ 282 (513) .++. . + ..-.|+.=|+ .+.|.++..+.+.=++. +..+...+ .+++ |. + T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~-~ilv--i~g~- 302 (537) T protein:vir:78 227 LAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSE-AIYV--VKGF- 302 (537) T ss_pred eeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcC-ceee--eecC- Confidence 0000 0 0 0001111111 12334444433332222 22222222 2221 11 0 Q ss_pred cCCCCCccccCHHHHHHHHHHHHHh----cc-cc--ceEEEecccccccccccccccchhhhhhHHhhhhhh---hhhhh Q lcl|NC_015263. 283 SSNDNNDFTLDMPMMNYFHEALSMT----VP-DN--VGVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNA---GVSQI 352 (513) Q Consensus 283 ~~n~~~~~~vd~~~~~~~~~~ik~~----Lp-~g--v~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~---GiS~~ 352 (513) ++.+.++ +...++.. ++ +| +..++.+.+.+.+. -.++-..++|+.-+ .++.. T Consensus 303 ~~~~~~~----------~~~~l~~~~~i~v~~d~~~v~~l~~~~~~~~~e--------~~ld~L~~~I~~~s~~~~~~~~ 364 (537) T protein:vir:78 303 SGDSTDK----------LRQNIKAKKMIGVNGDNAGMEIQTVSIPYEARK--------AKMDIDVENIYRSGMGFNSTAV 364 (537) T ss_pred CCccchh----------HHHHHhhcCceeecCCCCceeEEEecCCHHHHH--------HHHHHHHHHHHHhcCCCCCccc Confidence 0111222 22222221 11 11 11222222211111 11333345555543 22222 Q ss_pred hccCCCcchHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccc----ceEEEEEecCCCCccHHHHHHHHHHH Q lcl|NC_015263. 353 LFSSDNKTSQGIAMSIATDEQFIF-------GVINQLERWLNRYLLLNGM----SKYFKATMLEVTHFSKKEAHDRYITD 421 (513) Q Consensus 353 Lfn~d~~s~~~~~~SI~~d~~~~~-------~~~~~iE~~~N~~i~~~~~----~~~f~~~~l~~T~fn~ke~~~~~~~~ 421 (513) ++.+.|+..++.-...-.+.+- ..++++=+++-.+++.... .....+.|-+..+.|..+.++.+.++ T Consensus 365 --~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l 442 (537) T protein:vir:78 365 --GDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTE 442 (537) T ss_pred --cccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHH Confidence 2223344444333222222221 1222222223333332221 12478999999999999999999998 Q ss_pred HhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccc-cccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 422 AQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSD-IAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 422 ~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~-~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) .+-|.-+...+.+.+++-.+.-...+.-|....+..+......=---+.+.. .+.........+++.|++.+..+..++ T Consensus 443 ~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 522 (537) T protein:vir:78 443 AETEALKIGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVAD 522 (537) T ss_pred HhcCcchHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCC Confidence 8899555555555677644322222222322222222222111000000000 000111112234445666666777777 Q ss_pred CCCCCCCCccCCC Q lcl|NC_015263. 501 ETQRAKDKPANTQ 513 (513) Q Consensus 501 ~~~~~~d~~~~~~ 513 (513) -+..|.+.|..+- T Consensus 523 ~~~~~~~~~~~~~ 535 (537) T protein:vir:78 523 PNVVPPTDPNAVP 535 (537) T ss_pred CCCCCCCCCccCC Confidence 7777776666554 No 123 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=94.79 E-value=0.0035 Score=34.05 Aligned_cols=313 Identities=12% Similarity=0.065 Sum_probs=152.7 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccc----------hHHHHHHHhh---hcc--Chh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTS----------SQSKVRKIVK---EYR--NEG 65 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~----------s~d~~k~~i~---~~~--P~~ 65 (513) |-+..... ...+.......|+-++ -.|-+.=+.+ +|+ |.. T Consensus 1 ~~~~~~~~-------------------------~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~ 55 (348) T protein:vir:26 1 MTEQLIHS-------------------------HTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPIS 55 (348) T ss_pred CCccccch-------------------------hhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCCC Confidence 22111110 0111111122333221 1122233322 344 433 Q ss_pred HHHHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHh Q lcl|NC_015263. 66 NQKTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTV 145 (513) Q Consensus 66 n~~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~ 145 (513) -..|.++ +..+++..+.|....+|..-+|.=-|. |. ..+|..+..+++.- T Consensus 56 -~~~La~l----~~~n~~h~~~i~~k~N~l~~~~~Pn~~---------------------~t----~~~f~~~~~d~ll~ 105 (348) T protein:vir:26 56 -LKGLAEI----ANANGYHGSLLKARANYVAGRFMNGGG---------------------LP----MYKMNSACWDYFGL 105 (348) T ss_pred -HHHHHHH----HhhhhhhhhhHhhhhhHHhhcccCCCC---------------------CC----HHHHHHHHHHHHhc Confidence 2333332 456667677776666665444321111 00 34455667788888 Q ss_pred cceeEEEEEc--CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCe Q lcl|NC_015263. 146 DIFYGYVIDD--KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWY 223 (513) Q Consensus 146 g~~~gy~i~d--~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~ 223 (513) |..|.+.+.+ +..+.+.++|+.||++. .+|.|.+... ... . + T Consensus 106 Gnay~~~~rn~~G~~~~L~~l~~~~v~~~--~d~~~~~~~~-----~g~---------------------------~--~ 149 (348) T protein:vir:26 106 GMSAFVKIRSYLKNVIALEPLPMVHMRKR--KNGDFVQLLR-----NNE---------------------------Q--K 149 (348) T ss_pred CCeEEEEEEcCCCcEEEEEEecCceeEee--ecCcEEEEEe-----cCe---------------------------E--E Confidence 9999999865 45578999999999986 6776533110 000 0 1 Q ss_pred eecCCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHH Q lcl|NC_015263. 224 EIQDKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFH 301 (513) Q Consensus 224 ~L~~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~ 301 (513) +++++.-+-|+. + ....+|+|+..+....+.--+...+...-. ..|- ..-+-|-.. .++.++.++++.+. T Consensus 150 ~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~--f~NG-a~pg~Il~~-----~~~~ls~e~~~~lk 221 (348) T protein:vir:26 150 VFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRY--YLNG-AHMGFIFYA-----TDPNLSEADEKALK 221 (348) T ss_pred EEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH--Hhcc-CCCceEEEe-----cCCCCCHHHHHHHH Confidence 223333344552 3 245579999999998876655555433211 2221 111111110 12457888888888 Q ss_pred HHHHHhccccce-----EEEeccc-cccccc---ccccccchhhh---hhHHhhhhhhhhhhhhccC---CCcchHH-HH Q lcl|NC_015263. 302 EALSMTVPDNVG-----VVTSPME-IDTVSF---DKDSSTDDSVE---KATKNFWDNAGVSQILFSS---DNKTSQG-IA 365 (513) Q Consensus 302 ~~ik~~Lp~gv~-----~v~sP~~-~d~i~l---d~~~~~~dtv~---~~~~~i~~~~GiS~~Lfn~---d~~s~~~-~~ 365 (513) +.++++ .|.+ .|.+|-- =+.+++ .....+.+.++ -..++|..+.||...|.|- .+.+.+. -+ T Consensus 222 ~~~~~~--~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~ 299 (348) T protein:vir:26 222 EKIASS--KGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLK 299 (348) T ss_pred HHHHHh--cCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHH Confidence 888775 3443 3444311 123333 22233333333 2336799999999888762 2222222 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHH Q lcl|NC_015263. 366 MSIATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRY 418 (513) Q Consensus 366 ~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~ 418 (513) ....--..-+.-++++||..+|+.+... -...|+|.|-... .+ .....+ T Consensus 300 ~~~~f~~~~l~P~~~~ie~~ln~~l~~~-~~~~~~fdl~~~~--e~-~~~~a~ 348 (348) T protein:vir:26 300 VSQVYDFYEVIPVCKRFMDAVNNDPEIP-DNLKLKFNLNPGV--ES-ANGSAV 348 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhCCC-CccEEEEecCccc--cc-chhhcC Confidence 3333233334468888888899887532 1223555543322 11 111111 No 124 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=94.51 E-value=0.0042 Score=33.60 Aligned_cols=422 Identities=14% Similarity=0.125 Sum_probs=162.5 Q ss_pred eeeh-hhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc-------- Q lcl|NC_015263. 11 MIDV-ESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS-------- 81 (513) Q Consensus 11 ~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s-------- 81 (513) |-|. .+-..|++.++ ..|......... ...+.+.+.|..| +...++.+-+|+.... T Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~---------~~~~~i~~~i~~~----~~~~~~~~~~YY~g~~~i~~~~~~ 65 (503) T protein:vir:59 1 MADIYPLGKTHTEELN--EIIVESAKEIAE---------PDTTMIQKLIDEH----NPEPLLKGVRYYMCENDIEKKRRT 65 (503) T ss_pred CcccccCChhhHHhHH--Hhhhhhhhhccc---------hhHHHHHHHHHhh----cHHHHHHHHHHhccccchhhccch Confidence 3332 11122222221 112221111110 0111223333332 2234455555544332 Q ss_pred ---------------------chHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHH Q lcl|NC_015263. 82 ---------------------QQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVK 140 (513) Q Consensus 82 ---------------------g~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~ 140 (513) +..+.+++-.++...-+=+. +. .+++--.++++ .+..=++...+..+.+ T Consensus 66 ~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~--~~-----~~d~~~~~~l~---~~~~n~~~~~~~~~~~ 135 (503) T protein:vir:59 66 YYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEPVT--FT-----SDNKTLLEYVN---ELADDDFDDILNETVK 135 (503) T ss_pred hcccccccccccccccceeecchHHHHHHHHHhhhhcCCee--ec-----cCcHHHHHHHH---HHHhcCHHHHHHHHHH Confidence 23334444333322111111 00 01111112222 2223367888899999 Q ss_pred HHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEEE--CCeeEEEEEeeeccC----c---chhccccH-HHHHHHHHHh Q lcl|NC_015263. 141 LAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSVS--GGVYNYVIDLDALVS----A---DIVDYYPK-EIQEAVNKYT 209 (513) Q Consensus 141 ~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~syFd~----~---~~L~~~p~-Ei~~~y~~Y~ 209 (513) .+++.|..|.+...|.++ +-+..++|.-|..+--. ++...+++ +++.. . ..++.|.+ .+.. |. T Consensus 136 ~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~i--r~~~~~~~~~~~~~~~evy~~~~i~~----~~ 209 (503) T protein:vir:59 136 NMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTRRDILFAL--RYYSYKGIMGEETQKAELYTDTHVYY----YE 209 (503) T ss_pred HHhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCCCceEEEE--EEEEEecCCCceEEEEEEEeCCcEEE----EE Confidence 999999999988876654 56778888877766331 24454443 22211 0 11222221 1110 11 Q ss_pred hhhh--------cc---------CcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHH-----HHhhHh Q lcl|NC_015263. 210 TMKK--------GN---------NKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKD-----LRNDKA 267 (513) Q Consensus 210 ~~k~--------~~---------~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kd-----L~~~~~ 267 (513) .... .. ......|=.+| .+.+ -....+.+-|. ++.++.+.=+ +-+..+ T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP-----iv~~-~nn~~~~sd~~----~~~~liDa~d~~~s~~~~~~~ 279 (503) T protein:vir:59 210 KIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVP-----IIPF-KNNEEMVSDLK----FYKDLIDNYDSITSSTMDSFS 279 (503) T ss_pred EcCCcccccccccccccccceeecceeccCCccc-----eEEe-cCCCCCCcchh----hhHHHHHHHHHHHHHHHHHHH Confidence 0000 00 00000011111 1111 01122333332 2222222211 111111 Q ss_pred hhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccce-EEEecccccccccccccccchhhhhhHHhhhhh Q lcl|NC_015263. 268 ELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVG-VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDN 346 (513) Q Consensus 268 ~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~-~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~ 346 (513) ...+ .+++ +.=..+.+.+++.-++.....+ .+|.+.. ...++ .++. ......++...++|+.. T Consensus 280 ~~~~-~~~v--~~g~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~l~~------~~~~-~~~~~~~~~l~~~i~~~ 343 (503) T protein:vir:59 280 DFQQ-IVYV--LKNYDGENPKEFTANLRYHSVI------KVSGDGGVDTLRA------EIPV-DSAAKELERIQDELYKS 343 (503) T ss_pred HhcC-CeeE--eecCCccccchhhhhhhcccce------eccCCCcceeEec------cCCH-HHHHHHHHHHHHHHHHH Confidence 1222 2222 2100112223333222211110 1233221 11111 1110 11112345555677777 Q ss_pred hhhhhhhcc--CCCcchHHHHHHHHHHHHHHHHHHHHHHHH----HHH---HHhhcc-c----ceEEEEEecCCCCccHH Q lcl|NC_015263. 347 AGVSQILFS--SDNKTSQGIAMSIATDEQFIFGVINQLERW----LNR---YLLLNG-M----SKYFKATMLEVTHFSKK 412 (513) Q Consensus 347 ~GiS~~Lfn--~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~----~N~---~i~~~~-~----~~~f~~~~l~~T~fn~k 412 (513) +++..+-+. +.+.|+..+......-.+.+....+.++.- ++. ++.... . ...+.+.|-+..+-|+. T Consensus 344 s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~ 423 (503) T protein:vir:59 344 AQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDS 423 (503) T ss_pred hcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHH Confidence 776654442 223344444443332222233222222222 222 222211 1 12489999999999999 Q ss_pred HHHHHHHHHHhcCCcHHHHHHHHhCC--CHHHHHHHHHHHHHhh-CcccccCcccccccccccccccCCccccCCCCcCC Q lcl|NC_015263. 413 EAHDRYITDAQYGFPVKVYLASLMGI--DPVAFTGLLKVENEML-DLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRP 489 (513) Q Consensus 413 e~~~~~~~~~~~G~~~~~~laa~~G~--~p~~~~~~~~~E~e~L-~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grP 489 (513) +.++.+.++.+-|.-+...+...+|+ +|.+.+.+.+.|++.. .....++ +.+ .+.+...+ ..| T Consensus 424 ~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~---------~~~----~~~~~~~~-~~~ 489 (503) T protein:vir:59 424 EIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLL---------DDE----GGDDDLEE-DDP 489 (503) T ss_pred HHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhcccc---------Ccc----CCCCCCCc-CCC Confidence 99999999999995334445556777 6788899988887431 1111100 110 00000111 011 Q ss_pred CCcccccccCCCCCCCC Q lcl|NC_015263. 490 TNETTGNKDSDETQRAK 506 (513) Q Consensus 490 t~et~~n~~~~~~~~~~ 506 (513) + .+..++++.++++ T Consensus 490 ~---~~~~~~~~~g~~~ 503 (503) T protein:vir:59 490 N---AGAAESGGAGQVS 503 (503) T ss_pred C---CCcccCCCCCCcC Confidence 1 1111222222222 No 125 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=94.04 E-value=0.0056 Score=32.93 Aligned_cols=436 Identities=9% Similarity=0.006 Sum_probs=179.2 Q ss_pred cchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc--- Q lcl|NC_015263. 5 KKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS--- 81 (513) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s--- 81 (513) ----+..|++|+-.-+. +-|..+++ ..-.+.+.+++..| ..-...++.+-+|+-... T Consensus 1 ~~~~~~~~~~~~~~~~~---------~p~~~~~~---------~~~~~l~~~l~~~~--~~~~~rl~~l~~YY~G~~~~~ 60 (501) T protein:vir:25 1 MTVPVDVIADAPAADVE---------FPEDSMSR---------EQLGALVADMWRLH--ISERQWLDRIYEYTKGLRGRP 60 (501) T ss_pred CcccchhhhccCccccc---------CCcccCCh---------HHHHHHHHHHHHHH--HHHHHHHHHHHHHHhcCCCch Confidence 11123334444321110 11111111 11122223333322 122233344444433221 Q ss_pred -------------------chHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHH Q lcl|NC_015263. 82 -------------------QQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLA 142 (513) Q Consensus 82 -------------------g~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~ 142 (513) +.-+.++|-++....++.+..| +. +.....++ ..+.-++....+.+.+++ T Consensus 61 ~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~~----d~----~~~~~l~~---i~~~N~~d~~~~~~~~~a 129 (501) T protein:vir:25 61 EVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYRNA----LA----KENDPAWE---MWQRNRMDARQAEVHRPA 129 (501) T ss_pred hccccCChhhhhhHhhhhcChHHHHHHHHHhhhcccceecC----Cc----cchHHHHH---HHHhcChhHHHHHHHHHH Confidence 2223333333332233433322 11 11222333 345666888889999999 Q ss_pred HHhcceeEEEEEcCcceeeeecCcceeEEEEE---ECCeeEEEEEeeeccC--c--chhccccHHHHHHHHHHhhhhhcc Q lcl|NC_015263. 143 MTVDIFYGYVIDDKESVMIQQFPNDICKISSV---SGGVYNYVIDLDALVS--A--DIVDYYPKEIQEAVNKYTTMKKGN 215 (513) Q Consensus 143 l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~---~nG~y~~~fD~syFd~--~--~~L~~~p~Ei~~~y~~Y~~~k~~~ 215 (513) ++.|.-|.+...+.++-.+..++|.-|..+-. .+....+++-...-.. . ....-|.+....-|..+....... T Consensus 130 ~i~G~ay~~v~~de~~~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ 209 (501) T protein:vir:25 130 LTYGASYVTVTPTDEGPVFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDA 209 (501) T ss_pred hhcCceEEEEecCCCCCeEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeec Confidence 99999999988877776788899999986532 2233455544322111 0 111112111110000000000000 Q ss_pred -Cccc-----ccC--------eeecCCceE--EEEe-cCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeee Q lcl|NC_015263. 216 -NKSA-----SNW--------YEIQDKNSI--CIKI-NESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQK 278 (513) Q Consensus 216 -~~~~-----~~W--------~~L~~~kt~--~ik~-~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~k 278 (513) ...| .-| ..-+..-.+ ++.+ |...+. +...+-|.++.++.+.=+...+ .+.+. .-... T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~--~~g~sdie~v~~l~Da~~~~~s--~~~~~-~e~~a 284 (501) T protein:vir:25 210 GGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDAD--DMIVGEVAPLILLQQAINSVNF--DRLIV-SRFGA 284 (501) T ss_pred cccccccccccccccccccccccccCCccceeeEeccCccccC--ccccchhhhhHHHHHHHHHHHH--HHHHH-HHhhc Confidence 0000 000 001111111 2333 222111 2223334444333332221111 11111 11122 Q ss_pred ecccc--CCCCCccccCHHHHHHHHHHHHH--hccccceEEEeccccccccccccc--ccchhhhhhHHhhhhhhhhhhh Q lcl|NC_015263. 279 LETRS--SNDNNDFTLDMPMMNYFHEALSM--TVPDNVGVVTSPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQI 352 (513) Q Consensus 279 ip~~~--~n~~~~~~vd~~~~~~~~~~ik~--~Lp~gv~~v~sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~ 352 (513) .|.+. |-+..++. .+...... ++|++=+.+ ..|++.. .--+.++....+|...+++.-. T Consensus 285 ~p~~~i~G~~~~~~~-------~~~~~~~~i~~~~~~~~~~--------~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~ 349 (501) T protein:vir:25 285 NPQRVISGWTGSKAE-------VLKASALRVWTFEDPEVKA--------QAFPPASVEPYNLILEEMLQHVAMVAQISPA 349 (501) T ss_pred cHHHHHhCCCCCccc-------hhhhcccceeccCCCCceE--------EEecccChHHHHHHHHHHHHHHHhhcCCChh Confidence 33211 21111111 11111111 123221111 1222211 1114455555777777888866 Q ss_pred hccCC--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----h-ccc----ceEEEEEecCCCCccHHHHHHHHHHH Q lcl|NC_015263. 353 LFSSD--NKTSQGIAMSIATDEQFIFGVINQLERWLNRYLL----L-NGM----SKYFKATMLEVTHFSKKEAHDRYITD 421 (513) Q Consensus 353 Lfn~d--~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~----~-~~~----~~~f~~~~l~~T~fn~ke~~~~~~~~ 421 (513) -|++. +.|+.+++.....-...+.+..+.+..=+.+.+. - +.. ...+++.|-+..+-|..+.++.+.|+ T Consensus 350 ~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl 429 (501) T protein:vir:25 350 QVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKL 429 (501) T ss_pred hhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHH Confidence 67664 3455555555555555554444444443333221 1 111 13688899999999999999999999 Q ss_pred HhcCCcHHHHHHHHhCCCHHHHHHHHHHHHH--hhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccC Q lcl|NC_015263. 422 AQYGFPVKVYLASLMGIDPVAFTGLLKVENE--MLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDS 499 (513) Q Consensus 422 ~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e--~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~ 499 (513) .+-|.|....+.-+.|+++.++-.+.+.+.+ ..++-+.+...+. .+ .. ..+.+..+..++.+ T Consensus 430 ~~~gis~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~-------~~----~~-~~~~~~~~~~~~~~---- 493 (501) T protein:vir:25 430 ASAGIPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEP-------AP----VP-PPPPQAAAQALNEG---- 493 (501) T ss_pred HhcCCCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCc-------CC----CC-CCCCCCCccccccc---- Confidence 9999998887777789999887655555432 2222222222111 00 00 00011111111111 Q ss_pred CCCCCCCCC Q lcl|NC_015263. 500 DETQRAKDK 508 (513) Q Consensus 500 ~~~~~~~d~ 508 (513) +++...++ T Consensus 494 -~~~~~~g~ 501 (501) T protein:vir:25 494 -GVNGNGGA 501 (501) T ss_pred -cCCCCCCC Confidence 11111111 No 126 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=93.90 E-value=0.006 Score=32.76 Aligned_cols=445 Identities=11% Similarity=0.105 Sum_probs=176.0 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhc-cCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHH--- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDN-RTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQR--- 86 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~r--- 86 (513) |+-|..+-.++-.++..-..++.-. .++...+.-.-.....+.+.+.|+.|. ......++++.+|+........+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~-~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHM-DYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHH-HhhHHHHHHHHHHhcccCccccccCc Confidence 6666655555555544433444333 333321111112223344566666542 22234566666666555443211 Q ss_pred --------------HHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 87 --------------LLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 87 --------------lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ...+++...+ .|.+ .|..- ...+....+ .+..++..-++...+..+.+.+++.|..|.+ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~--~~~d~~~~~---~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~ 153 (511) T protein:vir:93 80 RKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQY--QDDDKDVLE---VIEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (511) T ss_pred CcccccCcceeecchHHHHHHHHh-hhhcccCeee--ccCChHHHH---HHHHHHhhcCHhHHHHHHHHHHHhcCeeEEE Confidence 1122222111 1211 11100 001111122 1233455567888999999999999999998 Q ss_pred EEEcCcc-eeeeecCcceeEEEEEEC--CeeEEEEEeeecc--C----cc--hhccc-cHHHHHHHHHHhhhhhccCccc Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSVSG--GVYNYVIDLDALV--S----AD--IVDYY-PKEIQEAVNKYTTMKKGNNKSA 219 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd--~----~~--~L~~~-p~Ei~~~y~~Y~~~k~~~~~~~ 219 (513) ...+.++ +-+..++|.-|.++--.. +.+.+++=.-.-. + .. ..+-| +..+.+ |.... ...... T Consensus 154 vy~de~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~----~~~~~-~~~~~~ 228 (511) T protein:vir:93 154 MIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYR----YLTSR-TNGLKL 228 (511) T ss_pred EEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEE----EEecC-CCcccc Confidence 8876554 566788999888775422 4454443221100 0 00 01111 111110 10000 000000 Q ss_pred ccCeeecCCc----eEEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhc--eee--eeeeccccCCCCCc Q lcl|NC_015263. 220 SNWYEIQDKN----SICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNY--KLL--IQKLETRSSNDNND 289 (513) Q Consensus 220 ~~W~~L~~~k----t~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~--~ii--~~kip~~~~n~~~~ 289 (513) .-|..-+.++ -.++.+- ....+.+-| .++.++.+.=+.. ..-+.++.. .++ .+..+ T Consensus 229 ~~~~~~~~~~~~g~vPvv~~~-nn~~g~gd~----e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~--------- 294 (511) T protein:vir:93 229 TPRENGFESHSFERMPITEFS-NNERRKGDY----EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN--------- 294 (511) T ss_pred ccccccccccCCCccceEEec-CCCCCCCch----hhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcc--------- Confidence 0111111111 1122221 122343333 3333333322211 111122211 111 11111 Q ss_pred cccCHHHHHHHHHHHHHhccc-----cceEEEec-cccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCCcchH Q lcl|NC_015263. 290 FTLDMPMMNYFHEALSMTVPD-----NVGVVTSP-MEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDNKTSQ 362 (513) Q Consensus 290 ~~vd~~~~~~~~~~ik~~Lp~-----gv~~v~sP-~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~ 362 (513) .+..........-.-.++. +.+.-..+ .++.-+.-+.+. ...-.++-..++|+.-+++..+-+.+.+++.+ T Consensus 295 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~S 372 (511) T protein:vir:93 295 --LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred --cCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccch Confidence 1111111110000000100 00000000 000000001001 11122445557788888877665544333334 Q ss_pred H--HHHHHHHHHHHHHH----HHHHHHHHHHH---HHhhcc-c--ce---EEEEEecCCCCccHHHHHHHHHHHHhcC-C Q lcl|NC_015263. 363 G--IAMSIATDEQFIFG----VINQLERWLNR---YLLLNG-M--SK---YFKATMLEVTHFSKKEAHDRYITDAQYG-F 426 (513) Q Consensus 363 ~--~~~SI~~d~~~~~~----~~~~iE~~~N~---~i~~~~-~--~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~ 426 (513) | ++.....-...+.. |=+.|++-++. ++.... . .. ..++.|-+..+-|..+.++.+.++. | . T Consensus 373 g~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~--g~i 450 (511) T protein:vir:93 373 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKI 450 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHHh--ccC Confidence 4 33333222222222 22222222222 222211 1 11 3688899999999999999999984 6 6 Q ss_pred cHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 427 PVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 427 ~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) |....|. .+++ +|.+.+.++..|++. .++-...+.. ++.+ +.+.+.+.+++..+.+..+ T Consensus 451 S~et~~~-~l~~v~d~~~E~~ri~~E~~~-~~~~~~~~~~-------~~~~-------~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 451 SQTTLMS-LFSFFQDPELEVKKIEEDEKE-SIKKAQKGIY-------KDPR-------DINDDEQDDDTKDTVDKKE 511 (511) T ss_pred chHHHHH-hCCCCCCHHHHHHHHHHHHHH-HHHHHhhhcc-------cCCC-------CCCCCCCCCcccccccccC Confidence 6655554 5676 578889999988753 2221111111 1100 0011111111111111111 No 127 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=93.90 E-value=0.006 Score=32.76 Aligned_cols=447 Identities=11% Similarity=0.108 Sum_probs=175.3 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhcc-CcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHH---- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNR-TPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQ---- 85 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~---- 85 (513) |+-|..+..+|-.++.....++.-.+ .....+.-.-.....+.+++.|..|. ......++++.+|+-....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~-~~~~~r~~~l~~Yy~g~~~i~~~~~~ 79 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHM-DYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHH-HhhHHHHHHHHHHhcccCccccccCc Confidence 66666666666555443333333322 22211111112222333455555442 1112345555555544433211 Q ss_pred -------------HHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 86 -------------RLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 86 -------------rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) +...+++...+ .|.+ .|..-. .......+ .+..+++.-++...+..+.+.+++.|..|.+ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~~--~~d~~~~~---~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~ 153 (511) T protein:vir:99 80 RKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQYQ--DDDKDVLE---AIEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (511) T ss_pred ccccccCcceeecchHHHHHHHHH-hhhcccCceee--cCchHHHH---HHHHHHhhcCHhHHHHHHHHHHHhcCeeEEE Confidence 11111111111 1111 110000 01111222 2233455557889999999999999999998 Q ss_pred EEEcCc-ceeeeecCcceeEEEEEEC--CeeEEEEEeeeccC--------cc--hhccc-cHHHHHHHHHHhhhhhccCc Q lcl|NC_015263. 152 VIDDKE-SVMIQQFPNDICKISSVSG--GVYNYVIDLDALVS--------AD--IVDYY-PKEIQEAVNKYTTMKKGNNK 217 (513) Q Consensus 152 ~i~d~~-~~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd~--------~~--~L~~~-p~Ei~~~y~~Y~~~k~~~~~ 217 (513) ...+.+ .+-+.-++|.-|.++--.. +.+.+++ ++... .. ..+.| +..+.. |.... .... T Consensus 154 vy~ded~~~~i~~~~p~~~~~vyd~~~~~~~~~~v--r~~~~~~~~~~~~~~~~~~~vyt~~~i~~----~~~~~-~~~~ 226 (511) T protein:vir:99 154 MIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGV--RYLRTKPIDKTDEDEVFTVDLFTSHGVYR----YLTSR-TNGL 226 (511) T ss_pred EEeCCCCceEEEEEccceeEEEEcCCCCCceEEEE--EEEEeeecccCccceEEEEEEEeCCcEEE----EEecC-Cccc Confidence 887654 4677778888888775422 4455444 22211 00 01111 111111 10000 0000 Q ss_pred ccccCeeecCCc----eEEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhceeeeeeeccccCCCCCccc Q lcl|NC_015263. 218 SASNWYEIQDKN----SICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNYKLLIQKLETRSSNDNNDFT 291 (513) Q Consensus 218 ~~~~W~~L~~~k----t~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~~ii~~kip~~~~n~~~~~~ 291 (513) ...-|..-...| -.++.+- ....+.|-|. ++.++.+.=+.. ..-..++-. ...+.+..| ... T Consensus 227 ~~~~~~~~~~~~~~g~vPvv~~~-nn~~g~sd~e----~v~~liDa~d~~~S~~~~~~~~~---~~~~lv~~G----~~~ 294 (511) T protein:vir:99 227 KLTPRENGFESHSFERMPITEFS-NNERRKGDYE----KVITLIDLYDNAESDTANYMSDL---NDAMLLIKG----NLN 294 (511) T ss_pred cccccccccccCCCCccceEEec-CCCCCCCchh----hhHHHHHHHHHHHHHHHHHHHHh---hchhhhhcc----Ccc Confidence 000011111111 1122321 1223444443 333333322211 111112111 011111000 001 Q ss_pred cCHHHHHHHHHH-----HHHhccccceEEEe-cccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCcchHH- Q lcl|NC_015263. 292 LDMPMMNYFHEA-----LSMTVPDNVGVVTS-PMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQG- 363 (513) Q Consensus 292 vd~~~~~~~~~~-----ik~~Lp~gv~~v~s-P~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~- 363 (513) .+.......... -......+.+.... +.++.-+.-+.+ ......++-..++|+.-+++...-+.+-+++.+| T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~ 374 (511) T protein:vir:99 295 LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGE 374 (511) T ss_pred cCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHH Confidence 111111111000 00000011110000 001111111111 1111235555677888888877666443333444 Q ss_pred -HHHHHHHHHHHHHHHHH----HHHHHHHH---HHhhcc-cc--e---EEEEEecCCCCccHHHHHHHHHHHHhcC-CcH Q lcl|NC_015263. 364 -IAMSIATDEQFIFGVIN----QLERWLNR---YLLLNG-MS--K---YFKATMLEVTHFSKKEAHDRYITDAQYG-FPV 428 (513) Q Consensus 364 -~~~SI~~d~~~~~~~~~----~iE~~~N~---~i~~~~-~~--~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~ 428 (513) ++.....-.+.+..... .|++-++. ++.... .. . ..++.|-+..+-|..+.++.+.++. | .|. T Consensus 375 Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--GiiS~ 452 (511) T protein:vir:99 375 AMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQ 452 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCH Confidence 43333322222222222 22222222 222211 11 1 3688899999999999999999985 6 665 Q ss_pred HHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 429 KVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 429 ~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) ...+. .+++ +|.+.+.++..|++. .+.....+.. .+ +.+.+.+.+..++..+.+.++ T Consensus 453 et~l~-~l~~v~D~~~E~~ri~~E~~~-~~~~~~~~~~-------~~-------~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 453 TTLMS-LFSFFQDPELEVKKIEEDEKE-SIKKAQKNMY-------QD-------PRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred HHHHH-hCCCCCCHHHHHHHHHHHHHH-HHHHHhhccc-------cc-------CCCCCCCCCCCCCcCcccccC Confidence 55555 4665 578999999999754 2222222111 11 001111222222222222222 No 128 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=93.72 E-value=0.0066 Score=32.54 Aligned_cols=331 Identities=14% Similarity=0.124 Sum_probs=137.7 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccc---cccccccchHHHHHHHhhhccChhHHHHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFG---APVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDL 77 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~---s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~l 77 (513) -.-|+-+|-+... ..+..+... +.+.....+. +.++++ T Consensus 2 ~~~~~f~~r~~~~-------------------~~~~~~~~~~~~~~~~~~~v~~---~~al~~----------------- 42 (359) T protein:vir:10 2 SILNPFERRSSIT-------------------PNNYYPFMVQNGSIVPNSLVDA---TEALKN----------------- 42 (359) T ss_pred cccchhhccccCC-------------------CCcchhhhhccccccCCcccCH---HHhhcc----------------- Confidence 0011111111000 000000000 0000001111 112221 Q ss_pred HhhcchHHHHHHHH----hhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcC----hhHHHHHHHHHHHHhccee Q lcl|NC_015263. 78 AVQSQQYQRLLNFY----ANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLN----PKYNFSKIVKLAMTVDIFY 149 (513) Q Consensus 78 Y~~sg~~~rlidy~----~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n----~k~~~~~i~~~~l~~g~~~ 149 (513) +.+...|+.+ +++|--+ ......++.+=| --.+...++..++..|..| T Consensus 43 ----~av~~cv~~ia~~ia~~p~~~--------------------~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay 98 (359) T protein:vir:10 43 ----SDLYAVTSLISSDIAGTRFIG--------------------NQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVF 98 (359) T ss_pred ----hHHHHHHHHHHHhhhcCcccc--------------------chHHHHHhhcccccCCHHHHHHHHHHhccccCceE Confidence 2222333333 3444211 111122233333 3345567777888889999 Q ss_pred EEEEEcCcc--eeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecC Q lcl|NC_015263. 150 GYVIDDKES--VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQD 227 (513) Q Consensus 150 gy~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~ 227 (513) .+++-+..+ ..+.++|+++|.|. ..+|.+.|.+. .+.. ....+++. T Consensus 99 ~~i~r~~~g~~~~l~~l~~~~v~i~-~~~~~~~y~~~--~~~~-----------------------------~~~~~~~~ 146 (359) T protein:vir:10 99 LAILKGDNSLMKELRLIPSNAITID-LTDDTLTYEVN--QFDD-----------------------------YPSAKYNA 146 (359) T ss_pred EEEEECCCCeEEEEEEeCCceEEEE-EcCCeEEEEEE--ecCC-----------------------------ceEEEEcc Confidence 998866655 56779999999985 34555544432 1110 11233444 Q ss_pred CceEEEEecC------ccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHH Q lcl|NC_015263. 228 KNSICIKINE------SSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFH 301 (513) Q Consensus 228 ~kt~~ik~~~------~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~ 301 (513) ..-+-|+... +...|++|...+...+--.....+.. ..-..|-.....-|-+ . .-.++.++++++. T Consensus 147 ~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~--~~~f~ng~~~~gil~~---~---~~~l~~e~~~~~~ 218 (359) T protein:vir:10 147 SEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLS--LSTLKGALNPTSVVKV---P---QGTLSSEAKDSIR 218 (359) T ss_pred cceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHH--HHHHhccCCcceEEEe---C---CCCCCHHHHHHHH Confidence 4455555422 23457777765444332222222211 1112221111111111 0 1136777777766 Q ss_pred HHHHHhcc-c---cceEEEecccccccccccccccch---hhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHH-HHHH Q lcl|NC_015263. 302 EALSMTVP-D---NVGVVTSPMEIDTVSFDKDSSTDD---SVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIA-TDEQ 373 (513) Q Consensus 302 ~~ik~~Lp-~---gv~~v~sP~~~d~i~ld~~~~~~d---tv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~-~d~~ 373 (513) +.+++..- + ++..+-..+++..+.+. ..+.+ +.+-..++|-.+.||...++|+...+.+.. .+++ .-.. T Consensus 219 ~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~--~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-~~~e~~~~~ 295 (359) T protein:vir:10 219 KEFEKANGGNNSGRVMVLDQSADFSTVSIN--ADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSL-DQIKDLYVN 295 (359) T ss_pred HHHHHHhCccccCCceecCCCcceeeecCC--HHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccH-HHHHHHHHH Confidence 66655431 1 13333333444444432 22222 344455789999999999997632222211 1121 1112 Q ss_pred HHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHh Q lcl|NC_015263. 374 FIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEM 453 (513) Q Consensus 374 ~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~ 453 (513) ++-..++.++..+|+.|.... .+..-....|+.......+.++..-| =++|.|.-.++.+| T Consensus 296 ~l~~~l~p~~~~l~~~l~~~~-----~~~~~~~~~~d~~~~~~~~~~~~~~G-----------~~t~NE~R~~l~~~--- 356 (359) T protein:vir:10 296 ALNRFIEPLISELRIKCDSSI-----GVDMSPITDYSNSVFKADILNWVKEG-----------IIEPTEAKTLLESK--- 356 (359) T ss_pred HHHHHHHHHHHHHHHHhhhhh-----cccchhhhhcCHHHHHHHHHHHHhCC-----------CcCHHHHHHHhCCC--- Confidence 223445556666666554321 11111112233334444444333222 25566555443333 Q ss_pred hCccccc Q lcl|NC_015263. 454 LDLPEIM 460 (513) Q Consensus 454 L~l~~~~ 460 (513) -.| T Consensus 357 ----pv~ 359 (359) T protein:vir:10 357 ----GII 359 (359) T ss_pred ----CCC Confidence 333 No 129 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=93.60 E-value=0.007 Score=32.40 Aligned_cols=429 Identities=10% Similarity=-0.005 Sum_probs=164.3 Q ss_pred cchhee-eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhcc---------ChhHHHHHHHHH Q lcl|NC_015263. 5 KKKRLS-MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYR---------NEGNQKTLRKVS 74 (513) Q Consensus 5 ~~~~~~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~---------P~~n~~~ir~~s 74 (513) ---|+. |.+.+...-. ++.|+-.-....|. .++ +..|+ |..--..++++ T Consensus 1 ~~~~i~~~~~~~~~~~~-----~~~l~~~~~~~~~r--------------~~~-~~~Yy~G~~~i~~~~~~~~~~~~~~- 59 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIA-----RDEMVSAFEDSTQN--------------LKT-NTSYYEAERRPEAIGVTVPIQMQSL- 59 (485) T ss_pred CCCCCCCCCCCCCHHHH-----HHHHHHHHHHHHHH--------------HHH-HHHHHhcCCcchhcCCCCChhhhhh- Confidence 011111 1112111100 11111111011110 000 11111 11111111111 Q ss_pred HHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEE Q lcl|NC_015263. 75 EDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVID 154 (513) Q Consensus 75 ~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~ 154 (513) ....+--+.+++..+.-..++-+..| . .....+..++ .+..-++......+.+.+++.|..|.+... T Consensus 60 ---~~~~n~~~~ivd~~~~~l~~~g~~~~----~---~~~~~~~~~~---i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~ 126 (485) T protein:vir:10 60 ---LAHVGYPRLYVDSIAERQAVEGFRFG----D---ADEADEELWQ---WWQANNLDIEAPLGYTDAYVHGRSYITISR 126 (485) T ss_pred ---hhhcCcHHHHHHHHHhhhcccceecC----C---CchhHHHHHH---HHHhcCHhHHHHHHHHHHhhcCceEEEEee Confidence 12223335555555543333332211 1 1111222333 345556888889999999999999988776 Q ss_pred cCcc---------eeeeecCcceeEEEEE-ECCeeEEEEEeeeccCcc---hhccccHHHHHHHHHHhhhhhccCccccc Q lcl|NC_015263. 155 DKES---------VMIQQFPNDICKISSV-SGGVYNYVIDLDALVSAD---IVDYYPKEIQEAVNKYTTMKKGNNKSASN 221 (513) Q Consensus 155 d~~~---------~~iq~lp~dyckIsg~-~nG~y~~~fD~syFd~~~---~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~ 221 (513) +... ..+..++|..|..+-= ..+...+++=..+=+... .+..|.++... .|. . .... T Consensus 127 ~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~---~~~---~----~~~~ 196 (485) T protein:vir:10 127 PDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIF---GWY---R----VENE 196 (485) T ss_pred CCcccccccCCCeeEEEEEccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEE---EEE---E----cCCc Confidence 5332 2577788888875532 235444444222211111 12222221100 010 0 0123 Q ss_pred CeeecC-Cce----EEEEe-cCc---cccchhhHHHHHHhHHHHHHHHHHH-hhHhhhhhceeeeeeeccc---cCCCCC Q lcl|NC_015263. 222 WYEIQD-KNS----ICIKI-NES---SLTPVPPFAGTFDSIYDIHSFKDLR-NDKAELQNYKLLIQKLETR---SSNDNN 288 (513) Q Consensus 222 W~~L~~-~kt----~~ik~-~~~---~~~~ip~f~~v~~d~~di~~~kdL~-~~~~~i~n~~ii~~kip~~---~~n~~~ 288 (513) |..... +|. .++.+ +.. .++|.+=+..-+.++.|.. .... +.....+- ...|.. +...++ T Consensus 197 ~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~--~~~~s~~~~~~~~-----~a~p~~~i~G~~~~~ 269 (485) T protein:vir:10 197 WQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAA--ARILMLMQATAEL-----MGVPQRLIFGIKPEE 269 (485) T ss_pred eEEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHH--HHHHHHHHHHHHh-----hcchHHHHhcCCccc Confidence 422211 111 22222 222 2234443433333333322 2211 11111111 112211 001111 Q ss_pred ccccCHHHHHHHHHHHHH--hccccceEEEecccccccccccccc--cchhhhhhHHhhhhhhhhhhhhccCCCc---ch Q lcl|NC_015263. 289 DFTLDMPMMNYFHEALSM--TVPDNVGVVTSPMEIDTVSFDKDSS--TDDSVEKATKNFWDNAGVSQILFSSDNK---TS 361 (513) Q Consensus 289 ~~~vd~~~~~~~~~~ik~--~Lp~gv~~v~sP~~~d~i~ld~~~~--~~dtv~~~~~~i~~~~GiS~~Lfn~d~~---s~ 361 (513) ....+......+...... ++|++ +.....|+.... --+.+.....++....++..-.|++... |+ T Consensus 270 ~~~~~~~~~~~~~~~~~~i~~~~~~--------d~k~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg 341 (485) T protein:vir:10 270 IGVDPETGQTLFDAYLARILAFEDA--------EGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASA 341 (485) T ss_pred ccccccccchhhhhcccceeccCCC--------CceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHH Confidence 111122211111111111 11221 111112222111 1123455556666777777778876643 44 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----HHHHHHhhc-ccc-----eEEEEEecCCCCccHHHHHHHHHHHHhcC--CcHH Q lcl|NC_015263. 362 QGIAMSIATDEQFIFGVINQLER----WLNRYLLLN-GMS-----KYFKATMLEVTHFSKKEAHDRYITDAQYG--FPVK 429 (513) Q Consensus 362 ~~~~~SI~~d~~~~~~~~~~iE~----~~N~~i~~~-~~~-----~~f~~~~l~~T~fn~ke~~~~~~~~~~~G--~~~~ 429 (513) ..++.....-...+-...+.+.. .+-..+... ..+ ...++.|-+..+-|..+.++.+.++.+-| ..+. T Consensus 342 ~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~ 421 (485) T protein:vir:10 342 EAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPR 421 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCH Confidence 45666555444444433333333 222222211 111 25788999999999999999999998755 6667 Q ss_pred HHHHHHhCCCHHHHHHHHHHHH-HhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 430 VYLASLMGIDPVAFTGLLKVEN-EMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 430 ~~laa~~G~~p~~~~~~~~~E~-e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~ 501 (513) ..+...+|+++.++-.+...+. +... ...+.-.....+. +.+ .+ .+.++.|..++..++..|. T Consensus 422 et~~~~lg~~~~~~~~~~~~~ee~~~~-~~~~~~~~~~~~~-~~~---~~----~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 422 ERARKDMGYSIAEREEMRRWDEEEAAM-GLGLIGTMVDPNP-TVP---GS----PSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHHHHhCCCCHhHHHHHHHHHHHHHHH-HHHHHHHhhccCC-CCC---CC----CCccccccCcCCCCCCCCC Confidence 7777789999998765543322 2111 0000001111110 110 00 0111111111111111111 No 130 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=93.39 E-value=0.0077 Score=32.16 Aligned_cols=413 Identities=11% Similarity=0.039 Sum_probs=169.9 Q ss_pred ccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh--------------------cchHHHHHHHHhhcccccceEeeccc Q lcl|NC_015263. 47 LTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ--------------------SQQYQRLLNFYANMPLYAYSVVPFKD 106 (513) Q Consensus 47 ~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~--------------------sg~~~rlidy~~~mpt~dY~I~P~~~ 106 (513) .++-++.++..++.| ......++.+-.|+... .+.-+.++|..+.-..++.+..| T Consensus 1 ~~t~~~~i~~L~~~~--~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~--- 75 (480) T protein:vir:78 1 MTTYHEHVERLQGLL--ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS--- 75 (480) T ss_pred CCCHHHHHHHHHHHH--HHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceecC--- Confidence 233444444444443 33333444444443332 23445555665555555544332 Q ss_pred hhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEE------cCcc-eeeeecCcceeEEEEE--ECC Q lcl|NC_015263. 107 ISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVID------DKES-VMIQQFPNDICKISSV--SGG 177 (513) Q Consensus 107 ~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~------d~~~-~~iq~lp~dyckIsg~--~nG 177 (513) . .....+... ..++.-++...+..+.+.+++.|..|.+... |.++ ..+..+++..|.++-= ..+ T Consensus 76 -~---d~~~~~~l~---~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~ 148 (480) T protein:vir:78 76 -E---DSEGLEELW---NWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTR 148 (480) T ss_pred -C---CchhHHHHH---HHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCcc Confidence 1 112223333 3356667888999999999999998877652 2233 4678888988876542 234 Q ss_pred eeEEEEEeeeccC----cc--hhccc-cHHHHHHHHHHhhhhhccCcccccCeeecC--Cc----eEEEEe-cC---ccc Q lcl|NC_015263. 178 VYNYVIDLDALVS----AD--IVDYY-PKEIQEAVNKYTTMKKGNNKSASNWYEIQD--KN----SICIKI-NE---SSL 240 (513) Q Consensus 178 ~y~~~fD~syFd~----~~--~L~~~-p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~--~k----t~~ik~-~~---~~~ 240 (513) ...+++ .+... .. ....| |.++.. |. ........|..... +| -.++.+ +. +.+ T Consensus 149 ~~~~~i--~~~~~~~~~~~~~~~~~y~~~~~~~----~~----~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~ 218 (480) T protein:vir:78 149 RVTRAV--RLYTTRDDVAVPDRATLYLPDETVP----LR----RNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) T ss_pred ceEEEE--EEEEeecCCCceEEEEEEeCCeEEE----EE----ecCCCccccccccccccCCCCCcceEEeecccccCCc Confidence 555543 22211 10 11111 211111 10 00111112322111 01 111222 22 223 Q ss_pred cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccc--cCCCCCccccCH--HHHHHHHHHHHHhccccceEEE Q lcl|NC_015263. 241 TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETR--SSNDNNDFTLDM--PMMNYFHEALSMTVPDNVGVVT 316 (513) Q Consensus 241 ~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~--~~n~~~~~~vd~--~~~~~~~~~ik~~Lp~gv~~v~ 316 (513) +|++-+..-+.++.|..+.--. +....++- ...|.. .|-+..++..+. .........+ -+++.+-+ T Consensus 219 ~G~s~i~~~v~~l~Da~~~~~s-~~~~~~~~-----~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--- 288 (480) T protein:vir:78 219 YGRSEISPELRKVTDAASRTLM-NLQSASQI-----LGTPLRVISGVTTDELTNDGENTTLDIYYGRI-LTLASEAA--- 288 (480) T ss_pred cCcccchhhHHHHHHHHHHHHH-HHHHHHHh-----hcchhhhhhcCCccccccccccchhhhhhhhh-ccCCCCCc--- Confidence 4544444333333332221111 11111110 011100 011111111110 0011111111 01121111 Q ss_pred eccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccCCCc---chHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_015263. 317 SPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSSDNK---TSQGIAMSIATDEQFIFGVINQLERWLN---- 387 (513) Q Consensus 317 sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~---s~~~~~~SI~~d~~~~~~~~~~iE~~~N---- 387 (513) +-..++... .--+.+.....+|+...|+...-|++.+. |+.+++.....-...+....+.+..-+. T Consensus 289 -----~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~ 363 (480) T protein:vir:78 289 -----KISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) T ss_pred -----eEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112222211 11244666667777778888888877653 4444555444444444433333222222 Q ss_pred HHHh--hcccc---eEEEEEecCCCCccHHHHHHHHHHHHhcC--CcHHHHHHHHhCCCHHHHHHHHHHHHHh-hCcccc Q lcl|NC_015263. 388 RYLL--LNGMS---KYFKATMLEVTHFSKKEAHDRYITDAQYG--FPVKVYLASLMGIDPVAFTGLLKVENEM-LDLPEI 459 (513) Q Consensus 388 ~~i~--~~~~~---~~f~~~~l~~T~fn~ke~~~~~~~~~~~G--~~~~~~laa~~G~~p~~~~~~~~~E~e~-L~l~~~ 459 (513) .++. ..... ...++.|-+..+-|..+.++.+.++.+-| .-+...+...+|+++.+.-.+.+.+.+. -+.-+. T Consensus 364 l~~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~ 443 (480) T protein:vir:78 364 IAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDT 443 (480) T ss_pred HHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHH Confidence 2221 11111 24777888888889999999999998877 2235556667999998876655442221 011111 Q ss_pred cCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCC Q lcl|NC_015263. 460 MTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDK 508 (513) Q Consensus 460 ~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~ 508 (513) +. ..+..+ ....+....++.|+ + .....++.+|++-+ T Consensus 444 ~~------~~~~~~---~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 444 LY------STTKAQ---ADATPKPTVTETKT-E--TQTSPSGFNRTKTR 480 (480) T ss_pred hh------cccccc---CCCCCCCCCCCCCC-c--cccccCCCCcccCC Confidence 11 000110 00000011111111 1 11111122222111 No 131 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=93.38 E-value=0.0078 Score=32.15 Aligned_cols=417 Identities=11% Similarity=0.055 Sum_probs=162.0 Q ss_pred hHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc----------------- Q lcl|NC_015263. 19 SYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS----------------- 81 (513) Q Consensus 19 ~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s----------------- 81 (513) -|--+ +-++=+++|.-..+. ..-...+.++++|..| ......++.+-+|+.... T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~------~~~~~~~~i~~~i~~~--~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~ 71 (472) T protein:vir:93 1 MYPSQ-PTQTEIFDAIVRTNN------KPETLEEMIVRYIKQH--LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDP 71 (472) T ss_pred CCCCC-CcchhhhhceeeecC------chhhHHHHHHHHHHHH--HHHHHHHHHHHHHhccccccccccchhhccccccc Confidence 00000 001112222222222 1111222233333332 334445555555544432 Q ss_pred ---------chHHHHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 82 ---------QQYQRLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 82 ---------g~~~rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) +..+.+++-.++ |.+ .|+. ...+++-..++++ .++. -++...+..+.+.+++.|..|.+ T Consensus 72 ~~~~~ri~~n~~~~ivd~~~~-----~l~g~~~~---~~~~d~~~~~~l~--~~~~-n~~~~~~~~~~~~~~~~G~~~~~ 140 (472) T protein:vir:93 72 LKPDDRMITNFHANLVDQKVS-----YIVGKPIA---FKHTDDEVVKRID--EVLG-NRFDDKLHSVLTGASNKGIEWLH 140 (472) T ss_pred cccccccccchHHHHHHHHhh-----hhcccCee---eccCChHHHHHHH--HHHh-ccHHHHHHHHHHHHhhcCeEEEE Confidence 222222222222 111 1100 0001111112222 2232 36777888899999999999999 Q ss_pred EEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccccH-HHHHHHHHHhhhhhc--cCcccccCeee Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYYPK-EIQEAVNKYTTMKKG--NNKSASNWYEI 225 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~p~-Ei~~~y~~Y~~~k~~--~~~~~~~W~~L 225 (513) ...|.++ +.+..++|+.|.++-- ..+.+.+++=...-+.....+.|.+ ++.. | .+...... -......|..- T Consensus 141 v~~d~d~~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~ 218 (472) T protein:vir:93 141 PYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNY-Y-VYENGSLIPDYSNNLENSKTH 218 (472) T ss_pred EEECCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEE-E-EEecCeeeecccccccccccc Confidence 8887664 5677789988888753 2355666543222112222222211 1100 0 00000000 00000112111 Q ss_pred cCCce----EEEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeec-cccCCCCCccccCHHHHHHH Q lcl|NC_015263. 226 QDKNS----ICIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLE-TRSSNDNNDFTLDMPMMNYF 300 (513) Q Consensus 226 ~~~kt----~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip-~~~~n~~~~~~vd~~~~~~~ 300 (513) ...+. .++.+- ...++++-|.. ..++.|..+.- +-.....++....-+-.+. + ...+.+++ T Consensus 219 ~~~~~~~~vPvv~~~-nn~~g~s~~e~-v~~liDa~~~~-~s~~~~~~~~~~~~~~~~~g~-~~~~~~~~---------- 284 (472) T protein:vir:93 219 FSTGSWGKIPFIPFK-NNDLEISDIFM-YKTLIDAYNRR-LSDLSNTFKDSNELTYVLTNY-DDQELPEF---------- 284 (472) T ss_pred cccCCCCCcceEEec-CCCCCCCchhh-hHHHHHHHHHH-HHHHHHHHHHhcCceeEeecC-Ccccchhh---------- Confidence 11111 112221 12344444443 23333222211 1122222322211111111 0 00111222 Q ss_pred HHHHHH--h--ccccce--EEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCC--CcchHHHHHHHHHHH Q lcl|NC_015263. 301 HEALSM--T--VPDNVG--VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSD--NKTSQGIAMSIATDE 372 (513) Q Consensus 301 ~~~ik~--~--Lp~gv~--~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~s~~~~~~SI~~d~ 372 (513) ...++. + ++++-+ .++.+.+. ..-...++-..++|+.-+|+-..-+.+. +.|+..++.....-. T Consensus 285 ~~~~~~~~~~~~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 356 (472) T protein:vir:93 285 KRLLRYYGAIKVSDNGGVDTIQVEVPV--------ENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLN 356 (472) T ss_pred HHHHhhccccccCCCCcceeEeecCCH--------HHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHH Confidence 111111 1 233322 22222111 1111234445567777777765554332 334444443333222 Q ss_pred HHHHHHHHH----HHHHHHHHHhhccc---ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCC--CHHHH Q lcl|NC_015263. 373 QFIFGVINQ----LERWLNRYLLLNGM---SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGI--DPVAF 443 (513) Q Consensus 373 ~~~~~~~~~----iE~~~N~~i~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~--~p~~~ 443 (513) ..+-..... |++.+..++..... ...+.+.|-+..+-|..+.++.+.++++. .|....|. .+|+ +|.+. T Consensus 357 ~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~~gi-is~et~l~-~l~~~~d~~~E 434 (472) T protein:vir:93 357 LKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGI-VSHETVLE-NHPFVEDLQAE 434 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeCCCCCCCHHHHHHHHHHHhcc-CchHHHHH-hCCCCCCHHHH Confidence 222222222 22222222222111 22588999999999999999999998532 66555554 5676 58888 Q ss_pred HHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCc Q lcl|NC_015263. 444 TGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNE 492 (513) Q Consensus 444 ~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~e 492 (513) +.+...|++. .....++ ...++.+ +.+...+.+..+.| T Consensus 435 ~~ri~~E~~~-----~~~~~~~-~~~~~~d-----~~~~~~~~~~~~~e 472 (472) T protein:vir:93 435 LERIEQEQME-----YNKQLPN-LDDGGAD-----GAQQQERSNNKESE 472 (472) T ss_pred HHHHHHHHHH-----HHHhccC-cCcccCC-----CCCCCCCCCcccCC Confidence 9998888743 1222221 1111111 00001111111111 No 132 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=93.35 E-value=0.0079 Score=32.11 Aligned_cols=441 Identities=12% Similarity=0.108 Sum_probs=173.5 Q ss_pred eeehhhhhhHHHHHHH-HHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHH--- Q lcl|NC_015263. 11 MIDVESISSYSNKRNN-RISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQR--- 86 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~r--- 86 (513) |+-|..+..++-.++. +..+=+.=|.++...+.-.-....-+.+++.|..|+ ......++++-+|+........+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~r~~~l~~Yy~g~~~i~~~~~~ 79 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHM-DYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHH-HhhHHHHHHHHHHhcccCccccccCc Confidence 6656555555554443 333333333333311111111112234455555441 22234555666666554443211 Q ss_pred --------------HHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 87 --------------LLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 87 --------------lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ...++++..+ .|.+ .|..- ....+...+ .+..++..-++...+..+.+.+++.|..|.+ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~--~~~d~~~~~---~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~ 153 (511) T protein:vir:10 80 RKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQY--QDDDKDVLE---AIEAFNDLNDVESHNRSLGLDLSIYGKAYEI 153 (511) T ss_pred ccccccCcceeecchHHHHHHHHh-hhhcccCcee--ecCchHHHH---HHHHHHhhcCHHHHHHHHHHHHHhcCeeEEE Confidence 1112222111 1111 11000 001111222 2233345556888889999999999999998 Q ss_pred EEEcCcc-eeeeecCcceeEEEEEEC--CeeEEEEEeeeccCc--------c--hhccc-cHHHHHHHHHHhhhhhccCc Q lcl|NC_015263. 152 VIDDKES-VMIQQFPNDICKISSVSG--GVYNYVIDLDALVSA--------D--IVDYY-PKEIQEAVNKYTTMKKGNNK 217 (513) Q Consensus 152 ~i~d~~~-~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd~~--------~--~L~~~-p~Ei~~~y~~Y~~~k~~~~~ 217 (513) ...|.++ +-+..++|.-|.++.-.. +.+.+++ +|.... . ..+-| +..+. .|.... .... T Consensus 154 vy~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~v--r~~~~~~~d~~~~~~~~~~~iyt~~~i~----~~~~~~-~~~~ 226 (511) T protein:vir:10 154 MIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGV--RYLRTKPIDKTDEDEVFTVDLFTSHGVY----RYLTSR-TNGL 226 (511) T ss_pred EEeCCCCceEEEEEccceeEEEEcCCCCCceEEEE--EEEEeeecccCccceEEEEEEEeCCcEE----EEEecC-CCcc Confidence 8877654 677778888888775422 3344443 221110 0 01111 11110 010000 0000 Q ss_pred ccccCe----eecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHH-----HHhhHhhhhhceeee--eeeccccCCC Q lcl|NC_015263. 218 SASNWY----EIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKD-----LRNDKAELQNYKLLI--QKLETRSSND 286 (513) Q Consensus 218 ~~~~W~----~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kd-----L~~~~~~i~n~~ii~--~kip~~~~n~ 286 (513) ...-|. ..+...-.++.+- ....+.+-| .++..+.+.=+ +-+......+ .+++ ...+. T Consensus 227 ~~~~~~~~~~~~~~~~vPvv~f~-nn~~g~gd~----e~v~~liDa~d~~~S~~~~~~~~~~~-~~lv~~g~~~~----- 295 (511) T protein:vir:10 227 KLTPRENGFESHSFERMPITEFS-NNERRKGDY----EKVITLIDLYDNAESDTANYMSDLND-AMLLIKGNLNL----- 295 (511) T ss_pred cccccccccccccCcceeEEEec-CCCCCCCch----hhhHHHHHHHHHHHHHHHHHHHHhhC-ceeeeeccccC----- Confidence 000000 0111111123331 122343333 33333333222 1122222222 2222 11111 Q ss_pred CCccccCHHHHHHHHHHHHHhc-------cccceEEEeccccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCC Q lcl|NC_015263. 287 NNDFTLDMPMMNYFHEALSMTV-------PDNVGVVTSPMEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDN 358 (513) Q Consensus 287 ~~~~~vd~~~~~~~~~~ik~~L-------p~gv~~v~sP~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~ 358 (513) +.....+....-.-.+ +.+++..-. .++.-+.-+.+. .....++...++|+.-+++..+-+.+-+ T Consensus 296 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~ 368 (511) T protein:vir:10 296 ------DPVEVRKQKEANVLFLEPTVYADSEGRETEGS-VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS 368 (511) T ss_pred ------CchhhccchhccceecccccccccccccCCCC-cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc Confidence 1111111100000000 111100000 011001001001 1112344455778888887766554433 Q ss_pred c--chHHHHHHHHHHHHHHHHH----HHHHHHHHHH---HHhh-cccc--e---EEEEEecCCCCccHHHHHHHHHHHHh Q lcl|NC_015263. 359 K--TSQGIAMSIATDEQFIFGV----INQLERWLNR---YLLL-NGMS--K---YFKATMLEVTHFSKKEAHDRYITDAQ 423 (513) Q Consensus 359 ~--s~~~~~~SI~~d~~~~~~~----~~~iE~~~N~---~i~~-~~~~--~---~f~~~~l~~T~fn~ke~~~~~~~~~~ 423 (513) + |+.+++.....-.+.+..- =+.|++-++. ++.. .... . ..++.|-+..+-|..+.++.+.++. T Consensus 369 ~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~- 447 (511) T protein:vir:10 369 GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG- 447 (511) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHh- Confidence 3 4444443333222222222 2222222222 2222 1111 1 4789999999999999999999985 Q ss_pred cC-CcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 424 YG-FPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 424 ~G-~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) | .|....+ ..+++ +|.+.+.++..|++. .+.....+ + ++++ .+.++|.+..++..+.+.. T Consensus 448 -G~iS~et~~-~~l~~v~d~~~E~~ri~~E~~~-~~~~~~~~-----~--~~~~-------~~~~~~~~~~~~~~~~~~~ 510 (511) T protein:vir:10 448 -GKISQTTLM-SLFSFFQDPELEVKKIEEDEKE-SIKKAQKG-----I--YKDP-------RDINDDEQDDDTKDTVDKK 510 (511) T ss_pred -ccCcHHHHH-HhCCCCCCHHHHHHHHHHHHHH-HHHHHhhh-----c--ccCC-------CCCCCCCCCCcccCccccc Confidence 6 5555544 45776 678899999998754 11111111 1 1110 0111111211111111111 Q ss_pred C Q lcl|NC_015263. 501 E 501 (513) Q Consensus 501 ~ 501 (513) + T Consensus 511 ~ 511 (511) T protein:vir:10 511 E 511 (511) T ss_pred C Confidence 1 No 133 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=93.20 E-value=0.0084 Score=31.96 Aligned_cols=414 Identities=9% Similarity=0.006 Sum_probs=168.0 Q ss_pred HHhhccCcccc-c-cc--cc--ccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc------------------chHHH Q lcl|NC_015263. 31 LRDDNRTPVFG-A-PV--GS--LTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS------------------QQYQR 86 (513) Q Consensus 31 ~~~~~~~~~~~-s-~~--~s--~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s------------------g~~~r 86 (513) .+|||.+--.. . .. +. .....+.+++.|..| -......++.+-+|+.... +..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~-~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~~~ 79 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYN-ETVLKPRYRENMKLYLGKHKILTAPEKETGADNRIVVNSAKY 79 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHHHHHHHH-HHhhHHHHHHHHHHhccccccccCcccccCCcceeecchHHH Confidence 44444433000 0 00 00 011122334444433 1112234555666655432 22333 Q ss_pred HHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeec Q lcl|NC_015263. 87 LLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQF 164 (513) Q Consensus 87 lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~l 164 (513) +++-.++ |.+ .|..-. ...++.. ...+..++..-++...+..+.+.+++.|..|.+..-+.++ +-+..+ T Consensus 80 Ivd~~~~-----~l~g~p~~~~-~~~d~~~---~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~ 150 (470) T protein:vir:99 80 VVDVYNG-----YFCGIEPKLA-LLNDSSK---IDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYS 150 (470) T ss_pred HHHHHhh-----hhccCCeeEe-eCCchhH---HHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEE Confidence 3332222 221 121100 0111122 2234455667788999999999999999999988776655 567889 Q ss_pred CcceeEEEEEECC--eeEEEEEeeeccCcch----hccc-cHHHHHHHHHHhhhh-----hccCcccccCeeecCCceEE Q lcl|NC_015263. 165 PNDICKISSVSGG--VYNYVIDLDALVSADI----VDYY-PKEIQEAVNKYTTMK-----KGNNKSASNWYEIQDKNSIC 232 (513) Q Consensus 165 p~dyckIsg~~nG--~y~~~fD~syFd~~~~----L~~~-p~Ei~~~y~~Y~~~k-----~~~~~~~~~W~~L~~~kt~~ 232 (513) +|+-|.++--..+ .+.+++=.-..+.... ..-| +..+- .|.... .........|=.+| + T Consensus 151 ~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~g~vP-----v 221 (470) T protein:vir:99 151 SPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFY----KFKGYDIEEDTNAAGYAINPYGLVP-----A 221 (470) T ss_pred ccceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEE----EEEecccccccccccccccCCCccc-----e Confidence 9998887743222 2444432222111111 1101 11100 000000 00000000011111 1 Q ss_pred EEecCccccchhhHHHHHHhHHHHHHHHHHH-hhHhhhhhceeeeeeec-c-ccCCCCCccccCHHHHHHHHHHHHHhcc Q lcl|NC_015263. 233 IKINESSLTPVPPFAGTFDSIYDIHSFKDLR-NDKAELQNYKLLIQKLE-T-RSSNDNNDFTLDMPMMNYFHEALSMTVP 309 (513) Q Consensus 233 ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~-~~~~~i~n~~ii~~kip-~-~~~n~~~~~~vd~~~~~~~~~~ik~~Lp 309 (513) +.+ -....+++-|.. +.+++|..+ .+. +....++..+--.-.++ + .+.+++|++..++..... -.+| T Consensus 222 v~~-~n~~~g~sd~e~-v~~liDa~~--~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~------~~~~ 291 (470) T protein:vir:99 222 VEF-FENEERQGIFDS-IKTLINALD--KVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRV------LYVS 291 (470) T ss_pred Eee-cCCCCCCcchHh-HHHHHHHHH--HHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcce------eeec Confidence 222 112234444433 233322222 111 22222222211111122 0 012333443322111000 0011 Q ss_pred ccceEEEeccccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCCCc--chHHHHHHHHHHHHHHHH----HHHHH Q lcl|NC_015263. 310 DNVGVVTSPMEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSDNK--TSQGIAMSIATDEQFIFG----VINQL 382 (513) Q Consensus 310 ~gv~~v~sP~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~--s~~~~~~SI~~d~~~~~~----~~~~i 382 (513) . .+.-..| ++.-+.-+... .....++-..++|+.-+|+-...+.+..+ |+..++.....-.+.+.. |-+.| T Consensus 292 ~-~~~~~~~-~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l 369 (470) T protein:vir:99 292 Q-LDPDTNP-QIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSL 369 (470) T ss_pred C-CCCCCCC-cceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0000000 01111101001 11123555667788888887666655433 444444433222222222 22222 Q ss_pred HH---HHHHHHhhcc-cc---eEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHh-h Q lcl|NC_015263. 383 ER---WLNRYLLLNG-MS---KYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEM-L 454 (513) Q Consensus 383 E~---~~N~~i~~~~-~~---~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~-L 454 (513) ++ .+...+.... .. ..+.+.|-+..+.|..+.++.+.++.+. .|....|.-+-|++|.+.+.+++.|++. + T Consensus 370 ~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~gi-is~et~l~~l~~vd~~~E~eri~~E~~~~~ 448 (470) T protein:vir:99 370 MQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAEGI-VSKKTQLGMIPDIEPDAEMKQIAKEKADAI 448 (470) T ss_pred HHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHhcc-CCHHHHHHhCCCCCHHHHHHHHHHHHHHHH Confidence 22 2222232221 11 1578999999999999999999998632 7887777776677899999999988742 2 Q ss_pred CcccccCcccccccccccccccCCccccCCCCcCCCCccc Q lcl|NC_015263. 455 DLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETT 494 (513) Q Consensus 455 ~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~ 494 (513) +..... ++.. + ..++.|..+.. T Consensus 449 ~~~~~~---------~~~~-------d--~~~~d~~~ee~ 470 (470) T protein:vir:99 449 KQTQQL---------SMPI-------D--ILKRDNNAEEE 470 (470) T ss_pred HHHHhh---------cCCC-------C--cCCCCCCccCC Confidence 211111 1100 0 00111111100 No 134 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=93.17 E-value=0.0085 Score=31.93 Aligned_cols=349 Identities=15% Similarity=0.113 Sum_probs=143.8 Q ss_pred cccccccchHHH-----HHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcch Q lcl|NC_015263. 42 APVGSLTSSQSK-----VRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKL 115 (513) Q Consensus 42 s~~~s~~~s~d~-----~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~ 115 (513) =..|++..+-.. ..+.+..| .+.. .+..+..+.+.|+.+++ +..+...++--.......+... T Consensus 1 Mg~f~~~~~f~~~~~~~~~~~~~~~---~~~~--------~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~ 69 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDTQRVTAW---QNEA--------VEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLI 69 (378) T ss_pred CccchhhhhhhccccCCCcceeeec---ccch--------hHHHHHHHHHHHHHHHhhhhhCceeeEEEccccccccccc Confidence 111221111000 00000111 0000 12234456666666543 2233333321111111111111 Q ss_pred hHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEEc-CcceeeeecCcceeEEEEEECCeeEEEEEeeecc Q lcl|NC_015263. 116 KKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVIDD-KESVMIQQFPNDICKISSVSGGVYNYVIDLDALV 189 (513) Q Consensus 116 ~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~d-~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd 189 (513) ...-+.+...|+ + +.-..+...++..++..|..|.|.+.+ ..+..+.-+|.+ ++ T Consensus 70 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~~--------~~------------ 129 (378) T protein:vir:93 70 SMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD--------DK------------ 129 (378) T ss_pred ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC--------Ce------------ Confidence 111233444454 2 224566677888899999999887643 333222221110 00 Q ss_pred CcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhh Q lcl|NC_015263. 190 SADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAEL 269 (513) Q Consensus 190 ~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i 269 (513) .+++++.-+-|+-......++++...+...+ ...+ T Consensus 130 ---------------------------------~~~~~~diih~r~~~~~~~~~s~l~~~~~~i------------~~~~ 164 (378) T protein:vir:93 130 ---------------------------------KEYKTEELVRLTSPFYINEDTSILDNALASI------------QTKL 164 (378) T ss_pred ---------------------------------eEeccceeEEecCccccchhhHHHHHHHHHH------------HHHH Confidence 0112222222221111111233322222111 1112 Q ss_pred hhceeeeeeeccccCCCCCccccCHH----HHHHHHHHHHHhc----cccceEEEecccccccccccccccchhhhhhHH Q lcl|NC_015263. 270 QNYKLLIQKLETRSSNDNNDFTLDMP----MMNYFHEALSMTV----PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKATK 341 (513) Q Consensus 270 ~n~~ii~~kip~~~~n~~~~~~vd~~----~~~~~~~~ik~~L----p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~ 341 (513) .+-. +...|-+ ++ .++.+ ..++|.+.++... ..|+..+-..+++..+.++.....-....-..+ T Consensus 165 ~~~~-~~g~l~~-----~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~~~~~~~~~ 236 (378) T protein:vir:93 165 EQGK-LRGLLKI-----NA--FLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKS 236 (378) T ss_pred hcCc-ccceeee-----CC--cCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHH Confidence 2211 1111111 11 13333 3344555554433 234445555556665555432222233445567 Q ss_pred hhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-----cc----ceEEEEEecCCCCccHH Q lcl|NC_015263. 342 NFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLN-----GM----SKYFKATMLEVTHFSKK 412 (513) Q Consensus 342 ~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-----~~----~~~f~~~~l~~T~fn~k 412 (513) +|..+.||...+++|+.. +... ..--..-+.-++++||..+|+.|-.. .. ...|+|.+-....-+.+ T Consensus 237 ~Ia~~fgVPp~~l~g~~~--e~~~--~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~ 312 (378) T protein:vir:93 237 ELLTGYFMNENILLGTAT--QEQQ--IYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLK 312 (378) T ss_pred HHHHHhCCCHHHhcCCcH--HHHH--HHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHH Confidence 899999999888876532 2222 22223334469999999999988432 11 11366666666777888 Q ss_pred HHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCc Q lcl|NC_015263. 413 EAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNE 492 (513) Q Consensus 413 e~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~e 492 (513) +.++.+.++..-|.=..--+-+.+|+.|.+ |=|..++|+. ...-.+.+.+.+ +..++.|++| T Consensus 313 ~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~------------ggD~~~~~~n---~~~~~~~~~~~~---~~~~~~~~~e 374 (378) T protein:vir:93 313 ELIDLYHENINGPIFTQNQLLVKMGEQPIE------------GGDVYIANLN---AVAVKNLSDLQG---SRKDVTSTDE 374 (378) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CCCeeeeccc---cccccchhhhcC---ccCCCCCCCC Confidence 999998888888743333333345666642 2233344332 211111111100 1111112222 Q ss_pred ccccccCCCCCCC Q lcl|NC_015263. 493 TTGNKDSDETQRA 505 (513) Q Consensus 493 t~~n~~~~~~~~~ 505 (513) +++. T Consensus 375 ---------~~n~ 378 (378) T protein:vir:93 375 ---------TNNQ 378 (378) T ss_pred ---------CCCC Confidence 1111 No 135 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=91.65 E-value=0.015 Score=30.61 Aligned_cols=432 Identities=12% Similarity=0.103 Sum_probs=157.8 Q ss_pred CCCccc--hheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccc---------------hHHHHHHHhhhccC Q lcl|NC_015263. 1 MVKNKK--KRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTS---------------SQSKVRKIVKEYRN 63 (513) Q Consensus 1 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~---------------s~d~~k~~i~~~~P 63 (513) |.|-.. .|. |+-.|-++ .|.+.. ..+.+.+.|+.|. T Consensus 1 ~~~~~~~~~~~-----------------------~~~~~~~~---~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~- 53 (512) T protein:vir:97 1 MLKANEFETDT-----------------------DLRENRNY---LFNDEANVVYTYDGTESDLLQNINEVSKYIEHHM- 53 (512) T ss_pred CccceeccCce-----------------------eeeeCcee---eeccccccccccCchhhhhhhhHHHHHHHHHHHH- Confidence 322110 000 11111111 111111 1122333344331 Q ss_pred hhHHHHHHHHHHHHHhhcchHHHH-----------------HHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHH Q lcl|NC_015263. 64 EGNQKTLRKVSEDLAVQSQQYQRL-----------------LNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEF 125 (513) Q Consensus 64 ~~n~~~ir~~s~~lY~~sg~~~rl-----------------idy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~ 125 (513) ......++++.+|+........+. ..+++...+ .|.+ .|..- ...+....+ .+..+ T Consensus 54 ~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~-~yl~g~p~~~--~~~d~~~~~---~l~~~ 127 (512) T protein:vir:97 54 DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQC--QDDDKDVLE---AIEAF 127 (512) T ss_pred HhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHh-hhhcccCcee--ccCChHHHH---HHHHH Confidence 111233555555555444332111 111111110 1111 11000 001111222 23334 Q ss_pred HhhcChhHHHHHHHHHHHHhcceeEEEEEcCc-ceeeeecCcceeEEEEEEC--CeeEEEEEeeeccCc--------c-- Q lcl|NC_015263. 126 LSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKE-SVMIQQFPNDICKISSVSG--GVYNYVIDLDALVSA--------D-- 192 (513) Q Consensus 126 L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~-~~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd~~--------~-- 192 (513) +..-++...+..+.+.+++.|..|.+...+.+ .+-+..++|.-|.++--.. +.+.+++ +|.... . T Consensus 128 ~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~p~~~~~iyd~~~~~~~~~~v--r~~~~~~~~~~~~~~~~ 205 (512) T protein:vir:97 128 NDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGV--RYLRTKPIDKTDEDEVF 205 (512) T ss_pred HhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEEeeeccccccceEE Confidence 55556888899999999999999998887654 4567778888888774322 3343332 221100 0 Q ss_pred hhcccc-HHHHHHHHHHhhhhhccCcccccCee----ecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHH---- Q lcl|NC_015263. 193 IVDYYP-KEIQEAVNKYTTMKKGNNKSASNWYE----IQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR---- 263 (513) Q Consensus 193 ~L~~~p-~Ei~~~y~~Y~~~k~~~~~~~~~W~~----L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~---- 263 (513) .++-|. ..+. .|.... .......-|.. .+...-.++.+- ....+. |.|.++.++.+.=+.. T Consensus 206 ~~~vyt~~~i~----~~~~~~-~~~~~~~~~~~~~~~~~~g~vPvv~~~-nn~~~~----gd~e~v~~liDa~d~~~S~~ 275 (512) T protein:vir:97 206 TVDLFTSHGVY----RYLTSR-TNGLKLTPRENGFESHSFERMPITEFS-NNERRK----GDYEKVITLIDLYDNAESDT 275 (512) T ss_pred EEEEEeCCcEE----EEEecC-CCcccccccccccccccCcccceEeec-CCCCCC----CchhhhHHHHHHHHHHHHHH Confidence 011111 1110 011000 00000000110 111111122221 122233 3344444444332211 Q ss_pred -hhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccceEEEeccccccccc---cccc-ccchhhhh Q lcl|NC_015263. 264 -NDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSF---DKDS-STDDSVEK 338 (513) Q Consensus 264 -~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~l---d~~~-~~~dtv~~ 338 (513) +..+...+ .+++ +. +....+.+-.........+ .+..+.+.+.+..+.+-.=..+.+ +.+. .....++- T Consensus 276 ~~~~~~~~~-~~lv--~~-G~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~ 349 (512) T protein:vir:97 276 ANYMSDLND-AMLL--IK-GNLNLDPVEVRKQKEANVL--FLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDR 349 (512) T ss_pred HHHHHHhcC-ceee--ee-cCccCCchhhhhhhhcccc--cccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHH Confidence 11222222 1221 11 0000110000000000000 000011111110000000000111 0001 11122444 Q ss_pred hHHhhhhhhhhhhhhccCCCc--chHHHHHHHHHHHHHHHHHH----HHHHHHHHH---HHhhc-cc--ce---EEEEEe Q lcl|NC_015263. 339 ATKNFWDNAGVSQILFSSDNK--TSQGIAMSIATDEQFIFGVI----NQLERWLNR---YLLLN-GM--SK---YFKATM 403 (513) Q Consensus 339 ~~~~i~~~~GiS~~Lfn~d~~--s~~~~~~SI~~d~~~~~~~~----~~iE~~~N~---~i~~~-~~--~~---~f~~~~ 403 (513) ..++|+.-+++-.+-+.+.++ |+..++.....-.+.+-.-. +.|++.++. ++... .. .. ..++.| T Consensus 350 L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f 429 (512) T protein:vir:97 350 LNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVY 429 (512) T ss_pred HHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEe Confidence 557788888877666644333 44434433332222222222 223332222 22211 11 11 368889 Q ss_pred cCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCcc Q lcl|NC_015263. 404 LEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIK 480 (513) Q Consensus 404 l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~ 480 (513) -+..+-|..+.++.+.++. | .|....+ ..+|+ +|.+.+.++..|++. .+.....+. ++++ T Consensus 430 ~~~~p~~~~e~~~~~~kl~--giiS~et~~-~~l~~v~d~~~E~eri~~E~~~-~~~~~~~~~-------~~~~------ 492 (512) T protein:vir:97 430 NRNLPKSLIEELKAYIDSG--GKISQTTLM-SLFSFFQDPELEVKKIEEDEKE-SIKKAQKGI-------YKDP------ 492 (512) T ss_pred CCCCCcCHHHHHHHHHHHh--ccCchHHHH-HhCCCCCCHHHHHHHHHHHHHH-HHHHHhhcc-------cCCC------ Confidence 9999999999999999985 6 6655545 45676 688889999998754 111111111 1110 Q ss_pred ccCCCCcCCCCcccccccCCC Q lcl|NC_015263. 481 EKGKENGRPTNETTGNKDSDE 501 (513) Q Consensus 481 ~~~~~~grPt~et~~n~~~~~ 501 (513) .+.+.+.+.+++....+..+ T Consensus 493 -~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 493 -RDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred -CCCCCCCCCCCccccccccC Confidence 01111222222111111111 No 136 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=91.64 E-value=0.015 Score=30.60 Aligned_cols=361 Identities=12% Similarity=0.065 Sum_probs=158.9 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceE Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSV 101 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I 101 (513) |.=+.++.=|. ...+.++. ...+..++.-.|-.+..+.+.|+.+++ +..+...+ T Consensus 1 Mg~f~~~f~~~-----~~~~~~~~--------------------~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~ 55 (385) T protein:vir:95 1 MGLFDSVFKRH-----SELSWMYD--------------------LEFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRV 55 (385) T ss_pred CchhhhhhccC-----cccccccc--------------------hhhhhccchhhhhhhHHHHHHHHHHHHHHcccceee Confidence 11111111111 00011110 111222222334456667777877764 33444444 Q ss_pred eeccchhhhhhcchhHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEEEEC Q lcl|NC_015263. 102 VPFKDISTANENKLKKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISSVSG 176 (513) Q Consensus 102 ~P~~~~~~~~~~~~~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~~n 176 (513) +- +. +... +.+...|.. +.-..+...++..++..|..|-+...++..+ .+..|+... .. T Consensus 56 ~~----~~---~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~~~~----~~~~~~~~~--~~ 119 (385) T protein:vir:95 56 MK----NN---TKEK---GTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEGHFF----VADDFEKED--EL 119 (385) T ss_pred ee----cC---cccc---chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCCCee----ecccccccc--cc Confidence 31 11 1111 233444432 2245666778888899999887765443321 111111111 11 Q ss_pred CeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCc--cccchhhHHHHHHhHH Q lcl|NC_015263. 177 GVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINES--SLTPVPPFAGTFDSIY 254 (513) Q Consensus 177 G~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~--~~~~ip~f~~v~~d~~ 254 (513) +.+.+.+.-.. +..|. .=..++...-+.|+.... ...+++|...+-.-+- T Consensus 120 ~~~~~~~~~~~-----------------~~~~~-----------~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~ 171 (385) T protein:vir:95 120 GLYSHRFTNVL-----------------VNDFE-----------FKRVFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFG 171 (385) T ss_pred ccccccceeee-----------------ecccc-----------eeeeeccccEEEecCCCCCcccccchHHHHHHHHHH Confidence 11111100000 00110 002344544555665322 2346666654433221 Q ss_pred HHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcc----ccceEE--Eeccccccccccc Q lcl|NC_015263. 255 DIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVP----DNVGVV--TSPMEIDTVSFDK 328 (513) Q Consensus 255 di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp----~gv~~v--~sP~~~d~i~ld~ 328 (513) ........ -...+. +-++| +...++.++++.+.+.+++.+- ++-+.+ -..+++..+++.. T Consensus 172 ~~~~~~~~------~~~~~g-~l~~~-------~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~ 237 (385) T protein:vir:95 172 RMIDLQML------NNQIRG-ILKVD-------ATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRG 237 (385) T ss_pred HHHHHHHh------cCCCce-EEEeC-------CccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccc Confidence 11111110 011111 11122 2234667777766666666541 233333 3333444444432 Q ss_pred cc--c--c---chhhhhhHHhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--cccceEE Q lcl|NC_015263. 329 DS--S--T---DDSVEKATKNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLL--NGMSKYF 399 (513) Q Consensus 329 ~~--~--~---~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~--~~~~~~f 399 (513) .. . + -.+..-..++|..+.||...+++|+..+......+ -...-+.-++.+||..+|+.|-. +..+..| T Consensus 238 ~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~sn~e~~~~~--~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~ 315 (385) T protein:vir:95 238 AAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLGEMADLEKTIES--YLQFCINPLLRKIEAELNSKFFYQDEYLNDDM 315 (385) T ss_pred cccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHHHHHH--HHHHHHHHHHHHHHHHHHhhcCChhhcccceE Confidence 21 1 1 12344455789999999999998765554444444 34444557999999999997743 3234457 Q ss_pred EEEecCCCCccHHHHHHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCC Q lcl|NC_015263. 400 KATMLEVTHFSKKEAHDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENA 478 (513) Q Consensus 400 ~~~~l~~T~fn~ke~~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~ 478 (513) +|..-+...-+.++.++.+.++..-|. .+=..-+ .+|+.|.+. | +=+..++|+. ...- T Consensus 316 ~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~-~~g~~p~~~------~----~gd~~~~~~n---~~~~------- 374 (385) T protein:vir:95 316 HIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRI-MTGEEPADD------P----ELDKFIITKN---LQSA------- 374 (385) T ss_pred EEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCCC------C----CCceeeeccc---ceec------- Confidence 776667767778888888887776663 2222222 345544321 1 2223333321 1100 Q ss_pred ccccCCCCcCCCCc Q lcl|NC_015263. 479 IKEKGKENGRPTNE 492 (513) Q Consensus 479 ~~~~~~~~grPt~e 492 (513) + ...||.=.+| T Consensus 375 ~---~~kgge~~~e 385 (385) T protein:vir:95 375 D---AFKGGESNEE 385 (385) T ss_pred c---cccCCCCCCC Confidence 0 0011100001 No 137 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=91.41 E-value=0.016 Score=30.44 Aligned_cols=348 Identities=14% Similarity=0.093 Sum_probs=146.4 Q ss_pred cccccccchHHH-----HHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcch Q lcl|NC_015263. 42 APVGSLTSSQSK-----VRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKL 115 (513) Q Consensus 42 s~~~s~~~s~d~-----~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~ 115 (513) =..|++..+-.. ..+-+..| .+.+ +...+..+.+.|+.++. +..+...++--.......+... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~---~~~~--------~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~ 69 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNNDTQRVTAW---QNEA--------VEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLI 69 (378) T ss_pred CccchhhhhhhcccccCCcceeeec---ccch--------hhHHHHHHHHHHHHHHhhhhhCceeEEEEccccccccccc Confidence 123332221100 00101111 1111 11244566677776653 3333333321111111111000 Q ss_pred hHHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEEcCcceeeeecCcceeEEEEEECCeeEEEEEeeeccC Q lcl|NC_015263. 116 KKELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVIDDKESVMIQQFPNDICKISSVSGGVYNYVIDLDALVS 190 (513) Q Consensus 116 ~~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~ 190 (513) ...-+.+...|+. +....+...++..++..|..|.|.+-|....-+..+-+..++ T Consensus 70 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~~~-------------------- 129 (378) T protein:vir:16 70 SMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFADDK-------------------- 129 (378) T ss_pred ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecCCe-------------------- Confidence 0111333444442 335577777888999999999997655432112211111110 Q ss_pred cchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhh Q lcl|NC_015263. 191 ADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQ 270 (513) Q Consensus 191 ~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~ 270 (513) .+.+.+.-+-|+......-++++...+... +...+. T Consensus 130 --------------------------------~~~~~~diih~r~~~~~~~~~s~l~~~~~~------------i~~~~~ 165 (378) T protein:vir:16 130 --------------------------------KEYKPEELVRLTSPFYINEDTSILDNALAS------------IQTKLE 165 (378) T ss_pred --------------------------------eEecccceEEecCccCccchhHHHHHHHHH------------HHHHHh Confidence 011111122233111111122222222211 111122 Q ss_pred hc--eeeeeeeccccCCCCCccccCHHHH----HHHHHHHHHhc----cccceEEEecccccccccccccccchhhhhhH Q lcl|NC_015263. 271 NY--KLLIQKLETRSSNDNNDFTLDMPMM----NYFHEALSMTV----PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKAT 340 (513) Q Consensus 271 n~--~ii~~kip~~~~n~~~~~~vd~~~~----~~~~~~ik~~L----p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~ 340 (513) +- +.+ -++| ..++.+.. ++|.+.++... ..++..+-..+++..+.++.....-....-.. T Consensus 166 ~~~~~g~-l~~~---------~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~ 235 (378) T protein:vir:16 166 QGKLRGL-LKIN---------AFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIK 235 (378) T ss_pred cCcccee-eEeC---------CcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHH Confidence 21 111 1122 12334433 34444444332 12344454555665555543222223345566 Q ss_pred HhhhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----c----ceEEEEEecCCCCccH Q lcl|NC_015263. 341 KNFWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLNG-----M----SKYFKATMLEVTHFSK 411 (513) Q Consensus 341 ~~i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~~-----~----~~~f~~~~l~~T~fn~ 411 (513) ++|..+.||...+++|+.. +... ..--..-+.-++++||..+|+.|-... . ...++|..-.....+. T Consensus 236 ~~Ia~~fgVPp~~l~g~~~--e~~~--~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~ 311 (378) T protein:vir:16 236 SELLTGYFMNENILLGTAS--QEQQ--IYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATL 311 (378) T ss_pred HHHHHHhCCCHHHhcCCch--HHHH--HHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCH Confidence 8899999999988877532 2222 222223344699999999999874321 1 1135566666667788 Q ss_pred HHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCC Q lcl|NC_015263. 412 KEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTN 491 (513) Q Consensus 412 ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~ 491 (513) ++.++.+.++..-|.=..--.-+.+|+.|.+ |-|..++|+- ...-.+.++.. .+..++.|++ T Consensus 312 ~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~------------ggD~~~~~~n---~~~~~~~~~~~---~~~~~~~~~~ 373 (378) T protein:vir:16 312 KELIDLYHENINGPIFTQNQLLVKMGEQPIE------------GGDVYIANLN---AVAVKNLSDLQ---GSRKDVTSTD 373 (378) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC------------CCCeEeeccc---cccccchhhhc---CccCCCCCCC Confidence 8888888888887743222233345666632 2244444433 21111100000 0111122222 Q ss_pred cccccccCCCCCCC Q lcl|NC_015263. 492 ETTGNKDSDETQRA 505 (513) Q Consensus 492 et~~n~~~~~~~~~ 505 (513) |+ ++. T Consensus 374 e~---------~ne 378 (378) T protein:vir:16 374 ET---------NNQ 378 (378) T ss_pred CC---------CCC Confidence 21 111 No 138 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=91.35 E-value=0.016 Score=30.39 Aligned_cols=310 Identities=12% Similarity=0.073 Sum_probs=141.7 Q ss_pred hccCcccccccccccchHH-HHHHH--hhhc-c--ChhHHHHHHHHHHHHHhhcchHHHHHHHHhhcccccceEee-ccc Q lcl|NC_015263. 34 DNRTPVFGAPVGSLTSSQS-KVRKI--VKEY-R--NEGNQKTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVP-FKD 106 (513) Q Consensus 34 ~~~~~~~~s~~~s~~~s~d-~~k~~--i~~~-~--P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P-~~~ 106 (513) |.+--. ...-+. ..+.. ..-+ + |+. .|. =+.+.||+......+.+..| ++- T Consensus 1 m~~~~~-------~~~~~~~~~~~~~~~~~~~~~~p~~---~~~------------~~~~~~~~~~~~~~~~~~~pp~~~ 58 (346) T protein:vir:10 1 MKKQLR-------KNLTQNDRLQPQAQTEIFSFGDPIP---VLD------------RADILNYLECSAMYEKWYNPPMSF 58 (346) T ss_pred CCcccC-------CCCCcccccccccCeEEEecCCcce---ecC------------chhHHHHHHHhhcCCceEecCCCH Confidence 222211 011110 00000 0000 0 222 000 01244444433333333333 221 Q ss_pred h---hhhhhcch----hHHHHHHHHHHhhc-C---hhHHHHHHHHHHHHhcceeEEEEEcC--cceeeeecCcceeEEEE Q lcl|NC_015263. 107 I---STANENKL----KKELATVTEFLSRL-N---PKYNFSKIVKLAMTVDIFYGYVIDDK--ESVMIQQFPNDICKISS 173 (513) Q Consensus 107 ~---~~~~~~~~----~~~y~~v~~~L~k~-n---~k~~~~~i~~~~l~~g~~~gy~i~d~--~~~~iq~lp~dyckIsg 173 (513) . +....+.+ -...+.++..|-+. | =.++|.++..+.+..|..|.+++-+. ..+.+.++|+.||++ . T Consensus 59 ~~la~l~~~~~~h~~~i~~k~n~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~-~ 137 (346) T protein:vir:10 59 DGLAKSLRSSTHHESAIITKANILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRK-G 137 (346) T ss_pred HHHHHHHHhhhhcchhhhhhhhhHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEE-E Confidence 1 11111111 01111222222221 1 25677888889999999999988654 457899999999996 4 Q ss_pred EECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-c-CccccchhhHHHHHH Q lcl|NC_015263. 174 VSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-N-ESSLTPVPPFAGTFD 251 (513) Q Consensus 174 ~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~-~~~~~~ip~f~~v~~ 251 (513) +.++.|.|.+... +. .=+++++.--+-|+. + ....+|+|+..+... T Consensus 138 ~~~~~~~~~~~~~---~g-----------------------------~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~ 185 (346) T protein:vir:10 138 LEAGQFYYVPQRF---DH-----------------------------QEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQ 185 (346) T ss_pred EcCCeEEEEEEcc---CC-----------------------------eEEEEecccEEEecCCCCCCCeeeccHHHHHHH Confidence 5556665543210 00 112334433444553 2 234579999998888 Q ss_pred hHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc-cc--cceEEEec-cccccccc- Q lcl|NC_015263. 252 SIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV-PD--NVGVVTSP-MEIDTVSF- 326 (513) Q Consensus 252 d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L-p~--gv~~v~sP-~~~d~i~l- 326 (513) .+.-.+...+...- -..|- ....-|-+. .+..++.++++.+.+.++++- +. |-..|..| .+-+.+++ T Consensus 186 si~l~~~a~~~~~~--~~~NG-~~~~~il~~-----~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~ 257 (346) T protein:vir:10 186 SAWLNESATLFRRK--YFLNG-AHAGFVFYM-----SDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQII 257 (346) T ss_pred HHHHHHHHHHHHHH--HHhcc-CCCceEEEe-----CCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEE Confidence 77766666654421 12221 111111110 124478888888888887764 11 11234444 12234433 Q ss_pred --ccccccchhh---hhhHHhhhhhhhhhhhhccC---CCcchHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccce Q lcl|NC_015263. 327 --DKDSSTDDSV---EKATKNFWDNAGVSQILFSS---DNKTSQGIA-MSIATDEQFIFGVINQLERWLNRYLLLNGMSK 397 (513) Q Consensus 327 --d~~~~~~dtv---~~~~~~i~~~~GiS~~Lfn~---d~~s~~~~~-~SI~~d~~~~~~~~~~iE~~~N~~i~~~~~~~ 397 (513) .....+.+.+ +...++|..+.||...+.|- .+++.+.+. ....-...-+.-++++||. +|..|-. . T Consensus 258 pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~iee-~n~~L~~----e 332 (346) T protein:vir:10 258 PIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVFFITEIEPLQERLKE-FNQWLGQ----E 332 (346) T ss_pred ecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhccc----c Confidence 2223333333 33447899999999988863 222222222 2222222233357777775 4443321 2 Q ss_pred EEEEEecCCCCccH Q lcl|NC_015263. 398 YFKATMLEVTHFSK 411 (513) Q Consensus 398 ~f~~~~l~~T~fn~ 411 (513) .++|.--.+.-+++ T Consensus 333 ~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 333 VIKFKPSKLLQRTQ 346 (346) T ss_pred eeeechhhhcccCC Confidence 34444333333333 No 139 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=91.30 E-value=0.017 Score=30.35 Aligned_cols=350 Identities=16% Similarity=0.113 Sum_probs=144.8 Q ss_pred cccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh----cccccceEeeccchhhhhhcch-h Q lcl|NC_015263. 42 APVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN----MPLYAYSVVPFKDISTANENKL-K 116 (513) Q Consensus 42 s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~----mpt~dY~I~P~~~~~~~~~~~~-~ 116 (513) =.+|.+--+.- .++.. .+..-.-..+..-....+..+.+.|+.++. ||--=|... ......+.. . T Consensus 1 M~if~~~~~~~----~~~~~--~~~~~~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~----~~~~~~~~~~~ 70 (378) T protein:vir:94 1 MNLFGKVVSFS----RGKLN--NDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYK----KSDVGSDTLIS 70 (378) T ss_pred CchhHHhHhhh----hcccc--cCcceeeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeec----ccccccccccc Confidence 44444222210 11111 111000011122223344567778877754 442222221 111111111 0 Q ss_pred HHHHHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEE-EcCcceeeeecCcceeEEEEEECCeeEEEEEeeeccC Q lcl|NC_015263. 117 KELATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVI-DDKESVMIQQFPNDICKISSVSGGVYNYVIDLDALVS 190 (513) Q Consensus 117 ~~y~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i-~d~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~ 190 (513) ..-+.....|+. +.-..+...++..++..|..|.|.+ .+.++..+.-.+ ..+|+ T Consensus 71 ~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~--------~~~~~------------ 130 (378) T protein:vir:94 71 MAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLF--------ANDKK------------ 130 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEE--------ecCcE------------ Confidence 111333444542 2345677778899999999998854 344443221111 01110 Q ss_pred cchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhh Q lcl|NC_015263. 191 ADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQ 270 (513) Q Consensus 191 ~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~ 270 (513) .||.+ .-+-|+ .+.......+.+..+....+ +.++. . T Consensus 131 -----~~~~~----------------------------dvih~~----~~~~~~~~~~~~~~~~~~~~-~~~~~-----~ 167 (378) T protein:vir:94 131 -----EYKPE----------------------------ELVRLT----SPFYINEDTSILDNALASIQ-TKLEQ-----G 167 (378) T ss_pred -----Eechh----------------------------ceeeec----CcCCcccchhHHHHHHHHHH-HHHhh-----C Confidence 01100 011111 01111111222222221110 11111 0 Q ss_pred hceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc--------cccceEEEecccccccccccccccchhhhhhHHh Q lcl|NC_015263. 271 NYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV--------PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKATKN 342 (513) Q Consensus 271 n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L--------p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~ 342 (513) ..+.+. +.| -.++.++++++.+..++.+ ..|+..+-..+++..+.++.....-..+.-..++ T Consensus 168 ~~~g~l-~~~---------~~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~ 237 (378) T protein:vir:94 168 KLRGLL-KIN---------AFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSE 237 (378) T ss_pred Ccccce-eeC---------CcCCHHHHHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhHHHHHHHHHH Confidence 111111 222 1244444444444444433 1134444444566555553322212334455578 Q ss_pred hhhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-----ccc----eEEEEEecCCCCccHHH Q lcl|NC_015263. 343 FWDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLN-----GMS----KYFKATMLEVTHFSKKE 413 (513) Q Consensus 343 i~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-----~~~----~~f~~~~l~~T~fn~ke 413 (513) |..+.||...+++|+.. + ..++.--..-+.-++.+||..+|+.|-.. ... ..+.|..-+....+.++ T Consensus 238 Ia~~fgvPp~~l~g~~~--e--~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~ 313 (378) T protein:vir:94 238 LLTGYFMNENILLGTAT--Q--EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKE 313 (378) T ss_pred HHHHhCCCHHHhcCCch--H--HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHH Confidence 99999999888876533 2 22222223334468999999999977421 111 13555556666778889 Q ss_pred HHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcc Q lcl|NC_015263. 414 AHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNET 493 (513) Q Consensus 414 ~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et 493 (513) .++.+.++..-|.=..--.-+.+|+.|. | |-+..++|+. .+.-...+...+ ..+++.|+.| T Consensus 314 ~~e~~~~~~~~G~~t~NE~R~~~g~~p~--------~----ggd~~~~~~n---~~~~~~~~~~~~---~~~~~~~~~e- 374 (378) T protein:vir:94 314 LIDLYHENINGPIFTQNQLLVKMGEQPI--------E----GGDVYIANLN---AVAVKNLSDLQG---NRKDVTSTDE- 374 (378) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCC--------C----CCCeeeeccc---ccchhcchhccc---ccCCCCCCCC- Confidence 9999888888773222222234566663 2 2234444433 221111000000 0111112211 Q ss_pred cccccCCCCCCC Q lcl|NC_015263. 494 TGNKDSDETQRA 505 (513) Q Consensus 494 ~~n~~~~~~~~~ 505 (513) +++. T Consensus 375 --------~~n~ 378 (378) T protein:vir:94 375 --------TNNQ 378 (378) T ss_pred --------CCCC Confidence 1111 No 140 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=91.18 E-value=0.017 Score=30.27 Aligned_cols=318 Identities=13% Similarity=0.028 Sum_probs=146.4 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccc---------hHHHHHHHhh-hcc--ChhHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTS---------SQSKVRKIVK-EYR--NEGNQK 68 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~---------s~d~~k~~i~-~~~--P~~n~~ 68 (513) |-|.||+|..-- +.....-..+-.-.+......|+-+. ..|-++-|-. +|+ |..-.. T Consensus 1 m~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~ 69 (350) T protein:vir:11 1 MSKRRSHRRQQP-----------VTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEG 69 (350) T ss_pred CCccccCCCcCc-----------cccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHH Confidence 888877664311 00000000010111111122233110 1111111111 232 333333 Q ss_pred HHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 69 TLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 69 ~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~ 148 (513) |.++ +..+++-.+.|.+..+|..-.|. | +.+ |. ..+|..++.+.+..|.. T Consensus 70 -la~~----~~~~~~h~~~l~~k~n~l~~~~~--P---------n~~----------~t----~~~f~~~v~d~ll~Gna 119 (350) T protein:vir:11 70 -LAKS----VGSSVYLQSGLKFKRNMLAKTFI--P---------HRL----------LS----RATFEQFSLDWLTFGSA 119 (350) T ss_pred -HHHH----Hhhhhhhccchhhhhhhhhhccc--C---------CCC----------CC----HHHHHHHHHHHHhcCCe Confidence 3332 23455555555544444332221 1 000 11 34456677788889999 Q ss_pred eEEEEEcCc--ceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeec Q lcl|NC_015263. 149 YGYVIDDKE--SVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ 226 (513) Q Consensus 149 ~gy~i~d~~--~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~ 226 (513) |.+.+-+.. ...+.++|++||++. .+|.+.|.+. . .. .=++++ T Consensus 120 y~~~~rn~~G~~~~L~~l~~~~vr~~--~~~~~~~~~~--~--~~-----------------------------~~~~~~ 164 (350) T protein:vir:11 120 YLEQPRSRLGTRMPLQAPLAKYMRRG--TDLETFYQVR--S--WK-----------------------------DEHEFE 164 (350) T ss_pred EEEEEEcCCCCEEEEEEeCCceeEee--ecCCeEEEEe--e--CC-----------------------------eEEEEC Confidence 999987654 467889999999976 4444322211 0 00 012344 Q ss_pred CCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHH Q lcl|NC_015263. 227 DKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEAL 304 (513) Q Consensus 227 ~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~i 304 (513) .+--+-|+- + ....+|+|+..+....+.-.+...+...- -..|-. ..+-|-+. .+..++.++++.+.+.+ T Consensus 165 ~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~--~f~NGa-~~~gil~~-----~~~~ls~e~~~~l~~~~ 236 (350) T protein:vir:11 165 KGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRK--YYNNGS-HAGFILYM-----TDAAQNEEDIDALRTAL 236 (350) T ss_pred cccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH--HHhccC-CCceEEEe-----cCCCCCHHHHHHHHHHH Confidence 433344442 2 23457999999988887765555543321 233321 11111111 02457888888887777 Q ss_pred HHhccccce-----EEEecc-cccccc---cccccccchh---hhhhHHhhhhhhhhhhhhccC---CCcchH-HHHHHH Q lcl|NC_015263. 305 SMTVPDNVG-----VVTSPM-EIDTVS---FDKDSSTDDS---VEKATKNFWDNAGVSQILFSS---DNKTSQ-GIAMSI 368 (513) Q Consensus 305 k~~Lp~gv~-----~v~sP~-~~d~i~---ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~---d~~s~~-~~~~SI 368 (513) ++. .|.+ .|..|- +-+.++ +.....+.+. .+...++|..+.||...+.|- .+++.+ .-+... T Consensus 237 ~~~--~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~ 314 (350) T protein:vir:11 237 KTA--KGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAA 314 (350) T ss_pred HHh--cCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHH Confidence 764 3432 444442 112233 3323333333 334447899999999888862 122222 223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCC Q lcl|NC_015263. 369 ATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEV 406 (513) Q Consensus 369 ~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~ 406 (513) .--..-+.-++++||. +|..|-.+...+. .+..-++ T Consensus 315 ~f~~~~L~P~~~~ie~-ln~~l~~~~~~F~-~~~~~~l 350 (350) T protein:vir:11 315 VWASLELAPMQTRLQQ-VNEMIGEEVVRFA-QFDAPGL 350 (350) T ss_pred HHHHHHHHHHHHHHHH-HHhhcCccccccC-cccccCC Confidence 3333334457778874 6666533322210 1111111 No 141 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=89.36 E-value=0.027 Score=29.20 Aligned_cols=316 Identities=13% Similarity=0.090 Sum_probs=144.0 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccccc-------ccchHHHHHHHhh-hcc--ChhHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGS-------LTSSQSKVRKIVK-EYR--NEGNQKTL 70 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s-------~~~s~d~~k~~i~-~~~--P~~n~~~i 70 (513) |-|.|++|..--.... ....+.-.+=+|+ .....|-++-+-+ +|+ |..-.. | T Consensus 1 ~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~-l 62 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTM-----------------TASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTG-L 62 (344) T ss_pred CCcccCCCCcchhhhh-----------------hccCCceEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHH-H Confidence 9888888753111100 0111211111222 1111122222211 233 443332 3 Q ss_pred HHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeE Q lcl|NC_015263. 71 RKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYG 150 (513) Q Consensus 71 r~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~g 150 (513) .++ +.++++-.+.|.+..+|..-.| .|- .+ |. .++|..+..+.+..|..|. T Consensus 63 a~~----~~a~~~h~~~i~~k~n~l~~~~--~Pn---------~~----------lt----~~~f~~~~~d~ll~Gnay~ 113 (344) T protein:vir:20 63 AKS----LRAAVHHSSPIYVKRNILASTF--IPH---------PW----------LS----QQDFSRFVLDFLVFGNAFL 113 (344) T ss_pred HHH----HhhhhhhCccceehhhhHHHhc--cCC---------CC----------CC----HHHHHHHHHHHHhcCCeEE Confidence 332 3445544555544444433332 120 00 11 2445667778888999999 Q ss_pred EEEEc--CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCC Q lcl|NC_015263. 151 YVIDD--KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDK 228 (513) Q Consensus 151 y~i~d--~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~ 228 (513) +++.+ +..+.+.++|+.||++. ...+.|.+... .. .=++++.+ T Consensus 114 ~i~rn~~G~~~~L~pl~~~~vr~~-~~~~~~~~~~~-----~~-----------------------------~~~~~~~~ 158 (344) T protein:vir:20 114 EKRYSTTGKVIRLETSPAKYTRRG-VEEDVYWWVPS-----FN-----------------------------EPTAFAPG 158 (344) T ss_pred EEEECCCCcEEEEEEcCCceeEee-ecCCEEEEEcc-----CC-----------------------------eEEEEcCc Confidence 99875 45578999999999985 22333321110 00 00233333 Q ss_pred ceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHH Q lcl|NC_015263. 229 NSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSM 306 (513) Q Consensus 229 kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~ 306 (513) --+-|+- + ....+|+|+..+....+.--+...+...- -..|- ....-|-+. .+..++.++++.+.+.+++ T Consensus 159 eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~--~f~NG-a~p~~Il~~-----~d~~l~~e~~~~ik~~~~~ 230 (344) T protein:vir:20 159 SVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRK--YYENG-AHAGYIMYV-----TDAVQDRNDIEMLRENMVK 230 (344) T ss_pred cEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH--HHhcc-CCCceEEEe-----cCcCCCHHHHHHHHHHHHH Confidence 2333442 2 24557999999988877655554443211 11221 111111110 0234788888888777776 Q ss_pred hccccc---eEEEeccc-cccccc---ccccccchh---hhhhHHhhhhhhhhhhhhccC--C-CcchHH-HHHHHHHHH Q lcl|NC_015263. 307 TVPDNV---GVVTSPME-IDTVSF---DKDSSTDDS---VEKATKNFWDNAGVSQILFSS--D-NKTSQG-IAMSIATDE 372 (513) Q Consensus 307 ~Lp~gv---~~v~sP~~-~d~i~l---d~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~--d-~~s~~~-~~~SI~~d~ 372 (513) +--.|- ..+.+|-- -+.+++ .....+.+. -+-..++|..+.||...|.|- + +++.+. -+....-.. T Consensus 231 ~~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~ 310 (344) T protein:vir:20 231 SKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVR 310 (344) T ss_pred hcCCCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHH Confidence 431111 24454421 123333 222333333 333447899999999888862 1 222221 222222122 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccH Q lcl|NC_015263. 373 QFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSK 411 (513) Q Consensus 373 ~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ 411 (513) .-+.-++++||. +|..|-. +-|+|.+..+.-=++ T Consensus 311 ~~l~P~~~~~e~-in~~lg~----~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 311 NELIPLQDRIRE-INGWLGQ----EVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHHHH-HHHhcCC----cccccCccccccCCC Confidence 222235566663 5555432 224444433332233 No 142 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=88.82 E-value=0.03 Score=28.93 Aligned_cols=305 Identities=10% Similarity=0.049 Sum_probs=150.3 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccc---h------HHHHHHHh---hhcc-ChhHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTS---S------QSKVRKIV---KEYR-NEGNQ 67 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~---s------~d~~k~~i---~~~~-P~~n~ 67 (513) |-|.|+++.+.-.- +| ...|+-+. . .|-++=+. -+|+ |-=+. T Consensus 1 m~~~~~~~~~~~~~----------------------~~---~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~pP~~~ 55 (337) T protein:vir:78 1 MTKRQQQPAQAAAS----------------------SP---RPSVVFSMPEAIDPTAWMTDYTGVFYNPYGEYYQPPIDR 55 (337) T ss_pred CCCcccCccccccc----------------------Cc---eeEEEecCcccccCcchhHhhhhhhhccCcceecCCCCH Confidence 88777776544211 11 11222111 0 11111111 1232 22122 Q ss_pred HHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcc Q lcl|NC_015263. 68 KTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDI 147 (513) Q Consensus 68 ~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~ 147 (513) . .|++. +..+++-.++|.+-.++-.-.|.-. ..++..+..+.+..|. T Consensus 56 ~---~La~l-~~~~~~h~~~L~~k~N~~~~~f~~~-----------------------------~~~~~~~~~d~ll~GN 102 (337) T protein:vir:78 56 K---GLAKV-ARANAHHGAILMARRNMVAGRFTNQ-----------------------------RATITAFVHNYLQFGD 102 (337) T ss_pred H---HHHHH-hhcchhhhhHHHhhhccccccCcCc-----------------------------HHHHHHHHHHHHhhCC Confidence 2 22332 2445555666665444322222110 1356677788889999 Q ss_pred eeEEEEEc--CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeee Q lcl|NC_015263. 148 FYGYVIDD--KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEI 225 (513) Q Consensus 148 ~~gy~i~d--~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L 225 (513) .|.+++-+ +..+-+.++|+.||++. .||.+.|... .. . =+.+ T Consensus 103 ay~~~~rn~~G~~~~L~pl~~~~v~~~--~d~~~~~~~~----~~-~-----------------------------~~~~ 146 (337) T protein:vir:78 103 GGLLKLRNSFGQVVGLHPLSSVYLRRR--EDGCFVYLQQ----GK-P-----------------------------NLIY 146 (337) T ss_pred eEEEEEECCCCcEEEEEEeCCceeEee--eCCeEEEEEc----CC-c-----------------------------eEEE Confidence 99999876 45678889999999987 7887654211 00 0 0123 Q ss_pred cCCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHH Q lcl|NC_015263. 226 QDKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEA 303 (513) Q Consensus 226 ~~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ 303 (513) +.+--+-++- + ....+|+|+..+....+.--+...+...- -..|=.. .+-|-.. -+..++.++++.+.+. T Consensus 147 ~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~--~f~NGa~-p~~il~~-----~~~~l~~e~~~~lk~~ 218 (337) T protein:vir:78 147 RPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRR--YFLNGAH-MGFIFYA-----TDPNMDDDTEEEMKEM 218 (337) T ss_pred CCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH--HHhccCC-CceeEEc-----CCCCCCHHHHHHHHHH Confidence 3332333442 3 24557999999888877655555553321 1222111 1111110 0234777888777777 Q ss_pred HHHhc-cc--cceEEEecc----cccccccccccccchh---hhhhHHhhhhhhhhhhhhcc----CCCcch-HHHHHHH Q lcl|NC_015263. 304 LSMTV-PD--NVGVVTSPM----EIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFS----SDNKTS-QGIAMSI 368 (513) Q Consensus 304 ik~~L-p~--gv~~v~sP~----~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn----~d~~s~-~~~~~SI 368 (513) +++.- +. +-..+.+|- .++-+++.....+.+. .+-..++|..+.||...+.| +.+.+. +.-+.++ T Consensus 219 ~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~ 298 (337) T protein:vir:78 219 IANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDA 298 (337) T ss_pred HHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHH Confidence 76542 11 112444431 1122233333334343 33344789999999988775 222222 2333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCc Q lcl|NC_015263. 369 ATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHF 409 (513) Q Consensus 369 ~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~f 409 (513) .-...-+.-++++||..+|+.+...... ++|.+.--.-+ T Consensus 299 ~f~~~~L~P~~~~ie~~~n~~ll~~~~~--~~f~~~~~~~~ 337 (337) T protein:vir:78 299 TYARNEVLPLCELVQDAINSAGLPRALW--VTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHHHHHHHhhhcCChhhc--eeccccccccC Confidence 3333334468889999999765432211 33333333222 No 143 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=88.72 E-value=0.031 Score=28.89 Aligned_cols=418 Identities=11% Similarity=0.043 Sum_probs=157.7 Q ss_pred HHHHHHHHHHhhccCcccccccccccch----------------HHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHH- Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSS----------------QSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQ- 85 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s----------------~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~- 85 (513) |+++ ||-- -++-..|+.+.. .+.+++.|..| ......+..+.+|+.....+.. T Consensus 1 ~~~~--~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~--~~~~~r~~~l~~YY~g~~~i~~~ 70 (483) T protein:vir:12 1 MAQA--LIKG------GNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQH--LEKLPEISIGQEYYEQRPDIVKE 70 (483) T ss_pred Cccc--hhcC------CceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHH--HHHHHHHHHHHHHhccccccccc Confidence 2222 1110 001111222222 11223333332 2333445555555444321111 Q ss_pred ---------------------HHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHH Q lcl|NC_015263. 86 ---------------------RLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAM 143 (513) Q Consensus 86 ---------------------rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l 143 (513) ....+++...+ .|.+ .|..- ...++...+ +++ .++ .=++...+..+.+.++ T Consensus 71 ~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~-~~l~G~p~~~--~~~d~~~~~-~l~--~~~-~n~~~~~~~~~~~~~~ 143 (483) T protein:vir:12 71 PKPVDATGAVDPLKPDDRMITNFHANLVDQKV-SYIVGKPIAF--KHTDDEVVK-RID--EVL-GNRFDDKLHSVLTGAS 143 (483) T ss_pred cccccccccccccccccccccchHHHHHHHHh-hhhcccCcee--ccCChHHHH-HHH--HHH-hccHHHHHHHHHHHHh Confidence 11111111110 1110 01000 001111111 111 122 2356777888999999 Q ss_pred HhcceeEEEEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccccH-HHHHHHHHHhhhhhc--cCc Q lcl|NC_015263. 144 TVDIFYGYVIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYYPK-EIQEAVNKYTTMKKG--NNK 217 (513) Q Consensus 144 ~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~p~-Ei~~~y~~Y~~~k~~--~~~ 217 (513) +.|..|.+...|.++ +.+.-++|+-|.++-- ..+.+.+++=....+.....+.|.+ ++.. | .+...... -.. T Consensus 144 ~~G~~y~~v~~d~d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~-~-~~~~~~~~~~~~~ 221 (483) T protein:vir:12 144 NKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNY-Y-VYENGSLIPDYSN 221 (483) T ss_pred hCCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEE-E-EEeCCeeeecccc Confidence 999999988877655 5777899999988754 3366666543222112222222211 1100 0 00000000 000 Q ss_pred ccccCeeecCCce----EEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhceeeeeeeccccCCCCCccc Q lcl|NC_015263. 218 SASNWYEIQDKNS----ICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNYKLLIQKLETRSSNDNNDFT 291 (513) Q Consensus 218 ~~~~W~~L~~~kt----~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~~ii~~kip~~~~n~~~~~~ 291 (513) ....|..-...+. .++.+-. ...+.|-|. ++.++.+.=+.. +....++..+--+-.+.=.+..+.+++. T Consensus 222 ~~~~~~~~~~~~~~g~vPvv~~~n-n~~g~sd~e----~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~ 296 (483) T protein:vir:12 222 NLENSKTHFSTGSWGKIPFIPFKN-NDLEISDIF----MYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFK 296 (483) T ss_pred cccccccccccCCCCccceEEecC-CCCCCCchh----hHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHH Confidence 0001111111111 1122211 223333333 333333322211 2222222211111111100011112221 Q ss_pred cCHHHHHHHHHHHHHhccccce--EEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCCC--cchHHHHHH Q lcl|NC_015263. 292 LDMPMMNYFHEALSMTVPDNVG--VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSDN--KTSQGIAMS 367 (513) Q Consensus 292 vd~~~~~~~~~~ik~~Lp~gv~--~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~--~s~~~~~~S 367 (513) -++. .+..+. ++++.. .++.+.+. ......++...++|+.-+++-..-+.+.+ .|+..++.- T Consensus 297 ~~~~----~~~~~~--~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~ 362 (483) T protein:vir:12 297 RLLR----YYGAIK--VSDNGGVDTIQVEVPV--------ENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFL 362 (483) T ss_pred Hhhh----hccccc--cCCCCcceEEeecCCH--------HHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHH Confidence 1111 111111 233321 22222111 11112345555678888877665554333 333333322 Q ss_pred HHHHHHHHHH----HHHHHHHHHHH---HHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCC- Q lcl|NC_015263. 368 IATDEQFIFG----VINQLERWLNR---YLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGI- 438 (513) Q Consensus 368 I~~d~~~~~~----~~~~iE~~~N~---~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~- 438 (513) ...-...+.. |=+.+++.+.. ++....--..+.+.|-+..+-|..+.++.+.++. | .|... +.+.+|+ T Consensus 363 ~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~--GiiS~et-~~~~~~~v 439 (483) T protein:vir:12 363 YTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GIVSHET-VLENHPFV 439 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccceeeEEeCCCCCCCHHHHHHHHHHHh--ccCchHH-HHHhCCCC Confidence 2222222221 22222232222 2222211235889999999999999999999985 6 55555 4445776 Q ss_pred -CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 439 -DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 439 -~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) +|.+.+.+...|++. . ....+. ...++. +++ +..+..++++.. T Consensus 440 ~d~~~E~~ri~~E~~~-~----~~~~~~-~~~~~~------------d~~-~~~~~~~~~e~e 483 (483) T protein:vir:12 440 EDLQAELERIEQEQME-Y----NKQLPN-LDDGGA------------DGA-QQQERSNNKESE 483 (483) T ss_pred CCHHHHHHHHHHHHHH-H----Hhhccc-cccccc------------CCc-ccCCCCCcccCC Confidence 688899999888743 1 111111 111111 111 111111111111 No 144 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=88.52 E-value=0.032 Score=28.79 Aligned_cols=443 Identities=10% Similarity=0.051 Sum_probs=164.9 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccccccc--chHHHHHHHhhhccChhHHHHHHHHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLT--SSQSKVRKIVKEYRNEGNQKTLRKVSEDLA 78 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~--~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY 78 (513) |.+--- .++-+-..-+.++|.-.+ +.+.... +.+... -..+.+++.|..|. ..-...|+.+-+|++ T Consensus 1 ~~~~~~----~~~~~~~~~~~~~~~~~~---~~~~~~~----~~~~~~~~~~~~~i~~~i~~h~-~~~~~rl~~l~~yY~ 68 (502) T protein:vir:48 1 MMEQTL----FTDSTGQDLVLNLRFHRE---SRIRYRA----DNLEELMVNNWELLKNFINHHK-LRQAPRIQELLDYAR 68 (502) T ss_pred CceeEE----EEecchhHHHhhcccChh---HHhhhcc----cchhhhccccHHHHHHHHHHHH-HHHHHHHHHHHHHhc Confidence 222111 111111111111111111 0011110 111111 11234555555541 111234566666666 Q ss_pred hhcc-hH--------------------HHHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHH-HHHHhhcChhHHH Q lcl|NC_015263. 79 VQSQ-QY--------------------QRLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATV-TEFLSRLNPKYNF 135 (513) Q Consensus 79 ~~sg-~~--------------------~rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v-~~~L~k~n~k~~~ 135 (513) ..+. ++ +.+++..++ |.+ .|+.-. .......+...+. ..++..-++...+ T Consensus 69 g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~-----yl~g~p~~~~--~~d~~~~~~~~~~l~~~~~~N~~~~~~ 141 (502) T protein:vir:48 69 GENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTG-----YLAGNPIRVE--YDDNEDNSQNDDAIKRIGRINDIDTHN 141 (502) T ss_pred CCCccccccccccccccccceeecchHHHHHHHHhh-----hhcccCeeEe--cCCccchhHHHHHHHHHHhhcCHhHHH Confidence 5432 11 112221111 111 111000 0001111222222 1234444688899 Q ss_pred HHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEEE--CCeeEEEEEeeecc--Ccc--hhccc-cHHHHHHHHH Q lcl|NC_015263. 136 SKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSVS--GGVYNYVIDLDALV--SAD--IVDYY-PKEIQEAVNK 207 (513) Q Consensus 136 ~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~syFd--~~~--~L~~~-p~Ei~~~y~~ 207 (513) ..+.+.+++.|..|.+...+.++ +-+..+||.-|.++--. ++.+.+++=.-.-+ +.. ..+-| +..+. . T Consensus 142 ~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~----~ 217 (502) T protein:vir:48 142 RNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIY----T 217 (502) T ss_pred HHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEE----E Confidence 99999999999999888876554 45888899999877432 35555553221111 111 11111 11110 0 Q ss_pred HhhhhhccCcccccCeeec-CCceE----EEEecCccccchhhHHHHHHhHHHHHHHHH--HHhhHhhhhhceeeeeeec Q lcl|NC_015263. 208 YTTMKKGNNKSASNWYEIQ-DKNSI----CIKINESSLTPVPPFAGTFDSIYDIHSFKD--LRNDKAELQNYKLLIQKLE 280 (513) Q Consensus 208 Y~~~k~~~~~~~~~W~~L~-~~kt~----~ik~~~~~~~~ip~f~~v~~d~~di~~~kd--L~~~~~~i~n~~ii~~kip 280 (513) | .....|..+. .+|.+ ++.+ -+...++|-|.. ..+++|..+.-- +-+......+ .+++ +. T Consensus 218 ~--------~~~~~~~~~~~~~~~~g~vPvv~~-~nn~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~-~~lv--~~ 284 (502) T protein:vir:48 218 L--------DASDSFNEISVTPHAFGTVPITEF-LNNADGIGDYET-ELYLIDLYDSAESDTANHMSDMAD-AILA--IY 284 (502) T ss_pred E--------EeCCceeeccceecCCCccceEEe-cCCCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhcC-ceee--ee Confidence 0 0001121111 11111 1222 112234443433 223322222111 1111222222 1221 11 Q ss_pred cccCCCCCccccCHHHHHHHHHHHHHhccccceEEEec-ccccccccccccc-cchhhhhhHHhhhhhhhhhhhhccCC- Q lcl|NC_015263. 281 TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSP-MEIDTVSFDKDSS-TDDSVEKATKNFWDNAGVSQILFSSD- 357 (513) Q Consensus 281 ~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP-~~~d~i~ld~~~~-~~dtv~~~~~~i~~~~GiS~~Lfn~d- 357 (513) =......++...+++....++ ..+.+.+....+ .++.-+.-+.... ....++.-.++|+.-+++...-+.+. T Consensus 285 g~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~ 359 (502) T protein:vir:48 285 GDLALPQGMQASDMKRTRLMQ-----LKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFS 359 (502) T ss_pred cCcccccccchhhhhhcceee-----ccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccc Confidence 000111222222211111110 001111111000 0111111111111 11224445567887787776555443 Q ss_pred -CcchHHHHHHHHHHHHHHHHHHHHHHHH----HH---HHHhhc--ccce---EEEEEecCCCCccHHHHHHHHHHHHhc Q lcl|NC_015263. 358 -NKTSQGIAMSIATDEQFIFGVINQLERW----LN---RYLLLN--GMSK---YFKATMLEVTHFSKKEAHDRYITDAQY 424 (513) Q Consensus 358 -~~s~~~~~~SI~~d~~~~~~~~~~iE~~----~N---~~i~~~--~~~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~ 424 (513) +.|+..++.....-...+......++.- +. .++... .... ..++.|-+..+-|..+.++.+.+++ T Consensus 360 ~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~-- 437 (502) T protein:vir:48 360 GNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLG-- 437 (502) T ss_pred cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh-- Confidence 3344445444333333333222222221 21 122211 1111 3688999999999999999999985 Q ss_pred C-CcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccccccc Q lcl|NC_015263. 425 G-FPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKD 498 (513) Q Consensus 425 G-~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~ 498 (513) | .|..- +...+|+ +|.+.+.++..|++.++.. +.+.....-++++.....++..+++.+-+ + T Consensus 438 g~iS~et-~l~~l~~v~D~~~E~~ri~~E~~~~~~~----~~~~~~~~~~~~~~d~~~e~~~~~~~~~~-------~ 502 (502) T protein:vir:48 438 GQVSQET-ALSLSGLVENPTEELDKINEESSKIDFK----GYPSYFYDNVGKYTDEVKETHTDDFERVY-------E 502 (502) T ss_pred ccCcHHH-HHHhCCCCCCHHHHHHHHHHHHHhhhhh----cccccccccccccCCCccCCCCcCcCCCC-------C Confidence 6 55544 4455787 5888899999998654422 11211111111110000000011111000 1 No 145 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=88.25 E-value=0.033 Score=28.67 Aligned_cols=316 Identities=15% Similarity=0.103 Sum_probs=142.0 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccc-----hHHHHHHHh-hh--cc-ChhHHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTS-----SQSKVRKIV-KE--YR-NEGNQKTLR 71 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~-----s~d~~k~~i-~~--~~-P~~n~~~ir 71 (513) |-++++........ ..|. .+.+||-.- -.|-..=+. .+ |+ |-=+-+.|. T Consensus 1 ~~~~~~~~~~~~~~---------------------~~~~-~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la 58 (345) T protein:vir:37 1 MKTNVKTDNKKGIV---------------------IAPI-NDRTFSLNEISASPALDYVGIGFDENYNCYLPPVNRHALA 58 (345) T ss_pred CCCCccccchhhcc---------------------cCcc-eeEEeecCCcccccchhhhhhhhcCCccccCCCCCHHHHH Confidence 54444332111100 0111 123344211 001111111 11 33 222222222 Q ss_pred HHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEE Q lcl|NC_015263. 72 KVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGY 151 (513) Q Consensus 72 ~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy 151 (513) ++ + ..+++-.+.|..-.+|....+ .| +.+ |. ..+|..+..+.+..|..|.+ T Consensus 59 ~l---~-~~~~~h~~~i~~k~n~l~~~~--~P---------n~~----------lt----~~~f~~~~~d~ll~Gnay~~ 109 (345) T protein:vir:37 59 KL---P-HQNAQHGGILHSRANMVSSLY--EG---------GKA----------LS----RMDMRALCLNLIQFGDVGLL 109 (345) T ss_pred HH---h-hcccccccceeeechHHHhhc--cC---------CCC----------CC----HHHHHHHHHHHHhcCCeEEE Confidence 22 1 222222222221111111111 11 000 11 34456677788899999999 Q ss_pred EEEcC--cceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCc Q lcl|NC_015263. 152 VIDDK--ESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKN 229 (513) Q Consensus 152 ~i~d~--~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~k 229 (513) ++-+. ..+-+.++|+.||++. .+|...+.+--..+... ..=++++.+- T Consensus 110 ~~rn~~G~~~~L~pl~~~~vr~~--~d~~~~~~~~~~~~~~~----------------------------g~~~~~~~~d 159 (345) T protein:vir:37 110 KVRNGFGQVVRLVPLSSLYLRVR--KDGGYSYLMKKSLYDTA----------------------------QEIYRYDAKD 159 (345) T ss_pred EEEcCCCcEEEEEEEcCceeEEE--EeCCeeEEEEEeEecCC----------------------------ceEEEEcccc Confidence 88764 4567889999999986 34333222211111100 0112333332 Q ss_pred eEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHh Q lcl|NC_015263. 230 SICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMT 307 (513) Q Consensus 230 t~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~ 307 (513) -+-|+. + ....+|+|+..+.+..+.--+...+...- -..|-.. .+-|-.. .++.++.++++.+.+.+++. T Consensus 160 Vihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~--~f~NG~~-p~~Il~~-----~d~~l~~e~~~~lk~~~~~~ 231 (345) T protein:vir:37 160 IIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRR--YFSNGAH-MGFILYS-----TDPDLTEEMEEEIARKISES 231 (345) T ss_pred EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH--HHhccCC-cceEEEe-----cCCCCCHHHHHHHHHHHHHh Confidence 333442 3 23457999999998887665555543311 1233111 1111110 12456777777777777664 Q ss_pred ccccce-----EEEeccc-cccc---ccccccccchhhh---hhHHhhhhhhhhhhhhccC--C-CcchH-HHHHHHHHH Q lcl|NC_015263. 308 VPDNVG-----VVTSPME-IDTV---SFDKDSSTDDSVE---KATKNFWDNAGVSQILFSS--D-NKTSQ-GIAMSIATD 371 (513) Q Consensus 308 Lp~gv~-----~v~sP~~-~d~i---~ld~~~~~~dtv~---~~~~~i~~~~GiS~~Lfn~--d-~~s~~-~~~~SI~~d 371 (513) .|++ .+.+|-- -+.+ ++.....+.+.++ -..++|..+.||...|.|- + +.+++ .-+..+.-- T Consensus 232 --~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~ 309 (345) T protein:vir:37 232 --KGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYH 309 (345) T ss_pred --cCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHH Confidence 2332 4455421 1223 3332333333333 4457899999999998852 1 11222 223333333 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCC Q lcl|NC_015263. 372 EQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTH 408 (513) Q Consensus 372 ~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~ 408 (513) ..-+.-++++||..+|+.+... ....++|.=.+++- T Consensus 310 ~~~l~P~~~~ie~~ln~~~~~~-~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 310 YDEVMPLQEIIAETINQDPEIK-NLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHhhhhccCC-CcceEEecchhhcC Confidence 3344468899999999865422 11224433333332 No 146 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=87.44 E-value=0.039 Score=28.32 Aligned_cols=406 Identities=11% Similarity=0.049 Sum_probs=157.0 Q ss_pred HHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh---------------------- Q lcl|NC_015263. 23 KRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ---------------------- 80 (513) Q Consensus 23 ~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~---------------------- 80 (513) |++.. ..|...-.+...........+.|+.+.-......++++-+|+... T Consensus 1 ~~~~~--------~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~ 72 (479) T protein:vir:79 1 MLNIY--------ISETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTK 72 (479) T ss_pred CCCce--------ecccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCccccccccccccccccccccc Confidence 11111 111111122222222333344454432111223344444444322 Q ss_pred ------cchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEE Q lcl|NC_015263. 81 ------SQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVID 154 (513) Q Consensus 81 ------sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~ 154 (513) .+..+.+++-.++...=+=.. +. -+++-..+++ ..+..=++...+..+.+.+++.|..|.+... T Consensus 73 ~~~ki~~~~~~~Ivd~~~~~l~g~p~~--~~-----~~~~~~~~~~---~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~ 142 (479) T protein:vir:79 73 VNNKAINNYHKLLVDQKVGYSVGNPIV--FN-----ADDDNLTKLL---NDLLGEEFDDTITELYLNASNKGVEWLHPYI 142 (479) T ss_pred CcceeecchHHHHHHHHHhhhhcCCce--ec-----cCCHHHHHHH---HHHHhcCHHHHHHHHHHHHHhcCeEEEEEEe Confidence 222344444333322111000 00 0111112222 2233336888889999999999999988877 Q ss_pred cCcc-eeeeecCcceeEEEEEE--CCeeEEEEEeeecc---Ccc--hhccccH-HHHHHHHHHhhhh----------hcc Q lcl|NC_015263. 155 DKES-VMIQQFPNDICKISSVS--GGVYNYVIDLDALV---SAD--IVDYYPK-EIQEAVNKYTTMK----------KGN 215 (513) Q Consensus 155 d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~syFd---~~~--~L~~~p~-Ei~~~y~~Y~~~k----------~~~ 215 (513) |.++ +-+..++|+-|.++.-. .+.+.+++=.-... ... ..+-|.+ .+.. | .+.... ... T Consensus 143 d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~-~-~~~~~~~~~~~~~~~~~~~ 220 (479) T protein:vir:79 143 NRKGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITY-F-IERGNSFIQEFLYDEYGKM 220 (479) T ss_pred CCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEE-E-EecCCcccccccccccccc Confidence 6554 56777888888887542 23455443211111 011 1111111 1100 0 000000 000 Q ss_pred CcccccC-eeecCCce----EEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhc--eeeeeeeccccCCC Q lcl|NC_015263. 216 NKSASNW-YEIQDKNS----ICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNY--KLLIQKLETRSSND 286 (513) Q Consensus 216 ~~~~~~W-~~L~~~kt----~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~--~ii~~kip~~~~n~ 286 (513) .....-| ..-..++. .++.+ -...++.+-|. ++.++.+.=+.. +..+.++.. .+++ +.=..+.. T Consensus 221 ~~~~~~~~~~~~~~~~~~~vPvv~~-~nn~~g~sd~~----~v~~liDa~d~~~S~~~~~~~~~~~~~~v--~~g~~~~~ 293 (479) T protein:vir:79 221 TDIQEGHFRINNKEQGWGKVPFIPF-KNNEKCVSDLT----FYKSLIDIYDNNISTLADNLDEIQEVIYV--LKEYPGTS 293 (479) T ss_pred cccccccccccccccCCCcccEEEe-cCCCCCCcchh----hhHHHHHHHHHHHHHHHHHHHHhhCceee--eecCCccc Confidence 0000001 00111111 11222 11223444333 333333222211 112222211 1222 22001122 Q ss_pred CCccccCHHHHHHHHHHHHHhccccc--eEEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCCC-cchHH Q lcl|NC_015263. 287 NNDFTLDMPMMNYFHEALSMTVPDNV--GVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSDN-KTSQG 363 (513) Q Consensus 287 ~~~~~vd~~~~~~~~~~ik~~Lp~gv--~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~-~s~~~ 363 (513) .+++.-++.. ...+. ++.+- ..++.+.+. ......++.-.++|+..+++-..-+.+.+ .|+.. T Consensus 294 ~~~~~~~~~~----~~~i~--~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~A 359 (479) T protein:vir:79 294 LQEFIDNIRY----YKSIK--VDGGGGVDKLEINIPV--------EAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVA 359 (479) T ss_pred cccchhhhhh----cccee--cCCCCcceEEeccCCH--------HHHHHHHHHHHHHHHHHhCccccccccccchhHHH Confidence 2222211111 11111 12221 122222111 11112244555677777766544443332 23333 Q ss_pred HHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcc-c---ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHH Q lcl|NC_015263. 364 IAMSIATDEQFIF-------GVINQLERWLNRYLLLNG-M---SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYL 432 (513) Q Consensus 364 ~~~SI~~d~~~~~-------~~~~~iE~~~N~~i~~~~-~---~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~l 432 (513) ++.....-...+. ..++++=+++-..+.... . ...+.+.|-+..+.|.++.++.+.++. |.-+...+ T Consensus 360 i~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~--g~iS~et~ 437 (479) T protein:vir:79 360 LKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKST--GIVSDETI 437 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHH Confidence 3332222222222 222222222222322211 1 125789999999999999999999985 64444445 Q ss_pred HHHhCC--CHHHHHHHHHHHHHh-hCcccccCcccccccccccccccCCccccCCCCcCCCCcc Q lcl|NC_015263. 433 ASLMGI--DPVAFTGLLKVENEM-LDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNET 493 (513) Q Consensus 433 aa~~G~--~p~~~~~~~~~E~e~-L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et 493 (513) ...+|+ +|.+.+.++..|++. ......+ . + ++.| -.+|| T Consensus 438 l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~-------~---~-----------~~~~-~~~e~ 479 (479) T protein:vir:79 438 VSNHPWVEDVNDELERLKKQEDTQKEYDDLI-------P---N-----------NQDG-VIDET 479 (479) T ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHHHhcc-------C---c-----------ccCC-CcCcC Confidence 556786 678889999998743 2211111 1 0 0111 11122 No 147 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=86.22 E-value=0.047 Score=27.86 Aligned_cols=417 Identities=13% Similarity=0.095 Sum_probs=161.6 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhc---cChhHHHHHHHHHHHHHhhcchH--- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEY---RNEGNQKTLRKVSEDLAVQSQQY--- 84 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~---~P~~n~~~ir~~s~~lY~~sg~~--- 84 (513) |+ .++-..+-+-+++ ||-.. .+++.+... .|..-+..|+..-+|+.+....+ T Consensus 1 m~-----------~~~~~~~~~~~~~-------~~~~~----~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~ 58 (496) T protein:vir:38 1 MI-----------NQIIAGVKGVMRR-------MGLLK----ALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNL 58 (496) T ss_pred Ch-----------hHHHHHHHHHHHH-------hccch----hhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcc Confidence 32 2222222222222 11111 112222211 15555666666666655433222 Q ss_pred -------------------HHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHh Q lcl|NC_015263. 85 -------------------QRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTV 145 (513) Q Consensus 85 -------------------~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~ 145 (513) +.+++..+++..= .|... ...++..++... .++..-+....+..++..|... T Consensus 59 ~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~----~p~~i--~~~d~~~~e~l~---~~~~~n~f~~~~~~~~~~a~~~ 129 (496) T protein:vir:38 59 NYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFN----EKVKI--NIDDKAAEEFVL---NVLKTNGFTKNMERYIEYGEAM 129 (496) T ss_pred hhccCCCccccceeecchHHHHHHHHhhhhhC----CcceE--eeCChHHHHHHH---HHHhccCHHHHHHHHHHHHhhh Confidence 2233333332110 01000 001112222222 2344445778889999999999 Q ss_pred cceeEEEEEcCc-ceeeeecCcceeEEEEEECCeeE-EEEEeeeccCcc----hhccccHHHHHHH---HHHhhhhhccC Q lcl|NC_015263. 146 DIFYGYVIDDKE-SVMIQQFPNDICKISSVSGGVYN-YVIDLDALVSAD----IVDYYPKEIQEAV---NKYTTMKKGNN 216 (513) Q Consensus 146 g~~~gy~i~d~~-~~~iq~lp~dyckIsg~~nG~y~-~~fD~syFd~~~----~L~~~p~Ei~~~y---~~Y~~~k~~~~ 216 (513) |..|.+...|.+ .+-+--.|++-+-++...+|... ++| .+...... .|+.+-.+=...+ .-|+......- T Consensus 130 G~~~~~~~~D~~~~~~i~~v~~~~~~P~~~~~~~~~~~~f-~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~ 208 (496) T protein:vir:38 130 GGFVIKVYHDGNKNVKVSFATADCMYPLSNDSENVDECVI-ANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNEL 208 (496) T ss_pred CcEEEEEEEcCCCcEEEEEEcccceEEEEecCCcEEEEEE-EEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCcccc Confidence 999999988754 45666677775555544334322 333 11111110 1221110000000 00100000000 Q ss_pred c----ccccCeeecCC-------c-eEE-EE------ecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhh--ce Q lcl|NC_015263. 217 K----SASNWYEIQDK-------N-SIC-IK------INESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQN--YK 273 (513) Q Consensus 217 ~----~~~~W~~L~~~-------k-t~~-ik------~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n--~~ 273 (513) . .+.-|=.+.+. + .|+ |+ ++.++++|+|-| .++.++.+.=+.. ....+++. .+ T Consensus 209 g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~----~~~~~lid~ld~~~s~~~~~~~~~~~~ 284 (496) T protein:vir:38 209 GTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVY----ANALDTLKTLDLMFDSYYQEFKLGKKK 284 (496) T ss_pred CccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchH----hhHHHHHHHHHHHHHHHHHHHhhcccc Confidence 0 00001111111 0 121 11 223344454444 4444443333322 11122221 12 Q ss_pred eeee--eeccccCCCCCccccCH-HHHHHHHHHHHHhccccceEEEecccccccccccc-cccchhhhhhHHhhhhhhhh Q lcl|NC_015263. 274 LLIQ--KLETRSSNDNNDFTLDM-PMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGV 349 (513) Q Consensus 274 ii~~--kip~~~~n~~~~~~vd~-~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~Gi 349 (513) |.+. -+.. ..+.+|....-. ...+.|....... .++.. .+..+..+-. ..-...++...+.+...+|+ T Consensus 285 i~v~~~~l~~-~~~~~g~~~~~~~~~~~~~~~~~~~~-~~~~~------~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~ 356 (496) T protein:vir:38 285 VLVPSSFVKT-AVNLDGSTTQYFDSTDEAFFLYQGDQ-DDNGK------AIKDISVEIRSTEFIESINAMLRIYAMQVGL 356 (496) T ss_pred eecchHHhhc-cCCCCCccccCCCCccceEEEeecCC-Ccccc------cceeeccccCHHHHHHHHHHHHHHHHHhhCC Confidence 2221 0110 112222211100 0000000000000 00000 0111111100 11124466666778889999 Q ss_pred hhhhccCCCcchHH---HHHHHHHHHHHHHHHHHHHHH----HHHHHHh-------h---cccceEEEEEecCCCCccHH Q lcl|NC_015263. 350 SQILFSSDNKTSQG---IAMSIATDEQFIFGVINQLER----WLNRYLL-------L---NGMSKYFKATMLEVTHFSKK 412 (513) Q Consensus 350 S~~Lfn~d~~s~~~---~~~SI~~d~~~~~~~~~~iE~----~~N~~i~-------~---~~~~~~f~~~~l~~T~fn~k 412 (513) |...|+.+.++..+ +..+...-..-+-.....+|. .+..++. . ........+.|-+.-+-++. T Consensus 357 ~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~ 436 (496) T protein:vir:38 357 SAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDED 436 (496) T ss_pred ChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHH Confidence 99999866544322 222222111111122222223 2222221 0 11233588999999999999 Q ss_pred HHHHHHHHHHhcC-CcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCC Q lcl|NC_015263. 413 EAHDRYITDAQYG-FPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTN 491 (513) Q Consensus 413 e~~~~~~~~~~~G-~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~ 491 (513) +.++++.++.+.| .|...++....|++-.+....++-.+++-. .-.|.. +.|.+.. T Consensus 437 ~~~~~~~~~~~~GiiS~et~l~~~~~~~d~ea~~el~ri~~E~~---~~~~~~--------------------d~~~~~~ 493 (496) T protein:vir:38 437 TTINRYTNAKNQGMIPLKIALQRAWNITEAEADEWAEMLAKEKQ---AEMPNN--------------------DMNGIFG 493 (496) T ss_pred HHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhh---ccCccc--------------------cccCCCC Confidence 9999999999999 577777777779987765444333222101 001111 1111111 Q ss_pred ccc Q lcl|NC_015263. 492 ETT 494 (513) Q Consensus 492 et~ 494 (513) +.. T Consensus 494 ~~e 496 (496) T protein:vir:38 494 EEE 496 (496) T ss_pred CCC Confidence 100 No 148 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=85.59 E-value=0.052 Score=27.64 Aligned_cols=445 Identities=9% Similarity=0.027 Sum_probs=168.5 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) |-+.- -+-|..-+.-++.--+...+..-+......-+ ....+.+++.|..|.+. -...++++-+|++.. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~i~~~i~~~~~~-~~~r~~~~~~yY~g~ 69 (501) T protein:vir:96 1 MEQTL-FTDSTGQERVLNLRFHRESRIRYRADNLEELM---------VNNWELLKNFINHHKLR-QAPRIQELLDYARGE 69 (501) T ss_pred Cceee-eeecccceeccccccchhHHhhhccccccccc---------CChHHHHHHHHHHHHHH-HHHHHHHHHHHhcCC Confidence 21100 00000000000000011111111111111111 11234456666665211 123466667776665 Q ss_pred cc-hHHH--------------------HHHHHhhcccccceE-eeccchhhhhhcchhHHHHH-HHHHHhhcChhHHHHH Q lcl|NC_015263. 81 SQ-QYQR--------------------LLNFYANMPLYAYSV-VPFKDISTANENKLKKELAT-VTEFLSRLNPKYNFSK 137 (513) Q Consensus 81 sg-~~~r--------------------lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~-v~~~L~k~n~k~~~~~ 137 (513) .. ++++ +++-++ +|.+ .|..- . ...+...+...+ +..++..-++...+.. T Consensus 70 ~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~-----~yl~g~p~~~-~-~~~~~~~~~~~~~l~~~~~~n~~~~~~~~ 142 (501) T protein:vir:96 70 NHDVLKSGRRKDNEMADKRAVHNYGRMISKFKT-----GYLAGNPIRV-E-YDDNDDNSQNDDAIKRIGRINDLDSLNRT 142 (501) T ss_pred CCcccCccccCccccccceeecchHHHHHHHHh-----hhhcccCeeE-e-eCCccchhHHHHHHHHHHHhcCHHHHHHH Confidence 43 1111 111111 1111 01000 0 000111121111 2234555578889999 Q ss_pred HHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEEE--CCeeEEEEEeeeccCc----chhccc-cHHHHHHHHHHh Q lcl|NC_015263. 138 IVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSVS--GGVYNYVIDLDALVSA----DIVDYY-PKEIQEAVNKYT 209 (513) Q Consensus 138 i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~syFd~~----~~L~~~-p~Ei~~~y~~Y~ 209 (513) +.+.+++.|..|.+...+.++ +-+..++|.-|.++--. .+.+.+++=.-+-... .+.+-| +..+.. |. T Consensus 143 ~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~----~~ 218 (501) T protein:vir:96 143 LIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYT----LD 218 (501) T ss_pred HHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEE----Ee Confidence 999999999999988776554 56888999999888553 2566666432221111 112212 111110 10 Q ss_pred hhhhccCcccccCeee-cCCceE----EEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccc-c Q lcl|NC_015263. 210 TMKKGNNKSASNWYEI-QDKNSI----CIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETR-S 283 (513) Q Consensus 210 ~~k~~~~~~~~~W~~L-~~~kt~----~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~-~ 283 (513) ....|.++ ..+|.+ ++.+ -+.+.+++-|.. +.++.|..+.--. .....++...--.-.+. + . T Consensus 219 --------~~~~~~~~~~~~~~~g~vPvv~~-~nn~~g~sd~e~-v~~liDa~d~~~s-~~~~~~~~~~~~~l~i~-G~~ 286 (501) T protein:vir:96 219 --------ASDDFNEISVTTHAFGTVPITEY-LNNIDGIGDYET-ELYLIDLYDSAES-DTANHMSDMADAILAIY-GDL 286 (501) T ss_pred --------eCCCceeccccccCCCccceEEe-cCCccCCCchhh-hHHHHHHHHHHHH-HHHHHHHHhcCceeeee-ccc Confidence 00011111 111111 1222 122344444443 2333332221111 22222222111011111 0 0 Q ss_pred CCCCCccccCHHHHHHHHHHHHHhccccceEEEec-cccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCC--Cc Q lcl|NC_015263. 284 SNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSP-MEIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSD--NK 359 (513) Q Consensus 284 ~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP-~~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~ 359 (513) ....+++..++.....+.- -+.+.+....+ .++.-+.-+... .....++-..++|+.-+|+...-+++. +. T Consensus 287 ~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~ 361 (501) T protein:vir:96 287 ALPKGMQASDMKRTRLMQL-----KPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNT 361 (501) T ss_pred ccCcccchhhhhhcCeeee-----cccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccc Confidence 1222333222221111100 01111110000 011111001001 111224445577888888876666443 33 Q ss_pred chHHHHHHHHHHHHHHHH----HHHHHHHHHH---HHHhhc--ccce---EEEEEecCCCCccHHHHHHHHHHHHhcC-C Q lcl|NC_015263. 360 TSQGIAMSIATDEQFIFG----VINQLERWLN---RYLLLN--GMSK---YFKATMLEVTHFSKKEAHDRYITDAQYG-F 426 (513) Q Consensus 360 s~~~~~~SI~~d~~~~~~----~~~~iE~~~N---~~i~~~--~~~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~ 426 (513) |+.+++.-...-...+-. |-+.|++.+. .++... .... ..++.|-+..+-|..+.++.+.++. | . T Consensus 362 Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~i 439 (501) T protein:vir:96 362 SGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQV 439 (501) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccC Confidence 444444332222222221 2222222222 222221 1111 3689999999999999999999996 6 5 Q ss_pred cHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCC Q lcl|NC_015263. 427 PVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQR 504 (513) Q Consensus 427 ~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~ 504 (513) |....| +.+++ +|.+.+.++..|++.++.. ..+-+- . +..+ ...++-..++.+...+.. T Consensus 440 S~et~~-~~l~~v~D~~~E~~ri~~E~~~~~~~--~~~~~~------~---~~~~-~~~~~~~e~~~d~~e~~~------ 500 (501) T protein:vir:96 440 SQETAL-SLSGLVESPNEELDKINKEMSEIDFK--GYSNDF------N---EHVG-KYTDEVKETHTDDFEREY------ 500 (501) T ss_pred chHHHH-HhCCCCCCHHHHHHHHHHHHHHhhcc--ccccch------h---hccc-ccCCcCCCCCCCcccccc------ Confidence 555544 45777 6889999999998664411 111110 0 0000 000110101001011111 Q ss_pred C Q lcl|NC_015263. 505 A 505 (513) Q Consensus 505 ~ 505 (513) . T Consensus 501 ~ 501 (501) T protein:vir:96 501 E 501 (501) T ss_pred C Confidence 1 No 149 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=85.49 E-value=0.052 Score=27.60 Aligned_cols=318 Identities=13% Similarity=0.069 Sum_probs=145.3 Q ss_pred CCCccchheee-eehhhhhhHHHHHHHHHHHHHhhccCccccccccc---------ccchHHHHHHHhh-hcc--ChhHH Q lcl|NC_015263. 1 MVKNKKKRLSM-IDVESISSYSNKRNNRISILRDDNRTPVFGAPVGS---------LTSSQSKVRKIVK-EYR--NEGNQ 67 (513) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s---------~~~s~d~~k~~i~-~~~--P~~n~ 67 (513) |-|.|+.+... --. .+..+ -++ +|. .+..|+ .....|-++-+.. +|+ |..- T Consensus 1 ~~~~~~~~~~~~~~~------~~~~~----~~~----~~~-~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~- 64 (351) T protein:vir:79 1 MSKRRSRAPRTFAAA------PNPSA----GSA----APA-RAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSF- 64 (351) T ss_pred CCCCCCCCCCCCCCC------Cchhh----hhc----ccc-eeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCH- Confidence 87766654321 110 00000 000 010 111222 1111222222222 243 4442 Q ss_pred HHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcc Q lcl|NC_015263. 68 KTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDI 147 (513) Q Consensus 68 ~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~ 147 (513) ..|++. +..+++-.+.|....+|..-.|.=-|. |. ..+|..+..+.+..|. T Consensus 65 ---~~la~~-~~~~~~h~~~l~~k~n~l~~~~~Pnp~---------------------~t----~~~f~~~v~d~ll~Gn 115 (351) T protein:vir:79 65 ---AGLAKS-FRASTHHSSALFFKANVLASTFRPHRW---------------------LS----RHAFERWALDFLTFGN 115 (351) T ss_pred ---HHHHHH-HhhhHhhhhhhhhhhhHHhhcccCCCC---------------------CC----HHHHHHHHHHHHhcCC Confidence 223332 344444455555444444333221111 00 3345667778889999 Q ss_pred eeEEEEEc--CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeee Q lcl|NC_015263. 148 FYGYVIDD--KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEI 225 (513) Q Consensus 148 ~~gy~i~d--~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L 225 (513) .|.+.+-+ +..+-+.++|+.|+++.-..+ .|.+... ... =+++ T Consensus 116 ay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~-~~~~~~~-----~g~-----------------------------~~~~ 160 (351) T protein:vir:79 116 GYLERRRNMVGGTLRLEPALAKYVRRKADFS-GFVYVNG-----WQE-----------------------------RHEF 160 (351) T ss_pred eEEEEEECCCCCEEEEEEeCCcceeeeecCC-eEEEEec-----Cce-----------------------------EEEE Confidence 99999875 455899999999999863333 3333221 000 0123 Q ss_pred cCCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHH Q lcl|NC_015263. 226 QDKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEA 303 (513) Q Consensus 226 ~~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ 303 (513) +++.-+.|+- + ....+|+|+..+.+..+.--+...+... .=..|-. ..+-|-+. . +..++.++.+.+.+. T Consensus 161 ~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~--~~f~NGa-~pg~il~~---~--~~~ls~e~~~~lk~~ 232 (351) T protein:vir:79 161 EPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRR--KYYENGS-HAGFILYM---T--DAAQKQDDVDNMRDA 232 (351) T ss_pred cCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhccC-CCceEEEe---c--CCCCCHHHHHHHHHH Confidence 3333333442 2 2345799999888887765555444321 1122311 11111110 0 135778888888777 Q ss_pred HHHhccccce-----EEEeccc-cccccc---ccccccchh---hhhhHHhhhhhhhhhhhhccCC---CcchHHH-HHH Q lcl|NC_015263. 304 LSMTVPDNVG-----VVTSPME-IDTVSF---DKDSSTDDS---VEKATKNFWDNAGVSQILFSSD---NKTSQGI-AMS 367 (513) Q Consensus 304 ik~~Lp~gv~-----~v~sP~~-~d~i~l---d~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d---~~s~~~~-~~S 367 (513) +++. .|++ .|..|-- -+.+++ .....+.+. .+-+.++|..+.||...|.|-- +.+.+.+ +.. T Consensus 233 ~~~~--~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~ 310 (351) T protein:vir:79 233 LKNA--KGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAA 310 (351) T ss_pred HHHh--cCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHH Confidence 7764 3432 3444321 123333 222333333 3344578999999998888531 1112222 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHH Q lcl|NC_015263. 368 IATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITD 421 (513) Q Consensus 368 I~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~ 421 (513) ..--..-+.-++++||. +|..|-.+ .++ |+..+.+..-.++ T Consensus 311 ~~f~~~~l~Pl~~~ie~-ln~~lg~~----~~~--------F~~~~llr~d~~a 351 (351) T protein:vir:79 311 RVFGRNEIRPLQARFAE-LNDWLGDE----VVT--------FDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHHHHHHHHHHHH-HHhhcCcc----eee--------eChhhhccccccC Confidence 22222223346667764 56554222 122 3444444443333 No 150 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=84.96 E-value=0.056 Score=27.43 Aligned_cols=308 Identities=13% Similarity=0.113 Sum_probs=141.6 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccc---------hHHHHHHHhh-hcc--ChhHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTS---------SQSKVRKIVK-EYR--NEGNQK 68 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~---------s~d~~k~~i~-~~~--P~~n~~ 68 (513) |-|+|+ |-..-... .+| ..+..|+-+. ..|-.+-+-. +|+ |..- T Consensus 1 m~~~~~-~~~~~~~~--------------------~~~-~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~-- 56 (340) T protein:vir:98 1 MSKRKP-RKAVAMTA--------------------SAP-QKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSF-- 56 (340) T ss_pred CCCCCC-Cccccccc--------------------cCc-cceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCH-- Confidence 776654 42211100 000 0112222110 0111122111 143 3332 Q ss_pred HHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 69 TLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 69 ~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~ 148 (513) +.|++. +.++++-.+.|.+..+|-.-.|.=-|. |. .++|..+..+.+.-|.. T Consensus 57 --~~la~l-~~a~~~h~s~i~~k~n~l~~~~~Pn~~---------------------lt----~~~f~~~~~d~ll~Gna 108 (340) T protein:vir:98 57 --SGLAKS-LRSAVHHSSPIYVKRNVLASTYIPHPL---------------------LS----RQDFSRFALDYLVFGNA 108 (340) T ss_pred --HHHHHH-HHhccccchhhhhhhhHHhhccCCCCC---------------------CC----HHHHHHHHHHHHhcCCe Confidence 223332 345555555555444443332211110 00 34456677788888999 Q ss_pred eEEEEEcC--cceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeec Q lcl|NC_015263. 149 YGYVIDDK--ESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ 226 (513) Q Consensus 149 ~gy~i~d~--~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~ 226 (513) |.+.+.+. ..+-+.++|+.||++. .+|...|.+.. .. .=++++ T Consensus 109 y~~~~rn~~G~~~~L~pl~~~~vr~~--~~~~~~~~~~~----~~-----------------------------~~~~~~ 153 (340) T protein:vir:98 109 FLEQRHSVTGQLIKLLTSPAKYTRRG--VDDSVFWFVEN----FT-----------------------------QPHEFA 153 (340) T ss_pred EEEEEECCCCcEEEEEEeCCceEEEc--ccCcEEEEEec----CC-----------------------------eEEEEc Confidence 99998765 4567889999999985 45443221100 00 012344 Q ss_pred CCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHH Q lcl|NC_015263. 227 DKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEAL 304 (513) Q Consensus 227 ~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~i 304 (513) .+.-+-|+- + ....+|+|+..+....+.--+.......- -..|-. .-+-|-+. .+..++.++++.+.+.+ T Consensus 154 ~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~--~f~NGa-~pg~il~~-----~~~~ls~e~~~~lk~~~ 225 (340) T protein:vir:98 154 PDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRK--YYQNGA-HAGYIMYV-----TDPAQSATDVESLRDAM 225 (340) T ss_pred cccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH--HHhccC-CCceEEEe-----cCCCCCHHHHHHHHHHH Confidence 433444542 3 24557999999988877655444443211 122311 11111110 12447778887777777 Q ss_pred HHhccccce-----EEEecc----cccccccccccccchh---hhhhHHhhhhhhhhhhhhccC--C-Ccc-hHHHHHHH Q lcl|NC_015263. 305 SMTVPDNVG-----VVTSPM----EIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSS--D-NKT-SQGIAMSI 368 (513) Q Consensus 305 k~~Lp~gv~-----~v~sP~----~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~--d-~~s-~~~~~~SI 368 (513) ++. .|.+ .|.+|- .++-+++.....+.+. -+-..++|..+.||...|.|- + +.+ ++.-+... T Consensus 226 ~~~--~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~ 303 (340) T protein:vir:98 226 RNS--KGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAK 303 (340) T ss_pred HHh--cCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHH Confidence 763 4443 444442 1122233333333333 333347899999999888852 2 111 22233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHH Q lcl|NC_015263. 369 ATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKE 413 (513) Q Consensus 369 ~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke 413 (513) .--..-+.-++++||. +|..|-.+. .+|+-+-+..+ | T Consensus 304 ~f~~~~l~Pl~~~iee-~n~~L~~e~--~rF~~~~l~~~-----d 340 (340) T protein:vir:98 304 VFVRNELSPLQDRFRE-VNDWLGMEV--IRFKEYTLDNP-----E 340 (340) T ss_pred HHHHHHHHHHHHHHHH-HHhcccccc--cccCccccccC-----C Confidence 2222223346777774 565553222 12333333322 1 No 151 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=84.41 E-value=0.061 Score=27.26 Aligned_cols=421 Identities=10% Similarity=0.112 Sum_probs=155.0 Q ss_pred HHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcch-------------------HHH Q lcl|NC_015263. 26 NRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQ-------------------YQR 86 (513) Q Consensus 26 ~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~-------------------~~r 86 (513) .-+.+-+++-.+=. -.+.+.+++.|..| ......++.+-.|+.....+ .+. T Consensus 1 ~~~~~~~~~~~~~~--------~~~~~~i~~~i~~~--~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ 70 (499) T protein:vir:10 1 MAVVIDKDLLDDVN--------EPNIEAINYAIREL--QNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKY 70 (499) T ss_pred CccchhhhHHhhhh--------cCCHHHHHHHHHHH--HHHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHH Confidence 12222222211100 00123344545544 33445556666665554322 222 Q ss_pred HHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCccee----- Q lcl|NC_015263. 87 LLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKESVM----- 160 (513) Q Consensus 87 lidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~~----- 160 (513) +++-.++ |.+ .|..- ....+ +.......+++.-++...+..+.+.+++.|..|.+..-+.++.+ T Consensus 71 Iv~~~~~-----~l~g~p~~~--~~~~~---~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~ 140 (499) T protein:vir:10 71 ITDMNVG-----FMTGNPVKY--VAEKG---KNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDE 140 (499) T ss_pred HHHHHhh-----hhcccCcee--ecCCh---hHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccc Confidence 2222221 110 01000 00111 22233445566667888899999999999999988876655422 Q ss_pred -------------eeecCcceeEEEEEECCe--eEEEEEeeeccC------c--chhccc-cHHHHHHHHHHhhhhh--- Q lcl|NC_015263. 161 -------------IQQFPNDICKISSVSGGV--YNYVIDLDALVS------A--DIVDYY-PKEIQEAVNKYTTMKK--- 213 (513) Q Consensus 161 -------------iq~lp~dyckIsg~~nG~--y~~~fD~syFd~------~--~~L~~~-p~Ei~~~y~~Y~~~k~--- 213 (513) +.-++|.=|.++.-..+. ..++ +++... . ..++-| |..+.. |..... T Consensus 141 ~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~--i~~~~~~~~~~~~~~~~~~iyt~~~i~~----~~~~~~~~~ 214 (499) T protein:vir:10 141 LGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFA--VFTQEKKDLEGNTNGYSITVYMPQRIVE----YRTKTTMEV 214 (499) T ss_pred ccccccccccceEEEEEcccceEEEecCCCCcceEEE--EEEEEEeecCCCceEEEEEEEeCCeEEE----EEecCCccc Confidence 333444444333221111 1111 111100 0 011111 111111 100000 Q ss_pred -----ccCcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHH-HHHH-hhHhhhhhceeeeeeeccccCCC Q lcl|NC_015263. 214 -----GNNKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSF-KDLR-NDKAELQNYKLLIQKLETRSSND 286 (513) Q Consensus 214 -----~~~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~-kdL~-~~~~~i~n~~ii~~kip~~~~n~ 286 (513) ........|=.+| ++.+-. ...+.+ .|.++.++.+. ..+. +....++..+--.-.+. |.. T Consensus 215 ~~~~~~~~~~~~~~g~vP-----vv~~~n-~~~~~~----d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~---G~~ 281 (499) T protein:vir:10 215 SANDPIVYDGENLFGAVP-----IIEFRN-NEERQG----DFEQLISLIDAYNLLQTDRISDKEAFVDALLVTF---GFG 281 (499) T ss_pred cCcceecccccCCCCccc-----eEEecC-CCCCCC----chHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee---cCc Confidence 0000000111111 122211 122323 33333333332 1111 12222222111111111 111 Q ss_pred CCccccCHHHHHHHHHHHHHhcc--ccc--eEEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccC--CCcc Q lcl|NC_015263. 287 NNDFTLDMPMMNYFHEALSMTVP--DNV--GVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSS--DNKT 360 (513) Q Consensus 287 ~~~~~vd~~~~~~~~~~ik~~Lp--~gv--~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~--d~~s 360 (513) -++.. ..........-.+++ +|. ..++.+.+. ......++--.++|+.-+++...-+.. .+.| T Consensus 282 ~~~~~---~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~--------~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~S 350 (499) T protein:vir:10 282 LGDDK---DDIQRLKRGAIEAPPREEGADIEWLTKSFDE--------TQVNLLSQSIENDIHKISYVPNMNDEKFMGNVS 350 (499) T ss_pred ccccc---chhhhhhhcceeccCCCCCCcceEEeccCCH--------HHHHHHHHHHHHHHHHHhCcccCCchhhcccch Confidence 01111 111111111111111 111 111111111 111122444456787777766544422 2334 Q ss_pred hHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcccce---EEEEEecCCCCccHHHHHHHHHHHHhcC-CcHH Q lcl|NC_015263. 361 SQGIAMSIATDEQFIFG-------VINQLERWLNRYLLLNGMSK---YFKATMLEVTHFSKKEAHDRYITDAQYG-FPVK 429 (513) Q Consensus 361 ~~~~~~SI~~d~~~~~~-------~~~~iE~~~N~~i~~~~~~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~ 429 (513) +..++.....-.+.+-. .++++=+.+-.++....... ...+.|-+..+-|..+.++.+.++. | .|.. T Consensus 351 g~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~e 428 (499) T protein:vir:10 351 GEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNAD--GIIPRK 428 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCChH Confidence 44444433322222222 22222222222333222222 3588899999999999999999984 5 6666 Q ss_pred HHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCC Q lcl|NC_015263. 430 VYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKD 507 (513) Q Consensus 430 ~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d 507 (513) ..+. .+++ +|.+.+.++..|++. .......| ..+.+++....... .+...| ..++.++.+++++ T Consensus 429 t~~~-~l~~v~d~~~E~~ri~~E~~~-~~~~~~~~------~~~~~~~~~~~~~~-~~~~~~-----~~~~~~~~~~~~~ 494 (499) T protein:vir:10 429 YTYS-WLPDVDNPQDVIDEMNQQDAE-TIKKNQEA------LRGQDPDRLELEDK-QDDSSE-----NDKEAGSNHNQSH 494 (499) T ss_pred HHHH-hCCCCCCHHHHHHHHHHHHHH-HHHHHHhh------hccCCCCCCCCCCC-CcccCC-----CCCCCccccccCC Confidence 6555 5676 588899999888754 11111111 11221111100000 011111 1111111122111 Q ss_pred CccCC Q lcl|NC_015263. 508 KPANT 512 (513) Q Consensus 508 ~~~~~ 512 (513) .-|-. T Consensus 495 ~~~~~ 499 (499) T protein:vir:10 495 RTRAV 499 (499) T ss_pred CCCCC Confidence 11111 No 152 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=84.25 E-value=0.062 Score=27.21 Aligned_cols=316 Identities=14% Similarity=0.077 Sum_probs=147.7 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccc--hHHHHHH----Hhhh--cc--ChhHHHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTS--SQSKVRK----IVKE--YR--NEGNQKTL 70 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~--s~d~~k~----~i~~--~~--P~~n~~~i 70 (513) |-|+++ |..+..-. .+|. .+..||..- -.+.+.. +..+ |+ |.. -..| T Consensus 1 ~~~~~~-~~~~~~~~--------------------~~~~-~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~epp~~-~~~l 57 (345) T protein:vir:37 1 MKTNVK-TDNKKGIV--------------------IAPI-NDRTFSLSEITASPALDYVGIGFDENYNCYLPPVN-RHAL 57 (345) T ss_pred CCcccc-ccchhhhc--------------------CCCc-eEEEeecCCcccchhhcccceeeecCCccccCCCC-HHHH Confidence 777654 54331100 0111 122333211 0001000 0112 33 433 2222 Q ss_pred HHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeE Q lcl|NC_015263. 71 RKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYG 150 (513) Q Consensus 71 r~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~g 150 (513) .+ |+..|++-.+.|..-.+|-...| .| +... . .++|..+..+++..|..|. T Consensus 58 a~----~~~~~~~h~~~i~~k~n~l~~~~--~P---------n~~~----------t----~~~f~~~v~d~ll~Gnay~ 108 (345) T protein:vir:37 58 AK----LPHQNAQHGGILHSRANMVSATY--EG---------GKAL----------S----KMEMRALCLNLIQFGDVGL 108 (345) T ss_pred HH----HhhcchhhcchhhhhhhHHhhcc--CC---------CCCC----------C----HHHHHHHHHHHHhcCCeEE Confidence 22 23555555555554444433322 12 0000 0 3445667778888999999 Q ss_pred EEEEcCcc--eeeeecCcceeEEEEEECCeeEEE-EEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecC Q lcl|NC_015263. 151 YVIDDKES--VMIQQFPNDICKISSVSGGVYNYV-IDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQD 227 (513) Q Consensus 151 y~i~d~~~--~~iq~lp~dyckIsg~~nG~y~~~-fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~ 227 (513) +.+.+..+ +.+.++|+.||++. .+|...+. -+-.+...... ++++. T Consensus 109 ~i~rn~~G~~~~L~pl~~~~vr~~--~d~~~~~~~~~~~~~~~g~~-----------------------------~~~~~ 157 (345) T protein:vir:37 109 LKVRNGFGQVVRLVPLSSLYLRVH--KDGGYSYLMKKSLYDTAQEI-----------------------------YRYDA 157 (345) T ss_pred EEEECCCCCEEEEEEecCceeEEe--ecCCeeEEEeeeeeccCceE-----------------------------EEEcc Confidence 99987554 67889999999974 44433322 22222111111 12333 Q ss_pred CceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHH Q lcl|NC_015263. 228 KNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALS 305 (513) Q Consensus 228 ~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik 305 (513) +--+-|+- + ....+|+|+..+.+..+.--+...+... .-..|-. ..+-|-.. .+..++.++.+.+.+.++ T Consensus 158 ~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~--~~f~NGa-~~~~Il~~-----t~~~l~~e~~~~lk~~~~ 229 (345) T protein:vir:37 158 KDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRR--RYFSNGA-HMGFILYS-----TDPDLTEEMEEEIARKIS 229 (345) T ss_pred ccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHH--HHHhccC-CcceEEEe-----CCCCCCHHHHHHHHHHHH Confidence 32333442 2 2345799999998887765554444331 1123321 11111110 123577777777777777 Q ss_pred Hhc-c-c-cceEEEeccc-cccc---ccccccccchh---hhhhHHhhhhhhhhhhhhccC---CCcchHHH-HHHHHHH Q lcl|NC_015263. 306 MTV-P-D-NVGVVTSPME-IDTV---SFDKDSSTDDS---VEKATKNFWDNAGVSQILFSS---DNKTSQGI-AMSIATD 371 (513) Q Consensus 306 ~~L-p-~-gv~~v~sP~~-~d~i---~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~---d~~s~~~~-~~SI~~d 371 (513) +.- + + +...+.+|-- -+.+ ++.....+.+. -+...++|..+.||...|.|- .+++.+.+ +....-- T Consensus 230 ~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~~f~ 309 (345) T protein:vir:37 230 ESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYH 309 (345) T ss_pred HhcCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHHHHH Confidence 653 1 1 1123444310 1223 22222223232 344457899999999888852 12222222 2332222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCC Q lcl|NC_015263. 372 EQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTH 408 (513) Q Consensus 372 ~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~ 408 (513) ..-+.-++++||..+|+.+.-. ....|+|.--.++- T Consensus 310 ~~~l~P~~~~ie~~ln~~~e~~-~~~~i~F~~~~l~k 345 (345) T protein:vir:37 310 YDEVMPLQEIIAETINQDPEIK-NLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHhhhhhccC-CcceEEECchhhcC Confidence 2233458889999999865422 22334444333332 No 153 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=83.55 E-value=0.068 Score=27.00 Aligned_cols=419 Identities=11% Similarity=0.077 Sum_probs=172.4 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhc------- Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS------- 81 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s------- 81 (513) ...--.--|.--.|++++.+++..+++..- ..+.+++.|+.|. ......++.+-+|+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~i~~~i~~~~-~~~~~~~~~~~~yY~g~~~~i~~~~ 68 (481) T protein:vir:10 1 MTVYTINNINTKFSPLANDDFVVSDLAELL-----------KEENLRNFISRHQ-TEQVPRLEMLESYYLNRNTDILAGE 68 (481) T ss_pred CeeEeeehhchhcccccCceeeeecchhhc-----------CHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcccccCc Confidence 111111123333455555555444433222 2233444555442 112233444555544432 Q ss_pred ----------------chHHHHHHHHhhcccccceE-ee--ccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHH Q lcl|NC_015263. 82 ----------------QQYQRLLNFYANMPLYAYSV-VP--FKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLA 142 (513) Q Consensus 82 ----------------g~~~rlidy~~~mpt~dY~I-~P--~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~ 142 (513) +..+.+++..++ |.+ .| +.. ..+...+ .+..++..-++...+..+.+.+ T Consensus 69 ~~~~~~~~~~~~ki~~n~~~~ivd~~~~-----~l~g~~~~~~~----~d~~~~~---~l~~~~~~n~~~~~~~~~~~~~ 136 (481) T protein:vir:10 69 RRLQKYGDKADHRAVHNYAKYVSRFIVG-----YLTGNPITITH----QDNQTND---KIIELNDLNDADEVNSDLALNL 136 (481) T ss_pred cccccccccccceeecchHHHHHHHHHh-----hhccCCceEec----CChhHHH---HHHHHHHhcChhHHHHHHHHHH Confidence 233333333332 211 11 111 1122222 2334566667889999999999 Q ss_pred HHhcceeEEEEEcCcc-eeeeecCcceeEEEEEEC--CeeEEEEEeeeccCc--c---hhccc-cHHHHHHHHHHhhhhh Q lcl|NC_015263. 143 MTVDIFYGYVIDDKES-VMIQQFPNDICKISSVSG--GVYNYVIDLDALVSA--D---IVDYY-PKEIQEAVNKYTTMKK 213 (513) Q Consensus 143 l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd~~--~---~L~~~-p~Ei~~~y~~Y~~~k~ 213 (513) ++.|..|.+..-+.++ +-+..+||+.|.++--.. +.+.+++-.-..+.. . ..+-| +..+.. |. T Consensus 137 ~~~G~~~~~~~~d~dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~----~~---- 208 (481) T protein:vir:10 137 SIYGRAYEIVYRDFEDRDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYY----IE---- 208 (481) T ss_pred HhcCeEEEEEEeCCCCeEEEEEEcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEE----EE---- Confidence 9999998887766555 567788999988774322 345554322111111 1 11112 222211 10 Q ss_pred ccCcccccCeeecC-Cce----EEEEecCccccchhhHHHHHHhHHHHHHH--HHHHhhHhhhhhceeeeeeeccccCCC Q lcl|NC_015263. 214 GNNKSASNWYEIQD-KNS----ICIKINESSLTPVPPFAGTFDSIYDIHSF--KDLRNDKAELQNYKLLIQKLETRSSND 286 (513) Q Consensus 214 ~~~~~~~~W~~L~~-~kt----~~ik~~~~~~~~ip~f~~v~~d~~di~~~--kdL~~~~~~i~n~~ii~~kip~~~~n~ 286 (513) . .+..|..+.. +|. .++.+- .+.++.+-+.. +.++.|..+. -++-+......+- ++ .++ +..+. T Consensus 209 ~---~~~~~~~~~~~~~~~g~vPvv~~~-n~~~g~~~~~~-v~~lida~~~~~s~~~~~~~~~~~~-~~--~~~-g~~~~ 279 (481) T protein:vir:10 209 I---KGGTYHRVEEVEHYYNDVPIIEYL-NDQFKQGDFEN-VIALIDLYDSAQSDTANYMTDLNDA-ML--AII-GNVDL 279 (481) T ss_pred e---cCCceeecccccccCCceeEEEee-cCCCCCCchhh-HHHHHHHHHHHHHHHHHHHHHhcCc-ee--Eee-cCcCC Confidence 0 0123433211 111 122221 12334443332 2333322211 1111222222221 11 111 00111 Q ss_pred CCccccCHHHHHHHHHHHHHhccccceEEEecccccccccccc----cccchhhhhhHHhhhhhhhhhhhhccCCCcchH Q lcl|NC_015263. 287 NNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKD----SSTDDSVEKATKNFWDNAGVSQILFSSDNKTSQ 362 (513) Q Consensus 287 ~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~----~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~s~~ 362 (513) +++ ....+...-...+|.++.. ..+-+=..+.+-.. ......++-..++|+.-+|+-..-+++.+++.+ T Consensus 280 ~~~------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S 352 (481) T protein:vir:10 280 DSE------DAKAFRDANMIHLEPGTNA-NGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQS 352 (481) T ss_pred Ccc------chhhhhhccceeccccccc-cCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccH Confidence 221 1111111111111211110 00000001111110 111123445556788888887776655433334 Q ss_pred H--HHHHHHHHHHHHHHHHHH----HHH---HHHHHHhhcccc----eEEEEEecCCCCccHHHHHHHHHHHHhcC-CcH Q lcl|NC_015263. 363 G--IAMSIATDEQFIFGVINQ----LER---WLNRYLLLNGMS----KYFKATMLEVTHFSKKEAHDRYITDAQYG-FPV 428 (513) Q Consensus 363 ~--~~~SI~~d~~~~~~~~~~----iE~---~~N~~i~~~~~~----~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~ 428 (513) | +......-.+.+-..... +++ .+-+.+...... ..+.+.|-+..+-|..+.++.+.++. | .|. T Consensus 353 g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~is~ 430 (481) T protein:vir:10 353 GESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNALS--GGVSE 430 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCCh Confidence 3 333322222222222222 222 222222222221 25799999999999999999999985 5 666 Q ss_pred HHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCC Q lcl|NC_015263. 429 KVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDET 502 (513) Q Consensus 429 ~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~ 502 (513) .-.| ..+|+ +|.+.+.++..|++... +..- +.+- + .+.++++.| +.|++ T Consensus 431 et~~-~~l~~i~d~~~E~~ri~~E~~~~~------~~~~---~~~~-~------~~~~~~~~~--------dd~~g 481 (481) T protein:vir:10 431 STRL-SLLDFIDNPKEELEKMQEEEAQRE------KQAD---KRGY-G------EAFENHLNV--------DDSNG 481 (481) T ss_pred HHHH-HhCCCCCCHHHHHHHHHHHHHHHH------hhhh---hccC-C------ccCCCCCCC--------CCCCC Confidence 5544 46787 68899999988875421 1110 0000 0 001111111 11111 No 154 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=83.00 E-value=0.072 Score=26.84 Aligned_cols=318 Identities=13% Similarity=0.074 Sum_probs=141.8 Q ss_pred CCCccchheee-eehhhhhhHHHHHHHHHHHHHhhccCccccccccc---------ccchHHHHHHHhh-hcc--ChhHH Q lcl|NC_015263. 1 MVKNKKKRLSM-IDVESISSYSNKRNNRISILRDDNRTPVFGAPVGS---------LTSSQSKVRKIVK-EYR--NEGNQ 67 (513) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s---------~~~s~d~~k~~i~-~~~--P~~n~ 67 (513) |-|.|+.+... .-.. +..+ -++ +|. .+..|+ .....|-++-+-. +|+ |..- T Consensus 1 ~~~~~~~~~~~~~~~~------~~~~----~~~----~~~-~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~- 64 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAP------NPSA----GSA----APA-RAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSF- 64 (351) T ss_pred CCCCCCCCCCCCCCCC------chhh----hhc----ccc-eeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCH- Confidence 87766644321 1100 0000 000 010 111222 1111222222222 233 4442 Q ss_pred HHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcc Q lcl|NC_015263. 68 KTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDI 147 (513) Q Consensus 68 ~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~ 147 (513) . .|++. +..+++-.+.|....+|..-.|.=-|. | -..+|..+..+.+..|. T Consensus 65 ~---~la~~-~~~~~~h~~~l~~k~n~l~~~~~Pn~~---------------------~----t~~~f~~~~~d~ll~Gn 115 (351) T protein:vir:78 65 A---GLAKS-FRASTHHSSALFFKANVLASTFRPHRW---------------------L----SRHAFERWALDFLTFGN 115 (351) T ss_pred H---HHHHH-HhhhHhhhhhhhhhhhHHhhcccCCCC---------------------C----CHHHHHHHHHHHHhcCC Confidence 2 23332 334444455554444444333221110 0 13445667778888999 Q ss_pred eeEEEEEcC--cceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeee Q lcl|NC_015263. 148 FYGYVIDDK--ESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEI 225 (513) Q Consensus 148 ~~gy~i~d~--~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L 225 (513) .|.+.+-+. ..+-+.++|+.||++.-..++ |.+.-. .... +++ T Consensus 116 ay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~~-~~~~~~-----~~~~-----------------------------~~~ 160 (351) T protein:vir:78 116 GYLERRRNMVGGTLRLEPALAKYVRRKADFSG-FVYVNG-----WQER-----------------------------HEF 160 (351) T ss_pred eEEEEEECCCCCEEEEEEecCcceEEeeeCCe-EEEEec-----CCeE-----------------------------EEE Confidence 999988764 558999999999998643333 332210 0000 123 Q ss_pred cCCceEEEE-ec-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHH Q lcl|NC_015263. 226 QDKNSICIK-IN-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEA 303 (513) Q Consensus 226 ~~~kt~~ik-~~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ 303 (513) +...-+-|+ .+ ....+|+|+..+....+.--+...+... .-..|-. ..+-|-+. .+..++.++++.+.+. T Consensus 161 ~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~--~~f~NGa-~pggIl~~-----~~~~ls~e~~~~lr~~ 232 (351) T protein:vir:78 161 APDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRR--KYYENGS-HAGFILYM-----TDAAQKQDDVDNMRDA 232 (351) T ss_pred ccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhccC-CCceEEEe-----cCCCCCHHHHHHHHHH Confidence 332223233 12 3456799999998888765555554321 1123311 11111111 1245788888888888 Q ss_pred HHHhccccce-----EEEeccc-ccccc---cccccccchhh---hhhHHhhhhhhhhhhhhccCC---CcchH-HHHHH Q lcl|NC_015263. 304 LSMTVPDNVG-----VVTSPME-IDTVS---FDKDSSTDDSV---EKATKNFWDNAGVSQILFSSD---NKTSQ-GIAMS 367 (513) Q Consensus 304 ik~~Lp~gv~-----~v~sP~~-~d~i~---ld~~~~~~dtv---~~~~~~i~~~~GiS~~Lfn~d---~~s~~-~~~~S 367 (513) +++. .|++ .|..|-- -+.++ +.....+.+.+ +-..++|..+.||...|.|-- +.+.+ .-+.. T Consensus 233 ~~~~--~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~ 310 (351) T protein:vir:78 233 LKNA--KGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAA 310 (351) T ss_pred HHHh--cCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHH Confidence 8764 3443 3444310 12333 33223333332 333477999999998888521 11112 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHH Q lcl|NC_015263. 368 IATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITD 421 (513) Q Consensus 368 I~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~ 421 (513) ..--..-+.-++++||. +|..+-.. .|+ |+..+....=.++ T Consensus 311 ~~f~~~~l~P~~~~iee-~n~~l~~~----~~~--------F~~~~Llr~d~ka 351 (351) T protein:vir:78 311 RVFGRNEIRPLQARFAE-LNDWLGDE----VVR--------FDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHHHHHHHHHHHH-HHhhcCcc----cee--------cChhhhccccccC Confidence 22222223346667764 44333211 122 2333322221111 No 155 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=82.26 E-value=0.078 Score=26.64 Aligned_cols=416 Identities=12% Similarity=0.080 Sum_probs=163.1 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCccccccccc-----ccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGS-----LTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQ 85 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s-----~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~ 85 (513) ||..- +| -.++..+.+.+. .....+.+++.++.| ..-...+.++.+|+.....+.. T Consensus 1 ~~~~~---~~--------------~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~--~~~~~~~~~l~~Yy~g~~~i~~ 61 (474) T protein:vir:95 1 MINII---RM--------------PWDKPYGEEVVEQMKPKVETQEEMIIRLINNH--KQKLKDINVGQKYYDKDNDINY 61 (474) T ss_pred Ccccc---cC--------------CCCCCCCcchhhhccccccchHHHHHHHHHHH--HHHHHHHHHHHHHhcccCcccc Confidence 32211 11 011111122221 223444556666654 2334456666666554432111 Q ss_pred ----------------------HHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHH Q lcl|NC_015263. 86 ----------------------RLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLA 142 (513) Q Consensus 86 ----------------------rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~ 142 (513) +...+++...+ .|.+ .|..- ........+.... ++. =++...+..+.+.+ T Consensus 62 ~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~--~~~~~~~~~~l~~---~~~-n~~~~~~~~l~~~~ 134 (474) T protein:vir:95 62 QAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKV-SYVAGKPVTY--AHDDDKVLDVIHQ---VLD-TRWDNKLIDILTAA 134 (474) T ss_pred ccchhhhcccccccccccccccchHHHHHHhhh-hhhcccCcee--ccCChHHHHHHHH---HHh-ccHHHHHHHHHHHH Confidence 11112222111 1111 11110 0011111221211 222 25777888899999 Q ss_pred HHhcceeEEEEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcc--hhccc-cHHHHHHHHHHhhhhhccC Q lcl|NC_015263. 143 MTVDIFYGYVIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSAD--IVDYY-PKEIQEAVNKYTTMKKGNN 216 (513) Q Consensus 143 l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~--~L~~~-p~Ei~~~y~~Y~~~k~~~~ 216 (513) ++.|..|.+..-|.++ +-+.-++|+-|.++-- ..+.+.+++ ++..... ..+.| +.++.. | .+..+..... T Consensus 135 ~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~i--r~~~~~~~~~~~vy~~~~i~~-~-~~~~~~~~~~ 210 (474) T protein:vir:95 135 SNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFI--RIFTFNGETKVEYWTAETVTY-Y-VYENGGLIPD 210 (474) T ss_pred hhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEeecCeeEEEEEeCCeEEE-E-EEcCCceeec Confidence 9999999988766554 5577788888887743 125554442 3333221 12222 222211 0 0000000000 Q ss_pred c--ccccCee----ecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhce--eeeeeeccc-cCC Q lcl|NC_015263. 217 K--SASNWYE----IQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNYK--LLIQKLETR-SSN 285 (513) Q Consensus 217 ~--~~~~W~~----L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~~--ii~~kip~~-~~n 285 (513) . .+.-|.. -+...-.++.+-. ...+. |.|.++.++.+.=+.. +.-+.++... +++ +. + ++. T Consensus 211 ~~~~~~~~~~~~~~~~~~~vPvv~~~n-n~~~~----~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv--~~-g~~~~ 282 (474) T protein:vir:95 211 FYYGDEHIQTHFSTGSWERVPFIAFKN-NPEEV----SDIWMYKSFVDAIDKRLSDVQNMFDESVELIYI--LR-GYEGE 282 (474) T ss_pred cccccccccCcccccCCCccceEEecC-CCCCC----CchHHHHHHHHHHHHHHHHHHHHHHHhhcchhh--hc-CCCcc Confidence 0 0000100 0000001122211 11122 3344444443332211 1222222211 111 11 0 111 Q ss_pred CCCccccCHHHHHHHHHHHHHhccccc--eEEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCCC--cch Q lcl|NC_015263. 286 DNNDFTLDMPMMNYFHEALSMTVPDNV--GVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSDN--KTS 361 (513) Q Consensus 286 ~~~~~~vd~~~~~~~~~~ik~~Lp~gv--~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~--~s~ 361 (513) +.+++.-++.... .+ .++++- ..++.+.+. ......++.-.++|+.-+++-..-+.+.. .|+ T Consensus 283 ~~~~~~~~~~~~~----~i--~~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg 348 (474) T protein:vir:95 283 DLSEFMEGLKYYK----AI--NVSSDGGVETIQVEVPV--------ASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSG 348 (474) T ss_pred cccchhhhhhccc----ee--eccCCCceeEEeccCCH--------HHHHHHHHHHHHHHHHHhCCcCccccccccccHH Confidence 2222222221110 00 023322 122222111 11112355555778888887655553333 344 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhh---cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHH Q lcl|NC_015263. 362 QGIAMSIATDEQFIFG----VINQLERWLNRYLLL---NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLAS 434 (513) Q Consensus 362 ~~~~~SI~~d~~~~~~----~~~~iE~~~N~~i~~---~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa 434 (513) ..++.....-.+.+.. |-+.|++.+..++.. ..-...+.+.|-+..+-|..+.++.+.++ |.-+...+.+ T Consensus 349 ~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~~~~---giiS~et~~~ 425 (474) T protein:vir:95 349 IALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIGAQS---QYLSKETLVR 425 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHHHHc---CCCChHHHHH Confidence 3344332222222221 222333333323322 22223578999999999999999987653 7666666777 Q ss_pred HhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccc Q lcl|NC_015263. 435 LMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNK 497 (513) Q Consensus 435 ~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~ 497 (513) .+|+ +|.+.+.++..|++.-. ..+..... ++-++ ..+.+.|. ..+.+ T Consensus 426 ~lp~v~D~~~E~eri~~E~~~~~--~~~~~~~~----~~~~~--------~~~~~~~~--~~e~~ 474 (474) T protein:vir:95 426 HHPWVDDPKAELERLDEEQLELN--KQLPNLDD----GGADG--------AQQQQQSE--NNQSK 474 (474) T ss_pred hCCCCCCHHHHHHHHHHHHHHHH--hhcccccc----ccCCC--------CCCcCCCC--ccccC Confidence 7887 78899999988875311 11111110 11000 00101110 00000 No 156 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=82.26 E-value=0.078 Score=26.64 Aligned_cols=416 Identities=12% Similarity=0.080 Sum_probs=163.1 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCccccccccc-----ccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGS-----LTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQ 85 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s-----~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~ 85 (513) ||..- +| -.++..+.+.+. .....+.+++.++.| ..-...+.++.+|+.....+.. T Consensus 1 ~~~~~---~~--------------~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~--~~~~~~~~~l~~Yy~g~~~i~~ 61 (474) T protein:vir:96 1 MINII---RM--------------PWDKPYGEEVVEQMKPKVETQEEMIIRLINNH--KQKLKDINVGQKYYDKDNDINY 61 (474) T ss_pred Ccccc---cC--------------CCCCCCCcchhhhccccccchHHHHHHHHHHH--HHHHHHHHHHHHHhcccCcccc Confidence 32211 11 011111122221 223444556666654 2334456666666554432111 Q ss_pred ----------------------HHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHH Q lcl|NC_015263. 86 ----------------------RLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLA 142 (513) Q Consensus 86 ----------------------rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~ 142 (513) +...+++...+ .|.+ .|..- ........+.... ++. =++...+..+.+.+ T Consensus 62 ~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~--~~~~~~~~~~l~~---~~~-n~~~~~~~~l~~~~ 134 (474) T protein:vir:96 62 QAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKV-SYVAGKPVTY--AHDDDKVLDVIHQ---VLD-TRWDNKLIDILTAA 134 (474) T ss_pred ccchhhhcccccccccccccccchHHHHHHhhh-hhhcccCcee--ccCChHHHHHHHH---HHh-ccHHHHHHHHHHHH Confidence 11112222111 1111 11110 0011111221211 222 25777888899999 Q ss_pred HHhcceeEEEEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcc--hhccc-cHHHHHHHHHHhhhhhccC Q lcl|NC_015263. 143 MTVDIFYGYVIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSAD--IVDYY-PKEIQEAVNKYTTMKKGNN 216 (513) Q Consensus 143 l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~--~L~~~-p~Ei~~~y~~Y~~~k~~~~ 216 (513) ++.|..|.+..-|.++ +-+.-++|+-|.++-- ..+.+.+++ ++..... ..+.| +.++.. | .+..+..... T Consensus 135 ~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~i--r~~~~~~~~~~~vy~~~~i~~-~-~~~~~~~~~~ 210 (474) T protein:vir:96 135 SNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFI--RIFTFNGETKVEYWTAETVTY-Y-VYENGGLIPD 210 (474) T ss_pred hhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEeecCeeEEEEEeCCeEEE-E-EEcCCceeec Confidence 9999999988766554 5577788888887743 125554442 3333221 12222 222211 0 0000000000 Q ss_pred c--ccccCee----ecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhce--eeeeeeccc-cCC Q lcl|NC_015263. 217 K--SASNWYE----IQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNYK--LLIQKLETR-SSN 285 (513) Q Consensus 217 ~--~~~~W~~----L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~~--ii~~kip~~-~~n 285 (513) . .+.-|.. -+...-.++.+-. ...+. |.|.++.++.+.=+.. +.-+.++... +++ +. + ++. T Consensus 211 ~~~~~~~~~~~~~~~~~~~vPvv~~~n-n~~~~----~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv--~~-g~~~~ 282 (474) T protein:vir:96 211 FYYGDEHIQTHFSTGSWERVPFIAFKN-NPEEV----SDIWMYKSFVDAIDKRLSDVQNMFDESVELIYI--LR-GYEGE 282 (474) T ss_pred cccccccccCcccccCCCccceEEecC-CCCCC----CchHHHHHHHHHHHHHHHHHHHHHHHhhcchhh--hc-CCCcc Confidence 0 0000100 0000001122211 11122 3344444443332211 1222222211 111 11 0 111 Q ss_pred CCCccccCHHHHHHHHHHHHHhccccc--eEEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCCC--cch Q lcl|NC_015263. 286 DNNDFTLDMPMMNYFHEALSMTVPDNV--GVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSDN--KTS 361 (513) Q Consensus 286 ~~~~~~vd~~~~~~~~~~ik~~Lp~gv--~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~--~s~ 361 (513) +.+++.-++.... .+ .++++- ..++.+.+. ......++.-.++|+.-+++-..-+.+.. .|+ T Consensus 283 ~~~~~~~~~~~~~----~i--~~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg 348 (474) T protein:vir:96 283 DLSEFMEGLKYYK----AI--NVSSDGGVETIQVEVPV--------ASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSG 348 (474) T ss_pred cccchhhhhhccc----ee--eccCCCceeEEeccCCH--------HHHHHHHHHHHHHHHHHhCCcCccccccccccHH Confidence 2222222221110 00 023322 122222111 11112355555778888887655553333 344 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhh---cccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHH Q lcl|NC_015263. 362 QGIAMSIATDEQFIFG----VINQLERWLNRYLLL---NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLAS 434 (513) Q Consensus 362 ~~~~~SI~~d~~~~~~----~~~~iE~~~N~~i~~---~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa 434 (513) ..++.....-.+.+.. |-+.|++.+..++.. ..-...+.+.|-+..+-|..+.++.+.++ |.-+...+.+ T Consensus 349 ~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~~~~---giiS~et~~~ 425 (474) T protein:vir:96 349 IALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIGAQS---QYLSKETLVR 425 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHHHHc---CCCChHHHHH Confidence 3344332222222221 222333333323322 22223578999999999999999987653 7666666777 Q ss_pred HhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccc Q lcl|NC_015263. 435 LMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNK 497 (513) Q Consensus 435 ~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~ 497 (513) .+|+ +|.+.+.++..|++.-. ..+..... ++-++ ..+.+.|. ..+.+ T Consensus 426 ~lp~v~D~~~E~eri~~E~~~~~--~~~~~~~~----~~~~~--------~~~~~~~~--~~e~~ 474 (474) T protein:vir:96 426 HHPWVDDPKAELERLDEEQLELN--KQLPNLDD----GGADG--------AQQQQQSE--NNQSK 474 (474) T ss_pred hCCCCCCHHHHHHHHHHHHHHHH--hhcccccc----ccCCC--------CCCcCCCC--ccccC Confidence 7887 78899999988875311 11111110 11000 00101110 00000 No 157 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=81.51 E-value=0.085 Score=26.45 Aligned_cols=389 Identities=11% Similarity=0.044 Sum_probs=162.5 Q ss_pred HhhhccChhHHHHHHHHHHHHHhh---------------------cchHHHHHHHHhhcccccceEe-eccchhhhhhcc Q lcl|NC_015263. 57 IVKEYRNEGNQKTLRKVSEDLAVQ---------------------SQQYQRLLNFYANMPLYAYSVV-PFKDISTANENK 114 (513) Q Consensus 57 ~i~~~~P~~n~~~ir~~s~~lY~~---------------------sg~~~rlidy~~~mpt~dY~I~-P~~~~~~~~~~~ 114 (513) .|.++ .....+.++++-+|+... .+..+.+++-.++. .+- |..-. ..+.. T Consensus 1 ~~~~~-~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~-----l~g~~~~~~--~~~~~ 72 (440) T protein:vir:95 1 MLAAF-LGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGY-----VIGNPVSIG--VMEGG 72 (440) T ss_pred ChhhH-HHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhh-----eeccCceEe--eCCCc Confidence 12221 222333344444443332 23333344433322 110 10000 00111 Q ss_pred hhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEEE--CCeeEEEEEeeeccCc Q lcl|NC_015263. 115 LKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSVS--GGVYNYVIDLDALVSA 191 (513) Q Consensus 115 ~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~syFd~~ 191 (513) ..+....+...+..-++......+.+.+++.|..|.+...+.++ .-+..++|+-|.++--. ++.+.+++-....+.. T Consensus 73 ~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~ 152 (440) T protein:vir:95 73 SADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYADK 152 (440) T ss_pred cHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCc Confidence 22233334455667788999999999999999999988776665 56778899998887542 2557777644433332 Q ss_pred chhccc-cHHHHHHHHHHhhhhhccCcccccCeeecC-Cce----EEEEecCccccchhhHHHHHHhHHHHHHHH--HHH Q lcl|NC_015263. 192 DIVDYY-PKEIQEAVNKYTTMKKGNNKSASNWYEIQD-KNS----ICIKINESSLTPVPPFAGTFDSIYDIHSFK--DLR 263 (513) Q Consensus 192 ~~L~~~-p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~-~kt----~~ik~~~~~~~~ip~f~~v~~d~~di~~~k--dL~ 263 (513) ....-| +.++.+ |.. ..+. +..|..... +|. .++.+ .+..++.|-|..+. ++.|..+.- ++. T Consensus 153 ~~~~vyt~~~~~~----~~~---~~~~-~~~~~~~~~~~~~~g~vPvv~~-~n~~~g~sd~e~v~-~lida~~~~~s~~~ 222 (440) T protein:vir:95 153 VNMTVYTKDKVIT----YKP---YSNN-SVRLVVDDVKKHSYNDVPVVEW-WNNRFRMGDYESEI-SLIDAYDAGQSDTA 222 (440) T ss_pred eEEEEEeCCeEEE----EEE---ecCC-ccceeecceeeccCceeeEEEe-eCCCCCCCchhhhH-HHHHHHHHHHHHHH Confidence 222222 111111 100 0000 111211110 011 11222 11234444444322 222222211 111 Q ss_pred hhHhhhhh-ceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccceEE---Eecccccccccccc-cccchhhhh Q lcl|NC_015263. 264 NDKAELQN-YKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVV---TSPMEIDTVSFDKD-SSTDDSVEK 338 (513) Q Consensus 264 ~~~~~i~n-~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v---~sP~~~d~i~ld~~-~~~~dtv~~ 338 (513) +..+...+ +.++....+ ....+.++...+.+.-.-.++.+.... ..+ +++-+.-+.. ......++. T Consensus 223 ~~~~~~~~~~~v~~g~~~--------~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lt~~~~~~~~~~~~~~ 293 (440) T protein:vir:95 223 NYMSDLNDAMLLVKGDLD--------GIKLSPEDAAKMKDANMLFLKTGISTTGQQTTA-DASYIYKQYDVNGTEAYKNR 293 (440) T ss_pred HHHHHhhcceeeeecccc--------cCCCCccchhhhhhccceecccccccccCCCCc-ceeEEeecCCHHHHHHHHHH Confidence 11111122 111121111 111122222222222111122111100 000 0000100110 111234666 Q ss_pred hHHhhhhhhhhhhhhccCC--CcchHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhc-cc---ceEEEEEecC Q lcl|NC_015263. 339 ATKNFWDNAGVSQILFSSD--NKTSQGIAMSIATDEQFIF-------GVINQLERWLNRYLLLN-GM---SKYFKATMLE 405 (513) Q Consensus 339 ~~~~i~~~~GiS~~Lfn~d--~~s~~~~~~SI~~d~~~~~-------~~~~~iE~~~N~~i~~~-~~---~~~f~~~~l~ 405 (513) ..++|+.-+++-..-+.+- +.|+.+++.....-...+- ..++++=+.+-..+... .. .....+.|-+ T Consensus 294 l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~ 373 (440) T protein:vir:95 294 LANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHP 373 (440) T ss_pred HHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCC Confidence 6778888888776655442 3344433333222211222 12222222222233221 11 2247899999 Q ss_pred CCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCC Q lcl|NC_015263. 406 VTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKE 485 (513) Q Consensus 406 ~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~ 485 (513) ..+-|..+.++.+.++.+. .|....|..+-|+++.+-+.+...|++... .+.+. .+|. .+. T Consensus 374 ~~p~~~~~~ad~~~kl~g~-iS~et~~~~l~~~d~~~E~~ri~~E~~~~~-----~~~~~---~~~~----------~~~ 434 (440) T protein:vir:95 374 NIPQDVWTEIKAYIEAGGE-ISQETLMENASFTDYKTEHSRILKQGGSSD-----LEIGQ---IVGD----------ADV 434 (440) T ss_pred CCCCCHHHHHHHHHHHhcc-CcHHHHHHhCCCCCcHHHHHHHHHHHHHhh-----hhHHh---hccC----------CCC Confidence 9999999999999998643 777676666545676677777777775411 11110 1110 011 Q ss_pred CcCCCCc Q lcl|NC_015263. 486 NGRPTNE 492 (513) Q Consensus 486 ~grPt~e 492 (513) |....| T Consensus 435 -~~~~~e 440 (440) T protein:vir:95 435 -GQADTE 440 (440) T ss_pred -CCcCCC Confidence 101111 No 158 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=81.37 E-value=0.086 Score=26.42 Aligned_cols=398 Identities=10% Similarity=0.077 Sum_probs=161.0 Q ss_pred HHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh------------------------ Q lcl|NC_015263. 24 RNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV------------------------ 79 (513) Q Consensus 24 ~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~------------------------ 79 (513) -+|.|+|.---.... ..+.+++.|..| .+..+.+.++-.|+.. T Consensus 1 ~~~~~~~~~~~~~~~-----------~~e~i~~~i~~~--~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (474) T protein:vir:10 1 MTLYKLIDDIEAQGI-----------LPKHIEALIESH--KDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGN 67 (474) T ss_pred CchHHHHhhccccCC-----------CHHHHHHHHHHh--hhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccc Confidence 344444422111111 112334444443 2333444444444332 Q ss_pred ------------hcchHHHHHHHHhhcccccceE-ee--cc-chhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHH Q lcl|NC_015263. 80 ------------QSQQYQRLLNFYANMPLYAYSV-VP--FK-DISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAM 143 (513) Q Consensus 80 ------------~sg~~~rlidy~~~mpt~dY~I-~P--~~-~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l 143 (513) ..+..+.+++..++ |.+ .| +. ..+......+++. .-.++..-++...+..+.+.+. T Consensus 68 ~~~~~~~~~~ki~~n~~~~ivd~~~~-----yl~g~pv~~~~~~~~~~~e~~~~~---l~~~~~~n~~~~~~~~~~~~~~ 139 (474) T protein:vir:10 68 VRRLDVSVNNKLNNSFDSEIVDTRVG-----YLHGVPVTYDLDENAEKNEKLKKF---ITNFAIRNSVDDEDSEIGKMAA 139 (474) T ss_pred ccccccCcccccccchHHHHHHhHhh-----heeccceeEeeCCCCcchHHHHHH---HHHHHhhcCHhHHHHHHHHHHh Confidence 23444444443332 211 01 11 1111111122222 2234556678888999999999 Q ss_pred HhcceeEEEEEcCcc-eeeeecCcceeEEEEEECCeeEEEEEeeeccC--cc----hhccccH-HHHHHHHHHhhhhhcc Q lcl|NC_015263. 144 TVDIFYGYVIDDKES-VMIQQFPNDICKISSVSGGVYNYVIDLDALVS--AD----IVDYYPK-EIQEAVNKYTTMKKGN 215 (513) Q Consensus 144 ~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~--~~----~L~~~p~-Ei~~~y~~Y~~~k~~~ 215 (513) +.|..|.+...+.++ +-+..++|.-|.++.-..+.+.+++-.-+-.. .. .++.|.+ ++. .|. T Consensus 140 ~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~----~~~------ 209 (474) T protein:vir:10 140 ICGYGARLAYIDTNGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYY----VFR------ 209 (474) T ss_pred hcCeEEEEEEeCCCCeeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEE----EEe------ Confidence 999999888766554 67778888888777544455555443322111 00 1111111 100 010 Q ss_pred CcccccCeeecC-CceE----EEEecCccccchhhHHHHHHhHHHHHHHHH-HH-hhHhhhhhceeeeeeeccc-cCCCC Q lcl|NC_015263. 216 NKSASNWYEIQD-KNSI----CIKINESSLTPVPPFAGTFDSIYDIHSFKD-LR-NDKAELQNYKLLIQKLETR-SSNDN 287 (513) Q Consensus 216 ~~~~~~W~~L~~-~kt~----~ik~~~~~~~~ip~f~~v~~d~~di~~~kd-L~-~~~~~i~n~~ii~~kip~~-~~n~~ 287 (513) .....-|..... +|.+ ++.+ -....+.+-|. ++.++.+.=+ +. .....++... .-+.+- +.+.+ T Consensus 210 ~~~~~~~~~~~~~~~~~g~vPvv~~-~n~~~g~sd~e----~v~~liDa~d~~~S~~~~~~~~~~---~~~l~i~g~~~~ 281 (474) T protein:vir:10 210 GEGIDALQEVGRYEHLFDYNPLFGV-PNNKEMIGDAE----KVIHLIDAYDLTMSDASSEISQTR---LAYLVLRGMGMS 281 (474) T ss_pred ecCCCcccccccccCCCCccceEEe-cCCCCCCCchH----HHHHHHHHHHHHHHHHHHHHHHhh---cchhhhccCCCC Confidence 000011211111 1111 1222 11223333333 3333333211 11 1112222111 111110 10111 Q ss_pred CccccCHHHHHHHHHHHHHhccccceEEEecc-ccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCC--CcchHH Q lcl|NC_015263. 288 NDFTLDMPMMNYFHEALSMTVPDNVGVVTSPM-EIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSD--NKTSQG 363 (513) Q Consensus 288 ~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~-~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~s~~~ 363 (513) .+. ... ++ ..|+..+...- +++-+.-+... .....++-..++|+.-+++-..-+.+. +.|+.+ T Consensus 282 ~~~------~~~----~~---~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 348 (474) T protein:vir:10 282 EEM------IQE----TQ---KSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIG 348 (474) T ss_pred chh------hhh----hh---hcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH Confidence 111 110 01 11222221110 11111111111 111234555577887777655444332 334444 Q ss_pred HHHHHHHHHHHHHH-------HHHHHHHHHHHHHhhccc---c---eEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHH Q lcl|NC_015263. 364 IAMSIATDEQFIFG-------VINQLERWLNRYLLLNGM---S---KYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVK 429 (513) Q Consensus 364 ~~~SI~~d~~~~~~-------~~~~iE~~~N~~i~~~~~---~---~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~ 429 (513) ++.-...-.+.+.. .++++=+.+-.++..... . ....+.|-+..+-|..+.++.+.++. | .|.. T Consensus 349 l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~e 426 (474) T protein:vir:10 349 MKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSER 426 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchH Confidence 44332222222221 222222222223332211 1 14688999999999999999999985 6 5555 Q ss_pred HHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccc Q lcl|NC_015263. 430 VYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETT 494 (513) Q Consensus 430 ~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~ 494 (513) ..+. .+|+ +|.+.+.++..|++... ..++..+ .| ..+++.+++++. T Consensus 427 t~~~-~l~~v~d~~~E~eri~~E~~e~~--~~~~~~~-----~~-----------~~~~~~~~~~s~ 474 (474) T protein:vir:10 427 TRLG-QSQLVDDVDYELDEMEKESLEFN--DKLPDID-----EG-----------DANDKSQNNQSE 474 (474) T ss_pred HHHH-hCCCCCCHHHHHHHHHHHHHHHH--hhccccc-----CC-----------CcCCCCccccCC Confidence 5444 5675 78899999988875411 1221111 01 011121222211 No 159 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=81.37 E-value=0.086 Score=26.42 Aligned_cols=398 Identities=10% Similarity=0.077 Sum_probs=161.0 Q ss_pred HHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHh------------------------ Q lcl|NC_015263. 24 RNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAV------------------------ 79 (513) Q Consensus 24 ~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~------------------------ 79 (513) -+|.|+|.---.... ..+.+++.|..| .+..+.+.++-.|+.. T Consensus 1 ~~~~~~~~~~~~~~~-----------~~e~i~~~i~~~--~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (474) T protein:vir:94 1 MTLYKLIDDIEAQGI-----------LPKHIEALIESH--KDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGN 67 (474) T ss_pred CchHHHHhhccccCC-----------CHHHHHHHHHHh--hhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccc Confidence 344444422111111 112334444443 2333444444444332 Q ss_pred ------------hcchHHHHHHHHhhcccccceE-ee--cc-chhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHH Q lcl|NC_015263. 80 ------------QSQQYQRLLNFYANMPLYAYSV-VP--FK-DISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAM 143 (513) Q Consensus 80 ------------~sg~~~rlidy~~~mpt~dY~I-~P--~~-~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l 143 (513) ..+..+.+++..++ |.+ .| +. ..+......+++. .-.++..-++...+..+.+.+. T Consensus 68 ~~~~~~~~~~ki~~n~~~~ivd~~~~-----yl~g~pv~~~~~~~~~~~e~~~~~---l~~~~~~n~~~~~~~~~~~~~~ 139 (474) T protein:vir:94 68 VRRLDVSVNNKLNNSFDSEIVDTRVG-----YLHGVPVTYDLDENAEKNEKLKKF---ITNFAIRNSVDDEDSEIGKMAA 139 (474) T ss_pred ccccccCcccccccchHHHHHHhHhh-----heeccceeEeeCCCCcchHHHHHH---HHHHHhhcCHhHHHHHHHHHHh Confidence 23444444443332 211 01 11 1111111122222 2234556678888999999999 Q ss_pred HhcceeEEEEEcCcc-eeeeecCcceeEEEEEECCeeEEEEEeeeccC--cc----hhccccH-HHHHHHHHHhhhhhcc Q lcl|NC_015263. 144 TVDIFYGYVIDDKES-VMIQQFPNDICKISSVSGGVYNYVIDLDALVS--AD----IVDYYPK-EIQEAVNKYTTMKKGN 215 (513) Q Consensus 144 ~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~--~~----~L~~~p~-Ei~~~y~~Y~~~k~~~ 215 (513) +.|..|.+...+.++ +-+..++|.-|.++.-..+.+.+++-.-+-.. .. .++.|.+ ++. .|. T Consensus 140 ~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~----~~~------ 209 (474) T protein:vir:94 140 ICGYGARLAYIDTNGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYY----VFR------ 209 (474) T ss_pred hcCeEEEEEEeCCCCeeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEE----EEe------ Confidence 999999888766554 67778888888777544455555443322111 00 1111111 100 010 Q ss_pred CcccccCeeecC-CceE----EEEecCccccchhhHHHHHHhHHHHHHHHH-HH-hhHhhhhhceeeeeeeccc-cCCCC Q lcl|NC_015263. 216 NKSASNWYEIQD-KNSI----CIKINESSLTPVPPFAGTFDSIYDIHSFKD-LR-NDKAELQNYKLLIQKLETR-SSNDN 287 (513) Q Consensus 216 ~~~~~~W~~L~~-~kt~----~ik~~~~~~~~ip~f~~v~~d~~di~~~kd-L~-~~~~~i~n~~ii~~kip~~-~~n~~ 287 (513) .....-|..... +|.+ ++.+ -....+.+-|. ++.++.+.=+ +. .....++... .-+.+- +.+.+ T Consensus 210 ~~~~~~~~~~~~~~~~~g~vPvv~~-~n~~~g~sd~e----~v~~liDa~d~~~S~~~~~~~~~~---~~~l~i~g~~~~ 281 (474) T protein:vir:94 210 GEGIDALQEVGRYEHLFDYNPLFGV-PNNKEMIGDAE----KVIHLIDAYDLTMSDASSEISQTR---LAYLVLRGMGMS 281 (474) T ss_pred ecCCCcccccccccCCCCccceEEe-cCCCCCCCchH----HHHHHHHHHHHHHHHHHHHHHHhh---cchhhhccCCCC Confidence 000011211111 1111 1222 11223333333 3333333211 11 1112222111 111110 10111 Q ss_pred CccccCHHHHHHHHHHHHHhccccceEEEecc-ccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCC--CcchHH Q lcl|NC_015263. 288 NDFTLDMPMMNYFHEALSMTVPDNVGVVTSPM-EIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSD--NKTSQG 363 (513) Q Consensus 288 ~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~-~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d--~~s~~~ 363 (513) .+. ... ++ ..|+..+...- +++-+.-+... .....++-..++|+.-+++-..-+.+. +.|+.+ T Consensus 282 ~~~------~~~----~~---~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 348 (474) T protein:vir:94 282 EEM------IQE----TQ---KSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIG 348 (474) T ss_pred chh------hhh----hh---hcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH Confidence 111 110 01 11222221110 11111111111 111234555577887777655444332 334444 Q ss_pred HHHHHHHHHHHHHH-------HHHHHHHHHHHHHhhccc---c---eEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHH Q lcl|NC_015263. 364 IAMSIATDEQFIFG-------VINQLERWLNRYLLLNGM---S---KYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVK 429 (513) Q Consensus 364 ~~~SI~~d~~~~~~-------~~~~iE~~~N~~i~~~~~---~---~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~ 429 (513) ++.-...-.+.+.. .++++=+.+-.++..... . ....+.|-+..+-|..+.++.+.++. | .|.. T Consensus 349 l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~e 426 (474) T protein:vir:94 349 MKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSER 426 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchH Confidence 44332222222221 222222222223332211 1 14688999999999999999999985 6 5555 Q ss_pred HHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccc Q lcl|NC_015263. 430 VYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETT 494 (513) Q Consensus 430 ~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~ 494 (513) ..+. .+|+ +|.+.+.++..|++... ..++..+ .| ..+++.+++++. T Consensus 427 t~~~-~l~~v~d~~~E~eri~~E~~e~~--~~~~~~~-----~~-----------~~~~~~~~~~s~ 474 (474) T protein:vir:94 427 TRLG-QSQLVDDVDYELDEMEKESLEFN--DKLPDID-----EG-----------DANDKSQNNQSE 474 (474) T ss_pred HHHH-hCCCCCCHHHHHHHHHHHHHHHH--hhccccc-----CC-----------CcCCCCccccCC Confidence 5444 5675 78899999988875411 1221111 01 011121222211 No 160 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=80.79 E-value=0.092 Score=26.28 Aligned_cols=420 Identities=11% Similarity=0.055 Sum_probs=157.0 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchH------------HHHHHHhhhcc--ChhHHHHHHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQ------------SKVRKIVKEYR--NEGNQKTLRKVSED 76 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~------------d~~k~~i~~~~--P~~n~~~ir~~s~~ 76 (513) |-=+.-|| -. ++-||-- -++-..|++|... +.+++.|+.+. -......++.+.+| T Consensus 1 ~~~~~~~~---~~--~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~Y 69 (492) T protein:vir:94 1 MQFIQLIS---QV--AQALIKG------GNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEY 69 (492) T ss_pred ChHHHHHH---HH--HHHHhcC------CceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21111222 12 2222211 0012223333222 22223222221 12233445555555 Q ss_pred HHhhc--------------------------chHHHHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhc Q lcl|NC_015263. 77 LAVQS--------------------------QQYQRLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRL 129 (513) Q Consensus 77 lY~~s--------------------------g~~~rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~ 129 (513) +.... +..+.+++-.++ |.+ .|.. -....+...+.... ++ += T Consensus 70 Y~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~-----yl~G~p~~--~~~~d~~~~~~l~~---~~-~n 138 (492) T protein:vir:94 70 YEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVS-----YIVGKPIA--FKHTDDEVVKRIDE---VL-GN 138 (492) T ss_pred hccccccccccccccccccccccccccccccchHHHHHHHHHh-----hhcccCce--eccCchHHHHHHHH---HH-hc Confidence 44332 111122221111 111 1100 00011111222211 22 23 Q ss_pred ChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccccH-HHHHHH Q lcl|NC_015263. 130 NPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYYPK-EIQEAV 205 (513) Q Consensus 130 n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~p~-Ei~~~y 205 (513) ++...+..+.+.+++.|..|.+...|.++ .-+..++|.-|..+-- ..+.+.+++=.-.-+.....+.|.+ ++.. | T Consensus 139 ~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~-~ 217 (492) T protein:vir:94 139 RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNY-Y 217 (492) T ss_pred cHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEE-E Confidence 57778889999999999999988876554 5677788888877643 2345554432211111111111111 1100 0 Q ss_pred HHHhhhhhc-c-CcccccCeeecCCceE----EEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhceeeee Q lcl|NC_015263. 206 NKYTTMKKG-N-NKSASNWYEIQDKNSI----CIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNYKLLIQ 277 (513) Q Consensus 206 ~~Y~~~k~~-~-~~~~~~W~~L~~~kt~----~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~~ii~~ 277 (513) .+...... . ......|..-...+.+ ++.+- ...++.+-| .++.++.+.=+.. +....++...--+- T Consensus 218 -~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-nn~~~~sd~----e~v~~liDa~d~~~S~~~~~~~~~~~p~l 291 (492) T protein:vir:94 218 -VYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-NNDLEISDI----FMYKTLIDAYNRRLSDLSNTFKDSNELTY 291 (492) T ss_pred -EEecCeeeeccccccccccccccccCCCccceEEec-CCCCCCCch----HHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 00000000 0 0000011110000111 12221 112333333 3333333322221 11122222111111 Q ss_pred eec-cccCCCCCccccCHHHHHHHHHHHHHh----ccccce--EEEecccccccccccccccchhhhhhHHhhhhhhhhh Q lcl|NC_015263. 278 KLE-TRSSNDNNDFTLDMPMMNYFHEALSMT----VPDNVG--VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVS 350 (513) Q Consensus 278 kip-~~~~n~~~~~~vd~~~~~~~~~~ik~~----Lp~gv~--~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS 350 (513) .+. + ++.+.+++ ...++.. ++++-. .++.+.+. ....-.++-..++|+.-+++- T Consensus 292 v~~g~-~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~l~~~~~~--------~~~~~~~~~l~~~I~~~s~~p 352 (492) T protein:vir:94 292 VLKNY-DDQELPEF----------KRLLRYYGAIKVSDNGGVDTIQVEVPV--------ENSKKYLDELYQKIMLFGQAV 352 (492) T ss_pred eeecC-Ccccchhh----------HHHHhhccceecCCCCcceeEeccCCH--------HHHHHHHHHHHHHHHHHhCCc Confidence 111 0 01111221 1112111 222211 11111111 011122444556677777765 Q ss_pred hhhccCC--CcchHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHhhccc---ceEEEEEecCCCCccHHHHHHHHHHH Q lcl|NC_015263. 351 QILFSSD--NKTSQGIAMSIATDEQFIFGVINQLE----RWLNRYLLLNGM---SKYFKATMLEVTHFSKKEAHDRYITD 421 (513) Q Consensus 351 ~~Lfn~d--~~s~~~~~~SI~~d~~~~~~~~~~iE----~~~N~~i~~~~~---~~~f~~~~l~~T~fn~ke~~~~~~~~ 421 (513) .+-+..- +.|+..++.-...-...+....+.++ +.+..++..... ...+.+.|-+..+-|..+.++.+.++ T Consensus 353 ~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl 432 (492) T protein:vir:94 353 DFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQS 432 (492) T ss_pred CCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHH Confidence 4444322 23444444333322222222222222 222222222111 23588999999999999999999998 Q ss_pred HhcC-CcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCccccccc Q lcl|NC_015263. 422 AQYG-FPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKD 498 (513) Q Consensus 422 ~~~G-~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~ 498 (513) . | .|... +.+.+|+ +|.+.+.++..|++. .+...+. ..-.+ .+++ |..+...+++ T Consensus 433 ~--giiS~et-~~~~l~~v~d~~~E~eri~~E~~~-----~~~~~~~-~~~~~------------~~~~-~~~~~~~~~e 490 (492) T protein:vir:94 433 M--GIVSHET-VLENHPFVEDLQAELERIEQEQME-----YNKQLPN-LDDGG------------ADSA-QQQERSNNKE 490 (492) T ss_pred h--ccCchHH-HHHhCCCCCCHHHHHHHHHHHHHH-----HHhhccc-ccccc------------CCCC-ccccCCcccc Confidence 5 6 55544 5556777 688999999988743 1111111 00001 1111 2212122222 Q ss_pred CC Q lcl|NC_015263. 499 SD 500 (513) Q Consensus 499 ~~ 500 (513) +. T Consensus 491 ~e 492 (492) T protein:vir:94 491 SE 492 (492) T ss_pred CC Confidence 22 No 161 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=80.29 E-value=0.096 Score=26.16 Aligned_cols=435 Identities=11% Similarity=0.049 Sum_probs=170.5 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCccc-ccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVF-GAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLN 89 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~-~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlid 89 (513) |-.-+.+..- .-++.|+-.-....|.. --.-|..+ +..|+. +|..--...++ .....+--+.++| T Consensus 1 ~~~~~~~d~~---~~i~~L~~~~~~~~~r~~~~~~Yy~g------~~~i~~-~~~~~~~~~~~----~~~~~n~~~~ivd 66 (488) T protein:vir:23 1 MAETESIDPE---KLRDQLLDAFENKQNELKSSKAYYDA------ERRPDA-IGLAVPLDMRK----YLAHVGYPRTYVD 66 (488) T ss_pred CCcccCCCHH---HHHHHHHHHHHHHHHHHHHHHHHHhc------ccchhh-cCcccchhhhh----hhhhcchHHHHHH Confidence 2222222211 11122211111111110 00000000 000111 01111111111 1233445566667 Q ss_pred HHhhcccccceEeeccchhh---hhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEc--------Ccc Q lcl|NC_015263. 90 FYANMPLYAYSVVPFKDIST---ANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDD--------KES 158 (513) Q Consensus 90 y~~~mpt~dY~I~P~~~~~~---~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d--------~~~ 158 (513) -++.-..++-+..|...... ....+.....+ ..+..-++...+..+.+.+++.|..|.+.... .++ T Consensus 67 ~~a~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~---~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~ 143 (488) T protein:vir:23 67 AIAERQELEGFRIPSANGEEPESGGENDPASELW---DWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPE 143 (488) T ss_pred HHHHhhhccceeccCCcccccccccchhHHHHHH---HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCC Confidence 77766667766665332211 12223333333 33566678888999999999999998876532 222 Q ss_pred -eeeeecCcceeEEEEE-ECCeeEEEEEeeeccCc-ch--hcc-ccHHHHHHHHHHhhhhhccCcccccCeeec-CCceE Q lcl|NC_015263. 159 -VMIQQFPNDICKISSV-SGGVYNYVIDLDALVSA-DI--VDY-YPKEIQEAVNKYTTMKKGNNKSASNWYEIQ-DKNSI 231 (513) Q Consensus 159 -~~iq~lp~dyckIsg~-~nG~y~~~fD~syFd~~-~~--L~~-~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~-~~kt~ 231 (513) ..|..++++-|..+-= ..+...+++=..+-+.. .+ ... .|.++.. | .. ....|.... .+|.+ T Consensus 144 ~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~---~~----~~~~~~~~~~~~h~~ 212 (488) T protein:vir:23 144 VPLIRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMT----W---LR----AEGEWEAPTSTPHGL 212 (488) T ss_pred cceEEEeccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEE----E---Ee----cCCceEeccccccCC Confidence 2466777777655532 23444343322221111 11 111 1222111 0 00 011232221 11221 Q ss_pred ----EEEe-cCc---cccchhhHHHHHHhHHHHHHHHHHHh-hHhhhh---h-ceeeeeeeccccCCCCCccccCHHHHH Q lcl|NC_015263. 232 ----CIKI-NES---SLTPVPPFAGTFDSIYDIHSFKDLRN-DKAELQ---N-YKLLIQKLETRSSNDNNDFTLDMPMMN 298 (513) Q Consensus 232 ----~ik~-~~~---~~~~ip~f~~v~~d~~di~~~kdL~~-~~~~i~---n-~~ii~~kip~~~~n~~~~~~vd~~~~~ 298 (513) ++-+ +.. .++|.+-+...+.++.|. +..... .....+ . +..|... ..++....+..... T Consensus 213 g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da--~~~~~s~~~~~~~~~a~p~~~i~G~------~~~~~~~~~~~~~~ 284 (488) T protein:vir:23 213 EMVPVIPISNRTRLSDLYGTSEISPELRSVTDA--AAQILMNMQGTANLMAIPQRLIFGA------KPEELGINAETGQR 284 (488) T ss_pred CCcceEEeccccccCCcCCccchhhhHHHHHHH--HHHHHHHHHHHHHHhhhHHHHHhCC------Ccccccccccccch Confidence 1222 111 224444444333333322 222111 111111 1 0111111 11111111111112 Q ss_pred HHHHHHHH--hccccceEEEeccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccCCCc---chHHHHHHHHHH Q lcl|NC_015263. 299 YFHEALSM--TVPDNVGVVTSPMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSSDNK---TSQGIAMSIATD 371 (513) Q Consensus 299 ~~~~~ik~--~Lp~gv~~v~sP~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~---s~~~~~~SI~~d 371 (513) .+...... ++|+|-. ..-..++... .--+.+....+++....|+...-|++... |+..++.....- T Consensus 285 ~~~~~~~~v~~~~~g~~-------~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l 357 (488) T protein:vir:23 285 MFDAYMARILAFEGGEG-------AHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRL 357 (488) T ss_pred hhhhhhhhhccCCCCCC-------ceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHH Confidence 22222211 2243321 0011222211 11134555557777788888888876542 444556555544 Q ss_pred HHHHHHHHH----HHHHHHHHHHhh-cccc-----eEEEEEecCCCCccHHHHHHHHHHHHhcC--CcHHHHHHHHhCCC Q lcl|NC_015263. 372 EQFIFGVIN----QLERWLNRYLLL-NGMS-----KYFKATMLEVTHFSKKEAHDRYITDAQYG--FPVKVYLASLMGID 439 (513) Q Consensus 372 ~~~~~~~~~----~iE~~~N~~i~~-~~~~-----~~f~~~~l~~T~fn~ke~~~~~~~~~~~G--~~~~~~laa~~G~~ 439 (513) ...+-...+ .+++.+...+.- +... ...++.|-+..+-|..+.++.+.|+.+-| .-+...+...+|++ T Consensus 358 ~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~ 437 (488) T protein:vir:23 358 VKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYT 437 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCC Confidence 444443333 333333333321 2111 25788898999999999999999998866 44556677778998 Q ss_pred HHHHHHHHHH--HHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCC Q lcl|NC_015263. 440 PVAFTGLLKV--ENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANT 512 (513) Q Consensus 440 p~~~~~~~~~--E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~ 512 (513) +.+.-.+-.. |.+.-. ...+-.+.. -+.+.++.. ....++.|.+++.. + T Consensus 438 ~d~~~~~~~~~~~~~~~~-~~~~~~~~~----~~~~~~~~~---~~~~~~~~~~e~~~----------------a 488 (488) T protein:vir:23 438 IVEREQMRQWLEQDQKQG-LGLIGSLYG----ASTPEGKPG---EAPVGEPPAPEPDA----------------A 488 (488) T ss_pred chHHHHHHHHHHHHHHHH-HHHHHHHhc----cCCCcccCC---CCCCCCCCCCCCCC----------------C Confidence 8765543332 211100 001111000 000000000 01122222222111 1 No 162 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=78.73 E-value=0.11 Score=25.81 Aligned_cols=314 Identities=12% Similarity=0.089 Sum_probs=145.0 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccc---------cchHHHHHHHhh-hcc--ChhHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSL---------TSSQSKVRKIVK-EYR--NEGNQK 68 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~---------~~s~d~~k~~i~-~~~--P~~n~~ 68 (513) |-|.|++|+.--.. ++. .++. ....|+- ....|-+.-+-. +|+ |..-.. T Consensus 1 m~~~~~~~~~~~~~------------~~~------~~~~-~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~ 61 (344) T protein:vir:60 1 MSKKKGKTLQPAAK------------KMT------ASAP-KMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTG 61 (344) T ss_pred CCcccCCCCCchHH------------hhc------CCcC-cEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHH Confidence 88888877642110 000 0010 1122221 112223222222 133 343333 Q ss_pred HHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 69 TLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 69 ~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~ 148 (513) |.++ +..+++-.+.|....+|-...| .| +.. |. .++|..+..+.+..|.. T Consensus 62 -la~~----~~a~~~h~~~i~~k~n~l~~~~--~P---------n~~----------~t----~~~f~~~~~d~ll~Gna 111 (344) T protein:vir:60 62 -LAKS----LRAAVHHSSPIYVKRNILASTF--IP---------HPW----------LS----QQDFSRFVLDFLVFGNA 111 (344) T ss_pred -HHHH----HHhhhhhccchhhhhhHHHhhc--cC---------CCC----------CC----HHHHHHHHHHHHhcCCe Confidence 3222 2344444444443333322211 11 000 01 34456777788889999 Q ss_pred eEEEEEc--CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeec Q lcl|NC_015263. 149 YGYVIDD--KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ 226 (513) Q Consensus 149 ~gy~i~d--~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~ 226 (513) |.+++.+ +..+.+.++|+.||++. .+|...|.+ .. .. .=++++ T Consensus 112 y~~i~rn~~G~~~~L~~l~~~~vr~~--~~~~~~~~v--~~--~~-----------------------------~~~~~~ 156 (344) T protein:vir:60 112 FLEKRYSTTGKVIRLETSPAKYTRRG--VEEDVYWWV--PS--FN-----------------------------EPTAFA 156 (344) T ss_pred EEEEEECCCCcEEEEEEcCcceEEEe--ecCCeEEEE--cc--CC-----------------------------eEEEEc Confidence 9998876 45678999999999986 333221110 00 00 002233 Q ss_pred CCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHH Q lcl|NC_015263. 227 DKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEAL 304 (513) Q Consensus 227 ~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~i 304 (513) .+--+-|+- + ....+|+||..++...+.--+...+...- -..|- ...+-|-+. . +..++.++++.+.+.+ T Consensus 157 ~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~--~f~NG-~~pg~il~~---~--~~~ls~e~~~~ik~~~ 228 (344) T protein:vir:60 157 PGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRK--YYENG-AHAGYIMYV---T--DAVQDRNDIEMLRENM 228 (344) T ss_pred CccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH--HHhcc-CCCceEEEe---c--CcCCCHHHHHHHHHHH Confidence 333344442 2 24557999999888877655555443311 11221 111111111 0 2347778888777777 Q ss_pred HHhcccc---ceEEEecc-ccccccc---ccccccchhhh---hhHHhhhhhhhhhhhhccC--C-CcchH-HHHHHHHH Q lcl|NC_015263. 305 SMTVPDN---VGVVTSPM-EIDTVSF---DKDSSTDDSVE---KATKNFWDNAGVSQILFSS--D-NKTSQ-GIAMSIAT 370 (513) Q Consensus 305 k~~Lp~g---v~~v~sP~-~~d~i~l---d~~~~~~dtv~---~~~~~i~~~~GiS~~Lfn~--d-~~s~~-~~~~SI~~ 370 (513) +++--.| ...+.+|- +-+.+++ .....+.+.++ -..++|..+.||...|.|- + +++.+ .-+..+.- T Consensus 229 ~~~~g~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f 308 (344) T protein:vir:60 229 VKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVF 308 (344) T ss_pred HHhcCCCCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHH Confidence 7643111 12444442 1123333 22233333333 3447899999999888862 1 21222 22333322 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCc Q lcl|NC_015263. 371 DEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHF 409 (513) Q Consensus 371 d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~f 409 (513) ...-+.-++++|| .+|..|-.+.+ +|+.+=+++..- T Consensus 309 ~~~~L~Pl~~~~e-~ln~~lg~~~i--~F~~~~l~~~d~ 344 (344) T protein:vir:60 309 VRNELIPLQDRIR-EINGWLGQEVI--RFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHH-HHHHhcCCccc--ccCccccCCCCC Confidence 2222334566666 36666532222 355555555522 No 163 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=78.71 E-value=0.11 Score=25.80 Aligned_cols=349 Identities=15% Similarity=0.113 Sum_probs=140.5 Q ss_pred cccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchh-HHH Q lcl|NC_015263. 42 APVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLK-KEL 119 (513) Q Consensus 42 s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~-~~y 119 (513) =++|.+-.+.-. .+-+..+-|. .....+-....+..+++.|+.++. +..+.-.++-- .......+... ..- T Consensus 1 M~~f~k~~~~~~-~~~~~~~~~~-----~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~-~~~~~~~~~~~~~~~ 73 (378) T protein:vir:85 1 MNLFGKVVSFSR-GKLNNDTQRV-----TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKY-KKSDVGSDTLISMAG 73 (378) T ss_pred Cchhhhhhhhhh-cccccCCcce-----eeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEE-ecccccccccccccc Confidence 444553221111 0111111011 111112233344556666666543 12222222211 11111111111 112 Q ss_pred HHHHHHHhh-----cChhHHHHHHHHHHHHhcceeEEEEE-cCcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcch Q lcl|NC_015263. 120 ATVTEFLSR-----LNPKYNFSKIVKLAMTVDIFYGYVID-DKESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADI 193 (513) Q Consensus 120 ~~v~~~L~k-----~n~k~~~~~i~~~~l~~g~~~gy~i~-d~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~ 193 (513) +.....|+. +.--.+...++..++..|..|.|.+. +.++..+...+. +|.-+| T Consensus 74 ~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~---------~~~~~~------------ 132 (378) T protein:vir:85 74 SDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLFA---------NDKKEY------------ 132 (378) T ss_pred chHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEec---------CCCEEE------------ Confidence 233344432 23446667788888899999988653 444422221111 111110 Q ss_pred hccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhh-- Q lcl|NC_015263. 194 VDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQN-- 271 (513) Q Consensus 194 L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n-- 271 (513) .|.|+ =|+.-+. +..+ ..+.+.-..+. +..-+.+ T Consensus 133 ---~~~dv-------------------ih~~~~~-----------~~~~---~~~~~~~a~~~--------~~~~~~~~~ 168 (378) T protein:vir:85 133 ---KPEEL-------------------VRLVSPF-----------YINE---DTSILDNALAS--------IQTKLEQGK 168 (378) T ss_pred ---cccce-------------------EEEecCc-----------Cccc---hhhHHHHHHHH--------HHHHHhcCC Confidence 01000 0111000 0000 01111111110 1111111 Q ss_pred ceeeeeeeccccCCCCCccccCHHHH----HHHHHHHHHhc----cccceEEEecccccccccccccccchhhhhhHHhh Q lcl|NC_015263. 272 YKLLIQKLETRSSNDNNDFTLDMPMM----NYFHEALSMTV----PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKATKNF 343 (513) Q Consensus 272 ~~ii~~kip~~~~n~~~~~~vd~~~~----~~~~~~ik~~L----p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i 343 (513) ...+. ++| + .++.+.. ++|.+.++... ..++..+-..+++..++++.....-..+.-..+.| T Consensus 169 ~~g~l-~~~-------~--~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~I 238 (378) T protein:vir:85 169 LRGLL-KIN-------A--FLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIELIKSEL 238 (378) T ss_pred cceEE-EeC-------C--cCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhHHHHHHHHHHH Confidence 11111 122 1 2333333 33444443322 12344444555666555543322224455556789 Q ss_pred hhhhhhhhhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-----ccce----EEEEEecCCCCccHHHH Q lcl|NC_015263. 344 WDNAGVSQILFSSDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLLLN-----GMSK----YFKATMLEVTHFSKKEA 414 (513) Q Consensus 344 ~~~~GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~~~-----~~~~----~f~~~~l~~T~fn~ke~ 414 (513) ..+.||...+++|+.. +. ..+.--..-+.-++.+||..+|+.|-.. .... .+.|..-+...-+.++. T Consensus 239 a~~fgVPp~~l~~s~~--e~--~~~~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~ 314 (378) T protein:vir:85 239 LTGYFMNENILLGTAT--QE--QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKEL 314 (378) T ss_pred HHHhCCCHHHhcCCch--HH--HHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHH Confidence 9999999888876532 22 2222222334468999999999988432 1111 24555556667788899 Q ss_pred HHHHHHHHhcCC-cHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccC-CCCcCCCCc Q lcl|NC_015263. 415 HDRYITDAQYGF-PVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKG-KENGRPTNE 492 (513) Q Consensus 415 ~~~~~~~~~~G~-~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~-~~~grPt~e 492 (513) ++.+.++..-|. .+=.. -+.+|+.|.+ |-|..++|+- ...-.+ .+..+. ..++.|+ T Consensus 315 ~~~~~~~~~~G~~T~NE~-R~~lgl~p~~------------gGD~~~~~~N---~~~~~~----~~~~~~~~~~~~~~-- 372 (378) T protein:vir:85 315 IDLYHENINGPIFTQNQL-LVKMGEQPIE------------GGDIYIANLN---AVAVKN----LSDLQGSRKDVAST-- 372 (378) T ss_pred HHHHHHHHhCCCcCHHHH-HHHhCCCCCC------------CCCeEeeccc---cccccc----chhhcCccCCCCCC-- Confidence 999888888773 33332 2245666532 2233344432 211111 010000 1111111 Q ss_pred ccccccCCCCCCC Q lcl|NC_015263. 493 TTGNKDSDETQRA 505 (513) Q Consensus 493 t~~n~~~~~~~~~ 505 (513) +++++. T Consensus 373 -------~e~~n~ 378 (378) T protein:vir:85 373 -------DETNNQ 378 (378) T ss_pred -------CCCCCC Confidence 111111 No 164 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=78.55 E-value=0.11 Score=25.77 Aligned_cols=386 Identities=8% Similarity=0.034 Sum_probs=157.8 Q ss_pred chHHHHHHHhhhccChhHHHHHHHHHHHHHhhc-------------------chHHHHHHHHhhcccccceEeeccchhh Q lcl|NC_015263. 49 SSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS-------------------QQYQRLLNFYANMPLYAYSVVPFKDIST 109 (513) Q Consensus 49 ~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s-------------------g~~~rlidy~~~mpt~dY~I~P~~~~~~ 109 (513) ...+.+.+.|..| ......++.+-+|+.... +..+.+++-.++...= .|..-. T Consensus 1 l~~~~l~~~i~~~--~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g----~~~~~~-- 72 (429) T protein:vir:98 1 MTKDLLSELIQKH--RSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIG----VPVQTS-- 72 (429) T ss_pred CCHHHHHHHHHHH--HHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhcc----cCceee-- Confidence 3344555555554 334455555555544432 2333444433332110 110000 Q ss_pred hhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEEE--CCeeEEEEEee Q lcl|NC_015263. 110 ANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSVS--GGVYNYVIDLD 186 (513) Q Consensus 110 ~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~s 186 (513) ......++. +-.+++.-++...+..+.+.+++.|..|.+...+.++ .-+..++|.-|.+.--. ++.+.+++ + T Consensus 73 ~~~~~~~~~---l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i--~ 147 (429) T protein:vir:98 73 HENKQVSNY---LELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLFAV--R 147 (429) T ss_pred cCChHHHHH---HHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEEEE--E Confidence 011111222 2223444468899999999999999999888776665 35778888888776432 24455554 4 Q ss_pred eccCcch---hccccHHHHHHHHHHhhhhhccCcccccCeeecCCc----eEEEEecCccccchhhHHHHHHhHHHHHHH Q lcl|NC_015263. 187 ALVSADI---VDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKN----SICIKINESSLTPVPPFAGTFDSIYDIHSF 259 (513) Q Consensus 187 yFd~~~~---L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~k----t~~ik~~~~~~~~ip~f~~v~~d~~di~~~ 259 (513) |...... ...|..+.-. -|. ...+.-|+.=+.+| -.++.+ -+...+.|-|.. +.++.|..+. T Consensus 148 ~~~~~~~~~~~~~~~~~~~~---~~~------~~~~~~~~~~~~~~~~g~vPvv~~-~n~~~g~sd~e~-v~~liD~~d~ 216 (429) T protein:vir:98 148 YFYNKGGVLEGSYSDASNIT---YFK------DGEKGIEIGESEPHPFDGVPMIEY-VENEERQSLLAS-VVTLINAFNK 216 (429) T ss_pred EEEecCceEEEEEEeCceEE---EEE------ecCCceEecccccccCCccceEEe-cCCCCCCCcHHH-HHHHHHHHHH Confidence 4332211 1111111000 000 00000011100011 111222 112234444443 2333332221 Q ss_pred HHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccceEEEecccccccccccccc-cchhhhh Q lcl|NC_015263. 260 KDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKDSS-TDDSVEK 338 (513) Q Consensus 260 kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~~~-~~dtv~~ 338 (513) -- -+....++-..--.-.+. +...++++.-++... ..+ .+|++-| .+.+++-+.-+.... ....++. T Consensus 217 ~~-s~~~~~~~~~~~p~~~i~--g~~~~~~~~~~~~~~----~~~--~~~~~~~---~~~~~~~l~~~~~~~~~~~~~~~ 284 (429) T protein:vir:98 217 AI-SEKANDVEYFADAYLKIL--GAELDDETLKSLRDT----RII--NLKDTDA---QQLTVEFLQKPDADATQEHLLDR 284 (429) T ss_pred HH-HHHHHHHHHhcCceeeee--cCCCCcchhhhHhhC----cee--eccCCCC---CCcceeEEeecCCHHHHHHHHHH Confidence 10 011111222111111111 111122211111000 000 1122110 000111111111111 1123455 Q ss_pred hHHhhhhhhhhhhhhccCC-CcchHHHHHHHHHHHHHHHH----HHHHHHHHHH---HHHhhcccce---EEEEEecCCC Q lcl|NC_015263. 339 ATKNFWDNAGVSQILFSSD-NKTSQGIAMSIATDEQFIFG----VINQLERWLN---RYLLLNGMSK---YFKATMLEVT 407 (513) Q Consensus 339 ~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~SI~~d~~~~~~----~~~~iE~~~N---~~i~~~~~~~---~f~~~~l~~T 407 (513) ..++|+.-+++-.+-+.+. +.|+..+......-...+-. |-+.+++.++ .++....... ...+.|-+.. T Consensus 285 l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~ 364 (429) T protein:vir:98 285 LENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNL 364 (429) T ss_pred HHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCC Confidence 5567777777655445333 23443344333322222221 2222222222 2333222221 3678999999 Q ss_pred CccHHHHHHHHHHHHhcCCcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCC Q lcl|NC_015263. 408 HFSKKEAHDRYITDAQYGFPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKE 485 (513) Q Consensus 408 ~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~ 485 (513) +-+..+.++.+.++... .|... +...+|+ +|.+.+.+++.|++.. +.+ ++ +.-.+++. T Consensus 365 p~~~~~~a~~~~kl~g~-is~et-~~~~l~~v~d~~~E~~ri~~E~~~~-----~~~-~~-----~~~~~~~~------- 424 (429) T protein:vir:98 365 PANLLEESQIAGNLAGI-VSEET-QVGVLSIVENPQKEIERKNSDKSTL-----ISR-QA-----GGLNGQNT------- 424 (429) T ss_pred CcCHHHHHHHHHHHhcc-CchHH-HHHhCCCCCCHHHHHHHHHHHHHHH-----HHH-HH-----hhhcCCCC------- Confidence 99999999999998532 55544 5556787 6889999999998541 112 21 11000000 Q ss_pred CcCCCCcccc Q lcl|NC_015263. 486 NGRPTNETTG 495 (513) Q Consensus 486 ~grPt~et~~ 495 (513) ++.. . T Consensus 425 ---~~~~--~ 429 (429) T protein:vir:98 425 ---TTIL--E 429 (429) T ss_pred ---CCCC--C Confidence 1100 0 No 165 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=76.88 E-value=0.13 Score=25.43 Aligned_cols=376 Identities=12% Similarity=0.098 Sum_probs=152.9 Q ss_pred chHHHHHHHhhhcc--ChhHHHHHHHHHHHHHhhcchH------------------------------HHHHHHHhhccc Q lcl|NC_015263. 49 SSQSKVRKIVKEYR--NEGNQKTLRKVSEDLAVQSQQY------------------------------QRLLNFYANMPL 96 (513) Q Consensus 49 ~s~d~~k~~i~~~~--P~~n~~~ir~~s~~lY~~sg~~------------------------------~rlidy~~~mpt 96 (513) .-.+++++.|..+. -......++.+-+|+.....+. +.|++-.+ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~---- 76 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEA---- 76 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhh---- Confidence 33334444433321 1223333444445544443322 22222222 Q ss_pred ccceEe-e--ccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEE Q lcl|NC_015263. 97 YAYSVV-P--FKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKIS 172 (513) Q Consensus 97 ~dY~I~-P--~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIs 172 (513) .|.+- | |... +....+... .++.. +....+..+.+.+.+.|..|.+...|.++ +-+.-+||..|.++ T Consensus 77 -~yl~G~p~~~~~~----d~~~~~~l~---~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v 147 (470) T protein:vir:10 77 -GYVASVFPDIDVG----KDADNKKII---DVLGD-DRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPI 147 (470) T ss_pred -hheeccceeeecC----chHHHHHHH---HHHhh-hHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEE Confidence 12210 1 1111 111222222 23332 44555667889999999999998887655 66777999999888 Q ss_pred EEEC--CeeEEEEEeeeccCc-----c---hhccccHHHHHHHH-------------HHhh------hh-hccCcccccC Q lcl|NC_015263. 173 SVSG--GVYNYVIDLDALVSA-----D---IVDYYPKEIQEAVN-------------KYTT------MK-KGNNKSASNW 222 (513) Q Consensus 173 g~~n--G~y~~~fD~syFd~~-----~---~L~~~p~Ei~~~y~-------------~Y~~------~k-~~~~~~~~~W 222 (513) --.+ +.+.+++ +++... . ..+.|.++-..-|. .... .. ..-+.....| T Consensus 148 ~d~~~~~~~~a~i--r~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (470) T protein:vir:10 148 YATTLDNKLLGIL--RSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNF 225 (470) T ss_pred EcCCCCCceEEEE--EEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCC Confidence 5432 5555553 333211 0 11222111100000 0000 00 0000000112 Q ss_pred eeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHH-----HHhhHhhhhhceeeeeeeccccCCCCCccccCHHHH Q lcl|NC_015263. 223 YEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKD-----LRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMM 297 (513) Q Consensus 223 ~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kd-----L~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~ 297 (513) =.+| ++.+-. ...+.+- |.++..+.+.=+ +-+..+...+ .+++ ++=.++.+.+++. T Consensus 226 g~vP-----vv~~~n-n~~g~sd----~e~v~~liDa~d~~~S~~~~~~~~~~~-~~lv--l~g~~~~~~~~~~------ 286 (470) T protein:vir:10 226 GRVP-----FIEFSK-NKYRLPE----LNKYKGLIDAYDDIYNGFINDLDDVQT-VILV--LTNYGGADLHQFM------ 286 (470) T ss_pred Ceee-----EEEeec-CCCCCCc----hhHHHHHHHHHHHHHHHHHHHHHHhcC-ccee--eecCCccccchhh------ Confidence 1111 122211 1123232 333333322222 1122222222 1211 1200011112221 Q ss_pred HHHHHHHHHh----cc---c----cceEEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHHH Q lcl|NC_015263. 298 NYFHEALSMT----VP---D----NVGVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGIA 365 (513) Q Consensus 298 ~~~~~~ik~~----Lp---~----gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~ 365 (513) ..++.. ++ . ++-.++.+.+. ......++...++|+.-+++-.+-+.+. +.|+..++ T Consensus 287 ----~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~--------~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk 354 (470) T protein:vir:10 287 ----NDLRKYKSIKINNTGNGDNSGVDKLQIDIPV--------EARDDALKITRKNIFLFGQGIDPANFESSNASGVAIK 354 (470) T ss_pred ----hhhhhcCeEeccCCCCCcCceeEEEeecCCh--------HHHHHHHHHHHHHHHHHhCCCCCCccccccchHHHHH Confidence 122211 11 0 11122222211 1112335566677888777664444332 33444444 Q ss_pred HHHHHHHHHHHHH----HHHHHHHHHHH---Hhhccc-ceEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHh Q lcl|NC_015263. 366 MSIATDEQFIFGV----INQLERWLNRY---LLLNGM-SKYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLM 436 (513) Q Consensus 366 ~SI~~d~~~~~~~----~~~iE~~~N~~---i~~~~~-~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~ 436 (513) .-...-.+.+... -+-|.+.++.+ +..... .....+.|-+..+-|..+.++.+.++. | .|... +.+.+ T Consensus 355 ~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~--g~iS~et-~l~~~ 431 (470) T protein:vir:10 355 MLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVA--NYSSKEA-VAKAN 431 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccceeeEEeccCCCCCHHHHHHHHHHHh--ccCcHHH-HHHhC Confidence 3333333333322 22222222222 222111 225889999999999999999998875 6 55554 55568 Q ss_pred CC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcC Q lcl|NC_015263. 437 GI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGR 488 (513) Q Consensus 437 G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~gr 488 (513) |+ +|.+.+.++..|++. ..+ ....... .. +++.+.+. T Consensus 432 p~v~D~~~E~eri~~E~~e-~~~-~~~~~~~-~~------------~~~~dde~ 470 (470) T protein:vir:10 432 PIVDDWQQELKDLAKDKEE-NDP-YSNQADE-LN------------GKGVNDEQ 470 (470) T ss_pred CCCCCHHHHHHHHHHHHHH-HHH-hhccccc-cC------------CCCCCCCC Confidence 87 789999999998754 111 1111100 00 00001000 No 166 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=76.69 E-value=0.13 Score=25.39 Aligned_cols=413 Identities=9% Similarity=0.028 Sum_probs=151.4 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHH-----HHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLR-----KVSE 75 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir-----~~s~ 75 (513) |++-...|+.+=-|-+-.- +.+. -||.+ ..+...|| ++-+ T Consensus 4 ~~~~~~p~~~~g~~~~~~~-~~~~----~~~~~------------------------------~e~~~~lr~~~~~~ly~ 48 (469) T protein:vir:10 4 RVKTAAPVSEAGYVFGSGV-VDGW----TVWDP------------------------------FEQTPELQWPQSVAVYS 48 (469) T ss_pred cccCCCCccchhhhhhccc-ccch----hhccc------------------------------cccccccccccchHHHH Confidence 2222222222111100000 0000 01111 11112232 2445 Q ss_pred HHHhhcchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhc------ChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 76 DLAVQSQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRL------NPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 76 ~lY~~sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~------n~k~~~~~i~~~~l~~g~~ 148 (513) .+....+|++..++-... +-.+++.|.|-+.++. ....+.............+ .++..+...+...+..... T Consensus 49 ~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~~~~e-~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~ 127 (469) T protein:vir:10 49 RMDNEDSRVTSLLEAISLPIRSTPWRIRANGASDE-VTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQ 127 (469) T ss_pred HHHhhChHHHHHHHHHHHHHhcCCceEecCCCCHH-HHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhh Confidence 555678999988876654 7889999999664321 1111222111111100000 1234455566666666666 Q ss_pred eEEEEE----cC-----cc-eeeeecCcceeEEEEEECCeeEEEEEeeeccCcch--hccccHHHHHHHHHHhhhhhccC Q lcl|NC_015263. 149 YGYVID----DK-----ES-VMIQQFPNDICKISSVSGGVYNYVIDLDALVSADI--VDYYPKEIQEAVNKYTTMKKGNN 216 (513) Q Consensus 149 ~gy~i~----d~-----~~-~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~--L~~~p~Ei~~~y~~Y~~~k~~~~ 216 (513) |||-.- .. ++ +.. ++|.-+...-. . -+.++++... +...+++.......| T Consensus 128 ~G~s~~Eivw~~~~~~~dG~~~~-------~~l~~rp~~~i-~--~~~~~~~~~l~~~~~~~~~~~~~~~~~-------- 189 (469) T protein:vir:10 128 FGHAVFEQVYRPRNQSPDGRFWL-------RKLAPRPQWTI-S--KFNVAPDGGLESIEQIAPPARTRGSLY-------- 189 (469) T ss_pred hCceeeeeeeecccccCCCceee-------eeeeecCcccc-e--eeeeccCCceeeeeecCcccccccccc-------- Confidence 776552 11 11 122 22221111100 0 0122222221 111222211111111 Q ss_pred cccccCeeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHH--HHhhHhhhhhc--eeeeeeeccccCCCCCcccc Q lcl|NC_015263. 217 KSASNWYEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKD--LRNDKAELQNY--KLLIQKLETRSSNDNNDFTL 292 (513) Q Consensus 217 ~~~~~W~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kd--L~~~~~~i~n~--~ii~~kip~~~~n~~~~~~v 292 (513) ........||+.|-+++..+..+ +-|.-.+++..++-.--+|. ++.--.=++-+ -+.+.|.|-+ - T Consensus 190 ~~~~~~~~lp~~k~i~~~~~~~~--g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~---------a 258 (469) T protein:vir:10 190 VANIAPPEIPVNRLVVYTRNKRP--GQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSA---------T 258 (469) T ss_pred cCCCCccccccCcEEEEEecCCC--CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCC---------C Confidence 11124678999998888865432 22333444444444444444 22222223332 3455665521 2 Q ss_pred CHHHHHHHHHHHHHhccccc-eEEEecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCCCc----chHHHHH Q lcl|NC_015263. 293 DMPMMNYFHEALSMTVPDNV-GVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSDNK----TSQGIAM 366 (513) Q Consensus 293 d~~~~~~~~~~ik~~Lp~gv-~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d~~----s~~~~~~ 366 (513) +.+..+...+.+++.. .|. +.++.|-. ..|+|-.. ++..++.. . - =+-+.-||.+++|+.-+ +++.+.. T Consensus 259 ~~~ek~~l~~a~~~~~-~g~~a~~iip~~-~~ie~~ea~g~~~~~~~-l-i-~~~d~~Isk~iLG~tlTs~~~gGS~a~~ 333 (469) T protein:vir:10 259 DEDEVRKMAALARSVR-GGINAGVGLAQG-QILELLGVSGNLPDIRR-A-I-EGHDRSIALSGLAHFLNLDGKGGSYALA 333 (469) T ss_pred CHHHHHHHHHHHHHHh-cCCceEEEccCC-ceEEEeecCCCchHHHH-H-H-HHHHHHHHHHHhcccccccCccchhhHH Confidence 2334445555555432 221 22333422 33444222 22222211 1 1 12334455555433211 1122222 Q ss_pred HHH--HHHHHHHHHHHHHHHHHHHHH-h----hccc--ceEEEEEecCCCCccHHHHHHHHHHHHhcCC----cHHHHHH Q lcl|NC_015263. 367 SIA--TDEQFIFGVINQLERWLNRYL-L----LNGM--SKYFKATMLEVTHFSKKEAHDRYITDAQYGF----PVKVYLA 433 (513) Q Consensus 367 SI~--~d~~~~~~~~~~iE~~~N~~i-~----~~~~--~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~----~~~~~la 433 (513) ++. .-..++-.-.++|+.-+|+.| . .|.- ....+|.|...- ...+..++.+.++.+.|+ ++...+. T Consensus 334 ~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~ 412 (469) T protein:vir:10 334 SVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAI 412 (469) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCCC-CcHHHHHHHHHHHHhcCCccCccccHHHH Confidence 222 223344467778888888733 2 2211 223567765433 344556666666666665 2221111 Q ss_pred HHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 434 SLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 434 a~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~~ 513 (513) . +.+||+..-.... ..... ++.+.+..+..| ......+...-..++|+..+ T Consensus 413 ----------------~-e~~gip~~~~~~~---~~~~~-----~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~ 463 (469) T protein:vir:10 413 ----------------R-QRFNLPSELNDTP---SAEPE-----EPAAVPNQSAAP----ARTRSSGNADARARAPKADQ 463 (469) T ss_pred ----------------H-HHhCCCCCCCCcc---cccch-----hcccCCCCCccc----cccCCCCCcccccccCCChH Confidence 1 3445431100000 00000 000000000000 00001111111111111111 No 167 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=75.37 E-value=0.15 Score=25.14 Aligned_cols=312 Identities=13% Similarity=0.116 Sum_probs=144.1 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCccccccccccc-------c--hHHHHHHHhh-hcc--ChhHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLT-------S--SQSKVRKIVK-EYR--NEGNQK 68 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~-------~--s~d~~k~~i~-~~~--P~~n~~ 68 (513) |-|.|++|..--. .-. ....+. ...|+-+ . ..|-++-+.+ +|+ |..- . T Consensus 1 ~~~~~~~~~~~~~--------------~~~---~~~~~~--~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~-~ 60 (344) T protein:vir:56 1 MSKKKGKTPQPAA--------------KTM---TASAPK--MEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSF-T 60 (344) T ss_pred CCCCCCCCCchhh--------------HHh---hcCCCc--eEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCH-H Confidence 9888888753111 000 011121 2333311 1 1122222211 233 3333 2 Q ss_pred HHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 69 TLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 69 ~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~ 148 (513) .|.++ +..+++-.+.|....+|....| .| +.+ |. ..+|..+..+.+..|.. T Consensus 61 ~la~~----~~a~~~h~s~i~~k~n~l~~~~--~P---------np~----------~t----~~~f~~~~~d~ll~Gna 111 (344) T protein:vir:56 61 GLAKS----LRAAVHHSSPIYVKRNILASTF--IP---------HPW----------LS----QQDFSRFVLDFLVFGNA 111 (344) T ss_pred HHHHH----HhhhhhhCccceehhhhHHhhc--CC---------CCC----------CC----HHHHHHHHHHHHhcCCe Confidence 23332 2344444444444444332222 12 000 01 23456677788889999 Q ss_pred eEEEEEc--CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeec Q lcl|NC_015263. 149 YGYVIDD--KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ 226 (513) Q Consensus 149 ~gy~i~d--~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~ 226 (513) |..++.+ +..+-+.++|+.||++. ...+.|.+... .. .=++++ T Consensus 112 y~~~~rn~~G~~~~L~pl~~~~v~~~-~~~~~~~~~~~-----~g-----------------------------~~~~~~ 156 (344) T protein:vir:56 112 FLEKRYSTTGKVIRLETSPAKYTRRG-VEEDVYWWVPS-----FN-----------------------------EPTAFA 156 (344) T ss_pred EEEEEECCCCcEEEEEEeCCceeEEe-ecCCEEEEEec-----CC-----------------------------eEEEEc Confidence 9998876 45578999999999985 23333322110 00 002233 Q ss_pred CCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHH Q lcl|NC_015263. 227 DKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEAL 304 (513) Q Consensus 227 ~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~i 304 (513) .+.-+.|+- + ....+|+|+..+....+.--+...+... .-..|- ...+-|-+. .+..++.++++.+.+.+ T Consensus 157 ~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~--~~f~NG-a~pg~Il~~-----~d~~ls~e~~~~lk~~~ 228 (344) T protein:vir:56 157 PGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRR--KYYENG-AHAGYIMYV-----TDAVQDRNDIEMLRENM 228 (344) T ss_pred CccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcc-CCCceEEEe-----cCCCCCHHHHHHHHHHH Confidence 433344442 3 2456899999988887665444444321 111221 111111110 02347777887777777 Q ss_pred HHhccccc---eEEEeccc-cccccc---ccccccchh---hhhhHHhhhhhhhhhhhhccC--C-CcchH-HHHHHH-- Q lcl|NC_015263. 305 SMTVPDNV---GVVTSPME-IDTVSF---DKDSSTDDS---VEKATKNFWDNAGVSQILFSS--D-NKTSQ-GIAMSI-- 368 (513) Q Consensus 305 k~~Lp~gv---~~v~sP~~-~d~i~l---d~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~--d-~~s~~-~~~~SI-- 368 (513) +++--.|- ..+.+|-- -+.+++ .....+.+. -+-..++|..+.||...+.|- + +.+.+ .-+... T Consensus 229 ~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~f 308 (344) T protein:vir:56 229 VKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVF 308 (344) T ss_pred HHhcCCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHH Confidence 76431111 24444421 123333 222333333 333447899999999998863 1 11122 222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCc Q lcl|NC_015263. 369 ATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHF 409 (513) Q Consensus 369 ~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~f 409 (513) -.++.. -++++||. +|..|-.+.+ +|+=+-++.+.- T Consensus 309 ~~~tL~--Pl~~~ie~-~n~~l~~~~~--~F~~y~l~~~~~ 344 (344) T protein:vir:56 309 VRNELI--PLQDRIRE-INGWIGQEVI--RFKNYSLDTDNG 344 (344) T ss_pred HHHHHH--HHHHHHHH-HHhhhccccc--cCCCccccccCC Confidence 233332 36666764 5555532222 244444555433 No 168 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=72.99 E-value=0.18 Score=24.72 Aligned_cols=375 Identities=14% Similarity=0.091 Sum_probs=150.7 Q ss_pred chHHHHHHHhhhccChhHHHHHHHHHHHHHhhc--------------------------chHHHHHHHHhhcccccceEe Q lcl|NC_015263. 49 SSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQS--------------------------QQYQRLLNFYANMPLYAYSVV 102 (513) Q Consensus 49 ~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~s--------------------------g~~~rlidy~~~mpt~dY~I~ 102 (513) ..-+.+++.|..| ......++.+-+|+.... +..+.+++-.++ |.+ T Consensus 1 l~~~~i~~~i~~~--~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~-----yl~- 72 (451) T protein:vir:10 1 MELEKIRAIISAD--AARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKAS-----YMF- 72 (451) T ss_pred CCHHHHHHHHHHH--HHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhh-----hee- Confidence 3334445555544 233444555555555432 222223332222 221 Q ss_pred eccchh-hhh-hcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCc---------ceeeeecCcceeEE Q lcl|NC_015263. 103 PFKDIS-TAN-ENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKE---------SVMIQQFPNDICKI 171 (513) Q Consensus 103 P~~~~~-~~~-~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~---------~~~iq~lp~dyckI 171 (513) ++.- ... ..+-..+..+ .++ .=++......+.+.+++.|..|.+...+.+ .+-+.-++|..|.+ T Consensus 73 --G~p~~~~~~~~~~~~~~~~--~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~ 147 (451) T protein:vir:10 73 --TYPVLFDIDNNKELNEKVT--DVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIP 147 (451) T ss_pred --cccceeecCCcHHHHHHHH--HHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEE Confidence 1111 100 1111111222 122 235788889999999999999988776543 35577788888877 Q ss_pred EEE--ECCeeEEEEEeeeccCc--c--------hhccccHHHHHHHHHHhhhhhccCcccccCeee-----cCCceEEEE Q lcl|NC_015263. 172 SSV--SGGVYNYVIDLDALVSA--D--------IVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEI-----QDKNSICIK 234 (513) Q Consensus 172 sg~--~nG~y~~~fD~syFd~~--~--------~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L-----~~~kt~~ik 234 (513) +-- ..+.+.+++=.-+-... . .++.|.+.... .|.. ........+... +...-.++. T Consensus 148 vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~---~~~~---~~~~~~~~~~~~~~~~~~~g~vPvv~ 221 (451) T protein:vir:10 148 IYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILD---KYKF---FGVSCCGSQIEHITVQHRFNSVPFVE 221 (451) T ss_pred EEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEE---EEEe---cccCccccccccccccCCCCeeeEEE Confidence 643 23555555432211110 0 11112111000 0100 000000001110 011111233 Q ss_pred ecCccccchhhHHHHHHhHHHHHHHHH-----HHhhHhhhhhceeeeeeec-cccCCCCCccccCHHHHHHHHHHHHHhc Q lcl|NC_015263. 235 INESSLTPVPPFAGTFDSIYDIHSFKD-----LRNDKAELQNYKLLIQKLE-TRSSNDNNDFTLDMPMMNYFHEALSMTV 308 (513) Q Consensus 235 ~~~~~~~~ip~f~~v~~d~~di~~~kd-----L~~~~~~i~n~~ii~~kip-~~~~n~~~~~~vd~~~~~~~~~~ik~~L 308 (513) +-. ...+.|- |.++..+.+.=+ +-+..+...+ .+++ ++ + ++...++ +-..++. T Consensus 222 ~~n-n~~~~~d----~e~v~~liDa~~~~~S~~~~~~~~~~~-~~l~--~~g~-~~~~~~~----------~~~~~~~-- 280 (451) T protein:vir:10 222 FSN-NIKKQSD----LSKYKKILDLYDRVMSGFANDLEDIQQ-IIYI--LENF-GGEDTSE----------FLKELKR-- 280 (451) T ss_pred ecc-CCCCCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcc-ceee--eecC-Ccccchh----------hHHHHhh-- Confidence 311 1122233 333333332222 1122222222 2221 22 1 0111111 1111222 Q ss_pred cccceEEEecc-----ccccccccccc-ccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHHHHHHHHHHHHHHHHH-- Q lcl|NC_015263. 309 PDNVGVVTSPM-----EIDTVSFDKDS-STDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGIAMSIATDEQFIFGVI-- 379 (513) Q Consensus 309 p~gv~~v~sP~-----~~d~i~ld~~~-~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~SI~~d~~~~~~~~-- 379 (513) .++..+.... +++-+.-+... .....++-..++|+.-+++-.+-+.+. +.|+.+++.-...-.+.+...- T Consensus 281 -~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~ 359 (451) T protein:vir:10 281 -YKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLETE 359 (451) T ss_pred -CCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 1111111000 11111111111 112335555677888887654333222 2344444444333333333222 Q ss_pred --HHHHHHHHHHHhhcc-c-ceEEEEEecCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCC--CHHHHHHHHHHHHH Q lcl|NC_015263. 380 --NQLERWLNRYLLLNG-M-SKYFKATMLEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGI--DPVAFTGLLKVENE 452 (513) Q Consensus 380 --~~iE~~~N~~i~~~~-~-~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~--~p~~~~~~~~~E~e 452 (513) +-+++.++.++.-.. . -..+.+.|-+..+-|..+.++.+.++. | .|... +.+.+|+ +|.+.+.+...|.+ T Consensus 360 f~~~l~~~~~li~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~et-~~~~~p~v~d~~~e~~~~~ee~~ 436 (451) T protein:vir:10 360 FRTSFDKLIKAILYFLGVTDYKKIQQTYTRNMMSNDLEDADIATKSV--GIIPTKI-ILRHHPWVDDVEEAEKLYLEEKK 436 (451) T ss_pred HHHHHHHHHHHHHHHhCCCCccceeEEecCCCCCCHHHHHHHHHHHh--ccCchHH-HHHhCCCCCCHHHHHHHHHHHHH Confidence 222222322222111 1 224889999999999999999999985 5 55554 5556777 57777777765543 Q ss_pred hhCcccccCcccccccccccccccCCccccCCC Q lcl|NC_015263. 453 MLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKE 485 (513) Q Consensus 453 ~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~ 485 (513) . ..... +. ..+.- .+ T Consensus 437 ~-~~~~~----~~---~~~~~----------~~ 451 (451) T protein:vir:10 437 I-QASKV----SD---DYNNF----------TE 451 (451) T ss_pred H-HHHHH----Hh---hcCCC----------CC Confidence 2 11110 00 00100 00 No 169 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=72.81 E-value=0.18 Score=24.69 Aligned_cols=424 Identities=11% Similarity=0.034 Sum_probs=157.4 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchH------------HHHHHHhhhcc--ChhHHHHHHHHHHH Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQ------------SKVRKIVKEYR--NEGNQKTLRKVSED 76 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~------------d~~k~~i~~~~--P~~n~~~ir~~s~~ 76 (513) |-=+.-|| -.+ +-||-- -++-..|++|... +.+++.|+.+. -......+..+.+| T Consensus 1 ~~~~~~~~---~~~--~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~Y 69 (492) T protein:vir:97 1 MQFIQLIS---QVA--QALIKG------GNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEY 69 (492) T ss_pred ChHHHHHH---HHH--HHHhcC------CceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21111122 112 222210 0011222222222 12222222221 12333444445555 Q ss_pred HHhhcchHH----------------------HHHHHHhhcccccceEeeccch-hhhhhcchhHHHHHHHHHHhhcChhH Q lcl|NC_015263. 77 LAVQSQQYQ----------------------RLLNFYANMPLYAYSVVPFKDI-STANENKLKKELATVTEFLSRLNPKY 133 (513) Q Consensus 77 lY~~sg~~~----------------------rlidy~~~mpt~dY~I~P~~~~-~~~~~~~~~~~y~~v~~~L~k~n~k~ 133 (513) +.....+.. +...+++...+ .|.+ ++. ....+++-..++++ .++. =++.. T Consensus 70 Y~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~-~yl~---g~p~~~~~~d~~~~~~l~--~~~~-n~~~~ 142 (492) T protein:vir:97 70 YEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKV-SYIV---GKPIAFKHTDDEVVKRID--EVLG-NRFDD 142 (492) T ss_pred hcccCccccccccccccccccccccccccccchHHHHHHHHh-hhhc---ccCceeccCchHHHHHHH--HHHh-ccHHH Confidence 444321111 11111111110 1111 110 00111111112222 2232 35677 Q ss_pred HHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEE--ECCeeEEEEEeeeccCcchhccccH-HHHHHHHHHh Q lcl|NC_015263. 134 NFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSV--SGGVYNYVIDLDALVSADIVDYYPK-EIQEAVNKYT 209 (513) Q Consensus 134 ~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~--~nG~y~~~fD~syFd~~~~L~~~p~-Ei~~~y~~Y~ 209 (513) .+..+.+.+++.|..|.+...+.++ +-+.-++|+.|.++-- ..+.+.+++=.-.-+.....+.|.+ .+.. | .+. T Consensus 143 ~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~-~-~~~ 220 (492) T protein:vir:97 143 KLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNY-Y-VYE 220 (492) T ss_pred HHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEE-E-EEe Confidence 8888999999999999988876554 5677789988888754 2355655533222112222222211 1100 0 000 Q ss_pred hhhhc-c-CcccccCeeecCCceE----EEEecCccccchhhHHHHHHhHHHHHHHHHH-H-hhHhhhhhceeeeeeecc Q lcl|NC_015263. 210 TMKKG-N-NKSASNWYEIQDKNSI----CIKINESSLTPVPPFAGTFDSIYDIHSFKDL-R-NDKAELQNYKLLIQKLET 281 (513) Q Consensus 210 ~~k~~-~-~~~~~~W~~L~~~kt~----~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL-~-~~~~~i~n~~ii~~kip~ 281 (513) ..... . ......|..-...+.+ ++.+-. ...+.|-|. ++.++.+.=+. . +....++...--+-.+- T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-n~~g~sd~e----~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~- 294 (492) T protein:vir:97 221 NGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-NDLEISDIF----MYKTLIDAYNRRLSDLSNTFKDSNELTYVLK- 294 (492) T ss_pred cCeeeecccccccccccccccCCCCCcceEEecC-CCCCCCchH----hHHHHHHHHHHHHHHHHHHHHHhccceeeee- Confidence 00000 0 0000011111111111 222211 122334333 33333322221 1 22222222111100010 Q ss_pred c-cCCCCCccccCHHHHHHHHHHHHHh----ccccce--EEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhc Q lcl|NC_015263. 282 R-SSNDNNDFTLDMPMMNYFHEALSMT----VPDNVG--VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILF 354 (513) Q Consensus 282 ~-~~n~~~~~~vd~~~~~~~~~~ik~~----Lp~gv~--~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lf 354 (513) + ++.+.++ +-..++.. ++++.. .++.+.+. ......++...++|+.-+++-..-+ T Consensus 295 g~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~l~~~~~~--------~~~~~~~~~L~~~I~~~s~~p~~~~ 356 (492) T protein:vir:97 295 NYDDQELPE----------FKRLLRYYGAIKVSDNGGVDTIQVEVPV--------ENSKKYLDELYQKIMLFGQAVDFSS 356 (492) T ss_pred cCCcccchh----------HHHHHhhccceecCCCCcceeEeccCCH--------HHHHHHHHHHHHHHHHHhCCCCCCc Confidence 0 0111122 22222221 233221 11111110 1111224555567777777665544 Q ss_pred cCCC--cchHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhh---cccceEEEEEecCCCCccHHHHHHHHHHHHhcC Q lcl|NC_015263. 355 SSDN--KTSQGIAMSIATDEQFIFG----VINQLERWLNRYLLL---NGMSKYFKATMLEVTHFSKKEAHDRYITDAQYG 425 (513) Q Consensus 355 n~d~--~s~~~~~~SI~~d~~~~~~----~~~~iE~~~N~~i~~---~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G 425 (513) .+.+ .|+.+++.....-...+.. |-+.+++.+..++.. ..-....++.|-+..+-|..+.++.+.++. | T Consensus 357 ~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--G 434 (492) T protein:vir:97 357 DKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--G 434 (492) T ss_pred cccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHh--c Confidence 3322 3444343332222222222 222233322222222 211235889999999999999999999985 6 Q ss_pred -CcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCC Q lcl|NC_015263. 426 -FPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSD 500 (513) Q Consensus 426 -~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~ 500 (513) .|... +...+|+ +|.+.+.++..|++.- ..-.+. ..+.. .++|.+ .+..++++.. T Consensus 435 ~iS~et-~l~~l~~v~d~~~Eleri~~E~~~~-----~~~~~~---~~~~~----------~~~~~~-~~~~~~~~~e 492 (492) T protein:vir:97 435 IVSHET-VLENHPFVEDLQAELERIEQEQTEY-----NKQLPN---LDDGG----------ADSAQQ-QERSNNKESE 492 (492) T ss_pred cCchHH-HHHhCCCCCCHHHHHHHHHHHHHHH-----HHhhhc---cccCC----------CCCCcc-cccccccccC Confidence 55555 4446776 5788888888887431 111111 01110 111111 0101111111 No 170 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=72.36 E-value=0.18 Score=24.62 Aligned_cols=343 Identities=13% Similarity=0.057 Sum_probs=158.9 Q ss_pred hHHHHHHHHHHHHHhhcchHHHHHHHHhhcccccceE--ee--ccchhhhhhcch--------------------hHHHH Q lcl|NC_015263. 65 GNQKTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSV--VP--FKDISTANENKL--------------------KKELA 120 (513) Q Consensus 65 ~n~~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I--~P--~~~~~~~~~~~~--------------------~~~y~ 120 (513) =..+.|..|..-+...-+.++++-+||..-+.+.|.- .| +.. ..+.--.. -...+ T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~-~~~~v~nw~~~iVds~a~rl~~~Gf~~~d~~l~ 79 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQ-QYRSILGWCAKGVDSLADRLVFREFENDDFTVN 79 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHH-HHhhhcchhHHHHHHhHhhcccCcccCCchHHH Confidence 2233344444444444444444445554443332210 01 000 00000000 01122 Q ss_pred HHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEE-ECCeeEEEEEeeeccCcch---h- Q lcl|NC_015263. 121 TVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSV-SGGVYNYVIDLDALVSADI---V- 194 (513) Q Consensus 121 ~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~-~nG~y~~~fD~syFd~~~~---L- 194 (513) ...+.-++...+..+.+.+++.|.-|.+...+.++ ..|...+|..|.++-= ..+...+++-..+=+..+. . T Consensus 80 ---~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~ 156 (409) T protein:vir:94 80 ---EIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPITGLLTEGYAVLERDENNNVVLEA 156 (409) T ss_pred ---HHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecCCCceeeeEEEEEecCCCceEEEE Confidence 23455567888999999999999999999976666 4788999999987642 3355556554443222221 1 Q ss_pred ccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEE--EEe----cCccccchhhHHHHHHhHHHHHHHHHHH-hhHh Q lcl|NC_015263. 195 DYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSIC--IKI----NESSLTPVPPFAGTFDSIYDIHSFKDLR-NDKA 267 (513) Q Consensus 195 ~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~--ik~----~~~~~~~ip~f~~v~~d~~di~~~kdL~-~~~~ 267 (513) -++|.++...+ +.+..|...+..-..| +.+ +...++|.+-++--+.++.+.....-+. .+.. T Consensus 157 ~~~~~~~~~~~-----------~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~ 225 (409) T protein:vir:94 157 HFLPDRTDYYY-----------RDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTA 225 (409) T ss_pred EEecCcEEEEE-----------ecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHH Confidence 12344332210 0112344444333332 222 2223344443332222333322222111 2222 Q ss_pred hhhhceeeeeeeccc---cCCCCCccccCHHHHHHHHHHHHHhc--cccceEEEeccccccc---cccccccc--chhhh Q lcl|NC_015263. 268 ELQNYKLLIQKLETR---SSNDNNDFTLDMPMMNYFHEALSMTV--PDNVGVVTSPMEIDTV---SFDKDSST--DDSVE 337 (513) Q Consensus 268 ~i~n~~ii~~kip~~---~~n~~~~~~vd~~~~~~~~~~ik~~L--p~gv~~v~sP~~~d~i---~ld~~~~~--~dtv~ 337 (513) +. ...|.+ +-+++++. +..|...+...+ |+.- +=+.+ .|++..-. -+.+. T Consensus 226 e~-------~a~pqr~i~G~d~d~~~------~~~~~~~~~~i~~~~~d~-------dg~~~~v~q~~~~~l~~~~~~l~ 285 (409) T protein:vir:94 226 EF-------YSFPQKYVTGLSDDAEP------METWKATVSSMLQFTKDE-------DGDKPTLGQFTQPSMSPFTEQLR 285 (409) T ss_pred HH-------hcChhheeEecCCCCcc------cchhhhhHHHhhcCCCCC-------CCCCceEEecCCCChhHHHHHHH Confidence 22 223311 12333322 223444443332 3210 00112 22222211 14566 Q ss_pred hhHHhhhhhhhhhhhhccCCC---cchHHHHHHHHHHHHHHHHHHHHHHHHHH---H---HHhhcc--cc---eEEEEEe Q lcl|NC_015263. 338 KATKNFWDNAGVSQILFSSDN---KTSQGIAMSIATDEQFIFGVINQLERWLN---R---YLLLNG--MS---KYFKATM 403 (513) Q Consensus 338 ~~~~~i~~~~GiS~~Lfn~d~---~s~~~~~~SI~~d~~~~~~~~~~iE~~~N---~---~i~~~~--~~---~~f~~~~ 403 (513) ..-.++...+|+..-.|++.. .|+.+++.....-...+-+-.+.+..=+. | .+..+. .. +.-++.+ T Consensus 286 ~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W 365 (409) T protein:vir:94 286 TAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKW 365 (409) T ss_pred HHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceEEe Confidence 666778888899988888754 34555665544333333333333332111 1 121111 11 1345556 Q ss_pred cCCC---CccHHHHHHHHHHHHhcC--CcHHHHHHHHhCCCHHH Q lcl|NC_015263. 404 LEVT---HFSKKEAHDRYITDAQYG--FPVKVYLASLMGIDPVA 442 (513) Q Consensus 404 l~~T---~fn~ke~~~~~~~~~~~G--~~~~~~laa~~G~~p~~ 442 (513) -+.+ .-+.-..++.+.|+++-| ...+..+.-.+|++..+ T Consensus 366 ~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 366 EPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred ccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 5443 334567789999999998 43456777789999998 No 171 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=72.27 E-value=0.18 Score=24.60 Aligned_cols=402 Identities=10% Similarity=0.060 Sum_probs=161.8 Q ss_pred hccCcccccccccccc----hHHHHHHHhhhccChhHHHHHHHHHHHHHhh-------------------cchHHHHHHH Q lcl|NC_015263. 34 DNRTPVFGAPVGSLTS----SQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ-------------------SQQYQRLLNF 90 (513) Q Consensus 34 ~~~~~~~~s~~~s~~~----s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~-------------------sg~~~rlidy 90 (513) |+..++ .+|-.-. ..+.+.+.|..| .....-++++-+|+... .+..+.+++. T Consensus 1 ~~~~~~---~~~~~p~d~~~~~~~l~~~i~~~--~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~ 75 (453) T protein:vir:39 1 MKYKPP---KLMTFPKDEPITNEVVTKFMEKH--RLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDT 75 (453) T ss_pred CeecCC---cceEcCCCCCCCHHHHHHHHHHH--HHHHHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHH Confidence 333332 1222111 222333333332 23333444444443322 2233334443 Q ss_pred HhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcc-eeeeecCccee Q lcl|NC_015263. 91 YANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKES-VMIQQFPNDIC 169 (513) Q Consensus 91 ~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyc 169 (513) .++...- .|..-. ...+...+ .+...+..-++...+..+.+.+++.|..|.+...+.++ .-+..++++.| T Consensus 76 ~~~~l~g----~~~~~~--~~d~~~~~---~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~ 146 (453) T protein:vir:39 76 FTGYFNG----IPVKKS--HSDKETLS---KLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENM 146 (453) T ss_pred Hhhhhcc----cCceec--cCChHHHH---HHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccce Confidence 3332111 110000 01111222 23344666678889999999999999999988877665 45778888888 Q ss_pred EEEEEE--CCeeEEEEEeeeccCc-chhccc-cHHHHHHHHHHhhhhhccCcccccCeeecC-Cce----EEEEecCccc Q lcl|NC_015263. 170 KISSVS--GGVYNYVIDLDALVSA-DIVDYY-PKEIQEAVNKYTTMKKGNNKSASNWYEIQD-KNS----ICIKINESSL 240 (513) Q Consensus 170 kIsg~~--nG~y~~~fD~syFd~~-~~L~~~-p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~-~kt----~~ik~~~~~~ 240 (513) -.+--. +....+++-...-++. .+++-| |..+.. |. .....|..... +|. .++-+- ... T Consensus 147 ~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~----~~-------~~~~~~~~~~~~~~~~g~vPvv~~~-n~~ 214 (453) T protein:vir:39 147 FMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYA----LN-------GTMGFYNMTEQAPNPFDDLPVVEFY-FNE 214 (453) T ss_pred EEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEE----EE-------ecCCceeeecccccCCCceeEEEec-CCC Confidence 777542 2334455432221111 112222 111111 10 00112322211 111 112221 122 Q ss_pred cchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhc--eeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccceEEE Q lcl|NC_015263. 241 TPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNY--KLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVT 316 (513) Q Consensus 241 ~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~--~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~ 316 (513) .+++-|. ++.++.+.=+.. +....++-. .+++ +. +.+.+++ ....+...-.-.++.+.+... T Consensus 215 ~g~sd~e----~v~~liDa~~~~~s~~~~~~~~~~~p~~~--~~--g~~~~~~------~~~~~~~~~~~~~~~~~~~~~ 280 (453) T protein:vir:39 215 ERMSIFE----SVISLVNAFNKAISEKANDVDYFSDQYLT--FL--GAAVEEE------DLKNIRSNRVINYYGESSEAK 280 (453) T ss_pred CCCcchh----hhHHHHHHHHHHHHHHHHHHHHhhCceee--ee--cCCCCch------hhhhhhhcceeeecCCCCCCC Confidence 3433333 333333222211 111122111 1111 11 1111221 111221111111121111110 Q ss_pred ecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccCC-CcchHHHHHHHHHHHHHHHH----HHHHHHHHHHH-- Q lcl|NC_015263. 317 SPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSSD-NKTSQGIAMSIATDEQFIFG----VINQLERWLNR-- 388 (513) Q Consensus 317 sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~SI~~d~~~~~~----~~~~iE~~~N~-- 388 (513) .| ++.-+.-+.. ......++-..++|+.-+++-..-+.+. +.|+.+++.....-...+-. |-+.|++.+.. T Consensus 281 ~~-~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~ 359 (453) T protein:vir:39 281 NV-DVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYC 359 (453) T ss_pred CC-ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01 1111111111 1111224444566766666543333221 23444555544332233322 22233332222 Q ss_pred -HHhhcccce---EEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCc Q lcl|NC_015263. 389 -YLLLNGMSK---YFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTP 462 (513) Q Consensus 389 -~i~~~~~~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~P 462 (513) ++....... ...+.|-+..+-+..+.++.+.++.+. .|....|. .+|+ +|.+.+.+...|++....... T Consensus 360 ~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~g~-is~et~l~-~l~~v~D~~~E~~ri~~E~~~~~~~~~--- 434 (453) T protein:vir:39 360 ELSTNVSNKEAWKDIEYTFTRNEPKDIKEQAETANILMGI-TSQETALS-VISVIPDVQAEMEKIKKEEASTAIFDK--- 434 (453) T ss_pred HHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHhcc-CChHHHHH-hCCCCCCHHHHHHHHHHHHHHHHHHHH--- Confidence 222222222 357888999999999999999998643 77666665 5787 688999999999865331110 Q ss_pred ccccccccccccccCCccccCCCCcCCCCccccc Q lcl|NC_015263. 463 LSSSFNTSGSDIAENAIKEKGKENGRPTNETTGN 496 (513) Q Consensus 463 l~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n 496 (513) . + ....++++++.| ++.+. T Consensus 435 -~------~------~~~~~~~~~~~~--~~~~e 453 (453) T protein:vir:39 435 -D------K------QPSEKGTDTVVP--ETNEE 453 (453) T ss_pred -h------c------cCCCCCCCCCCC--CcCCC Confidence 0 0 000011122222 11111 No 172 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=70.67 E-value=0.21 Score=24.35 Aligned_cols=421 Identities=12% Similarity=0.073 Sum_probs=164.0 Q ss_pred eeeeehhhhhhHHHHHHH-HHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh------- Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNN-RISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ------- 80 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~------- 80 (513) .+||+ ...++ +|++. -|...++...-+ .++.|.- |....+.|+.--+|+... T Consensus 1 m~~~~--------~~k~~~~~~~~-~~~~~~~~~~~~---------~~~~i~~--~~~~~~ri~~~~~~y~g~~~~~~~~ 60 (508) T protein:vir:15 1 MGLIQ--------RIKDLFWKGAA-ATGVTGSLSKIT---------DDPRISI--DPDEYVRIQTDLDYYSDKLQYIHYQ 60 (508) T ss_pred CChHH--------HHHHHHHHHHH-HhccccchHHhh---------ccccccc--CHHHHHHHHHHHHHhcCCCcccccc Confidence 33322 11111 22221 222222110000 0111111 333344444333333221 Q ss_pred -------------cchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcc Q lcl|NC_015263. 81 -------------SQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDI 147 (513) Q Consensus 81 -------------sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~ 147 (513) -+.=+.+.+-.+++.+-+-.-.-.+..+ ...+.+.+ .|..-+....+...+..|+..|. T Consensus 61 ~~~~~~~~~~~~sln~~~~i~~~~A~lv~~e~~~i~v~~~~-----~~~e~l~~---il~~n~f~~~~~~~~e~a~a~G~ 132 (508) T protein:vir:15 61 ASDGIKKKRLKNTINMAKTAARRIASVVFNEKAEIHVKDNN-----EADKFLND---VLEDNDFKNKFEEALEKGVALGG 132 (508) T ss_pred cCCCCccccceeecchHHHHHHHHHhhhhCCCceEEeCCch-----HHHHHHHH---HHHhccHHHHHHHHHHHHhhcCc Confidence 1222444444444443332110011110 11111222 34444456667778888888777 Q ss_pred eeEEEEEcCcceeeeecCcceeEEEEE-ECCeeEEEEEeee--ccC-c----chhccc------cHHHH-HHHHHHhh-- Q lcl|NC_015263. 148 FYGYVIDDKESVMIQQFPNDICKISSV-SGGVYNYVIDLDA--LVS-A----DIVDYY------PKEIQ-EAVNKYTT-- 210 (513) Q Consensus 148 ~~gy~i~d~~~~~iq~lp~dyckIsg~-~nG~y~~~fD~sy--Fd~-~----~~L~~~------p~Ei~-~~y~~Y~~-- 210 (513) .+.-..-|.+++-+-..+++-.-++.. .+++-.++|=..+ -+. + ..|+.+ +--|+ +.|.+... T Consensus 133 ~~~k~~~d~~~~~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~ 212 (508) T protein:vir:15 133 FAMRPYIDGNHIKIAWVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDI 212 (508) T ss_pred eEEEEEEeCCeeEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchh Confidence 766555455565555555654444333 4445445542211 100 0 112211 00111 01111100 Q ss_pred -hhhccCcccccCeeecCCce--------EE-EE------ecCccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhh--c Q lcl|NC_015263. 211 -MKKGNNKSASNWYEIQDKNS--------IC-IK------INESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQN--Y 272 (513) Q Consensus 211 -~k~~~~~~~~~W~~L~~~kt--------~~-ik------~~~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n--~ 272 (513) .....-....-|-.|.++.+ |+ |+ ++..+++|+|-|..+..-+-+++.. +-....+++. . T Consensus 213 lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~--~s~~~~e~~~~~~ 290 (508) T protein:vir:15 213 VGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDT--HDQFIWEIRLGQK 290 (508) T ss_pred cCcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHH--HHHHHHHHHhccc Confidence 00000000111433333211 11 11 2445666777666665333222211 1122222211 1 Q ss_pred eeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccce-EEEecccccccccccccccchhhhhhHHhhhhhhhhhh Q lcl|NC_015263. 273 KLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVG-VVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQ 351 (513) Q Consensus 273 ~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~-~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~ 351 (513) +|.+..=-+ ..+.+|.+..+.+. ..|.. ++.--++|.+ .+.+| + | +...-...++...+.+....|+|. T Consensus 291 ~i~v~~~~l-~~d~~~~~~~~~~~-~~~~~-~~~~~~~~~~i~~~~~-~---i---r~e~~~~~~~~~l~~~~~~~gls~ 360 (508) T protein:vir:15 291 HIAVQPGML-RFDDEHKPTFDTEQ-NVYVG-VLSDDNNGLGVKDMTT-P---I---RTVQYKDAIDHFIKEFEVQIGLST 360 (508) T ss_pred ceeechHHh-cCCCCCccccCCCC-eeEEe-ccCCCCCCCceeEeec-c---c---ChHHHHHHHHHHHHHHHHHhCCCc Confidence 233321111 12555555555331 11111 1111111111 12222 1 1 111223557777788888999999 Q ss_pred hhccCCCcchHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHh---hc---cc------------ceEEEEEecC Q lcl|NC_015263. 352 ILFSSDNKTSQGIAMSIATDEQFIF--------GVINQLERWLNRYLL---LN---GM------------SKYFKATMLE 405 (513) Q Consensus 352 ~Lfn~d~~s~~~~~~SI~~d~~~~~--------~~~~~iE~~~N~~i~---~~---~~------------~~~f~~~~l~ 405 (513) .-|+-+..+..++. .|+...+-.+ .+-..|++.+..++. .. .. .....|.|-+ T Consensus 361 ~~f~~~~~~~~TAt-ei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D 439 (508) T protein:vir:15 361 GTFSYSNDGVKTAT-EVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDD 439 (508) T ss_pred hhcccccCccccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCC Confidence 99876554433222 2222222222 233333332222221 10 00 1235678888 Q ss_pred CCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCCCHHHHHHHH---HHHHHhhCcccccCcccccccccccccccCCccc Q lcl|NC_015263. 406 VTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGIDPVAFTGLL---KVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKE 481 (513) Q Consensus 406 ~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~~p~~~~~~~---~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~ 481 (513) -=.-+++..++++.++.+.| +|...++....|++..+.-.++ ..|+...+ .+-..+--..| T Consensus 440 ~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~~g~~deea~~el~ri~~E~~~~~-----~~~~~~~~~~g---------- 504 (508) T protein:vir:15 440 GVFVNKDKQLEEDAKVLAIGALSKQTFLQRNYGMTDEQAAEELAKIQSEAPTDT-----FEGGRSAILNG---------- 504 (508) T ss_pred CCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhccccC-----ccccccccCCC---------- Confidence 77889999999999999999 5766666666699987754333 33432111 00000000001 Q ss_pred cCCCCc Q lcl|NC_015263. 482 KGKENG 487 (513) Q Consensus 482 ~~~~~g 487 (513) ++|. T Consensus 505 --~~ge 508 (508) T protein:vir:15 505 --GDGE 508 (508) T ss_pred --CCCC Confidence 0111 No 173 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=67.39 E-value=0.25 Score=23.86 Aligned_cols=407 Identities=9% Similarity=0.035 Sum_probs=160.5 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) |---+-|.+-+..-|.+ ..+.+.+.|..| ......++.+.+|+... T Consensus 1 ~~~~~~~~~~~~~~~~~--------------------------------~~~~i~~~i~~~--~~~~~r~~~~~~yy~g~ 46 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEI--------------------------------TDKVVNDFMKKH--QEEVERYEYLGNMYKGI 46 (453) T ss_pred CccccceeeeccccccC--------------------------------CHHHHHHHHHHH--HHHHHHHHHHHHHhccc Confidence 11111112222212222 122233333332 23333444444444432 Q ss_pred -------------------cchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHH Q lcl|NC_015263. 81 -------------------SQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKL 141 (513) Q Consensus 81 -------------------sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~ 141 (513) .+..+.+++..++...=+ |+.-. ..+....+ .+..+++.-++...+..+.+. T Consensus 47 ~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~----~~~~~--~~d~~~~~---~l~~~~~~n~~~~~~~~~~~~ 117 (453) T protein:vir:73 47 MEISSQKAKDSWKPDNRLTNNFAKYIVDTFVGYFNGI----PIKKT--HDDKSVLE---AMQLFDNLNDMEDEESELAKI 117 (453) T ss_pred cchhcCCCCCccCccceeecchHHHHHHHhhhhhccc----Cceee--cCChHHHH---HHHHHHHhcChhHHHHHHHHH Confidence 233444444333221100 10000 01111122 233445666789999999999 Q ss_pred HHHhcceeEEEEEcCcc-eeeeecCcceeEEEEEEC--CeeEEEEEeeeccCcc--hhccc-cHHHHHHHHHHhhhhhcc Q lcl|NC_015263. 142 AMTVDIFYGYVIDDKES-VMIQQFPNDICKISSVSG--GVYNYVIDLDALVSAD--IVDYY-PKEIQEAVNKYTTMKKGN 215 (513) Q Consensus 142 ~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~n--G~y~~~fD~syFd~~~--~L~~~-p~Ei~~~y~~Y~~~k~~~ 215 (513) +++.|..|.+...+.++ .-+..++|+-|-++.... ..+.+++-..+ +... ...-| +.++.+ |.. T Consensus 118 ~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~-~~~~~~~~~vyt~~~i~~----~~~----- 187 (453) T protein:vir:73 118 ACVYGRAYELMYQNESTESEVIYCSPLNVFMVYDDSIKQKPLFAVYYGF-DEEGNLSGTVYTLLETIS----ITG----- 187 (453) T ss_pred HHhcCeEEEEEEeCCCCceEEEEEcccceEEEEeCCCCceeEEEEEEEE-ecCceEEEEEEeCCeEEE----EEe----- Confidence 99999999888776665 457778998887776532 33444443222 2211 11112 111110 100 Q ss_pred CcccccCeeecC-Cc----eEEEEecCccccchhhHHHHHHhHHHHHHHHHHH--hhHhhhhhceeeeeeeccccCCCCC Q lcl|NC_015263. 216 NKSASNWYEIQD-KN----SICIKINESSLTPVPPFAGTFDSIYDIHSFKDLR--NDKAELQNYKLLIQKLETRSSNDNN 288 (513) Q Consensus 216 ~~~~~~W~~L~~-~k----t~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL~--~~~~~i~n~~ii~~kip~~~~n~~~ 288 (513) .+..|..... +| -.++.+ -..+.+.|-|. ++.++.+.=+.. .....++...--.-.+ . +....+ T Consensus 188 --~~~~~~~~~~~~~~~g~vPvv~~-~n~~~g~s~~~----~v~~liDa~~~~~S~~~~~~~~~~~~~l~~-~-g~~~~~ 258 (453) T protein:vir:73 188 --KAGEVKFGESTYNVYSDLPIVEY-NFNEERQSIFE----PVHSLINSYNKVTSEKANDVEYFSDQYLVF-L-GAEVDE 258 (453) T ss_pred --cCCceEEccceeccCCceeEEEe-cCCCCCCcchh----hHHHHHHHHHHHHHHHHHHHHHhccceeee-e-cCCCCc Confidence 0011211111 11 111222 11223433333 333333221111 1111222211000001 1 111111 Q ss_pred ccccCHHHHHHHHHHHHHhccccceEEEecccccccccccc-cccchhhhhhHHhhhhhhhhhhhhccC-CCcchHHHHH Q lcl|NC_015263. 289 DFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKD-SSTDDSVEKATKNFWDNAGVSQILFSS-DNKTSQGIAM 366 (513) Q Consensus 289 ~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~-~~~~dtv~~~~~~i~~~~GiS~~Lfn~-d~~s~~~~~~ 366 (513) +..-++....... +....+.+.+.-..+.++.-+.-+.. ......++-..++|+..+++-.+-+.+ .+.|+.+++. T Consensus 259 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~ 336 (453) T protein:vir:73 259 EDAKNIKDNRLIN--FFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAY 336 (453) T ss_pred hhhhccccccccc--ccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHH Confidence 1111110000000 00001111111111111111111111 111123455556787777664444433 2234444544 Q ss_pred HHHHHHHHHHH----HHHHHHHHHHH---HHhhcccce---EEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHh Q lcl|NC_015263. 367 SIATDEQFIFG----VINQLERWLNR---YLLLNGMSK---YFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLM 436 (513) Q Consensus 367 SI~~d~~~~~~----~~~~iE~~~N~---~i~~~~~~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~ 436 (513) ....-.+.+-. |-..+++.++. ++....... ...+.|-+..+-|..+.++.+.++. |.-+...+.+.+ T Consensus 337 ~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~~~~ 414 (453) T protein:vir:73 337 KLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GITSEETALSVI 414 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHHhC Confidence 43322222222 22333333332 222222222 4688899999999999999999986 644444455567 Q ss_pred CC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCC Q lcl|NC_015263. 437 GI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRP 489 (513) Q Consensus 437 G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grP 489 (513) |+ +|.+.+.+++.|++. .+..-+.+ ++..+.+.. |.- T Consensus 415 ~~~~d~~~E~~ri~~E~~~-----~~~~~~~~---~~~~~~~~~--------~~~ 453 (453) T protein:vir:73 415 SVIPDVQAEMEKIKKKKLL-----QLSLTRTS---NLVRMKQMR--------GNL 453 (453) T ss_pred CCCCCHHHHHHHHHHHHHH-----HHHHHHhc---cCCcchhhh--------cCC Confidence 88 688999999998754 12222221 111110111 111 No 174 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=61.66 E-value=0.35 Score=23.09 Aligned_cols=319 Identities=13% Similarity=0.068 Sum_probs=143.8 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccc---------cchHHHHHHHhh-hcc--ChhHHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSL---------TSSQSKVRKIVK-EYR--NEGNQK 68 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~---------~~s~d~~k~~i~-~~~--P~~n~~ 68 (513) |-|.|+.|...-..- .......+ .|. ....|+- ....|-++-+-. +|+ |..-.. T Consensus 26 ~~~~~~~~~~~~~~~---------~~~~~~~~----~~~-~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~ 91 (376) T protein:vir:10 26 MSKRRSRAPRTFAAA---------PNPSAGSA----APA-RAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAG 91 (376) T ss_pred chhccCCCcccchhh---------hhHhhhcc----Ccc-eeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHHH Confidence 666655543221110 00000000 110 1122331 111112122211 233 555333 Q ss_pred HHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 69 TLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 69 ~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~ 148 (513) |++ ++..+++-.+.|.+..+|-..+|.=-|+ |. ..+|..++.+.+..|.. T Consensus 92 ----La~-~~~~~~~h~s~l~~k~n~l~~~~~Pnp~---------------------lT----~~~f~~~v~d~ll~Gna 141 (376) T protein:vir:10 92 ----LAK-SFRASTHHSSALFFKANVLASTFRPHRW---------------------LS----RHAFERWALDFLTFGNG 141 (376) T ss_pred ----HHH-HHhhhHHhhhhHHHHhHHHHhccCCCCC---------------------CC----HHHHHHHHHHHHhcCCe Confidence 222 2444445555555544443333221111 11 34456677788889999 Q ss_pred eEEEEEcC--cceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeec Q lcl|NC_015263. 149 YGYVIDDK--ESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQ 226 (513) Q Consensus 149 ~gy~i~d~--~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~ 226 (513) |.+.+-+. ..+-+.++|+.||++.-- .+.|.+..+ .. .=++++ T Consensus 142 y~~~~rn~~G~~~~L~pl~~~~vr~~~d-~~~~~~~~~-----~~-----------------------------~~~~~~ 186 (376) T protein:vir:10 142 YLERRRNMVGGTLRLEPALAKYVRRKAD-FNGFVYVNG-----WQ-----------------------------ERHEFE 186 (376) T ss_pred EEEEEECCCCCEEEEEEeCCcceEEEee-CCeEEEEEc-----CC-----------------------------eEEEEc Confidence 99998764 457899999999998632 233333211 00 001233 Q ss_pred CCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHH Q lcl|NC_015263. 227 DKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEAL 304 (513) Q Consensus 227 ~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~i 304 (513) .+.-+-|+- + ....+|+|+..+++..+.--+...+...-. ..|- ..-+-|-+. .+..++.++++.+.+.+ T Consensus 187 ~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~--f~NG-a~pggIl~~-----~d~~l~~e~~~~lr~~~ 258 (376) T protein:vir:10 187 PDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKY--YENG-SHAGFILYM-----TDAAQKQDDVDNMRDAL 258 (376) T ss_pred cccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH--Hhcc-CCCceEEEe-----cCCCCCHHHHHHHHHHH Confidence 322333442 2 245689999999888876555555433211 2221 111111110 02347778887777777 Q ss_pred HHhccccce-----EEEeccc-cccc---ccccccccchh---hhhhHHhhhhhhhhhhhhccCC---CcchHHH-HHHH Q lcl|NC_015263. 305 SMTVPDNVG-----VVTSPME-IDTV---SFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSD---NKTSQGI-AMSI 368 (513) Q Consensus 305 k~~Lp~gv~-----~v~sP~~-~d~i---~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d---~~s~~~~-~~SI 368 (513) ++. .|++ .|.+|-- =+.+ ++..+..+.+. .+-..++|..+.||...|.|-- +.+.+.+ +... T Consensus 259 ~~~--~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~ 336 (376) T protein:vir:10 259 KNA--KGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAAR 336 (376) T ss_pred HHh--cCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHH Confidence 663 3443 3444310 0122 23222333333 3344478999999998888531 1122222 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHH Q lcl|NC_015263. 369 ATDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITD 421 (513) Q Consensus 369 ~~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~ 421 (513) .-...-+.-++++||. +|..|-.. .++ |+..+....=.++ T Consensus 337 ~f~~~~L~Pl~~~iee-ln~~L~~~----~~~--------F~~~~Llr~d~ka 376 (376) T protein:vir:10 337 VFGRNEIRPLQARFAE-LNDWLGEE----VVR--------FDDYEIPPAPVAA 376 (376) T ss_pred HHHHHHHHHHHHHHHH-HHhhcccc----ccc--------cChhHhhcccccC Confidence 2222223346677764 55544221 122 3333332221111 No 175 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=60.51 E-value=0.37 Score=22.94 Aligned_cols=361 Identities=9% Similarity=0.001 Sum_probs=162.1 Q ss_pred HHHHHhhccCc------cc------------ccccccccc-hHH-HHHHHh--hhcc--ChhH-HHHHHHHHHHHHhhcc Q lcl|NC_015263. 28 ISILRDDNRTP------VF------------GAPVGSLTS-SQS-KVRKIV--KEYR--NEGN-QKTLRKVSEDLAVQSQ 82 (513) Q Consensus 28 ~~i~~~~~~~~------~~------------~s~~~s~~~-s~d-~~k~~i--~~~~--P~~n-~~~ir~~s~~lY~~sg 82 (513) -+.|..+-.-| .. ..+ |+.-+ ..+ ....++ ..+- |... -..-..++.-.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~ 79 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVE-FRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLID 79 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceee-ccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhH Confidence 22222222110 00 000 11111 000 000000 0110 1100 0112223334445566 Q ss_pred hHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHh-h----cChhHHHHHHHHHHHHhcceeEEEEE-c Q lcl|NC_015263. 83 QYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLS-R----LNPKYNFSKIVKLAMTVDIFYGYVID-D 155 (513) Q Consensus 83 ~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~-k----~n~k~~~~~i~~~~l~~g~~~gy~i~-d 155 (513) .+.+.|+.++. +..+...++ ...... .... ..|. . +.-..+...++..++. |..|.+.+. + T Consensus 80 ~v~acV~~Ia~~iA~lpl~~~----~~~~~~----~~~~---~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~ 147 (409) T protein:vir:83 80 VAWACIDLNASVLSSMPIYRM----RNGRII----DSVA---WMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHG 147 (409) T ss_pred HHHHHHHHHHHhhccCceEEe----eCCccc----cchh---hhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEEC Confidence 77777776653 122212222 111111 1111 1121 1 2233445555555555 777777653 4 Q ss_pred --CcceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEE Q lcl|NC_015263. 156 --KESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICI 233 (513) Q Consensus 156 --~~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~i 233 (513) +...-+.+||++.|.|.--.+|.+.|.++-.+. + + --+-| T Consensus 148 ~~G~~~~L~pl~p~~v~v~~~~~g~~~y~~~~~~~---------~----------------------------~-eiiHi 189 (409) T protein:vir:83 148 SDGYPIRFRVVPPWLVNVELKKGARREYRIGGLNV---------T----------------------------D-EILHI 189 (409) T ss_pred CCCcEEEEEEECCcceEEEEcCCceEEEEEccccC---------c----------------------------c-ceEEe Confidence 445689999999999887788888776643211 0 1 12334 Q ss_pred Eec--CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhcccc Q lcl|NC_015263. 234 KIN--ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDN 311 (513) Q Consensus 234 k~~--~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~g 311 (513) +.. .+...|++|+..+-..+--....++. ...-..|-...-.-|.+ ++ .++.++++.+.+..+++.... T Consensus 190 r~~~~~~~~~G~spi~~~~~~i~~~~a~~~~--~~~~f~nga~p~gil~~-----~~--~ls~e~~~~~~~~~~~~~~~n 260 (409) T protein:vir:83 190 RYQGNTADAHGHGPLESAAPRQVVIGLLQKY--VQNLAETGGVPLYWLGV-----ER--RLSETEAVDLMDRWIESRSKY 260 (409) T ss_pred CCCCCCCCcccccHHHHHHHHHHHHHHHHHH--HHHHHhcCCCcceEeec-----CC--CCCHHHHHHHHHHHHHhhCCc Confidence 431 23456888876665443322222222 12223332222222222 11 477777777777776655222 Q ss_pred ce--EEE-eccccc-ccccccccccchh---hhhhHHhhhhhhhhhhhhccC--CCc--chH-HHHHHHHHHHHHHHHHH Q lcl|NC_015263. 312 VG--VVT-SPMEID-TVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSS--DNK--TSQ-GIAMSIATDEQFIFGVI 379 (513) Q Consensus 312 v~--~v~-sP~~~d-~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~--d~~--s~~-~~~~SI~~d~~~~~~~~ 379 (513) .+ .++ ..++.. .+.+. ..+-+. .+-..++|-.+.||...++|. +.+ +++ .-...+.--..-+.-++ T Consensus 261 ag~~~il~~g~~~~~~~~~s--~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~ 338 (409) T protein:vir:83 261 AGHPALVTGGATLNQAKSMS--AQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKA 338 (409) T ss_pred cCccceecCCcccccccCCC--HHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHH Confidence 22 222 222221 12222 222222 334457799999999999863 222 212 22333333333344689 Q ss_pred HHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccc Q lcl|NC_015263. 380 NQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEI 459 (513) Q Consensus 380 ~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~ 459 (513) ++||..+|+.|-.. +..|+|.+-+...-+.++..+.+.++.+-| =+++.|+-.+ | + T Consensus 339 ~~ie~~l~~~Ll~~--~~~~~f~~~~llr~d~~~r~~~~~~~~~~G-----------~lT~NE~R~~---~----g---- 394 (409) T protein:vir:83 339 TAVMAALDRWALPS--PQHLELNRDDYTRPSLVERATAYKIMIEAG-----------VMEPNEARAM---E----R---- 394 (409) T ss_pred HHHHHHHHHhhCCC--CcEEEeehhhhhccCHHHHHHHHHHHHhCC-----------CcCHHHHHHH---h----C---- Confidence 99999999987543 334666555554556667777666665444 1355554322 2 2 Q ss_pred cCcccccccccccccccCCccccCCCCcC Q lcl|NC_015263. 460 MTPLSSSFNTSGSDIAENAIKEKGKENGR 488 (513) Q Consensus 460 ~~Pl~TS~T~Sg~~~~~~~~~~~~~~~gr 488 (513) |+|.. |++ ....||. T Consensus 395 lpp~~------ggd--------~l~~~gv 409 (409) T protein:vir:83 395 LHSEA------AAV--------RLSGGGV 409 (409) T ss_pred CCCCC------CCc--------ccCCCCC Confidence 34533 332 2234444 No 176 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=58.51 E-value=0.41 Score=22.70 Aligned_cols=337 Identities=11% Similarity=0.072 Sum_probs=143.4 Q ss_pred CCCccchheeeeeh-hhhhhHHHHHHHHHHHHHhhccCcccccccccccc------h---HHHHHHHhhh-cc--ChhHH Q lcl|NC_015263. 1 MVKNKKKRLSMIDV-ESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTS------S---QSKVRKIVKE-YR--NEGNQ 67 (513) Q Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~------s---~d~~k~~i~~-~~--P~~n~ 67 (513) |-|.||+|+----. ..-+.-+...+.++ .++......|+-+. . .|-..-+-.. |. |.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~-- 71 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHH-------TDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMP-- 71 (368) T ss_pred CCccccccchhccCcccccccccCcchhh-------ccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcC-- Confidence 88888776521000 00000000000000 00000011122110 0 0111111111 22 332 Q ss_pred HHHHHHHHHHHhhcchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcc Q lcl|NC_015263. 68 KTLRKVSEDLAVQSQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDI 147 (513) Q Consensus 68 ~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~ 147 (513) ++.|++ ++..+.+-.+.+..-..|..+.+.--| +|. .++|..+..+++..|. T Consensus 72 --~~~la~-~~~~~~~h~~~~~~~~n~l~l~~~Pn~---------------------~~t----~~~f~~l~~d~ll~Gn 123 (368) T protein:vir:79 72 --WDGLAR-SFRAAAHHSSAVYVKRNILVSTFIPHP---------------------LLS----RATFERLVLDWQVFGN 123 (368) T ss_pred --HHHHHH-HHhhccccchhhhhhcchhhhhcCCCc---------------------CCC----HHHHHHHHHHHhhcCC Confidence 222332 233344333333333333322221100 011 3445677888999999 Q ss_pred eeEEEEEcC--cceeeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeee Q lcl|NC_015263. 148 FYGYVIDDK--ESVMIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEI 225 (513) Q Consensus 148 ~~gy~i~d~--~~~~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L 225 (513) .|.+++.+. ..+.+.++|+.||++.- ..+.|.+... ... =+++ T Consensus 124 ay~~~~r~~~G~~~~L~~l~~~~v~~~~-~~~~~~~~~~-----~~~-----------------------------~~~~ 168 (368) T protein:vir:79 124 AYLERRENVLGGTIRLDTPLAKYVRRGL-DLNTYFFVQN-----WQQ-----------------------------PYTF 168 (368) T ss_pred eEEEEEEcCCCCEEEEEEeCcccceeec-cCCEEEEEec-----CCe-----------------------------EEEE Confidence 999998764 45788899999999763 2233433221 000 0233 Q ss_pred cCCceEEEEe-c-CccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHH Q lcl|NC_015263. 226 QDKNSICIKI-N-ESSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEA 303 (513) Q Consensus 226 ~~~kt~~ik~-~-~~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ 303 (513) +...-+-|+. + .+..+|+||..++...+.--....+.. ..-..|-. ..+-|-+. .+..++.++++.+.+. T Consensus 169 ~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~--~~~~~NGa-~~~gil~~-----~~~~l~~e~~~~lk~~ 240 (368) T protein:vir:79 169 AAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFR--RRYYKNGS-HAGFILYM-----TDAAQKQEDVDTLREA 240 (368) T ss_pred ccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHHHhccC-CCceEEEe-----CCCCCCHHHHHHHHHH Confidence 4433333442 3 335579999988887666544444432 12233311 11111110 0234777877777777 Q ss_pred HHHhc-cccc--eEEEecc----cccccccccccccchh---hhhhHHhhhhhhhhhhhhccCCCc---c-hHHHHHHHH Q lcl|NC_015263. 304 LSMTV-PDNV--GVVTSPM----EIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFSSDNK---T-SQGIAMSIA 369 (513) Q Consensus 304 ik~~L-p~gv--~~v~sP~----~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn~d~~---s-~~~~~~SI~ 369 (513) +++.- +.+- ..|.+|- .++=+++..+..+.+. .+...++|..+.||...|.|-... + ++.-+.... T Consensus 241 ~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~~~ 320 (368) T protein:vir:79 241 MKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAAMV 320 (368) T ss_pred HHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHH Confidence 77642 1111 2343331 1122223222333333 344458899999999998863211 1 122222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHH Q lcl|NC_015263. 370 TDEQFIFGVINQLERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKV 430 (513) Q Consensus 370 ~d~~~~~~~~~~iE~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~ 430 (513) --..-+.-++++||. +|..|.. +.++ |+..++...-.++-+-|..+.- T Consensus 321 f~~~~l~Pl~~~ie~-ln~~l~~----e~~r--------F~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 321 FARNEVKPLQDRLLA-INDWIGD----EVVR--------FAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred HHHHHHHHHHHHHHH-HHhccCc----ceee--------echhHhhcccccccCCcccccC Confidence 222222346666663 5544422 2233 3333333332333333322221 No 177 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=58.00 E-value=0.42 Score=22.64 Aligned_cols=199 Identities=13% Similarity=0.137 Sum_probs=94.2 Q ss_pred EEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEEEe-c-CccccchhhHHH Q lcl|NC_015263. 171 ISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICIKI-N-ESSLTPVPPFAG 248 (513) Q Consensus 171 Isg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~-~-~~~~~~ip~f~~ 248 (513) +-..+||.|.|.+.....+.. ..=.+++.+--+-|+. + .+..+|+||..+ T Consensus 1 ~r~~~dg~~~y~~~~~~~~~~----------------------------g~~~~~~~~eilH~r~~~~~~~~~Glspi~~ 52 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKSLYDTK----------------------------SEIYEYNKNDVIFIKLYDPMQQVYGSPDYVG 52 (219) T ss_pred CceeecCeEEEEEecceecCC----------------------------ceeEEeccccEEEecCCCCCCCcceecHHHH Confidence 333355665555432221100 0012344433455553 2 244569999888 Q ss_pred HHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccce-----EEEe------ Q lcl|NC_015263. 249 TFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVG-----VVTS------ 317 (513) Q Consensus 249 v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~-----~v~s------ 317 (513) ....+.-.....+.. ..=..|-...-.-|-+ .+..++.++++.+.+.+++. .|.+ .+.. T Consensus 53 a~~~i~~~~aa~~~~--~~~f~Ng~~p~gil~~------~~~~l~~e~~~~~~~~~~~~--~g~~n~~~~~l~~~gg~~~ 122 (219) T protein:vir:98 53 GITSALLNSDATIFR--RRYYSNGAHMGFILYS------TDPDMTEEMEDEIAERIRDS--KGVGNFRSMFVNIAGGHPD 122 (219) T ss_pred HHHHHHHHHHHHHHH--HHHHhcCCCCceEEEe------CCCCCCHHHHHHHHHHHHHh--cCcccccceeEecCCCCcc Confidence 766655433333321 1112232111111111 02347778888877777664 2332 2322 Q ss_pred cccccccccccccccchh---hhhhHHhhhhhhhhhhhhcc-C--CCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015263. 318 PMEIDTVSFDKDSSTDDS---VEKATKNFWDNAGVSQILFS-S--DNKT-SQGIAMSIATDEQFIFGVINQLERWLNRYL 390 (513) Q Consensus 318 P~~~d~i~ld~~~~~~dt---v~~~~~~i~~~~GiS~~Lfn-~--d~~s-~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i 390 (513) .+++..+.+. ..+.+. .+-..++|-...||-..+.| . ++.+ ++.-..++.-...-+.-++++||.-+|+.+ T Consensus 123 G~~~~~~~~~--~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~ 200 (219) T protein:vir:98 123 GLKVIPIGDT--GQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDY 200 (219) T ss_pred ceeEEEccCC--HHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 2333333332 223333 34445789999999988875 2 2222 222333333333333458888999899865 Q ss_pred hhcccceEEEEEecCCCCcc Q lcl|NC_015263. 391 LLNGMSKYFKATMLEVTHFS 410 (513) Q Consensus 391 ~~~~~~~~f~~~~l~~T~fn 410 (513) .... .-.|+|.=...+-.| T Consensus 201 ~~~~-~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 201 EIKS-ALKVNFKQPEKRDKN 219 (219) T ss_pred cCCC-ccEEeecCcccccCC Confidence 4321 223555444444333 No 178 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=57.96 E-value=0.42 Score=22.63 Aligned_cols=409 Identities=13% Similarity=0.072 Sum_probs=157.0 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) |.. ++=-.+-+-|. +.++++-+...+.-.-. ...-++.+..-......|.. =.+.....++ +. . T Consensus 1 m~~----~i~~~~g~p~~----~~~~~~~~~~~ia~~~~-~~~~~~~~~~~~~~~~iLr~--~~~~~~~y~~----m~-~ 64 (491) T protein:vir:10 1 MSK----GLWVSPTEFVT----FGEPDKSLSSQIATRAR-SIDFFALGMYLPNPDPVLKA--LGKDIRVYRE----LR-A 64 (491) T ss_pred CCC----ceeCCCCCccC----cccCChHHHHHHHhhhc-ccccccccCCccchHHHHHh--cCCCHHHHHH----Hh-h Confidence 332 11111111122 22222222222221100 01111111111111111211 1222333333 34 5 Q ss_pred cchHHHHHHHHhh-cccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEcCcce Q lcl|NC_015263. 81 SQQYQRLLNFYAN-MPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDDKESV 159 (513) Q Consensus 81 sg~~~rlidy~~~-mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d~~~~ 159 (513) .+|++..++-... +-..++.|.|-+.... .-..+...|..++ |..++..++ +..+|||-.- T Consensus 65 D~~i~s~l~~Rk~av~~~~w~i~~~~~~~~--------~~e~v~e~l~~~~----~~~~l~~~l-da~~~G~s~~----- 126 (491) T protein:vir:10 65 DAHVGGCVRRRKAAVKALEWGLDRGKAKSR--------VAKSIADVFADLD----LSRIVTEML-DAVLYGYQPM----- 126 (491) T ss_pred ChHHHHHHHHHHHHHhCCCcEEecCCCCHH--------HHHHHHHHHhcCC----HHHHHHHHH-HhhhhcceeE----- Confidence 8888888876654 4468999998554321 1223555666654 566666665 5777887651 Q ss_pred eeeecCcceeEEEE-EECCeeEEE-EEe---eec--cCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEE Q lcl|NC_015263. 160 MIQQFPNDICKISS-VSGGVYNYV-IDL---DAL--VSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSIC 232 (513) Q Consensus 160 ~iq~lp~dyckIsg-~~nG~y~~~-fD~---syF--d~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~ 232 (513) .|.- ..+|.++.. +.. ++| +....+. | + . .........|++.+-++ T Consensus 127 ----------Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~~~l~------------~---~-~-~~~~~~g~~l~~~k~i~ 179 (491) T protein:vir:10 127 ----------EITWGKVGNYIVPIDVVGKPADWFVYDPENQLR------------F---R-S-KDHWMQGEELPARKFLV 179 (491) T ss_pred ----------EEEEeecCCeeEEEEeeeecccceeeccCCceE------------E---e-c-CCCCCCcceecCCCEEE Confidence 1110 112222210 110 122 1111111 1 0 0 01123456788888888 Q ss_pred EEecC--ccccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhc--eeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhc Q lcl|NC_015263. 233 IKINE--SSLTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNY--KLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTV 308 (513) Q Consensus 233 ik~~~--~~~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~--~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~L 308 (513) +.-.. ..+++.+.+..++--.+..... ++.--.=++.| -+.+.|.|-. .+.++.+...+.+++.. T Consensus 180 ~~~~~~~~~p~g~gLl~~~~w~~~fK~~~--~~~w~~f~E~yG~P~~igky~~~---------a~~~ek~~l~~al~~~~ 248 (491) T protein:vir:10 180 PRQEATYLNPYGFPDLSMCFWPTTFKKGG--LKFWVQFTEKYGSPMLVGKHPRS---------ASDGEKNLLLDCLEDMV 248 (491) T ss_pred EEecCCCCCcccchhHHHHHHHHHHHHHH--HHHHHHHHHHcCCCeEEEecCCC---------CCHHHHHHHHHHHHHHh Confidence 77533 3445555555544444433332 22222333333 3567777631 12334555666666666 Q ss_pred cccceEEEecccccccccccccccchhhhhhHHhhhhhhhhhhhhccCC----CcchHHHHHHHH--HHHHHHHHHHHHH Q lcl|NC_015263. 309 PDNVGVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVSQILFSSD----NKTSQGIAMSIA--TDEQFIFGVINQL 382 (513) Q Consensus 309 p~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS~~Lfn~d----~~s~~~~~~SI~--~d~~~~~~~~~~i 382 (513) -++++++-..++++-+...+.....+....-. =|-+..||.+++|+. .+++. +...+. .-..++-.-..+| T Consensus 249 ~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li--~~~d~~Isk~iLGqtlTt~~~gs~-a~~~vh~~v~~di~~~D~~~i 325 (491) T protein:vir:10 249 QDAVAVVPDDSSIEIKEAAGKTGSADVYERLL--HFCRGEVSIALLGQNQTTEATSTR-ASAQAGLEVTDDIRDGDKAVV 325 (491) T ss_pred cCcEEEecCCceeEEEecCCCCCChhHHHHHH--HHHHHHHHHHHhhhhcccCcccch-hHHHHHHHHHHHHHHHHHHHH Confidence 56666666666655554432222111111110 023344554444332 22221 222222 2222223444556 Q ss_pred HHHHHHHHh----hccc-ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHHHHHH-HHhCCCHHHHHHHHHHHHHhhCc Q lcl|NC_015263. 383 ERWLNRYLL----LNGM-SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLA-SLMGIDPVAFTGLLKVENEMLDL 456 (513) Q Consensus 383 E~~~N~~i~----~~~~-~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~la-a~~G~~p~~~~~~~~~E~e~L~l 456 (513) +.-+|+.|. .|.. ...++|.|...--- .++.++.+.++.+.|+.+...++ -.+|+.+-+ . T Consensus 326 ~~tln~li~~l~~~N~~~~~~p~f~~~~~~e~-~~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~-------------~ 391 (491) T protein:vir:10 326 SEAMNMLIRWICDLNFDGADRPVFDMWEQEQV-DEIQAGRDQKLTQAGARFTPAYFKRAYNLQDGD-------------L 391 (491) T ss_pred HHHHHHHHHHHHHhcCCCCCcceEEecCcCch-hHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCC-------------c Confidence 666665432 2222 22466777654422 25678888888887764332222 223332110 0 Q ss_pred ccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 457 PEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 457 ~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~~ 513 (513) .+...|.++-.+.+. ...+.....+++..+... . ...++..+ T Consensus 392 ~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~d~~~---~------~~~~~~~~ 433 (491) T protein:vir:10 392 DERPLPVSAVDTVGA------ASFAEFEAPDQDALDAAL---N------TLSARDLN 433 (491) T ss_pred CccccccCCCCCccc------ccccccCCCCCCchHHHH---H------HHHHHHHH Confidence 111111111001000 000000000000000000 0 00000000 No 179 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=57.05 E-value=0.44 Score=22.52 Aligned_cols=425 Identities=10% Similarity=0.096 Sum_probs=168.3 Q ss_pred eehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcch-------- Q lcl|NC_015263. 12 IDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQ-------- 83 (513) Q Consensus 12 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~-------- 83 (513) .+. -.+|++|+.-.+-.|+..-+ .+.+.+.|..|.. .-...++++.+|+...+.. T Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~l~------------~~~i~~li~~~~~-~~~~r~~~l~~YY~g~~~~i~~~~~~~ 63 (506) T protein:vir:94 1 MDY----DLTEHKQANLIYQESLENLT------------PNKIMKFITHHFN-YQRPRLEMLDDYYQGYNLKILDKQSRR 63 (506) T ss_pred CCc----chhhhhcceeecccchhcCC------------HHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccccccc Confidence 111 13455555443322222111 1233444444311 1122355666665554432 Q ss_pred --------------HHHHHHHHhhcccccceE-eeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 84 --------------YQRLLNFYANMPLYAYSV-VPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 84 --------------~~rlidy~~~mpt~dY~I-~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~ 148 (513) .+.+++-.++ |.+ .|..- ...+...++. +-.++..-++...+..+.+.+++.|.. T Consensus 64 ~~~~~~~~ki~~n~~~~Iv~~~~~-----~l~G~p~~~--~~~d~~~~~~---l~~~~~~N~~~~~~~~~~~~~~~~G~a 133 (506) T protein:vir:94 64 HEDGKADHRATHSFAKYIADFQTS-----YSVGNPINV--KLPDDGSNSG---FDTFNKANDVDAENYDLFLDMSRYGRA 133 (506) T ss_pred ccccCCcceeecchHHHHHHHhhh-----hhcccCcee--ecCcchHHHH---HHHHHhccCHhHHHHHHHHHHHhcCeE Confidence 2222222111 111 11000 0011112222 223455667889999999999999999 Q ss_pred eEEEEEcCcc-eeeeecCcceeEEEEEE--CCeeEEEEEeeec---cCcc--hh----ccccHHHHHHHHHHhhhhhccC Q lcl|NC_015263. 149 YGYVIDDKES-VMIQQFPNDICKISSVS--GGVYNYVIDLDAL---VSAD--IV----DYYPKEIQEAVNKYTTMKKGNN 216 (513) Q Consensus 149 ~gy~i~d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~syF---d~~~--~L----~~~p~Ei~~~y~~Y~~~k~~~~ 216 (513) |.+..-|.++ +-+..++|.-|-++.-. ++.+.+++=.-.. +... .+ +.|.+. .+.... + T Consensus 134 ~~~v~~ded~~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~------~~~~~~---~ 204 (506) T protein:vir:94 134 YEYVYRGEDNEEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTAD------TYTLYN---P 204 (506) T ss_pred EEEEEecCCCeeEEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCc------eEEEec---c Confidence 9887766544 56677899888877542 2445554322111 1110 01 111110 000000 0 Q ss_pred cccccC-eeecCCceE----EEEe-cCccccchhhHHHHHHhHHHHHHHHH-----HHhhHhhhhhceeeeeeec----- Q lcl|NC_015263. 217 KSASNW-YEIQDKNSI----CIKI-NESSLTPVPPFAGTFDSIYDIHSFKD-----LRNDKAELQNYKLLIQKLE----- 280 (513) Q Consensus 217 ~~~~~W-~~L~~~kt~----~ik~-~~~~~~~ip~f~~v~~d~~di~~~kd-----L~~~~~~i~n~~ii~~kip----- 280 (513) . ...| +.....|.+ ++.+ +.+ .+.+- |.++.++.+.=+ +-+..+...+-.++..-.+ T Consensus 205 ~-~~~~~~~~~~~~~~g~vPvv~~~n~~--~~~sd----~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~ 277 (506) T protein:vir:94 205 T-PIMGKMQVDTTKPITTFPVVEFKNSN--FRLGD----FENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFE 277 (506) T ss_pred c-cCccceeccccccCCccceEEecCCC--CCCCc----hhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCcccccc Confidence 0 0011 111111211 1222 111 12222 222222221111 1111111111111111111 Q ss_pred -------cccCCCCCccccCHHHHHHHHHHHHHhc---cccceEEEecc--ccccccccccc-ccchhhhhhHHhhhhhh Q lcl|NC_015263. 281 -------TRSSNDNNDFTLDMPMMNYFHEALSMTV---PDNVGVVTSPM--EIDTVSFDKDS-STDDSVEKATKNFWDNA 347 (513) Q Consensus 281 -------~~~~n~~~~~~vd~~~~~~~~~~ik~~L---p~gv~~v~sP~--~~d~i~ld~~~-~~~dtv~~~~~~i~~~~ 347 (513) ....+.++...+..........+-+..+ +++.+.-..+- ++.-+.-+... ...-.++...++|+.-+ T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s 357 (506) T protein:vir:94 278 GSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFS 357 (506) T ss_pred chhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHh Confidence 1112333444433333333322222221 22221111110 11111111111 11123555567788888 Q ss_pred hhhhhhccCC--CcchHHHHHHHHHHHHHHHHHH----HHHHHHHHH---HHhhcc--cce---EEEEEecCCCCccHHH Q lcl|NC_015263. 348 GVSQILFSSD--NKTSQGIAMSIATDEQFIFGVI----NQLERWLNR---YLLLNG--MSK---YFKATMLEVTHFSKKE 413 (513) Q Consensus 348 GiS~~Lfn~d--~~s~~~~~~SI~~d~~~~~~~~----~~iE~~~N~---~i~~~~--~~~---~f~~~~l~~T~fn~ke 413 (513) ++-..-+.+. +.|+..++.....-.+.+-... +.|++-++. ++.... ... ..++.|-+..+-|..+ T Consensus 358 ~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e 437 (506) T protein:vir:94 358 HTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNIS 437 (506) T ss_pred CccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHH Confidence 8776555332 3344434433222222222222 222222222 222111 111 3578999999999999 Q ss_pred HHHHHHHHHhcC-CcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCC Q lcl|NC_015263. 414 AHDRYITDAQYG-FPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPT 490 (513) Q Consensus 414 ~~~~~~~~~~~G-~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt 490 (513) .++.+.++. | .|....|. .+|+ +|.+.+.++..|++..+ +. ..+++..+.........++ + T Consensus 438 ~a~~~~kl~--g~iS~et~~~-~lp~v~d~~~E~~ri~~E~~~~~------~~---~~~~~~~~~~~~~~~~~~~----~ 501 (506) T protein:vir:94 438 QIKALVQAG--ATLPQKYLYQ-QLPGVTNPQDIVDMMKEQSANGD------YS---FDQNGVISNDGQTNTTATQ----T 501 (506) T ss_pred HHHHHHHHh--ccCChHHHHH-hCCCCCCHHHHHHHHHHHHHHHh------hc---chhhcCCCcccCccccccc----c Confidence 999999985 5 66655555 4665 47888899998875422 11 1122322111111111111 1 Q ss_pred Ccccc Q lcl|NC_015263. 491 NETTG 495 (513) Q Consensus 491 ~et~~ 495 (513) ++-++ T Consensus 502 ~~e~~ 506 (506) T protein:vir:94 502 DEEVR 506 (506) T ss_pred ccCCC Confidence 11122 No 180 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=54.29 E-value=0.51 Score=22.20 Aligned_cols=193 Identities=15% Similarity=0.099 Sum_probs=85.3 Q ss_pred eeeec-cccCCCCCccccCHHHHHHHHHHHHHhccccceEEEecccccccccccccccchhhhhhHHhhhhhhhhh-hhh Q lcl|NC_015263. 276 IQKLE-TRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDKDSSTDDSVEKATKNFWDNAGVS-QIL 353 (513) Q Consensus 276 ~~kip-~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i~~~~GiS-~~L 353 (513) +-|+. +...-++|...+. +.-.+....+... .+++..-.-=+++.++-+-+ .-.|.+....+.+--++||- ..| T Consensus 1 V~k~~~l~~~~~~~~~~~~--~r~~~~~~~~~~~-~~~~ld~~~e~~e~~~~~ls-Gl~d~l~~~~~~iaa~s~iP~t~L 76 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAAR--LRLAQVDNNSGVG-QAIGIDADSEEYNVLNSDIG-GIDTFLSQKFDRIVALSGIHEIIL 76 (201) T ss_pred CccchHHHHHhcCChHHHH--HHHHHHHHhhhhh-hhheeecCCcceeeeecCcC-ChHHHHHHHHHHHHhHhcCchhhh Confidence 22222 1111112221111 1111222333221 11211111112334433332 33467888888888889988 666 Q ss_pred ccCCCcchHHHHHHHHHHH----HHHHHHHHHH-HHHHHHHHhhcccceEEEEEecCCCCccHHHHHHHHHHHHhcCCcH Q lcl|NC_015263. 354 FSSDNKTSQGIAMSIATDE----QFIFGVINQL-ERWLNRYLLLNGMSKYFKATMLEVTHFSKKEAHDRYITDAQYGFPV 428 (513) Q Consensus 354 fn~d~~s~~~~~~SI~~d~----~~~~~~~~~i-E~~~N~~i~~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~ 428 (513) ||-. .+|++.+=+.|- ..|-++++.. -..+.+.++.....-.|.|+|.++...+.||.++..++.++- . T Consensus 77 fG~s---p~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~~~~~~~~~f~pL~~~s~kekAei~~~~a~a---~ 150 (201) T protein:vir:10 77 KGKN---VGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIVTEQEWSVEFNPLSQVSDKDKSEILEKNVNS---V 150 (201) T ss_pred cCCC---CccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCceEeeCCCCCCCHHHHHHHHHHHHHH---H Confidence 6543 223332222222 2222333222 123445555444455799999999999999999998887642 1 Q ss_pred HHHHHHHhC-CCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCc Q lcl|NC_015263. 429 KVYLASLMG-IDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNE 492 (513) Q Consensus 429 ~~~laa~~G-~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~e 492 (513) .. ++. .| ++|.++-..+...-..-.+.+-.++.--.++ ...+..+.|.++ T Consensus 151 ~~-~~~-~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~------------e~~dp~~~~~~~ 201 (201) T protein:vir:10 151 AA-LIA-AGIIDADEARDTLRAISTEVKIGEGSIQTEVVIN------------ESEDPLDVSANN 201 (201) T ss_pred HH-HHH-cCCCCHHHHHHHHHhcCCcCCCCCCCCCcccccc------------ccCCCCCCCCCC Confidence 22 333 46 8899988877665211111110000000000 000111112221 No 181 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=49.32 E-value=0.64 Score=21.64 Aligned_cols=420 Identities=9% Similarity=0.021 Sum_probs=161.8 Q ss_pred CCCccchheeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh Q lcl|NC_015263. 1 MVKNKKKRLSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ 80 (513) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~ 80 (513) ...+.-.++.+.+.+ .+++.|..|. ......++++-+|+... T Consensus 28 ~~~~~~~~~~~~~~~-------------------------------------~l~~~i~~~~-~~~~~r~~~l~~yY~g~ 69 (501) T protein:vir:27 28 YRADNLEELMVNNWE-------------------------------------LLKNFINHHK-LRQAPRIQELLDYARGE 69 (501) T ss_pred hccccccccccccHH-------------------------------------HHHHHHHHHH-HHHHHHHHHHHHHhcCC Confidence 111112222222222 2333333321 11112233333333322 Q ss_pred ---------------------cchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHH Q lcl|NC_015263. 81 ---------------------SQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIV 139 (513) Q Consensus 81 ---------------------sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~ 139 (513) .+..+.+++-+++...=+=+- +.-.+....+.+..- +..++..-++...+..+. T Consensus 70 ~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~--~~~~d~~~~~~~~~~---l~~~~~~n~~~~~~~~~~ 144 (501) T protein:vir:27 70 NHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIR--VEYDDNDNNSQNDDT---IKRIGRINDIDSHNRTLI 144 (501) T ss_pred CccccccCccCccccccceeccchHHHHHHHHhhhhcccCee--EecCCccchHHHHHH---HHHHHHhcChhHHHHHHH Confidence 223333333322211100000 110111111111111 223455567889999999 Q ss_pred HHHHHhcceeEEEEEcCcc-eeeeecCcceeEEEEEE--CCeeEEEEEeeeccC--cc--hhccccHHHHHHHHHHhhhh Q lcl|NC_015263. 140 KLAMTVDIFYGYVIDDKES-VMIQQFPNDICKISSVS--GGVYNYVIDLDALVS--AD--IVDYYPKEIQEAVNKYTTMK 212 (513) Q Consensus 140 ~~~l~~g~~~gy~i~d~~~-~~iq~lp~dyckIsg~~--nG~y~~~fD~syFd~--~~--~L~~~p~Ei~~~y~~Y~~~k 212 (513) +.+++.|..|.+...+.++ +-+..++|.-|.++--. .+.+.+++-.-+-.. .. ..+-|-++-.. .|.. T Consensus 145 ~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~---~~~~-- 219 (501) T protein:vir:27 145 RDLSQTGRAYEVIYRNEYDETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIY---TLDA-- 219 (501) T ss_pred HHHhhCCeEEEEEEeCCCCceEEEEEccceeEEEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEE---EEEe-- Confidence 9999999999988877655 56888999999887532 355666654333211 11 11111111000 0100 Q ss_pred hccCcccccCeee-cCCceE----EEEecCccccchhhHHHHHHhHHHHHHHHH--HHhhHhhhhhceeeeeeeccccCC Q lcl|NC_015263. 213 KGNNKSASNWYEI-QDKNSI----CIKINESSLTPVPPFAGTFDSIYDIHSFKD--LRNDKAELQNYKLLIQKLETRSSN 285 (513) Q Consensus 213 ~~~~~~~~~W~~L-~~~kt~----~ik~~~~~~~~ip~f~~v~~d~~di~~~kd--L~~~~~~i~n~~ii~~kip~~~~n 285 (513) ...|.++ ..+|.+ ++.+ -+...+++-|.. ..++.|..+.-- +-+......+ .+++-+=. ... T Consensus 220 ------~~~~~~~~~~~~~~g~vPvv~~-~nn~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~-~~~v~~g~--~~~ 288 (501) T protein:vir:27 220 ------SDDFNEISVTTHAFGTVPITEF-LNNVDGIGDYET-ELYLIDLYDSAESDTANHMSDMAD-AILAIYGD--LAL 288 (501) T ss_pred ------CCceeeccccccCCCcccEEEe-cCCCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhcC-ceeeeecC--ccC Confidence 0011111 111111 1222 112334444433 222322222111 1111122222 22221110 012 Q ss_pred CCCccccCHHHHHHHHHHHHHhccccceEEEec-ccccccccccccccc-hhhhhhHHhhhhhhhhhhhhccCCCc--ch Q lcl|NC_015263. 286 DNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSP-MEIDTVSFDKDSSTD-DSVEKATKNFWDNAGVSQILFSSDNK--TS 361 (513) Q Consensus 286 ~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP-~~~d~i~ld~~~~~~-dtv~~~~~~i~~~~GiS~~Lfn~d~~--s~ 361 (513) ..+++.-+++....+ ...+.+.+....+ .++.-+.-+...... -.++-..++|+.-+++...-+.+.++ |+ T Consensus 289 ~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg 363 (501) T protein:vir:27 289 PKGMQASDMKRTRLM-----QLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSG 363 (501) T ss_pred CcccchhhhhhcCce-----eecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchH Confidence 233333222221111 0111111111111 011111001011111 12344456777777777655544333 33 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHHH---HHhhccc--ce---EEEEEecCCCCccHHHHHHHHHHHHhcC-CcH Q lcl|NC_015263. 362 QGIAMSIATDEQFIFGVIN----QLERWLNR---YLLLNGM--SK---YFKATMLEVTHFSKKEAHDRYITDAQYG-FPV 428 (513) Q Consensus 362 ~~~~~SI~~d~~~~~~~~~----~iE~~~N~---~i~~~~~--~~---~f~~~~l~~T~fn~ke~~~~~~~~~~~G-~~~ 428 (513) .+++.....-...+....+ .|++-+.. ++..... .. ..++.|-+..+-|..+.++.+.++. | .|. T Consensus 364 ~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~--g~iS~ 441 (501) T protein:vir:27 364 EALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQVSQ 441 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCcH Confidence 3344332222222222222 22222222 2222211 11 3679999999999999999999985 6 555 Q ss_pred HHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCC Q lcl|NC_015263. 429 KVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQ 503 (513) Q Consensus 429 ~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~ 503 (513) .- +.+.+++ +|.+.+.++..|++..+.......+ ...+ + ..++.+++...+ ..-+..+ T Consensus 442 et-~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~-~~~~---~-----------~~~d~~~~~~~d-~~e~~~~ 501 (501) T protein:vir:27 442 ET-ALSLSGLVESPNEELDKINKEVSEIDFKGYSNDF-NEHV---G-----------KYTDEVKETHTD-DFERAYE 501 (501) T ss_pred HH-HHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCcc-cccc---c-----------cccCCCCCCccc-cccccCC Confidence 44 5556665 4889999999987654422221111 1111 1 111111111000 0000111 No 182 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=44.59 E-value=0.8 Score=21.11 Aligned_cols=387 Identities=13% Similarity=0.048 Sum_probs=146.4 Q ss_pred eehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh----------- Q lcl|NC_015263. 12 IDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ----------- 80 (513) Q Consensus 12 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~----------- 80 (513) +|.|.+... ++++| ..| ......+..+-+|+... T Consensus 1 ~~~e~~~~~-----i~~~~----------------------------~~~--~~~~~~~~~~~~Yy~g~hdi~~~~~~~~ 45 (471) T protein:vir:10 1 MEIEVIKKI-----ISSQM----------------------------VKH--GKFVSQAAEAEKYYRNENDIKRKRKPAD 45 (471) T ss_pred CCHHHHHHH-----HHHHH----------------------------HHH--HHHHHHHHHHHHHhccccccccccchhh Confidence 444443211 11111 111 11122222233333222 Q ss_pred -----------------------cchHHHHHHHHhhcccccceEeeccch-hhhhhcchhHHHHHHHHHHhhcChhHHHH Q lcl|NC_015263. 81 -----------------------SQQYQRLLNFYANMPLYAYSVVPFKDI-STANENKLKKELATVTEFLSRLNPKYNFS 136 (513) Q Consensus 81 -----------------------sg~~~rlidy~~~mpt~dY~I~P~~~~-~~~~~~~~~~~y~~v~~~L~k~n~k~~~~ 136 (513) .+..+.+++-.++ |.+ ++. ....++ ....+.+...-.=++..... T Consensus 46 ~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~-----yl~---G~p~~~~~~~---~~~~~~l~~~~~n~~~~~~~ 114 (471) T protein:vir:10 46 KKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKKA-----YAL---TYPPTFDVDD---KKVNDMIVDVLGDDYERISK 114 (471) T ss_pred hhcccccccccccccccccceeccchhHHHHHhhhh-----hhc---ccCceeccCC---hHHHHHHHHHHhcCHHHHHH Confidence 2233333332222 211 111 000111 11112222222235777788 Q ss_pred HHHHHHHHhcceeEEEEEcC--cceeeeecCcceeEEEEEE--CCeeEEEEEeeeccCcc--------hhccccHH-HHH Q lcl|NC_015263. 137 KIVKLAMTVDIFYGYVIDDK--ESVMIQQFPNDICKISSVS--GGVYNYVIDLDALVSAD--------IVDYYPKE-IQE 203 (513) Q Consensus 137 ~i~~~~l~~g~~~gy~i~d~--~~~~iq~lp~dyckIsg~~--nG~y~~~fD~syFd~~~--------~L~~~p~E-i~~ 203 (513) .+.+.+++.|..|.+...|. ..+-+.-++|..|.++--. ++.+.++ ++++.... .++.|.++ +.. T Consensus 115 ~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~--ir~~~~~~~~~~~~~~~~~vy~~~~~~~ 192 (471) T protein:vir:10 115 QLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLDKKSIGV--LRVYSSIDETDGKNYTVYEYWNDKECSF 192 (471) T ss_pred HHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCCCceEEE--EEEEEeeccCCCceeEEEEEEeCCcEEE Confidence 88999999999998887652 3457777899888777543 2334444 23332110 11111110 000 Q ss_pred ----------HHHHHhhhh--hccCcccccC--eeecCCceEEEEecCccccchhhHHHHHHhHHHHHHHHHH-----Hh Q lcl|NC_015263. 204 ----------AVNKYTTMK--KGNNKSASNW--YEIQDKNSICIKINESSLTPVPPFAGTFDSIYDIHSFKDL-----RN 264 (513) Q Consensus 204 ----------~y~~Y~~~k--~~~~~~~~~W--~~L~~~kt~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL-----~~ 264 (513) ....+.... ....+....+ +.-+-.+-.++.+.. ...+.|-| .++.++.+.=++ -+ T Consensus 193 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-~~~~~sd~----e~v~~liDa~d~~~S~~~~ 267 (471) T protein:vir:10 193 YRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-NEIETNDL----KPIKDLVDVYDKVFSGFVN 267 (471) T ss_pred EEecCCcccccccccccccccccccccccccccccCCCCceeEEEecc-CCCCCCch----HHHHHHHHHHHHHHHHHHH Confidence 000000000 0000000000 000000111122211 11233333 233332222221 11 Q ss_pred hHhhhhhceeeeeeeccccCCCCCccccCHHHHHHHHHHHHHhccccceEEEec-----cccccccccccc-ccchhhhh Q lcl|NC_015263. 265 DKAELQNYKLLIQKLETRSSNDNNDFTLDMPMMNYFHEALSMTVPDNVGVVTSP-----MEIDTVSFDKDS-STDDSVEK 338 (513) Q Consensus 265 ~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~~~~~~~ik~~Lp~gv~~v~sP-----~~~d~i~ld~~~-~~~dtv~~ 338 (513) ..+...+ .+++ +.=..+...++ +...++. .++.-+-.. -+++-+.-+... .....++. T Consensus 268 ~~~~~~~-~~lv--~~g~~~~~~~~----------~~~~~~~---~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 331 (471) T protein:vir:10 268 DTDDVQE-VIFV--LTNYGGQDKQE----------FLEDLKR---YKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILER 331 (471) T ss_pred HHHHhhC-ceee--eecCCccccch----------hHHHhhc---CCeEEecCCCCccCccceEEeecCChHHHHHHHHH Confidence 2222222 1121 11000111111 1111111 111100000 011111111111 12234566 Q ss_pred hHHhhhhhhhhhhhhccCC-CcchHHHHHHHHHHHHHHHH----HHHHHHHHHHH---HHhhcccceEEEEEecCCCCcc Q lcl|NC_015263. 339 ATKNFWDNAGVSQILFSSD-NKTSQGIAMSIATDEQFIFG----VINQLERWLNR---YLLLNGMSKYFKATMLEVTHFS 410 (513) Q Consensus 339 ~~~~i~~~~GiS~~Lfn~d-~~s~~~~~~SI~~d~~~~~~----~~~~iE~~~N~---~i~~~~~~~~f~~~~l~~T~fn 410 (513) ..++|+..+++-..-+.+. +.|+.+++.-...-.+.+.. |-+.|++.++. ++.... ...+.+.|-+..+.| T Consensus 332 l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~d-~~~i~i~f~~~~p~n 410 (471) T protein:vir:10 332 TKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLSD-KLKIKQTWTRNSINN 410 (471) T ss_pred HHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CceeEEEeCCCCCCC Confidence 6677888877644433322 23333343332222222221 22233332332 222221 224889999999999 Q ss_pred HHHHHHHHHHHHhcC-CcHHHHHHHHhCC--CHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCc Q lcl|NC_015263. 411 KKEAHDRYITDAQYG-FPVKVYLASLMGI--DPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENG 487 (513) Q Consensus 411 ~ke~~~~~~~~~~~G-~~~~~~laa~~G~--~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~g 487 (513) ..+.++.+.++. | .|....+. .+++ +|.+.+.+++.|++.- .+.+.+. + +.. T Consensus 411 ~~e~~~~~~kl~--g~iS~et~~~-~~p~v~D~~~E~eri~~E~~~~--~~~~~~~----~--~~~-------------- 465 (471) T protein:vir:10 411 DTEMAQVVSTLA--TITSRENVAK-SNPIVEDWQDELRLQKAEQEGR--SEKLYDM----E--EVE-------------- 465 (471) T ss_pred HHHHHHHHHHHh--ccCchHHHHH-hCCCCCCHHHHHHHHHHHHHHH--Hhccccc----C--CCC-------------- Confidence 999999999985 5 66555554 5666 7899999999987541 0111111 1 111 Q ss_pred CCCCccc Q lcl|NC_015263. 488 RPTNETT 494 (513) Q Consensus 488 rPt~et~ 494 (513) |..|-. T Consensus 466 -~~~e~~ 471 (471) T protein:vir:10 466 -HESEVE 471 (471) T ss_pred -CccccC Confidence 111100 No 183 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=44.08 E-value=0.82 Score=21.06 Aligned_cols=393 Identities=12% Similarity=0.068 Sum_probs=177.3 Q ss_pred cCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHHhhcccccc---------------- Q lcl|NC_015263. 36 RTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFYANMPLYAY---------------- 99 (513) Q Consensus 36 ~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~~~mpt~dY---------------- 99 (513) +-|..++..=. . |..-...|..|...+...-+.++++-.||..-+.+.| T Consensus 1 ~~~~~~~~~~g--l-------------~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~ 65 (474) T protein:vir:81 1 MIQQQTVRIPS--L-------------SNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVL 65 (474) T ss_pred CcCCCcCcCCC--C-------------ChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhc Confidence 33432222211 1 1122223333444444444444444444444444333 Q ss_pred ------------------eEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEE--cCcc- Q lcl|NC_015263. 100 ------------------SVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVID--DKES- 158 (513) Q Consensus 100 ------------------~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~--d~~~- 158 (513) +..| .... .....+ .....-++....+.+.+.+++.|.-|.+... ++++ T Consensus 66 nw~~~~Vd~~a~rl~~~Gf~~~----d~~~---~~~~l~---~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~ 135 (474) T protein:vir:81 66 GWTGKAVDALARRCNLEGFVWP----DGDL---DSLGGT---EVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPE 135 (474) T ss_pred ChHHHHHHHHHhhhcccceECC----CCCc---cchHHH---HHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCce Confidence 2222 0000 011122 2345666888899999999999999998875 3332 Q ss_pred eeeeecCcceeEEEEE-ECCeeEEEEEeeeccCcch---hc-cccHHHHHHHHHHhhhhhccCcccccCeeecCCceE-- Q lcl|NC_015263. 159 VMIQQFPNDICKISSV-SGGVYNYVIDLDALVSADI---VD-YYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSI-- 231 (513) Q Consensus 159 ~~iq~lp~dyckIsg~-~nG~y~~~fD~syFd~~~~---L~-~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~-- 231 (513) ..|...++..|.++-= ..+...+++-...=+.... .. ++|.++... . .++....|..=..+|.+ T Consensus 136 ~~i~~~sp~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~----~-----~~~~~~~w~~~~~~~~~gv 206 (474) T protein:vir:81 136 ALIHVKDASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTA----Q-----RDKATLKWQVDRDEHVYGV 206 (474) T ss_pred eEEEEeccceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEE----E-----EcCccceeeeccCCCCCCc Confidence 6789999999986542 3366666665443232211 11 234333211 0 11112334322222322 Q ss_pred -EEEe----cCccccchhhHHHHHHhHHHHHHHHHHH-hhHhhhhhceeeeeeeccc--cCCCCCccc-cCHHHHHHHHH Q lcl|NC_015263. 232 -CIKI----NESSLTPVPPFAGTFDSIYDIHSFKDLR-NDKAELQNYKLLIQKLETR--SSNDNNDFT-LDMPMMNYFHE 302 (513) Q Consensus 232 -~ik~----~~~~~~~ip~f~~v~~d~~di~~~kdL~-~~~~~i~n~~ii~~kip~~--~~n~~~~~~-vd~~~~~~~~~ 302 (513) ++-+ +...+.|.+-++--+.++.|.....=+. .+..+. ..+|.+ -|=+..++. -|.+....|.. T Consensus 207 PvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~-------~a~pqr~i~G~~~~~~~d~d~~~~~~~~~ 279 (474) T protein:vir:81 207 PAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDV-------FSYPEFWLLGADESALKNADGTIKSVWEA 279 (474) T ss_pred ceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHH-------hcchhheeecCChhhcccccccccchhhh Confidence 2222 2233345554433333333332222222 222222 223311 011111111 22233344544 Q ss_pred HHHHhc--cccceEEEecccccccc---ccccccc--chhhhhhHHhhhhhhhhhhhhcc--C-CC-cchHHHHHHHHHH Q lcl|NC_015263. 303 ALSMTV--PDNVGVVTSPMEIDTVS---FDKDSST--DDSVEKATKNFWDNAGVSQILFS--S-DN-KTSQGIAMSIATD 371 (513) Q Consensus 303 ~ik~~L--p~gv~~v~sP~~~d~i~---ld~~~~~--~dtv~~~~~~i~~~~GiS~~Lfn--~-d~-~s~~~~~~SI~~d 371 (513) .+...+ |.+= -+-.| .....+ |+..... -|.+...-.++...+||..-.|| + ++ .|+.++...-..- T Consensus 280 ~~~~i~~~~~d~-d~~~~-~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l 357 (474) T protein:vir:81 280 RLGRIKGLPDDA-DADIP-QLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYEL 357 (474) T ss_pred hHHHHhcCCCcc-ccccc-ccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHH Confidence 444433 2210 01111 111222 2222111 13455555566666777755654 3 44 4555676665444 Q ss_pred HHHHHHHHHHHHHHHHHHHh------hccc-------ceEEEEEecCCCCccHHHHHHHHHHHHhcC--CcHHHHHHHHh Q lcl|NC_015263. 372 EQFIFGVINQLERWLNRYLL------LNGM-------SKYFKATMLEVTHFSKKEAHDRYITDAQYG--FPVKVYLASLM 436 (513) Q Consensus 372 ~~~~~~~~~~iE~~~N~~i~------~~~~-------~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G--~~~~~~laa~~ 436 (513) -..+-+-.+.+..=+.+.+. .+.. -...++++-+...-+.-..++.+.|+++-| .+....+-..+ T Consensus 358 ~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~l 437 (474) T protein:vir:81 358 IAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELI 437 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhc Confidence 44444444444442332222 1111 124677888888889999999999998865 66677778889 Q ss_pred CCCHHHHHHHHHHHH--HhhCcccccCcccccccccccccccCCccccCCCCcCCCCc Q lcl|NC_015263. 437 GIDPVAFTGLLKVEN--EMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNE 492 (513) Q Consensus 437 G~~p~~~~~~~~~E~--e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~e 492 (513) |++|.++-....... +..+.-+-+. . -+ ... +|.+ T Consensus 438 g~t~~~i~~~~~~~~~~~~~~~~~~l~--~-----~~----~~~----------~~aq 474 (474) T protein:vir:81 438 GLTPQQARRAMADKRRVQGRGTLQALI--D-----RS----NNG----------ATAQ 474 (474) T ss_pred CCCHHHHHHHHHHHHHHhHHHHHHHHH--h-----cC----CCC----------CCCC Confidence 999999876554422 2222111111 0 01 111 1111 No 184 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=38.27 E-value=1.1 Score=20.41 Aligned_cols=390 Identities=12% Similarity=-0.014 Sum_probs=146.3 Q ss_pred HHHHHH---HHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHHHH-hhccc Q lcl|NC_015263. 21 SNKRNN---RISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLNFY-ANMPL 96 (513) Q Consensus 21 ~~~~~~---~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlidy~-~~mpt 96 (513) -||-.- ..-+.+++.. +....-....+.+.|.+-+ . ..+++-+=-++-+-+....+||+..++-. .++-. T Consensus 1 ~~~~~~~~p~~~~~~~~~~-~~~~~~~~~g~~~~D~~lr---~--~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~ 74 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIY-AMEHLGLATSYLSEDGGYK---R--AGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLN 74 (446) T ss_pred CcccccCCCchhhhhhhhh-ccccchhhcccCCcchHhh---h--cCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhc Confidence 111100 0111111111 1111111111222222211 1 12333222345566677899999988865 45678 Q ss_pred ccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEc-----CcceeeeecCccee-E Q lcl|NC_015263. 97 YAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDD-----KESVMIQQFPNDIC-K 170 (513) Q Consensus 97 ~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d-----~~~~~iq~lp~dyc-k 170 (513) +++.|.|-.. +.-.-+..+|..+.. .+++.+ +.....|||-... ..+ ..+|.+|+ + T Consensus 75 ~~w~V~p~~~----------~~a~~v~~~l~~~~~----~~~~~~-~ldai~~G~s~~Eivw~~~~g---~~~p~~~~d~ 136 (446) T protein:vir:98 75 KVGPYQHGDK----------RIKKFIDDQLRNRAK----TWISHC-VKSIMTYGFSLSEQIYAHGAR---DNMPATVLDD 136 (446) T ss_pred CCceecCccH----------HHHHHHHHHHhhcCc----hhHHHH-HHHHHhhCceeeeEEEeeccc---ccccchhhcc Confidence 9999998321 111124555665543 334443 3455567765421 111 11222221 1 Q ss_pred EEEEECCeeEEEEEeeeccC-cchhccccHHHHHHHH------------HHhhhhhccCcccccCeeecCCceEEEEecC Q lcl|NC_015263. 171 ISSVSGGVYNYVIDLDALVS-ADIVDYYPKEIQEAVN------------KYTTMKKGNNKSASNWYEIQDKNSICIKINE 237 (513) Q Consensus 171 Isg~~nG~y~~~fD~syFd~-~~~L~~~p~Ei~~~y~------------~Y~~~k~~~~~~~~~W~~L~~~kt~~ik~~~ 237 (513) +++..-=-.++. |+. ...+..-+..+.+-.. .... +.... ..=+.||..|-+++..+. T Consensus 137 ~~~~~~~~~r~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---g~~~~iP~~kfi~~~~~~ 207 (446) T protein:vir:98 137 IVNYHPLQVMLI-----ANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPK-KVDVV---GSHVRLPSHKRLFINYNT 207 (446) T ss_pred ccccccccceee-----eccCCccccccccchhhcccccccCcccchhhhhhh-hcccC---cccccccccceEEEEecC Confidence 111111001111 111 1111111111111000 0000 00000 112568998999988755 Q ss_pred cc--ccchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeeeeeccccCCCCCccccCHHHH-----HHHHHHHHHhccc Q lcl|NC_015263. 238 SS--LTPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQKLETRSSNDNNDFTLDMPMM-----NYFHEALSMTVPD 310 (513) Q Consensus 238 ~~--~~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~kip~~~~n~~~~~~vd~~~~-----~~~~~~ik~~Lp~ 310 (513) .+ +++.+.+..++-..+-...-...-..-.+.--.-+.+.|.|-+..+++ .-.-+...+ .+....+.+.-.+ T Consensus 208 ~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~-~~~~~~~~~~~~~~~~L~~av~~~~~d 286 (446) T protein:vir:98 208 KGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVV-EEAPDGTEITTTIAEQAEDALRRLSTD 286 (446) T ss_pred CCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCccc-ccchhHHHHHHHHHHHHHHHHHhcccc Confidence 44 345444444444443333333333344444444577888884322221 111222222 2233333332234 Q ss_pred cceEEEe---ccccccccccccc--ccchhhhhhHHhhhhhhhhhhhhccC------C-CcchHHHHHHHHH--HHHHHH Q lcl|NC_015263. 311 NVGVVTS---PMEIDTVSFDKDS--STDDSVEKATKNFWDNAGVSQILFSS------D-NKTSQGIAMSIAT--DEQFIF 376 (513) Q Consensus 311 gv~~v~s---P~~~d~i~ld~~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~------d-~~s~~~~~~SI~~--d~~~~~ 376 (513) +.+.+.. | +-..|+|-..+ ...++-.-. =|-+.-||.+++++ + +++++.+..++.. -..++- T Consensus 287 a~~ii~~~~~P-~g~eie~~ea~~~~~~~~~~~i---~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~ 362 (446) T protein:vir:98 287 SGLVLTQLSKE-QPVQVGALTTGNNFSDSFERAI---SLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKIN 362 (446) T ss_pred ceeeeecccCC-CCceEEeeccccCChhhHHHHH---HHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHH Confidence 4333311 4 22334443322 222221111 12345566555433 1 2122223233321 122233 Q ss_pred HHHHHHHHHHHHHHh-----hccc--ceEEE-----EEecCCCCccHHHHHHHHHHHHhcCCcHHHHHHHHhCCCHHHHH Q lcl|NC_015263. 377 GVINQLERWLNRYLL-----LNGM--SKYFK-----ATMLEVTHFSKKEAHDRYITDAQYGFPVKVYLASLMGIDPVAFT 444 (513) Q Consensus 377 ~~~~~iE~~~N~~i~-----~~~~--~~~f~-----~~~l~~T~fn~ke~~~~~~~~~~~G~~~~~~laa~~G~~p~~~~ 444 (513) .-.++|+.-+|+.|= -|.- ...++ ++|--...-+.+.+++.+.++...|+-+ |.. T Consensus 363 aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~-----------p~~-- 429 (446) T protein:vir:98 363 SIFDTVIHAFTEQVIGNLIRLNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLV-----------DGD-- 429 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccc-----------ccc-- Confidence 567778887887442 1211 11111 2222222334455566666666666421 110 Q ss_pred HHHHHHHHhhCcccccCcccccc Q lcl|NC_015263. 445 GLLKVENEMLDLPEIMTPLSSSF 467 (513) Q Consensus 445 ~~~~~E~e~L~l~~~~~Pl~TS~ 467 (513) . .+=.|.+|| ++.+.|- T Consensus 430 -~-~~ire~~gi----P~~~~~~ 446 (446) T protein:vir:98 430 -K-DHIRSITGL----PDAISST 446 (446) T ss_pred -H-HHHHHHhCc----CCCCCCC Confidence 0 011145564 3322221 No 185 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=32.98 E-value=1.4 Score=19.80 Aligned_cols=427 Identities=11% Similarity=0.071 Sum_probs=162.7 Q ss_pred eeehhhhhhH-HHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhhcchHHHHHH Q lcl|NC_015263. 11 MIDVESISSY-SNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQSQQYQRLLN 89 (513) Q Consensus 11 ~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~sg~~~rlid 89 (513) |-+.|.+.+- |=.|-.++|-.++- .-+. ..| ++..--.+=.+.. ++-+.+.. .+|++..++ T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~-~~~~---~~~---------~~~~~~Lr~~~~~----~ly~~m~~-D~hi~s~l~ 62 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLK-VKNG---RIY---------EEPRQALRFPESI----KTFQLMMR-DPAVAASVN 62 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhc-cccc---hhh---------ccchhhhcccchH----HHHHHHhh-ChHHHHHHH Confidence 6666665443 33333344321110 0000 000 0000000001122 23444443 788888887 Q ss_pred HHh-hcccccceEeeccch-hhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEc----Ccceeeee Q lcl|NC_015263. 90 FYA-NMPLYAYSVVPFKDI-STANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDD----KESVMIQQ 163 (513) Q Consensus 90 y~~-~mpt~dY~I~P~~~~-~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d----~~~~~iq~ 163 (513) -.. ++-.+++.|.|-+.. ...+.....+- +...|. +++..+..++..++ ..++|||-.-. ...-...+ T Consensus 63 ~Rk~av~~~~w~v~p~~~~~~d~~~~~~a~~---v~~~l~--~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~ 136 (488) T protein:vir:95 63 IIKMFVRKVNWRFVPPKGKEQDPKMLERADF---FNSLMD--DMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGK 136 (488) T ss_pred HHHHHHhcCCceEecCCCCchhHHHHHHHHH---HHHHHh--ccCccHHHHHHHHH-Hhhcccceeeeeeeecccccccc Confidence 654 455699999996533 22211111111 112232 24556778888887 67888876511 00000000 Q ss_pred cCcceeEEEEEECCeeEEE-------EEeee--ccCcchhc----cccHHHHHHHHHHhhhhhccCcccccCeeecCCce Q lcl|NC_015263. 164 FPNDICKISSVSGGVYNYV-------IDLDA--LVSADIVD----YYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNS 230 (513) Q Consensus 164 lp~dyckIsg~~nG~y~~~-------fD~sy--Fd~~~~L~----~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt 230 (513) ....| .+|.+... -.+++ |+...-+. ..+++.... ...........=.+||+.|- T Consensus 137 ~~~~~------~dg~~~~~~i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~------~~~~~~~~~~~~~~lP~~kf 204 (488) T protein:vir:95 137 YQSKF------DDGLIGWAKLPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHI------AGAINLGERPLTRKLPRAKF 204 (488) T ss_pred ccccc------cCCeeeeeeeeecCcccccceeeccCCCceeeccccccccccc------ccccccccccccccccccce Confidence 01111 22322110 01111 22111111 011111100 00000000112256888888 Q ss_pred EEEEecCccccchhhHHHHHHhHHHHHHHHHH--HhhHhhhhh--ceeeeeeeccccCCCCCccccCHH-HHHHHHHHHH Q lcl|NC_015263. 231 ICIKINESSLTPVPPFAGTFDSIYDIHSFKDL--RNDKAELQN--YKLLIQKLETRSSNDNNDFTLDMP-MMNYFHEALS 305 (513) Q Consensus 231 ~~ik~~~~~~~~ip~f~~v~~d~~di~~~kdL--~~~~~~i~n--~~ii~~kip~~~~n~~~~~~vd~~-~~~~~~~~ik 305 (513) ++++.+.. .+-|.-.+++..++=.--+|.. +.--.=+|- +-+.+-+.|.+ -.+++..-+.. .++...+++. T Consensus 205 i~~~~~~~--~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~--~~~~~~~~e~~~l~~a~~~i~~ 280 (488) T protein:vir:95 205 MLFKYDDE--YGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPD--YLDENAEPEKKAFVQYCKTVVN 280 (488) T ss_pred EEEeecCC--CCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccC--CCCCcccHHHHHHHHHHHHHHH Confidence 88876443 2334455566665555555552 222222221 34445555532 12223322222 2244444444 Q ss_pred HhccccceEEEec--ccccc----ccccc--cc--ccchhhhhhHHhhhhhhhhhhhhccCC-----CcchHHHHHHHH- Q lcl|NC_015263. 306 MTVPDNVGVVTSP--MEIDT----VSFDK--DS--STDDSVEKATKNFWDNAGVSQILFSSD-----NKTSQGIAMSIA- 369 (513) Q Consensus 306 ~~Lp~gv~~v~sP--~~~d~----i~ld~--~~--~~~dtv~~~~~~i~~~~GiS~~Lfn~d-----~~s~~~~~~SI~- 369 (513) ++....-+.++.| ++++. ++|.. .+ ...++..=. =+-+.-||.+++|.. ..+++.+...+. T Consensus 281 ~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li---~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ 357 (488) T protein:vir:95 281 DMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSII---DRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKT 357 (488) T ss_pred HhhccchhheeeccccccccchhhhhhhccccccCCchhHHHHH---HHHHHHHHHHHhccccccccCcchhhhHHHHHH Confidence 4332111222223 22222 22221 11 111111100 123456666666542 112222322222 Q ss_pred -HHHHHHHHHHHHHHHHHHHHH-----hhccc--ceEEEEEecCCCCccHHHHHHHHHHHHhcCCcHH-----HHHHHHh Q lcl|NC_015263. 370 -TDEQFIFGVINQLERWLNRYL-----LLNGM--SKYFKATMLEVTHFSKKEAHDRYITDAQYGFPVK-----VYLASLM 436 (513) Q Consensus 370 -~d~~~~~~~~~~iE~~~N~~i-----~~~~~--~~~f~~~~l~~T~fn~ke~~~~~~~~~~~G~~~~-----~~laa~~ 436 (513) .-..++-.-..+|++-+|+.| ..|.- ...-+|.|-....-+.+..++.+.++...|+-+. .++...+ T Consensus 358 ev~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~ 437 (488) T protein:vir:95 358 SLLAMSVDILLKQIKNVINRDLVAQTYALNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHI 437 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHh Confidence 222333355667777777632 12211 2224667766666677888889888888886543 3344445 Q ss_pred CCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcccccccCCCCCCCCCCccCCC Q lcl|NC_015263. 437 GIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNETTGNKDSDETQRAKDKPANTQ 513 (513) Q Consensus 437 G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et~~n~~~~~~~~~~d~~~~~~ 513 (513) |+.+-+ +-| +++. ..+|-.. ..++.. .++ ...++. ...++..++ T Consensus 438 gip~~~-----~~e-~~~~---~~~~~~~----------~~~~~~-----~~~--------~~~~~~-~~~~~~~~~ 481 (488) T protein:vir:95 438 GLPPAD-----ESQ-PVSE---KLSPNSQ----------SRSGDG-----YKT--------AGEGTA-KTPSAKDPS 481 (488) T ss_pred CCCCCC-----CCc-cccc---cCCCCCC----------CCCCcc-----cCC--------CcccCC-cccccccch Confidence 655321 001 1111 1111000 000000 000 000000 011111111 No 186 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=30.67 E-value=1.6 Score=19.53 Aligned_cols=416 Identities=10% Similarity=0.071 Sum_probs=159.7 Q ss_pred eeeeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHHHHHhhhccChhHHHHHHHHHHHHHhh-------- Q lcl|NC_015263. 9 LSMIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKVRKIVKEYRNEGNQKTLRKVSEDLAVQ-------- 80 (513) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~k~~i~~~~P~~n~~~ir~~s~~lY~~-------- 80 (513) .+|++. +..+|-+=+++... .+... ...| +..|. +|....+.|+.--.|+-.. T Consensus 1 m~~~~~-----------ik~~~~~~~~~~~~--~~~~~--~i~d--~~~i~--~~~~~~~~i~~~~~~Y~g~~~~l~~~~ 61 (505) T protein:vir:79 1 MAFWDT-----------LKNLFRKGSAAVGM--TKSLG--QIID--DPRIN--LPADEVERIARDKRYYMDDFKQVTHKN 61 (505) T ss_pred CchHHH-----------HHHHHHHhhhhhcc--hhhhh--hhhc--ccCCC--CCHHHHHHHHHHHHHhcCCCccccccc Confidence 444432 11112111111100 00000 0000 00011 1333344443332222111 Q ss_pred ------------cchHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcce Q lcl|NC_015263. 81 ------------SQQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIF 148 (513) Q Consensus 81 ------------sg~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~ 148 (513) -+.-+.+.+-.+++.+-+-.- +.-. ++..++-.. ..|..=+....+..++..|...|.. T Consensus 62 ~~~~~~~~~~~slnl~~~i~~~~A~ll~~e~~~--i~~~----d~~~~e~l~---~i~~~n~f~~~~~~~~e~a~a~G~~ 132 (505) T protein:vir:79 62 SYGDTQKHELQSVNVTKLASAKLASLIFNEQCQ--VTVS----DETANDFLD---DVFQQNDFYTTFEEKLEEWIALGSG 132 (505) T ss_pred cCCCccccceeecchHHHHHHHHHhhhcCCCce--eecC----ChHHHHHHH---HHHHhccHHHHHHHHHHHHhhcCCe Confidence 122233333333322111000 1101 111222222 2244434566677888888887777 Q ss_pred eEEEEEcCcceeeeecCcceeEEEEE-ECCeeEEEEEee--eccCc-----chhccccHHHHHHH---HHHhhh------ Q lcl|NC_015263. 149 YGYVIDDKESVMIQQFPNDICKISSV-SGGVYNYVIDLD--ALVSA-----DIVDYYPKEIQEAV---NKYTTM------ 211 (513) Q Consensus 149 ~gy~i~d~~~~~iq~lp~dyckIsg~-~nG~y~~~fD~s--yFd~~-----~~L~~~p~Ei~~~y---~~Y~~~------ 211 (513) +.-..-|+..+-+...|++-.-++.. ++++-.++|=.. ..++. ..|+++-.+-.+.+ .-|++. T Consensus 133 ~~k~~~D~~~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG 212 (505) T protein:vir:79 133 CVRPYVDSGKIKLAWATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVG 212 (505) T ss_pred EEEEEEeCCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccC Confidence 76665566666666667764444433 334434444222 21111 11333211100100 011100 Q ss_pred hhccCcccccCeeecCCce--------EE-E------EecCccccchhhHHHHHHhHHHHHHH-HHHHhhHhhhhhceee Q lcl|NC_015263. 212 KKGNNKSASNWYEIQDKNS--------IC-I------KINESSLTPVPPFAGTFDSIYDIHSF-KDLRNDKAELQNYKLL 275 (513) Q Consensus 212 k~~~~~~~~~W~~L~~~kt--------~~-i------k~~~~~~~~ip~f~~v~~d~~di~~~-kdL~~~~~~i~n~~ii 275 (513) +...-..-..|-.|.++.+ |+ | +++..+++|++-|..+..-+-+++.. -++.+.....+. +|. T Consensus 213 ~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~-~i~ 291 (505) T protein:vir:79 213 INVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQR-RLI 291 (505) T ss_pred cccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhccc-cee Confidence 0000000111433333222 22 2 23445666766666655333222221 111111111112 222 Q ss_pred e-----eeeccccCCC-CCcc-ccCHHHHHHHHHHHHHhccccceEEEeccccccccccc-ccccchhhhhhHHhhhhhh Q lcl|NC_015263. 276 I-----QKLETRSSND-NNDF-TLDMPMMNYFHEALSMTVPDNVGVVTSPMEIDTVSFDK-DSSTDDSVEKATKNFWDNA 347 (513) Q Consensus 276 ~-----~kip~~~~n~-~~~~-~vd~~~~~~~~~~ik~~Lp~gv~~v~sP~~~d~i~ld~-~~~~~dtv~~~~~~i~~~~ 347 (513) + ...|-+.+.. ...+ ..+.+. ..|..... .++-+ .++.+..+- ...--++++...+.|...+ T Consensus 292 v~~~~l~~~~~~~~~~~~~~~~~fd~~~-~~y~~~~~---~~~~~------~i~~~~~~ir~e~~~~~l~~~l~~i~~~~ 361 (505) T protein:vir:79 292 VPAEWLKTGSSYGGQASETHPPMFDPDE-TVYQAMYG---DASEV------GFHDATSPIRVADYQATMDFFLREFENQT 361 (505) T ss_pred echHHhcccCCCCcccccccccCCCccc-eeeeeccC---CCCCC------ceEEecccCCHHHHHHHHHHHHHHHHHHh Confidence 2 1112111110 1111 111111 11111111 11110 011111110 0112255667778889999 Q ss_pred hhhhhhccCCCcchHHHHHHHHHHHHHHHH----HHHHHHH----HHHHHHhh---c-------------ccceEEEEEe Q lcl|NC_015263. 348 GVSQILFSSDNKTSQGIAMSIATDEQFIFG----VINQLER----WLNRYLLL---N-------------GMSKYFKATM 403 (513) Q Consensus 348 GiS~~Lfn~d~~s~~~~~~SI~~d~~~~~~----~~~~iE~----~~N~~i~~---~-------------~~~~~f~~~~ 403 (513) |+|..-|+-+..+..++ ..|+...+-.+. ..+.+|. .+..++.. . .-.....|.| T Consensus 362 g~s~~~~~~~~~~~~TA-tei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f 440 (505) T protein:vir:79 362 GLSQGTFTTSPSGIQTA-TEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINF 440 (505) T ss_pred CCChhhcCCCccccchH-HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEe Confidence 99999997654443322 233323333332 2222222 22222210 0 0012577888 Q ss_pred cCCCCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCCCHHHH---HHHHHHHHHhhCcccccCcccccccccccc Q lcl|NC_015263. 404 LEVTHFSKKEAHDRYITDAQYG-FPVKVYLASLMGIDPVAF---TGLLKVENEMLDLPEIMTPLSSSFNTSGSD 473 (513) Q Consensus 404 l~~T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~~p~~~---~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~ 473 (513) -+.=+.++++-++++.++.+-| +|...++....|++..+. +.....|+.. -. |-. +.-|++ T Consensus 441 ~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~~~~~~eeea~~el~ri~~E~~~-----~~-p~~---~~~gg~ 505 (505) T protein:vir:79 441 NDGVFVDQESKRAADLQAVQAQVMPKKQFLMRNYGLDEEEADEWLAQIDAENST-----AE-PEF---NQFGGD 505 (505) T ss_pred CCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhccc-----cC-CCc---hhccCC Confidence 8888889999999999999999 577777787779998775 3444445421 11 111 112222 No 187 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=24.24 E-value=2.2 Score=18.70 Aligned_cols=411 Identities=15% Similarity=0.128 Sum_probs=166.0 Q ss_pred eeehhhhhhHHHHHHHHHHHHHhhccCcccccccccccchHHHH-HHHhhhccChhHHHHHHHHHHHHHhhc-------- Q lcl|NC_015263. 11 MIDVESISSYSNKRNNRISILRDDNRTPVFGAPVGSLTSSQSKV-RKIVKEYRNEGNQKTLRKVSEDLAVQS-------- 81 (513) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~s~~~s~~~s~d~~-k~~i~~~~P~~n~~~ir~~s~~lY~~s-------- 81 (513) |+| ++-+.=|-.++-|.... +-.+.+ +..|.- |....+.|+.--+|+.+.- T Consensus 1 m~~--------~~~~~~~~~~~~~~~~~----------~~~~~~~~~~i~~--~~~~~~~i~~~~~~Y~g~~~~~~~~~~ 60 (499) T protein:vir:80 1 MIN--------QIIAGVKGVMRRMGLLK----------SLKDVTDHKKVNA--NDEDYKYIDMWKRLYQGNYAEWHNLNY 60 (499) T ss_pred Chh--------HHHHHHHHHHHHhcccc----------chhhhhcCCCCcC--CHHHHHHHHHHHHHhcCCcchhhcccc Confidence 332 22222222222222111 111111 111111 4444455555444443321 Q ss_pred --------------chHHHHHHHHhhcccccceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcc Q lcl|NC_015263. 82 --------------QQYQRLLNFYANMPLYAYSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDI 147 (513) Q Consensus 82 --------------g~~~rlidy~~~mpt~dY~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~ 147 (513) +.-+.+++-.+++..=+-.- +.-. .+..++-+.++ |..=+....+..++..|...|. T Consensus 61 ~~~~~~~~~~~~s~n~~~~iv~~~a~~l~~ep~~--i~~~----d~~~~e~l~~~---~~~n~f~~~~~~~~~~a~~~G~ 131 (499) T protein:vir:80 61 EHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVK--INID----DETAEEFVLNV---LKTNGFTKNMERYIEYGEAMGG 131 (499) T ss_pred ccCCCccccceeecchHHHHHHHHHHhhhCCcce--EeeC----CHHHHHHHHHH---HhhccHHHHHHHHHHHHhhcCc Confidence 22233444333322111000 1111 11222222222 3333356677888899999999 Q ss_pred eeEEEEEcCc-ceeeeecCcceeEEEEEECCee-EEEEEeeeccCcc----hhccccHHHH-HHHHHHh----hhhhcc- Q lcl|NC_015263. 148 FYGYVIDDKE-SVMIQQFPNDICKISSVSGGVY-NYVIDLDALVSAD----IVDYYPKEIQ-EAVNKYT----TMKKGN- 215 (513) Q Consensus 148 ~~gy~i~d~~-~~~iq~lp~dyckIsg~~nG~y-~~~fD~syFd~~~----~L~~~p~Ei~-~~y~~Y~----~~k~~~- 215 (513) .|....-|.+ .+-+...|++-+-++...+|.. -++| .+...... .|+.. |+. ..+..|. ..+..+ T Consensus 132 ~~~~~~~D~~~~~~i~~v~a~~~~Pi~~d~~~~~~~~f-~~~~~~~~~~y~~lE~h--~~~~~~~~~y~I~n~~~~~~~~ 208 (499) T protein:vir:80 132 FVIKVYHDGNKNVKVSFATADCMYPLSNDSENVDECLI-ANSFHKNNKYYKLLEWN--EWKGEKEEVYTVTTELYQSDDP 208 (499) T ss_pred EEEEEEECCCCcEEEEEEcCCceEEEEecCCCeEEEEE-EEEEeecCeEEEEEEEE--EecccceeeEEEEEEEEeccCc Confidence 9998887654 4677888888777665544433 3333 22222111 12211 000 0000110 000000 Q ss_pred Cccc------ccCeeecCCce--------EE-EEe------cCccccchhhHHHHHHhHHHHHHHHHHH-----hhHhhh Q lcl|NC_015263. 216 NKSA------SNWYEIQDKNS--------IC-IKI------NESSLTPVPPFAGTFDSIYDIHSFKDLR-----NDKAEL 269 (513) Q Consensus 216 ~~~~------~~W~~L~~~kt--------~~-ik~------~~~~~~~ip~f~~v~~d~~di~~~kdL~-----~~~~~i 269 (513) +..+ .-|=.+++..+ |+ |+. +.+++.|+|-|..+ .++.+.=|.. +..+.. T Consensus 209 ~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~----~~lid~lD~~~s~~~~e~~~~ 284 (499) T protein:vir:80 209 NELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANA----LDTLKTLDLMFDSYYQEFKLG 284 (499) T ss_pred cccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhH----HHHHHHHHHHHHHHHHHHHhc Confidence 0000 00222222211 11 222 33455555555544 3333332222 222222 Q ss_pred hhceeeee--eeccccCCCCCccc--cCHHHHHHHHHHHHHhccccce--EEEecccccccccccccccchhhhhhHHhh Q lcl|NC_015263. 270 QNYKLLIQ--KLETRSSNDNNDFT--LDMPMMNYFHEALSMTVPDNVG--VVTSPMEIDTVSFDKDSSTDDSVEKATKNF 343 (513) Q Consensus 270 ~n~~ii~~--kip~~~~n~~~~~~--vd~~~~~~~~~~ik~~Lp~gv~--~v~sP~~~d~i~ld~~~~~~dtv~~~~~~i 343 (513) ++ +|.+. -++. ..+.+|... -+.+. +.|. .+...-+++.. .+.+| .+. ...-.+.++...+.+ T Consensus 285 ~~-~i~v~~~~l~~-~~~~~g~~~~~~~~~~-~~~~-~~~~~~~~~~~~i~~~~~------~ir-~e~~~~~l~~~l~~i 353 (499) T protein:vir:80 285 KK-KVLVPSSFVKT-AVNLDGSTTQYFDSTD-EAFF-LYQGEQDDNGKAIKDISV------EIR-STEFIESINAMLRIY 353 (499) T ss_pred cc-ceecchhhhhc-cCCCCCCcccCCCccc-ceee-EeeccCCCCcCceeEecC------cCC-hHHHHHHHHHHHHHH Confidence 22 23321 1111 113333322 11111 1111 11111111111 11111 111 011225577777889 Q ss_pred hhhhhhhhhhccCCCcchHH---HHHHHH----HHHHHHHHHHHHHHHHHHHHH---h---h-c--c-cceEEEEEecCC Q lcl|NC_015263. 344 WDNAGVSQILFSSDNKTSQG---IAMSIA----TDEQFIFGVINQLERWLNRYL---L---L-N--G-MSKYFKATMLEV 406 (513) Q Consensus 344 ~~~~GiS~~Lfn~d~~s~~~---~~~SI~----~d~~~~~~~~~~iE~~~N~~i---~---~-~--~-~~~~f~~~~l~~ 406 (513) ...+|+|...|+.+..+..+ ++.+.. +-....-.|...|+..+..++ + . . . ......+.|-+. T Consensus 354 ~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~ 433 (499) T protein:vir:80 354 AMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDS 433 (499) T ss_pred HHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCC Confidence 99999999999866443222 221211 111111122223333222222 1 1 1 1 123588889888 Q ss_pred CCccHHHHHHHHHHHHhcC-CcHHHHHHHHhCCCHHHH---HHHHHHHHHhhCcccccCcccccccccccccccCCcccc Q lcl|NC_015263. 407 THFSKKEAHDRYITDAQYG-FPVKVYLASLMGIDPVAF---TGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEK 482 (513) Q Consensus 407 T~fn~ke~~~~~~~~~~~G-~~~~~~laa~~G~~p~~~---~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~ 482 (513) -+-++++-++++.++.+-| +|...+++...|++-.+. +..+..|+.. .+ |-+. ++|-.|. T Consensus 434 i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~ea~~el~~i~~E~~~-~~-----~~~d---~~g~~ge------- 497 (499) T protein:vir:80 434 IAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITEAEADEWAEMLAKEKQA-EI-----PNND---MTGIFGE------- 497 (499) T ss_pred CCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChHHHHHHHHHHHHHhhc-CC-----CCCC---ccccCCC------- Confidence 8889999999999999999 677777777779986654 3333444321 21 1110 1121110 Q ss_pred CCCCcCCCCc Q lcl|NC_015263. 483 GKENGRPTNE 492 (513) Q Consensus 483 ~~~~grPt~e 492 (513) . | T Consensus 498 ------~--e 499 (499) T protein:vir:80 498 ------E--E 499 (499) T ss_pred ------C--C Confidence 1 0 No 188 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=21.77 E-value=2.5 Score=18.36 Aligned_cols=436 Identities=12% Similarity=0.073 Sum_probs=181.6 Q ss_pred hhhhhhHHHHHHHHHHHHHhhccCccc--ccccccccchHHHHHHHh----------hhcc--ChhHHHHHHHHHHHHHh Q lcl|NC_015263. 14 VESISSYSNKRNNRISILRDDNRTPVF--GAPVGSLTSSQSKVRKIV----------KEYR--NEGNQKTLRKVSEDLAV 79 (513) Q Consensus 14 ~~~~~~~~~~~~~~~~i~~~~~~~~~~--~s~~~s~~~s~d~~k~~i----------~~~~--P~~n~~~ir~~s~~lY~ 79 (513) ..|..-|...... ....-=+++.+ +.+-+..+++ .+..++ .-++ -.+|.++|.+.=+-++. T Consensus 1 ~~~~~~w~~~de~---~~~~~~~~~~~~~~~p~~~dG~s--~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~ 75 (533) T protein:vir:58 1 MPSLEKYKKLNEA---VNFTNFLSPMYGMGAPHGAGGSS--MIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDY 75 (533) T ss_pred CCCcchhhhhhHH---HHHHHhhchhhcccCccCCCCCc--cccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhc Confidence 1222222221110 00000011110 0111111110 000000 0000 22344444444444444 Q ss_pred hcchHHHHHHHHhhcc-ccc--ceEeeccchhhhhhcchhHHHHHHHHHHhhcChhHHHHHHHHHHHHhcceeEEEEEc- Q lcl|NC_015263. 80 QSQQYQRLLNFYANMP-LYA--YSVVPFKDISTANENKLKKELATVTEFLSRLNPKYNFSKIVKLAMTVDIFYGYVIDD- 155 (513) Q Consensus 80 ~sg~~~rlidy~~~mp-t~d--Y~I~P~~~~~~~~~~~~~~~y~~v~~~L~k~n~k~~~~~i~~~~l~~g~~~gy~i~d- 155 (513) +.+.+...|+-+++=. ++| ..++-..-+.....+.++ ++ + ++-++.....+.+++..+++|..|+.++.+ T Consensus 76 ~~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK-~k--I---~~lldf~~~~~~~fR~WYVDGriy~Hkiik~ 149 (533) T protein:vir:58 76 TDPLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAIL-SY--L---DYVINIEKNAYPIIRNMIKYGDMFLHILEKG 149 (533) T ss_pred cCcchhhHHHhhhceeeEecCCCceeEeecccccccHHHH-HH--H---HHHhcchhhhhHHHHhhhhcceeEEEeccCC Confidence 4445555554443211 111 111111111111122222 22 2 567788999999999999999999999864 Q ss_pred -Ccce-eeeecCcceeEEEEEECCeeEEEEEeeeccCcchhccccHHHHHHHHHHhhhhhccCcccccCeeecCCceEEE Q lcl|NC_015263. 156 -KESV-MIQQFPNDICKISSVSGGVYNYVIDLDALVSADIVDYYPKEIQEAVNKYTTMKKGNNKSASNWYEIQDKNSICI 233 (513) Q Consensus 156 -~~~~-~iq~lp~dyckIsg~~nG~y~~~fD~syFd~~~~L~~~p~Ei~~~y~~Y~~~k~~~~~~~~~W~~L~~~kt~~i 233 (513) +.|| .++.|+|...+-+-- ..=+..||.=.+. |..+ ..+.-=+.+|.+ +|++ T Consensus 150 ~k~GI~elr~lDPr~i~~vr~------~~t~~eyyvy~~~-----------~~~~--------~s~~~~~kI~~d-aI~y 203 (533) T protein:vir:58 150 SDGTIEKFQVVSPYIFSKRYN------PETDTWYYVITDV-----------YRNV--------VSGYFNEDIPEE-DVIH 203 (533) T ss_pred cccchhhheecCCeeeEEEEe------eccceEEEeeccc-----------cccc--------ccCccccccchh-heee Confidence 4555 778888887765422 1112233321110 0001 000111344542 4443 Q ss_pred E-ecCccc---cchhhHHHHHHhHHHHHHHHHHHhhHhhhhhceeeee----eeccccCCCCCccccCHHHHHHHHHHHH Q lcl|NC_015263. 234 K-INESSL---TPVPPFAGTFDSIYDIHSFKDLRNDKAELQNYKLLIQ----KLETRSSNDNNDFTLDMPMMNYFHEALS 305 (513) Q Consensus 234 k-~~~~~~---~~ip~f~~v~~d~~di~~~kdL~~~~~~i~n~~ii~~----kip~~~~n~~~~~~vd~~~~~~~~~~ik 305 (513) - .+.... ++++++-.+ +-.+-+||.+++-+-=|+|-.. -+=++.|| =|. .+|+|+-..|. T Consensus 204 ~~SGl~d~~~~~iisyLhkA------iKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGN---lpk---~KAeqYl~~im 271 (533) T protein:vir:58 204 FSHKIDTNFFPYGRSYLESA------RAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGN---VPP---DKINEYLTNIA 271 (533) T ss_pred eeeccccCCCCceehhhhHH------HHHHHHHHHHHHHHHHHhhcCChhheEEEEeecC---CCc---cCHHHHHHHHH Confidence 2 232233 333332222 3345688888888877777654 22232232 122 24556666666 Q ss_pred Hhccccc------eEE---Eec--------------------cccccccccccccc-chhhhhhHHhhhhhhhhhhhhcc Q lcl|NC_015263. 306 MTVPDNV------GVV---TSP--------------------MEIDTVSFDKDSST-DDSVEKATKNFWDNAGVSQILFS 355 (513) Q Consensus 306 ~~Lp~gv------~~v---~sP--------------------~~~d~i~ld~~~~~-~dtv~~~~~~i~~~~GiS~~Lfn 355 (513) ....+-+ |-| ..- +++++++ +.... -+-|.==...+|.|++|--.=+. T Consensus 272 ~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLp--Gg~lgemeDV~YF~kkLy~ALnVP~sRl~ 349 (533) T protein:vir:58 272 MQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQ--GSKVDLAEDVEYMLNRLISALKVPKAFIG 349 (533) T ss_pred HhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecC--CCCCCcHHHHHHHHHHHHHHhCCCeeecC Confidence 5554433 222 121 2333322 21110 01122222569999999877776 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------hcccceEEEEEecCCCCccHHHHHHHHHHH-------H Q lcl|NC_015263. 356 SDNKTSQGIAMSIATDEQFIFGVINQLERWLNRYLL------LNGMSKYFKATMLEVTHFSKKEAHDRYITD-------A 422 (513) Q Consensus 356 ~d~~s~~~~~~SI~~d~~~~~~~~~~iE~~~N~~i~------~~~~~~~f~~~~l~~T~fn~ke~~~~~~~~-------~ 422 (513) .+.+. |....|-.|+.....|+++|..-+...|. .......|++.|....||.+-+..+.|..= - T Consensus 350 ~e~~f--gr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~d 427 (533) T protein:vir:58 350 YEGDV--NAKNTLATQDIKFNNTIKRIQGFFVEELERMVRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLK 427 (533) T ss_pred CCCCC--ccchhhhHHHHHHHHHHHHHHHHHHHHHhcccccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhc Confidence 66554 45556666766654566655554443332 222244699999999999987766665432 2 Q ss_pred hcCCcHHHHHHHHhCCCHHHHHHHHHHHHHhhCcccccCcccccccccccccccCCccccCCCCcCCCCcc-----ccc- Q lcl|NC_015263. 423 QYGFPVKVYLASLMGIDPVAFTGLLKVENEMLDLPEIMTPLSSSFNTSGSDIAENAIKEKGKENGRPTNET-----TGN- 496 (513) Q Consensus 423 ~~G~~~~~~laa~~G~~p~~~~~~~~~E~e~L~l~~~~~Pl~TS~T~Sg~~~~~~~~~~~~~~~grPt~et-----~~n- 496 (513) -|.+- -+..--+|.++- +...+.+.-.++++ +-+.|. +..+++-.+.+.....|.|-... -+. T Consensus 428 pyvgk-~yi~k~ILr~td-ei~~q~e~ie~E~~--~~~~~~-------~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~ 496 (533) T protein:vir:58 428 GWVRE-DWIYSNILQIPY-DLKPQEEVAEAAGG--GGLFDT-------GGFGEETTPADFLGERGSPIESPRGRTEFDFG 496 (533) T ss_pred chhhH-HHHHHHHhcCCh-hhhHHHHHHHHhhc--CCCCCC-------CCcccccCCcccCccccCcccCCCChhhHhcc Confidence 22221 222334677774 44443333112211 111110 11111111111112222221110 011 Q ss_pred ----ccCCCCCCCCCCccCC-----------C Q lcl|NC_015263. 497 ----KDSDETQRAKDKPANT-----------Q 513 (513) Q Consensus 497 ----~~~~~~~~~~d~~~~~-----------~ 513 (513) .+-|+..+-.++++.- + T Consensus 497 ~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~~ 528 (533) T protein:vir:58 497 TEGGEELGGELNLGGAFEEFEEETGGGEEELP 528 (533) T ss_pred cCCcccccccccccccchhhhhhcCCcccCCC Confidence 1111111111222220 0 Done!