Query lcl|NC_017974.1_cdsid_YP_006383017.1 [gene=CR3_gp002] [protein=hypothetical protein] [protein_id=YP_006383017.1] [location=1507..3030] Match_columns 507 No_of_seqs 165 out of 345 Neff 8.1 Searched_HMMs 1612 Date Thu Nov 7 15:14:53 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_2 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_2_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95254 Length: 488 100.0 8E-127 5E-130 712.0 42.7 476 13-507 1-487 (488) 2 protein:vir:108215 Length: 469 100.0 4E-113 2E-116 637.0 43.2 452 15-507 1-469 (469) 3 protein:vir:79233 Length: 526 100.0 5E-111 3E-114 625.2 41.5 443 1-507 1-458 (526) 4 protein:vir:103860 Length: 528 100.0 2E-109 1E-112 616.2 42.9 445 1-507 1-460 (528) 5 protein:vir:99232 Length: 526 100.0 2E-109 1E-112 616.1 42.0 441 1-507 1-455 (526) 6 protein:vir:1986 Length: 512 # 100.0 3E-107 2E-110 604.3 42.5 439 1-507 1-449 (512) 7 protein:vir:77981 Length: 448 100.0 6E-104 4E-107 586.5 37.7 432 1-507 1-444 (448) 8 protein:vir:79511 Length: 448 100.0 2E-103 1E-106 584.1 38.8 438 1-498 1-448 (448) 9 protein:vir:79063 Length: 491 100.0 3E-102 2E-105 577.0 37.5 421 1-507 1-429 (491) 10 protein:vir:99853 Length: 488 100.0 9E-101 5E-104 569.1 39.3 410 7-507 1-417 (488) 11 protein:vir:107880 Length: 491 100.0 2E-100 1E-103 567.4 39.6 422 1-507 1-434 (491) 12 protein:vir:98816 Length: 446 100.0 2.3E-99 1E-102 561.3 37.1 417 1-461 1-446 (446) 13 protein:vir:78161 Length: 355 100.0 1E-85 6.3E-89 486.6 32.7 336 133-507 1-340 (355) 14 protein:vir:102727 Length: 945 99.7 1.8E-17 1.1E-20 112.3 31.0 444 1-507 64-546 (945) 15 protein:vir:3153 Length: 467 # 99.7 6.8E-17 4.2E-20 109.2 33.6 415 60-507 1-455 (467) 16 protein:vir:79772 Length: 648 99.7 9.2E-16 5.7E-19 103.0 38.5 453 1-507 1-517 (648) 17 protein:vir:1380 Length: 422 # 99.7 8.2E-17 5.1E-20 108.8 32.3 413 1-491 1-422 (422) 18 protein:vir:99452 Length: 651 99.7 2.3E-16 1.4E-19 106.3 34.2 471 1-507 1-565 (651) 19 protein:vir:93610 Length: 454 99.7 8E-16 5E-19 103.3 36.0 429 1-507 2-450 (454) 20 protein:vir:6240 Length: 457 # 99.7 3.5E-16 2.1E-19 105.4 32.2 426 1-501 1-457 (457) 21 protein:vir:102080 Length: 429 99.7 3.5E-16 2.2E-19 105.3 31.9 420 1-498 1-429 (429) 22 protein:vir:101648 Length: 518 99.7 1.7E-15 1E-18 101.6 34.7 421 13-507 1-454 (518) 23 protein:vir:105002 Length: 432 99.7 7.3E-16 4.5E-19 103.6 32.6 406 31-507 1-432 (432) 24 protein:vir:107605 Length: 432 99.7 7.3E-16 4.5E-19 103.6 32.6 406 31-507 1-432 (432) 25 protein:vir:102855 Length: 432 99.7 7.3E-16 4.5E-19 103.6 32.6 406 31-507 1-432 (432) 26 protein:vir:105064 Length: 421 99.7 9.3E-16 5.7E-19 103.0 33.1 416 1-505 1-421 (421) 27 protein:vir:1326 Length: 457 # 99.7 3.9E-16 2.4E-19 105.0 30.8 427 1-501 1-457 (457) 28 protein:vir:3843 Length: 397 # 99.7 9.4E-16 5.8E-19 103.0 32.3 396 1-501 1-397 (397) 29 protein:vir:7853 Length: 518 # 99.7 2.3E-15 1.4E-18 100.8 34.3 422 13-507 1-454 (518) 30 protein:vir:102118 Length: 409 99.7 7.7E-16 4.8E-19 103.4 31.5 400 1-493 2-409 (409) 31 protein:vir:1266 Length: 416 # 99.7 6.2E-16 3.8E-19 104.0 30.6 405 1-492 1-416 (416) 32 protein:vir:80644 Length: 551 99.7 9.1E-15 5.7E-18 97.6 36.4 441 1-507 1-533 (551) 33 protein:vir:100150 Length: 437 99.7 1.6E-15 9.8E-19 101.7 30.5 422 1-498 1-437 (437) 34 protein:vir:81152 Length: 411 99.7 3.6E-15 2.2E-18 99.8 32.5 403 1-481 1-411 (411) 35 protein:vir:100249 Length: 431 99.7 2E-15 1.2E-18 101.2 30.9 412 1-502 4-431 (431) 36 protein:vir:4454 Length: 414 # 99.7 5.6E-15 3.5E-18 98.7 33.1 405 1-503 1-414 (414) 37 protein:vir:189 Length: 424 # 99.6 1.9E-15 1.2E-18 101.3 30.1 406 1-502 10-424 (424) 38 protein:vir:96579 Length: 576 99.6 4.3E-14 2.6E-17 93.9 37.3 439 1-507 35-549 (576) 39 protein:vir:483 Length: 413 # 99.6 5.9E-15 3.7E-18 98.6 32.4 405 1-503 1-413 (413) 40 protein:vir:5737 Length: 419 # 99.6 1.5E-15 9.4E-19 101.8 29.0 409 1-507 1-418 (419) 41 protein:vir:95599 Length: 563 99.6 5.6E-14 3.5E-17 93.2 36.8 427 1-507 46-540 (563) 42 protein:vir:99312 Length: 563 99.6 5.6E-14 3.5E-17 93.2 36.8 427 1-507 46-540 (563) 43 protein:vir:1884 Length: 424 # 99.6 2.8E-15 1.7E-18 100.4 29.5 401 13-502 1-424 (424) 44 protein:vir:8418 Length: 409 # 99.6 1E-14 6.4E-18 97.2 32.2 401 1-507 1-408 (409) 45 protein:vir:98396 Length: 441 99.6 1.7E-14 1.1E-17 96.1 32.8 412 1-492 14-441 (441) 46 protein:vir:10362 Length: 432 99.6 1.7E-14 1E-17 96.1 32.2 416 1-503 7-432 (432) 47 protein:vir:1431 Length: 419 # 99.6 2.9E-14 1.8E-17 94.8 33.4 411 1-506 1-419 (419) 48 protein:vir:80796 Length: 574 99.6 9.3E-14 5.8E-17 92.0 36.1 442 1-507 27-560 (574) 49 protein:vir:63755 Length: 547 99.6 8E-14 5E-17 92.4 35.6 441 1-507 1-529 (547) 50 protein:vir:80333 Length: 419 99.6 1.6E-14 1E-17 96.2 31.7 411 1-506 1-419 (419) 51 protein:vir:81072 Length: 432 99.6 4.1E-14 2.5E-17 94.0 33.7 418 1-503 7-432 (432) 52 protein:vir:4509 Length: 424 # 99.6 2E-14 1.2E-17 95.7 31.5 399 1-489 17-424 (424) 53 protein:vir:4337 Length: 434 # 99.6 2.5E-14 1.6E-17 95.1 32.0 420 1-499 1-434 (434) 54 protein:vir:79984 Length: 441 99.6 4.8E-14 3E-17 93.6 33.0 410 1-492 14-441 (441) 55 protein:vir:9408 Length: 441 # 99.6 4.8E-14 3E-17 93.6 33.0 410 1-492 14-441 (441) 56 protein:vir:2683 Length: 412 # 99.6 2.1E-14 1.3E-17 95.6 29.9 405 1-494 1-412 (412) 57 protein:vir:960 Length: 413 # 99.6 5.4E-14 3.4E-17 93.3 32.2 397 1-491 4-413 (413) 58 protein:vir:4194 Length: 540 # 99.6 2.3E-13 1.4E-16 89.9 35.2 438 1-507 1-466 (540) 59 protein:vir:97060 Length: 432 99.6 6.1E-14 3.8E-17 93.0 32.0 414 1-503 7-432 (432) 60 protein:vir:93943 Length: 409 99.6 2.3E-14 1.4E-17 95.3 29.6 405 1-494 1-409 (409) 61 protein:vir:94426 Length: 409 99.6 4.7E-14 2.9E-17 93.7 30.7 404 1-494 1-409 (409) 62 protein:vir:96980 Length: 409 99.6 1E-13 6.3E-17 91.8 30.8 396 1-497 1-409 (409) 63 protein:vir:9359 Length: 348 # 99.5 3.7E-14 2.3E-17 94.2 27.9 342 80-494 1-348 (348) 64 protein:vir:100691 Length: 535 99.5 1E-12 6.5E-16 86.3 34.9 440 1-507 13-532 (535) 65 protein:vir:4156 Length: 542 # 99.5 1.1E-12 6.8E-16 86.2 34.8 443 1-507 1-468 (542) 66 protein:vir:81218 Length: 423 99.5 2.6E-13 1.6E-16 89.6 31.2 412 1-491 1-423 (423) 67 protein:vir:4598 Length: 416 # 99.5 5.6E-13 3.5E-16 87.8 32.5 405 1-493 1-416 (416) 68 protein:vir:81095 Length: 416 99.5 5.6E-13 3.5E-16 87.8 32.5 405 1-493 1-416 (416) 69 protein:vir:101647 Length: 460 99.5 1.7E-12 1E-15 85.2 34.2 418 13-493 1-460 (460) 70 protein:vir:7407 Length: 392 # 99.5 3.1E-13 2E-16 89.1 29.6 385 1-495 3-392 (392) 71 protein:vir:3989 Length: 392 # 99.5 4.7E-13 2.9E-16 88.2 29.4 385 1-495 3-392 (392) 72 protein:vir:1023 Length: 392 # 99.5 4.7E-13 2.9E-16 88.2 29.4 385 1-495 3-392 (392) 73 protein:vir:95378 Length: 406 99.5 7.6E-12 4.7E-15 81.6 34.3 395 1-493 1-406 (406) 74 protein:vir:94666 Length: 723 99.4 8.7E-12 5.4E-15 81.2 33.1 408 28-507 1-436 (723) 75 protein:vir:9507 Length: 395 # 99.4 4.6E-12 2.8E-15 82.8 31.1 385 1-504 1-395 (395) 76 protein:vir:101289 Length: 395 99.4 4.6E-12 2.8E-15 82.8 31.1 385 1-504 1-395 (395) 77 protein:vir:100650 Length: 395 99.4 4.6E-12 2.8E-15 82.8 31.1 385 1-504 1-395 (395) 78 protein:vir:3868 Length: 417 # 99.4 1E-11 6.2E-15 80.9 31.2 397 1-507 1-416 (417) 79 protein:vir:6210 Length: 394 # 99.4 3.4E-11 2.1E-14 78.0 33.3 386 1-507 4-394 (394) 80 protein:vir:8317 Length: 409 # 99.4 1.3E-11 8.1E-15 80.3 29.9 371 1-487 32-409 (409) 81 protein:vir:4854 Length: 386 # 99.4 4.9E-11 3.1E-14 77.1 32.4 385 1-490 1-386 (386) 82 protein:vir:80134 Length: 403 99.4 6.9E-11 4.3E-14 76.3 33.0 388 1-493 1-403 (403) 83 protein:vir:4089 Length: 395 # 99.3 3.6E-11 2.2E-14 77.9 30.7 383 1-496 1-395 (395) 84 protein:vir:9702 Length: 406 # 99.3 2.9E-11 1.8E-14 78.4 30.2 395 1-501 1-406 (406) 85 protein:vir:95965 Length: 385 99.3 2.7E-11 1.7E-14 78.5 29.9 376 1-493 1-385 (385) 86 protein:vir:4952 Length: 386 # 99.3 5.7E-11 3.6E-14 76.7 31.7 383 1-504 1-386 (386) 87 protein:vir:9641 Length: 395 # 99.3 2.7E-11 1.6E-14 78.6 29.6 382 1-489 1-395 (395) 88 protein:vir:100187 Length: 385 99.3 3.8E-11 2.3E-14 77.7 30.1 373 1-484 4-385 (385) 89 protein:vir:98643 Length: 395 99.3 8.4E-11 5.2E-14 75.8 30.8 385 1-491 1-395 (395) 90 protein:vir:100882 Length: 383 99.3 4.3E-11 2.6E-14 77.4 29.0 371 1-482 4-383 (383) 91 protein:vir:78310 Length: 376 99.3 9.1E-11 5.6E-14 75.6 29.5 369 31-490 1-376 (376) 92 protein:vir:104259 Length: 403 99.2 3.3E-10 2E-13 72.6 32.4 385 1-491 1-403 (403) 93 protein:vir:4995 Length: 384 # 99.2 9.1E-11 5.6E-14 75.6 27.6 382 1-466 1-384 (384) 94 protein:vir:94002 Length: 378 99.2 1.8E-10 1.1E-13 74.0 28.7 367 1-493 1-378 (378) 95 protein:vir:8100 Length: 466 # 99.2 5.4E-10 3.4E-13 71.4 29.8 442 1-507 4-466 (466) 96 protein:vir:93867 Length: 378 99.1 6.3E-10 3.9E-13 71.0 27.0 367 1-493 1-378 (378) 97 protein:vir:1661 Length: 378 # 99.1 8E-10 5E-13 70.5 26.8 367 1-493 1-378 (378) 98 protein:vir:4828 Length: 382 # 99.1 1.5E-09 9E-13 69.0 27.9 380 1-490 1-382 (382) 99 protein:vir:94049 Length: 532 99.1 2.2E-09 1.4E-12 68.0 33.6 455 1-507 23-531 (532) 100 protein:vir:5249 Length: 437 # 99.0 5.2E-09 3.2E-12 66.0 31.7 423 11-501 1-437 (437) 101 protein:vir:80040 Length: 461 99.0 9.3E-09 5.8E-12 64.6 32.5 434 1-481 1-461 (461) 102 protein:vir:5691 Length: 344 # 98.9 3.6E-09 2.2E-12 66.9 22.9 335 1-422 1-344 (344) 103 protein:vir:107742 Length: 537 98.9 1.9E-08 1.2E-11 62.9 36.5 454 1-506 25-537 (537) 104 protein:vir:78641 Length: 278 98.9 1.2E-08 7.6E-12 63.9 24.0 273 80-419 1-278 (278) 105 protein:vir:858 Length: 378 # 98.8 3.3E-08 2E-11 61.6 25.6 366 1-493 1-378 (378) 106 protein:vir:94869 Length: 378 98.8 4.2E-08 2.6E-11 61.0 26.5 365 1-493 1-378 (378) 107 protein:vir:96068 Length: 765 98.8 4.6E-08 2.9E-11 60.8 33.6 455 1-507 37-560 (765) 108 protein:vir:98567 Length: 340 98.8 4.7E-08 2.9E-11 60.7 25.1 334 1-423 1-340 (340) 109 protein:vir:100328 Length: 346 98.8 5.5E-08 3.4E-11 60.4 24.8 336 1-424 1-346 (346) 110 protein:vir:103971 Length: 376 98.7 4.3E-08 2.7E-11 61.0 22.3 338 1-424 26-376 (376) 111 protein:vir:79207 Length: 351 98.7 8.5E-08 5.3E-11 59.4 23.7 338 1-424 1-351 (351) 112 protein:vir:6058 Length: 344 # 98.7 8E-08 4.9E-11 59.5 23.2 335 1-422 1-344 (344) 113 protein:vir:1082 Length: 359 # 98.7 1.4E-07 8.8E-11 58.1 29.0 350 1-458 1-359 (359) 114 protein:vir:79150 Length: 368 98.6 1.2E-08 7.7E-12 63.9 17.6 344 1-441 1-368 (368) 115 protein:vir:78191 Length: 351 98.6 1.4E-07 8.8E-11 58.1 22.6 338 1-424 1-351 (351) 116 protein:vir:2013 Length: 344 # 98.6 1.5E-07 9.3E-11 58.0 22.6 338 1-422 1-344 (344) 117 protein:vir:1150 Length: 350 # 98.5 3.2E-07 2E-10 56.2 24.6 332 1-422 1-350 (350) 118 protein:vir:79647 Length: 435 98.4 1E-06 6.2E-10 53.5 27.8 408 1-482 5-435 (435) 119 protein:vir:78749 Length: 337 98.3 1.2E-06 7.2E-10 53.1 23.5 326 1-422 1-337 (337) 120 protein:vir:99563 Length: 862 98.0 7E-06 4.3E-09 48.9 35.8 458 1-507 39-596 (862) 121 protein:vir:107662 Length: 427 98.0 8.9E-06 5.5E-09 48.3 27.3 404 13-489 1-427 (427) 122 protein:vir:267 Length: 348 # 97.9 1E-05 6.5E-09 47.9 26.7 330 1-429 1-348 (348) 123 protein:vir:3743 Length: 345 # 97.8 2E-05 1.3E-08 46.3 24.6 339 1-424 1-345 (345) 124 protein:vir:4698 Length: 251 # 97.7 2.6E-05 1.6E-08 45.7 20.8 248 1-312 1-251 (251) 125 protein:vir:3780 Length: 345 # 97.7 3.1E-05 1.9E-08 45.3 24.6 340 1-424 1-345 (345) 126 protein:vir:78537 Length: 480 97.7 3.3E-05 2E-08 45.2 27.8 447 13-507 1-478 (480) 127 protein:vir:99916 Length: 504 97.6 3.3E-05 2.1E-08 45.1 28.4 450 13-503 1-504 (504) 128 protein:vir:78227 Length: 480 97.4 8.5E-05 5.3E-08 42.9 27.3 443 13-507 1-478 (480) 129 protein:vir:2341 Length: 488 # 97.2 0.00014 8.7E-08 41.7 26.7 445 13-503 1-488 (488) 130 protein:vir:105782 Length: 449 97.1 0.00015 9.6E-08 41.5 26.8 419 1-497 1-449 (449) 131 protein:vir:389 Length: 530 # 97.1 0.00016 9.9E-08 41.4 32.8 462 1-505 1-530 (530) 132 protein:vir:4223 Length: 486 # 97.1 0.00016 1E-07 41.3 26.6 434 13-503 1-486 (486) 133 protein:vir:99072 Length: 479 97.1 0.00016 1E-07 41.3 28.4 431 10-502 1-479 (479) 134 protein:vir:104338 Length: 422 97.1 0.00016 1E-07 41.3 29.7 396 13-480 1-422 (422) 135 protein:vir:98444 Length: 434 97.1 0.00018 1.1E-07 41.1 28.5 398 50-497 1-434 (434) 136 protein:vir:3420 Length: 533 # 96.9 0.00025 1.6E-07 40.3 33.1 459 11-503 1-533 (533) 137 protein:vir:98883 Length: 517 96.9 0.00026 1.6E-07 40.2 26.0 428 1-491 26-517 (517) 138 protein:vir:5839 Length: 533 # 96.9 0.00028 1.7E-07 40.1 21.9 437 1-507 4-527 (533) 139 protein:vir:7768 Length: 484 # 96.8 0.00031 1.9E-07 39.8 26.9 440 13-503 1-484 (484) 140 protein:vir:104082 Length: 485 96.8 0.00033 2.1E-07 39.6 27.4 435 13-502 1-485 (485) 141 protein:vir:5665 Length: 511 # 96.8 0.00036 2.2E-07 39.5 20.1 440 1-491 11-511 (511) 142 protein:vir:2427 Length: 485 # 96.7 0.00038 2.4E-07 39.3 26.7 434 11-502 1-485 (485) 143 protein:vir:3964 Length: 453 # 96.6 0.00047 2.9E-07 38.8 26.9 429 1-493 1-453 (453) 144 protein:vir:94101 Length: 474 96.6 0.00051 3.2E-07 38.6 30.0 422 1-499 1-474 (474) 145 protein:vir:105889 Length: 474 96.6 0.00051 3.2E-07 38.6 30.0 422 1-499 1-474 (474) 146 protein:vir:102602 Length: 456 96.6 0.00052 3.2E-07 38.6 23.5 416 13-504 1-456 (456) 147 protein:vir:105819 Length: 456 96.6 0.00052 3.2E-07 38.6 23.5 416 13-504 1-456 (456) 148 protein:vir:104500 Length: 537 96.5 0.00056 3.5E-07 38.4 22.6 446 1-505 10-537 (537) 149 protein:vir:80680 Length: 441 96.4 0.00062 3.8E-07 38.2 23.0 417 16-501 1-441 (441) 150 protein:vir:80165 Length: 651 96.3 0.00072 4.5E-07 37.8 26.9 465 11-507 1-626 (651) 151 protein:vir:95806 Length: 440 96.2 0.00082 5.1E-07 37.5 26.9 409 27-489 1-440 (440) 152 protein:vir:6382 Length: 553 # 96.2 0.00092 5.7E-07 37.2 32.5 460 1-507 1-550 (553) 153 protein:vir:96738 Length: 505 96.1 0.00096 6E-07 37.1 31.3 454 1-500 1-505 (505) 154 protein:vir:80453 Length: 535 96.1 0.00097 6E-07 37.1 30.2 453 1-507 1-534 (535) 155 protein:vir:93747 Length: 472 96.1 0.001 6.3E-07 37.0 31.4 415 1-501 1-472 (472) 156 protein:vir:8184 Length: 474 # 96.1 0.001 6.4E-07 37.0 25.0 425 18-493 1-474 (474) 157 protein:vir:5961 Length: 503 # 96.0 0.0011 7.1E-07 36.7 31.6 424 1-507 38-503 (503) 158 protein:vir:99522 Length: 470 95.9 0.0013 7.8E-07 36.5 27.3 425 1-491 1-470 (470) 159 protein:vir:7987 Length: 456 # 95.8 0.0015 9.1E-07 36.1 22.1 422 19-504 1-456 (456) 160 protein:vir:79538 Length: 502 95.8 0.0015 9.2E-07 36.1 35.8 450 1-504 11-502 (502) 161 protein:vir:101494 Length: 527 95.7 0.0017 1E-06 35.8 27.9 445 1-507 1-523 (527) 162 protein:vir:105154 Length: 525 95.6 0.0018 1.1E-06 35.6 24.2 456 1-507 1-516 (525) 163 protein:vir:102239 Length: 527 95.6 0.0018 1.1E-06 35.6 27.9 445 1-507 1-523 (527) 164 protein:vir:38 Length: 496 # N 95.4 0.0021 1.3E-06 35.3 23.1 416 1-493 25-496 (496) 165 protein:vir:1587 Length: 508 # 95.4 0.0021 1.3E-06 35.3 27.0 429 1-489 1-508 (508) 166 protein:vir:4782 Length: 522 # 95.3 0.0023 1.4E-06 35.0 23.3 439 1-500 1-522 (522) 167 protein:vir:103177 Length: 533 95.2 0.0025 1.5E-06 34.9 23.2 454 1-507 9-532 (533) 168 protein:vir:1236 Length: 483 # 95.1 0.0027 1.7E-06 34.7 30.1 426 1-495 1-483 (483) 169 protein:vir:94956 Length: 452 94.9 0.0033 2.1E-06 34.2 29.2 400 13-494 1-452 (452) 170 protein:vir:3609 Length: 452 # 94.7 0.0036 2.2E-06 34.0 25.3 416 1-493 1-452 (452) 171 protein:vir:10321 Length: 495 94.7 0.0037 2.3E-06 33.9 31.7 452 1-505 3-495 (495) 172 protein:vir:95899 Length: 474 94.5 0.0043 2.7E-06 33.6 25.7 417 1-496 1-474 (474) 173 protein:vir:96266 Length: 474 94.5 0.0043 2.7E-06 33.6 25.7 417 1-496 1-474 (474) 174 protein:vir:94805 Length: 492 94.4 0.0045 2.8E-06 33.4 26.1 425 1-507 21-491 (492) 175 protein:vir:107112 Length: 478 94.3 0.0049 3E-06 33.3 27.7 415 1-493 1-478 (478) 176 protein:vir:733 Length: 453 # 93.9 0.0061 3.8E-06 32.7 27.1 417 1-505 1-453 (453) 177 protein:vir:2500 Length: 501 # 93.6 0.007 4.4E-06 32.4 25.5 424 13-507 1-498 (501) 178 protein:vir:79043 Length: 479 93.6 0.0071 4.4E-06 32.3 28.9 410 1-491 1-479 (479) 179 protein:vir:95014 Length: 491 93.3 0.0079 4.9E-06 32.1 25.0 430 1-489 1-491 (491) 180 protein:vir:98853 Length: 219 93.2 0.0085 5.3E-06 31.9 17.5 213 169-423 1-219 (219) 181 protein:vir:97336 Length: 492 92.6 0.011 6.6E-06 31.4 28.2 410 1-495 53-492 (492) 182 protein:vir:4898 Length: 502 # 92.5 0.011 6.9E-06 31.3 28.6 424 1-507 49-499 (502) 183 protein:vir:96494 Length: 501 91.9 0.014 8.4E-06 30.8 31.2 446 1-507 1-498 (501) 184 protein:vir:105292 Length: 478 91.5 0.016 9.7E-06 30.5 30.1 418 1-500 1-478 (478) 185 protein:vir:97265 Length: 513 91.0 0.018 1.1E-05 30.2 30.9 446 1-507 1-511 (513) 186 protein:vir:106571 Length: 499 90.5 0.021 1.3E-05 29.8 28.7 431 13-507 1-493 (499) 187 protein:vir:106282 Length: 521 90.3 0.022 1.4E-05 29.7 23.0 445 1-491 13-521 (521) 188 protein:vir:97447 Length: 474 90.3 0.022 1.4E-05 29.7 28.9 412 1-493 32-474 (474) 189 protein:vir:94498 Length: 474 90.3 0.022 1.4E-05 29.7 28.9 412 1-493 32-474 (474) 190 protein:vir:3028 Length: 500 # 90.2 0.022 1.4E-05 29.7 25.0 434 1-489 1-500 (500) 191 protein:vir:9815 Length: 500 # 90.2 0.022 1.4E-05 29.7 25.0 434 1-489 1-500 (500) 192 protein:vir:95113 Length: 474 90.1 0.023 1.4E-05 29.6 30.1 416 1-501 1-474 (474) 193 protein:vir:78393 Length: 489 89.5 0.026 1.6E-05 29.3 25.6 430 1-503 1-489 (489) 194 protein:vir:78907 Length: 518 89.1 0.029 1.8E-05 29.1 31.2 441 1-490 1-518 (518) 195 protein:vir:105461 Length: 470 88.9 0.029 1.8E-05 29.0 27.3 411 13-493 1-470 (470) 196 protein:vir:7208 Length: 524 # 88.7 0.03 1.9E-05 28.9 18.8 446 1-491 1-524 (524) 197 protein:vir:9871 Length: 429 # 88.6 0.031 1.9E-05 28.8 27.9 407 1-492 1-429 (429) 198 protein:vir:95149 Length: 501 87.1 0.041 2.5E-05 28.2 31.0 430 13-507 1-499 (501) 199 protein:vir:106639 Length: 481 86.9 0.043 2.6E-05 28.1 27.3 432 1-490 6-481 (481) 200 protein:vir:95542 Length: 548 85.9 0.049 3E-05 27.8 34.2 459 1-500 1-548 (548) 201 protein:vir:99781 Length: 511 84.6 0.059 3.6E-05 27.3 28.3 419 1-502 49-511 (511) 202 protein:vir:2732 Length: 501 # 84.6 0.059 3.7E-05 27.3 32.2 417 1-496 48-501 (501) 203 protein:vir:94546 Length: 506 84.1 0.063 3.9E-05 27.2 30.1 433 1-498 31-506 (506) 204 protein:vir:104892 Length: 558 82.3 0.078 4.8E-05 26.7 23.4 467 1-505 1-558 (558) 205 protein:vir:94599 Length: 641 82.3 0.078 4.8E-05 26.7 21.7 466 9-507 1-615 (641) 206 protein:vir:94742 Length: 409 81.8 0.083 5.1E-05 26.5 29.5 373 28-458 1-409 (409) 207 protein:vir:9568 Length: 410 # 79.5 0.1 6.4E-05 26.0 27.2 385 21-480 1-410 (410) 208 protein:vir:80959 Length: 499 79.0 0.11 6.8E-05 25.9 32.1 437 1-491 1-499 (499) 209 protein:vir:101806 Length: 516 77.7 0.12 7.6E-05 25.6 21.4 444 1-491 11-516 (516) 210 protein:vir:101189 Length: 516 77.7 0.12 7.6E-05 25.6 21.4 444 1-491 11-516 (516) 211 protein:vir:9751 Length: 422 # 77.5 0.12 7.7E-05 25.6 27.4 380 28-474 1-422 (422) 212 protein:vir:81017 Length: 521 76.5 0.13 8.4E-05 25.4 24.5 451 1-491 8-521 (521) 213 protein:vir:6596 Length: 521 # 74.2 0.16 0.0001 24.9 25.0 450 1-491 8-521 (521) 214 protein:vir:79703 Length: 505 73.6 0.17 0.0001 24.8 26.0 426 1-480 14-505 (505) 215 protein:vir:102330 Length: 451 73.5 0.17 0.00011 24.8 28.2 394 16-480 1-451 (451) 216 protein:vir:1634 Length: 409 # 70.9 0.2 0.00013 24.4 28.9 372 28-458 1-409 (409) 217 protein:vir:78083 Length: 537 69.5 0.22 0.00014 24.2 34.8 413 22-500 1-537 (537) 218 protein:vir:78805 Length: 511 69.5 0.22 0.00014 24.2 27.2 425 1-507 49-508 (511) 219 protein:vir:96366 Length: 511 69.5 0.22 0.00014 24.2 27.2 425 1-507 49-508 (511) 220 protein:vir:9922 Length: 489 # 68.2 0.24 0.00015 24.0 29.7 436 13-498 1-489 (489) 221 protein:vir:106999 Length: 564 67.8 0.25 0.00015 23.9 20.4 460 1-507 1-561 (564) 222 protein:vir:8883 Length: 543 # 67.3 0.25 0.00016 23.8 25.4 452 1-500 1-543 (543) 223 protein:vir:102950 Length: 471 67.2 0.26 0.00016 23.8 28.6 410 11-496 1-471 (471) 224 protein:vir:96240 Length: 511 66.3 0.27 0.00017 23.7 28.4 423 1-507 49-508 (511) 225 protein:vir:98265 Length: 524 65.0 0.29 0.00018 23.5 23.3 449 1-491 17-524 (524) 226 protein:vir:96839 Length: 474 64.9 0.29 0.00018 23.5 29.1 414 1-501 4-474 (474) 227 protein:vir:108049 Length: 524 62.7 0.33 0.0002 23.2 24.6 448 1-491 1-524 (524) 228 protein:vir:94572 Length: 535 54.4 0.51 0.00031 22.2 21.4 447 13-507 1-535 (535) 229 protein:vir:7430 Length: 563 # 53.8 0.52 0.00032 22.1 30.9 457 1-507 1-548 (563) 230 protein:vir:103951 Length: 511 52.8 0.55 0.00034 22.0 29.6 421 1-507 49-508 (511) 231 protein:vir:1538 Length: 535 # 52.3 0.56 0.00035 22.0 25.1 450 1-507 1-528 (535) 232 protein:vir:100598 Length: 516 49.6 0.63 0.00039 21.7 21.6 446 1-494 11-516 (516) 233 protein:vir:96179 Length: 468 49.5 0.64 0.00039 21.7 29.2 411 1-491 1-468 (468) 234 protein:vir:103458 Length: 524 48.8 0.66 0.00041 21.6 22.9 446 1-491 1-524 (524) 235 protein:vir:97171 Length: 512 48.6 0.67 0.00041 21.6 30.6 424 1-507 49-509 (512) 236 protein:vir:9306 Length: 511 # 47.3 0.71 0.00044 21.4 29.8 408 1-507 49-508 (511) 237 protein:vir:78589 Length: 695 46.4 0.74 0.00046 21.3 31.2 449 1-507 1-561 (695) 238 protein:vir:3361 Length: 535 # 43.9 0.83 0.00051 21.0 25.4 451 1-507 1-531 (535) 239 protein:vir:2198 Length: 536 # 41.9 0.91 0.00056 20.8 22.0 453 1-507 1-529 (536) 240 protein:vir:6896 Length: 523 # 38.6 1.1 0.00066 20.5 24.2 447 1-491 15-523 (523) 241 protein:vir:101541 Length: 694 33.2 1.4 0.00085 19.8 33.1 451 1-507 1-561 (694) 242 protein:vir:94709 Length: 522 32.8 1.4 0.00087 19.8 27.3 442 13-496 1-522 (522) 243 protein:vir:96988 Length: 516 27.9 1.8 0.0011 19.2 22.0 442 1-497 1-516 (516) No 1 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=100.00 E-value=7.8e-127 Score=711.96 Aligned_cols=476 Identities=25% Similarity=0.380 Sum_probs=413.6 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) |+.+.+++..++|.|+++|+..+.+.+.+....++.++||+++++++|++|++|+||+++|++|+++|++++|.+++.++ T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~~w~v~p~~~ 80 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESIKTFQLMMRDPAVAASVNIIKMFVRKVNWRFVPPKG 80 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC Confidence 99999999999999999999999988888888899999999999999999999999999999999999999998876644 Q ss_pred CC--hhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCc-------cccCcceeeeeccccCchhc Q lcl|NC_017974. 93 ES--EESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTV-------GEYIGKYKVKNLAFRPQASL 163 (507) Q Consensus 93 ~~--~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~-------~~~~g~~~~~~l~~r~~~~i 163 (507) ++ ..+.+++++|+++++++.. +|.++|++||+|++|||||+|++|+++.+ .+.+|+|.+++|++|||.++ T Consensus 81 ~~~d~~~~~~a~~v~~~l~~~~~-~~~~~i~~~lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~Rpq~~~ 159 (488) T protein:vir:95 81 KEQDPKMLERADFFNSLMDDMEH-DWADFINSVMSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIRNQSTL 159 (488) T ss_pred CchhHHHHHHHHHHHHHHhccCc-cHHHHHHHHHHhhcccceeeeeeeeccccccccccccccCCeeeeeeeeecCcccc Confidence 33 3456789999999999864 69999999999999999999999987643 24589999999999999887 Q ss_pred ccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHH Q lcl|NC_017974. 164 SRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREK 243 (507) Q Consensus 164 ~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK 243 (507) .. |.|+.|++.+++.+|.+....... ..........++.||++|||+|+|+++++||||.||||.|||+|+|| T Consensus 160 ~~---f~~d~d~~l~~~~~~~~~~~~~~~----~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~fK 232 (488) T protein:vir:95 160 DK---WYFDEDFRRVTGVRQNLRNVSHIA----GAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWKYK 232 (488) T ss_pred cc---eeeccCCCceeecccccccccccc----cccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHHHHH Confidence 63 678999998888888765543221 11222345678899999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc--cceee Q lcl|NC_017974. 244 ILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ--YSMTL 321 (507) Q Consensus 244 ~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~--~~~el 321 (507) ++++++|++|+||||+|+|++++|+++. ..++++++.++++++.++++|+++|+++|+|||.|++++++. +++++ T Consensus 233 ~~~~~~w~~f~Er~g~g~p~~~~p~~~~---~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l 309 (488) T protein:vir:95 233 VQIEEYEAVGVSRDLVGMPKIGLPPDYL---DENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSL 309 (488) T ss_pred HHHHHHHHHHHHHhcccceeEeeccCCC---CCcccHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhc Confidence 9999999999999999999999998654 456678889999999999999999999999999999999865 34555 Q ss_pred eeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_017974. 322 KGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALND 401 (507) Q Consensus 322 ~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~ 401 (507) .++++ ++.++|.+||+|||++|||+||||||||+++++||+|+|+||++|+++++++|+++|+++||+|||+||+++|+ T Consensus 310 ~~~~~-~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~Nf 388 (488) T protein:vir:95 310 VSRQG-AKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALNM 388 (488) T ss_pred ccccc-CCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 65554 45667999999999999999999999999888999999999999999999999999999999999999999994 Q ss_pred CcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCc Q lcl|NC_017974. 402 MRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDG 481 (507) Q Consensus 402 ~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (507) ++..++|+|+|+..+++|++++++++++|+++|++++++++++|++|+||||++.+++++. .+.++.+.++++++ T Consensus 389 --g~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~---~~~~~~~~~~~~~~ 463 (488) T protein:vir:95 389 --WDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVS---EKLSPNSQSRSGDG 463 (488) T ss_pred --CCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCcccc---ccCCCCCCCCCCcc Confidence 6678899999999999999999999999999999999999999999999999886665443 33333333333333 Q ss_pred ccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 482 MTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 482 ~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ..++++++++.++++|+++||.|| T Consensus 464 --~~~~~~~~~~~~~~~~~~~a~~~~ 487 (488) T protein:vir:95 464 --YKTAGEGTAKTPSAKDPSTANKAN 487 (488) T ss_pred --cCCCcccCCcccccccchhhhhcc Confidence 336778889999999999999999 No 2 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=100.00 E-value=3.6e-113 Score=637.01 Aligned_cols=452 Identities=19% Similarity=0.209 Sum_probs=362.7 Q ss_pred cccccccccccccccccCchhhhHH-HHhhhhcccchhccchhHHHHHHHh-hChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 15 KATSEQINVSRMRMSEQGTFALAKV-QVDSERMKAEEIRWPNLIGTVESMK-QDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 15 ~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) -+..+++.+|+.+.+++++.|++.. ++...++++++||+++.+++|++|+ +|+||+|+|++||++|++++|++++..+ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~~ 80 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRIRANGA 80 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC Confidence 2334566788888999998888654 5566788999999999999999997 5999999999999999999888765432 Q ss_pred CChhHHHHHHHHHHHHHhc--------------cCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 93 ESEESKKAAKFVEYCLKNM--------------EGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 93 ~~~~~~~~ae~v~~~l~~~--------------~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) ++++++++.++|+.. ...+|.++|.++|+ |++|||||+|++|+..+. ..+|++.+++|++ T Consensus 81 ----~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~-~~dG~~~~~~l~~ 155 (469) T protein:vir:10 81 ----SDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQ-SPDGRFWLRKLAP 155 (469) T ss_pred ----CHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccc-cCCCceeeeeeee Confidence 356777777776532 23468888888776 899999999999998754 3589999999999 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) |||++|.+| .|+.|++ +..++|.+.. .......+.....+++||++|||+|+|+++++||||.||||.|| T Consensus 156 rp~~~i~~~---~~~~~~~-l~~~~~~~~~------~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~ 225 (469) T protein:vir:10 156 RPQWTISKF---NVAPDGG-LESIEQIAPP------ARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAY 225 (469) T ss_pred cCcccceee---eeccCCc-eeeeeecCcc------cccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHH Confidence 999999764 4566653 5555554321 11122233455678899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~ 317 (507) |+|+||++++++|+.|+||||+|++++++|. +++++++..++ +++.++++|+++++|||.|++|+ T Consensus 226 ~~~~fK~~~~~~w~~f~EryG~P~~vgky~~-------~a~~~ek~~l~----~a~~~~~~g~~a~~iip~~~~ie---- 290 (469) T protein:vir:10 226 KHWLLKDKLLRIEAATAERNGMGIPVGTASS-------ATDEDEVRKMA----ALARSVRGGINAGVGLAQGQILE---- 290 (469) T ss_pred HHHHHHHHHHHHHHHHHHHcCCcceEEecCC-------CCCHHHHHHHH----HHHHHHhcCCceEEEccCCceEE---- Confidence 9999999999999999999999999988764 45555655444 44456667899999999999765 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLL 397 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~ 397 (507) |++++|+ ..+|.+||+|||++|||+|||||||++ ++|||||+|+||++|+++++++|+++|+++||+|||++|+ T Consensus 291 ---~~ea~g~--~~~~~~li~~~d~~Isk~iLG~tlTs~-~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~ 364 (469) T protein:vir:10 291 ---LLGVSGN--LPDIRRAIEGHDRSIALSGLAHFLNLD-GKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDLV 364 (469) T ss_pred ---EeecCCC--chHHHHHHHHHHHHHHHHHhccccccc-CccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4455544 446999999999999999999999995 5689999999999999999999999999999999999999 Q ss_pred HhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCC Q lcl|NC_017974. 398 ALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSR 477 (507) Q Consensus 398 ~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~ 477 (507) ++|+ ++...+|+|+|++.++ +.+.+++++++|+++|+++.++..++|++|+||||++.+++++.+..++.++..+++ T Consensus 365 ~lN~--g~~~~~P~~~~~~~e~-~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~ 441 (469) T protein:vir:10 365 DINF--GVDTPAPVLTFDPIGS-RQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSA 441 (469) T ss_pred HhcC--CCCCCccEEEecCCCC-cHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCc Confidence 9994 6678899999998875 447889999999999999888888999999999999999888766666554444444 Q ss_pred CCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 478 AGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .+++ ..+++.+++..+++.+.++...|- T Consensus 442 ~~~~--~~~~~~~~~~~~~~~~~~~~l~da 469 (469) T protein:vir:10 442 APAR--TRSSGNADARARAPKADQGVLFDA 469 (469) T ss_pred cccc--cCCCCCcccccccCCChHHhhccC Confidence 4333 335566666667777766666666 No 3 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=100.00 E-value=5.3e-111 Score=625.17 Aligned_cols=443 Identities=12% Similarity=0.074 Sum_probs=341.3 Q ss_pred Ccccccc-----cccccCccccccccccccccccccCchhhhHHH---HhhhhcccchhccchhHHHHHHHh-hChHHHH Q lcl|NC_017974. 1 MSRKRNR-----NRSVKVAKATSEQINVSRMRMSEQGTFALAKVQ---VDSERMKAEEIRWPNLIGTVESMK-QDATVAT 71 (507) Q Consensus 1 ~~~k~~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s 71 (507) |++=--. +.... .++.+..+...+.+++.+++.|+|+.+ +...++.++..+ .+++|++|+ +|+||+| T Consensus 1 ~~~~~d~~g~p~~~~~~-~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~---~~~L~edm~e~D~~i~s 76 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQL-REPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQA---QAELFMDMEERDAHLFA 76 (526) T ss_pred CCeeeCCCCCccCcccc-chhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHH---HHHHHHHHHhhChHHHH Confidence 5543211 11111 122222334456778889999999876 555677776655 467899998 6999999 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) +|++||++|++++|.+++..++++++++++++|+++|+++. +|.++|++||+|++|||||+||+|+.. +|.+. T Consensus 77 ~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~--~~~~~i~~~ldA~~~G~s~~Ei~w~~~-----~g~~~ 149 (526) T protein:vir:79 77 EMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLE--GLEDLLLDALDGIGHGYSCIELEWALQ-----GREWM 149 (526) T ss_pred HHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhccc--CHHHHHHHHHhhhhhcceeEEEEEeec-----CCcee Confidence 99999999999999888777778899999999999998764 699999999999999999999999874 57899 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) +++|++|||+ +|.|+.+++..+..+ .....|+++|++|||+|+|++++++|||.| T Consensus 150 ~~~l~~r~~~------~F~~~~~~~~~l~~~-------------------~~~~~g~~l~~~k~iv~~~~~~~g~p~g~g 204 (526) T protein:vir:79 150 PLAFHHRPQS------WFQLNPEDQNELRLR-------------------DNSPAGEALQPFGWIIHRPRARSGYVARSG 204 (526) T ss_pred EEEeeeeccc------ceEeccCCCcEEEec-------------------CCCCCceeecCCceEEEeecCCcCCccccc Confidence 9999999985 456666665443322 234568899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) |||.|||+|+||++++++|+.|+||||+|++++++| .+++++++.++++++.++ |+++++|||.|++ T Consensus 205 Llr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~-------~~a~~~ek~~L~~av~~i------~~da~~iiP~~~~ 271 (526) T protein:vir:79 205 LFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYP-------PGTADEEKATLLRAVTGL------GHAAAGIIPETMA 271 (526) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecC-------CCCCHHHHHHHHHHHHHH------hcCcEEEecCCce Confidence 999999999999999999999999999999888765 456777888888887776 5679999999997 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) |+| ++++++ +...|..||+|||++|||+||||||||+ ++++||||+|+||++|+++++++|+++|++||| T Consensus 272 ie~-------~ea~~~-~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln 343 (526) T protein:vir:79 272 IDF-------QQAAQG-SSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLS 343 (526) T ss_pred eEE-------eecCCC-CHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654 455443 3457999999999999999999999986 456899999999999999999999999999999 Q ss_pred HHHHHHHHHhCCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhh Q lcl|NC_017974. 390 ENLLPQLLALNDMRL-PETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELM 468 (507) Q Consensus 390 ~~li~~l~~lN~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~ 468 (507) +|||++|+++|+... +..++|+|+|+..+++|++++++++++|+++|+.++ .+|++++||||++.+++++..... T Consensus 344 ~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~----~~~i~e~~gip~~~~~e~~l~~~~ 419 (526) T protein:vir:79 344 RDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIP----SAWVYDKLGIPQPAKNEPVLRPAA 419 (526) T ss_pred HHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCC----HHHHHHHhCCCCCCCchhhccccC Confidence 999999999996433 335789999999999999999999999999999887 479999999999887765533222 Q ss_pred hcCCCcCCCCCCcccccCcccccccccCCCCc---hhhhccC Q lcl|NC_017974. 469 KLLGEDTSRAGDGMTKGSSGNGTGKISSTRDN---SAANLDN 507 (507) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~---~a~~~~~ 507 (507) +..+....+ +......+... ......++. ..++++. T Consensus 420 ~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~d~~l~~~~~ 458 (526) T protein:vir:79 420 QPAILSRQH-GQRVAALATIV--GPRYGDQQALDKALADLPA 458 (526) T ss_pred Ccccccccc-ccccccccccc--cccCchhhHHHHHHHHHHH Confidence 211111111 11111111111 111111111 1112111 No 4 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=100.00 E-value=2.3e-109 Score=616.18 Aligned_cols=445 Identities=13% Similarity=0.080 Sum_probs=342.5 Q ss_pred Ccccc-----cccccccCccccccccccccccccccCchhhhHHH---HhhhhcccchhccchhHHHHHHHh-hChHHHH Q lcl|NC_017974. 1 MSRKR-----NRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQ---VDSERMKAEEIRWPNLIGTVESMK-QDATVAT 71 (507) Q Consensus 1 ~~~k~-----~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s 71 (507) |++=- +.++. ...++....+..++..++.+++.|+|+.+ +...++.++..+ .+++|++|+ +|+||+| T Consensus 1 ~~~~~d~~g~p~~~~-~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~---~~~L~~~m~e~D~~i~s 76 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQ-QLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQA---QAELFMDMEERDAHLFA 76 (528) T ss_pred CCeeECCCCCccccc-cccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHH---HHHHHHHHHhhChHHHH Confidence 55422 22222 11222223445567788899999999976 445666666544 567799997 6999999 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) +|++||++|++++|.+++..++++++++++++|+++|+++. +|+++|.+||+|++|||||+|++|+.+ +|.+. T Consensus 77 ~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~--~f~~~i~~~lda~~~G~s~~Ei~w~~~-----~g~~~ 149 (528) T protein:vir:10 77 EMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLE--GIEDLMLDCMDGVGHGYSAIELDWSLQ-----GREWL 149 (528) T ss_pred HHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCc--cHHHHHHHHHhhhhhcceeEEEEEeec-----CCcee Confidence 99999999999999888776778889999999999998764 699999999999999999999999864 57899 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) ++++.+|||+ +|.|+.+++..+.++ .+...+++||++||++|+|++++++|||.| T Consensus 150 ~~~~~~r~~~------~f~~~~~~~~~l~~~-------------------~~~~~g~~l~~~k~iv~~~~~~~g~p~g~g 204 (528) T protein:vir:10 150 PQAFDHRPQS------WFQLNPDDQDELRLR-------------------DNSIAGEVLQPFGWIMHKPRSRSGYVARSG 204 (528) T ss_pred EEEeeeeccc------ceeeccCCCcEEecc-------------------CCCCCceeecCCCeEEEeecCCCCCccccc Confidence 9999999984 466777665443322 234468899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) |||.|||+|+||++++++|+.|+||||+|++++++| .+++++++.++++++.++ |+++++|||.|++ T Consensus 205 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~-------~~a~~~ek~~L~~al~~i------~~~~~~iiP~~~~ 271 (528) T protein:vir:10 205 LFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYP-------PGTPDEEKVTLLRAVTGL------GHAAAGIIPESMS 271 (528) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecC-------CCCCHHHHHHHHHHHHHH------hhCcEEEecCCce Confidence 999999999999999999999999999999888765 456777888888887766 5678999999997 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) |+ |++++++ +...|..|++|||++|||+||||||||+ ++++||||+|+||++|+++++++|+++|++||| T Consensus 272 ie-------~~ea~~~-~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln 343 (528) T protein:vir:10 272 ID-------FQEASKG-SAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLS 343 (528) T ss_pred eE-------EeecCCC-ChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65 4555443 3457999999999999999999999995 346799999999999999999999999999999 Q ss_pred HHHHHHHHHhCCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhh Q lcl|NC_017974. 390 ENLLPQLLALNDMRL-PETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELM 468 (507) Q Consensus 390 ~~li~~l~~lN~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~ 468 (507) +|||++|+++|+... +...+|+|+|+..+++|++++++++++|+++|+.++ ++|++++||||.+.++++...... T Consensus 344 ~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~----~~~i~e~~gip~p~~~e~~~~~~~ 419 (528) T protein:vir:10 344 RDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVP----VNWVQEQLGIPLPANGEAVLGDQA 419 (528) T ss_pred HHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCC----HHHHHHHhCCCCCCCCcccccCCC Confidence 999999999995321 235689999999999999999999999999999877 589999999999887776543333 Q ss_pred hcCCCcCCCCCCcccccCcccccccccCCCC---chhhhccC Q lcl|NC_017974. 469 KLLGEDTSRAGDGMTKGSSGNGTGKISSTRD---NSAANLDN 507 (507) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d---~~a~~~~~ 507 (507) +..+...++.........+ +.........+ ...+.+.. T Consensus 420 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~d~~~~~~~~ 460 (528) T protein:vir:10 420 GAGIAQLSRRPGPRIAALA-QVIGPRYRDQEALDQVLASLPA 460 (528) T ss_pred cccccccCccccccccccc-ccccccccccchHHHHHHHHHH Confidence 2222222211111111111 11111111111 11111111 No 5 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=100.00 E-value=2.4e-109 Score=616.07 Aligned_cols=441 Identities=12% Similarity=0.068 Sum_probs=342.5 Q ss_pred Ccccc-----cccccccCccccccccccccccccccCchhhhHHH---HhhhhcccchhccchhHHHHHHHh-hChHHHH Q lcl|NC_017974. 1 MSRKR-----NRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQ---VDSERMKAEEIRWPNLIGTVESMK-QDATVAT 71 (507) Q Consensus 1 ~~~k~-----~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s 71 (507) |++=- +.+....+ ++....+..++..++.+++.|+|+.+ +...++.++..+ .+++|++|+ +|+||+| T Consensus 1 ~~~~~d~~g~p~~~~~~~-~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~---~~~L~e~m~e~D~~i~s 76 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLR-EPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQA---QAELFMDMEERDAHLFA 76 (526) T ss_pred CCeeECCCCCcccccccc-chhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHH---HHHHHHHHHhhChHHHH Confidence 65533 22222211 22223344466778899999999876 555677776655 566799997 5999999 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) +|++||++|++++|.+++..++++++++++++|+++|+++. +|+++|++||+|++|||||+|++|+.. +|.+. T Consensus 77 ~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~--~~~~~i~~~lda~~~G~s~~Eivw~~~-----~g~~~ 149 (526) T protein:vir:99 77 EMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLE--GLEDLLLDALDGIGHGYSCIELEWALQ-----GREWM 149 (526) T ss_pred HHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhccc--CHHHHHHHHHHhhhhcceeEEEEEeec-----CCcee Confidence 99999999999998888776778899999999999998764 699999999999999999999999874 58899 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) +++|++|||+ +|.|+.+++..+..+ .+...|+++|++|||+|+|++++++|||.| T Consensus 150 ~~~l~~r~~~------~f~~~~~~~~~l~~~-------------------~~~~~g~~l~~~k~i~~~~~~~~g~p~g~g 204 (526) T protein:vir:99 150 PLAFHHRPQS------WFQLNPEDQNELRLR-------------------DNSPAGEALQPFGWIIHRPRARSGYVARSG 204 (526) T ss_pred EEEeeeeccc------ceeeccCCCcEEEec-------------------CCCCCceeecCCCeEEEeecCCcCCccccc Confidence 9999999985 456666665443332 234568899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) |||.|||+|+||++++++|+.|+||||+|++++++| .+++++++.++++++.++ |+++++|||.|++ T Consensus 205 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~-------~~a~~~ek~~L~~av~~i------~~d~~~iiP~~~~ 271 (526) T protein:vir:99 205 LFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYP-------PGTADEEKATLLRAVTGL------GHAAAGIIPETMA 271 (526) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecC-------CCCCHHHHHHHHHHHHHH------hhCcEEEecCCce Confidence 999999999999999999999999999999888765 456777888888887766 4678999999997 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) |+| ++++++ +...|..|++|||++|||+||||||||+ ++++||||+|+||++|+++++++|+++|++||| T Consensus 272 ie~-------~ea~~~-~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln 343 (526) T protein:vir:99 272 IDF-------QQAAQG-SSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLS 343 (526) T ss_pred eEE-------eecCCC-CHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654 455443 3456999999999999999999999986 456799999999999999999999999999999 Q ss_pred HHHHHHHHHhCCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhh Q lcl|NC_017974. 390 ENLLPQLLALNDMRL-PETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELM 468 (507) Q Consensus 390 ~~li~~l~~lN~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~ 468 (507) +|||++|+++|+... +..++|+|+|+..+++|++++++++++|+++|+.++ .+|++++||||++.+++.+..... T Consensus 344 ~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~----~~~i~e~~Gip~~~~~e~~l~~~~ 419 (526) T protein:vir:99 344 RDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIP----SAWVYDKLGIPQPAKNEPVLRSAA 419 (526) T ss_pred HHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccC----HHHHHHHhCCCCCCCcccccCCCC Confidence 999999999996433 335789999999999999999999999999999877 479999999999888776544332 Q ss_pred hcCCCcCCCCCCcccccCcccccccccCCCCchh--hhccC Q lcl|NC_017974. 469 KLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSA--ANLDN 507 (507) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a--~~~~~ 507 (507) +..+....+ +......+. . ..+..++..+ ..++. T Consensus 420 ~~~~~~~~~-~~~~~~~~~--~--~~~~~~~~~~~d~~l~~ 455 (526) T protein:vir:99 420 QPAILSRQH-GQRVAALAT--I--VGPRYGDQQALDKALAD 455 (526) T ss_pred CCccccccc-ccccccccc--c--ccccCcchhhHHHHHHH Confidence 221111111 111111111 1 1111222211 11111 No 6 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=100.00 E-value=3.4e-107 Score=604.29 Aligned_cols=439 Identities=13% Similarity=0.063 Sum_probs=338.4 Q ss_pred Ccccc-----cccccccCccccccccccccccccccCchhhhHHH---HhhhhcccchhccchhHHHHHHHh-hChHHHH Q lcl|NC_017974. 1 MSRKR-----NRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQ---VDSERMKAEEIRWPNLIGTVESMK-QDATVAT 71 (507) Q Consensus 1 ~~~k~-----~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---~~~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s 71 (507) |++=- +.+... ..++....+..++.+++++++.|+|+.+ +...++.++..++. ++|++|+ +|+||+| T Consensus 1 m~~~~d~~g~p~~~~~-~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~---~L~~dm~~~D~hi~s 76 (512) T protein:vir:19 1 MGRILDISGQPFDFDD-EMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQA---DLAFDMEEKDTHLFS 76 (512) T ss_pred CcceeCCCCCcccccc-ccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHH---HHHHHHHhhChHHHH Confidence 66533 222111 1222223344456788899999999876 55567777776654 4466665 6999999 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) +|++||++|++++|.+++..++++++++++++|+++|+++. +|+++|++||+|++|||||+||+|+.. +|.+. T Consensus 77 ~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~--~f~~~~~~lldA~~~G~s~~Ei~w~~~-----~g~~~ 149 (512) T protein:vir:19 77 ELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAA--WFEDALFDAGDAILKGYSMQEIEWGWL-----GKMRV 149 (512) T ss_pred HHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCC--CHHHHHHHHHhhhhhcceeeeeEeeee-----CCcee Confidence 99999999999998887766678899999999999998764 699999999999999999999999864 67899 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) +++|.+|||+ +|.|+.+++..+..+ .....|++||++||++|+|++++++|||.| T Consensus 150 ~~~~~~r~~~------~f~~~~~~~~~lr~~-------------------~~~~~G~~l~~~k~i~~~~~~~~g~p~g~g 204 (512) T protein:vir:19 150 PVALHHRDPA------LFCANPDNLNELRLR-------------------DASYHGLELQPFGWFMHRAKSRTGYVGTNG 204 (512) T ss_pred eeeeeeeccc------cceeccCCCcEEEec-------------------CCCCCceeecCCceEEEeccCCCCCccccc Confidence 9999999985 455666655433222 234568889999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) |||.|||+|+||++++++|+.|+||||+|++++++| .+++++++.++++++.++ |+++++|||.|++ T Consensus 205 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~-------~~a~~~ek~~L~~al~~~------~~~a~~iiP~~~~ 271 (512) T protein:vir:19 205 LVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYP-------TGSTNREKATLMQAVMDI------GRRAGGIIPMGMT 271 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecC-------CCCCHHHHHHHHHHHHHH------hhCcEEEecCCce Confidence 999999999999999999999999999999887755 456777888888877776 5679999999997 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) |+| +++++ ++...|..|++|||++|||+||||||||+++++||||+|+||++|+++++++|+++|++|||+| T Consensus 272 ie~-------~ea~~-~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~ 343 (512) T protein:vir:19 272 LDF-------QSAAD-GQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGEVHDEVRREIRNADVGQLARSINRD 343 (512) T ss_pred EEE-------eecCC-CCHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654 45544 3446799999999999999999999999988899999999999999999999999999999999 Q ss_pred HHHHHHHhCCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhc Q lcl|NC_017974. 392 LLPQLLALNDMRL-PETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKL 470 (507) Q Consensus 392 li~~l~~lN~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~ 470 (507) ||++|+++|+... +..++|+|+|+..+++|++.+++.+++|+ +|+.++ ++|++++||||++.+++.......+. T Consensus 344 li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~-~G~~i~----~~~i~e~~Gip~~~~~e~~~~~~~~~ 418 (512) T protein:vir:19 344 LIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLA-AGMRIP----VSWIQEKLHIPQPVGDEAVFTIQPVV 418 (512) T ss_pred HHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHHHh-cCCCCC----HHHHHHHhCCCCCCCccccccCCCcc Confidence 9999999996432 23568999999999999999999999996 899887 58999999999987776543322111 Q ss_pred CCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 471 LGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .+. . +...++.++...+ ....-|..+..+++ T Consensus 419 ~~~-~---~~~~~~~~~~~~~--~~~~~d~~~~~~~~ 449 (512) T protein:vir:19 419 PDN-G---SQKEAALSAEDIP--QEDDIDRMGVSPED 449 (512) T ss_pred ccc-c---ccccccccccCCC--chhhHhHHhhhHHH Confidence 111 0 0111111111000 01111111111111 No 7 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=100.00 E-value=5.9e-104 Score=586.54 Aligned_cols=432 Identities=16% Similarity=0.129 Sum_probs=319.4 Q ss_pred CcccccccccccCccccccc------cccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQ------INVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~------~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |+||+||.|... |++.+.. ....+.....+++.|+++ .+..++||++..+++|++|++|+||+|+|+ T Consensus 1 m~kk~~k~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~------~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~ 73 (448) T protein:vir:77 1 MAKRGRKPKELV-PGPGSIDPSDVPKLEGASVPVMSTSYDVVVD------REFDELLQGKDGLLVYHKMLSDGTVKNALN 73 (448) T ss_pred CCCCCCCCcccC-Ccccccchhhhhhhccchhhhcccccccccc------cchhHhhccccchHHHHHHhhChHHHHHHH Confidence 999998876432 3332221 222344445555555544 444567888889999999999999999999 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhc----cCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcce Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNM----EGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKY 150 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~----~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~ 150 (507) +||++|++++|.+++ .++++.+++++++|+++|... ...+|.++|.+||+|++|||||+|++|+.. .+|++ T Consensus 74 ~Rk~av~~~~w~v~p-~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~lda~~~G~s~~Eivw~~~----~dg~~ 148 (448) T protein:vir:77 74 YIFGRIRSAKWYVEP-ASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLG----ADGKL 148 (448) T ss_pred HHHHHHhcCCceEec-CCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHHHHhhhhcceeEEEEEeec----CCCce Confidence 999999999888764 456788899999999999753 235799999999999999999999999863 36899 Q ss_pred eeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccc Q lcl|NC_017974. 151 KVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGV 230 (507) Q Consensus 151 ~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~ 230 (507) .+++|++|||+++. +|.|+.+++.+...+... .........++.||..||++| |++++|||||. T Consensus 149 ~~~~l~~r~~~~~~---~f~~~~~~~l~~~~~~~~------------~~~~~~~~~~~~lP~~~~i~~-~~~~~g~p~g~ 212 (448) T protein:vir:77 149 ILDKIVPIHPFNID---EVLYDEEGGPKALKLSGE------------VKGGSQFVNGLEIPIWKTVVF-LHNDDGSFTGQ 212 (448) T ss_pred eeccccccCCCccc---eeeeecCCceEEEecCCc------------ccccccCCCccccccceEEEE-ecCCcCCcccc Confidence 99999999998775 456777776443222211 011223345778999999877 55789999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcc Q lcl|NC_017974. 231 SPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDT 310 (507) Q Consensus 231 gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~ 310 (507) ||||.|||+|+||++++++|++|+||||+|++++++|.+. .+++++ .+++.+++.|+++|+++++|||.|+ T Consensus 213 gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga-----~~~~~~----~~~l~~av~~i~~g~~a~~iiP~g~ 283 (448) T protein:vir:77 213 SALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSV-----RQGTKQ----WEAAKEIVKNFVQKPRHGIILPDDW 283 (448) T ss_pred hHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCC-----CCCHHH----HHHHHHHHHHHhcCCceEEEecCCc Confidence 9999999999999999999999999999999999876531 222233 3455666777888999999999999 Q ss_pred cccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHH-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 311 KDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLS-ESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 311 ~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~-~vh~~v~~~~~~aD~~~i~~~ln 389 (507) +|+| +++++++ .+|.++|+|||++|||+||||||||+.+ +|+++.+ .+|.+++++++++|+++|++||| T Consensus 284 ~ie~-------~ea~~~~--~~~~~~i~~~d~~Isk~iLGqtlTs~~~-~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln 353 (448) T protein:vir:77 284 KFDT-------VDLKSAM--PDAIPYLTYHDAGIARALGIDFNTVQLN-MGVQAVNIGEFVSLTQQTIISLQREFASAVN 353 (448) T ss_pred eEEE-------EecCCCc--cCHHHHHHHHHHHHHHHHhccccccccc-cchhhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 7654 4555443 3467899999999999999999999754 3443333 35668999999999999999999 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhh Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMK 469 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~ 469 (507) +|||++|+++|+ ++..++|+|+|+..+++|++++|+++++|+ +++++++|||++.++.+...... T Consensus 354 ~~Li~~l~~lNf--g~~~~~P~~~f~~~e~eDl~~~a~~~~~l~------------~~~~~~~~ip~~~~~~~~~~~~~- 418 (448) T protein:vir:77 354 LYLIPKLVLPNW--PGATRFPRLTFEMEERNDFSAAANLMGMLI------------NAVKDSEDIPTELKALIDALPSK- 418 (448) T ss_pred HHHHHHHHHhcC--CCCCCCCEEEecCCChhhHHHHHHHhHHHH------------HHHHHHhcCCccCCcCCCCCchh- Confidence 999999999994 677889999999999999999999999885 36899999998766543221111 Q ss_pred cCCCcCCCCCCcccccCcccccccccCCCC-chhhhccC Q lcl|NC_017974. 470 LLGEDTSRAGDGMTKGSSGNGTGKISSTRD-NSAANLDN 507 (507) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~a~~~~~ 507 (507) ++.... .++... ++.++ .....+-+ T Consensus 419 --~~~~~~--------~~~~~~---~~~~~~~~~~~~~~ 444 (448) T protein:vir:77 419 --MRRALG--------VVDEVR---EAVRQPADSRYLYT 444 (448) T ss_pred --cccccC--------CCCCCC---chhhcchhhHHHHh Confidence 111100 000000 00110 11111111 No 8 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=100.00 E-value=1.6e-103 Score=584.12 Aligned_cols=438 Identities=15% Similarity=0.113 Sum_probs=326.8 Q ss_pred CcccccccccccCcccccc------ccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSE------QINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~------~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |+||+||.+... |.+... .+...+...+.+++.|+++ .+..++||++.++++|++|++|+||+|+|+ T Consensus 1 m~k~~~k~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~------~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~ 73 (448) T protein:vir:79 1 MAKRGRKPKELV-PGPGSIDPSDVPKLEGASVPVMSTSYDVVVD------REFDELLQGKDGLLVYHKMLSDGTVKNALN 73 (448) T ss_pred CCCCCCCCcccc-Ccccccccccchhhhhhhhhhcccccccccc------cchhHhhccccchHHHHHHhhChHHHHHHH Confidence 999999977542 222221 1223344455555555554 445577888889999999999999999999 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhcc----CCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcce Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNME----GQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKY 150 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~----~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~ 150 (507) +||++|++++|.+++ .++++++++++++|+++|.... ..+|.++|.+||+|++|||||+|++|+.. .+|++ T Consensus 74 ~Rk~av~~~~w~v~p-~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~lda~~~G~s~~Eivw~~~----~~g~~ 148 (448) T protein:vir:79 74 YIFGRIRSAKWYVEP-ASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLG----ADGKL 148 (448) T ss_pred HHHHHHhcCCceEec-CCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHHHHhhhhcceeEEEEeeec----CCCce Confidence 999999999988864 4667889999999999997532 35899999999999999999999999863 36899 Q ss_pred eeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccc Q lcl|NC_017974. 151 KVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGV 230 (507) Q Consensus 151 ~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~ 230 (507) .+++|++|||+++. +|.|+.|++........+. .......+++.||..||++|. .+++|||||. T Consensus 149 ~~~~l~~r~~~~~~---~f~~~~d~~l~~~~~~~~~------------~~~~~~~~~~~lP~~~~i~~~-~~~~g~p~g~ 212 (448) T protein:vir:79 149 ILDKIVPIHPFNID---EVLYDEEGGPKALKLSGEV------------KGGSQFVSGLEIPIWKTVVFL-HNDDGSFTGQ 212 (448) T ss_pred ecccccccCCcccc---ceeeecCCceEEeecCCcc------------cccccCCCccccccceEEEEe-cCccCCcccc Confidence 99999999998876 3567887764433222111 111233457789999998775 5789999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcc Q lcl|NC_017974. 231 SPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDT 310 (507) Q Consensus 231 gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~ 310 (507) ||||.|||+|+||++++++|+.|+||||+|++++++|.+ +.+++++ .+++.+++.|+++|+++++|||.|+ T Consensus 213 gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~g-----a~~~~~~----~~~l~~av~~i~~g~~a~~iiP~~~ 283 (448) T protein:vir:79 213 SALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKS-----VRQGTKQ----WEAAKEIVKNFVQKPRHGIILPDDW 283 (448) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCC-----CCcCHHH----HHHHHHHHHHHhcCCceEEEecCCc Confidence 999999999999999999999999999999999887753 1222333 3455666777888999999999999 Q ss_pred cccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 311 KDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 311 ~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~ 390 (507) +|+| +++++++ .+|.++|+|||++|||+||||||||+.++|++.++..+|.+++++++++|+++|++|||+ T Consensus 284 ~ie~-------~ea~~~~--~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~ 354 (448) T protein:vir:79 284 KFDT-------VDLKSAM--PDAIPYLTYHDAGIARALGIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNL 354 (448) T ss_pred eEEE-------EecCCCc--ccHHHHHHHHHHHHHHHHhhhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 7654 4555443 346789999999999999999999975543333333578999999999999999999999 Q ss_pred HHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhc Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKL 470 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~ 470 (507) |||++|+++| |++..++|+|+|+..+++|++++|+++++|+.++. ..++|+++++|+|++.++++.. ++. T Consensus 355 ~li~~l~~lN--fg~~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~-----~~~~~~~~~~~~p~~~~~~~~~---a~~ 424 (448) T protein:vir:79 355 YLIPKLVLPN--WPSATRFPRLTFEMEERNDFSAAANLMGMLINAVK-----DSEDIPTELKALIDALPSKMRR---ALG 424 (448) T ss_pred HHHHHHHHhc--CCCcCCCcEEEecCCChHHHHHHHHHhhhhhccch-----hhHHHHHHhhcCCCCCCCcccc---ccC Confidence 9999999999 46778899999999999999999999999998752 2368999999999876654321 111 Q ss_pred CCCcCCCCCCcccccCcccccccccCCC Q lcl|NC_017974. 471 LGEDTSRAGDGMTKGSSGNGTGKISSTR 498 (507) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (507) .. .+.+++ ....+.+-.-=..--| T Consensus 425 ~~---~~~~~~-~~~~~~~~~~~~~~~~ 448 (448) T protein:vir:79 425 VV---DEVREA-VRQPADSRYLYTRRRR 448 (448) T ss_pred CC---Cccccc-ccCCccccchhhcccC Confidence 11 111110 0011110000000011 No 9 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=100.00 E-value=3.3e-102 Score=576.97 Aligned_cols=421 Identities=13% Similarity=0.046 Sum_probs=315.8 Q ss_pred Cc------ccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhc-cchhHHHHHHHhhChHHHHHH Q lcl|NC_017974. 1 MS------RKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIR-WPNLIGTVESMKQDATVATGL 73 (507) Q Consensus 1 ~~------~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr-~~~~~~~y~~m~~D~~v~s~l 73 (507) |+ -.+.++.+... ++....+++.+..+..+++.++++.... .|| .+..+++|++|++|+||+|+| T Consensus 1 ~~~~i~~~~g~~~~~~~~~-~~~~~~ia~~~~~~~~~~~~~~~p~~~~-------il~~~~~~~~~y~~m~~D~~i~s~l 72 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPD-KSLSSQIATRARSIDFFALGMYLPNPDP-------VLKALGKDIRVYRELRADAHVGGCV 72 (491) T ss_pred CCCeeeCCCCCcccccccc-hhHHHHHhhhccccccccccccCcchhH-------HHhhccCCHHHHHHHhhChHHHHHH Confidence 33 33444432211 2222233444444555555555553321 222 345689999999999999999 Q ss_pred HHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 74 DMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 74 ~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) ++|+++|++++|.+++..+ +++++++|+++|+++ +|.++|++||+|++|||||+|++|+.. +|++.++ T Consensus 73 ~~Rk~av~~~~w~i~~~~~----~~~~a~~i~e~l~~~---~~~~~i~~~lda~~~G~s~~Ei~w~~~-----~g~~~~~ 140 (491) T protein:vir:79 73 RRRKAAVKALEWGLDRGKA----KSRVAKSIADVFADL---DLSRIATEMLDAVLYGYQPMEITWGKV-----GNYIVPI 140 (491) T ss_pred HHHHHHHhCCCcEEecCCC----CHHHHHHHHHHHhcC---CHHHHHHHHHHhhhhcceeEEEEEeec-----CCeeeEE Confidence 9999999999887765432 246789999999875 699999999999999999999999864 6889999 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL 233 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll 233 (507) +|.+|||+ +|.|+.+++.++. ...+...++++|++|||+|+|+++++||||.||| T Consensus 141 ~l~~r~~~------~f~~d~~~~l~l~-------------------~~~~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl 195 (491) T protein:vir:79 141 DVVGKPAD------WFVYDPENQLRFR-------------------SKEHWVQGEELPARKFLVPRQEATYLNPYGFPDL 195 (491) T ss_pred eeeeeccc------ceeeccCCceEEe-------------------ecCCCCCceeecCCCeEEEEecCCCCCcccchhH Confidence 99999985 4566766654332 2234467889999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccc Q lcl|NC_017974. 234 VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDN 313 (507) Q Consensus 234 ~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~ 313 (507) +.|||+|+||++++++|+.|+||||+|++++++| .+++++++.++++++.++ |+++++|||.|++|+ T Consensus 196 ~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~-------~~a~~~ek~~l~~al~~~------~~~a~~viP~~~~ie 262 (491) T protein:vir:79 196 SMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP-------RSASDAETNLLLDRLEDM------VQDAVAVIPDDSSIE 262 (491) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecC-------CCCCHHHHHHHHHHHHHH------hcCeEEEecCCceeE Confidence 9999999999999999999999999999888765 456677888888877766 567999999999765 Q ss_pred ccccceeeeecc-CCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGID-GMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENL 392 (507) Q Consensus 314 ~~~~~~el~~~~-g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~l 392 (507) | ++++ .+|+...|.+||+|||++|||+||||||||+ .+||||+|+||++|+++++++|+++|+++|| +| T Consensus 263 ~-------~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~--~~gs~a~~~vh~~v~~~i~~~D~~~i~~tln-~l 332 (491) T protein:vir:79 263 I-------KEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTE--ATSTRASAQAGLEVTDDIRDGDKAIVVEAMN-ML 332 (491) T ss_pred E-------EeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccC--cccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HH Confidence 5 4454 3555667999999999999999999999995 3799999999999999999999999999999 59 Q ss_pred HHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCC Q lcl|NC_017974. 393 LPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLG 472 (507) Q Consensus 393 i~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~ 472 (507) |++|+++|+ ++...|+|.|.+.++.+ +.+|+++++|+++|+.++ ++|++++||||++..++...+...+..+ T Consensus 333 i~~l~~~N~---~~~~~p~f~~~e~ee~~-~~~a~~~~~L~~~G~~i~----~~~~~e~~Gip~~~~~e~~~~~~~~~~~ 404 (491) T protein:vir:79 333 IRWICDLNF---DGAARPVFDMWEQEQVD-EIQAGRDEKLTRAGARFT----PAYFKRAYNLQDGDLDERPLPVSAVDAV 404 (491) T ss_pred HHHHHHhcC---CCCCcceEeecCcCchh-HHHHHHHHHHHhCCCccC----HHHHHHHhCCCCCCCCccccCcCccccc Confidence 999999994 45667889888776554 678999999999999877 5799999999988776654433332222 Q ss_pred CcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 473 EDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .....+ ...+.+ ..+.|..+..++. T Consensus 405 ~~~~~~----~~~~~~------~~~~d~~~~~~~~ 429 (491) T protein:vir:79 405 GAASFA----EFEAPD------QDALDAALNALSA 429 (491) T ss_pred cccccc----ccCCCC------CcchHHHHHHHHH Confidence 111110 000000 0111111111111 No 10 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=100.00 E-value=8.8e-101 Score=569.12 Aligned_cols=410 Identities=13% Similarity=0.072 Sum_probs=310.0 Q ss_pred ccccccCcccccccccc---ccccccccCchhhhHHHHhhhhcccchhc--cchhHHHHHHHhhChHHHHHHHHHHHHHh Q lcl|NC_017974. 7 RNRSVKVAKATSEQINV---SRMRMSEQGTFALAKVQVDSERMKAEEIR--WPNLIGTVESMKQDATVATGLDMLYTFVE 81 (507) Q Consensus 7 ~~~~~~~~~~~~~~~~~---~~~~~~~~g~~~~~~~~~~~~~e~~~~lr--~~~~~~~y~~m~~D~~v~s~l~~Rk~av~ 81 (507) ++| +..++ ++++ .+..+..+.+.++.+. .+.|+ ....+++|++|++|+||+|+|++|+++|+ T Consensus 1 v~~----~~l~~-e~at~~~~~d~~~~~~~~l~~~~--------~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~ 67 (488) T protein:vir:99 1 MEK----PALGR-EIATSGDGRDITRPFISGLQVPN--------DSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVV 67 (488) T ss_pred CCc----cchhH-HHHHHHhhhhhhccccCCCCCCC--------hHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHh Confidence 222 11111 1111 1111222222211111 11222 23457899999999999999999999999 Q ss_pred hCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeeeeccccCch Q lcl|NC_017974. 82 KAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQA 161 (507) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~ 161 (507) +++|++++ +++++++++++++|+++|+++ +|.++|++||+|++|||||+|++|+.. +|++.+++|.+|||+ T Consensus 68 ~~~w~i~p-~~~~~~~~~~ae~v~~~l~~~---~~~~~l~~~lda~~~G~s~~Ei~w~~~-----~g~~~~~~l~~r~~~ 138 (488) T protein:vir:99 68 SREWKVEA-GGDRPIDQAAAEHLEQQLQRV---GWDRVTSKMLFGVFYGYAVSELIYGRD-----DRYITLEAIKVRNRR 138 (488) T ss_pred cCCceEEc-CCCChHHHHHHHHHHHHHhCC---CHHHHHHHHHhhhhhcceeEEEEEeec-----CCeeeEeeeeeeccc Confidence 99998875 456788999999999999874 699999999999999999999999864 689999999999985 Q ss_pred hcccccceeecCCCccccccccccccccccccccccccccccCCCccccc-cccEEEEeeCCccCccccchHHHHHHHHH Q lcl|NC_017974. 162 SLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIP-INRVMLMNTGGSSSQALGVSPLVGCYRAW 240 (507) Q Consensus 162 ~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP-~~k~i~~~~~~~~~~p~G~gll~~~~~~~ 240 (507) +|.|+.+++.++..+ .....++++| +.+|++|+|+++++||||.|||+.|||+| T Consensus 139 ------~f~~d~~~~l~~~~~-------------------~~~~~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~ 193 (488) T protein:vir:99 139 ------RFRYDQDGGLRLLTP-------------------NNMFEGEPCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPV 193 (488) T ss_pred ------ceeecCCCceEEecc-------------------CCCCCccccccCceEEEEeecCCCCCcccchHHHHHHHHH Confidence 456777765433222 2334677886 56899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccccee Q lcl|NC_017974. 241 REKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMT 320 (507) Q Consensus 241 ~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~e 320 (507) +||++++++|+.|+||||+|++++++|+ .+++++++.++++++.++ |+++++|||.|++|+ T Consensus 194 ~fK~~~~~~w~~f~E~yG~P~~igky~~------~~a~~~ek~~l~~av~~~------~~~~~~viP~~~~ie------- 254 (488) T protein:vir:99 194 FFKRNGIKFWLIFLDKFGMPTAVGRYDD------KTATPEDKAKLLAALHAI------QTDSAIIMPAGMQAE------- 254 (488) T ss_pred HHHHhhHHHHHHHHHHcCCceeeeecCC------CCCCHHHHHHHHHHHHHH------hcCcEEEecCCceeE------- Confidence 9999999999999999999999887653 245666777777777766 567999999999765 Q ss_pred eeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_017974. 321 LKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALN 400 (507) Q Consensus 321 l~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN 400 (507) |++++++ +...|..||+|||++|||+||||||||+ +++||||+|+||++|+++++++|+++|+++||+|||++|+++| T Consensus 255 ~~ea~~~-~~~~~~~li~~~d~~Isk~iLGqtlts~-~~~Gs~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N 332 (488) T protein:vir:99 255 LLEAGRS-GTADYKTLHDTMDATIAKVGLGQVASTQ-GTPGRLGNDDLQADVRLDLVKADADLICESFNLGPARWLTEWN 332 (488) T ss_pred EeecCCC-ChHHHHHHHHHHHHHHHHHHhhhhhccc-ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 4555544 3457999999999999999999999985 4579999999999999999999999999999999999999999 Q ss_pred CCcCCccccceEEecCcCchhHHHHHHHHHHHHhc-CCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCC Q lcl|NC_017974. 401 DMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAV-GYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAG 479 (507) Q Consensus 401 ~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~-G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~ 479 (507) |++..+|+|+|+..+++|++++++++++|+++ |+.++ ++|++++||||++.++++.... .++.... T Consensus 333 ---~~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~----~~~i~e~~Gip~~~~~~~~~~~------~~~~~~~ 399 (488) T protein:vir:99 333 ---FPGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPT----RGYVQETYGVEVESTQAEATAP------TPSTEFA 399 (488) T ss_pred ---cCCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCC----HHHHHHHcCCCCcccccccccC------CCcccCC Confidence 46678899999999999999999999999997 88776 5799999999987665543211 1111111 Q ss_pred CcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 480 DGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ++.. .. ...+..+..++. T Consensus 400 ~~~~--~~--------~~~~~~~~~~~~ 417 (488) T protein:vir:99 400 EGDQ--PS--------DPAAAMAPQLAE 417 (488) T ss_pred CCCC--CC--------CchHHHHHHHHH Confidence 1000 00 000111111111 No 11 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=100.00 E-value=1.8e-100 Score=567.42 Aligned_cols=422 Identities=14% Similarity=0.064 Sum_probs=314.9 Q ss_pred Ccccc------cccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKR------NRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~------~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |+..= ....... .++....++ .+.+..++++.|+++.+.+..... ....+++|++|++|+||+|+|+ T Consensus 1 m~~~i~~~~g~p~~~~~~-~~~~~~~ia-~~~~~~~~~~~~~~~~~~~~iLr~-----~~~~~~~y~~m~~D~~i~s~l~ 73 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEP-DKSLSSQIA-TRARSIDFFALGMYLPNPDPVLKA-----LGKDIRVYRELRADAHVGGCVR 73 (491) T ss_pred CCCceeCCCCCccCcccC-ChHHHHHHH-hhhcccccccccCCccchHHHHHh-----cCCCHHHHHHHhhChHHHHHHH Confidence 55421 1111110 011111222 344566777777776655443311 2346789999999999999999 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeeee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVKN 154 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~ 154 (507) +|+++|++++|.+++..+ +.+++++|+++|+++ +|.++|++||+|++|||||+|++|+.. +|++.+++ T Consensus 74 ~Rk~av~~~~w~i~~~~~----~~~~~e~v~e~l~~~---~~~~~l~~~lda~~~G~s~~Ei~w~~~-----~g~~~~~~ 141 (491) T protein:vir:10 74 RRKAAVKALEWGLDRGKA----KSRVAKSIADVFADL---DLSRIVTEMLDAVLYGYQPMEITWGKV-----GNYIVPID 141 (491) T ss_pred HHHHHHhCCCcEEecCCC----CHHHHHHHHHHHhcC---CHHHHHHHHHHhhhhcceeEEEEEeec-----CCeeEEEE Confidence 999999999877764322 346789999998865 699999999999999999999999864 67899999 Q ss_pred ccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHH Q lcl|NC_017974. 155 LAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLV 234 (507) Q Consensus 155 l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~ 234 (507) |.+|||+ +|.|+.+++.... .......++++|++|||+|+|+++++||||.||++ T Consensus 142 l~~r~~~------~f~~d~~~~l~~~-------------------~~~~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~ 196 (491) T protein:vir:10 142 VVGKPAD------WFVYDPENQLRFR-------------------SKDHWMQGEELPARKFLVPRQEATYLNPYGFPDLS 196 (491) T ss_pred eeeeccc------ceeeccCCceEEe-------------------cCCCCCCcceecCCCEEEEEecCCCCCcccchhHH Confidence 9999985 4556766654322 12234578899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccc Q lcl|NC_017974. 235 GCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNA 314 (507) Q Consensus 235 ~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~ 314 (507) .|||+|+||++++++|+.|+||||+|++++++| .+++++++.++++++.++ |+++++|||.|++|++ T Consensus 197 ~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~-------~~a~~~ek~~l~~al~~~------~~~a~~viP~~~~ie~ 263 (491) T protein:vir:10 197 MCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP-------RSASDGEKNLLLDCLEDM------VQDAVAVVPDDSSIEI 263 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecC-------CCCCHHHHHHHHHHHHHH------hcCcEEEecCCceeEE Confidence 999999999999999999999999999888765 456777888888877776 5679999999997654 Q ss_pred cccceeeeeccC-CCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDG-MGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLL 393 (507) Q Consensus 315 ~~~~~el~~~~g-~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li 393 (507) +++++ +|+...|.+||+|||++|||+||||||||+ .+||||+|+||++|+++++++|+++|+++|| +|| T Consensus 264 -------~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~--~~gs~a~~~vh~~v~~di~~~D~~~i~~tln-~li 333 (491) T protein:vir:10 264 -------KEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTE--ATSTRASAQAGLEVTDDIRDGDKAVVSEAMN-MLI 333 (491) T ss_pred -------EecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccC--cccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 45544 445567999999999999999999999995 3799999999999999999999999999999 599 Q ss_pred HHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCC Q lcl|NC_017974. 394 PQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGE 473 (507) Q Consensus 394 ~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~ 473 (507) ++||++|+ ++..+|+|+|++.++.+ +.+++++++|+++|+.++ ++|++++||||++..++...+...+.... T Consensus 334 ~~l~~~N~---~~~~~p~f~~~~~~e~~-~~~a~~~~~L~~~G~~i~----~~~i~e~~Gip~~~~~~~~~~~~~~~~~~ 405 (491) T protein:vir:10 334 RWICDLNF---DGADRPVFDMWEQEQVD-EIQAGRDQKLTQAGARFT----PAYFKRAYNLQDGDLDERPLPVSAVDTVG 405 (491) T ss_pred HHHHHhcC---CCCCcceEEecCcCchh-HHHHHHHHHHHhCCCcCC----HHHHHHHhCCCCCCcCccccccCCCCCcc Confidence 99999993 45668999999876554 789999999999999877 57999999999887766543322221111 Q ss_pred cCCCCCCcccccCcccccccccCCCCchhhhc-----cC Q lcl|NC_017974. 474 DTSRAGDGMTKGSSGNGTGKISSTRDNSAANL-----DN 507 (507) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~-----~~ 507 (507) +.+ .....+.+ ..+.|.....+ +. T Consensus 406 ~~~----~~~~~~~~------~~~~d~~~~~~~~~~~~~ 434 (491) T protein:vir:10 406 AAS----FAEFEAPD------QDALDAALNTLSARDLNA 434 (491) T ss_pred ccc----ccccCCCC------CCchHHHHHHHHHHHHHH Confidence 110 00000000 01111111111 11 No 12 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=100.00 E-value=2.3e-99 Score=561.32 Aligned_cols=417 Identities=15% Similarity=0.169 Sum_probs=300.0 Q ss_pred CcccccccccccC-ccccc-cccccccccccccCchhhhHHHHhhhhcccchhccc-----hhHHHHHHHhh-ChHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKV-AKATS-EQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWP-----NLIGTVESMKQ-DATVATG 72 (507) Q Consensus 1 ~~~k~~~~~~~~~-~~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~-----~~~~~y~~m~~-D~~v~s~ 72 (507) |+- -++. |.|.- .+.++.. ...+++. +. ....+.||+. +.+++|++|++ |+||+|+ T Consensus 1 ~~~------~~~~~p~~~~~~~~~~~~------~~~~~~~--g~--~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~ 64 (446) T protein:vir:98 1 MNM------EVRNAPTPAIRRRTIYAM------EHLGLAT--SY--LSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQG 64 (446) T ss_pred Ccc------cccCCCchhhhhhhhhcc------ccchhhc--cc--CCcchHhhhcCCChHHHHHHHHHHHhcchHHHHH Confidence 221 1111 11110 0111110 1112222 11 2234445432 35689999975 9999999 Q ss_pred HHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 73 LDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 73 l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) |++||++|++++|+++ |+ ++++++||+++|+++. |+.++.+|++|++|||||+|++|++..+++.++++.- T Consensus 65 l~~Rk~av~~~~w~V~--p~----~~~~a~~v~~~l~~~~---~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d 135 (446) T protein:vir:98 65 LDSIALSVLNKVGPYQ--HG----DKRIKKFIDDQLRNRA---KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLD 135 (446) T ss_pred HHHHHHHhhcCCceec--Cc----cHHHHHHHHHHHhhcC---chhHHHHHHHHHhhCceeeeEEEeecccccccchhhc Confidence 9999999999987765 32 4689999999999885 5667888999999999999999999877776555422 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccc-------cccccccccCCCccccccccEEEEeeCCccC Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTAS-------EIGVGGVSTRMSDVIIPINRVMLMNTGGSSS 225 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~-------~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~ 225 (507) ..+..+|.. . .+.|+.+++.+.+..+....+...... ....+.....+.++.||+.||++|+|+++++ T Consensus 136 ~~~~~~~~~----~-r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~ 210 (446) T protein:vir:98 136 DIVNYHPLQ----V-MLIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGN 210 (446) T ss_pred ccccccccc----c-eeeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCC Confidence 222222211 0 135566666555444433222111110 1122334456678899999999999999999 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHH--HHHHHHHHHHHhhcCCcee Q lcl|NC_017974. 226 QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAE--MVRGLMSDAANAHSGEQSF 303 (507) Q Consensus 226 ~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~--~l~a~~~~~~n~~~G~~a~ 303 (507) ||||.||||.|||+|+||++++++|++|+||||+|++++++|++..+....+++..+.. ..+.+.+.+.++ +++++ T Consensus 211 ~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~--~~da~ 288 (446) T protein:vir:98 211 NPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRL--STDSG 288 (446) T ss_pred CccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhc--cccce Confidence 99999999999999999999999999999999999999999988765554443322211 111233333332 56788 Q ss_pred Eee-----cCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHH Q lcl|NC_017974. 304 FML-----PSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQF 376 (507) Q Consensus 304 ~ii-----P~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~ 376 (507) +|| |+||+| ||+++++++ ..+|++||+|||++|||+|||||||++ ++++||||+|+||++|+.++ T Consensus 289 ~ii~~~~~P~g~ei-------e~~ea~~~~-~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~ 360 (446) T protein:vir:98 289 LVLTQLSKEQPVQV-------GALTTGNNF-SDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGK 360 (446) T ss_pred eeeecccCCCCceE-------EeeccccCC-hhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHH Confidence 888 999875 456676654 456999999999999999999998865 45679999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcCCcccc-----ceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH Q lcl|NC_017974. 377 VQRVNEIILEALNENLLPQLLALNDMRLPETEM-----PYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV 451 (507) Q Consensus 377 ~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~-----P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~ 451 (507) +++|+++|++|||+|||+|||++|+ ++...+ |.++|+..+++|++.+|+++++|+++|+++++. ++|++|+ T Consensus 361 ~~aDa~~i~~tln~~Li~~l~~lNf--~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~--~~~ire~ 436 (446) T protein:vir:98 361 INSIFDTVIHAFTEQVIGNLIRLNF--DPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGD--KDHIRSI 436 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCC--CccccccccccccceeccCChhhHHHHHHHHHHHHhCCcccccc--HHHHHHH Confidence 9999999999999999999999995 333333 345777789999999999999999999987653 6899999 Q ss_pred cCCCCCCCcc Q lcl|NC_017974. 452 LGIDEKIEED 461 (507) Q Consensus 452 ~Gip~~~~~~ 461 (507) ||||++.+.. T Consensus 437 ~giP~~~~~~ 446 (446) T protein:vir:98 437 TGLPDAISST 446 (446) T ss_pred hCcCCCCCCC Confidence 9999876655 No 13 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=100.00 E-value=1e-85 Score=486.55 Aligned_cols=336 Identities=18% Similarity=0.214 Sum_probs=263.1 Q ss_pred eeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCcccccc Q lcl|NC_017974. 133 VVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPI 212 (507) Q Consensus 133 ~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~ 212 (507) |+||+|+.+ +|+|.+++|++|||++|.+| .|+.|++ +..+++.+ ....+++.||+ T Consensus 1 v~Eivw~~~-----~g~~~~~~l~~r~~~~~~~f---~~~~~~~-l~~~~~~~----------------~~g~~~~~lp~ 55 (355) T protein:vir:78 1 MFEQVYRIE-----NGRARLGKLAWRPPRTISRF---DVAPDGG-LVAIEQWG----------------VFGKATVRIPV 55 (355) T ss_pred CeEEEEEee-----CCeEEEeeeeecCccceeee---eeccCCc-eeEEEecC----------------CCCCCcceecc Confidence 999999864 68999999999999988754 3566664 33444422 23346788999 Q ss_pred ccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCch--HHHHHHHHHHH Q lcl|NC_017974. 213 NRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQS--SPEAEMVRGLM 290 (507) Q Consensus 213 ~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~--~~~~~~l~a~~ 290 (507) +|||+|+|+++++||||.|||+.|||+|+||++++++|++|+||||+|||++++|...+........ ........++. T Consensus 56 ~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~ 135 (355) T protein:vir:78 56 DRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGL 135 (355) T ss_pred CCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999999988754432111000 00112334577 Q ss_pred HHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc-cchhhhhHHHHH Q lcl|NC_017974. 291 SDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN-DKGGSYNLSESK 369 (507) Q Consensus 291 ~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~~~GS~Al~~vh 369 (507) .++.++++|+++++|||.||+|+ |++++++ ..+|.+||+|||++|||+||||||||+. +++||||+|+|| T Consensus 136 ~~~~~i~~g~~a~~iip~g~~ie-------~~ea~g~--~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh 206 (355) T protein:vir:78 136 QLAKEFRAGEAAGGYIPHGANFT-------LTGVQGK--LPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTF 206 (355) T ss_pred HHHHHhhCCcceeEeecCCceEE-------EeecCCC--cccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHH Confidence 77788889999999999999765 4455443 3457889999999999999999999974 567999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHH Q lcl|NC_017974. 370 QTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVL 449 (507) Q Consensus 370 ~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~ 449 (507) ++|+++++++|+++|+++||+|||++|+++|+ ++..++|+|+|+.+++++ +++++++++|+++|++++++.+++|++ T Consensus 207 ~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~--~~~~~~P~~~~~~~~~~~-~~~a~~~~~l~~~G~~~~~~~~~~~~~ 283 (355) T protein:vir:78 207 ASFFTGSLNAVMKHIADVTQQHVVEDLVDQNW--GPEEPAPRLVPAQLGKEQ-PVTAEAIRALVECGAFTADPELEKDLR 283 (355) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCCCCCCEEEecCcChhH-HHHHHHHHHHHhCCCccccHHHHHHHH Confidence 99999999999999999999999999999994 667789999999877555 678999999999999999998899999 Q ss_pred HHcCCCCCCCcccc-cchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 450 EVLGIDEKIEEDIS-QEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 450 e~~Gip~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ++||||++.+.++. .+...+..+..+.+++++.. ++++++++.++++|+.+---+= T Consensus 284 e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~~~a~~~~~~~~~~ 340 (355) T protein:vir:78 284 ARYGLPAPAERDDGADAAAAKAAGRRRAKRLPGQR--QGAALPSRSPRADPPRRRGPLR 340 (355) T ss_pred HHhCCCCCCCCCcccCCccccccccccccccCCcc--ccccccccCCCCCChhhhHHHH Confidence 99999988766543 33333334444444444432 3456677767666665432211 No 14 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.75 E-value=1.8e-17 Score=112.35 Aligned_cols=444 Identities=11% Similarity=0.035 Sum_probs=213.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHH-hhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESM-KQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~a 79 (507) +=||+++=|+.++..|-+......-.++-+. .+-+....- ...+...+. +..++..+ +..+.|.+|++.+... T Consensus 64 ~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~--s~es~s~vt---sls~pdaf~-~vnVs~~~AlknsaV~scI~~IA~s 137 (945) T protein:vir:10 64 IFRKNQVLKKEKIIVPYNHQEPPFKFNLFEY--SPESLMYLP---SISDPDAFF-LINLFRKYRFNNDSKLIKVSEIPKK 137 (945) T ss_pred eehhhhHHHhhcccccccccccchhhhhhhc--cCccceecc---cccCcccee-eehhhhhhhhccHHHHHHHHHHHhh Confidence 4444444222222222221111000000000 000000000 000000010 12344444 4689999999999999 Q ss_pred HhhCCceeee--CCCCCh-hHHHH--HHHHHHHHHhccCC------CHHHHHHHHH-HHHhhcceeeeeeeeccCccccC Q lcl|NC_017974. 80 VEKAFKDFKV--IPGESE-ESKKA--AKFVEYCLKNMEGQ------TLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYI 147 (507) Q Consensus 80 v~~~~~~~~~--~~~~~~-~~~~~--ae~v~~~l~~~~~~------~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~ 147 (507) |.++++...- ..+... ..++. ..-+..+|++ ++. .|.++++.++ +.+.+|-+..++++... T Consensus 138 IAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~r-PNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~------ 210 (945) T protein:vir:10 138 LTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLER-PDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQ------ 210 (945) T ss_pred hccCceEEEEecccCcccccccccccchHHHHHHhC-CCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCC------ Confidence 9888766422 112111 00110 1112333431 211 2445666654 67889999999987543 Q ss_pred cceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccC-- Q lcl|NC_017974. 148 GKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSS-- 225 (507) Q Consensus 148 g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~-- 225 (507) |+ +..|.+.++.++. +..+.||..+...++. ........+++...|+|++....+ T Consensus 211 G~--ii~L~pLdPs~Vt----i~~ddDG~~~y~Yv~~-----------------idG~~~~~v~a~DvIlhirn~s~DG~ 267 (945) T protein:vir:10 211 GN--LVAITPVDGTTIK----PILSEDTGIVVGYVQE-----------------VDGAIVAHFDKRDVVLFRQNLTPDVY 267 (945) T ss_pred Cc--EEEEEEECCcceE----EEEcCCCcEEEEEEEe-----------------cCCceEEEecCCceEEEeccCCCCcc Confidence 32 3456666665543 2234455332211110 111223346677778888665433 Q ss_pred -ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCceEEecccccC---CCCCCCchHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_017974. 226 -QALGVSPLVGCYRAWREKILIENLEVVGATKDM-GGVIELKIPSQIL---NKAAMDQSSPEAEMVRGLMSDAANAHSGE 300 (507) Q Consensus 226 -~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g-~p~~v~~~P~~~~---k~~~~~~~~~~~~~l~a~~~~~~n~~~G~ 300 (507) .++|.+.+..+....-.-....++-+.+..+.| .|--++.++.... +....-+.+.. +++.+...+..+|. T Consensus 268 ~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~----erlKe~wee~~sG~ 343 (945) T protein:vir:10 268 MYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQL----ESIQRQLQAIMMGD 343 (945) T ss_pred cccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHH----HHHHHHHHHHhCCc Confidence 356888899888777666666666666665555 3322333332211 11122223222 23333333334454 Q ss_pred cee--EeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHH-HHHH Q lcl|NC_017974. 301 QSF--FMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIH-TQFV 377 (507) Q Consensus 301 ~a~--~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~-~~~~ 377 (507) .++ ++++.|++.. ..+.+.....+.+..++..++|+++..-..--.+...+++++..+.+...+ ..-+ T Consensus 344 NnG~piVLdeGmef~---------pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL 414 (945) T protein:vir:10 344 YTQVPILSGGKFTWI---------DFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGL 414 (945) T ss_pred ccccceecCCCceEE---------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHH Confidence 444 4567776532 222222333466777888889999875543223333334455444444444 5678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCC Q lcl|NC_017974. 378 QRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEK 457 (507) Q Consensus 378 ~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~ 457 (507) ...++.|+..||+.|++.. .+ ..-+|.|+...-.|.++.++++.++++.|++.+ +.+|+.+|+|+- T Consensus 415 ~Pil~~IEqeLNrkLl~~~--------eg-~~i~fdFd~ldl~D~ksraEal~kli~sGiLTi-----NEvRe~lGLpPI 480 (945) T protein:vir:10 415 EPLMATISKGFDEVVSEFR--------NE-KDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSI-----NEARMEKGLEPV 480 (945) T ss_pred HHHHHHHHHHHHHhccccc--------cC-ceeEEEecchhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCC Confidence 8899999999997653221 11 112577877777888899999999999999876 579999999976 Q ss_pred CCcccccchhhhcCC--------------CcCCCCCCcccccCcc-ccccccc-CCCCchhhhccC Q lcl|NC_017974. 458 IEEDISQEELMKLLG--------------EDTSRAGDGMTKGSSG-NGTGKIS-STRDNSAANLDN 507 (507) Q Consensus 458 ~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~-~~~~~~~-~~~d~~a~~~~~ 507 (507) +.+|..........| +.....++.......+ .+....+ ..++.++.-+.. T Consensus 481 eGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~kda~~e~~~~ 546 (945) T protein:vir:10 481 PWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQKNAGLEVLRN 546 (945) T ss_pred CCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCcccchHHHHHHH Confidence 555432111000000 0000000000000000 0011111 112222121111 No 15 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.74 E-value=6.8e-17 Score=109.23 Aligned_cols=415 Identities=12% Similarity=0.109 Sum_probs=203.2 Q ss_pred HHHHhh-ChHHHHHHHHHHHHHhhCCceeeeCCCC--ChhHHHHHHHHHHHHHhccC-----------CCHHHHHHHHH- Q lcl|NC_017974. 60 VESMKQ-DATVATGLDMLYTFVEKAFKDFKVIPGE--SEESKKAAKFVEYCLKNMEG-----------QTLRQFARDAA- 124 (507) Q Consensus 60 y~~m~~-D~~v~s~l~~Rk~av~~~~~~~~~~~~~--~~~~~~~ae~v~~~l~~~~~-----------~~~~~~l~~~l- 124 (507) .++|.+ .+.|.+|++.+...|.++++.+....+. .....+..+.+...|..... ..+.+++..++ T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 344544 7999999999999999998877654321 11122222323332322111 13445665543 Q ss_pred HHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccc--------cccc Q lcl|NC_017974. 125 TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYT--------ASEI 196 (507) Q Consensus 125 ~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~--------~~~~ 196 (507) +.+.+|.+.+|+++...+ . +..|.+.++.++.... |+............++... .+.. T Consensus 81 ~l~l~Gn~~i~~~r~~~G------~--~~~l~~l~~~~v~~~~------d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDG------T--PTGLAYVPGHTIRKRM------DERGFVQLLEEKEKYFGVAGDRYQTNGNGDL 146 (467) T ss_pred HHHhcCCeEEEEEECCCC------c--EEEEEEeCCceeEeee------ecceeEeecCCceeeEEeccccceeecccce Confidence 577799999999986532 2 3345555555443211 1111110000000000000 0000 Q ss_pred c----ccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCC Q lcl|NC_017974. 197 G----VGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILN 272 (507) Q Consensus 197 ~----~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k 272 (507) . .......+..+.+|...+|+++.....+..||.+.+..+......-....++-..|...-+.|--++.++- T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---- 222 (467) T protein:vir:31 147 DPVFVDADDGSTGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKG---- 222 (467) T ss_pred eeeeeeeccccccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC---- Confidence 0 00011223455678888887776666677899999998887665555555555556554444443444321 Q ss_pred CCCCCchHHHHHHHHHHHHHHHH--------hhc--CCceeEeecCcccccccccceeeeecc-CCCcchhHHHHHHHHH Q lcl|NC_017974. 273 KAAMDQSSPEAEMVRGLMSDAAN--------AHS--GEQSFFMLPSDTKDNAPQYSMTLKGID-GMGKQYSTAQLISDRK 341 (507) Q Consensus 273 ~~~~~~~~~~~~~l~a~~~~~~n--------~~~--G~~a~~iiP~g~~i~~~~~~~el~~~~-g~g~~~~~~~li~~~d 341 (507) ..-+++....+.+.+.+...+ ..+ .....++++.|++....+ ++++..+ .+.....|.+..++.. T Consensus 223 --~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~--~~~~~ls~~~~~d~qf~e~~~~~~ 298 (467) T protein:vir:31 223 --AELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVE--IRLEPLTVGIDEEASFLEFRGRNE 298 (467) T ss_pred --cCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccc--eeEEeccccChhhHHHHHHHHHHH Confidence 112222222222222221110 000 112235667776554433 3344332 2223345777888888 Q ss_pred HHHHHHHhcccccccccchhhhh-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCc Q lcl|NC_017974. 342 KSILDRLGAGFINVGNDKGGSYN-LS-ESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVD 419 (507) Q Consensus 342 ~~Isk~iLGqtLts~~~~~GS~A-l~-~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~ 419 (507) ++|++++.-..--.+...+|+++ -. +.........+.-.++.|++.||+.|++.....+ ..+-+|.+..... T Consensus 299 ~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~------~~~i~f~~~~l~~ 372 (467) T protein:vir:31 299 HDILKVHDVPPVIAGVVESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAP------DWTIEFELAKPDT 372 (467) T ss_pred HHHHHHhCCCHHHcccCCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccC------CceEEEecchhhc Confidence 89999754433222322233332 12 2222334555777889999999987766543322 1122455566677 Q ss_pred hhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCcccccccccCCCC Q lcl|NC_017974. 420 VDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRD 499 (507) Q Consensus 420 ~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 499 (507) .|.++.++.+..++..|++.+ +.+|+.+|+|+-.+++..+..........+..++++...........+....-| T Consensus 373 ~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (467) T protein:vir:31 373 KLQDVEIASQRVQAMQGLLTV-----NELRDEFGFEPFPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRADEIID 447 (467) T ss_pred cCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCcccccCCcccccccccccCCCCcccCcCCCCCCCcccchHh Confidence 899999999999999998776 579999999865333321111000000000000000000011000111111112 Q ss_pred chhhhccC Q lcl|NC_017974. 500 NSAANLDN 507 (507) Q Consensus 500 ~~a~~~~~ 507 (507) ...++++. T Consensus 448 ~~~~~~~~ 455 (467) T protein:vir:31 448 SYQADLET 455 (467) T ss_pred hhhhcccc Confidence 22233333 No 16 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.74 E-value=9.2e-16 Score=103.02 Aligned_cols=453 Identities=12% Similarity=0.070 Sum_probs=211.2 Q ss_pred Cccc----------------ccccccccCccccccccccccccc----------------cccCchhhhHHHHhhhhccc Q lcl|NC_017974. 1 MSRK----------------RNRNRSVKVAKATSEQINVSRMRM----------------SEQGTFALAKVQVDSERMKA 48 (507) Q Consensus 1 ~~~k----------------~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~g~~~~~~~~~~~~~e~~ 48 (507) |+|| -.--|-+-+-. ..-++...|..+ ..+-..|+....+.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~---g~ 76 (648) T protein:vir:79 1 MARKVWGRGFWSRISLMWRDEDDDKEPLVLE-ESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGG---GG 76 (648) T ss_pred CccchhcchhhhhhhhhccCccccccccccc-cccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcC---Cc Confidence 4444 22212110000 000111111111 111111110000000 00 Q ss_pred chh-ccchhHHHHHHHh-hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-H Q lcl|NC_017974. 49 EEI-RWPNLIGTVESMK-QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-T 125 (507) Q Consensus 49 ~~l-r~~~~~~~y~~m~-~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~ 125 (507) ..+ .-+-.+..+.++. .+++|.+|+..+...|.+++|.+....+...+......+ .++-....+..+++..++ + T Consensus 77 ~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~l---l~rPn~~~t~~~f~~~l~~~ 153 (648) T protein:vir:79 77 RDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFT---LMAEATQIPTNQLFIEIAED 153 (648) T ss_pred cccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHH---hhccCCCCCHHHHHHHHHHH Confidence 001 1122233333332 589999999999999999998887654321111111111 122222234556776655 4 Q ss_pred HHhhcceeeeeeeeccCcccc-------CcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccc Q lcl|NC_017974. 126 FNEYGLSVVEKVYTQVTVGEY-------IGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGV 198 (507) Q Consensus 126 a~~~Gfs~~Ei~w~~~~~~~~-------~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~ 198 (507) .+.||-+..|++....+.... .....+..+.|-++.++. +..+++|..+. T Consensus 154 lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~----v~~d~~g~~~~------------------- 210 (648) T protein:vir:79 154 LVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMK----VKRDKFGMIKG------------------- 210 (648) T ss_pred HHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeE----EEEcCCCceee------------------- Confidence 668999999999865431100 000011222222222221 11222221110 Q ss_pred cccc--cCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCC Q lcl|NC_017974. 199 GGVS--TRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAM 276 (507) Q Consensus 199 ~~~~--~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~ 276 (507) +.+. .....+.++++.+|++++....+.+||.|.+..|.-..-.-.....+...|....+.|--+++++.. . T Consensus 211 Y~y~~~g~~~~~~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~------~ 284 (648) T protein:vir:79 211 WQQEQEGQDKPQKFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLE------Q 284 (648) T ss_pred eEEEecCCceeEEecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC------c Confidence 0111 1122345677777777777778889999999999988877788888888999887777666665421 1 Q ss_pred CchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_017974. 277 DQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG 356 (507) Q Consensus 277 ~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~ 356 (507) ...+...++++.+.....++.. ..+.+.+..+. ++.. .+....+|.+..++..++|+.++--.-.-.+ T Consensus 285 ~~~e~~k~~~e~~~~~~~~~~i--~gg~v~~~~~~-------i~~~---~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG 352 (648) T protein:vir:79 285 EGFGAEEGEVDLVRGEVENMDV--EGGMVTTERVN-------ISSI---ASNQIIDAKEYLKHFEQRAFTVLGVSELMMG 352 (648) T ss_pred cchHHHHHHHHHHHHhcccccc--cccccccceee-------cccc---CCHHHHHHHHHHHHHHHHHHHHhCCCHhHcc Confidence 1223333444444433322110 01222222221 1111 1111223556667788899998765543334 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCcCCccccceEEecCcCchhHHHHHHHHHHHH Q lcl|NC_017974. 357 NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA---LNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIG 433 (507) Q Consensus 357 ~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~---lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~ 433 (507) ...+++++-++.....+...+..-...+...++..+...+.. ++. .......-+|.|+.....|.+..++.+.+++ T Consensus 353 ~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~-~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~ 431 (648) T protein:vir:79 353 RGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDP-VLNPDDKVEFRFNEIDMDSKIKLENQAVFLY 431 (648) T ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc-cccccceEEEeecccchhhHHHHHHHHHHHH Confidence 334555666666666666656666666666666554443321 221 0111223467787777788888899999999 Q ss_pred hcCCccccHHHHHHHHHHcCCCCCCCcccc-----cchhhh-cCCC-cCCCCCCcccccCcccccccccCCCCchhhh-- Q lcl|NC_017974. 434 AVGYLPKTPKVINRVLEVLGIDEKIEEDIS-----QEELMK-LLGE-DTSRAGDGMTKGSSGNGTGKISSTRDNSAAN-- 504 (507) Q Consensus 434 ~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~-----~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~-- 504 (507) ..|++.+ +.+|+.+|+|+-.+++.. +..... ..+. ...+.+.+....+++....+.....+++..| T Consensus 432 ~~GilT~-----NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~ 506 (648) T protein:vir:79 432 EHNAISE-----DEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTNQH 506 (648) T ss_pred hCCCcCH-----HHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCCccccccccccCCCCCCCCCC Confidence 9998776 579999999865443211 000000 0000 0001111111111111101111111221111 Q ss_pred --------ccC Q lcl|NC_017974. 505 --------LDN 507 (507) Q Consensus 505 --------~~~ 507 (507) -.| T Consensus 507 g~~~~~~~~~~ 517 (648) T protein:vir:79 507 GTKTSPKKQTN 517 (648) T ss_pred CcCCCCccccc Confidence 112 No 17 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.73 E-value=8.2e-17 Score=108.78 Aligned_cols=413 Identities=10% Similarity=0.023 Sum_probs=213.9 Q ss_pred Cc------ccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH-HHHhhChHHHHHH Q lcl|NC_017974. 1 MS------RKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV-ESMKQDATVATGL 73 (507) Q Consensus 1 ~~------~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y-~~m~~D~~v~s~l 73 (507) |. +|++.+...+. .+.. . +...++.-+ .+..+... . ...+. +..++-+.|.+|+ T Consensus 1 MG~f~~lf~~~~~~~~~~~-~~~~-~---~~~~~~~~~-----~~~~~g~~-----~----~~~v~~~~al~~~~v~~ci 61 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRS-NYDE-D---IGIDISDSN-----FWEKFGIK-----L----NFSVRGKRALKENTVYVCT 61 (422) T ss_pred CchhhhhhhccCCccchhh-hhhh-c---cccccCcch-----hhhhcccc-----C----CcccchhhhhccHHHHHHH Confidence 22 22222211111 1100 0 000111100 01111100 0 00111 2234567899999 Q ss_pred HHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 74 DMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 74 ~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) ..+-..|.++++.+.-... ...+..+..++... --...++.++++.++. .+.+|-+.+++++... |+ + T Consensus 62 ~~ia~~iA~lp~~~~~~~~-~~~~~~~~~lL~~~--PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------G~--~ 130 (422) T protein:vir:13 62 KIRAESIGKLSLKIYKDKE-EYKEHELYYLLRYK--PNPLMSSINFWKCLETQRTLKGNAYAYIERDRK------GK--I 130 (422) T ss_pred HHHHHhhhhCceEEEecCc-ccccchHHHHHhhh--cccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--E Confidence 9999999998766533221 12222333443311 1122345677777765 6779999999987542 32 4 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchH Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSP 232 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gl 232 (507) ..|.+.++.++.. ..+.||.... . ...++.+ ....+....++++.+|++++....+.++|.|. T Consensus 131 ~~L~~i~~~~v~~----~~~~~~~~~~--~--~~~~y~~---------~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~ 193 (422) T protein:vir:13 131 IGLYPINSDNVTK----IIDDDNFLSS--L--SKVWYVV---------TDKNGKEHKLLPDEMLHFIGDITLDGLIGIKP 193 (422) T ss_pred EEEEEECCcceEE----EEcCCcceec--c--ceEEEEE---------EeCCCeEEEEcccceEEEcCCCCCCCcccccH Confidence 5677777765542 2334432110 0 0001111 11122344578888888877666777899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccc Q lcl|NC_017974. 233 LVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKD 312 (507) Q Consensus 233 l~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i 312 (507) +..+....-.-....++-..|...-+.|--++++|- .-+++...+..+.+.+. .+-..+....++++.|+++ T Consensus 194 ~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-------~l~~e~~~~~~~~~~~~-~~g~~n~~~~~vl~~g~~~ 265 (422) T protein:vir:13 194 LDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVG-------DLDEKAKKIFKKEFESM-SNGLENAHSISLLPFGYQF 265 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-------CCCHHHHHHHHHHHHHH-hcCccccCCceecCCCcee Confidence 999988776666677777778876666665665542 22233333332333222 2111122345788888764 Q ss_pred cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHH-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 313 NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSES-KQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 313 ~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~v-h~~v~~~~~~aD~~~i~~~ln~~ 391 (507) .- .+-+.....|.+..++...+|++++.-..--.+....++++-.+- .......-+.-.++.|+..||+. T Consensus 266 ~~---------l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ 336 (422) T protein:vir:13 266 QP---------ISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQDK 336 (422) T ss_pred ee---------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 32 112222334667778889999998766553333333455554333 33444566777889999999988 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) |+...-... ..+-+|.++.....|+++.+++++++++.|++.+ +.+|+.+|+|+-+.+|....... .. T Consensus 337 Ll~~~~~~~------g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD~~~~~~n-~~ 404 (422) T protein:vir:13 337 LFSQYETLQ------DVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEA-----NEARRRENLPPVEGGDRLLVNGN-MI 404 (422) T ss_pred hCChhhhcC------CceEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccC-cc Confidence 776542211 1122344445556789999999999999998876 57999999997655553221111 00 Q ss_pred CCcCCCCCCcccccCccccc Q lcl|NC_017974. 472 GEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~ 491 (507) |- ...++...+++...+. T Consensus 405 ~l--~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 405 PI--EMAGEQYKKGGEKGGK 422 (422) T ss_pred ch--hhcccccccCCCcCCC Confidence 00 0001100111111111 No 18 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=99.73 E-value=2.3e-16 Score=106.31 Aligned_cols=471 Identities=13% Similarity=0.084 Sum_probs=222.2 Q ss_pred CcccccccccccC--ccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhh-ChHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKV--AKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQ-DATVATGLDMLY 77 (507) Q Consensus 1 ~~~k~~~~~~~~~--~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~-D~~v~s~l~~Rk 77 (507) |+-+|.+-|+..+ ..+...+...--....+++... ++.. ...+..|-.......|.+ -+-+.+|++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-----~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~ 72 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHR---IQSH-----NVGVNPPYNPDRLAAFLELNETLATGIRKKS 72 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhh---hccc-----CCCCCCCCCHHHHHHHHhcChHHHHHHHHHh Confidence 9888866655432 2221111100000011222111 1111 112222223333455655 899999999999 Q ss_pred HHHhhCCceeeeCCC--CChhHHHHHHHHHHHHHh----cc--------CCCHHHHHHHHH-HHHhhcceeeeeeeeccC Q lcl|NC_017974. 78 TFVEKAFKDFKVIPG--ESEESKKAAKFVEYCLKN----ME--------GQTLRQFARDAA-TFNEYGLSVVEKVYTQVT 142 (507) Q Consensus 78 ~av~~~~~~~~~~~~--~~~~~~~~ae~v~~~l~~----~~--------~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~ 142 (507) ..|-+++|++++... .+++.++..+-+...++. +. ..++..++..++ |-..+||+|+|++-. .. T Consensus 73 ~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn-~~ 151 (651) T protein:vir:99 73 RYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTD-IE 151 (651) T ss_pred hhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhc-Cc Confidence 999999999887432 223333333333333321 10 123555655443 566789999998532 11 Q ss_pred ccccCcceee-------e------------eccccCchhccc------c-------cceeecCC--Cccccccccccccc Q lcl|NC_017974. 143 VGEYIGKYKV-------K------------NLAFRPQASLSR------T-------NPIVYNED--GSAIIGIKQSLSAF 188 (507) Q Consensus 143 ~~~~~g~~~~-------~------------~l~~r~~~~i~~------~-------~~~~~~~d--~~~l~~~~q~~~~~ 188 (507) +.-.+.+.+ . -|..+|...-.. + .++....+ ++....+...+... T Consensus 152 -g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v 230 (651) T protein:vir:99 152 -GRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEP 230 (651) T ss_pred -cchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcce Confidence 110000000 0 000111000000 0 00000000 00000000000000 Q ss_pred cccccccc-------------cccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 189 QNYTASEI-------------GVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGAT 255 (507) Q Consensus 189 ~~~~~~~~-------------~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~E 255 (507) ......+. .............+|.+.+|++++....+.++|.|.+..+.-....-....++...|.. T Consensus 231 ~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~ 310 (651) T protein:vir:99 231 TIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFD 310 (651) T ss_pred eEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 00000000 00011122233456777777777666667789999999999888877888888888887 Q ss_pred HhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccc--ccccceeeeeccCCC-cchh Q lcl|NC_017974. 256 KDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDN--APQYSMTLKGIDGMG-KQYS 332 (507) Q Consensus 256 r~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~--~~~~~~el~~~~g~g-~~~~ 332 (507) ..+.|--++++|- ..-+++....+.+.+.+ ...|..-.++++.+.... .....++++..+-+. .-.. T Consensus 311 NG~~p~gil~~~~------~~ls~e~~~~lr~~~~~----~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~q 380 (651) T protein:vir:99 311 NDTIPRMVIKVTG------GELSEESKRDLRQMLNG----LREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMD 380 (651) T ss_pred ccCCCceEEEecC------CCCCHHHHHHHHHHHHH----HhccCCceEEeecccccccccccCCceEEEcCcCchhhHH Confidence 7666666666542 11233333333233332 223433455665422111 111123344433222 1235 Q ss_pred HHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccce Q lcl|NC_017974. 333 TAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPY 411 (507) Q Consensus 333 ~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~ 411 (507) |.+..++...+|++++.-...-.+....+++|..+.+... ....+.-.++.|+..||+.|+....... +. .-+ T Consensus 381 fle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~-----~~-~i~ 454 (651) T protein:vir:99 381 FRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVT-----DW-TIE 454 (651) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccc-----Cc-eEE Confidence 6777888899999988766544444445667766655554 4667788999999999988876644322 11 113 Q ss_pred EEecC--cCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccc Q lcl|NC_017974. 412 VKAGE--IVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGN 489 (507) Q Consensus 412 ~~~~~--~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (507) |.|+. ....|.+..++.+..+++.|++.+ +.+|+.+|+|+-.++. ......+. ...+.++..+.+ ..+ T Consensus 455 ~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~-----NE~R~~lglppi~~~~-gd~~l~~~---~~~~~g~~~~gg-e~~ 524 (651) T protein:vir:99 455 YELRGADQPKQEAQLAEQRVRAMRLAGVGLV-----DEAREELGLDPLGEPY-GEMTLSEF---EAEVAGDVAGGG-ETE 524 (651) T ss_pred EEeccchhhhccHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcc-cccccccc---ccccccccccCC-CCc Confidence 55544 445788999999999999998876 5799999998643321 11111100 011111100000 000 Q ss_pred ccccccC----C-------------------CCchhhhccC Q lcl|NC_017974. 490 GTGKISS----T-------------------RDNSAANLDN 507 (507) Q Consensus 490 ~~~~~~~----~-------------------~d~~a~~~~~ 507 (507) +..+.+. + -+-+++++.- T Consensus 525 ~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~v~ss~~~~ 565 (651) T protein:vir:99 525 AVHEPPEENKIGEREWDTVKSELTTKDPIEQMQFSSSNLDE 565 (651) T ss_pred ccccCccccccccchhhhhhhhhcccchhhhhhHHHHHHHh Confidence 0000000 0 0001111111 No 19 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.72 E-value=8e-16 Score=103.35 Aligned_cols=429 Identities=14% Similarity=0.049 Sum_probs=213.7 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) -+--|+.++..+...+.....-.+- .+.++ ..+.+.. +.+..+ ..+..++-+.|.+|+..+-..| T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~-~~~~g~~--------~~g~~v-~~~~al~~~~V~~~v~~Ia~~i 66 (454) T protein:vir:93 2 WNLLRRTRKNQKSGRDVREAGWTSL-----FQAVA-EPFAGAW--------QQGVKA-DPEAVLSFHAVFACISLISQDI 66 (454) T ss_pred CCccccCcccccccccccchhhhhh-----hhhhh-hhhcchh--------hcCccc-ChHHhhccHHHHHHHHHHHHhh Confidence 1111221121111111110000000 00000 0000000 000000 1234456678999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .+++|.+.-...+ ....++.+..-..|..-++ .++.++++.++ +.+.+|-+++++++... |+ +..|.+ T Consensus 67 A~lp~~~~~~~~~-g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~------G~--~~~L~~ 137 (454) T protein:vir:93 67 AKMRLRLMQTDAQ-GIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR------GQ--IKELRI 137 (454) T ss_pred ccCceEEEEeccC-CccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC------Cc--EEEEEE Confidence 9987765332211 1111222222223322222 24567777776 56789999999998642 32 445666 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .++.++.. ..+.||...... ..... ........+|.+.+|++++....+.++|.+.+..+. T Consensus 138 i~~~~v~v----~~~~~g~~~y~~----------~~~~~-----~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~ 198 (454) T protein:vir:93 138 LDWNRVEP----LVADDGEVFYRI----------TPDRN-----CGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAG 198 (454) T ss_pred EcCcceEE----EEcCCCcEEEEE----------Eeccc-----cccceeEEecCcceEEeccCCCCCCceeccHHHHHH Confidence 66654431 223344211110 00000 011123456777777777666777889999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKDNAP 315 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i~~~ 315 (507) -..-.-....++...|...-+.|--++++|-. -+++...++.+.+ ....+|..+ .++++.|++..- T Consensus 199 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-------l~~e~~~~~~~~~----~~~~~g~n~g~~~vl~~g~~~~~- 266 (454) T protein:vir:93 199 LAATQGHHIQENSTSFFRNGGRPSGVIEIPGS-------ITEENAKKLKSNW----DSGYTGENAGKTAILSNGAKYNP- 266 (454) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEecCCC-------CCHHHHHHHHHHH----HHHhcccccCCceeccCCceEEE- Confidence 88777777777777777765555555555422 2233333332333 233334333 467788875432 Q ss_pred ccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 316 QYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 316 ~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~ 394 (507) .+-+.....|.+..++...+|++++.-..--.+...+++++-.+.+. .....-+.-.++.|+..||+.|+. T Consensus 267 --------l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L~~ 338 (454) T protein:vir:93 267 --------TTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALET 338 (454) T ss_pred --------cccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 22222223456666788889999865544333333344555433333 345566777888888888876532 Q ss_pred HHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccch-------- Q lcl|NC_017974. 395 QLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEE-------- 466 (507) Q Consensus 395 ~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~-------- 466 (507) . ...+-+|.++.....|.+..++.+.++++.|++.+ +.+|+.+|+|+-..+|..-.. T Consensus 339 ~----------~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~ggD~~~~~~~~~~~~~ 403 (454) T protein:vir:93 339 G----------ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTP-----NEARKRENLPPLAGGDALYLQQQNYSLEA 403 (454) T ss_pred C----------CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeeccCccchHh Confidence 1 12222444455557899999999999999998776 579999999976555432100 Q ss_pred hhhcCCCcC------CCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 467 LMKLLGEDT------SRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 467 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ..+.....+ .+..++........+........|.+.+.+.+ T Consensus 404 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~~~~~ 450 (454) T protein:vir:93 404 LSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEHDAVKAMFRG 450 (454) T ss_pred hhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccchhhhhhhh Confidence 011000000 00000000000000111112223444444444 No 20 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.70 E-value=3.5e-16 Score=105.35 Aligned_cols=426 Identities=10% Similarity=-0.016 Sum_probs=201.2 Q ss_pred Ccccc---cccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHH Q lcl|NC_017974. 1 MSRKR---NRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLY 77 (507) Q Consensus 1 ~~~k~---~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk 77 (507) |.--. +++++.......... ++..... .+.... ..+-...+. -+..++-+.|.+|+..+- T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~-------~~~~~~~---~~~~~~------~~~~g~~v~-~~~al~~~~v~~~i~~ia 63 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRA-------WEPYDPS---IYNLGA------TASSGERVT-PHDALQVSAVFASVRLLS 63 (457) T ss_pred Cchhhhhhccccccccccccccc-------cccchhh---hhhccc------cccCCceec-hHHhhccHHHHHHHHHHH Confidence 54322 222222111111100 0000000 000000 000000111 234566788999999999 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHHH-HHHHHHhc-cCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeee Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAKF-VEYCLKNM-EGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKN 154 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae~-v~~~l~~~-~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~ 154 (507) ..|.++++.+.-..+... +++... +..++..- ...++.++++.++. .+++|-+++++.+.. |. +.. T Consensus 64 ~~iA~lp~~~~~~~~~~~--~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~-------g~--~~~ 132 (457) T protein:vir:62 64 ETIATLPLSTYSKRGGTR--KEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAG-------PN--IAG 132 (457) T ss_pred HhHhhCceEEEEecCCcc--ccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCC-------Cc--EEE Confidence 999998776543322211 111111 11222211 11346677777664 678999999987752 21 233 Q ss_pred ccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHH Q lcl|NC_017974. 155 LAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLV 234 (507) Q Consensus 155 l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~ 234 (507) |.+.++..+... ....++... ..+..+.... .........+|++.+|++++....+..+|.+.+. T Consensus 133 l~~l~p~~v~v~---~~~~~~~~~-------~~~~~y~~~~-----~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~ 197 (457) T protein:vir:62 133 LDVLDPTKIHVH---MVMVDGLRR-------KVFEAYDIDA-----DGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPIS 197 (457) T ss_pred EEEEcCcceEEE---EeccCCccc-------eeEEEEEEcc-----CCceeEEEeeCccceEEecCCCCCCceecccHHH Confidence 444444333211 111111000 0000000000 0000011235667777776666666689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC---ceeEeecCccc Q lcl|NC_017974. 235 GCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE---QSFFMLPSDTK 311 (507) Q Consensus 235 ~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~---~a~~iiP~g~~ 311 (507) .+....-.-....++.+.|...-+.|--++++|-. -+++...+ +.+...+...|. ...++++.|++ T Consensus 198 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-------ls~e~~~~----~~~~~~~~~~G~~nag~~~vl~~g~~ 266 (457) T protein:vir:62 198 YARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGT-------MSEEGLAR----AREAWRAANSGVDNAHRVALLTEGAK 266 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCC-------CCHHHHHH----HHHHHHHHhcCccccCcceecCCCce Confidence 99888777777888888888877776666666532 22333222 222223323332 22477888875 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhH---HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNL---SESKQTIHTQFVQRVNEIILEAL 388 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al---~~vh~~v~~~~~~aD~~~i~~~l 388 (507) +.- .+-+..-..|.+..++...+|++++.-...-.+....+++.. .+........-+.--++.|+..| T Consensus 267 ~~~---------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~l~P~~~~ie~~l 337 (457) T protein:vir:62 267 FSK---------VAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGF 337 (457) T ss_pred EEE---------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 432 222222224666667888889988754432223222333322 22222333445566777888888 Q ss_pred HHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCc--ccccch Q lcl|NC_017974. 389 NENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEE--DISQEE 466 (507) Q Consensus 389 n~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~--~~~~~~ 466 (507) |+.|+... .....+-+|.++.....|.+..++++.+++..|++.+ +.+|+.+|+|+-.++ |..... T Consensus 338 n~~L~~~~-------~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~~g~~D~~~~~ 405 (457) T protein:vir:62 338 NRLLFAET-------ADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSI-----DEVRAAEDMTPLPDGLGEKYRVP 405 (457) T ss_pred HhhhcCcc-------ccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceeeec Confidence 87665442 1111122344445556799999999999999998776 689999999876544 221100 Q ss_pred -------------hhhcCCCcCCCC----CCcccccCcccccccccCCCCch Q lcl|NC_017974. 467 -------------LMKLLGEDTSRA----GDGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 467 -------------~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~d~~ 501 (507) +.+..+....++ .+.....+.+..........|.+ T Consensus 406 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 406 LNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred cccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCccccccccccC Confidence 000000000000 00000111111111111112221 No 21 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.70 E-value=3.5e-16 Score=105.33 Aligned_cols=420 Identities=12% Similarity=0.027 Sum_probs=211.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHH-HHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGT-VESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~-y~~m~~D~~v~s~l~~Rk~a 79 (507) |..-++--.-.+..++.......+ + ..+..+-+.. .. .+.+ -+..++.+.|.+|+..+-.. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~-------~-~~~~~~~g~~--~~--------~~~v~~~~al~~~~v~~~i~~ia~~ 62 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKD-------D-EKLLEWLGIS--PS--------TISVKGKNALKVATVFACIKILSES 62 (429) T ss_pred CchhhhhhcccccCcccccccCCC-------h-HHHHHHhcCC--CC--------cceechhhhhccHHHHHHHHHHHHh Confidence 443322110000011100000000 0 0000010000 00 0111 12345678999999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhcc--CCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNME--GQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~--~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.++++.+..... +...+....-+..+|+.-+ ..++.++++.++. .+.+|-+++++++... |+ +..|. T Consensus 63 ia~l~~~~~~~~~-~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~L~ 133 (429) T protein:vir:10 63 VSKLPLKIYQEDE-YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK------GK--VQALW 133 (429) T ss_pred hccCceEEEEecC-CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEE Confidence 9988776533221 1111111112344443222 2345677777665 5779999999987542 32 44566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++.++.. ..+.++. ..... .........+....+|++.+|++++....+.++|.|.+..+ T Consensus 134 ~i~~~~v~v----~~~~~~~--~~~~~------------~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~ 195 (429) T protein:vir:10 134 PIDASKVTV----YIDDVGL--LNSKT------------KMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYL 195 (429) T ss_pred EEcCceeEE----EEcCccc--ccccc------------eEEEEEccCCeEEEEccccEEEecCCCCCCCcccccHHHHH Confidence 666654431 1222221 10000 00011122233456788887777776667778899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-..-.-....++...+.+.-+.|--++++|-. -+++...+..+.+.+.-.. ..+....+++|.|+++.- T Consensus 196 ~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-------l~~e~~~~~~~~~~~~~~g-~~n~~~~~vl~~g~~~~~-- 265 (429) T protein:vir:10 196 KSTLENSASADKFINNFYKQGLQVKGLVQYVGD-------LNEDAKKVFRENFESMSSG-LQNSHRIALMPVGYQFQP-- 265 (429) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-------CCHHHHHHHHHHHHHHhcc-ccccCceeecCCCceEEE-- Confidence 887777777777778887766655555555422 2222222222222222111 112234578888876432 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQ 395 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~ 395 (507) .+-+.....+.+..++..++|++++.-..--.+....|+++-.+-+. .....-+.-.++.|+..||+.|+.. T Consensus 266 -------l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~ 338 (429) T protein:vir:10 266 -------ISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLD 338 (429) T ss_pred -------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCh Confidence 22222223455666788888999876665333333445665433333 3456667778888999999877544 Q ss_pred HHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhh----cC Q lcl|NC_017974. 396 LLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMK----LL 471 (507) Q Consensus 396 l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~----~~ 471 (507) --.- ...+-+|.++.....|+++.++++++|+..|++.+ +.+|+.+|+|+-+..|.......- .. T Consensus 339 ~~~~------~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD~~~~~~n~~~~d~~ 407 (429) T protein:vir:10 339 SELD------KGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKP-----NEARSKEDLPPEAGGDRLLVNGNMLPIDMA 407 (429) T ss_pred hhcC------CCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeecccccchhhc Confidence 3211 11122344445567799999999999999998876 579999999875544432211110 01 Q ss_pred CCcCCCCCCcccccCcccccccccCCC Q lcl|NC_017974. 472 GEDTSRAGDGMTKGSSGNGTGKISSTR 498 (507) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (507) ++.+.+.++. .++.+.+.-+.. T Consensus 408 ~~~~~k~g~~-----~~~~~~~~~e~~ 429 (429) T protein:vir:10 408 GQAYLKGGDT-----NGEVSKEGNEGN 429 (429) T ss_pred cccccCCCCC-----CCCCCCCCCCCC Confidence 1111111111 011111100111 No 22 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.69 E-value=1.7e-15 Score=101.59 Aligned_cols=421 Identities=10% Similarity=0.027 Sum_probs=205.2 Q ss_pred Ccccccccccccc--ccccccCc-hhhhHHHHhhhhcccchhccchhHHHHHHH-hhChHHHHHHHHHHHHHhhCCceee Q lcl|NC_017974. 13 VAKATSEQINVSR--MRMSEQGT-FALAKVQVDSERMKAEEIRWPNLIGTVESM-KQDATVATGLDMLYTFVEKAFKDFK 88 (507) Q Consensus 13 ~~~~~~~~~~~~~--~~~~~~g~-~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~av~~~~~~~~ 88 (507) |--.....+.+|- -..+.+.. .+..+..+. +-.....+|-.+ +..+.|.+|+..+-..|.++++.+. T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~ 71 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGM---------QLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred CcccCceeecCchhhhhhhhhhcccccccccce---------ecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEE Confidence 2222222222221 00000000 000011110 001112233333 3578999999999999998877654 Q ss_pred eCCCCChhHHHHHHH-HHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcc Q lcl|NC_017974. 89 VIPGESEESKKAAKF-VEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLS 164 (507) Q Consensus 89 ~~~~~~~~~~~~ae~-v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~ 164 (507) ...++... +..+. +..++. -++ .++.++++.++ +.+++|-+++++++... |+ +..|.+.++..+. T Consensus 72 ~~~~~~~~--~~~~~~~~~Ll~-~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~------G~--~~~L~~l~p~~v~ 140 (518) T protein:vir:10 72 FTSGDTET--EESDTGYAKLLA-DPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS------GT--PEKLMPMHPSRVA 140 (518) T ss_pred EEcCCCce--eccchHHHHHHc-CCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEEEECCCceE Confidence 33222111 11111 222222 122 24567777776 46679999999887543 32 3445555554332 Q ss_pred cccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHH Q lcl|NC_017974. 165 RTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKI 244 (507) Q Consensus 165 ~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~ 244 (507) +..+.++..+... +.... ......+.+|.+.+|++++....+.++|.+.+..+......-. T Consensus 141 ----v~~~~~~~~~~y~---------~~~~~------~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~ 201 (518) T protein:vir:10 141 ----IKRNSRTGRYEYY---------FQAGA------GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSED 201 (518) T ss_pred ----EEEcCCCCEEEEE---------EEecC------CccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHH Confidence 1112221111110 00000 0111223567777776666655666789999999988887777 Q ss_pred HHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC---ceeEeecCcccccccccceee Q lcl|NC_017974. 245 LIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE---QSFFMLPSDTKDNAPQYSMTL 321 (507) Q Consensus 245 ~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~---~a~~iiP~g~~i~~~~~~~el 321 (507) ...++-+.|...-+.|--++++|-. -+++...+..+.+.+. ..|. ...++++.|+++.- T Consensus 202 a~~~~~~~~f~ng~~p~gil~~~~~-------ls~e~~~~~k~~~~~~----~~G~~nag~v~vL~~G~~~~~------- 263 (518) T protein:vir:10 202 SSRNATAAMWKNAGRPNLVLRHEKR-------LSEAAQQRLREQFDRA----HSGSSNTGKTMVVEEGMEPIP------- 263 (518) T ss_pred HHHHHHHHHHhcCCCccEEEecCCC-------CCHHHHHHHHHHHHHH----hcCccccCcceEcCCCceEEE------- Confidence 7888888888876666556665522 2233333222223222 2231 23477788875432 Q ss_pred eeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_017974. 322 KGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQLLALN 400 (507) Q Consensus 322 ~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~~lN 400 (507) .+-+..-..|.+..++...+|++++--..--.+....++++-.+.+. .....-+.-.++.|+..||+.|+..+ T Consensus 264 --l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~---- 337 (518) T protein:vir:10 264 --LQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW---- 337 (518) T ss_pred --ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---- Confidence 22222223466667788899999876544333333345665444333 33445577788899999998665432 Q ss_pred CCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCC--Ccccc------cchhh-- Q lcl|NC_017974. 401 DMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKI--EEDIS------QEELM-- 468 (507) Q Consensus 401 ~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~--~~~~~------~~~~~-- 468 (507) . ... +|+|+ ..-..|.+..++++.+++..|++.+ +.+|+.+|+|+-. ..+.. .+... T Consensus 338 ---~-~~~--~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~-----NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~ 406 (518) T protein:vir:10 338 ---V-RKN--RMKFDIDDVIQPDWEAKSESTQKMVNSGVATP-----NEGREIMGLPRSDDPKADELYANSALQPLGATP 406 (518) T ss_pred ---c-CCc--eEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCCeeeecccceeccccc Confidence 1 112 35554 4556899999999999999998776 5799999998643 22221 01000 Q ss_pred hcCCCcCCCCCCcccccC----cccc-ccccc----CCCCchhhhccC Q lcl|NC_017974. 469 KLLGEDTSRAGDGMTKGS----SGNG-TGKIS----STRDNSAANLDN 507 (507) Q Consensus 469 ~~~~~~~~~~~~~~~~~~----~~~~-~~~~~----~~~d~~a~~~~~ 507 (507) ....+.+.++.....+.+ .+.+ ++.-+ ..-+......-+ T Consensus 407 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (518) T protein:vir:10 407 DGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKT 454 (518) T ss_pred ccccCCCCCCCCCCCCccccccccccccccCCCCCccccccccccccc Confidence 000000000000000000 0000 00000 000011111111 No 23 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.69 E-value=7.3e-16 Score=103.58 Aligned_cols=406 Identities=13% Similarity=0.062 Sum_probs=208.6 Q ss_pred cCchhhhHHHHhhhhc-----c-------cchh-cc----chhHHHH-HHHhhChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 31 QGTFALAKVQVDSERM-----K-------AEEI-RW----PNLIGTV-ESMKQDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 31 ~g~~~~~~~~~~~~~e-----~-------~~~l-r~----~~~~~~y-~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) +|-+ .+...+-.++ + .+.+ .+ +..+.+- +..++.+.|.+|+..+-..|.++++.+..... T Consensus 1 M~~~--~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 78 (432) T protein:vir:10 1 MKIV--DSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDE 78 (432) T ss_pred CChH--HHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 2211 0000000000 0 0000 00 0111111 23567889999999999999998876533321 Q ss_pred CChhHHHHHHHHHHHHHhc--cCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccce Q lcl|NC_017974. 93 ESEESKKAAKFVEYCLKNM--EGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPI 169 (507) Q Consensus 93 ~~~~~~~~ae~v~~~l~~~--~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~ 169 (507) +. ..+....-+..+|+.- +..++.++++.++. .+.+|-+.+++++... |+ +..|.+.++.++.. T Consensus 79 ~~-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~L~~i~~~~v~v---- 145 (432) T protein:vir:10 79 YG-IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK------GK--VQALWPIDASKVTV---- 145 (432) T ss_pred Cc-eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEEEEcCceeEE---- Confidence 11 1111111233444321 22356778887765 5779999999988643 22 44566666554321 Q ss_pred eecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 170 VYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENL 249 (507) Q Consensus 170 ~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~ 249 (507) ..+.++ +..... .........+....+|+..+|++++....+..+|.|.+..+....-.-....++ T Consensus 146 ~~d~~~--~~~~~~------------~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 211 (432) T protein:vir:10 146 YIDDVG--LLNSKT------------KMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKF 211 (432) T ss_pred EEcCcc--cccccc------------eEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 122221 110000 000111122334567888877776665666788999999998877777777777 Q ss_pred HHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCc Q lcl|NC_017974. 250 EVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGK 329 (507) Q Consensus 250 w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~ 329 (507) -+.+...-+.|--++++|-. -+++...+..+.+.+.-... .++...+++|.|++++- .+-+.. T Consensus 212 ~~~~~~ng~~p~gil~~~~~-------l~~e~~~~~~~~~~~~~~g~-~n~~~~~vl~~g~~~~~---------l~~~~~ 274 (432) T protein:vir:10 212 INNFYKQGLQVKGLVQYVGD-------LNEDAKKVFRENFESMSSGL-QNSHRIALMPVGYQFQP---------ISLNMS 274 (432) T ss_pred HHHHHhccCCccEEEEcCCC-------CCHHHHHHHHHHHHHHhccc-ccCCcceecCCCceEEE---------ccCChh Confidence 77887766656555555432 22222222222222221100 12234578888875432 222222 Q ss_pred chhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc Q lcl|NC_017974. 330 QYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE 408 (507) Q Consensus 330 ~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~ 408 (507) ...+.+..++..++|++++.-..--.+....|+|+-.+-+. .....-+.-.++.|+..||+.|+..--. .... T Consensus 275 d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~------~~g~ 348 (432) T protein:vir:10 275 DAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSEL------DKGF 348 (432) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhc------CCCc Confidence 23456667788899999876654334333445666433333 3445667778888999999877644211 1112 Q ss_pred cceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhh----hcCCCcCCCCCCcccc Q lcl|NC_017974. 409 MPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELM----KLLGEDTSRAGDGMTK 484 (507) Q Consensus 409 ~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 484 (507) +.+|.++.....|.++.++++++++..|++.+ +++|+.+|+|+-+.+|....... ....+...+.++.... T Consensus 349 ~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 349 YSKFNVDAILRADIKTRYEAYRTGIQGGFLKP-----NEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred EEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCC Confidence 22344455667899999999999999999876 57999999987544443211111 0011111111111000 Q ss_pred cCcccccccccCCCCchhhhccC Q lcl|NC_017974. 485 GSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 485 ~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) . +. ..|--| T Consensus 424 ~-~~-------------~~~~~~ 432 (432) T protein:vir:10 424 V-SK-------------EGNEGN 432 (432) T ss_pred C-CC-------------CCCCCC Confidence 0 00 001111 No 24 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.69 E-value=7.3e-16 Score=103.58 Aligned_cols=406 Identities=13% Similarity=0.062 Sum_probs=208.6 Q ss_pred cCchhhhHHHHhhhhc-----c-------cchh-cc----chhHHHH-HHHhhChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 31 QGTFALAKVQVDSERM-----K-------AEEI-RW----PNLIGTV-ESMKQDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 31 ~g~~~~~~~~~~~~~e-----~-------~~~l-r~----~~~~~~y-~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) +|-+ .+...+-.++ + .+.+ .+ +..+.+- +..++.+.|.+|+..+-..|.++++.+..... T Consensus 1 M~~~--~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 78 (432) T protein:vir:10 1 MKIV--DSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDE 78 (432) T ss_pred CChH--HHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 2211 0000000000 0 0000 00 0111111 23567889999999999999998876533321 Q ss_pred CChhHHHHHHHHHHHHHhc--cCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccce Q lcl|NC_017974. 93 ESEESKKAAKFVEYCLKNM--EGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPI 169 (507) Q Consensus 93 ~~~~~~~~ae~v~~~l~~~--~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~ 169 (507) +. ..+....-+..+|+.- +..++.++++.++. .+.+|-+.+++++... |+ +..|.+.++.++.. T Consensus 79 ~~-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~L~~i~~~~v~v---- 145 (432) T protein:vir:10 79 YG-IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK------GK--VQALWPIDASKVTV---- 145 (432) T ss_pred Cc-eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEEEEcCceeEE---- Confidence 11 1111111233444321 22356778887765 5779999999988643 22 44566666554321 Q ss_pred eecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 170 VYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENL 249 (507) Q Consensus 170 ~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~ 249 (507) ..+.++ +..... .........+....+|+..+|++++....+..+|.|.+..+....-.-....++ T Consensus 146 ~~d~~~--~~~~~~------------~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 211 (432) T protein:vir:10 146 YIDDVG--LLNSKT------------KMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKF 211 (432) T ss_pred EEcCcc--cccccc------------eEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 122221 110000 000111122334567888877776665666788999999998877777777777 Q ss_pred HHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCc Q lcl|NC_017974. 250 EVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGK 329 (507) Q Consensus 250 w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~ 329 (507) -+.+...-+.|--++++|-. -+++...+..+.+.+.-... .++...+++|.|++++- .+-+.. T Consensus 212 ~~~~~~ng~~p~gil~~~~~-------l~~e~~~~~~~~~~~~~~g~-~n~~~~~vl~~g~~~~~---------l~~~~~ 274 (432) T protein:vir:10 212 INNFYKQGLQVKGLVQYVGD-------LNEDAKKVFRENFESMSSGL-QNSHRIALMPVGYQFQP---------ISLNMS 274 (432) T ss_pred HHHHHhccCCccEEEEcCCC-------CCHHHHHHHHHHHHHHhccc-ccCCcceecCCCceEEE---------ccCChh Confidence 77887766656555555432 22222222222222221100 12234578888875432 222222 Q ss_pred chhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc Q lcl|NC_017974. 330 QYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE 408 (507) Q Consensus 330 ~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~ 408 (507) ...+.+..++..++|++++.-..--.+....|+|+-.+-+. .....-+.-.++.|+..||+.|+..--. .... T Consensus 275 d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~------~~g~ 348 (432) T protein:vir:10 275 DAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSEL------DKGF 348 (432) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhc------CCCc Confidence 23456667788899999876654334333445666433333 3445667778888999999877644211 1112 Q ss_pred cceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhh----hcCCCcCCCCCCcccc Q lcl|NC_017974. 409 MPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELM----KLLGEDTSRAGDGMTK 484 (507) Q Consensus 409 ~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 484 (507) +.+|.++.....|.++.++++++++..|++.+ +++|+.+|+|+-+.+|....... ....+...+.++.... T Consensus 349 ~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 349 YSKFNVDAILRADIKTRYEAYRTGIQGGFLKP-----NEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred EEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCC Confidence 22344455667899999999999999999876 57999999987544443211111 0011111111111000 Q ss_pred cCcccccccccCCCCchhhhccC Q lcl|NC_017974. 485 GSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 485 ~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) . +. ..|--| T Consensus 424 ~-~~-------------~~~~~~ 432 (432) T protein:vir:10 424 V-SK-------------EGNEGN 432 (432) T ss_pred C-CC-------------CCCCCC Confidence 0 00 001111 No 25 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.69 E-value=7.3e-16 Score=103.58 Aligned_cols=406 Identities=13% Similarity=0.062 Sum_probs=208.6 Q ss_pred cCchhhhHHHHhhhhc-----c-------cchh-cc----chhHHHH-HHHhhChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 31 QGTFALAKVQVDSERM-----K-------AEEI-RW----PNLIGTV-ESMKQDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 31 ~g~~~~~~~~~~~~~e-----~-------~~~l-r~----~~~~~~y-~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) +|-+ .+...+-.++ + .+.+ .+ +..+.+- +..++.+.|.+|+..+-..|.++++.+..... T Consensus 1 M~~~--~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 78 (432) T protein:vir:10 1 MKIV--DSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDE 78 (432) T ss_pred CChH--HHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 2211 0000000000 0 0000 00 0111111 23567889999999999999998876533321 Q ss_pred CChhHHHHHHHHHHHHHhc--cCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccce Q lcl|NC_017974. 93 ESEESKKAAKFVEYCLKNM--EGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPI 169 (507) Q Consensus 93 ~~~~~~~~ae~v~~~l~~~--~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~ 169 (507) +. ..+....-+..+|+.- +..++.++++.++. .+.+|-+.+++++... |+ +..|.+.++.++.. T Consensus 79 ~~-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~L~~i~~~~v~v---- 145 (432) T protein:vir:10 79 YG-IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK------GK--VQALWPIDASKVTV---- 145 (432) T ss_pred Cc-eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEEEEcCceeEE---- Confidence 11 1111111233444321 22356778887765 5779999999988643 22 44566666554321 Q ss_pred eecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 170 VYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENL 249 (507) Q Consensus 170 ~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~ 249 (507) ..+.++ +..... .........+....+|+..+|++++....+..+|.|.+..+....-.-....++ T Consensus 146 ~~d~~~--~~~~~~------------~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 211 (432) T protein:vir:10 146 YIDDVG--LLNSKT------------KMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKF 211 (432) T ss_pred EEcCcc--cccccc------------eEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 122221 110000 000111122334567888877776665666788999999998877777777777 Q ss_pred HHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCc Q lcl|NC_017974. 250 EVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGK 329 (507) Q Consensus 250 w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~ 329 (507) -+.+...-+.|--++++|-. -+++...+..+.+.+.-... .++...+++|.|++++- .+-+.. T Consensus 212 ~~~~~~ng~~p~gil~~~~~-------l~~e~~~~~~~~~~~~~~g~-~n~~~~~vl~~g~~~~~---------l~~~~~ 274 (432) T protein:vir:10 212 INNFYKQGLQVKGLVQYVGD-------LNEDAKKVFRENFESMSSGL-QNSHRIALMPVGYQFQP---------ISLNMS 274 (432) T ss_pred HHHHHhccCCccEEEEcCCC-------CCHHHHHHHHHHHHHHhccc-ccCCcceecCCCceEEE---------ccCChh Confidence 77887766656555555432 22222222222222221100 12234578888875432 222222 Q ss_pred chhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc Q lcl|NC_017974. 330 QYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE 408 (507) Q Consensus 330 ~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~ 408 (507) ...+.+..++..++|++++.-..--.+....|+|+-.+-+. .....-+.-.++.|+..||+.|+..--. .... T Consensus 275 d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~------~~g~ 348 (432) T protein:vir:10 275 DAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSEL------DKGF 348 (432) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhc------CCCc Confidence 23456667788899999876654334333445666433333 3445667778888999999877644211 1112 Q ss_pred cceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhh----hcCCCcCCCCCCcccc Q lcl|NC_017974. 409 MPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELM----KLLGEDTSRAGDGMTK 484 (507) Q Consensus 409 ~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 484 (507) +.+|.++.....|.++.++++++++..|++.+ +++|+.+|+|+-+.+|....... ....+...+.++.... T Consensus 349 ~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 349 YSKFNVDAILRADIKTRYEAYRTGIQGGFLKP-----NEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred EEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCC Confidence 22344455667899999999999999999876 57999999987544443211111 0011111111111000 Q ss_pred cCcccccccccCCCCchhhhccC Q lcl|NC_017974. 485 GSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 485 ~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) . +. ..|--| T Consensus 424 ~-~~-------------~~~~~~ 432 (432) T protein:vir:10 424 V-SK-------------EGNEGN 432 (432) T ss_pred C-CC-------------CCCCCC Confidence 0 00 001111 No 26 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.69 E-value=9.3e-16 Score=103.00 Aligned_cols=416 Identities=10% Similarity=-0.015 Sum_probs=207.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |-..+.-++..+. .+ . + ..|..............+..+ .-+..++-+.|.+|+..+-..| T Consensus 1 m~~~~~~~~~~~~---~s-~---~------------~~w~~~~~~~~~~~~~~g~~v-t~~~al~~~~v~~~i~~Ia~~i 60 (421) T protein:vir:10 1 MFIPQMFEGKKRS---VS-G---G------------GFWEAMLGGVRSSHSKAGVMI-TPETALALSAVRACVTLLAESV 60 (421) T ss_pred CCCcchhcccccc---cC-c---c------------hhhHHHhhhhccCcccCCcee-chHHhhccHHHHHHHHHHHHhh Confidence 4333222222110 00 0 0 001111000000000001111 1234567889999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHH-HHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAK-FVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae-~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .+++|...-...+.. ..++.+ -+..+|..-++ .++.++++.++ +.+.+|-+.+++++... | .+..|. T Consensus 61 A~lp~~~~~~~~~g~-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------G--~~~~L~ 131 (421) T protein:vir:10 61 AQLPVELYRRDKNGG-RQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGK------G--YPKELI 131 (421) T ss_pred ccCceEEEEEcCCCc-eeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC------C--cEEEEE Confidence 988776432211111 011111 13334432222 34567777765 57789999999887532 2 245566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++..+.. ..+.+|... +. ....+..+|.+-++++++.. .+.++|.|.+..+ T Consensus 132 ~l~~~~v~v----~~~~~g~~~----------y~------------~~~~g~~~~~~eiih~~~~~-~d~~~G~spi~~~ 184 (421) T protein:vir:10 132 PINPKKVIV----LKGPDGMPY----------YE------------IPEIGETLPMRMMHHVKVFS-LDGYIGSSPIQTN 184 (421) T ss_pred EecCceEEE----EECCCceEE----------EE------------EcCCCcEEchhhEEEecCcC-CCCcccccHHHHH Confidence 666544321 122333110 00 01122346666655555544 4568899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) ....-.-....++...+...-+.|--++++|.... ...+++...+..+...+.... ..+....+++|.|+++.- T Consensus 185 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~---~~~~~e~~~~~~~~~~~~~~g-~~n~~~~~vl~~g~~~~~-- 258 (421) T protein:vir:10 185 ADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAP---AIKSQEKIDQLLAKWTDRYSG-INNMFSVALLQEGMSYKQ-- 258 (421) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccC---ccCCHHHHHHHHHHHHHHhcC-ccccCcceecCCCceEEe-- Confidence 87776667777777888877666666666653221 111233333333333322111 011223577888875432 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQ 395 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~ 395 (507) .+.+.....|.+..++..++|++++.-..--.+....++++-.+.+. .....-+.-.++.|+..||+.|+.+ T Consensus 259 -------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~ 331 (421) T protein:vir:10 259 -------MSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQRDLLLP 331 (421) T ss_pred -------cCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCc Confidence 22222333466677788899999876654333333445555433333 4444566677788888888765433 Q ss_pred HHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcC Q lcl|NC_017974. 396 LLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDT 475 (507) Q Consensus 396 l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~ 475 (507) -- ....+-+|..+.....|+++.++++.++++.|++.+ +.+|+.+|+|+-+.+|....... ..+..+ T Consensus 332 ~~-------~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD~~~~~~n-~~~~~~ 398 (421) T protein:vir:10 332 SE-------RRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSV-----NDIRRMENLPPIAGGDKYLTPLN-MVDSAQ 398 (421) T ss_pred cc-------cCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeeeccc-cccccc Confidence 11 011122344444556799999999999999998776 57999999997655554321111 000000 Q ss_pred CCCCCcccccCcccccccccCCCCchhhhc Q lcl|NC_017974. 476 SRAGDGMTKGSSGNGTGKISSTRDNSAANL 505 (507) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~ 505 (507) ..++++ .++++.++-.|+..++- T Consensus 399 ~~~~~~-------~~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 399 IIPGDK-------KPTAQQMAEIDTILSRT 421 (421) T ss_pred cccCCC-------CcccccCcccccccccC Confidence 000110 01111111111111111 No 27 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.69 E-value=3.9e-16 Score=105.04 Aligned_cols=427 Identities=10% Similarity=-0.010 Sum_probs=202.0 Q ss_pred Cccc---ccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHH Q lcl|NC_017974. 1 MSRK---RNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLY 77 (507) Q Consensus 1 ~~~k---~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk 77 (507) |.-- .++.+........... ++.+... ...+.... .-+..+. .+..++-+.|.+|+..+- T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~-------~~~~~~~-~~~~~~~~--------~~g~~V~-~~~al~~~~V~~~v~~Ia 63 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARA-------WEPYDPS-IYNLGAVA--------ASGETVT-PHDALQVSAVFASVRLLS 63 (457) T ss_pred Cchhhhhhccccccccccccccc-------ccccchH-HHhhcccc--------cCCceec-hHHhhccHHHHHHHHHHH Confidence 5432 2222222222222211 0000000 00000000 0000111 234566788999999999 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhcc-CCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNME-GQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~-~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) ..|.++++.+.-..+...... ....+...++.-. ..++.++++.++ +.+++|-+.+++.+.. |. +..| T Consensus 64 ~~iA~lp~~~~~~~~~~~~~~-~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~-------g~--~~~l 133 (457) T protein:vir:13 64 ETIATLPLSTYSKRGGSRKEI-VTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQG-------PN--IVGL 133 (457) T ss_pred HhhccCceEEEEecCCccccc-ccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-------Cc--EEEE Confidence 999998776544322211111 1112233333211 124556777765 4778999999998752 11 2234 Q ss_pred cccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHH Q lcl|NC_017974. 156 AFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVG 235 (507) Q Consensus 156 ~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~ 235 (507) .+.++..+..+ ....++.. . ..+..+.. ... ........+|++.+|++++....+..+|.|.+.. T Consensus 134 ~~l~p~~v~v~---~~~~~~~~-~------~~~~~y~~---~~~--~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~ 198 (457) T protein:vir:13 134 DVLDPTKIHVH---MVMVDGLR-R------KVFEAYDI---DAD--GNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISY 198 (457) T ss_pred EEEccCceEEE---EecCCCcc-c------eeEEEEEE---ecC--CceeeEEeeCccceEEecCCCCCCccccccHHHH Confidence 44443322211 11111100 0 00000000 000 0000112356667666666655666899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC---ceeEeecCcccc Q lcl|NC_017974. 236 CYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE---QSFFMLPSDTKD 312 (507) Q Consensus 236 ~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~---~a~~iiP~g~~i 312 (507) +....-.-....++.+.|...-+.|--++++|-. -+++...+..+.+. ....|. ...+++|.|+++ T Consensus 199 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-------ls~e~~~~~~~~~~----~~~~g~~nag~~~vl~~g~~~ 267 (457) T protein:vir:13 199 ARESIGLALAAQKYGSKFFANGAMPGAVVEVPGT-------MSEEGLARAREAWR----AANSGVDNAHRVALLTEGAKF 267 (457) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCC-------CCHHHHHHHHHHHH----HHhcCccccCcceecCCCceE Confidence 9888777777888888888877777666666532 22333333222222 222232 235788888754 Q ss_pred cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhh---HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 313 NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYN---LSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 313 ~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A---l~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) .- .+-+..-..|.+..++...+|++++--...-.+...++++. ..+........-+.-.++.|+..|| T Consensus 268 ~~---------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln 338 (457) T protein:vir:13 268 SK---------VAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFN 338 (457) T ss_pred EE---------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32 22222222355666688888998876544222322223221 1222233334455667778888888 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCc--ccc-cc- Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEE--DIS-QE- 465 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~--~~~-~~- 465 (507) +.|+...- ....+-+|.++.....|+++.++++.+++..|++.+ +.+|+.+|+|+-.++ +.. .+ T Consensus 339 ~~L~~~~~-------~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~Pi~~g~~d~~~~~~ 406 (457) T protein:vir:13 339 RLLFAETA-------DRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSI-----DEVRAAEDMTPLPDGLGEKYRVPL 406 (457) T ss_pred HhhcCccc-------cCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCcccceeecc Confidence 76654421 111122344445567799999999999999998876 579999999865443 211 11 Q ss_pred -----------hhhhcCCCcCCCCCCc----ccccCcccccccccCCCCch Q lcl|NC_017974. 466 -----------ELMKLLGEDTSRAGDG----MTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 466 -----------~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~d~~ 501 (507) ..++..+....+..++ ...+......+......|++ T Consensus 407 n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 407 NLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred ccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 0000000000000000 00011111111112222222 No 28 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.69 E-value=9.4e-16 Score=102.96 Aligned_cols=396 Identities=11% Similarity=0.065 Sum_probs=204.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.=-++.++..+ ..... .+ .+.+....+ ..+. .+ ..+..++-+.|.+|+..+-..| T Consensus 1 M~~f~~~~~~~~---~~~~~--~~--~~~~~~~~~----------~~~~------~v-~~~~al~~~~V~~~v~~ia~~i 56 (397) T protein:vir:38 1 MPLLKLNKSHSQ---GFSLN--DP--DWVNFLTGG----------EAQK------YV-SADTALKNSDIFSLIMQLSGDL 56 (397) T ss_pred CcchhhhhcccC---cccCC--ch--hhhhhhcCC----------cCCc------ee-chHHhhccHHHHHHHHHHHHHH Confidence 554433221111 00000 00 000000000 0000 00 1234567888999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .++++. +.. .....++.+. -...++.++++.+. +.+.+|.+++++++... | .+..|.+.+ T Consensus 57 a~~p~~--~~~------~~~~~l~~~P---N~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~------g--~~~~l~~l~ 117 (397) T protein:vir:38 57 AMVRYT--SES------DRSQSIISNP---SVTANGYSFWQGMFAQLLLDGNCYAYRHKNTN------G--VDLSWEYLR 117 (397) T ss_pred hhCccc--ccc------cHHHHHHhcC---CCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC------C--cEEEEEEEc Confidence 877653 321 1222222111 11235677888777 45679999999988643 2 245566666 Q ss_pred chhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHH Q lcl|NC_017974. 160 QASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRA 239 (507) Q Consensus 160 ~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~ 239 (507) +.++. +..+.||+.+.... ... ....+....+|...+|++++....+.+||.|.+..+... T Consensus 118 ~~~v~----i~~~~~~~~~~y~~---------~~~------~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~ 178 (397) T protein:vir:38 118 PSQVQ----PMLLQDGSGLIYNI---------NFD------EPAIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINE 178 (397) T ss_pred CceeE----EEEcCCCceEEEEE---------Eec------cccccceeEecCccEEEecCCCCCCccccccHHHHHHHH Confidence 65443 23344553322110 000 001122345788887877777777778999999999988 Q ss_pred HHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccce Q lcl|NC_017974. 240 WREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSM 319 (507) Q Consensus 240 ~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~ 319 (507) .-.-....++...+...-++|--++++|.. ..+++... ++...+...... +....++++.|+++. T Consensus 179 i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-------~~~e~~~~-~~~~~~~~~~~~-n~~~~~vl~~g~~~~------ 243 (397) T protein:vir:38 179 QQIKDASNELTLKALKQSVTASAVLTIQKG-------GLLDAETR-IARSKEISKQIH-NSDGPVVIDALEDYK------ 243 (397) T ss_pred HHHHHHHHHHHHHHHhccCCccEEEEeCCC-------CCHHHHHH-HHHHHHHHhccc-ccCCceecCCCceEE------ Confidence 888888888888888887777777776532 22223222 222222222211 122346777776432 Q ss_pred eeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 320 TLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLAL 399 (507) Q Consensus 320 el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l 399 (507) ..+.+.....|.+..++...+|++++.-..--.+.. .++++..+-........+.-.+..|+..||+.|++.+ ++ T Consensus 244 ---~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~-~~~~~~~e~~~~~~~~~l~P~~~~ie~~ln~~l~~~~-~~ 318 (397) T protein:vir:38 244 ---PLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQ-GDQQSSITQISGQYAKSLNRYVQAIVGELNDKLHANI-SA 318 (397) T ss_pred ---ecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccChh-cc Confidence 233333344577788899999998765433212111 1122211212223334666778888888887765432 11 Q ss_pred CCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCC Q lcl|NC_017974. 400 NDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAG 479 (507) Q Consensus 400 N~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~ 479 (507) + +.| .-..|.+..++.+++|++.|++.+ +.+|+.+|+|+-..++...............+.. T Consensus 319 ~-----------~~~--~~~~d~~~~~~~~~~~~~~G~~t~-----nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~~ 380 (397) T protein:vir:38 319 N-----------IRF--AIDAMGDQYASTISSSVKGGTIAG-----NQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQE 380 (397) T ss_pred c-----------ccc--cccCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCccccccccccccccccccc Confidence 1 222 223577889999999999998766 5799999998654444322221111111111111 Q ss_pred CcccccCcccccccccCCCCch Q lcl|NC_017974. 480 DGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~d~~ 501 (507) .+......++.++. ||. T Consensus 381 ~g~~~~~~~~e~~~-----~~~ 397 (397) T protein:vir:38 381 GGENDGNNSDERGS-----DPE 397 (397) T ss_pred cCCCCCCCCCCCCC-----CCC Confidence 11111111111111 111 No 29 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.68 E-value=2.3e-15 Score=100.80 Aligned_cols=422 Identities=10% Similarity=0.022 Sum_probs=204.0 Q ss_pred Cccccccccccccc--cccccC-chhhhHHHHhhhhcccchhccchhHHHHHH-HhhChHHHHHHHHHHHHHhhCCceee Q lcl|NC_017974. 13 VAKATSEQINVSRM--RMSEQG-TFALAKVQVDSERMKAEEIRWPNLIGTVES-MKQDATVATGLDMLYTFVEKAFKDFK 88 (507) Q Consensus 13 ~~~~~~~~~~~~~~--~~~~~g-~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~-m~~D~~v~s~l~~Rk~av~~~~~~~~ 88 (507) |--.....+..|-. ..+.+. +.+..+..+. +-.....++-. .++.+.|.+|+..+-..|.+++|.+. T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~---------~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~ 71 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGM---------QLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccceece---------ecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEE Confidence 22222222222210 000000 0000000000 00011122222 24679999999999999999987765 Q ss_pred eCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccCchhccc Q lcl|NC_017974. 89 VIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSR 165 (507) Q Consensus 89 ~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~ 165 (507) ...++..... ....+..++.+ ++ .+..+|++.++ +.+++|-+++++++... |. +..|.+.++..+. T Consensus 72 ~~~~~~~~~~-~~~~~~~Ll~~-PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~------G~--~~~L~~l~p~~Vt- 140 (518) T protein:vir:78 72 FTSGDTETEE-HDTGYAKLLAD-PCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS------GT--PEKLMPMHPSRVA- 140 (518) T ss_pred EEcCCccccc-cchHHHHHHhC-CCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC------Cc--EEEEEEECCCceE- Confidence 4332211111 11112222221 22 24556777766 45678999999987543 22 3445555554332 Q ss_pred ccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHH Q lcl|NC_017974. 166 TNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKIL 245 (507) Q Consensus 166 ~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~ 245 (507) +..+.++..+... +.... ......+.+|.+.+|++++....+..+|.|.+..+....-.-.. T Consensus 141 ---v~~~~~~~~~~y~---------~~~~~------~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~a 202 (518) T protein:vir:78 141 ---IKRNSRTGRYEYY---------FQAGA------GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDS 202 (518) T ss_pred ---EEEcCCCCEEEEE---------EEecC------CccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHH Confidence 1112222111110 00000 01112345677777776665555667899999999887777777 Q ss_pred HHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC---ceeEeecCcccccccccceeee Q lcl|NC_017974. 246 IENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE---QSFFMLPSDTKDNAPQYSMTLK 322 (507) Q Consensus 246 ~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~---~a~~iiP~g~~i~~~~~~~el~ 322 (507) ..++-+.|...-+.|--++++|-. -+++...+..+.+.+. ..|. ...++++.|++..- T Consensus 203 a~~~~~~~f~Ng~~p~gvl~~~~~-------ls~e~~~~~k~~~~~~----~~G~~nag~~~vL~~G~~~~~-------- 263 (518) T protein:vir:78 203 SRNATAAMWKNAGRPNLVLRHEKR-------LSPEAQQRLREQFDRA----HAGSSNTGKTMVVEEGMEPIP-------- 263 (518) T ss_pred HHHHHHHHHhcCCCccEEEecCCC-------CCHHHHHHHHHHHHHH----hcCcccCCceeEcCCCceEEe-------- Confidence 778888888776666666665522 2233333222222222 2231 23477788865422 Q ss_pred eccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_017974. 323 GIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQT-IHTQFVQRVNEIILEALNENLLPQLLALND 401 (507) Q Consensus 323 ~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~-v~~~~~~aD~~~i~~~ln~~li~~l~~lN~ 401 (507) .+-+..-..|.+..++...+|++++--..--.+....++|+-.+.+.. ....-+.-.++.|+..||+.|+..+ T Consensus 264 -l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~~----- 337 (518) T protein:vir:78 264 -LQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW----- 337 (518) T ss_pred -ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----- Confidence 222222234666677888999997765543333333456664444433 3355677788999999998765432 Q ss_pred CcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC--cccc------cchhh--h Q lcl|NC_017974. 402 MRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE--EDIS------QEELM--K 469 (507) Q Consensus 402 ~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~--~~~~------~~~~~--~ 469 (507) . ... +|+|+ ..-..|.+..++++.+++..|++.+ +.+|+.+|+|+-.+ .+.. .+... . T Consensus 338 --~-~~~--~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~-----NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~ 407 (518) T protein:vir:78 338 --V-RKN--RMKFDIDDVIQPDWEAKSESTQKMVNSGVATP-----NEGREIMGLPRSDDPKADELYANSALQPLGATPD 407 (518) T ss_pred --c-Ccc--eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceeeecccceecccccc Confidence 1 111 35554 5557899999999999999998876 57999999986432 2211 00000 0 Q ss_pred cCCCcCC--CCCCc-cccc-Ccccc-cccccCCCCchh----hhccC Q lcl|NC_017974. 470 LLGEDTS--RAGDG-MTKG-SSGNG-TGKISSTRDNSA----ANLDN 507 (507) Q Consensus 470 ~~~~~~~--~~~~~-~~~~-~~~~~-~~~~~~~~d~~a----~~~~~ 507 (507) .....+. .+.+. .+.. ..+.+ ++.-+....+++ ...-+ T Consensus 408 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (518) T protein:vir:78 408 GAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKT 454 (518) T ss_pred cccCCCCCCCCCCCCcccccccccCccccCCCCCccccccccccccc Confidence 0000000 00000 0000 00000 000000000000 11111 No 30 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.68 E-value=7.7e-16 Score=103.44 Aligned_cols=400 Identities=13% Similarity=0.083 Sum_probs=206.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +=+|+.++++..+. .+....... .|.+..... + ..+..++-+.|.+|+..+-..| T Consensus 2 ~f~~~~~~~~~~~~--------~~~~~~~~~--~g~~~~~~~--------------v-~~~~al~~~~v~~~i~~ia~~i 56 (409) T protein:vir:10 2 LFRKGFKNQSQEIS--------IDDKKILEW--LGINPSETY--------------V-NGKSCLKQATVFGCIRILSDNI 56 (409) T ss_pred cccccccCcCCCCC--------CChHHHHHH--hcCCcCcce--------------e-chhhhhccHHHHHHHHHHHHhh Confidence 33333333321110 111000000 000000000 0 0234556788999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHH-HHHHHHHHhcc--CCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAA-KFVEYCLKNME--GQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~a-e~v~~~l~~~~--~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .++++.+.-..+... ++. .-+..+|..-+ ..++.++++.++. .+.+|-+.+++++... |. +..|. T Consensus 57 a~lp~~~~~~~~~~~---~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------G~--~~~L~ 125 (409) T protein:vir:10 57 SKLPIKIYQKKDGIK---RVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKN------GE--IKGLY 125 (409) T ss_pred hhCceEEEEecCCee---eccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC------Cc--EEEEE Confidence 988776532221111 111 11333443212 2346677777664 6789999999988643 22 44566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.|+.++. +..+.++... -. ...++ ......+....+|.+.+|++++. ..+.++|.|.+..| T Consensus 126 ~i~~~~V~----v~~~~~~~~~--~~--~~~~y---------~~~~~~g~~~~~~~~evih~r~~-~~d~~~G~s~i~~~ 187 (409) T protein:vir:10 126 PLKSDGMK----IFVDDTGLLN--SE--NNVWY---------LYTDDLGQRHKFMSDEILHFKGL-TADGLAGLSVIELL 187 (409) T ss_pred EEcCCceE----EEEcCCcccc--cc--ceEEE---------EEEeCCceeEEeccccEEEecCc-CCCCcccccHHHHH Confidence 66665443 1223332110 00 00000 01112233456777777666654 35568999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC---CceeEeecCccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG---EQSFFMLPSDTKDN 313 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G---~~a~~iiP~g~~i~ 313 (507) .-..-.-....++...+...-+.|--++++|-.+ +++. .+.+++. ..+...| ....++++.|+++. T Consensus 188 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l-------~~e~-~~~~~~~---~~~~~~g~~n~~~~~vl~~g~~~~ 256 (409) T protein:vir:10 188 NHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDL-------NPEA-EEVFKEN---FERMSSGLKNAHRIAMLPIGYKFE 256 (409) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCC-------CHHH-HHHHHHH---HHHHhccccccCCceecCCCceEE Confidence 9887777777777888887766665566554321 2222 2222222 2222223 22357778887543 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESK-QTIHTQFVQRVNEIILEALNENL 392 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh-~~v~~~~~~aD~~~i~~~ln~~l 392 (507) - .+-+.....+.+..++..++|++++.-..--.+....|+++..+.+ ......-+.-.++.|+..||+.| T Consensus 257 ~---------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL 327 (409) T protein:vir:10 257 P---------ISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYIDTLQSILNMYELEINYKL 327 (409) T ss_pred E---------ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 2 2222233356677788999999987665433332234555543333 34445556677888888888765 Q ss_pred HHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCC Q lcl|NC_017974. 393 LPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLG 472 (507) Q Consensus 393 i~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~ 472 (507) +..--. +...+-+|.++.....|.++.++++.++++.|++.+ +.+|+.+|+|+-+.+|..-.... ..| T Consensus 328 ~~~~~~------~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~lgl~p~~ggD~~~~~~n-~~~ 395 (409) T protein:vir:10 328 FLISEI------KNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTP-----NEIRELEEDEPLEGGDVLLINGN-MIP 395 (409) T ss_pred cCchhc------cCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccC-ccc Confidence 422110 111122333445556799999999999999999877 57999999987655553211111 111 Q ss_pred CcCCCCCCcccccCccccccc Q lcl|NC_017974. 473 EDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~ 493 (507) -+. .++...++ |+ | T Consensus 396 ~~~--~~~~~~kg--Ge---~ 409 (409) T protein:vir:10 396 VKM--AGEQYSKG--GE---K 409 (409) T ss_pred hhh--cccccccc--CC---C Confidence 000 00000000 11 1 No 31 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.68 E-value=6.2e-16 Score=103.97 Aligned_cols=405 Identities=11% Similarity=0.019 Sum_probs=212.5 Q ss_pred Ccccc-cccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKR-NRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |==+| -+++.... .... .... . ....+.. ... ..+..+ ..+..++-+.|.+|+..+... T Consensus 1 m~~~~~f~~~~~~~-~~~~---~~~~--------~---~~~~~~~---~~~-~~~~~v-~~~~al~~~~v~~~i~~Ia~~ 60 (416) T protein:vir:12 1 MLLERMFEKRSGSS-DHED---GFNN--------I---LLNMFGG---RKT-ASGERV-SESNSLVQPDIFACVNVLSDD 60 (416) T ss_pred CccchhcccccCcc-ccCc---cchh--------H---HHHhhcC---ccc-ccCcee-chhhhhccHHHHHHHHHHHHh Confidence 22221 11111110 0000 0000 0 0000000 000 000111 123456678899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHH-HHHHHHHhcc--CCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAK-FVEYCLKNME--GQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae-~v~~~l~~~~--~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) |.++++........ .. +++.+ -+...|..-+ ..++.++++.++. .+.+|-+.+++++... | .+..| T Consensus 61 ia~l~~~~~~~~~~-~~-~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~------G--~~~~L 130 (416) T protein:vir:12 61 IAKLPIHTYKRTDG-GI-ERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSH------G--YPEAL 130 (416) T ss_pred hhhCceEEEEecCC-cc-ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------C--cEEEE Confidence 99988764332211 11 11111 1223332211 2346678887764 5679999999987532 2 24556 Q ss_pred cccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHH Q lcl|NC_017974. 156 AFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVG 235 (507) Q Consensus 156 ~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~ 235 (507) .+.++.++. +..+.+++.+. + .....+..+.+|...++++++.. .+.++|.|.+.. T Consensus 131 ~~l~~~~v~----v~~~~~~~~~~---------~----------~~~~~g~~~~~~~~eiih~~~~~-~~~~~G~s~i~~ 186 (416) T protein:vir:12 131 FPLRPDYTN----AYVHPTTGMLW---------Y----------QTVLNGKAIELYDYEVLHFKGLS-TDGIHGKSPIGV 186 (416) T ss_pred EEECCcceE----EEEeCCCcEEE---------E----------EEecCCeEEEecCccEEEecCcC-CCCcccccHHHH Confidence 666665443 12233332211 0 01112234567888877776554 455899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccc Q lcl|NC_017974. 236 CYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAP 315 (507) Q Consensus 236 ~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~ 315 (507) ++...-.-....++...+.+.-+.|--++++|. .-+++...+ +++-.+-+ .. ....+++|.|+++.- T Consensus 187 ~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-------~~~~e~~~~-~~~~~~~~---~~-~~~~~vl~~g~~~~~- 253 (416) T protein:vir:12 187 VREHIGAQAAATKYNAKLYKNEATPRGILKVPA-------FLDEKPKEN-VRKEWKRV---NK-VENIAIIDYGLEYQS- 253 (416) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC-------CCCHHHHHH-HHHHHHHH---hc-CCCeeecCCCceEEE- Confidence 998887777778888888887676666666553 222333333 33222222 22 234678899886532 Q ss_pred ccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 316 QYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 316 ~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~ 394 (507) .+-+.....|.+..++..++|++++--..--.+....|+++-.+.+. .....-+.-.++.|+..||+.|+. T Consensus 254 --------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~ 325 (416) T protein:vir:12 254 --------ISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFL 325 (416) T ss_pred --------ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 22222333567778888999998886655333333456666544444 344666778899999999987764 Q ss_pred HHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhh----c Q lcl|NC_017974. 395 QLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMK----L 470 (507) Q Consensus 395 ~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~----~ 470 (507) ..-... ..+-+|.++.....|.++.++++.++++.|++.+ +.+|+.+|+|+-+.+|.......- . T Consensus 326 ~~~~~~------g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~ 394 (416) T protein:vir:12 326 DHDQKS------GHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNK-----DEIRELLERNPIENGDKYISSLNYVFLDF 394 (416) T ss_pred chhhcC------CceEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeeeccccccccc Confidence 432111 1111233444556789999999999999998776 579999999976555532111110 0 Q ss_pred CCCcC-CCCCCcccccCcccccc Q lcl|NC_017974. 471 LGEDT-SRAGDGMTKGSSGNGTG 492 (507) Q Consensus 471 ~~~~~-~~~~~~~~~~~~~~~~~ 492 (507) ..+.+ .+++. ..+++..++.+ T Consensus 395 ~~~~~~~~~~~-~~~gge~~~~g 416 (416) T protein:vir:12 395 LEEYQRLKAGG-AMKGGDNKNEG 416 (416) T ss_pred cchhhcccccc-ccCCCCCcCCC Confidence 00000 01111 01111111111 No 32 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.67 E-value=9.1e-15 Score=97.55 Aligned_cols=441 Identities=13% Similarity=0.077 Sum_probs=199.8 Q ss_pred Cccccccccccc--------------------------Cccccccccccccccc--cccCchhhhHHHHhhhhcccchhc Q lcl|NC_017974. 1 MSRKRNRNRSVK--------------------------VAKATSEQINVSRMRM--SEQGTFALAKVQVDSERMKAEEIR 52 (507) Q Consensus 1 ~~~k~~~~~~~~--------------------------~~~~~~~~~~~~~~~~--~~~g~~~~~~~~~~~~~e~~~~lr 52 (507) |.+|-+.=+..+ ++...+........-+ ..+|+. ....++. ..+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~--~~~~~~~---~r~~~~ 75 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSM--SANPGFK---TKPSIR 75 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccce--ecCcccc---cCcccc Confidence 111111100000 0000000000000000 001100 0000000 001111 Q ss_pred cchhH-HHHHHHhhChHHHHHHHHHHHHHhh-----------CCceeeeCCCC---ChhHHHHHHHHHHHHHhccC---- Q lcl|NC_017974. 53 WPNLI-GTVESMKQDATVATGLDMLYTFVEK-----------AFKDFKVIPGE---SEESKKAAKFVEYCLKNMEG---- 113 (507) Q Consensus 53 ~~~~~-~~y~~m~~D~~v~s~l~~Rk~av~~-----------~~~~~~~~~~~---~~~~~~~ae~v~~~l~~~~~---- 113 (507) -+..+ ++.+...+-+.|.+|+..|...|.+ ++|++.+...+ ...+.+..+.++++|..... T Consensus 76 ~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p 155 (551) T protein:vir:80 76 NNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDI 155 (551) T ss_pred ChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCC Confidence 11111 1222223468899999999999875 45666554321 23333333334444443211 Q ss_pred --CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccc Q lcl|NC_017974. 114 --QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQN 190 (507) Q Consensus 114 --~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~ 190 (507) .+|.+++..++ +.+.+|.+.+|+++... |+ +..|.+.++.++.. ..+.||..... ...++ T Consensus 156 ~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~------G~--~~~L~~l~p~~V~v----~~~~~g~~~~~----~~~y~- 218 (551) T protein:vir:80 156 NRDSFSSFVKKIVRDTYMYDQVNFEKVFNRN------QS--MVRFVAKDPTTIFF----ATTADGKIPDN----GNRFV- 218 (551) T ss_pred ccchHHHHHHHHHHHHHhcCCEEEEEEECCC------Cc--EEEEEEeCCceeEE----EECCccccccC----ceEEE- Confidence 25677887766 46789999999998643 32 45566666655532 22334321100 00000 Q ss_pred cccccccccccccCCCccccccccEEEEeeCCccC---ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecc Q lcl|NC_017974. 191 YTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSS---QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIP 267 (507) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~---~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P 267 (507) ..........++.+.+|++++....+ .+||.|.+..+......-....++-..|...-+.|--++.++ T Consensus 219 ---------~~~~g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~ 289 (551) T protein:vir:80 219 ---------QVIDQKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIK 289 (551) T ss_pred ---------EEeCCcEEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEc Confidence 01111223356777777776655433 578999999998888777777787788887655554444332 Q ss_pred cccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC-cee--Eee-cCcccccccccceeeeeccCCCcchhHHHHHHHHHHH Q lcl|NC_017974. 268 SQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE-QSF--FML-PSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKS 343 (507) Q Consensus 268 ~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~-~a~--~ii-P~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~ 343 (507) . +..-+++....+.+.+.+ ...|. .++ .++ +.|++ ++..+.+..-..|.+..++..++ T Consensus 290 ~-----~~~lt~e~~~~lk~~~~~----~~~G~~nag~~~vl~~~g~~---------~~~l~~~~~D~qfle~~~~~~~~ 351 (551) T protein:vir:80 290 A-----AQQQSQHALEIFKREWKN----SLSGINGSWQIPVVSAEDVK---------FVNMTPSARDMEFEKWLNYLINV 351 (551) T ss_pred C-----CCCCCHHHHHHHHHHHHH----HhcCccccCccccccCCCce---------EEEccCChhHHHHHHHHHHHHHH Confidence 1 111233333332222222 22232 233 234 34543 22333333333466777888899 Q ss_pred HHHHHhccccccc--------ccchh--hhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceE Q lcl|NC_017974. 344 ILDRLGAGFINVG--------NDKGG--SYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYV 412 (507) Q Consensus 344 Isk~iLGqtLts~--------~~~~G--S~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~ 412 (507) |++++.-.-.-.+ ...++ +++-.+... .....-+.-.++.|+..||+.|++.+ + ..+ +| T Consensus 352 Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~-------~--~~~-~f 421 (551) T protein:vir:80 352 ISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEF-------G--DKY-TF 421 (551) T ss_pred HHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc-------C--Cce-EE Confidence 9997643221111 11112 233333222 44566778889999999998775431 1 112 57 Q ss_pred EecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCC-CCccccc-chhhhcC--------C-------Cc- Q lcl|NC_017974. 413 KAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEK-IEEDISQ-EELMKLL--------G-------ED- 474 (507) Q Consensus 413 ~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~-~~~~~~~-~~~~~~~--------~-------~~- 474 (507) .|+.....+....++. .+++..|.+.+ +.+|+.+|+|+. +.+|... +...... + +. T Consensus 422 ~f~~~~~~~~~~~~~~-~~~~~~g~lT~-----NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (551) T protein:vir:80 422 QFVGGDIKSELESVKI-LAEKAKVAMTV-----NEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQ 495 (551) T ss_pred EeeccChhhHHHHHHH-HHHHhcCCcCH-----HHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccc Confidence 7887777776666654 45666787654 689999999763 3333211 0000000 0 00 Q ss_pred --CCC----CCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 475 --TSR----AGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 475 --~~~----~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .+. .+......+++.+ .......|-+.++.+| T Consensus 496 ~~~~~~~~~~~~~~~~~p~~~~-~~~~~~~~~~~~~~~~ 533 (551) T protein:vir:80 496 MLQEQTGNRVSTDVEDIPDGKD-TTGDIGKDGQRKDKDN 533 (551) T ss_pred cccCcCCCCCCCCCCCCCCccc-cCCCccccccccCccc Confidence 000 0000000000000 0111223333333333 No 33 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.66 E-value=1.6e-15 Score=101.73 Aligned_cols=422 Identities=10% Similarity=0.031 Sum_probs=208.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |++.|++....-.+... .....| ...+. ++. +..+.... ..-+..+ ..+..++-+.|.+|+..+-..| T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~g~~-~s~~~-~~~----~~~~~~~~----~~~g~~v-~~~~al~~~~v~~ci~~Ia~~i 68 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFL-KWLGVP-ISLTD-GSF----WSAWGGMG----SSSGETV-TADSALQLSAVWSCVRLIAETI 68 (437) T ss_pred CCcchhhhhhhhHHhhh-hhcCCc-ccCCc-hhH----HHhhcccc----cCCCcee-chHhhhccHHHHHHHHHHHHHH Confidence 76555544322111111 111111 11110 111 11111000 0000111 1244567789999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .++++.+.....+.........-+..+|..-++ .++.++++.++ +.+++|-+.+++++.. | .+..|.+ T Consensus 69 a~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-------g--~~~~L~~ 139 (437) T protein:vir:10 69 ATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-------G--VLIGLEL 139 (437) T ss_pred hhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-------C--cEEEEEE Confidence 998876432211110000000112233432122 24667777766 4577999999887742 2 2334555 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .++..+. +..+.+|.. ... + ....+....+|.+.++++++.. .+.++|.|.+..+. T Consensus 140 l~p~~v~----i~~~~~g~~-~y~---------~---------~~~~g~~~~~~~~dIih~r~~~-~d~~~G~spi~~~~ 195 (437) T protein:vir:10 140 MLPQRTT----VKRLTSGAL-QYT---------Y---------RNVDGTVSTLAEDDVFHVRGFS-LDGLMGLTPIQYAR 195 (437) T ss_pred EcCcceE----EEECCCCeE-EEE---------E---------EecCceEEEEccccEEEecCcC-CCCcccccHHHHHH Confidence 5554332 112223321 100 0 0111223457777766666543 55689999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC--ceeEeecCccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE--QSFFMLPSDTKDNAP 315 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~--~a~~iiP~g~~i~~~ 315 (507) ...-.-....++-..|.+.-+.|--++++|-. -+++...+..+.+.+. +.+.. ...++++.|++.. T Consensus 196 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-------l~~e~~~~~~~~~~~~---~~g~~nag~~~vl~~g~~~~-- 263 (437) T protein:vir:10 196 EVLGNSTAANKTSASVFRNGLRPSGVLSTDQI-------LQKEKRAEIRTDLAEQ---FGGAMQAGKTMVLEAGMKYQ-- 263 (437) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-------CCHHHHHHHHHHHHHH---hcCccccCcceeccCCceEE-- Confidence 77777777778888888776666666665522 2233333333333322 11111 2347778887532 Q ss_pred ccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhh--hH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 316 QYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSY--NL-SESKQTIHTQFVQRVNEIILEALNENL 392 (507) Q Consensus 316 ~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--Al-~~vh~~v~~~~~~aD~~~i~~~ln~~l 392 (507) ..+-+.....|.+..++..++|++++.-..--.+....+++ +. .+........-+.-.+..|+..||+.| T Consensus 264 -------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kl 336 (437) T protein:vir:10 264 -------AITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTLRPWLTRIEQAARRSL 336 (437) T ss_pred -------eccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 22222233346677778889999987655433333333332 21 222233445556677788888888766 Q ss_pred HHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCC Q lcl|NC_017974. 393 LPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLG 472 (507) Q Consensus 393 i~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~ 472 (507) +..-- . ...+-+|.++..-..|.++.++++.+++..|++.+ +.+|+.+|+|+-.+++..........| T Consensus 337 l~~~e------~-~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~gg~~~~~~~~~~~~ 404 (437) T protein:vir:10 337 LRPGE------R-DQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTR-----DECRAKENLPPMGGNAAVLTVQSALLP 404 (437) T ss_pred cCccc------c-CceEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCcceEeecCcccc Confidence 44311 1 11222344445556789999999999999998876 579999999875444432111111111 Q ss_pred ----CcCCCCC---CcccccCcccccccccCCC Q lcl|NC_017974. 473 ----EDTSRAG---DGMTKGSSGNGTGKISSTR 498 (507) Q Consensus 473 ----~~~~~~~---~~~~~~~~~~~~~~~~~~~ 498 (507) ....++. ++.+.+..++...+..+-| T Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 405 IDKLGEHTTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred hhhccCcCCCcchhccccccCCCCCCCCccccC Confidence 0001111 1111111121111111112 No 34 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.66 E-value=3.6e-15 Score=99.76 Aligned_cols=403 Identities=14% Similarity=0.111 Sum_probs=210.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.=..+.++-.+.... ... .+ +.... .+ .. .+. -.-+..++.+.|.+|+..+-..| T Consensus 1 MG~~~~~~~~~~~~~~---~~~-----~~--~~~~~-~~--~g----~~~-------~~~~~al~~~~V~~~v~~Ia~~i 56 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNE---TVD-----MT--NPLLL-QW--LG----VDP-------DTPRNQLSEATYFACLKILSESL 56 (411) T ss_pred CchHHHHHhhccCccc---ccc-----cc--hHHHH-HH--hc----Ccc-------cChhhhhccHHHHHHHHHHHHhH Confidence 4322221111110000 000 00 00000 00 00 000 01233456788999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhc--cCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNM--EGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~--~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .++++.+....++... +....-+...|+.- ...++.++++.++. .+++|-+.+++++.. |. +..|.+ T Consensus 57 A~lp~~~~~~~~~~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-------g~--~~~l~~ 126 (411) T protein:vir:81 57 GKLPLKMYQKTERGIV-KSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-------PQ--LQALWI 126 (411) T ss_pred hhCceeEEEecCCcee-eecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-------Cc--eEEEEE Confidence 9998776433221110 00011133344322 22356778888764 678999999888751 22 334555 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .|+.++. +..+.++ ....... .++.+ .....+....+|.+.+|++++....+..+|.+.+..+. T Consensus 127 l~~~~v~----~~~~~~~--~~~~~~~--~~~~~--------~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~ 190 (411) T protein:vir:81 127 LPSQYVT----IVVDDRG--LLGEKNA--IWYRY--------NDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLK 190 (411) T ss_pred ECCceEE----EEEcCcc--cccccce--EEEEE--------EecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHH Confidence 5554432 1122222 1110000 00000 00112234467888877777666667789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~ 317 (507) ...-.-....++-..+...-+.|--++++|-. -+++...++.+++.+..... .+....++++.|+++.- T Consensus 191 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-------l~~e~~~~~~~~~~~~~~g~-~n~g~~~vl~~g~~~~~--- 259 (411) T protein:vir:81 191 HTVDGALESQKFMNNLYKTGLTGKAVLEYTGD-------LNQEARDRLVKGFEQFANGS-KNAGKIIPVPLGMKLVP--- 259 (411) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-------CCHHHHHHHHHHHHHHhcCc-cccCCceecCCCceEEE--- Confidence 88877788888888888776666666665422 22333333333333322111 11223477788875432 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQL 396 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l 396 (507) .+-+.....+.+..++..++|++++.-..--.+....|+|+-.+.+. .....-+.-.++.|++.||++|+..- T Consensus 260 ------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~ 333 (411) T protein:vir:81 260 ------LDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSND 333 (411) T ss_pred ------ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChh Confidence 22222223466677888999999887665444444456777555444 33445566778888888887765442 Q ss_pred HHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchh----hhcCC Q lcl|NC_017974. 397 LALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEEL----MKLLG 472 (507) Q Consensus 397 ~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~----~~~~~ 472 (507) .. ....+-+|.++.....|.++.++++++++..|++.+ +.+|+.+|+|+-+.+|...... ....+ T Consensus 334 ~~------~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~-----NE~R~~~gl~p~~ggD~~~~~~n~~pl~~~~ 402 (411) T protein:vir:81 334 LI------SQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTP-----NEARDYLDMPADDYGNNLMANGNYIPLSMLG 402 (411) T ss_pred hc------CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeeccCccchhhhh Confidence 11 111121333344456788999999999999998876 5799999998654444321110 11111 Q ss_pred CcCCCCCCc Q lcl|NC_017974. 473 EDTSRAGDG 481 (507) Q Consensus 473 ~~~~~~~~~ 481 (507) +..++.||. T Consensus 403 ~~~~kgGd~ 411 (411) T protein:vir:81 403 ANYGKGGDS 411 (411) T ss_pred hhhccCCCC Confidence 111111111 No 35 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.66 E-value=2e-15 Score=101.21 Aligned_cols=412 Identities=12% Similarity=0.058 Sum_probs=204.8 Q ss_pred CcccccccccccCcccccc---ccccccccccccCch--hhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSE---QINVSRMRMSEQGTF--ALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDM 75 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~---~~~~~~~~~~~~g~~--~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~ 75 (507) |.+-|++++....+++.-. +...+. .+.-|.. +..-... ...-....+- +..+ ..+..++-+.|.+|+.. T Consensus 4 ~d~~r~~~~~~~~~~~~~~~~~~~~~~~--~~~~g~~~~~~~~~~~-~~~~~~~~~~-g~~v-~~~~al~~~~V~~ci~~ 78 (431) T protein:vir:10 4 FDFIRREKQPEAQARPHVEPSFQASTPT--TSIPGETFEGLDDPRL-KEYIRRGELN-GGTG-RETRALRNMAVLRCVTL 78 (431) T ss_pred hhhhhcCccccccccccccccccccccc--ccccccccccccchHH-HHhhccCccC-ccee-chhhhhccHHHHHHHHH Confidence 4454444433222222111 111110 1100110 0000000 0000000000 0111 12445678889999999 Q ss_pred HHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 76 LYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 76 Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) +-..|.++++.+.-..+... .....-+..+|..-++ .++.++++.+ .+.+++|-+++++++.. |. + T Consensus 79 Ia~~iA~lp~~v~~~~~~~~--~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-------g~--~ 147 (431) T protein:vir:10 79 ISGTIGMLPMNLISSDDSKQ--VLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-------NR--P 147 (431) T ss_pred HHHhhccCceEEEEecCcee--eeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-------Cc--e Confidence 99999988876533322111 1111223444443222 2455676665 45777999999998852 21 2 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchH Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSP 232 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gl 232 (507) ..|.+.++.++. +..+.++... +.+. ...+..+.+|.+.++++++.. .+.++|.|. T Consensus 148 ~~L~pl~~~~v~----~~~~~~~~~~----------y~~~---------~~~g~~~~~~~~dViHir~~~-~dg~~G~sp 203 (431) T protein:vir:10 148 IRLIPMDRGSAK----GRLTSTWQIV----------YDYT---------TPTGDKIELPAREVFHLRDLS-IDGVSGVSR 203 (431) T ss_pred EEEEEEcCceeE----EEEcCCCeEE----------EEEE---------eCCceEEEEchhhEEEecCcC-CCCcccccH Confidence 345555554332 1223333211 0000 112234457777776666553 456899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC---CceeEeecCc Q lcl|NC_017974. 233 LVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG---EQSFFMLPSD 309 (507) Q Consensus 233 l~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G---~~a~~iiP~g 309 (507) +..+.-..-.-....++...|...-+.|--++++|- .-+++...+..+.+. +...| ....+++|.| T Consensus 204 i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-------~ls~e~~~~~~~~~~----~~~~g~~n~g~~~vl~~g 272 (431) T protein:vir:10 204 VKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPK-------ELSDNAYGRMKASVQ----ENHTGSENAGSWMLLEEG 272 (431) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC-------CCCHHHHHHHHHHHH----HHhcCccccCCceecCCC Confidence 999987777777777888888876666655665542 222333322222222 22223 2234788888 Q ss_pred ccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 310 TKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEAL 388 (507) Q Consensus 310 ~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~l 388 (507) +++. ..+-+.....|.+.-++..++|++++.-..--.+...+++++-.+-+. .....-+.-.++.|++.| T Consensus 273 ~~~~---------~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~l 343 (431) T protein:vir:10 273 ATAK---------QFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAIFFIQYGLSHWFVSWEQAA 343 (431) T ss_pred ceEE---------EccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7533 222233333455666777888998776554333333344555333333 333445666778888888 Q ss_pred HHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCcc--ccHHHHHHHHHHcCCCCCCC--ccccc Q lcl|NC_017974. 389 NENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLP--KTPKVINRVLEVLGIDEKIE--EDISQ 464 (507) Q Consensus 389 n~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~--~~~~~~~~i~e~~Gip~~~~--~~~~~ 464 (507) |+.|+..--. ...+.+|.++..-..|.++.++.+++++..|+.. -+ .+.+|+.+|+|+-.+ .|... T Consensus 344 n~~Ll~~~~~-------~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT---~NE~R~~~gl~p~~~~~gD~~~ 413 (431) T protein:vir:10 344 ARAFLPEKML-------GQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMK---QNEVREMLDLPRADDPVADQLR 413 (431) T ss_pred HhhccChhhc-------CCceEEEechhhhccCHHHHHHHHHHHHhcccccCccC---HHHHHHHhCCCCCCCcccccee Confidence 8766543211 1122234444455679999999999999988731 11 368999999986543 33221 Q ss_pred chhhhcCCCcCCCCCCcccccCcccccccccCCCCchh Q lcl|NC_017974. 465 EELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSA 502 (507) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a 502 (507) ... ...+.+++. + ++.++ T Consensus 414 ~p~------n~~~~~~~~------~--------~p~~~ 431 (431) T protein:vir:10 414 NPM------TQKQKGSGD------E--------PPATT 431 (431) T ss_pred ccc------ccccCCCCC------C--------CCCCC Confidence 110 011111000 0 00111 No 36 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.65 E-value=5.6e-15 Score=98.71 Aligned_cols=405 Identities=12% Similarity=0.051 Sum_probs=204.9 Q ss_pred Cccccc---ccccccCccccccccccccccccc-cCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRN---RNRSVKVAKATSEQINVSRMRMSE-QGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~---~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~R 76 (507) |.--++ ++.... + +.+..+.+ .|. +..... +..+. .+..++-+.|.+|+..+ T Consensus 1 Mg~f~~lf~r~~~~~---~------~~~~~~~~~~~~-~~~~~~-------------g~~v~-~~~al~~~~v~~~i~~I 56 (414) T protein:vir:44 1 MVFFSGLFQRKSDAP---V------TTPAELADAIGL-SYDTYT-------------GKQIS-SQRAMRLTAVFSCVRVL 56 (414) T ss_pred CchhhhhhccCccCc---c------cchhhHhHhhcc-CccccC-------------Cceec-hhhhhccHHHHHHHHHH Confidence 443332 111100 0 00000000 000 000000 00010 12345788899999999 Q ss_pred HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhcc--CCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 77 YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNME--GQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 77 k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~--~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) -..|.++++.+....++.. ......-+..+|..-+ ..++.++++.++. .+++|-++++++.. . |. +. T Consensus 57 a~~ia~~p~~~~~~~~~~~-~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~------g~--~~ 126 (414) T protein:vir:44 57 AESVGMLPCNLYHLNGSLK-QRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-F------GE--VA 126 (414) T ss_pred HHHhccCceEEEEecCCce-eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-C------Cc--EE Confidence 9999988876543322111 1111111233343212 2345677777664 66799998877543 1 22 34 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL 233 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll 233 (507) .|.+.++.++. +.++.+++.+ +.+ ....+....+|.+.+|++++. ..+.++|.|.+ T Consensus 127 ~L~~l~~~~v~----~~~~~~~~~~----------y~~---------~~~~g~~~~~~~~evih~~~~-~~d~~~G~s~i 182 (414) T protein:vir:44 127 ELLPVDPGCVV----PKLNSSWEPV----------YQV---------TFPDGSTDVLSQEDIWHVRTL-TLDGLVGLNPI 182 (414) T ss_pred EEEEEcCceEE----EEECCCCcEE----------EEE---------EecCceEEEEccccEEEecCC-CCCCcccccHH Confidence 45555554332 1223333211 000 011122345777777777655 44568999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccc Q lcl|NC_017974. 234 VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDN 313 (507) Q Consensus 234 ~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~ 313 (507) ..+....-.-....++...|...-+.|--++++|-.+ +++......+.+.+..... .+....+++|.|+++. T Consensus 183 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-------~~e~~~~~~~~~~~~~~g~-~n~~~~~vl~~g~~~~ 254 (414) T protein:vir:44 183 AYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTL-------SDQAYERLKKDFEERHTGL-GNAHRPMILEMGLDWK 254 (414) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-------CHHHHHHHHHHHHHHhcCc-cccCcceecCCCceEE Confidence 9998776666666777777777767676666665322 2233333333333322111 0112256778887543 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENL 392 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~l 392 (507) ..+-+..-..|.+..++...+|++++.-..--.+....++++-.+.+. .....-+.-.++.|++.||+.| T Consensus 255 ---------~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L 325 (414) T protein:vir:44 255 ---------SMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGL 325 (414) T ss_pred ---------EccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 222222233466777888899998876654333333345666544443 3345566778888888888766 Q ss_pred HHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccc-hhhhcC Q lcl|NC_017974. 393 LPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQE-ELMKLL 471 (507) Q Consensus 393 i~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~-~~~~~~ 471 (507) ++.-- ....+-+|.++.....|+++.+++++++++.|++.+ +++|+.+|+|+-+.+|.... ...... T Consensus 326 ~~~~~-------~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~gl~p~~ggD~~~~~~n~~~~ 393 (414) T protein:vir:44 326 VRKSK-------QGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSP-----NDCRDLEDMNPRPGGDVYLTPMNMTTK 393 (414) T ss_pred CCccc-------cCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeccccccccc Confidence 54311 011122344445556789999999999999999876 57999999987655553221 111111 Q ss_pred CCcCCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 472 GEDTSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) +....+.++ ..++ +..|-++| T Consensus 394 ~~~~~~~~~-----~~~~------~~~d~~~~ 414 (414) T protein:vir:44 394 PSDGSKAGK-----QKDN------ANADETTS 414 (414) T ss_pred CCccccCCC-----CCCC------CCCCCCCC Confidence 111111111 1111 11122222 No 37 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.65 E-value=1.9e-15 Score=101.26 Aligned_cols=406 Identities=11% Similarity=0.052 Sum_probs=203.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |..++..-...+-- ........+. .....+.+.. ..+. ....+ .=+..++-+.|.+|+..+-..| T Consensus 10 ~~~~~g~~~~~~~~-f~~~~~~~~~-~~~~~~~~~~---~~~~---------~~~~v-~~~~al~~~~v~~cv~~Ia~~i 74 (424) T protein:vir:18 10 LRTNNGWWARLKSW-FVGGRLVTPN-QGSQTGPVSA---HGYL---------GDSSI-NDERILQISTVWRCVSLISTLT 74 (424) T ss_pred cCCCCchHHHHHhh-cccccccccc-chhhcccccc---cccc---------ccccc-cHHHhhccHHHHHHHHHHHHhh Confidence 11111110000000 0000000000 0000000000 0000 00001 0133566788999999999999 Q ss_pred hhCCceeeeCCCCC-hhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 81 EKAFKDFKVIPGES-EESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 81 ~~~~~~~~~~~~~~-~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .++++.+.-...+. ........-+..+|+.-++ .+..++++.++ +.+++|-+.+++++... |+ +..|. T Consensus 75 A~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~L~ 146 (424) T protein:vir:18 75 ACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA------GD--VISLL 146 (424) T ss_pred ccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEE Confidence 98876643221111 1000001113344432222 24556666665 57789999999887532 32 34555 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++.++.. ..++..+.. . +...+....+|++.+|++++.. .+..+|.|.+..+ T Consensus 147 ~l~~~~v~v------~~~~~~~~y---------~----------~~~~g~~~~~~~~eVihir~~~-~dg~~G~spi~~~ 200 (424) T protein:vir:18 147 PLQSANMDV------KLVGKKVVY---------R----------YQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFA 200 (424) T ss_pred EecCcceEE------EEcCCeEEE---------E----------EEeCCeEEEeccccEEEecCcC-CCCcccccHHHHH Confidence 555544321 112211110 0 1112233467777776666554 4558999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCcccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKDNA 314 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i~~ 314 (507) .-..-.-....++...|...-+.|--++++|-.. ..++..+.+++.. ....+|..+ .++++.|+++.- T Consensus 201 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~-------l~~e~~~~~~~~~---~~~~~~~nag~~~vl~~g~~~~~ 270 (424) T protein:vir:18 201 CKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV-------LTEQQRSQVEENF---KEIAGGPVKKRLWILEAGFSTSA 270 (424) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcC-------CCHHHHHHHHHHH---HHHhCCcccCCceeccCCceEEe Confidence 8777666777777778887766666666654321 1223333333322 233334333 478888875432 Q ss_pred cccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhh--h-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSY--N-LSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--A-l~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) .+-+.....|.+..++...+|++++.-..--.+...++++ + ..+........-+.-.++.|+..||+. T Consensus 271 ---------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~ 341 (424) T protein:vir:18 271 ---------IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW 341 (424) T ss_pred ---------cCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 2222233356777788889999987665433333333333 2 122333444666677888888888877 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) |+..- .....+-+|.++..-..|.++.++.+.+++..|++.+ +.+|+.+|+|+-+.+|.......-.+ T Consensus 342 L~~~~-------~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~ggD~~~~~~n~~~ 409 (424) T protein:vir:18 342 LIPSK-------DVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTI-----NEMRRTDNMPPLPGGDVAMRQAQYVP 409 (424) T ss_pred cCCcc-------ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccCccc Confidence 65442 1111223444455567889999999999999999877 57999999997655443211111000 Q ss_pred CCcCCCCCCcccccCcccccccccCCCCchh Q lcl|NC_017974. 472 GEDTSRAGDGMTKGSSGNGTGKISSTRDNSA 502 (507) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a 502 (507) .+...... ...++.| T Consensus 410 l~~~~~~~----------------~~~~n~a 424 (424) T protein:vir:18 410 ITDLGTNK----------------EPRNNGA 424 (424) T ss_pred hhhhhccC----------------CccccCC Confidence 00000000 0111111 No 38 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.65 E-value=4.3e-14 Score=93.89 Aligned_cols=439 Identities=11% Similarity=0.080 Sum_probs=198.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhH-HHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLI-GTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~-~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |.+-..+......+-.......+.|....-.|. +.+. +.+..+...+.+ .+.+.+..-+.|.+|+..|... T Consensus 35 ~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~------~~~~--~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~ 106 (576) T protein:vir:96 35 IRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTN------PEFR--TKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQ 106 (576) T ss_pred HHHhhhhhhhhccccCCccchhhcceeeeeecC------CCcc--ccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHH Confidence 111100000000000000000000100000010 0010 011111111111 1112222357789999999987 Q ss_pred Hhh-----------CCceeeeCCCC-ChhHHHHHHH--HHHHHHhcc------CCCHHHHHHHHH-HHHhhcceeeeeee Q lcl|NC_017974. 80 VEK-----------AFKDFKVIPGE-SEESKKAAKF--VEYCLKNME------GQTLRQFARDAA-TFNEYGLSVVEKVY 138 (507) Q Consensus 80 v~~-----------~~~~~~~~~~~-~~~~~~~ae~--v~~~l~~~~------~~~~~~~l~~~l-~a~~~Gfs~~Ei~w 138 (507) |.. +.|++.....+ .+.+.++++. ++..|..+. ..+|.++++.++ +.+.+|.+.+|++| T Consensus 107 vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~ 186 (576) T protein:vir:96 107 VAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVF 186 (576) T ss_pred HHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEE Confidence 764 45555553322 2222222222 223332211 125777888866 47889999999998 Q ss_pred eccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEE Q lcl|NC_017974. 139 TQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLM 218 (507) Q Consensus 139 ~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~ 218 (507) .+.+. -.+..|.+.++.++.. ..+.||..+.... .+ ...........+|....|+| T Consensus 187 ~rd~~------g~~~~L~pl~p~~V~v----~~~~dg~~~~~~~----~~----------~~~~~~~~~~~~~~~dii~~ 242 (576) T protein:vir:96 187 NKKNA------TTMDKFIAVDPSTIFY----ATDKNGKIIKGGK----RF----------VQVINKKVVASFTSREMAMG 242 (576) T ss_pred ecCCC------CceEEEEEeCCceeEE----EECCCCceeeeee----EE----------EEecCCceEEEecccceEEE Confidence 76432 2244555556654432 2344443221100 00 00111222345677788888 Q ss_pred eeCCccC---ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_017974. 219 NTGGSSS---QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAAN 295 (507) Q Consensus 219 ~~~~~~~---~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n 295 (507) ++....+ .+||.+.+..+....-.-....++-..|...-+.|--++.+|. +..-+++...+..+.+ .+ T Consensus 243 ~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~-----~~~ls~e~~~~lr~~~----~~ 313 (576) T protein:vir:96 243 IRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKS-----EQQQSQRALENFKREW----KS 313 (576) T ss_pred eecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-----CCCCCHHHHHHHHHHH----HH Confidence 8776554 6789999999998888888888888888886665554554432 1112233322222222 22 Q ss_pred hhcCC-ce---eEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc---------cchh- Q lcl|NC_017974. 296 AHSGE-QS---FFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN---------DKGG- 361 (507) Q Consensus 296 ~~~G~-~a---~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~---------~~~G- 361 (507) ...|. .+ .++++.|+++ +..+.+.....|.+..++..++|++++.-...-.+. ..+| T Consensus 314 ~~~G~~nag~~p~vl~~G~~~---------~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s 384 (576) T protein:vir:96 314 SFSGINGSWQVPVVMADDIKF---------VNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNT 384 (576) T ss_pred HhccccccccceeecCCCceE---------EeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccc Confidence 22332 22 2567888643 233333334457778888899999987443211111 0112 Q ss_pred -hhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHH--HHHhcCC Q lcl|NC_017974. 362 -SYNL-SESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQ--RIGAVGY 437 (507) Q Consensus 362 -S~Al-~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~--~L~~~G~ 437 (507) +||- .+........-+.-.++.|+..||+.|++.. +. .+ .|.|... |.+..++... .+...|+ T Consensus 385 ~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~-------~~--~~-~~~f~r~---d~~~~~e~~~~~~~~~~G~ 451 (576) T protein:vir:96 385 LNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEY-------SD--KY-VFQFVGG---DTKSELDKIKILQEEVKTY 451 (576) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-------cC--ce-EEEeccC---CHHHHHHHHHHHHHHhcCc Confidence 3442 3333445556777888999999998776542 11 11 3555543 4444555443 3345687 Q ss_pred ccccHHHHHHHHHHcCCCCCCCcccccch-hhhcCCC----c---CCCCCCcccc-cCccc------ccccccC------ Q lcl|NC_017974. 438 LPKTPKVINRVLEVLGIDEKIEEDISQEE-LMKLLGE----D---TSRAGDGMTK-GSSGN------GTGKISS------ 496 (507) Q Consensus 438 ~~~~~~~~~~i~e~~Gip~~~~~~~~~~~-~~~~~~~----~---~~~~~~~~~~-~~~~~------~~~~~~~------ 496 (507) +.+ +.+|+.+|+|+-+.+|..... ....... . .....+.... ..... +..+++. T Consensus 452 lT~-----NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~ 526 (576) T protein:vir:96 452 KTV-----NEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGR 526 (576) T ss_pred cCH-----HHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCccccc Confidence 655 679999999865544421100 0000000 0 0000000000 00000 0000000 Q ss_pred ------------CCCchhhhccC Q lcl|NC_017974. 497 ------------TRDNSAANLDN 507 (507) Q Consensus 497 ------------~~d~~a~~~~~ 507 (507) .+|=+-++-+| T Consensus 527 ~~~~~~~~~~~~~~~~~~~~~~~ 549 (576) T protein:vir:96 527 ESNDPTKIDSPVGTDGQLKDQDN 549 (576) T ss_pred ccccCCCCCCccccccccCCCCc Confidence 01111111112 No 39 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.65 E-value=5.9e-15 Score=98.60 Aligned_cols=405 Identities=12% Similarity=0.028 Sum_probs=203.4 Q ss_pred Cc--ccccccccccCccccccccccccccccc-cCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHH Q lcl|NC_017974. 1 MS--RKRNRNRSVKVAKATSEQINVSRMRMSE-QGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLY 77 (507) Q Consensus 1 ~~--~k~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk 77 (507) |= +-.+|+....+..+. .+.+ .| .+...+.+ ..+. -+..++-+.|.+|+..+- T Consensus 1 ~~f~~~f~r~~~~~~~~~~---------~~~~~~~-~~~~~~~g-------------~~v~-~~~~l~~~~v~~~i~~Ia 56 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPA---------ELAEAIG-LSYDTYTG-------------KRIS-SQRAMRLTAVYSCVRVLA 56 (413) T ss_pred CccchhhccCccCCccchH---------HHHHhhh-cCcccccC-------------ceec-hhhhhccHHHHHHHHHHH Confidence 22 111111111110000 0000 00 00000000 0000 123456788999999999 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHH-HHHHHHHhcc--CCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAK-FVEYCLKNME--GQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae-~v~~~l~~~~--~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) ..|.++++.+....++.. .++.+ -+..+|+.-+ ..++.++++.++ +.+++|-+.+++++.. |. +. T Consensus 57 ~~iA~~p~~~~~~~~~~~--~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~-------g~--~~ 125 (413) T protein:vir:48 57 ESVGMLPCSLYKISGTLK--TRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKAL-------GE--VV 125 (413) T ss_pred HhhhhCceEEEEecCCcc--eeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCC-------Cc--EE Confidence 999998876543322111 11111 1233343212 234566777766 5677999988887642 22 34 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL 233 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll 233 (507) .|.+.++.++. +..+.++..+-. .....+....+|...++++++.. .+.++|.|.+ T Consensus 126 ~L~~l~~~~v~----~~~~~~~~~~y~-------------------~~~~~g~~~~~~~~evih~~~~~-~d~~~G~s~i 181 (413) T protein:vir:48 126 ELLPIDPGCVE----PKLNSQWQPVYQ-------------------VTFPDGSVDVLTQDEIWHVRTLT-LDGLVGLNPI 181 (413) T ss_pred EEEEEcCceEE----EEEcCCceEEEE-------------------EEecCceEEEEccccEEEecCcC-CCCcccccHH Confidence 45555554433 122333321100 00111223356777766666554 4568999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccc Q lcl|NC_017974. 234 VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDN 313 (507) Q Consensus 234 ~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~ 313 (507) ..++...-.-....++...+...-+.|--++++|.. -++++..+..+.+.+..... .+....++++.|+++. T Consensus 182 ~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-------~~~e~~~~~~~~~~~~~~g~-~n~g~~~vl~~g~~~~ 253 (413) T protein:vir:48 182 AYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQK-------LTPDAYERLKKDFEERHTGL-GNAHRPMILEMGLDWK 253 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-------CCHHHHHHHHHHHHHHhcCc-cccCcceecCCCceEE Confidence 999987766666677777777766666556655432 22333333333333322110 1112347788887543 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENL 392 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~l 392 (507) - .+-+.....|.+..++...+|+.++.-..--.+....++++-.+-+. .....-+.-.++.|++.||+.| T Consensus 254 ~---------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L 324 (413) T protein:vir:48 254 S---------MALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGL 324 (413) T ss_pred e---------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 2 22222333466777888899999876654233333345666444333 3444567778888888888876 Q ss_pred HHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCC Q lcl|NC_017974. 393 LPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLG 472 (507) Q Consensus 393 i~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~ 472 (507) +.+.-. ...+-+|.++.....|.++.+++++++++.|++.+ +.+|+.+|+|+-+.+|........... T Consensus 325 ~~~~~~-------~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~-----NE~R~~~g~~p~~ggD~~~~~~n~~~~ 392 (413) T protein:vir:48 325 VRESKQ-------GKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSP-----NDCRDLEDMNPRPGGDVYLTPMNMTTS 392 (413) T ss_pred cCcccc-------CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeecccccccc Confidence 654211 11122333445556788999999999999999877 579999999876555542221111111 Q ss_pred CcCCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 473 EDTSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) +......++.. .. + ..|-+++ T Consensus 393 ~~~~~~~~~~~--~~--~------~~~~~~~ 413 (413) T protein:vir:48 393 PSAGDDNGKKK--ES--G------DADKTAS 413 (413) T ss_pred ccccccCCCCC--CC--C------CccccCC Confidence 11100000000 01 1 1111111 No 40 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.64 E-value=1.5e-15 Score=101.83 Aligned_cols=409 Identities=11% Similarity=0.001 Sum_probs=207.4 Q ss_pred CcccccccccccCccccccccccccccccccC--chhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQG--TFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYT 78 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~ 78 (507) |-=.+.=++........ +..+. ..+..... +..+ ..+..++-+.|.+|+..+-. T Consensus 1 m~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~-------------g~~v-~~~~al~~~~v~~~i~~ia~ 56 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVN----------WQVVPGGMRSSSSQA-------------GVII-TPETALALSAVRACVTLLAE 56 (419) T ss_pred CcchhhhccCCcccccc----------ccccccccccccccC-------------Ccee-chHHhhccHHHHHHHHHHHH Confidence 54444422221111110 00000 00000000 0011 12334567789999999999 Q ss_pred HHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) .|.++++...-...+.........-+..+|..-++ .++.++++.+. +.+.+|-+++++++... |+ +..| T Consensus 57 ~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~------G~--~~~L 128 (419) T protein:vir:57 57 SVAQLPCVLYRRTENGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGR------GD--ITEL 128 (419) T ss_pred hhccCceEEEEEcCCCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEE Confidence 99998776422211111001001113444432222 24567777766 56779999999987643 32 3456 Q ss_pred cccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHH Q lcl|NC_017974. 156 AFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVG 235 (507) Q Consensus 156 ~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~ 235 (507) .+.++.++.. ..+.+|. . ++ .....+..+|.+.++++++. ..+.++|.|.+.. T Consensus 129 ~pl~~~~v~v----~~~~~g~--~--------~y------------~~~~~~~~~~~~~vih~r~~-~~d~~~G~s~i~~ 181 (419) T protein:vir:57 129 IPINPHKVIV----LKGPDGM--P--------YY------------DIPSIGEILPMRMVHHIKSF-SLDGYIGTSPIQT 181 (419) T ss_pred EEEcCcceEE----EECCCce--E--------EE------------EEcCCceEEchhhEEEecCc-CCCCcccccHHHH Confidence 6666543321 1222221 1 00 01112334666665555554 4456899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC---CceeEeecCcccc Q lcl|NC_017974. 236 CYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG---EQSFFMLPSDTKD 312 (507) Q Consensus 236 ~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G---~~a~~iiP~g~~i 312 (507) +....-.-....++...|...-+.|--+++.|.... ...+++... ++.+...+..+| ....++++.|+++ T Consensus 182 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~---~~~~~e~~~----~~~~~~~~~~~g~~nag~~~vl~~g~~~ 254 (419) T protein:vir:57 182 NPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAK---AIASQAAVD----AILAKWTERYGGVRNAFSVGMLQEGMTY 254 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCC---cccCHHHHH----HHHHHHHHHhccccccccceecCCCceE Confidence 888776667777788888877777766666653322 112222222 222222222223 1235677888754 Q ss_pred cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 313 NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQT-IHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 313 ~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~-v~~~~~~aD~~~i~~~ln~~ 391 (507) . ..+.+.....|.+..++..++|++++.-..--.+....|+++-.+-+.. ....-+.-.++.|+..||+. T Consensus 255 ~---------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ 325 (419) T protein:vir:57 255 K---------QLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRHESAMMRD 325 (419) T ss_pred E---------EcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 2 2222223335667778888999998766543333333455554443433 33555667777888888876 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) |+.+-- ....+.+|.++.....|++..+++++++++.|++.+ +.+|+.+|+|+-+.+|....... .. T Consensus 326 ll~~~~-------~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD~~~~~~n-~~ 392 (419) T protein:vir:57 326 LLLPSE-------RRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSV-----NDIRRMENLTPIPGGDKYLTPLN-MV 392 (419) T ss_pred ccCccc-------cCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccc-cc Confidence 654311 111222344445556799999999999999998876 57999999987655554321111 11 Q ss_pred CCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 472 GEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) +. ....+ .+.+.....+|..+.|.-+ T Consensus 393 ~~-~~~~~---------~~~~~~~~~~~~~~~~~~~ 418 (419) T protein:vir:57 393 DS-KALTG---------IGKATPQQLKDIEAILCTR 418 (419) T ss_pred cc-ccccc---------ccCCCcccCcchhhhhhcc Confidence 10 00000 0001111122222222222 No 41 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.64 E-value=5.6e-14 Score=93.22 Aligned_cols=427 Identities=12% Similarity=0.066 Sum_probs=195.3 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHH-hhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQV-DSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) .++.+|+++.+.+ +|......+.+ | ++. .....+...+ +.. .+.+..-+-|.+|+..|... T Consensus 46 ~~~~~~~~~~a~~-~~~~~~~~~~~------~------~~~~~~~~~~~~~l--~~~---l~~~~~n~i~~~~I~t~~~~ 107 (563) T protein:vir:95 46 LTKSLYGQQQAYA-EPFIEMMDTNP------E------FRDKRSYMKNEHNL--HDV---LKKFGNNPILNAIILTRSNQ 107 (563) T ss_pred HHhhhccCCCcch-hhhHhhhcccc------c------ccccccCCCCcccH--HHH---HHHhhcchHHHHHHHHHHHH Confidence 3333333322211 11000000000 0 000 0011111111 111 22222346677777777776 Q ss_pred Hhh-----------CCceeeeCCC-CChhHHHHH--HHHHHHHHhcc------CCCHHHHHHHHH-HHHhhcceeeeeee Q lcl|NC_017974. 80 VEK-----------AFKDFKVIPG-ESEESKKAA--KFVEYCLKNME------GQTLRQFARDAA-TFNEYGLSVVEKVY 138 (507) Q Consensus 80 v~~-----------~~~~~~~~~~-~~~~~~~~a--e~v~~~l~~~~------~~~~~~~l~~~l-~a~~~Gfs~~Ei~w 138 (507) |.. +.|++..... .++...+.+ ..++..|.... ..+|.+++..++ +.+.+|.+.+|+++ T Consensus 108 vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~ 187 (563) T protein:vir:95 108 VAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVF 187 (563) T ss_pred HHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEE Confidence 653 3345544321 112122222 22333333211 124667887766 47889999999998 Q ss_pred eccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEE Q lcl|NC_017974. 139 TQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLM 218 (507) Q Consensus 139 ~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~ 218 (507) .+.+. -.+..|.+.++.++.. ..+.+|...... ..++ ..........++....|+| T Consensus 188 ~rd~~------G~~~~L~pl~p~~V~v----~~~~~g~~~~~~----~~y~----------~~~~g~~~~~~~~~evI~~ 243 (563) T protein:vir:95 188 NKNNK------TKLEKFIAVDPSTIFY----ATDKKGKIIKGG----KRFV----------QVVDKRVVASFTSRELAMG 243 (563) T ss_pred EecCC------CceEEEEEeCCceeEE----EECCCCceeccc----eeEE----------EEeCCceeEEecCcceEEE Confidence 76432 2344566666554431 223333211000 0000 0011112234667777888 Q ss_pred eeCCccC---ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_017974. 219 NTGGSSS---QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAAN 295 (507) Q Consensus 219 ~~~~~~~---~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n 295 (507) +.....+ .+||.+.+..+......-....++-+.|...-+.|--++.+|.. ...+++...+ +.+...+ T Consensus 244 ~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~-----~~ls~e~~~~----~~~~~~~ 314 (563) T protein:vir:95 244 IRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSD-----QQQSQHALEN----FKREWKS 314 (563) T ss_pred eccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCC-----CCCCHHHHHH----HHHHHHH Confidence 8776544 67899999999988888888888888888876666555554321 1112322222 2222222 Q ss_pred hhcCC-ce---eEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc---------ccchh- Q lcl|NC_017974. 296 AHSGE-QS---FFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG---------NDKGG- 361 (507) Q Consensus 296 ~~~G~-~a---~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~---------~~~~G- 361 (507) ...|. .+ .++++.|+++. ..+-+..-..|.+..++..++|++++.-..--.+ +..++ T Consensus 315 ~~~G~~nagk~~~vl~~G~~~~---------~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss 385 (563) T protein:vir:95 315 SLSGINGSWQIPVVMADDIKFV---------NMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGST 385 (563) T ss_pred HhccccccccceEEcCCCceEE---------eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccc Confidence 22332 23 25678887532 2222223334667777888899987654321111 11122 Q ss_pred -hhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHH--HHHHHhcCC Q lcl|NC_017974. 362 -SYNLSE-SKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKA--IQRIGAVGY 437 (507) Q Consensus 362 -S~Al~~-vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~--~~~L~~~G~ 437 (507) ++|-.+ ........-+.--++.|+..||+.|++.+ +.. -+|.|... |.+..++. +.+++..|+ T Consensus 386 ~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-------~~~---~~~~f~r~---D~~~~~e~~~~~~~~~~G~ 452 (563) T protein:vir:95 386 LNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEY-------GDK---YTFQFVGG---DTKSATDKLNILKLETQIF 452 (563) T ss_pred hhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-------ccc---cEEEeccC---CHHHHHHHHHHHHHhcCCc Confidence 223222 22334555667788889999998777542 111 13555444 43444443 345678888 Q ss_pred ccccHHHHHHHHHHcCCCCCCCcccccchhh--------hcC---------------CCcCCCCCCcccccC--cccccc Q lcl|NC_017974. 438 LPKTPKVINRVLEVLGIDEKIEEDISQEELM--------KLL---------------GEDTSRAGDGMTKGS--SGNGTG 492 (507) Q Consensus 438 ~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~--------~~~---------------~~~~~~~~~~~~~~~--~~~~~~ 492 (507) +.+ +.+|+.+|+|+-..+|....... +.. ....+++++.....+ +.++.+ T Consensus 453 lT~-----NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (563) T protein:vir:95 453 KTV-----NEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDK 527 (563) T ss_pred cCH-----HHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCcc Confidence 765 57999999987655543211000 000 000001111000000 001111 Q ss_pred cccCCCCchhhhccC Q lcl|NC_017974. 493 KISSTRDNSAANLDN 507 (507) Q Consensus 493 ~~~~~~d~~a~~~~~ 507 (507) + ...|..-...+| T Consensus 528 ~--~~~~~~~~~~~~ 540 (563) T protein:vir:95 528 E--IGTDAQIKGDDN 540 (563) T ss_pred c--cccccccccccc Confidence 1 111111111111 No 42 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.64 E-value=5.6e-14 Score=93.22 Aligned_cols=427 Identities=12% Similarity=0.066 Sum_probs=195.3 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHH-hhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQV-DSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) .++.+|+++.+.+ +|......+.+ | ++. .....+...+ +.. .+.+..-+-|.+|+..|... T Consensus 46 ~~~~~~~~~~a~~-~~~~~~~~~~~------~------~~~~~~~~~~~~~l--~~~---l~~~~~n~i~~~~I~t~~~~ 107 (563) T protein:vir:99 46 LTKSLYGQQQAYA-EPFIEMMDTNP------E------FRDKRSYMKNEHNL--HDV---LKKFGNNPILNAIILTRSNQ 107 (563) T ss_pred HHhhhccCCCcch-hhhHhhhcccc------c------ccccccCCCCcccH--HHH---HHHhhcchHHHHHHHHHHHH Confidence 3333333322211 11000000000 0 000 0011111111 111 22222346677777777776 Q ss_pred Hhh-----------CCceeeeCCC-CChhHHHHH--HHHHHHHHhcc------CCCHHHHHHHHH-HHHhhcceeeeeee Q lcl|NC_017974. 80 VEK-----------AFKDFKVIPG-ESEESKKAA--KFVEYCLKNME------GQTLRQFARDAA-TFNEYGLSVVEKVY 138 (507) Q Consensus 80 v~~-----------~~~~~~~~~~-~~~~~~~~a--e~v~~~l~~~~------~~~~~~~l~~~l-~a~~~Gfs~~Ei~w 138 (507) |.. +.|++..... .++...+.+ ..++..|.... ..+|.+++..++ +.+.+|.+.+|+++ T Consensus 108 vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~ 187 (563) T protein:vir:99 108 VAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVF 187 (563) T ss_pred HHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEE Confidence 653 3345544321 112122222 22333333211 124667887766 47889999999998 Q ss_pred eccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEE Q lcl|NC_017974. 139 TQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLM 218 (507) Q Consensus 139 ~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~ 218 (507) .+.+. -.+..|.+.++.++.. ..+.+|...... ..++ ..........++....|+| T Consensus 188 ~rd~~------G~~~~L~pl~p~~V~v----~~~~~g~~~~~~----~~y~----------~~~~g~~~~~~~~~evI~~ 243 (563) T protein:vir:99 188 NKNNK------TKLEKFIAVDPSTIFY----ATDKKGKIIKGG----KRFV----------QVVDKRVVASFTSRELAMG 243 (563) T ss_pred EecCC------CceEEEEEeCCceeEE----EECCCCceeccc----eeEE----------EEeCCceeEEecCcceEEE Confidence 76432 2344566666554431 223333211000 0000 0011112234667777888 Q ss_pred eeCCccC---ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_017974. 219 NTGGSSS---QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAAN 295 (507) Q Consensus 219 ~~~~~~~---~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n 295 (507) +.....+ .+||.+.+..+......-....++-+.|...-+.|--++.+|.. ...+++...+ +.+...+ T Consensus 244 ~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~-----~~ls~e~~~~----~~~~~~~ 314 (563) T protein:vir:99 244 IRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSD-----QQQSQHALEN----FKREWKS 314 (563) T ss_pred eccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCC-----CCCCHHHHHH----HHHHHHH Confidence 8776544 67899999999988888888888888888876666555554321 1112322222 2222222 Q ss_pred hhcCC-ce---eEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc---------ccchh- Q lcl|NC_017974. 296 AHSGE-QS---FFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG---------NDKGG- 361 (507) Q Consensus 296 ~~~G~-~a---~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~---------~~~~G- 361 (507) ...|. .+ .++++.|+++. ..+-+..-..|.+..++..++|++++.-..--.+ +..++ T Consensus 315 ~~~G~~nagk~~~vl~~G~~~~---------~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss 385 (563) T protein:vir:99 315 SLSGINGSWQIPVVMADDIKFV---------NMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGST 385 (563) T ss_pred HhccccccccceEEcCCCceEE---------eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccc Confidence 22332 23 25678887532 2222223334667777888899987654321111 11122 Q ss_pred -hhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHH--HHHHHhcCC Q lcl|NC_017974. 362 -SYNLSE-SKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKA--IQRIGAVGY 437 (507) Q Consensus 362 -S~Al~~-vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~--~~~L~~~G~ 437 (507) ++|-.+ ........-+.--++.|+..||+.|++.+ +.. -+|.|... |.+..++. +.+++..|+ T Consensus 386 ~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-------~~~---~~~~f~r~---D~~~~~e~~~~~~~~~~G~ 452 (563) T protein:vir:99 386 LNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEY-------GDK---YTFQFVGG---DTKSATDKLNILKLETQIF 452 (563) T ss_pred hhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-------ccc---cEEEeccC---CHHHHHHHHHHHHHhcCCc Confidence 223222 22334555667788889999998777542 111 13555444 43444443 345678888 Q ss_pred ccccHHHHHHHHHHcCCCCCCCcccccchhh--------hcC---------------CCcCCCCCCcccccC--cccccc Q lcl|NC_017974. 438 LPKTPKVINRVLEVLGIDEKIEEDISQEELM--------KLL---------------GEDTSRAGDGMTKGS--SGNGTG 492 (507) Q Consensus 438 ~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~--------~~~---------------~~~~~~~~~~~~~~~--~~~~~~ 492 (507) +.+ +.+|+.+|+|+-..+|....... +.. ....+++++.....+ +.++.+ T Consensus 453 lT~-----NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (563) T protein:vir:99 453 KTV-----NEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDK 527 (563) T ss_pred cCH-----HHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCcc Confidence 765 57999999987655543211000 000 000001111000000 001111 Q ss_pred cccCCCCchhhhccC Q lcl|NC_017974. 493 KISSTRDNSAANLDN 507 (507) Q Consensus 493 ~~~~~~d~~a~~~~~ 507 (507) + ...|..-...+| T Consensus 528 ~--~~~~~~~~~~~~ 540 (563) T protein:vir:99 528 E--IGTDAQIKGDDN 540 (563) T ss_pred c--cccccccccccc Confidence 1 111111111111 No 43 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.64 E-value=2.8e-15 Score=100.37 Aligned_cols=401 Identities=11% Similarity=0.042 Sum_probs=201.8 Q ss_pred CccccccccccccccccccCchhhhHH-HHhhhh---------cccchh--cc--chhHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKV-QVDSER---------MKAEEI--RW--PNLIGTVESMKQDATVATGLDMLYT 78 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~---------e~~~~l--r~--~~~~~~y~~m~~D~~v~s~l~~Rk~ 78 (507) |..|+-+---.++. |-. .+. ..+... ...+.. -. ...+ .-+..++.+.|.+|+..+-. T Consensus 1 ~~~~~~~~~~~~~~-----g~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYTIDLRTNN-----GWW--ARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI-NDERILQISTVWRCVSLIST 72 (424) T ss_pred CCCCcceEeecCCC-----chH--HHHHhhhcccccccccccccccccccccccccccc-cHHHhhccHHHHHHHHHHHH Confidence 22222110000000 000 000 000000 000000 00 0001 11345678889999999999 Q ss_pred HHhhCCceeeeCCCCChhHH-HHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeee Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESK-KAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKN 154 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~-~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~ 154 (507) .|.++++.+.-...+....+ ....-+..+|+.-++ .+..+++..++ +.+++|-+..++++... |. +.. T Consensus 73 ~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~ 144 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA------GD--VIS 144 (424) T ss_pred hhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEE Confidence 99888776432211111000 000113333432222 24556666665 57789999999987543 32 345 Q ss_pred ccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHH Q lcl|NC_017974. 155 LAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLV 234 (507) Q Consensus 155 l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~ 234 (507) |.+.++.++.. ..++..+. +. +...+....+|.+.+|++++.. .+.++|.|.+. T Consensus 145 L~pl~~~~V~v------~~~~~~~~---------y~----------~~~~g~~~~~~~~eIih~r~~~-~dg~~G~spi~ 198 (424) T protein:vir:18 145 LLPLQSANMDV------KLVGKKVV---------YR----------YQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIA 198 (424) T ss_pred EEEecCcceEE------EEcCCeEE---------EE----------EEeCCeEEEeccccEEEecCcC-CCCcccccHHH Confidence 55555544321 11221111 00 1112233467777776666544 45689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCcccc Q lcl|NC_017974. 235 GCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKD 312 (507) Q Consensus 235 ~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i 312 (507) .+....-.-....++-..|...-+.|--++++|-. ..+++..+.+++.. .++.+|..+ .++++.|+++ T Consensus 199 ~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~-------~l~~e~~~~~~~~~---~~~~~g~nag~~~vl~~g~~~ 268 (424) T protein:vir:18 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-------VLTEQQRSQVEENF---KEIAGGPVKKRLWILEAGFST 268 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCc-------CCCHHHHHHHHHHH---HHHhCCcccCCceeccCCceE Confidence 98877766667777777888766666666665432 11223333333322 233334333 4788888764 Q ss_pred cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhh--hH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 313 NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSY--NL-SESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 313 ~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--Al-~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) .- .+-+.....|.+..++..++|++++.-..--.+...++++ +- .+........-+.-.++.|+..|| T Consensus 269 ~~---------l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~ 339 (424) T protein:vir:18 269 SA---------IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQ 339 (424) T ss_pred Ee---------cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32 2222223346677788889999987665433333223332 21 122233445566778888888888 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhh Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMK 469 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~ 469 (507) +.|+++. .....+-+|.++..-..|.++.++.+.+++..|++.+ +++|+.+|+|+-+.+|.......- T Consensus 340 ~~L~~~~-------~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~gGD~~~~~~n~ 407 (424) T protein:vir:18 340 RWLIPAK-------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI-----NEMRRTDNLPPLPGGDVAMRQSQY 407 (424) T ss_pred hhcCCcc-------ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccCc Confidence 7765542 1111222344444556889999999999999999876 579999999976555432111110 Q ss_pred cCCCcCCCCCCcccccCcccccccccCCCCchh Q lcl|NC_017974. 470 LLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSA 502 (507) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a 502 (507) .+.+..... ....++.| T Consensus 408 ~~l~~~~~~----------------~~p~~~ga 424 (424) T protein:vir:18 408 VPITDLGTN----------------KEPRNNGA 424 (424) T ss_pred cchHhhhcc----------------CCCccCCC Confidence 000000000 01111111 No 44 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.63 E-value=1e-14 Score=97.25 Aligned_cols=401 Identities=11% Similarity=-0.020 Sum_probs=204.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.--.+--+.....+... + .+.+...+. .+.. .+.. -..+..++-+.|.+|+..+-..| T Consensus 1 Mgl~~~~f~~~~~~~~~~------~--~~~~~~~~~-~~~~-----------~g~~-v~~~~al~~~~v~~~v~~ia~~i 59 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLT------K--ISGIPSPAE-DWAM-----------HGDR-PGANSAMTLGAFYACVTLLADTV 59 (409) T ss_pred CchhhhhhcCCCcccccc------c--ccccccccc-hhhc-----------cCcc-cchhhhhccHHHHHHHHHHHHhh Confidence 544332211110000000 0 000010000 0000 0001 11344566788999999999999 Q ss_pred hhCCceeeeCCCCC-hhHHHHHHHHHHHHHhc--cCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 81 EKAFKDFKVIPGES-EESKKAAKFVEYCLKNM--EGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 81 ~~~~~~~~~~~~~~-~~~~~~ae~v~~~l~~~--~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .++++.+.-..+.. .....+ .++|..- ...++.++++.++ +.+.+|-++.++.++... | .+..|. T Consensus 60 A~lp~~~~~~~~~~~~~~~~l----~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~-----g--~~~~L~ 128 (409) T protein:vir:84 60 ASLSIDAYRKKDNVRIPVSPA----PKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEA-----N--RPTAIM 128 (409) T ss_pred hhCceEEEEecCCcccccchH----HHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCC-----C--ceEEEE Confidence 99887654332211 111223 3334322 2235678888877 577899998888775332 2 233444 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++..+. +....|+.+... .......+..+|.+.+|++++....+.++|.|.+..+ T Consensus 129 ~l~p~~v~----v~~~~~~~~~~~-------------------~~~~~~~g~~~~~~dvih~~~~~~~~~~~G~s~i~~~ 185 (409) T protein:vir:84 129 PIHPDCIH----VTDAKDEDGDWI-------------------EPVYRIDGKVVPNHRIMHIKRYPVAGCALGMSPIEKA 185 (409) T ss_pred EEcCceeE----EEEcCCCcceEE-------------------EEEecCCceEEchhhEEEecCCCCCcccccccHHHHH Confidence 44443321 111122211110 0011123445777888887777777778999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-..-.-....++...|...-+.|--++++|. .-+++...+..+...+.. .+....+++|.|+++.- T Consensus 186 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-------~l~~e~~~~~~~~~~~~~----~n~g~~~vl~~g~~~~~-- 252 (409) T protein:vir:84 186 ASAIGLGLAAERYGLRWFRDSANPSGILSSDA-------DLTPDQVKQTQKQWIQSH----HNRRLPAVMSAGIKWQS-- 252 (409) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCccEEEecCC-------CCCHHHHHHHHHHHHHHh----ccCCCeeecCCCceEEE-- Confidence 87777777777888888877676666666543 223444444444444432 23344678888876432 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhh--hH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSY--NL-SESKQTIHTQFVQRVNEIILEALNENLL 393 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--Al-~~vh~~v~~~~~~aD~~~i~~~ln~~li 393 (507) .+-+.....|.+..++..++|++++--..--.+...+++. +. .+........-+.--++.|+..||+.|. T Consensus 253 -------~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L~ 325 (409) T protein:vir:84 253 -------VSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTFLP 325 (409) T ss_pred -------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 2222222345566677778898866443322222222222 11 1222233345566677788888886541 Q ss_pred HHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCC Q lcl|NC_017974. 394 PQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGE 473 (507) Q Consensus 394 ~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~ 473 (507) ...+-+|.++.....|+++.++++.++++.|++.+ +.+|+.+|+|+-+++|.-.....-...+ T Consensus 326 ------------~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~p~~ggD~~~~~~n~~~~~ 388 (409) T protein:vir:84 326 ------------RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSV-----NEVRAWEDAPPIPEGDIHLQPMNFVPLG 388 (409) T ss_pred ------------CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeeecccccccc Confidence 11222455556667899999999999999998776 5799999999765544321111101111 Q ss_pred cCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 474 DTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ...+ .++. ..+.+..+.-.| T Consensus 389 ~~~~-~~~~-------------~~~~~~~~~~gn 408 (409) T protein:vir:84 389 YVPP-EEPA-------------QEPQPNSATEGN 408 (409) T ss_pred cCCc-cccC-------------cCCCCCCccCCC Confidence 1100 0000 000011111111 No 45 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.63 E-value=1.7e-14 Score=96.05 Aligned_cols=412 Identities=11% Similarity=0.026 Sum_probs=204.7 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHH-HHhhhhcccchhccchhHHHH--HHHhhChHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKV-QVDSERMKAEEIRWPNLIGTV--ESMKQDATVATGLDMLY 77 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~~~lr~~~~~~~y--~~m~~D~~v~s~l~~Rk 77 (507) -+||.|+++..-+.--.... .|-.+....+...+ ...... + ......| +..++.+.|.+|+..+- T Consensus 14 ~~~~~~~~~~~~~~~f~~~e-----~r~~~~~~~~~~~~~~~~~~~------~-~~~~~~~~~~~al~~~~V~acv~~Ia 81 (441) T protein:vir:98 14 KSRKQSRKELVVVGIFYKNE-----KRDLQYNEDDLQMMVQTLPGF------Q-GTKLRQYKDIEAIRHSDIFTAVMMIA 81 (441) T ss_pred ccccchhhhhhccccccccc-----cccccCCCcchHHHHHHhhcc------c-ccCccccchhhhhccHHHHHHHHHHH Confidence 67777777654332110000 00001111111111 100000 0 0011112 23467888999999999 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeee Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKN 154 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~ 154 (507) ..|.++++.+... +....+ .-+...|..-++ .+..++++.++ +.+++|-+.+++++... |+ +.. T Consensus 82 ~~iA~lpl~~~~~-~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~------G~--~~~ 148 (441) T protein:vir:98 82 SDLARMPIRVTVN-GQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT------GE--PMN 148 (441) T ss_pred HhhccCceEEecC-Cccccc----chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC------Cc--EEE Confidence 9998887654322 211111 223334432222 23446666654 46789999999988542 32 455 Q ss_pred ccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHH Q lcl|NC_017974. 155 LAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLV 234 (507) Q Consensus 155 l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~ 234 (507) |.+.++.++. +..+.+|...-.. .... .........+|+..+|++++.+. +..+|.|.+. T Consensus 149 L~~i~~~~v~----v~~~~~g~~~~~~----------~~~~-----~~~~~~~~~~~~~dviHir~~~~-dg~~G~spi~ 208 (441) T protein:vir:98 149 LTFRKTSEIE----LKLDARGRLYYFH----------QRID-----SNGNNIERNVKFEDMLDIKFYSL-DGINGLSLLD 208 (441) T ss_pred EEEEcCceeE----EEECCCCcEEEEE----------EEec-----cCcceeeEEEccccEEEeccCCC-CCccccCHHH Confidence 6666665443 2234444221100 0000 00111223567777777766543 4478999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC-Cc--eeEeecCccc Q lcl|NC_017974. 235 GCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG-EQ--SFFMLPSDTK 311 (507) Q Consensus 235 ~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G-~~--a~~iiP~g~~ 311 (507) .+.-..-.-....++-..|.+.-+.|--++++|-.+ .+++....+.+...+. ..| .. ..++++.|++ T Consensus 209 ~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~------~~~e~~~~~~~~~~~~----~~G~~nag~~~vl~~g~~ 278 (441) T protein:vir:98 209 TLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL------DNKKARDRAREEFHKS----FSGTKQAGKVVVLDESMT 278 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC------CCHHHHHHHHHHHHHH----hcCccccCcceecCCCce Confidence 888777667777788888888766666666654221 1122222222222222 223 22 2477888875 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) ..- .+-+..-..|.+..++..++|++++.-..--.+....+ ++..+. +..+..-+.-.++.|+..||+. T Consensus 279 ~~~---------l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~-~s~~q~-~~~y~~tl~P~~~~ie~~ln~~ 347 (441) T protein:vir:98 279 FDQ---------LEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDA-NLDYLSTLKPYITCVCAELNFK 347 (441) T ss_pred EEE---------ccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC-ccHHHH-HHHHHHHHHHHHHHHHHHHHhh Confidence 432 22222233466677888889999876543222222222 221221 1112235566777777777765 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccc-ccchhhhc Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDI-SQEELMKL 470 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~-~~~~~~~~ 470 (507) |++.. . ..+-+|..+.....|.+..++++++++..|++.+ +++|+.+|+|+-++++. ........ T Consensus 348 L~~~~--------~-~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~gGd~~~~~~~~n~ 413 (441) T protein:vir:98 348 FNDEY--------V-NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLAPIPGGNGSIHRVDLNH 413 (441) T ss_pred ccccc--------c-CceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceEeecccc Confidence 53221 1 1122344445567889999999999999998876 58999999997655543 11111111 Q ss_pred CCC------cCCCCCCcccccCcccccc Q lcl|NC_017974. 471 LGE------DTSRAGDGMTKGSSGNGTG 492 (507) Q Consensus 471 ~~~------~~~~~~~~~~~~~~~~~~~ 492 (507) .+- ..++++.+......|++-. T Consensus 414 ~~~~~~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 414 VNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred cccccccccccccccccccccCCCCCCC Confidence 110 0011111111111111110 No 46 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.62 E-value=1.7e-14 Score=96.11 Aligned_cols=416 Identities=11% Similarity=0.053 Sum_probs=199.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |-.--+.+..-..+.|.+...... .+ +.......-....... +..+ .-+..++-+.|.+|+..+-..| T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~s~~------g~~v-~~~~al~~~~V~~~i~~Ia~~i 74 (432) T protein:vir:10 7 LGLLGQLKAMFVPPDPVDIGGGQT---FT--PVNATARDLGIIISDT------GAAV-NADAIMRLDAVAACVKLVSQAI 74 (432) T ss_pred cchhhhhHhhcCCccccccccccc---cc--cCcchhhhhccccccc------Cccc-chhhhhcchHHHHHHHHHHHhh Confidence 222211111111111211100000 00 0000000000000000 0011 0133567789999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .++++.+.....+. ..+....-+..+|..-++ .++.++++.++ +.+++|.+.+++++.. |+ +..|.+ T Consensus 75 a~lp~~~y~~~~~g-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-------g~--~~~L~~ 144 (432) T protein:vir:10 75 AAMPLTMYMRTPDG-RKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-------GR--IESLQY 144 (432) T ss_pred hhCceeEEEecCCC-cccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-------Cc--EEEEEE Confidence 99877653332211 111111113344432222 24566777665 5688999999988742 22 344555 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .++.++. +..+.+|+.. +.+ ....+....+|.+.++++++.+. +..+|.|.+..+. T Consensus 145 l~~~~v~----v~~~~~g~~~----------y~~---------~~~~g~~~~~~~~~iih~~~~~~-dg~~G~spi~~~~ 200 (432) T protein:vir:10 145 LANDRLT----ITTDTKGNTA----------YRY---------RRTDGQMIDIPKQQIWKIMGYSL-DGENGLSAIRYGA 200 (432) T ss_pred EcCCceE----EEEcCCCcEE----------EEE---------EecCceEEEEcCccEEEecCCCC-CCcccccHHHHHH Confidence 5554332 1223333211 000 01122345677777776665543 4478999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~ 317 (507) ..+-.-....++-..|...-+.|--++++|- .-+++...+ +++-..-+.| +...+++|.|+++.- T Consensus 201 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-------~l~~e~~~~-~~~~~~~~~n----ag~~~vl~~g~~~~~--- 265 (432) T protein:vir:10 201 QIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-------FLTDDQYDS-FAKKVSGSVE----AGRAPLLEGGMDVKS--- 265 (432) T ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEecCC-------CCCHHHHHH-HHHHHhhhhh----CCCceecCCCceEEE--- Confidence 8776666677777777776666665665542 222333322 2322222222 223578888876432 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSE----SKQTIHTQFVQRVNEIILEALNENLL 393 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~----vh~~v~~~~~~aD~~~i~~~ln~~li 393 (507) .+-+.....|.+..++...+|++++.-..--.+....|+++.+. .....-..-+.-.++.|+..||+.|+ T Consensus 266 ------l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~ 339 (432) T protein:vir:10 266 ------LGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRRIEQSIALNLL 339 (432) T ss_pred ------ccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 22222233466677889999999775544223332233333221 12223334556677777788887664 Q ss_pred HHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCC- Q lcl|NC_017974. 394 PQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLG- 472 (507) Q Consensus 394 ~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~- 472 (507) .+-- . ...+.+|.++..-..|.++.++++.++++.|++.+ +++|+.+|+|+-.+++..........| T Consensus 340 ~~~~------~-~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~glppi~g~~~~~~~~~~~~pl 407 (432) T protein:vir:10 340 SPAE------R-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTR-----DEAREIEGLPKLGGNAAVLTVQSAMVPL 407 (432) T ss_pred Cccc------c-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCcceEeecCcccch Confidence 4321 0 11122333344456789999999999999998776 589999999876544432211111111 Q ss_pred Cc--CCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 473 ED--TSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 473 ~~--~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) +. ..+.+++ ....++. ..+..++ T Consensus 408 ~~~~~~~~~~~--~~~~~~~------~~~~~~~ 432 (432) T protein:vir:10 408 DSIGLQASPEP--ASGLGNQ------QQDKVSK 432 (432) T ss_pred hhhcccCCCCC--CCCCCCc------ccccccC Confidence 00 0000000 0011110 0011111 No 47 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.62 E-value=2.9e-14 Score=94.82 Aligned_cols=411 Identities=11% Similarity=0.010 Sum_probs=201.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |==+|.......-..+.. .+- +..+...... .-...+ ..+..++-+.|.+|+..+-..| T Consensus 1 ~~~~r~~~~~~~~~~~~~------------~~~-----~~~~~g~~~s---~~~~~v-t~~~al~~~~v~~~v~~ia~~i 59 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSA------------GGW-----VSALLGSSRS---DSGQVV-TPASALALTVLQNCVTLLAESI 59 (419) T ss_pred CcccccccccccccccCc------------chh-----hHHhhcCCCc---cCCccc-chHHhhccHHHHHHHHHHHHhh Confidence 332222111111001100 000 0000000000 000000 1234567788999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHH-HHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAK-FVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae-~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .++++.+....++.. .++.+ -+..+|..-++ .++.++++.++ +.+.+|-+++++++... |. +..|. T Consensus 60 A~lp~~~~~~~~~~~--~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~------G~--~~~l~ 129 (419) T protein:vir:14 60 AQLPIELYERSGEDR--KPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSD------GV--IQGLY 129 (419) T ss_pred ccCceEEEEecCCcc--ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEE Confidence 998876543322111 11111 13334432222 34567777754 56789999999887532 32 44566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++..+.. ..+.|+..+ +.+. ....+|.+- |+|......+..+|.|.+..+ T Consensus 130 pl~~~~v~v----~~~~~~~~~----------y~~~-------------~~~~~~~~~-i~h~~~~~~dg~~G~s~i~~~ 181 (419) T protein:vir:14 130 PLDNEAVTV----MRGSDLKPV----------YRVR-------------GSDPMPQRL-VHHVRWMSINGYTGLSPVLLH 181 (419) T ss_pred EecCceEEE----EECCCceEE----------EEEc-------------cCcccchhh-eeEecCcCCCCcccccHHHHH Confidence 666654431 223333211 0000 011234443 445444344558999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC-ce--eEeecCccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE-QS--FFMLPSDTKDN 313 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~-~a--~~iiP~g~~i~ 313 (507) ....-.-....++...+...-+.|=-++++|-... ...+++. .+ .+.+.......|. .+ .++++.|+++. T Consensus 182 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~---~~~~~~~-~~---~~~~~~~~~~~g~~nag~~~vl~~g~~~~ 254 (419) T protein:vir:14 182 ANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAP---ALKDQAS-VD---RITDGWNAKFGGSGNAKKVALLQEGMTFR 254 (419) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCC---cccCHHH-HH---HHHHHHHHHhcCccccCCceecCCCceEE Confidence 98777667777778888877666655666543221 1112222 22 2222222222332 22 46777776543 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENL 392 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~l 392 (507) ..+-+..-..+.+..++...+|++++.-..--.+...+|+++-.+.+. ..-..-+.-.++.|+..||+.| T Consensus 255 ---------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~kl 325 (419) T protein:vir:14 255 ---------PLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDL 325 (419) T ss_pred ---------EccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 222222222355667788999999876655334434456666444333 3334566667788888888766 Q ss_pred HHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCC Q lcl|NC_017974. 393 LPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLG 472 (507) Q Consensus 393 i~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~ 472 (507) +.+-- . ...+-+|.++.....|.+..++++.++++.|++.+ +.+|+.+|+|+-+.+|.......-... T Consensus 326 l~~~~------~-~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~gGD~~~~~~n~~~~ 393 (419) T protein:vir:14 326 LLPSE------R-KQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSI-----NDIRRLENMPPVKGGDIYLSPMNMVDA 393 (419) T ss_pred cCccc------c-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccccccc Confidence 43311 0 11122333344456789999999999999998876 579999999976555532211110010 Q ss_pred CcCCCCCCcccccCcccccccccCCCCchhhhcc Q lcl|NC_017974. 473 EDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLD 506 (507) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~ 506 (507) ....+...+.+ .+..+.-+...+-|+ T Consensus 394 ~~~~~~~~~~~--------~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 394 SKPQQLPVGKS--------EPTKAAIDEIGRILS 419 (419) T ss_pred cccccccCCCC--------CCccccccchhcccC Confidence 11100000000 000111111112222 No 48 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.62 E-value=9.3e-14 Score=92.02 Aligned_cols=442 Identities=11% Similarity=0.058 Sum_probs=193.7 Q ss_pred CcccccccccccCcccc--cccccccccccc----ccCchhhhH-HHHhhhhcccchhccchhHHHHHHHhhChHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKAT--SEQINVSRMRMS----EQGTFALAK-VQVDSERMKAEEIRWPNLIGTVESMKQDATVATGL 73 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~--~~~~~~~~~~~~----~~g~~~~~~-~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l 73 (507) |..|.-.++.+..-..+ +++.+..-...+ ..|...... ++......+...+ +..+.. ....+.|..++ T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~l~~---~~~~~iv~~~i 101 (574) T protein:vir:80 27 MHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDL--HKTLKK---FGNNIILNAII 101 (574) T ss_pred cccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccH--HHHHHh---hccChhHHHHH Confidence 55554444444321111 111110000000 000000000 0000000111111 122222 22345667777 Q ss_pred HHHHHHHh-----------hCCceeeeCCCC-ChhHHHHH--HHHHHHHHhcc------CCCHHHHHHHHHH-HHhhcce Q lcl|NC_017974. 74 DMLYTFVE-----------KAFKDFKVIPGE-SEESKKAA--KFVEYCLKNME------GQTLRQFARDAAT-FNEYGLS 132 (507) Q Consensus 74 ~~Rk~av~-----------~~~~~~~~~~~~-~~~~~~~a--e~v~~~l~~~~------~~~~~~~l~~~l~-a~~~Gfs 132 (507) ..|+.+|. +++|.+.....+ ....++.+ .-+...|++.. ..+|.++++.++. .+.+|.+ T Consensus 102 ~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gna 181 (574) T protein:vir:80 102 NTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQV 181 (574) T ss_pred HHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCe Confidence 77776664 466766543221 11111111 12333443211 1256678877764 6779999 Q ss_pred eeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCcccccc Q lcl|NC_017974. 133 VVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPI 212 (507) Q Consensus 133 ~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~ 212 (507) .+|+++... |+ +..|.+.++.++... .+.++. . ......++. .........+|. T Consensus 182 yi~i~r~~~------G~--~~~L~pl~p~~V~v~----~d~~~~--~--~~~~~~y~~----------~~~g~~~~~~~~ 235 (574) T protein:vir:80 182 NFEKVFDKD------GN--FIKFDTVDPTTIFLA----TNGEGK--L--IKNGERFVQ----------VIDNRIVAKFNE 235 (574) T ss_pred EEEEEECCC------Cc--EEEEEEEcCceeEEE----EcCccc--c--ccCceEEEE----------EeCCceEEEEcc Confidence 999998643 22 445666666544321 111211 0 000000111 111122334677 Q ss_pred ccEEEEeeCCccC---ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHH Q lcl|NC_017974. 213 NRVMLMNTGGSSS---QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGL 289 (507) Q Consensus 213 ~k~i~~~~~~~~~---~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~ 289 (507) ..+|++++...++ .+||.+.+..+....-.-.....+-..|...-+.|--++.++. +..-+++.... + T Consensus 236 ~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~-----~~~ls~e~~~~----l 306 (574) T protein:vir:80 236 RELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKT-----GQQQSQQALDI----F 306 (574) T ss_pred ccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-----CCCCCHHHHHH----H Confidence 7777777665543 5689999999888877777777888888876555544444331 11112332222 2 Q ss_pred HHHHHHhhcC-Ccee---EeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--------- Q lcl|NC_017974. 290 MSDAANAHSG-EQSF---FMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--------- 356 (507) Q Consensus 290 ~~~~~n~~~G-~~a~---~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--------- 356 (507) .+...+...| ..++ ++.+.|+++ +..+.+..-..|.+..++..++|+.++.-..--.+ T Consensus 307 k~~~~~~~~G~~n~g~~~vl~~~G~~~---------~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~g 377 (574) T protein:vir:80 307 RREWRSSLAGINGSWQIPVVSAEDVKF---------VNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATG 377 (574) T ss_pred HHHHHHHhccccccccceeecCCCceE---------EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccc Confidence 2222222223 2233 333566542 22222333334666777788889987744321111 Q ss_pred -ccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHh Q lcl|NC_017974. 357 -NDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGA 434 (507) Q Consensus 357 -~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~ 434 (507) .++..++|-.+.+. ......+.-.++.|+..||+.|++.+ . ... +|.|+.....+.....+ +.+++. T Consensus 378 s~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~---~-----~~~--~~~f~~~d~~~~~~~~~-~~~~~~ 446 (574) T protein:vir:80 378 SKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEF---G-----EKY--QFQFRGGDLSAQLDKLK-IIEQEG 446 (574) T ss_pred cccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc---C-----Cce--EEEecccchhhHHHHHH-HHHHHh Confidence 01112245444443 34455678899999999999887542 1 111 46666544433333332 335677 Q ss_pred cCCccccHHHHHHHHHHcCCCCCCCcccccch-hhh----cCCCcCCC-----CCCcccccCccccccc-ccCCCCchhh Q lcl|NC_017974. 435 VGYLPKTPKVINRVLEVLGIDEKIEEDISQEE-LMK----LLGEDTSR-----AGDGMTKGSSGNGTGK-ISSTRDNSAA 503 (507) Q Consensus 435 ~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~-~~~----~~~~~~~~-----~~~~~~~~~~~~~~~~-~~~~~d~~a~ 503 (507) .|++.+ +.+|+.+|+|+-..+|..... ... .....+.. +.....-.+.+.+.++ .+.-+..+.. T Consensus 447 ~G~lT~-----NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 521 (574) T protein:vir:80 447 KVFRTV-----NEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQN 521 (574) T ss_pred CCccCH-----HHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCccc Confidence 887766 689999999876554432110 000 00000000 0000000000000000 0000100000 Q ss_pred -----------------------------------hccC Q lcl|NC_017974. 504 -----------------------------------NLDN 507 (507) Q Consensus 504 -----------------------------------~~~~ 507 (507) +.+| T Consensus 522 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 560 (574) T protein:vir:80 522 DTDVSFQDEQQGLNGKSKKVNGKVDDNVGKDGQLKSEEN 560 (574) T ss_pred cccchhhhhhhhhccchhhhcCCcccccccccccccccc Confidence 0001 No 49 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.62 E-value=8e-14 Score=92.38 Aligned_cols=441 Identities=14% Similarity=0.072 Sum_probs=198.4 Q ss_pred Cccc-------ccccccccCccccccc-cc--------------ccccccc--ccCchhhhHHHHhhhhcccchhccchh Q lcl|NC_017974. 1 MSRK-------RNRNRSVKVAKATSEQ-IN--------------VSRMRMS--EQGTFALAKVQVDSERMKAEEIRWPNL 56 (507) Q Consensus 1 ~~~k-------~~~~~~~~~~~~~~~~-~~--------------~~~~~~~--~~g~~~~~~~~~~~~~e~~~~lr~~~~ 56 (507) |.=- +++.-.......+... .. -.+.-++ .+|..... .++ ...+..+-+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~--~g~---~~~~~~~~~~~ 75 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSAN--PGF---KTKPSIRNNQD 75 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecc--ccc---ccCCccCChhH Confidence 2110 1110000000000000 00 0000000 00100000 000 01111111111 Q ss_pred HHHHHHH-hhChHHHHHHHHHHHHHhh-----------CCceeeeCCC---CChhHHHHHHHHHHHHHhccC------CC Q lcl|NC_017974. 57 IGTVESM-KQDATVATGLDMLYTFVEK-----------AFKDFKVIPG---ESEESKKAAKFVEYCLKNMEG------QT 115 (507) Q Consensus 57 ~~~y~~m-~~D~~v~s~l~~Rk~av~~-----------~~~~~~~~~~---~~~~~~~~ae~v~~~l~~~~~------~~ 115 (507) +.-+-+. ..-+.|.+|+..++..|.+ ++|++++... ....+++..+.++++|..... .+ T Consensus 76 l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s 155 (547) T protein:vir:63 76 LHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDS 155 (547) T ss_pred HHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccch Confidence 1111111 2358899999999988764 3466665432 223333333344444443211 24 Q ss_pred HHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccc Q lcl|NC_017974. 116 LRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTAS 194 (507) Q Consensus 116 ~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~ 194 (507) |.+++..++ +.+.+|.+++|+++... |+ +..|.+.++.++.. ..+.||..... ...+ T Consensus 156 ~~~f~~~lv~d~ll~Gn~~~~i~rd~~------G~--~~~L~~l~p~~V~~----~~~~~g~~~~~----~~~y------ 213 (547) T protein:vir:63 156 FSSFVKKIVRDTYMYDQVNFEKVFNRN------QS--MVRFVAKDPTTIFF----ATTADGKIPDN----GNRF------ 213 (547) T ss_pred HHHHHHHHHHHHHhhCCEEEEEEECCC------Cc--EEEEEEecCceeEE----EECCccccccC----ceEE------ Confidence 667887766 46789999999998643 32 44566666655532 22333321100 0000 Q ss_pred cccccccccCCCccccccccEEEEeeCCccC---ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccC Q lcl|NC_017974. 195 EIGVGGVSTRMSDVIIPINRVMLMNTGGSSS---QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQIL 271 (507) Q Consensus 195 ~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~---~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~ 271 (507) ...........++...+|++++....+ .+||.|.+..+......-....++-..|.+.-+.|--++.+|. T Consensus 214 ----~~~~~~~~~~~~~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~--- 286 (547) T protein:vir:63 214 ----VQVIDQKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKA--- 286 (547) T ss_pred ----EEEcCCcEEEEeccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecC--- Confidence 001112223456777777777655443 5789999999998887778888888888876555443443321 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHhhcC-Ccee--Eee-cCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_017974. 272 NKAAMDQSSPEAEMVRGLMSDAANAHSG-EQSF--FML-PSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDR 347 (507) Q Consensus 272 k~~~~~~~~~~~~~l~a~~~~~~n~~~G-~~a~--~ii-P~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~ 347 (507) +..-+++....+.+.+.+ ...| ..++ .++ ..|++ ++..+.+..-..|.+..++..++|+.+ T Consensus 287 --~~~ls~e~~~~lk~~~~~----~~~G~~nagk~~vl~~~g~~---------~~~l~~~~~d~qfle~~~~~~~~Ia~a 351 (547) T protein:vir:63 287 --AQQQSQHALEIFKREWKN----SLSGINGSWQIPVVSAEDVK---------FVNMTPSARDMEFEKWLNYLINVISAL 351 (547) T ss_pred --CCCCCHHHHHHHHHHHHH----HhcCcccccccccccCCCce---------EEEcCCChhHHHHHHHHHHHHHHHHHH Confidence 111223332222222222 2223 2233 233 34443 333333333334666777788899998 Q ss_pred Hhccccccc--------ccchh--hhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecC Q lcl|NC_017974. 348 LGAGFINVG--------NDKGG--SYNLSESK-QTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGE 416 (507) Q Consensus 348 iLGqtLts~--------~~~~G--S~Al~~vh-~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~ 416 (507) +.-...-.+ ...++ +++-.+.+ ......-+.-.++.|+..||+.|++.+ .. .+ +|.|+. T Consensus 352 fgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~-------~~--~~-~~~f~~ 421 (547) T protein:vir:63 352 YGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEF-------GD--KY-TFQFVG 421 (547) T ss_pred hCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------CC--ce-EEEeec Confidence 744321111 01112 23322222 244566778888999999998775431 11 12 577877 Q ss_pred cCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCC-CCccccc-chhhh--------cCCCc----------CC Q lcl|NC_017974. 417 IVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEK-IEEDISQ-EELMK--------LLGED----------TS 476 (507) Q Consensus 417 ~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~-~~~~~~~-~~~~~--------~~~~~----------~~ 476 (507) ....+....++. .+++..|.+.+ +.+|+.+|+|+. +.+|..- +.... ..++. .+ T Consensus 422 ~~~~~~~~~~~~-~~~~~~g~lT~-----NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (547) T protein:vir:63 422 GDIKSELESVKI-LAEKAKVAMTV-----NEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQE 495 (547) T ss_pred cccccHHHHHHH-HHHHhCCCcCH-----HHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhcccccc Confidence 777776665554 45677787654 689999999763 3333211 00000 00000 00 Q ss_pred CCC----CcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 477 RAG----DGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 477 ~~~----~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ..+ ......+++. ........|-+.++-+| T Consensus 496 ~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~~~~~~ 529 (547) T protein:vir:63 496 QTGNRVSTDVEDIPDGK-DTTGDIGKDGQRKDKDN 529 (547) T ss_pred ccCCCCCCCCCCCCCCc-ccCCCcCccccccCccc Confidence 000 0000000000 00111123333333333 No 50 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.61 E-value=1.6e-14 Score=96.15 Aligned_cols=411 Identities=12% Similarity=0.008 Sum_probs=207.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |--+|..++......+.... . ....+. .... ..+..+ .-+..++-+.|.+|+..+-..| T Consensus 1 m~~~~~~~~~~~~~~~~~~~---------~-------~~~~~g-~~~s---~~~~~v-~~~~al~~~~v~~cv~~ia~~i 59 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGG---------W-------VSALLG-SARS---EAGQVV-TPASALSLTVLQNCVTLLAESI 59 (419) T ss_pred CCcccccccccCcCCCCcch---------h-------hHHhhc-cccc---ccCccc-ChHHhhccHHHHHHHHHHHHhh Confidence 65444433322211111100 0 000000 0000 000111 1133556788999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHH-HHHHHHHhcc--CCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAK-FVEYCLKNME--GQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae-~v~~~l~~~~--~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .++++.+.-..++.. +++.+ -+..+|...+ ..++.++++.++ +.+.+|-+++++++... |. +..|. T Consensus 60 a~lp~~~~~~~~~~~--~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~------G~--~~~L~ 129 (419) T protein:vir:80 60 AQLPVELYERSGDDR--KPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQD------GV--IQGLY 129 (419) T ss_pred ccCceEEEEecCCCc--ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEE Confidence 998776433222111 11111 1333443222 234567777766 56789999999987542 32 45566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++.++.. ..+.++..+ +.+ .+...+|.+-++++++.+ .+.++|.|.+..+ T Consensus 130 ~i~~~~v~i----~~~~~~~~~----------y~~-------------~~~~~~~~~~i~h~~~~~-~d~~~G~s~i~~~ 181 (419) T protein:vir:80 130 PLDNEAVTV----MKGPDLKPM----------YRV-------------AGADPLPQRLVHHVRWMS-INGYTGLSPVLLH 181 (419) T ss_pred EecCceEEE----EECCCceEE----------EEE-------------cCccccchhheEEecCCC-CCCcccccHHHHH Confidence 666654431 223333211 000 011235555444444444 4558999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC---ceeEeecCccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE---QSFFMLPSDTKDN 313 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~---~a~~iiP~g~~i~ 313 (507) ....-.-....++...|...-+.|--++++|-... ...+++. .+.+.+...+...|. ...++++.|+++. T Consensus 182 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~---~~~~~~~----~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~ 254 (419) T protein:vir:80 182 ANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAP---ALKDQAS----VDRITDGWNAKFGGSGNAKKVALLQEGMKFK 254 (419) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCC---cccCHHH----HHHHHHHHHHHhcCccccCCceecCCCceEE Confidence 88776667777888888887777766666553221 1112222 222333332223332 2347778776532 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENL 392 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~l 392 (507) ..+-+.....+.+..++..++|+.++.-..--.+...+|+++-.+.+. .....-+.-.++.|+..||+.| T Consensus 255 ---------~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~kl 325 (419) T protein:vir:80 255 ---------PLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDL 325 (419) T ss_pred ---------eccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 222222233466777888899999876655333333455665444333 3345557778888888888877 Q ss_pred HHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCC Q lcl|NC_017974. 393 LPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLG 472 (507) Q Consensus 393 i~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~ 472 (507) +.+--. ...+.+|.++.....|+++.++.+.++++.|++.+ +.+|+.+|+|+-+.+|.......-... T Consensus 326 l~~~~~-------~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~gGD~~~~~~n~~~~ 393 (419) T protein:vir:80 326 LLPSER-------KQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSI-----NDIRRLENMPPVKGGDIYLSPMNMVDA 393 (419) T ss_pred cCcccc-------CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeeeccccccc Confidence 544211 11122344445556789999999999999999876 579999999976555543211111111 Q ss_pred CcCCCCCCcccccCcccccccccCCCCchhhhcc Q lcl|NC_017974. 473 EDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLD 506 (507) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~ 506 (507) ....+...+ ..+++. ++-+.--.-|+ T Consensus 394 ~~~~~~~~~-----~~~~~~---~~~~~~~~~l~ 419 (419) T protein:vir:80 394 SKPQPIPMG-----KTEPTK---AALDEIGRILS 419 (419) T ss_pred cccccccCC-----CCCchh---hhHHHHHhhcC Confidence 100100000 000000 00000011111 No 51 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.61 E-value=4.1e-14 Score=93.99 Aligned_cols=418 Identities=11% Similarity=0.049 Sum_probs=200.6 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.--.+.+..-..+.+....... +..+..+............+ ..+ ..+..++-+.|.+|+..+-..| T Consensus 7 mg~f~r~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~g------~~v-~~~~al~~~~V~~~i~~Ia~~i 74 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPVDIGGGQ-----TFTPVNATARDLGIIISDTG------AAV-NADAIMRLDAVAACVKLVSQAI 74 (432) T ss_pred cchhhhhhhhccccccccccccc-----ccccCccchhhhcccccccC------ccc-chHhhhccHHHHHHHHHHHHhh Confidence 33333322222111111100000 00000000000000000001 111 1244567789999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .++++.+.....+.. .+....-+..+|..-++ .+..++++.++ +.+++|-+..++++. +|+ +..|.+ T Consensus 75 a~lp~~~y~~~~~g~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-------~g~--~~~L~~ 144 (432) T protein:vir:81 75 AAMPLTMYMRTPDGR-KEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-------DGR--IESLQY 144 (432) T ss_pred hhCceeeEEecCCcc-eecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCc--EEEEEE Confidence 998776432221111 00011113344432222 23556777766 578899999888774 132 344555 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .++..+. +..+.+|.... .+. ...+....+|.+.++++++.+..+ .+|.|.+..+. T Consensus 145 l~~~~v~----v~~~~~g~~~y----------~~~---------~~~g~~~~~~~~~iih~r~~~~dg-~~G~spi~~~~ 200 (432) T protein:vir:81 145 LANDRLT----ITTDPKGNTAY----------RYR---------RTDGQMIDIPKQQIWKIMGYSLDG-ENGLSAIRYGA 200 (432) T ss_pred EcCCceE----EEECCCCcEEE----------EEE---------ecCceEEEEccccEEEecCCCCCC-cccccHHHHHH Confidence 5554332 22333432110 000 112234467777777776665444 78999999888 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~ 317 (507) -..-.-....++-+.|...-+.|--++++|- .-+++......+.+. -+.| +...+++|.|+++.- T Consensus 201 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-------~l~~e~~~~~~~~~~-~~~n----ag~~~vl~~g~~~~~--- 265 (432) T protein:vir:81 201 QIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-------FLTDDQYDSFAKKVS-GSVE----AGRAPLLEGGMDVKS--- 265 (432) T ss_pred HHHHHHHHHHHHHHHHHhcCCCcceEEecCC-------CCCHHHHHHHHHHHh-hhhc----CCCceecCCCceEEE--- Confidence 7776666677777777765555545555542 222333333222222 1111 224678888876432 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSE----SKQTIHTQFVQRVNEIILEALNENLL 393 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~----vh~~v~~~~~~aD~~~i~~~ln~~li 393 (507) .+-+.....+.+..++..++|++++.-..--.+....|+++.+. ........-+.--++.|+..||+.|+ T Consensus 266 ------l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~kLl 339 (432) T protein:vir:81 266 ------LGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLL 339 (432) T ss_pred ------ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 22222233466677888899999876544333333334443221 12222334455567777777777665 Q ss_pred HHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCC- Q lcl|NC_017974. 394 PQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLG- 472 (507) Q Consensus 394 ~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~- 472 (507) .+-- ....+.+|.++..-..|.++.++++.+++..|++.+ +.+|+.+|+|+-.+++..........| T Consensus 340 ~~~~-------~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~-----NE~R~~~glpp~~g~~~~~~~~~~~~pl 407 (432) T protein:vir:81 340 SPAE-------RRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTR-----DEAREIEGLPKLGGNAAVLTVQSAMVPL 407 (432) T ss_pred Cccc-------cCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCcceEeecCcccch Confidence 4321 011122333344456789999999999999998776 579999999876544332211111111 Q ss_pred CcCCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 473 EDTSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) +.....+.+......++.. .+..+. T Consensus 408 ~~~~~~~~~~~~~~~~n~~------~~~~~~ 432 (432) T protein:vir:81 408 DSIGLQASPEPASGLGNQQ------QDKVSK 432 (432) T ss_pred hhhccCCCCCCCCCCCCcc------cccccC Confidence 0000000000000111110 011111 No 52 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.61 E-value=2e-14 Score=95.73 Aligned_cols=399 Identities=10% Similarity=0.014 Sum_probs=201.9 Q ss_pred Ccccc-cccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKR-NRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |-.++ =+++....+.. +.+.. .....+.+.. + ..+ .-+..++-+.|.+|+..+-.. T Consensus 17 ~~~~~lf~~~~~~~~~~----~~~~~----~~~~~~~~~~--------~------~~v-s~~~al~~~~v~~cv~~Ia~~ 73 (424) T protein:vir:45 17 VLLDALFRSKSLENPST----PITGD----AVDTDGLFRA--------D------VYV-SPETAMKLAAVYSCIYVLSSS 73 (424) T ss_pred HHHHhhccccCCCCCcc----ccchh----hhhhhccccC--------C------cee-chHHhhccHHHHHHHHHHHHH Confidence 22111 01111111110 00000 0000010000 0 000 123456678899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHH-HHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAK-FVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae-~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) |.++++.+....+... +++.+ -+.++|..-++ .+..++++.++ +.+++|-++.++++... |. +..| T Consensus 74 iA~lp~~v~~~~~~~~--~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~------G~--~~~L 143 (424) T protein:vir:45 74 LAQMPLHVMRRHKGKV--EPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRR------GE--VISL 143 (424) T ss_pred HhhCceEEEEecCCce--eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC------Cc--EEEE Confidence 9999876543221111 11111 23344433222 23456666655 57789999999987543 22 3345 Q ss_pred cccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHH Q lcl|NC_017974. 156 AFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVG 235 (507) Q Consensus 156 ~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~ 235 (507) .+.++..+. ...++..+... .........+|++.++++++. ..+..+|.|.+.. T Consensus 144 ~~l~~~~v~------i~~~~~~~~y~-------------------~~~~~~~~~~~~~eVih~r~~-~~d~~~G~spi~~ 197 (424) T protein:vir:45 144 DCCMPWETT------LMNTGGRYTYG-------------------LYNEYGAFAISPDDMIHIRAL-GNNQKMGLSPIMQ 197 (424) T ss_pred EEecCceEE------EEEcCCeEEEE-------------------EEecCceEEECcccEEEecCc-CCCCcccccHHHH Confidence 555544321 11222211110 111122345777776666654 4566899999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccc Q lcl|NC_017974. 236 CYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAP 315 (507) Q Consensus 236 ~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~ 315 (507) +....-.-....++-+.|...-+.|--++++|-. -+++......+.+.+........+...++++.|++..- T Consensus 198 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-------l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~- 269 (424) T protein:vir:45 198 HAETIGMGMSGQKYTESFFSGNARPAGIVSVKSG-------LNKESWGWLKDQWQKASQALRRQENKTMLLPADLDYKA- 269 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-------CCHHHHHHHHHHHHHHhccccccCCceeEcCCCceEEE- Confidence 8877766666777777777766666666666532 22322222222232221111111224577888875432 Q ss_pred ccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 316 QYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESK-QTIHTQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 316 ~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh-~~v~~~~~~aD~~~i~~~ln~~li~ 394 (507) .+-+..-..|.+..++...+|++++.-..--.+...+++++-.+-+ ......-+.-.++.|++.||+.|+. T Consensus 270 --------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~kLl~ 341 (424) T protein:vir:45 270 --------LTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRRLFT 341 (424) T ss_pred --------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 2222222345666778889999987665533333334555533333 3344555677888888888877755 Q ss_pred HHHHhCCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC- Q lcl|NC_017974. 395 QLLALNDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL- 471 (507) Q Consensus 395 ~l~~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~- 471 (507) .--... + . +|+|+ ..-..|+++.++.+.+++..|++.+ +.+|+.+|+|+-+++|..-....... T Consensus 342 ~~e~~~-----g-~--~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~-----NE~R~~~gl~pi~ggD~~~~~~n~~~~ 408 (424) T protein:vir:45 342 RAELAA-----G-Y--YVRFNLTGLLRGTPQERAQFYHFAITDGWMSR-----NEARAFEDMNPVEGLDEMLVSVNAANP 408 (424) T ss_pred hhhhcC-----C-c--EEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeeeccccccc Confidence 422111 1 1 24444 4446789999999999999998766 57999999997655554322111111 Q ss_pred CCcCCCCCCcccccCccc Q lcl|NC_017974. 472 GEDTSRAGDGMTKGSSGN 489 (507) Q Consensus 472 ~~~~~~~~~~~~~~~~~~ 489 (507) ..+..++ +...+...+ T Consensus 409 ~~~~~~~--~~~~~~~~~ 424 (424) T protein:vir:45 409 AGDFKPP--KNDEGKTNE 424 (424) T ss_pred ccccCCC--CCCCCCCCC Confidence 1111110 001111111 No 53 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.61 E-value=2.5e-14 Score=95.12 Aligned_cols=420 Identities=13% Similarity=0.069 Sum_probs=208.7 Q ss_pred CcccccccccccCcccccc--ccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSE--QINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYT 78 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~--~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~ 78 (507) |+|-=.+-.+.....+.+. ........++ .-..+..+... +... ...+ ..+..++-+.|.+|+..+-. T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~---~~~~-g~~v-~~~~al~~~~V~~~i~~ia~ 70 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRLT-----DGAFWSQFLGR---ESSS-GKKV-TVDKAMKLSAVWACVRLIST 70 (434) T ss_pred CccchhhhhhhcccccchhhhcccccccccC-----chHHHHHHhcC---CccC-Ccee-chhhhhccHHHHHHHHHHHH Confidence 7665433222222222111 0000000000 00011111110 0000 0111 13456778899999999999 Q ss_pred HHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) .|.++++...-...+....+...--+..+|..-++ .+..++++.++ +.+++|-+..++.+. .|+ +..| T Consensus 71 ~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-------~G~--~~~L 141 (434) T protein:vir:43 71 SVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-------AGR--PAAL 141 (434) T ss_pred hhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-------CCc--EEEE Confidence 99998776432211111011111123344432222 24556777765 567899998877653 132 3345 Q ss_pred cccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHH Q lcl|NC_017974. 156 AFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVG 235 (507) Q Consensus 156 ~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~ 235 (507) .+.++.++. +..+.+|... +.+ ....+....+|.+.+|++++.+ .+..+|.|.+.. T Consensus 142 ~~l~p~~v~----~~~~~~g~~~----------y~~---------~~~~g~~~~~~~~eVih~~~~~-~dg~~G~spi~~ 197 (434) T protein:vir:43 142 DFLLPSRVD----LECDENGRLK----------YFY---------TTKKGARREIERTNMLHIPAFT-LDGRIGLSAIRY 197 (434) T ss_pred EEEcCcceE----EEEcCCCeEE----------EEE---------EecCceEEEEccccEEEecCcC-CCCccccCHHHH Confidence 555554432 2233343211 000 1112234567888877776654 455889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCccccc Q lcl|NC_017974. 236 CYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKDN 313 (507) Q Consensus 236 ~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i~ 313 (507) +....-.-....++-..|...-+.|--++++|-. -+++. .+.+++.. .+..++..+ .+++|.|+++. T Consensus 198 ~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-------l~~e~-~~~~r~~~---~~~~g~~nag~~~vl~~g~~~~ 266 (434) T protein:vir:43 198 GVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRI-------LQPAQ-REEFREYV---KSVSGAMNSGRSPVLEQGITPE 266 (434) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceEEecCCC-------CCHHH-HHHHHHHH---HHhcCccccCCccccCCCceEE Confidence 9888777777777778888766666666665532 22222 22333333 333333333 46778887532 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhh--hh-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGS--YN-LSESKQTIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS--~A-l~~vh~~v~~~~~~aD~~~i~~~ln~ 390 (507) ..+.+.....|.+..++..++|++++.-..--.+...+++ ++ ..+........-+.-.++.|+..||+ T Consensus 267 ---------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie~~ln~ 337 (434) T protein:vir:43 267 ---------TIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQIQQCVNK 337 (434) T ss_pred ---------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2222233335677888889999998766543333322333 22 12222333445567778888888887 Q ss_pred HHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhh-- Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELM-- 468 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~-- 468 (507) .|+..--..+ .+-+|.++.....|.+..++.+.+++..|++.+ +.+|+.+|+|+-+++|....... T Consensus 338 kL~~~~~~~~-------~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD~~~~~~n~~ 405 (434) T protein:vir:43 338 RLLTAPERIR-------YYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTR-----NEGRRKENLPELPGGDILTVQSNLV 405 (434) T ss_pred hcCChhhhcC-------ceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeeccCcc Confidence 6644321111 122333344456789999999999999999876 57999999997655543211110 Q ss_pred --hcCCCcCCCCCCcccccCcccccccccCCCC Q lcl|NC_017974. 469 --KLLGEDTSRAGDGMTKGSSGNGTGKISSTRD 499 (507) Q Consensus 469 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 499 (507) ....+ ..+...... +......+|...- T Consensus 406 ~~~~~~~-~~~~~~~~~---~~~~~~~~~~~~~ 434 (434) T protein:vir:43 406 PIDQLGQ-SNKSQAVRA---ALMNWFSQPEPQE 434 (434) T ss_pred chhhhhc-cCCCcchhh---hhhccCCCCCCCC Confidence 00111 001000000 0000011111111 No 54 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.60 E-value=4.8e-14 Score=93.59 Aligned_cols=410 Identities=11% Similarity=0.036 Sum_probs=203.6 Q ss_pred CcccccccccccCc---cccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH--HHHhhChHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVA---KATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV--ESMKQDATVATGLDM 75 (507) Q Consensus 1 ~~~k~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y--~~m~~D~~v~s~l~~ 75 (507) =+||.|+++.+-+. +........+ ..+...+-.....-. ......| +..++.+.|.+|+.. T Consensus 14 ~~~~~~~~~~~~~~lf~~~e~R~~~~~--------~~~~~~~~~~~~~~~------~~~~~~~~~~~al~~~~V~~cv~~ 79 (441) T protein:vir:79 14 KSRKQSRKELVVVGIFYKNEKRDLQYN--------EDDLQMMVQTLPGFQ------GTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred cccccchhhhhccccccccccccccCC--------CcchHHHHHHhcccC------cccccccchhhhhccHHHHHHHHH Confidence 46666666533111 1111111111 111111100000000 0011122 234677889999999 Q ss_pred HHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 76 LYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 76 Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) +-..|.++++.+.-. +..... +-+...|..-++ .+..+++..+. +.+.+|-+.+++++... |+ + T Consensus 80 Ia~~iA~lp~~~~~~-~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~ 146 (441) T protein:vir:79 80 IASDLARMPIRVTVN-GQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT------GE--P 146 (441) T ss_pred HHHhhccCceeeecC-cccccc----chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--E Confidence 999998887654321 211111 223334433222 23456666655 46789999999987532 32 4 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchH Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSP 232 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gl 232 (507) ..|.+.++.++. +..+.+|... ... ..... ........+|+..+|++++.+ .+..+|.|+ T Consensus 147 ~~L~~i~~~~v~----v~~d~~g~~~-~~~---------~~~~~-----~~~~~~~~~~~~dvih~k~~~-~dg~~G~sp 206 (441) T protein:vir:79 147 MNLTFRKTSEIE----LKSDARGRLY-YFH---------QRIDS-----NGNNIERNVKFEDMLDIKFYS-LDGINGLSL 206 (441) T ss_pred EEEEEEcCceeE----EEECCCccEE-EEE---------EEecc-----CCceeEEEEccccEEEeccCC-CCCccccCH Confidence 556666665443 2233333211 000 00000 001112346777777777654 344789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC-Cc--eeEeecCc Q lcl|NC_017974. 233 LVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG-EQ--SFFMLPSD 309 (507) Q Consensus 233 l~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G-~~--a~~iiP~g 309 (507) +..+.-..-.-....++-..|...-+.|--++++|-. ..+++....+++-.+.+ ..| .. ..+++|.| T Consensus 207 l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-------~~~~e~~e~~r~~~~~~---~~G~~nag~~~vl~~G 276 (441) T protein:vir:79 207 LDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-------LDNKKARDRAREEFHKS---FSGTKQAGKVVVLDES 276 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC-------CCCHHHHHHHHHHHHHH---hcCccccCcceecCCC Confidence 9988877766677777777888776666666665421 11222222223222221 123 22 24788888 Q ss_pred ccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 310 TKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 310 ~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) ++.. ..+-+..-..|.+..++..++|++++.-...-.+....+ ++..+. +..+..-+.-.++.|+..|| T Consensus 277 ~~~~---------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~s~~q~-~~~~~~tl~P~~~~ie~eln 345 (441) T protein:vir:79 277 MTFD---------QLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDA-NLDYLSTLKPYITCVCAELN 345 (441) T ss_pred ceEE---------EccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC-ccHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 7543 222222233466777888899999876543222221112 222221 11223356667778888888 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccccc-chhh Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQ-EELM 468 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~-~~~~ 468 (507) +.|++.. .+ .+-+|.++.....|.+..+++++++++.|++.+ +++|+.+|+|+-.+++... .... T Consensus 346 ~kl~~~~--------~~-~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~-----NE~R~~~gl~Pi~ggd~~~~~~~~ 411 (441) T protein:vir:79 346 FKFNDEY--------VN-REFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLAPIPGGNGSIHRVDL 411 (441) T ss_pred hhccccc--------cC-ceEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceEeecc Confidence 7653321 11 122344455567789999999999999998876 5799999998765555311 1111 Q ss_pred hcCCCc------CCCCCCcccccCcccccc Q lcl|NC_017974. 469 KLLGED------TSRAGDGMTKGSSGNGTG 492 (507) Q Consensus 469 ~~~~~~------~~~~~~~~~~~~~~~~~~ 492 (507) ...|-+ .++.+........|+.-. T Consensus 412 n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 412 NHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred cccccccccccccccccccccccCCCCCCC Confidence 111100 011111111111111110 No 55 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.60 E-value=4.8e-14 Score=93.59 Aligned_cols=410 Identities=11% Similarity=0.036 Sum_probs=203.6 Q ss_pred CcccccccccccCc---cccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH--HHHhhChHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVA---KATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV--ESMKQDATVATGLDM 75 (507) Q Consensus 1 ~~~k~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y--~~m~~D~~v~s~l~~ 75 (507) =+||.|+++.+-+. +........+ ..+...+-.....-. ......| +..++.+.|.+|+.. T Consensus 14 ~~~~~~~~~~~~~~lf~~~e~R~~~~~--------~~~~~~~~~~~~~~~------~~~~~~~~~~~al~~~~V~~cv~~ 79 (441) T protein:vir:94 14 KSRKQSRKELVVVGIFYKNEKRDLQYN--------EDDLQMMVQTLPGFQ------GTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred cccccchhhhhccccccccccccccCC--------CcchHHHHHHhcccC------cccccccchhhhhccHHHHHHHHH Confidence 46666666533111 1111111111 111111100000000 0011122 234677889999999 Q ss_pred HHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 76 LYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 76 Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) +-..|.++++.+.-. +..... +-+...|..-++ .+..+++..+. +.+.+|-+.+++++... |+ + T Consensus 80 Ia~~iA~lp~~~~~~-~~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~ 146 (441) T protein:vir:94 80 IASDLARMPIRVTVN-GQINYS----DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT------GE--P 146 (441) T ss_pred HHHhhccCceeeecC-cccccc----chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--E Confidence 999998887654321 211111 223334433222 23456666655 46789999999987532 32 4 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchH Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSP 232 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gl 232 (507) ..|.+.++.++. +..+.+|... ... ..... ........+|+..+|++++.+ .+..+|.|+ T Consensus 147 ~~L~~i~~~~v~----v~~d~~g~~~-~~~---------~~~~~-----~~~~~~~~~~~~dvih~k~~~-~dg~~G~sp 206 (441) T protein:vir:94 147 MNLTFRKTSEIE----LKSDARGRLY-YFH---------QRIDS-----NGNNIERNVKFEDMLDIKFYS-LDGINGLSL 206 (441) T ss_pred EEEEEEcCceeE----EEECCCccEE-EEE---------EEecc-----CCceeEEEEccccEEEeccCC-CCCccccCH Confidence 556666665443 2233333211 000 00000 001112346777777777654 344789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC-Cc--eeEeecCc Q lcl|NC_017974. 233 LVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG-EQ--SFFMLPSD 309 (507) Q Consensus 233 l~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G-~~--a~~iiP~g 309 (507) +..+.-..-.-....++-..|...-+.|--++++|-. ..+++....+++-.+.+ ..| .. ..+++|.| T Consensus 207 l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-------~~~~e~~e~~r~~~~~~---~~G~~nag~~~vl~~G 276 (441) T protein:vir:94 207 LDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-------LDNKKARDRAREEFHKS---FSGTKQAGKVVVLDES 276 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC-------CCCHHHHHHHHHHHHHH---hcCccccCcceecCCC Confidence 9988877766677777777888776666666665421 11222222223222221 123 22 24788888 Q ss_pred ccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 310 TKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 310 ~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) ++.. ..+-+..-..|.+..++..++|++++.-...-.+....+ ++..+. +..+..-+.-.++.|+..|| T Consensus 277 ~~~~---------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~s~~q~-~~~~~~tl~P~~~~ie~eln 345 (441) T protein:vir:94 277 MTFD---------QLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDA-NLDYLSTLKPYITCVCAELN 345 (441) T ss_pred ceEE---------EccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC-ccHHHH-HHHHHHHHHHHHHHHHHHHh Confidence 7543 222222233466777888899999876543222221112 222221 11223356667778888888 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccccc-chhh Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQ-EELM 468 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~-~~~~ 468 (507) +.|++.. .+ .+-+|.++.....|.+..+++++++++.|++.+ +++|+.+|+|+-.+++... .... T Consensus 346 ~kl~~~~--------~~-~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~-----NE~R~~~gl~Pi~ggd~~~~~~~~ 411 (441) T protein:vir:94 346 FKFNDEY--------VN-REFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLAPIPGGNGSIHRVDL 411 (441) T ss_pred hhccccc--------cC-ceEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceEeecc Confidence 7653321 11 122344455567789999999999999998876 5799999998765555311 1111 Q ss_pred hcCCCc------CCCCCCcccccCcccccc Q lcl|NC_017974. 469 KLLGED------TSRAGDGMTKGSSGNGTG 492 (507) Q Consensus 469 ~~~~~~------~~~~~~~~~~~~~~~~~~ 492 (507) ...|-+ .++.+........|+.-. T Consensus 412 n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 412 NHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred cccccccccccccccccccccccCCCCCCC Confidence 111100 011111111111111110 No 56 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.59 E-value=2.1e-14 Score=95.60 Aligned_cols=405 Identities=10% Similarity=0.018 Sum_probs=200.5 Q ss_pred CcccccccccccCccccccc-cccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQ-INVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |+=-++.+-..+..++.... ...+ ..+....+.+...... .+ ..+..++-+.|.+|+..+-.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~----------~v-~~~~a~~~~~v~~~i~~ia~~ 64 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQS-----TSKLYDFSPWKNRSFW----------GV-INNTLETNETIFSAITKLSNS 64 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhccc-----ccccccccccCCcccc----------cc-chhhhhccHHHHHHHHHHHHh Confidence 54333322211111111100 0000 0011111111111100 00 123455778899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.++++...-.. ...+..+..+ |..-++ .++.++++.++ +.+.+|-+..+++.... |. +..|. T Consensus 65 iA~lp~~~~~~~--~~~~~~~~~l----L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~L~ 130 (412) T protein:vir:26 65 MASLPLKMYEDY--KVVNTEVSDL----LTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY------HQ--PSKLF 130 (412) T ss_pred HhhCceeEeecc--ccccchHHHH----HHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC------Cc--EEEEE Confidence 988876543221 1222233333 332222 34667777665 56789999998876532 32 34555 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +-|+.++. +..+.|++.+....+ ...+....+|.+.++++++....+..+|.|.+..+ T Consensus 131 ~l~~~~v~----v~~~~~~~~~~y~~~------------------~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~ 188 (412) T protein:vir:26 131 LLNPDVVE----MLIENQSRELYYSIH------------------AATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVL 188 (412) T ss_pred EEcCceeE----EEEeCCCcEEEEEEE------------------cCCceEEEEccccEEEeCCCCCCCCcccccHHHHH Confidence 55554432 222333332221110 11122345777777777766566778899988877 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-..-..... ..|. ...++.+--. +.+.+..-+++......+.+.+. ..+....++++.|+++. T Consensus 189 ~~~i~~~~a~-~~~~--~~~~~~~~~~------i~~~~~~l~~e~~~~~~~~~~~~----~~~~g~~~vl~~g~~~~--- 252 (412) T protein:vir:26 189 KNTTDFDNAV-RTFN--LTEMQKPDSF------MLKYGSNVGKEKRQQVLEDFKQY----YEENGGILFQEPGVEIE--- 252 (412) T ss_pred HHHHHHHHHH-HHHH--HHhcCCCCce------EEecCCCCCHHHHHHHHHHHHHH----hhcCCCeeecCCCceEE--- Confidence 6544444333 3332 2233332111 11222233344444444444333 23444567788887543 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQT-IHTQFVQRVNEIILEALNENLLPQ 395 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~-v~~~~~~aD~~~i~~~ln~~li~~ 395 (507) ..+-+....+|.+..++...+|++++.-..--.+...+++++..+.+.. ....-+.-.++.|++.||+.|+.. T Consensus 253 ------~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kLl~~ 326 (412) T protein:vir:26 253 ------PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTK 326 (412) T ss_pred ------EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 2222223334666677788999998876654444344566776655554 334557778888888888766543 Q ss_pred HHHhCCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCC Q lcl|NC_017974. 396 LLALNDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGE 473 (507) Q Consensus 396 l~~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~ 473 (507) .= ..... +|.|+ .....|.++.++++.+++..|+..+ +.+|+.+|+|+-+.+|..-.... ..|- T Consensus 327 ~~------~~~~~--~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~gl~p~~ggD~~~~~~n-~~~~ 392 (412) T protein:vir:26 327 TD------REKNR--YFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGDKPLISGD-LYPI 392 (412) T ss_pred cc------ccCcc--eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccc-cccc Confidence 21 01111 35554 4456789999999999999999876 57999999987655554221111 1111 Q ss_pred cCCCCCCcccccCcccccccc Q lcl|NC_017974. 474 DTSRAGDGMTKGSSGNGTGKI 494 (507) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~ 494 (507) +.........++.. +...+. T Consensus 393 ~~~~~~~~~~~gG~-~n~~e~ 412 (412) T protein:vir:26 393 DTPLELRKSLKGGD-KNVNES 412 (412) T ss_pred ccchhhcccccCCC-CCcCCC Confidence 11111000011100 111111 No 57 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.59 E-value=5.4e-14 Score=93.32 Aligned_cols=397 Identities=10% Similarity=0.018 Sum_probs=203.2 Q ss_pred Cccccccccccc--Ccc-ccccccccccccccccCch---hhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVK--VAK-ATSEQINVSRMRMSEQGTF---ALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~--~~~-~~~~~~~~~~~~~~~~g~~---~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |+-||+.+.-.= ..+ +.+. .....+.++. ..+...... .+ ....|..++..+.|.+|+. T Consensus 4 ~~~~~~~~~m~~F~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~------~~----~~~~~~~~~~~~~v~~cI~ 68 (413) T protein:vir:96 4 VSEIRKDKNLKFFNNKRSPTEE-----SKAKDEIPKAPQVVMTLPNFFK------EL----ISDGYTKLSDSPEVRMAVD 68 (413) T ss_pred cchhhhhhcCCccccCCCcchh-----hhhhccccccccccccchhhHh------hh----ccchhHHHhhchHHHHHHH Confidence 777765433110 000 1000 0000011111 000000000 00 1122455667899999999 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) .+...|.++++.+.....+. .+++..-+...|..-++ .++.+++..++. .+.+|.++++++....+. . T Consensus 69 ~ia~~ia~~~~~~~~~~~~~--~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~-------~ 139 (413) T protein:vir:96 69 CIADLVSNMTIQLMQNGETG--DKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGD-------K 139 (413) T ss_pred HHHHhhccCceEEEEecCCC--ccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC-------c Confidence 99999988877654332211 11222234444432222 346678777664 567999999988754321 2 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccC-ccccc Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSS-QALGV 230 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~-~p~G~ 230 (507) +..|.+.++.++.. ..++..+. +. ....+..++++.+|++++..... ..+|. T Consensus 140 ~~~L~~l~~~~v~~------~~~~~~~~---------y~------------~~~~~~~~~~~evih~k~~~~~~~~~~G~ 192 (413) T protein:vir:96 140 IIGLTPISPYKVTF------NVSDDDLD---------YS------------ITFDNKEYDPSTLLHFVLNPSIERPFIGT 192 (413) T ss_pred eEEEEEecCceeEE------EEcCCeEE---------EE------------EeecCcEEchhhEEEEeccCCCCCccccc Confidence 23455555543321 11111111 00 01122345667777777654443 45699 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC-Cce--eEeec Q lcl|NC_017974. 231 SPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG-EQS--FFMLP 307 (507) Q Consensus 231 gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G-~~a--~~iiP 307 (507) |.+..+.-..-.-....++...|....+.|--++++|.. -+++...++.+.+.+. ..| ..+ .++++ T Consensus 193 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-------l~~e~~~~~~~~~~~~----~~g~~n~g~~~vl~ 261 (413) T protein:vir:96 193 GYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSD-------SDELSDEEGRENFEEM----YLKRKEAGKPWIIP 261 (413) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-------CCHHHHHHHHHHHHHH----hcCccccCceeeec Confidence 999999888877777888888888887777666665532 2333333333333332 223 222 36677 Q ss_pred CcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 308 SDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEA 387 (507) Q Consensus 308 ~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ 387 (507) .|+.. .. ++.. -+.....+.+..++.-++|++++--..--.+.+ .++.+. .......-+.-.++.|++. T Consensus 262 ~~~~~-~~----~~~~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~-~~~~~~---~~~~~~~~l~P~~~~ie~~ 330 (413) T protein:vir:96 262 EGMVN-VQ----QIKP--LTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVG-TYNKDE---FNNFINTKIMSIAQVIQQT 330 (413) T ss_pred CCccc-cc----cccc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-cchHHH---HHHHHHHHHHHHHHHHHHH Confidence 77531 11 1111 111222455666788888998876554222211 223322 2334455567778888888 Q ss_pred HHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchh Q lcl|NC_017974. 388 LNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEEL 467 (507) Q Consensus 388 ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~ 467 (507) ||+.|++ +..+-+|.++.....|.++.++++.+++..|++.+ +.+|+.+|+|+-+.+|...... T Consensus 331 ln~~ll~-----------~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~p~~~gd~~~~~~ 394 (413) T protein:vir:96 331 YNKLIVE-----------EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRR-----NEFRNWVGMPPDAEMDDLLVLE 394 (413) T ss_pred HHHhhCC-----------CCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeeecc Confidence 8876532 22232444555567899999999999999999876 5799999998754444321111 Q ss_pred hhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 468 MKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) . ..+-... ++ ...+-+.+| T Consensus 395 n-~~~~~~~--~~--~~~~~~~dt 413 (413) T protein:vir:96 395 N-YLQQKDL--VN--QKKLIQDET 413 (413) T ss_pred c-ccchhhc--cc--ccCCCCCCC Confidence 1 1110000 00 000111111 No 58 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.58 E-value=2.3e-13 Score=89.85 Aligned_cols=438 Identities=11% Similarity=0.059 Sum_probs=201.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHh-hChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMK-QDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s~l~~Rk~a 79 (507) |=+-+---+....+.........+.......+ ...++. -.+..+.++. ..++|.+|+..+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~pp------~~~~~La~~~~~n~~v~scI~~ia~~ 64 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKEDRFE----------EYVEPK------VHPLVLLSLLQVNPYHASACSIKAND 64 (540) T ss_pred CCCcccChhhccchhhhhccccccccccCCCC----------ccccCC------CCHHHHHHHHHhcHHHHHHHHHHHHH Confidence 11111111111111111100000100000000 001111 1122333444 578999999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) |.++++.+....+ ...++ +.+ ...++.+++..++ +.+.+|-+.+|+++... | .+..|.+. T Consensus 65 ia~~~~~i~~~~~------~~~~~----lpN-~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~------G--~~~~L~~i 125 (540) T protein:vir:41 65 ILRTGYLIDGDDG------GVEEL----LRA-CRPSFEFILLQALEDLQVFNYCTLEVVRDDQ------G--EPVRLDYI 125 (540) T ss_pred HhcCCceEecCcc------chhhh----ccC-CCCCHHHHHHHHHHHHHhcCCeEEEEEECCC------C--cEEEEEEe Confidence 9988776643321 12222 222 2245778888876 57789999999988643 2 24456666 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) ++.++.. ..++.............+....+...............+|...+|+++.....+.+||.+.+..+.. T Consensus 126 ~~~~V~v------~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~ 199 (540) T protein:vir:41 126 PAHTVRV------HRDGSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAP 199 (540) T ss_pred CCcceEE------eEcCceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHH Confidence 6654421 1122111111110000000011111111122223344567777766665555677899999999998 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC--Cce--eEeec------C Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG--EQS--FFMLP------S 308 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G--~~a--~~iiP------~ 308 (507) ....-....++-..|...-+.|--++++|-.+.+.. ..+.+......+.+.+...+...| ..+ .++++ . T Consensus 200 ~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~-~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~ 278 (540) T protein:vir:41 200 SILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEM-ELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTV 278 (540) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchh-ccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCccc Confidence 888888888888888876666666666653332221 112222222223333332221111 122 23332 2 Q ss_pred cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccc--hhhhhHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_017974. 309 DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDK--GGSYNLSES-KQTIHTQFVQRVNEIIL 385 (507) Q Consensus 309 g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~--~GS~Al~~v-h~~v~~~~~~aD~~~i~ 385 (507) |+ +++..+-+..-..|.+..++...+|++++.-..--.+... ++++|-.+. ........+.-.++.|+ T Consensus 279 g~---------~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie 349 (540) T protein:vir:41 279 EV---------TFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVS 349 (540) T ss_pred ce---------eEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 33 2333333333445777888899999987755442222211 222332332 33445667788999999 Q ss_pred HHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCccccc Q lcl|NC_017974. 386 EALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQ 464 (507) Q Consensus 386 ~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~ 464 (507) ..||+.|++. ++ .+. +|+|+...-.+ ...++.+.++++.|++.+ +.+|+. +|+++-.+.- .. T Consensus 350 ~~ln~~L~~~---~~----~~~---~i~f~~~~ll~-~D~~~~~~~lv~~G~lT~-----NE~Re~L~g~e~gdd~~-l~ 412 (540) T protein:vir:41 350 SVLTDFIQLK---LD----PGA---RFVFNEEILME-SEFVHNYALLVQCGVLTP-----SEVREKLFGLDGGPDMF-MV 412 (540) T ss_pred HHHHHhhhhc---cC----Cce---EEEecchhhcc-hHHHHHHHHHHhCCCCCH-----HHHHHHhCcCcCCCccc-cc Confidence 9999866542 21 111 46676554433 234567788999998776 467875 4775422211 11 Q ss_pred chhhhcC---CC----cCCCCCCcccccCcccccccc-----cCCCCchhhhccC Q lcl|NC_017974. 465 EELMKLL---GE----DTSRAGDGMTKGSSGNGTGKI-----SSTRDNSAANLDN 507 (507) Q Consensus 465 ~~~~~~~---~~----~~~~~~~~~~~~~~~~~~~~~-----~~~~d~~a~~~~~ 507 (507) +...... .+ ...++.+..+. .+..+++.. ....+.+-++++. T Consensus 413 p~n~~~~~~~~~~~~~~~~~~~~~~k~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (540) T protein:vir:41 413 PSSIGKSAMKRQKRNYEKNQINEIKRT-YAKYKPRIQEIISSESPLEDKKKKIDE 466 (540) T ss_pred ccccccccccccccccCCCCccccccc-cchhcccccCccccccccccccccccc Confidence 1111000 00 00000000000 000000000 0011111122222 No 59 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.58 E-value=6.1e-14 Score=93.02 Aligned_cols=414 Identities=12% Similarity=0.049 Sum_probs=197.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |-.--+.+..-..+.|........ .+ +.......-....... +..+ .-+..++-+.|.+|+..+-..| T Consensus 7 ~g~~~~~~~~~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~~~~------g~~v-~~~~a~~~~aV~~~v~~Ia~~i 74 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDPVDIGGGQT---FT--PVNATARDLGIIISDT------GAAV-NADAIMRLDAVAACVKLVSQAV 74 (432) T ss_pred CchhhhhHhhcCCccccccccccc---cc--cCchhhhhhccccccc------Cccc-chHhhhcchHHHHHHHHHHHhh Confidence 222222111111122211100000 00 0000000000000000 0011 0133557789999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhcc--CCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNME--GQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~--~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .++++.+.....+. ..+....=+..+|..-+ ..++.++++.++ +.+++|.+.+++++.. |+ +..|.+ T Consensus 75 a~lp~~~y~~~~~g-~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-------g~--~~~L~~ 144 (432) T protein:vir:97 75 AAMPLMMYMRTPDG-RKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-------GR--IESLQY 144 (432) T ss_pred ccCceEEEEecCCC-cccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-------Cc--EEEEEE Confidence 99877653322111 10111111333443212 234566777665 6788999999988741 22 344555 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .++..+. +..+.+|+.. +.+ ....+....+|.+.++++++.+. +..+|.|.+..+. T Consensus 145 l~p~~v~----v~~~~~g~~~----------y~~---------~~~~g~~~~~~~~~iih~r~~~~-dg~~G~spi~~~~ 200 (432) T protein:vir:97 145 LANDRLT----ITTDTKGNTA----------YRY---------RRTDGQMIDIPRQQIWKIMGYSL-DGENGLSAIRYGA 200 (432) T ss_pred EcCcceE----EEEcCCCcEE----------EEE---------EecCceEEEEccccEEEecCcCC-CCcccccHHHHHH Confidence 5554332 1223343211 000 01122334678888776665544 4478999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~ 317 (507) ..+-.-....++-+.|...-+.|--++++|-. -+++.... +++-..-.. ++...++++.|+++.- T Consensus 201 ~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~-------l~~e~~~~-~~~~~~~~~----nag~~~vl~~g~~~~~--- 265 (432) T protein:vir:97 201 QIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF-------LTDDQYDS-FSKKVSGSV----EAGRAPLLEGGMDVKS--- 265 (432) T ss_pred HHHHHHHHHHHHHHHHHhccCCcceeEecCCC-------CCHHHHHH-HHHHHhhhh----cCCCceecCCCceEEE--- Confidence 77766666667777777765665555555422 22333222 222221111 1224578888876432 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLS----ESKQTIHTQFVQRVNEIILEALNENLL 393 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~----~vh~~v~~~~~~aD~~~i~~~ln~~li 393 (507) .+-+.....+.+..++...+|++++--..--.+....|+++.+ +........-+.-.++.|+..||+.|+ T Consensus 266 ------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ie~~ln~kLl 339 (432) T protein:vir:97 266 ------LGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLL 339 (432) T ss_pred ------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 2222223346677788899999976554322232223333222 112223334555566777777776554 Q ss_pred HHHHHhCCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 394 PQLLALNDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 394 ~~l~~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) .+--. ... +|+|+ ..-..|.++.++++.+++..|++.+ +++|+.+|+|+..+++.......... T Consensus 340 ~~~e~-------~~~--~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~glpp~~g~~~~~~~~~~~~ 405 (432) T protein:vir:97 340 TPAER-------RRY--FADFDTSALLRADSAARSSYYSQLVNNGLMTR-----DEAREIEGLPKLGGNAAVLTVQSAMV 405 (432) T ss_pred Ccccc-------Cce--EEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCcceEeeccccc Confidence 43110 111 34554 4456789999999999999998776 57999999987654443221111111 Q ss_pred C-Cc--CCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 472 G-ED--TSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 472 ~-~~--~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) | +. ..+..++ ....++.. .+..++ T Consensus 406 pl~~~~~~~~~~~--~~~~~~~~------~~~~~~ 432 (432) T protein:vir:97 406 PLDSIGLQASPEP--ASGLGNQQ------QDKVSK 432 (432) T ss_pred chhhhcccCCCCC--CCCCCCcc------cccccC Confidence 1 00 0000000 00111100 001111 No 60 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.58 E-value=2.3e-14 Score=95.34 Aligned_cols=405 Identities=9% Similarity=0.003 Sum_probs=199.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |++.+=..+....-...-....+. +....+++...... .+ ..+..++-+.|.+|+..+-..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~----------~v-~~~~~~~~~~V~~ci~~Ia~~i 62 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTS-------KLYDFSPWKNRSFW----------GV-INNTLETNETIFSAITKLSNSM 62 (409) T ss_pred CCccchhhhhhhhhhhhhhccccc-------cccccccccCcccc----------cc-chhhhhccHHHHHHHHHHHHhh Confidence 765544333211111000000000 00000111111000 00 1233456778999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .++++...-.. ...+..+..++..- --...+..++++.++ +.+.+|-+..++++... |. +..|.+.| T Consensus 63 a~lp~~~~~~~--~~~~~~~~~lL~~~--PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~L~~l~ 130 (409) T protein:vir:93 63 ASLPLKMYEDY--KVVNTEVSDLLTVS--PNNSLSSFDFINQIETIRNEKGNAYVLIERDIY------HQ--PSKLFLLN 130 (409) T ss_pred hhCceeEeecc--ccccchHHHHHhhh--cccCCCHHHHHHHHHHHHhhcCceEEEEEECCC------Cc--EEEEEEEc Confidence 98876543221 12222333333211 112234667777765 46779999998877532 22 34565655 Q ss_pred chhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHH Q lcl|NC_017974. 160 QASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRA 239 (507) Q Consensus 160 ~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~ 239 (507) +.++. +..+.|++.+....+ ...+..+.+|.+.+|++++....+..+|.|.+..+.-. T Consensus 131 ~~~v~----~~~~~~~~~~~y~~~------------------~~~g~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~ 188 (409) T protein:vir:93 131 PDVVE----MLIENQSRELYYSIH------------------AATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNT 188 (409) T ss_pred CceeE----EEEeCCCcEEEEEEE------------------cCCceEEEEccccEEEeCCCCCCCccccccHHHHHHHH Confidence 54432 122333332221110 11123345777777777665556677899988877654 Q ss_pred HHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccce Q lcl|NC_017974. 240 WREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSM 319 (507) Q Consensus 240 ~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~ 319 (507) .-......+ +. ...++.+--. +.+.+..-+++......+...+. ..+....++++.|+++. T Consensus 189 i~~~~~~~~-~~--~~~~~~~~~~------i~~~~~~l~~e~~~~~~~~~~~~----~~~~g~~~vl~~g~~~~------ 249 (409) T protein:vir:93 189 TDFDNAVRT-FN--LTEMQKPDSF------MLKYGSNVGKEKRQQVLEDFKQY----YEENGGILFQEPGVEIE------ 249 (409) T ss_pred HHHHHHHHH-HH--HHhcCCCCce------EEecCCCCCHHHHHHHHHHHHHH----hhcCCCeeecCCCceEE------ Confidence 444333322 22 2233332111 11222233344433333333332 22344567778887543 Q ss_pred eeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 320 TLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQLLA 398 (507) Q Consensus 320 el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~~ 398 (507) ..+-+.....|.+..++...+|++++.-..--.+...+++++-.+.+. .....-+.-.++.|++.||+.|+...- T Consensus 250 ---~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~- 325 (409) T protein:vir:93 250 ---PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTD- 325 (409) T ss_pred ---EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc- Confidence 222222233466666788888999887665334433455666554444 344556777888888888877654421 Q ss_pred hCCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCC Q lcl|NC_017974. 399 LNDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTS 476 (507) Q Consensus 399 lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~ 476 (507) ..... +|.|+ .....|.++.++++.++++.|+..+ +.+|+.+|+|+-+.+|..-.... ..+-+.. T Consensus 326 -----~~~~~--~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~ggD~~~~~~n-~~~~~~~ 392 (409) T protein:vir:93 326 -----REKNR--YFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGDKPLISGD-LYPIDTP 392 (409) T ss_pred -----ccCcc--eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccc-ccccccc Confidence 01111 35554 4446789999999999999998776 57999999987655543221111 1111111 Q ss_pred CCCCcccccCcccccccc Q lcl|NC_017974. 477 RAGDGMTKGSSGNGTGKI 494 (507) Q Consensus 477 ~~~~~~~~~~~~~~~~~~ 494 (507) .......++. ++...+. T Consensus 393 ~~~~~~~~gG-~~n~~e~ 409 (409) T protein:vir:93 393 LELRKSLKGG-DKNVNES 409 (409) T ss_pred hhhcccccCC-CCCcCCC Confidence 1000001110 0111111 No 61 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.58 E-value=4.7e-14 Score=93.67 Aligned_cols=404 Identities=10% Similarity=0.007 Sum_probs=198.2 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |+|.+=+.+-...--......... +....+.+...... .+ ..+..++-+.|.+|+..+-..| T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~-------~~~~~~~~~~~~~~----------~v-~~~~a~~~~~v~~~i~~Ia~~i 62 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWIDQSAS-------KLYDFSPWKNKSFW----------GV-INNTLETNETIFSAITKLSNSM 62 (409) T ss_pred CcccccchhhhhHHhhhhhcCCcc-------cccccccccCcccc----------cc-chhhhhccHHHHHHHHHHHHhh Confidence 776654444322110000000000 00001111111000 00 1123446778999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .++++...-.. ...+..+..++..- --...+..++++.++ +.+++|-+..++++... |. +..|.+-+ T Consensus 63 a~lp~~~~~~~--~~~~~~~~~lL~~~--PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~--~~~L~~l~ 130 (409) T protein:vir:94 63 ASLPLKMYEDY--KVVNTEVSDLLTVS--PNNSLSSFDFINQIETIRNEKGNAYVLIERDIY------HQ--PSKLFLLN 130 (409) T ss_pred hhCceeEeecc--cccchhHHHHHhhh--cccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEEEEc Confidence 98876543221 12222333333211 111234567777755 56789999988876532 22 34555555 Q ss_pred chhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHH Q lcl|NC_017974. 160 QASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRA 239 (507) Q Consensus 160 ~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~ 239 (507) +.++. +..+.|++.+....+ ...+..+.+|.+.++++++....+..+|.|.+..+... T Consensus 131 ~~~v~----v~~~~~~~~~~y~~~------------------~~~g~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~ 188 (409) T protein:vir:94 131 PDVVE----MLIENQSRELYYSIH------------------AATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNT 188 (409) T ss_pred CceeE----EEEeCCCcEEEEEEE------------------cCCceEEEEccccEEEecCCCCCCccccccHHHHHHHH Confidence 54332 122333332221110 11123445777777777665555677899988877655 Q ss_pred HHHHHHHHHHHHHHHHHhcCCce-EEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 240 WREKILIENLEVVGATKDMGGVI-ELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 240 ~~fK~~~~~~w~~f~Er~g~p~~-v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) .-...... .|.. ..++.+-. +... +..-+++......+...+.. .+....++++.|+++. T Consensus 189 i~~~~~~~-~~~~--~~~~~~~~~i~~~-------~~~l~~e~~~~~~~~~~~~~----~~~g~~~vl~~g~~~~----- 249 (409) T protein:vir:94 189 TDFDNAVR-TFNL--TEMQKPDSFMLKY-------GSNVGKEKRQQVLEDFKQYY----EENGGILFQEPGVEIE----- 249 (409) T ss_pred HHHHHHHH-HHHH--HhcCCCCeeEEec-------CCCCCHHHHHHHHHHHHHHh----hcCCCeeecCCCceEE----- Confidence 44443333 2322 22333211 2222 22233334333333333322 2344567788887543 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQLL 397 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~l~ 397 (507) ..+.+....+|.+..++...+|++++--..--.+....++++-.+-+. .....-+.--++.|++.||+.|++..= T Consensus 250 ----~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~ 325 (409) T protein:vir:94 250 ----PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTD 325 (409) T ss_pred ----EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccc Confidence 222233333466677788899999876654333333345555444333 333445666778888888876654320 Q ss_pred HhCCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcC Q lcl|NC_017974. 398 ALNDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDT 475 (507) Q Consensus 398 ~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~ 475 (507) ..... .|+|+ ..-..|.++.++++.++++.|++.+ +++|+.+|+|+-+.+|....... ..|-+. T Consensus 326 ------~~~~~--~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~ggD~~~~~~n-~~~~~~ 391 (409) T protein:vir:94 326 ------REKNR--YFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGDKPLISGD-LYPIDT 391 (409) T ss_pred ------ccCcc--eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeEeeccc-cccccc Confidence 01112 35554 4446789999999999999999876 57999999987655553221111 111111 Q ss_pred CCCCCcccccCcccccccc Q lcl|NC_017974. 476 SRAGDGMTKGSSGNGTGKI 494 (507) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~ 494 (507) ....+...++.. +...+. T Consensus 392 ~~~~~~~~kGG~-~n~~e~ 409 (409) T protein:vir:94 392 PLELRKSLKGGD-KNVNES 409 (409) T ss_pred chhhcccccCCC-CCcCCC Confidence 111100011111 111110 No 62 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.56 E-value=1e-13 Score=91.81 Aligned_cols=396 Identities=10% Similarity=0.031 Sum_probs=195.1 Q ss_pred Ccccccccccc------cCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSV------KVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~------~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |+|-+=.+|.. -..++.+ +...-+.+...... .+ ..+..++-+.|.+|+. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~----------~v-~~~~a~~~~~V~~ci~ 56 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSAS-------------KLYDFSPWKNKSFW----------GV-INNTLETNETIFSAIT 56 (409) T ss_pred CccccchhhhhhHHhhhhhccccc-------------cccccccccCcccc----------cc-chhhHhhhHHHHHHHH Confidence 54322111110 0111110 00000011110000 00 0122446678999999 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) .+-..|.++++...-.. ...+..+.++ |..-++ .+..++++.++ +.+.+|-+..++++... |. T Consensus 57 ~ia~~ia~lp~~~~~~~--~~~~~~l~~l----L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------G~-- 122 (409) T protein:vir:96 57 KLSNSMASLPLKMYEDY--KVVNTEVSDL----LTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY------HQ-- 122 (409) T ss_pred HHHHhhhhCceEEeecc--cccchhHHHH----HhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC------Cc-- Confidence 99999998876543221 1222223333 332122 24556666655 56789999999987543 22 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) +..|.+-++..+. +..+.+++.+... + ....+....+|...+|++++....+..+|.| T Consensus 123 ~~~L~~l~~~~v~----v~~~~~~~~~~y~---------~---------~~~~g~~~~~~~~evih~r~~~~~~~~~G~s 180 (409) T protein:vir:96 123 PSKLFLLNPDVVE----MLIENQSRELYYS---------I---------HAATGNKLIVHNMDMLHFKHIVASNMVQGIS 180 (409) T ss_pred EEEEEEEcCceeE----EEEeCCCcEEEEE---------E---------EcCCceEEEEccccEEEeCCCCCCCcccccc Confidence 3455555554332 1223333222111 0 0112234457777777777655566778999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCce-EEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVI-ELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDT 310 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~-v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~ 310 (507) .+..+.-..-.-....+++ ...++.+-. ++.. +..-++++.....++..+.. .+....++++.|+ T Consensus 181 ~l~~~~~~i~~~~~~~~~~---~~~~~~~~~~i~~~-------~~~l~~e~~~~~~~~~~~~~----~n~g~~~vl~~g~ 246 (409) T protein:vir:96 181 PIDVLKNTTDFDNAVRTFN---LTEMQKPDSFMLKY-------GSNVSTEKRQQVLEDFKQYY----EENGGILFQEPGV 246 (409) T ss_pred HHHHHHHHHHHHHHHHHHH---HHhcCCCceeEEec-------CCCCCHHHHHHHHHHHHHHh----hcCCCeeecCCCc Confidence 9887764433333333332 223333321 2222 22233444444444443332 2344567888887 Q ss_pred cccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 311 KDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALN 389 (507) Q Consensus 311 ~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln 389 (507) ++.- .+-+.....|.+..++..++|++++--..--.+...+++++-.+.+. .....-+.-.++.|++.|| T Consensus 247 ~~~~---------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~ 317 (409) T protein:vir:96 247 EIEP---------LPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFN 317 (409) T ss_pred eEEE---------cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6432 22222333466677788889999876654333333345566544444 3335556778888888888 Q ss_pred HHHHHHHHHhCCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchh Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEEL 467 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~ 467 (507) +.|++..- ..... +|+|+ ..-..|.+..++++.++++.|++.+ +.+|+.+|+|+-+.+|...... T Consensus 318 ~~Ll~~~~------~~~g~--~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~-----NE~R~~~g~~pi~ggD~~~~~~ 384 (409) T protein:vir:96 318 RKLLTKTD------REKNR--YFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGDKPLISG 384 (409) T ss_pred hhcCCccc------ccCcc--eEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCcceeeecc Confidence 76654321 01112 35554 4446789999999999999998776 5799999998765555422111 Q ss_pred hhcCCCcCCCCCCcccccCcccccccccCC Q lcl|NC_017974. 468 MKLLGEDTSRAGDGMTKGSSGNGTGKISST 497 (507) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (507) . ..|-+.........++...+ ..+ . T Consensus 385 n-~~~~~~~~~~~~~~~gG~~n-~~e---~ 409 (409) T protein:vir:96 385 D-LYPIDTPLELRKSLKGGDKN-VNE---S 409 (409) T ss_pred c-ccccccchhhcccccCCCCC-cCC---C Confidence 1 01110000000000111000 000 0 No 63 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.55 E-value=3.7e-14 Score=94.23 Aligned_cols=342 Identities=10% Similarity=-0.016 Sum_probs=180.1 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.++++.+.- +....+..+ ..+|..-++ .++.++++.++ +.+.+|-+++.+++... |+ +..|. T Consensus 1 ia~lp~~~~~--~~~~~~~~l----~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~------G~--~~~L~ 66 (348) T protein:vir:93 1 MASLPLKMYE--DYKVVNTEV----SDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY------HQ--PSKLF 66 (348) T ss_pred CcccceEeEe--cCcCcccHH----HHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEE Confidence 7777665432 222222223 344442222 34567777766 57789999998887542 33 45666 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++.++. +..+.+++.+.... ....+..+.+|.+.++++++....+..+|.|.+..+ T Consensus 67 ~l~~~~v~----~~~~~~~~~~~y~~------------------~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~ 124 (348) T protein:vir:93 67 LLNPDVVE----MLIENQSRELYYSI------------------HAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVL 124 (348) T ss_pred EEcCCceE----EEEeCCCcEEEEEE------------------EcCCCeEEEEccccEEEecCCCCCCceeeccHHHHH Confidence 66665443 22233333221110 111223445777777777665556778899988877 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-..-.-.. ..-|. + ..++.+-.. +.+.+..-+++...+..++..+.. .++...+++|.|+++. T Consensus 125 ~~~i~~~~~-~~~~~-~-~~~~~~~~~------i~~~~~~l~~e~~~~~~~~~~~~~----~n~~~~~vl~~g~~~~--- 188 (348) T protein:vir:93 125 KNTTDFDNA-VRTFN-L-TEMQKPDSF------MLKYGSNVSTEKRQQVLEDFKQYY----EENGGILFQEPGVEIE--- 188 (348) T ss_pred HHHHHHHHH-HHHHH-H-HhcCCCcee------EEecCCCCCHHHHHHHHHHHHHHh----hcCCCeeecCCCceEE--- Confidence 654433322 23333 2 223333111 112223333444444444444332 3444567788887543 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLPQ 395 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~~ 395 (507) ..+-+....+|.+..++...+|++++.-...-.+...+++++..+.+. .....-+.-.++.|++.||+.|++. T Consensus 189 ------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~ 262 (348) T protein:vir:93 189 ------PLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTK 262 (348) T ss_pred ------EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCc Confidence 222233334577778889999999887655444444456677555444 3345567778888888888876554 Q ss_pred HHHhCCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCC Q lcl|NC_017974. 396 LLALNDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGE 473 (507) Q Consensus 396 l~~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~ 473 (507) . +.. .. -+|.|+ .....|.++.++++.+|++.|++.+ +.+|+.+|+|+-+.+|..--... ..|- T Consensus 263 ~-~~~-----~g--~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~-----NE~R~~~g~~p~~ggD~~~~~~n-~~~~ 328 (348) T protein:vir:93 263 T-DRE-----KN--RYFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGDKPLISGD-LYPI 328 (348) T ss_pred c-ccc-----Cc--ceEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCcCeEeeccc-cccc Confidence 2 111 11 135554 4446789999999999999998876 57999999987655543211111 1111 Q ss_pred cCCCCCCcccccCcccccccc Q lcl|NC_017974. 474 DTSRAGDGMTKGSSGNGTGKI 494 (507) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~ 494 (507) +.....+...++... .+.+. T Consensus 329 ~~~~~~~~~~~gg~~-n~~~~ 348 (348) T protein:vir:93 329 DTPLELRKSLKGGDK-NVNES 348 (348) T ss_pred ccchhhcccccCCCC-CcCCC Confidence 111000000111110 00000 No 64 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.54 E-value=1e-12 Score=86.27 Aligned_cols=440 Identities=8% Similarity=0.013 Sum_probs=194.7 Q ss_pred Cccccccccc-cc-Cccccccccccccccc--cccCchhh----hHH----------------HHhhhhcccchhccchh Q lcl|NC_017974. 1 MSRKRNRNRS-VK-VAKATSEQINVSRMRM--SEQGTFAL----AKV----------------QVDSERMKAEEIRWPNL 56 (507) Q Consensus 1 ~~~k~~~~~~-~~-~~~~~~~~~~~~~~~~--~~~g~~~~----~~~----------------~~~~~~e~~~~lr~~~~ 56 (507) |+.||+..-- +. ..+- .++.+..+.+- .+++ .|+ +.. ..+..+...+.++ T Consensus 13 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~---- 86 (535) T protein:vir:10 13 LSNKKSTSYIELGDYDKD-IVNKAIRPGRASARDTV-DGIDIADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQ---- 86 (535) T ss_pred hhhhhhhhhHHHhhhhHH-HHHhhhhhhhhhhhccc-cccccccCCcccccccCccccccCHHHHHHHhccChhHH---- Confidence 7766654320 00 0000 00000000000 0000 000 000 0111111111111 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCC-CChhHHHHH--HHHHHHHHhccCC------CHHHHHHHHHH-H Q lcl|NC_017974. 57 IGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPG-ESEESKKAA--KFVEYCLKNMEGQ------TLRQFARDAAT-F 126 (507) Q Consensus 57 ~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~-~~~~~~~~a--e~v~~~l~~~~~~------~~~~~l~~~l~-a 126 (507) .+.+-+..+..+.+|+......+ +.|++.+... .+....+++ .-+...|...+.. .|.+++..++. . T Consensus 87 -~~i~t~~~~va~~~~i~~~s~~~--~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~ 163 (535) T protein:vir:10 87 -AIIRTRTNQVLTYSNPSRYNRNG--VGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDM 163 (535) T ss_pred -HHHHHHHHHHHHHHHHHHHhccc--CcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHH Confidence 11222233444445555444444 3445544321 122222222 2244555433221 24456776654 4 Q ss_pred Hhhc-ceeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCC Q lcl|NC_017974. 127 NEYG-LSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRM 205 (507) Q Consensus 127 ~~~G-fs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~ 205 (507) +.+| .+..++++... |+ +..|.+.++.++.. ..+.++... ...++ ...... T Consensus 164 l~~~g~ay~~i~r~~~------G~--~~~L~~l~p~~V~v----~~d~~~~~~------~~~~~----------~~~~~~ 215 (535) T protein:vir:10 164 YVQDQINIERIFKNDS------NE--LDHFNAVDASKVVI----SYSPRSKDQ------PRKFE----------QFVSET 215 (535) T ss_pred HhhCCceEEEEEECCC------Cc--EEEEEEeCCceeEE----EEcCccccC------ceEEE----------EEecCc Confidence 5555 56666655432 22 44566666655431 112222100 00011 111222 Q ss_pred CccccccccEEEEeeCCc---cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHH Q lcl|NC_017974. 206 SDVIIPINRVMLMNTGGS---SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPE 282 (507) Q Consensus 206 ~~~~iP~~k~i~~~~~~~---~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~ 282 (507) ....++.+.+|++++... .+.+||.+.+..+.-..-.-....++-..|...-+.|--++++|-.. ....+++. T Consensus 216 ~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~---~~~ls~e~- 291 (535) T protein:vir:10 216 KSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDG---DAQANQMM- 291 (535) T ss_pred eeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCC---CcccCHHH- Confidence 345677888887776543 33578999999998887777778888888888766666566655321 11112222 Q ss_pred HHHHHHHHHHHHHhhcCC-cee-EeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch Q lcl|NC_017974. 283 AEMVRGLMSDAANAHSGE-QSF-FMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG 360 (507) Q Consensus 283 ~~~l~a~~~~~~n~~~G~-~a~-~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~ 360 (507) .+++.+...+...|. .++ ..|+.+..+++. ..+.+..-..|.+..++..++|++++.-...-.+.... T Consensus 292 ---~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~-------~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~ 361 (535) T protein:vir:10 292 ---LAGIRRQWTSQGSGLGGAWKIPILAAKDAKFV-------NMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNN 361 (535) T ss_pred ---HHHHHHHHHHHhcCcccccccccccCCCceEE-------ecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccC Confidence 233333333333342 223 234443333332 23333333346677778899999987554322222222 Q ss_pred hhhh------------HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHH Q lcl|NC_017974. 361 GSYN------------LSESKQT-IHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSK 427 (507) Q Consensus 361 GS~A------------l~~vh~~-v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~ 427 (507) ++|+ ..+.... .....+.-.++.|+..||+.|++.. . . . -+|.|+.....|.+..++ T Consensus 362 at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~-------~-~-~-~~f~f~~l~~~d~~~r~~ 431 (535) T protein:vir:10 362 GGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYV-------D-T-D-YRFSFTLGDAQDKLQEEQ 431 (535) T ss_pred cccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc-------C-C-e-EEEEeccccccCHHHHHH Confidence 2222 1122222 2234466778888888888776431 1 1 1 258888888888887777 Q ss_pred HHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccc----h--------hhhcCCCcCC------------CCCCccc Q lcl|NC_017974. 428 AIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQE----E--------LMKLLGEDTS------------RAGDGMT 483 (507) Q Consensus 428 ~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~----~--------~~~~~~~~~~------------~~~~~~~ 483 (507) +.+... .|.+. .+.+|+.+|+|+-+.+|.... . ..+..|+... +..+... T Consensus 432 ~~~~~~-~g~lT-----~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~ 505 (535) T protein:vir:10 432 VWKLKL-ANGYF-----INEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSK 505 (535) T ss_pred HHHHHH-cCCCC-----HHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccccCccccccc Confidence 766544 55543 368999999997655552100 0 0000011000 0000111 Q ss_pred ccCcccccccccCCC---CchhhhccC Q lcl|NC_017974. 484 KGSSGNGTGKISSTR---DNSAANLDN 507 (507) Q Consensus 484 ~~~~~~~~~~~~~~~---d~~a~~~~~ 507 (507) ....|.+..+.+..+ ..+.++.+| T Consensus 506 ~~~~g~~~~~~~~~~~~~~~~~~~~~~ 532 (535) T protein:vir:10 506 DYEKGKDDPKSPLPKPSESDDVSNNED 532 (535) T ss_pred ccccCCCCCCCCCCcCCCCCccccccc Confidence 111222222223222 223343344 No 65 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.53 E-value=1.1e-12 Score=86.16 Aligned_cols=443 Identities=12% Similarity=0.077 Sum_probs=199.4 Q ss_pred CcccccccccccCccccccc-cccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHh-hChHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQ-INVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMK-QDATVATGLDMLYT 78 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s~l~~Rk~ 78 (507) |=.+.=.=+....+++...+ ..... ++......+ .++ |-.+.-+..|. ..+.|.+|+..+.. T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~-----~~~~~~~~~-----~~p------p~~~~~la~l~~~n~~v~scI~~ia~ 64 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQA-----LGETRFEEY-----VEP------KVNPLVLLSLLQVNPYHASACSIKAN 64 (542) T ss_pred Cccccccccccccchhhhhccccccc-----cccccCCcc-----ccC------CCCHHHHHHHHhhcHHHHHHHHHHHH Confidence 21111111111111111100 00000 010000000 011 11222334444 58999999999999 Q ss_pred HHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .|.+++|.+. +... . .+...+.+ ...++.+++..++ +.+.+|.+.+|+++... |. +..|.+ T Consensus 65 ~IA~l~~~~~--~~~~----~---~l~~~lpN-~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~------G~--~~~L~~ 126 (542) T protein:vir:41 65 DIIRTGYILE--GDDE----G---VVDEFIRA-CKPSFEYVLLRALEDLQVFNYCTLEVVRDDR------GD--PIRFEY 126 (542) T ss_pred HHhhCceeee--cccc----h---hhhhhcCC-CCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC------Cc--EEEEEE Confidence 9988876653 2211 1 12222222 2245677877777 57889999999987643 22 344555 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .++.++. ...|+.............+....................+|...+|++++....+.+||.|.+..+. T Consensus 127 l~~~~v~------v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~ 200 (542) T protein:vir:41 127 IPSHTIR------VHKDGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAA 200 (542) T ss_pred EcCcceE------EEEcCCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHH Confidence 5554442 1222222222211111111111111111112222334456776766666666677899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCC---CCchHHHHHHHHHHHHHHHHhhcCCceeEeec--Ccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAA---MDQSSPEAEMVRGLMSDAANAHSGEQSFFMLP--SDTKD 312 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~---~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP--~g~~i 312 (507) -....-....++-..|...-+.|--++++|-.+.+... ..+.+....+.+.+.+............++++ .+.+- T Consensus 201 ~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~ 280 (542) T protein:vir:41 201 PAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTV 280 (542) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCccc Confidence 88777777888888888776777667777654432211 11122222222222222111111111234443 11110 Q ss_pred cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhh--hHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 313 NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSY--NLSESK-QTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 313 ~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--Al~~vh-~~v~~~~~~aD~~~i~~~ln 389 (507) .++++..+-+..-..|.+..++..++|++++.-..--.+...++|+ +-.+.+ .......+.-.++.|+..|| T Consensus 281 -----g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln 355 (542) T protein:vir:41 281 -----KVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILT 355 (542) T ss_pred -----ceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1233343333334456777788899999987543322222222222 322323 34456667788899999999 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcC--chhHHHHHHHHHHHHhcCCccccHHHHHHHHHHc-CCCCCCCcccccch Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIV--DVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVL-GIDEKIEEDISQEE 466 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~--~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Gip~~~~~~~~~~~ 466 (507) +.|++.. . .+. +|.|+... ..| ..+.+..+++.|++.+ +.+|+.+ |+++- ++....+. T Consensus 356 ~~L~~~~---~----~~~---~~~f~~~~ll~~d---~~~~~~~~v~~GilT~-----NE~Re~L~g~~pg-dd~~l~p~ 416 (542) T protein:vir:41 356 DFFQVKF---N----PKT---RFKFNDETLLESD---SVRNCALLVQSGVLTP-----AEARERLFGLDGG-PDIFMVPS 416 (542) T ss_pred hhccccc---C----Cce---EEEecchhhcchH---HHHHHHHHHhCCCCCH-----HHHHHhhCCCCCC-Cccccccc Confidence 8664432 1 111 35554333 333 3456778999999877 4678764 77642 22111111 Q ss_pred hhhcC---CCcCCCCCCc---ccccCccccccc----ccCCCCchhhh-ccC Q lcl|NC_017974. 467 LMKLL---GEDTSRAGDG---MTKGSSGNGTGK----ISSTRDNSAAN-LDN 507 (507) Q Consensus 467 ~~~~~---~~~~~~~~~~---~~~~~~~~~~~~----~~~~~d~~a~~-~~~ 507 (507) ..... ....+...+. ..+..+..++.. ++.....++.. .++ T Consensus 417 ~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~ 468 (542) T protein:vir:41 417 KGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDE 468 (542) T ss_pred cccccccccCCcCCCCCchhhhhhcccccCccccccccccccchhhcccccc Confidence 11000 0000000000 000001111100 01111111111 111 No 66 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.53 E-value=2.6e-13 Score=89.62 Aligned_cols=412 Identities=14% Similarity=0.058 Sum_probs=198.3 Q ss_pred Cccccccccccc-CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHH-hhChHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVK-VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESM-KQDATVATGLDMLYT 78 (507) Q Consensus 1 ~~~k~~~~~~~~-~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~ 78 (507) |.=.++-.+++. ...+... . +. |...... .... ..+ +.+.+ +..+.|.+|+..+-. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~----~---~~--~~~~~~~-~~~~-----------~~~-~~~~~~~~~~~v~~~i~~ia~ 58 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPI----E---LV--GPIFESL-KLST-----------KNM-TVEQIWEDQPHLRTVTTFIAR 58 (423) T ss_pred CchhHhhccccccccCcccc----c---cc--ccccccc-cccc-----------chh-hHHHHHHhhhHHHHHHHHHHH Confidence 322222211111 1111000 0 00 1000000 0000 011 12333 468999999999999 Q ss_pred HHhhCCceeee-C-CCCCh--hHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 79 FVEKAFKDFKV-I-PGESE--ESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 79 av~~~~~~~~~-~-~~~~~--~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) .|.++++.+.. . .+... .+..+. ..|++ ++ .++.++++.++ +.+++|-+...+....+ .++. T Consensus 59 ~ia~lp~~~~~~~~dg~~~~~~~~~~~----~ll~~-PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~----~~~~-- 127 (423) T protein:vir:81 59 NVASLQLQAFERVEDGGRERVREGHLA----RVCKL-ANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLG----VDTP-- 127 (423) T ss_pred hHhhCceEEEEEecCCceeeeccchHH----HHhhc-CCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCC----cCcc-- Confidence 99999876421 1 22211 111222 33332 22 24567777765 56678977766543321 1121 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) +-.|.|.+...+.. ....|+.+.. ++.+.. .....+..+.+|.+.+|+++.....+..+|.| T Consensus 128 ~~~l~p~~~~~v~~----~~~~~~~~~~--------~Y~~~~------~~~~~g~~~~~~~~evih~r~~~~~~~~~G~s 189 (423) T protein:vir:81 128 TLDIRPIPVSWVQR----RAYKDGWGSL--------DYIIIE------SGDNDGRSVKVPGERVIHRHGYNPKTMKRGKS 189 (423) T ss_pred eEEEeecccceeee----eeccCCCcce--------EEEEEE------ecCCCceEEEEcccceEEecCCCCCCcccccc Confidence 22244433332211 0111221110 000000 01122334567777766655444555568999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) .+..+.-..-.-....++-..|...-+.|--+++.+.... ...-+++......+.+.+....-..+....++++.|++ T Consensus 190 pi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~--~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~ 267 (423) T protein:vir:81 190 PVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESK--AGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMK 267 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCccc--CccCCHHHHHHHHHHHHHHhccccccCCcceecCCCce Confidence 9999998777777777888888876556655665442211 11112223333333333321111111223467788875 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~ 390 (507) +.- .+-+..-..|.+..++...+|++++.-..--.+...+++|+-.+-+. .....-+.-.++.|++.||+ T Consensus 268 ~~~---------l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~ 338 (423) T protein:vir:81 268 AEN---------FHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKALYGDNLGSWIRIIQDVMNL 338 (423) T ss_pred EEe---------ccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 432 22222223455666788889999776543223333345555434333 34445677788889999998 Q ss_pred HHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHH-hcCCccccHHHHHHHHHHcCCCCCCCcccccchhhh Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIG-AVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMK 469 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~-~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~ 469 (507) .|+++.-.-. ...+-+|.++.....|+++.++++.+++ +.|.+.+ +.+|+.+|+|+-+.+|.......- T Consensus 339 ~L~~~~~~~~-----~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~-----NE~R~~~gl~p~~gGD~~~~p~n~ 408 (423) T protein:vir:81 339 FLLPRVGIDN-----EKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTI-----NEVRAMDNLPSIDGGDDLARPLNT 408 (423) T ss_pred hhcCcccccc-----CccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCH-----HHHHHHhCCCCCCCcceeeccccc Confidence 8766532111 1112233344556679999999999876 5787665 579999999876665543222111 Q ss_pred cCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 470 LLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~ 491 (507) ...+...+.++. .+| T Consensus 409 ~~~~~~~~~~~~-------~~t 423 (423) T protein:vir:81 409 EFGDSEDAPGEE-------VET 423 (423) T ss_pred ccCccCCCCCCC-------CCC Confidence 111111111110 011 No 67 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.53 E-value=5.6e-13 Score=87.77 Aligned_cols=405 Identities=10% Similarity=0.005 Sum_probs=197.3 Q ss_pred Ccccccccc-cccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNR-SVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |-=-++..| ....+.. +. ....... ..........+. -+..++.+.|.+|+..+-.. T Consensus 1 Mg~f~~~~~r~~~~~~~---------------~~--~~~~~~~----~~~~~~~~~~~~-~~~al~~~~v~~cv~~Ia~~ 58 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNED---------------DL--QMMVQTL----PGFQGTKLRQYK-DIEAIRHSDIFTAVMMIASD 58 (416) T ss_pred CCcccccccccccCCCc---------------ch--hHHHHHh----ccccccCccccc-hhhhhcchHHHHHHHHHHHh Confidence 433222111 1100000 00 0000000 000000001110 12335677899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.+++|.+... +.... ..-+..+|..-++ .++.++++.+.. .+.+|.+.+++++... |+ +..|. T Consensus 59 iA~~p~~~~~~-~~~~~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------G~--~~~L~ 125 (416) T protein:vir:45 59 LARMPIRVTVN-GQINY----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT------GE--PMNLT 125 (416) T ss_pred hccCceEEecC-ccccc----cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEE Confidence 98887665422 21111 1223344443222 235567776654 5679999999887532 32 44566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++.++. +..+.+|+.. .. +....+ ........+|+..+|++++.+ .+.++|.|++..+ T Consensus 126 ~i~~~~v~----v~~~~~g~~~--~~------~~~~~~-------~~~~~~~~~~~~evihir~~~-~d~~~G~s~i~~~ 185 (416) T protein:vir:45 126 FRKTSEIE----LKSDARGRLY--YF------HQRIDS-------NGNNIERNVKFEDMLDIKFYS-LDGINGLSLLDTL 185 (416) T ss_pred EEcCceeE----EEECCCccEE--EE------EEEecC-------CCceeEEEEccccEEEeccCC-CCCccccCHHHHH Confidence 66665443 1223333211 00 000000 001112356777777766654 4558999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-..-.-....++...|...-+.|--++++|-.+ .+++...++.+.+.+..... .++...++++.|++.. T Consensus 186 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~------~~~~~~~~~~~~~~~~~~g~-~nag~~~vl~~g~~~~--- 255 (416) T protein:vir:45 186 SRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL------DNKKARDRAREEFHKSFSGT-KQAGKVVVLDESMTFD--- 255 (416) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC------CCHHHHHHHHHHHHHHhcCc-cccCceeecCCCceeE--- Confidence 8877777777788888888777776666665211 12222222222232221110 0111247778877543 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQL 396 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 396 (507) ..+-+.....|.+..++..++|++++.-..--.+....+ ++..+... ....-+.-.++.|+..||+.|.... T Consensus 256 ------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~~~~~~~-~~~~~l~P~~~~ie~~ln~~l~~~~ 327 (416) T protein:vir:45 256 ------QLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDANL-DYLSTLKPYITCVCAELNFKFNDEY 327 (416) T ss_pred ------eccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-ccHHHHHH-HHHHHHHHHHHHHHHHHhhhccccc Confidence 222222233466677888899999876543212211122 22222111 1233566677788888887553221 Q ss_pred HHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc-cchhhhcCC--- Q lcl|NC_017974. 397 LALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS-QEELMKLLG--- 472 (507) Q Consensus 397 ~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~-~~~~~~~~~--- 472 (507) .+ .+-+|.++.....|.+..++++.+++..|++.+ +.+|+.+|+|+-++++.. ........+ T Consensus 328 --------~~-~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~ 393 (416) T protein:vir:45 328 --------VN-REFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLAPIPGGNGSIHRVDLNHVNIEL 393 (416) T ss_pred --------cC-ceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceEeeccccccccc Confidence 11 122344455566789999999999999998876 579999999876555431 111111111 Q ss_pred -Cc--CCCCCCcccccCccccccc Q lcl|NC_017974. 473 -ED--TSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 473 -~~--~~~~~~~~~~~~~~~~~~~ 493 (507) +. ..+.+........|+. .+ T Consensus 394 ~~~~~~~~~~~~~~~~kgGe~-n~ 416 (416) T protein:vir:45 394 VDEYQMNKSRATDKKLKGGEE-NE 416 (416) T ss_pred ccccCcccccccccccCCCCC-CC Confidence 00 0111111111111111 00 No 68 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.53 E-value=5.6e-13 Score=87.77 Aligned_cols=405 Identities=10% Similarity=0.005 Sum_probs=197.3 Q ss_pred Ccccccccc-cccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNR-SVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |-=-++..| ....+.. +. ....... ..........+. -+..++.+.|.+|+..+-.. T Consensus 1 Mg~f~~~~~r~~~~~~~---------------~~--~~~~~~~----~~~~~~~~~~~~-~~~al~~~~v~~cv~~Ia~~ 58 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNED---------------DL--QMMVQTL----PGFQGTKLRQYK-DIEAIRHSDIFTAVMMIASD 58 (416) T ss_pred CCcccccccccccCCCc---------------ch--hHHHHHh----ccccccCccccc-hhhhhcchHHHHHHHHHHHh Confidence 433222111 1100000 00 0000000 000000001110 12335677899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.+++|.+... +.... ..-+..+|..-++ .++.++++.+.. .+.+|.+.+++++... |+ +..|. T Consensus 59 iA~~p~~~~~~-~~~~~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------G~--~~~L~ 125 (416) T protein:vir:81 59 LARMPIRVTVN-GQINY----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT------GE--PMNLT 125 (416) T ss_pred hccCceEEecC-ccccc----cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------Cc--EEEEE Confidence 98887665422 21111 1223344443222 235567776654 5679999999887532 32 44566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++.++. +..+.+|+.. .. +....+ ........+|+..+|++++.+ .+.++|.|++..+ T Consensus 126 ~i~~~~v~----v~~~~~g~~~--~~------~~~~~~-------~~~~~~~~~~~~evihir~~~-~d~~~G~s~i~~~ 185 (416) T protein:vir:81 126 FRKTSEIE----LKSDARGRLY--YF------HQRIDS-------NGNNIERNVKFEDMLDIKFYS-LDGINGLSLLDTL 185 (416) T ss_pred EEcCceeE----EEECCCccEE--EE------EEEecC-------CCceeEEEEccccEEEeccCC-CCCccccCHHHHH Confidence 66665443 1223333211 00 000000 001112356777777766654 4558999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-..-.-....++...|...-+.|--++++|-.+ .+++...++.+.+.+..... .++...++++.|++.. T Consensus 186 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~------~~~~~~~~~~~~~~~~~~g~-~nag~~~vl~~g~~~~--- 255 (416) T protein:vir:81 186 SRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL------DNKKARDRAREEFHKSFSGT-KQAGKVVVLDESMTFD--- 255 (416) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC------CCHHHHHHHHHHHHHHhcCc-cccCceeecCCCceeE--- Confidence 8877777777788888888777776666665211 12222222222232221110 0111247778877543 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQL 396 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 396 (507) ..+-+.....|.+..++..++|++++.-..--.+....+ ++..+... ....-+.-.++.|+..||+.|.... T Consensus 256 ------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~~~~~~~-~~~~~l~P~~~~ie~~ln~~l~~~~ 327 (416) T protein:vir:81 256 ------QLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDANL-DYLSTLKPYITCVCAELNFKFNDEY 327 (416) T ss_pred ------eccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-ccHHHHHH-HHHHHHHHHHHHHHHHHhhhccccc Confidence 222222233466677888899999876543212211122 22222111 1233566677788888887553221 Q ss_pred HHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc-cchhhhcCC--- Q lcl|NC_017974. 397 LALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS-QEELMKLLG--- 472 (507) Q Consensus 397 ~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~-~~~~~~~~~--- 472 (507) .+ .+-+|.++.....|.+..++++.+++..|++.+ +.+|+.+|+|+-++++.. ........+ T Consensus 328 --------~~-~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~ 393 (416) T protein:vir:81 328 --------VN-REFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLAPIPGGNGSIHRVDLNHVNIEL 393 (416) T ss_pred --------cC-ceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceEeeccccccccc Confidence 11 122344455566789999999999999998876 579999999876555431 111111111 Q ss_pred -Cc--CCCCCCcccccCccccccc Q lcl|NC_017974. 473 -ED--TSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 473 -~~--~~~~~~~~~~~~~~~~~~~ 493 (507) +. ..+.+........|+. .+ T Consensus 394 ~~~~~~~~~~~~~~~~kgGe~-n~ 416 (416) T protein:vir:81 394 VDEYQMNKSRATDKKLKGGEE-NE 416 (416) T ss_pred ccccCcccccccccccCCCCC-CC Confidence 00 0111111111111111 00 No 69 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.51 E-value=1.7e-12 Score=85.18 Aligned_cols=418 Identities=11% Similarity=-0.015 Sum_probs=196.5 Q ss_pred CccccccccccccccccccCchhhhHH-HHhhhhcccchhccchhHHHH--HHHhhChHHHHHHHHHHHHHhhCCceeee Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKV-QVDSERMKAEEIRWPNLIGTV--ESMKQDATVATGLDMLYTFVEKAFKDFKV 89 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~~~lr~~~~~~~y--~~m~~D~~v~s~l~~Rk~av~~~~~~~~~ 89 (507) |+...+..+ .+ ..+.+..-...| ..+. . . ....+.....+ +..+.-+.|.+|+..+-..|.+++|.+.. T Consensus 1 ~~~~~~~~~--~~--~~~~~~~~~~~~~~~~g-~--~-~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~ 72 (460) T protein:vir:10 1 MANRIIRAL--RE--LTGLDNKFNDAFIKYIG-Q--T-FTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKV 72 (460) T ss_pred CchhHHHHH--hh--hhccCCCchHHHHHhhc-c--c-cCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEe Confidence 333332221 00 001111000011 0000 0 0 00112222222 22456799999999999999999877643 Q ss_pred CCCCChhHH---------HHH-----------------HHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeec Q lcl|NC_017974. 90 IPGESEESK---------KAA-----------------KFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQ 140 (507) Q Consensus 90 ~~~~~~~~~---------~~a-----------------e~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~ 140 (507) ...+....+ .+. +-+...|..-++ .++.++++.++ +.+++|-+..++++.. T Consensus 73 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~ 152 (460) T protein:vir:10 73 VKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPD 152 (460) T ss_pred ccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC Confidence 322211000 000 001111211111 24667878777 5778999999888754 Q ss_pred cCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEee Q lcl|NC_017974. 141 VTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNT 220 (507) Q Consensus 141 ~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~ 220 (507) .+ ...| .+..|.+.++.++.. ..+.|+..+. .. ...........+....+|++.+|++++ T Consensus 153 ~~--~~~G--~~~~L~~l~~~~v~v----~~~~~~~~~~--~~----------~~~~~~~~~~~g~~~~~~~~evih~r~ 212 (460) T protein:vir:10 153 DG--INAG--VPSQMYVLPAHLIKI----VLKDDINLLS--TD----------SPIKSYMLIQGDQFIEFNEDEVIHTKY 212 (460) T ss_pred CC--ccCc--eeEEEEEEcCceEEE----EEcCCCceee--ee----------eeeeEEEEecCceeEEecccceEEEec Confidence 32 1122 345677777665432 2233332111 10 001111122233456788888887776 Q ss_pred CCcc-----CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_017974. 221 GGSS-----SQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAAN 295 (507) Q Consensus 221 ~~~~-----~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n 295 (507) .... +..+|.|.+..+.-..-.-....++-..|...-+.|--++..+ ..-++++..+..+.+.+.... T Consensus 213 ~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~-------~~l~~e~~~~~~~~~~~~~~g 285 (460) T protein:vir:10 213 ANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGS-------TGLTQPQADSLKQRLTEMDKS 285 (460) T ss_pred CCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecC-------CCCCHHHHHHHHHHHHHHhcC Confidence 5433 4568999999998777777777777777776544443333322 222333333332333322110 Q ss_pred hhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHH-HHH Q lcl|NC_017974. 296 AHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESK-QTI 372 (507) Q Consensus 296 ~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh-~~v 372 (507) ..++...++++.|++.. ..+.+.....+.+..++...+|++++.-..--.+ ++.+.++|-.+-+ ... T Consensus 286 -~~n~g~~~vl~~g~~~~---------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f 355 (460) T protein:vir:10 286 -PDRLSQIAGASGEIAFT---------KISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRV 355 (460) T ss_pred -ccccCCceecCCCceEE---------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHH Confidence 01122346677776432 2222222334666778888999998754332222 2222334433333 344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHc Q lcl|NC_017974. 373 HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVL 452 (507) Q Consensus 373 ~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~ 452 (507) ...-+.-.++.|++.||+.|++..-. ..-.+|+|+...-..+..-.+....+++.|++.+ +.+|+.+ T Consensus 356 ~~~~l~P~~~~ie~~ln~kl~~~~~~--------~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~-----NE~R~~~ 422 (460) T protein:vir:10 356 VTDNIQPDLVILKQAFDKKFIKRFKG--------YENAVIEWDISELPEMQTDMVAMASWLNTIPVTP-----NEIRIAM 422 (460) T ss_pred HHHHHHHHHHHHHHHHHHhhcCcccc--------cCCceEEeecchhhhHHHHHHHHHHHHhCCCCCH-----HHHHHHh Confidence 45567778899999999877655321 1112355554433222222334445678898766 5799999 Q ss_pred CCCCCCC--cccccchhhhcCCCcCCCCCCcccccCccccccc Q lcl|NC_017974. 453 GIDEKIE--EDISQEELMKLLGEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 453 Gip~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (507) |+|+-.+ +|.......-...+..+ +... ..++...+ T Consensus 423 g~~pi~~~~gD~~~~~~n~~~~~~~~---~~~~--~~~~nq~~ 460 (460) T protein:vir:10 423 KYETLNQDGMDIVFMPSNKVRIDDVS---NNLI--DSAFNQNQ 460 (460) T ss_pred CCCCCCCCCCCeeeecccccchhhcc---cccC--CCcccCCC Confidence 9986432 22211110000000000 0000 00000000 No 70 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.51 E-value=3.1e-13 Score=89.12 Aligned_cols=385 Identities=12% Similarity=0.040 Sum_probs=189.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH-HHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV-ESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~a 79 (507) |.-++..++....+.........+. +.. ...+..+.. .....+. +..++-+.|.+|+..+-.. T Consensus 3 m~~~~~~~~~~~~~~~~~~~~~~~~------~~~-~~~~~~~~~---------~~g~~v~~~~al~~~~v~~~v~~ia~~ 66 (392) T protein:vir:74 3 LPILNFINQTNDPPEAGSVQSYFPD------GND-AQIMESLLG---------DNNEWVSARAALRNSDLFSIILQLSSD 66 (392) T ss_pred chhhhhhhcccCccccccccccccc------Cch-hhhhhhccC---------CCCcccchhhhhcchHHHHHHHHHHHh Confidence 4333333322222221111111000 000 000001100 0011112 2346788999999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) |.++++.+. ... ...+++. --...++.++++.++ +.+.+|-+.+++++... |+ +..|.+. T Consensus 67 ia~lp~~~~--~~~------~~~l~~~---PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------G~--~~~L~~i 127 (392) T protein:vir:74 67 LAIVKINAE--KKK------NQGIIDN---PSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN------GA--DMKWEYL 127 (392) T ss_pred hccCceeec--cch------hhhhhhh---cCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC------Cc--EEEEEEE Confidence 988865543 211 0112221 111234567777766 67889999999987643 32 4456666 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) ++.++. +..+.++..+.... .... ........+|.+.+|++++....+..+|.|.+..+.. T Consensus 128 ~~~~v~----v~~~~~~~~~~y~~---------~~~~------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~ 188 (392) T protein:vir:74 128 RPSQVN----TYYFEYENGMYYNI---------TFDD------PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRR 188 (392) T ss_pred cCceeE----EEEcCCCceEEEEE---------EecC------CccceeEEEcCccEEEecCCCCCCccccccHHHHHHH Confidence 665443 22233443221110 0000 0011123467777776666666666899999999998 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) ..-.-....++-+.+....+.|--++++|-.. ..+++......++.. .. .++...+++|.|++++- T Consensus 189 ~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~-----~~~~~~~~~~~~~~~----~~-~n~g~~~vl~~g~~~~~---- 254 (392) T protein:vir:74 189 ESKIQRASDRLTISSLNSSLNVPGVLTVKGGG-----LLSDKDKASRSRSFM----KR-SRSGGPVVLDDLEEFTA---- 254 (392) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-----CchHHHHHHHHHHHh----cc-ccCCCeeecCCCceEEE---- Confidence 88888888888888998888877777765321 122222222222211 11 11223477888875432 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA 398 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~ 398 (507) .+-+.....|.+..++..++|++++.-..--.+.. +.+++..+.-......-+.--++.|++.||+.|++.+ . T Consensus 255 -----l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-~~~~~~~e~~~~~~~~~l~p~~~~ie~~l~~~l~~~~-~ 327 (392) T protein:vir:74 255 -----LEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQ-GDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-S 327 (392) T ss_pred -----ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccchh-c Confidence 22222333567777888899999875443222221 1112222222233344455667777777777664432 2 Q ss_pred hCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH---cCCCCCCCcccccchhhhcCCCcC Q lcl|NC_017974. 399 LNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV---LGIDEKIEEDISQEELMKLLGEDT 475 (507) Q Consensus 399 lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~---~Gip~~~~~~~~~~~~~~~~~~~~ 475 (507) +| +...-..|...+++.+.+|+..|++.++ ..|+. .|+.+ .+....+ ..+ | T Consensus 328 ~~-------------~~~~~~~d~~~~~~~~~~l~~~g~~t~n-----ear~~~~~~g~~p--ne~r~~e-nl~--~--- 381 (392) T protein:vir:74 328 VN-------------MRPAIDPLGDNYLSTISTATRWGALAEN-----QATFVLQEAGYIP--KDLPAPE-NTN--K--- 381 (392) T ss_pred cc-------------chhhhcCCHHHHHHHHHHHHhCCCcCHH-----HHHHHHHhCCCCc--cccchhc-CCC--C--- Confidence 22 1122234667788899999999987764 34443 46642 2211110 000 0 Q ss_pred CCCCCcccccCccccccccc Q lcl|NC_017974. 476 SRAGDGMTKGSSGNGTGKIS 495 (507) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~ 495 (507) .+.|+ ++.+ .| T Consensus 382 ~~~Gd-------~~~p--~p 392 (392) T protein:vir:74 382 KTTGQ-------SNEP--VP 392 (392) T ss_pred CCCCC-------CCCC--CC Confidence 01111 1111 11 No 71 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.49 E-value=4.7e-13 Score=88.18 Aligned_cols=385 Identities=11% Similarity=0.056 Sum_probs=190.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.-.+..++....+.........+. +.. ....... .+ .....+. -+..++.+.|.+|+..+-..| T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~------~~~----~~~~~~~-~~---~~~~~v~-~~~al~~~~v~~~i~~ia~~i 67 (392) T protein:vir:39 3 LPILNFINQTNDPPEVGSVQSYFPD------GND----AQIMESL-LG---DNNEWVS-ARAALRNSDLFSIILQLSSDL 67 (392) T ss_pred chhhhhhhccccccccccccccccc------Cch----hhhhhhh-cC---CCCceec-hHHhhccHHHHHHHHHHHHhh Confidence 5444333332222222111111110 000 0000000 00 0011111 133457889999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHh-ccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKN-MEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~-~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) .++++.+. ... .. . .+++ -...++.++++.++ +.+.+|-+.+++++... |+ +..|.+. T Consensus 68 a~lp~~~~--~~~---~~---~----l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------g~--~~~L~~l 127 (392) T protein:vir:39 68 AIVKINAE--KKK---NQ---G----IIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN------GA--DMKWEYL 127 (392) T ss_pred ccCceeec--cch---hh---h----HhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC------Cc--EEEEEEE Confidence 88865443 211 00 1 1111 11124567777766 57889999999987542 32 4556666 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) ++.++. +..+.|+..+... +.... ........+|.+.+|++++....+..+|.|.+..+.. T Consensus 128 ~~~~v~----~~~~~~~~~~~y~---------~~~~~------~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~ 188 (392) T protein:vir:39 128 RPSQVN----TYYFEYENGMYYN---------ITFDD------PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRR 188 (392) T ss_pred cCceeE----EEEcCCCceEEEE---------EEecC------cccceeEEEccccEEEecCCCCCCccccccHHHHHHH Confidence 665442 1223343322111 00000 0011123467777777666666667899999999998 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) ..-.-....++-..+....+.|--++++|-. ...+++......++.. +. .++...+++|.|+++. T Consensus 189 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-----~~~~~~~~~~~~~~~~----~~-~~~g~~~vl~~g~~~~----- 253 (392) T protein:vir:39 189 ESKIQRASDRLTISSLNSSLNVPGVLTVKGG-----GLLSDKDKASRSRSFM----KR-SRSGGPVVLDDLEEFT----- 253 (392) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCC-----CCchHHHHHHHHHHHh----cc-ccCCCeeecCCCceEE----- Confidence 8888788888888888887777666666532 1122222222111111 11 1222347778887543 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA 398 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~ 398 (507) ..+.+.....+.+..++..++|++++.-..--.+.. +.+++..+........-+.-.++.|++.||+.|++.+ . T Consensus 254 ----~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~-~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~-~ 327 (392) T protein:vir:39 254 ----ALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQ-GDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-S 327 (392) T ss_pred ----EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-c Confidence 222222333567777888899999875543323221 1112222223334455566677788888887664432 2 Q ss_pred hCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHc---CCCCCCCcccccchhhhcCCCcC Q lcl|NC_017974. 399 LNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVL---GIDEKIEEDISQEELMKLLGEDT 475 (507) Q Consensus 399 lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~---Gip~~~~~~~~~~~~~~~~~~~~ 475 (507) +| +...-..|...+++.+.+|+..|+...+ .+|+.+ |+.+ .+....+ .. .| T Consensus 328 ~d-------------~~~~~~~d~~~~~~~~~~l~~~g~~t~n-----E~r~~l~~~g~~p--~e~r~~e-~l--~~--- 381 (392) T protein:vir:39 328 VN-------------MRPAIDPLGDNYLSTISTATRWGALAEN-----QATFVLQEAGYIP--KDLPAPE-NT--NK--- 381 (392) T ss_pred cc-------------chhhhccCHHHHHHHHHHHHhCCCcCHH-----HHHHHHHhcCCCc--cccchhc-CC--CC--- Confidence 22 1112234667778899999999987664 344443 6642 1111100 00 00 Q ss_pred CCCCCcccccCccccccccc Q lcl|NC_017974. 476 SRAGDGMTKGSSGNGTGKIS 495 (507) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~ 495 (507) .+.|+ +..+.| T Consensus 382 ~~~Gd---------~~~p~p 392 (392) T protein:vir:39 382 KTTGQ---------SNEPVP 392 (392) T ss_pred CCCCC---------CCCCCC Confidence 01111 111111 No 72 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.49 E-value=4.7e-13 Score=88.18 Aligned_cols=385 Identities=11% Similarity=0.056 Sum_probs=190.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.-.+..++....+.........+. +.. ....... .+ .....+. -+..++.+.|.+|+..+-..| T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~------~~~----~~~~~~~-~~---~~~~~v~-~~~al~~~~v~~~i~~ia~~i 67 (392) T protein:vir:10 3 LPILNFINQTNDPPEVGSVQSYFPD------GND----AQIMESL-LG---DNNEWVS-ARAALRNSDLFSIILQLSSDL 67 (392) T ss_pred chhhhhhhccccccccccccccccc------Cch----hhhhhhh-cC---CCCceec-hHHhhccHHHHHHHHHHHHhh Confidence 5444333332222222111111110 000 0000000 00 0011111 133457889999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHh-ccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKN-MEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~-~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) .++++.+. ... .. . .+++ -...++.++++.++ +.+.+|-+.+++++... |+ +..|.+. T Consensus 68 a~lp~~~~--~~~---~~---~----l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------g~--~~~L~~l 127 (392) T protein:vir:10 68 AIVKINAE--KKK---NQ---G----IIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN------GA--DMKWEYL 127 (392) T ss_pred ccCceeec--cch---hh---h----HhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC------Cc--EEEEEEE Confidence 88865443 211 00 1 1111 11124567777766 57889999999987542 32 4556666 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) ++.++. +..+.|+..+... +.... ........+|.+.+|++++....+..+|.|.+..+.. T Consensus 128 ~~~~v~----~~~~~~~~~~~y~---------~~~~~------~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~ 188 (392) T protein:vir:10 128 RPSQVN----TYYFEYENGMYYN---------ITFDD------PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRR 188 (392) T ss_pred cCceeE----EEEcCCCceEEEE---------EEecC------cccceeEEEccccEEEecCCCCCCccccccHHHHHHH Confidence 665442 1223343322111 00000 0011123467777777666666667899999999998 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) ..-.-....++-..+....+.|--++++|-. ...+++......++.. +. .++...+++|.|+++. T Consensus 189 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-----~~~~~~~~~~~~~~~~----~~-~~~g~~~vl~~g~~~~----- 253 (392) T protein:vir:10 189 ESKIQRASDRLTISSLNSSLNVPGVLTVKGG-----GLLSDKDKASRSRSFM----KR-SRSGGPVVLDDLEEFT----- 253 (392) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCC-----CCchHHHHHHHHHHHh----cc-ccCCCeeecCCCceEE----- Confidence 8888788888888888887777666666532 1122222222111111 11 1222347778887543 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA 398 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~ 398 (507) ..+.+.....+.+..++..++|++++.-..--.+.. +.+++..+........-+.-.++.|++.||+.|++.+ . T Consensus 254 ----~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~-~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~-~ 327 (392) T protein:vir:10 254 ----ALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQ-GDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-S 327 (392) T ss_pred ----EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-c Confidence 222222333567777888899999875543323221 1112222223334455566677788888887664432 2 Q ss_pred hCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHc---CCCCCCCcccccchhhhcCCCcC Q lcl|NC_017974. 399 LNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVL---GIDEKIEEDISQEELMKLLGEDT 475 (507) Q Consensus 399 lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~---Gip~~~~~~~~~~~~~~~~~~~~ 475 (507) +| +...-..|...+++.+.+|+..|+...+ .+|+.+ |+.+ .+....+ .. .| T Consensus 328 ~d-------------~~~~~~~d~~~~~~~~~~l~~~g~~t~n-----E~r~~l~~~g~~p--~e~r~~e-~l--~~--- 381 (392) T protein:vir:10 328 VN-------------MRPAIDPLGDNYLSTISTATRWGALAEN-----QATFVLQEAGYIP--KDLPAPE-NT--NK--- 381 (392) T ss_pred cc-------------chhhhccCHHHHHHHHHHHHhCCCcCHH-----HHHHHHHhcCCCc--cccchhc-CC--CC--- Confidence 22 1112234667778899999999987664 344443 6642 1111100 00 00 Q ss_pred CCCCCcccccCccccccccc Q lcl|NC_017974. 476 SRAGDGMTKGSSGNGTGKIS 495 (507) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~ 495 (507) .+.|+ +..+.| T Consensus 382 ~~~Gd---------~~~p~p 392 (392) T protein:vir:10 382 KTTGQ---------SNEPVP 392 (392) T ss_pred CCCCC---------CCCCCC Confidence 01111 111111 No 73 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.47 E-value=7.6e-12 Score=81.55 Aligned_cols=395 Identities=11% Similarity=0.078 Sum_probs=196.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.=..+-++..+...+....... . .+...... ... .-.+..++..+.|.+|+..+-..| T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~--------~-----~~~~~~~~------~~~--~~~~~~~~~~~~v~~~i~~ia~~i 59 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYV--------G-----LFMSGEDV------SFL--VPGYVRLSDNPEVRMAVHKIADLI 59 (406) T ss_pred Ccchhhhccccccccccccchhh--------h-----hhccCccc------Ccc--ccCHHHHhhcHHHHHHHHHHHHhh Confidence 33222111111111111100000 0 00000000 000 001345667899999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHHHH-Hhh--cceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAATF-NEY--GLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l~a-~~~--Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) .+++|.+.....+. .+++..-+...|..-++ .++.++++.++.. +.+ ||+..++++... | .+..| T Consensus 60 a~~~~~~~~~~~~~--~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~------g--~~~~l 129 (406) T protein:vir:95 60 SSMTIYLMQNTEDG--DIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTAD------G--LIDEL 129 (406) T ss_pred ccCceEEEEecCCc--ceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCC------C--cEEEE Confidence 98877653322111 11111222333332222 3467787777753 444 677777777532 2 24556 Q ss_pred cccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCcc-CccccchHHH Q lcl|NC_017974. 156 AFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSS-SQALGVSPLV 234 (507) Q Consensus 156 ~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~-~~p~G~gll~ 234 (507) .+.++.++.. ..+.+|..+ ...+..+|++.+|++++...+ +..+|.|.+. T Consensus 130 ~~i~~~~v~~----~~~~~~~~~-------------------------~~~~~~~~~~evih~~~~~~~~~~~~G~s~i~ 180 (406) T protein:vir:95 130 VPLTPSKVNF----LDTPDGYQV-------------------------LYGGQTFNYDEVLHFIYNPDPERPYIGRGYRV 180 (406) T ss_pred EEEcCceeEE----EEcCCeEEE-------------------------EeccEEEchhHEEEeeccCCCCCCccccCHHH Confidence 6666554421 112222100 012234677777777765444 4467999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccc Q lcl|NC_017974. 235 GCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNA 314 (507) Q Consensus 235 ~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~ 314 (507) .+....-.-....++.+.+...-+.|--++++|-. -+++...+..+++.+...... ++...++++.+.+. . T Consensus 181 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-------l~~e~~~~~~~~~~~~~~g~~-n~~~~~v~~~~~~~-~ 251 (406) T protein:vir:95 181 VLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAA-------TAELSSEEGRNAVFKKYLQAT-EAGQPWIIPAELLE-V 251 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-------CCHHHHHHHHHHHHHHhcccc-ccCCceeecCCCcc-c Confidence 99888877777888888888776666556655432 233333344444444332111 11223566665421 1 Q ss_pred cccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~ 394 (507) . ++.. -+.....+.+..++.-++|++++.-..--.+.+ ++. .+........-+.-.++.|++.||+.|+. T Consensus 252 ~----~~~~--~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~---~~~-~~~~~~~~~~~l~P~~~~ie~~l~~~l~~ 321 (406) T protein:vir:95 252 E----QVKP--LSLKDIAINEAVELDKRTVAGMFGVPAFLLGIG---EFN-RDEYNNFINSTILPIAKGIEQELTRKLLI 321 (406) T ss_pred c----cccc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---Cch-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 1 1111 122223466677888899998876554222221 111 12233444555666777777777765532 Q ss_pred HHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhh----c Q lcl|NC_017974. 395 QLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMK----L 470 (507) Q Consensus 395 ~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~----~ 470 (507) +...+-+|.++.....|.++.++.+.+|+..|++.+ +++|+.+|+|+-+.++.-.....- . T Consensus 322 ----------~~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~-----NE~R~~~gl~p~~~gd~~~~~~n~~~~~~ 386 (406) T protein:vir:95 322 ----------SPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEG-----NEVRDWLGLSPKEGLSELVILENYIPLDK 386 (406) T ss_pred ----------CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeeeccCccchhh Confidence 122233445555556789999999999999999876 579999999865444432111110 0 Q ss_pred CCC-cCCCCCCcccccCccccccc Q lcl|NC_017974. 471 LGE-DTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 471 ~~~-~~~~~~~~~~~~~~~~~~~~ 493 (507) ..+ .+.+.+++.. .+++.. T Consensus 387 ~~~~~~~k~g~~~~----~~~~~~ 406 (406) T protein:vir:95 387 IGDQSKLKGGDNSG----ADGQTD 406 (406) T ss_pred cccccccCCCCCCC----CCCCCC Confidence 011 0011111100 011111 No 74 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.44 E-value=8.7e-12 Score=81.21 Aligned_cols=408 Identities=11% Similarity=0.034 Sum_probs=193.1 Q ss_pred ccccCchh--hhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHH Q lcl|NC_017974. 28 MSEQGTFA--LAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVE 105 (507) Q Consensus 28 ~~~~g~~~--~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~ 105 (507) ++.+++.+ ...|..... . +-.. +..++.+.|.+|+..+-..|.++++.+.-..+....+ .-+. T Consensus 1 ~~~~~~~~g~~~~~~~~~~---~-----~~~~---~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~----~~l~ 65 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSADSV---F-----GNGA---KGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDEL----HPLS 65 (723) T ss_pred CcccccCCCcccccccccc---c-----cccH---HHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchh----hHHH Confidence 33333211 111111110 0 0000 2235678999999999999988876554222211111 1234 Q ss_pred HHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccc Q lcl|NC_017974. 106 YCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIK 182 (507) Q Consensus 106 ~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~ 182 (507) ..|..-++ .+..++...++ +.+++|-+.+++++.-+. ..| .+..|.+.++.... +....++. . +. T Consensus 66 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~---~~g--~p~~l~~l~~~~~~----v~~~~~~~-~--~~ 133 (723) T protein:vir:94 66 QLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRT---PAG--VPDEIWYVYDRVTT----IVATRAAD-A--VP 133 (723) T ss_pred HHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc---ccc--ceeEEEEecCcceE----EeecCCCc-c--ce Confidence 44543222 34556777766 577899999998874211 112 23334444432111 00011110 0 00 Q ss_pred cccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_017974. 183 QSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVI 262 (507) Q Consensus 183 q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~ 262 (507) +....++. .....+....+|...+|++++....+..+|.|.+..+.-..-.-....++...|...-+.|-- T Consensus 134 ~~~~~~y~---------~~~~~G~~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~g 204 (723) T protein:vir:94 134 QAQIIGYV---------IERTDGVRVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGG 204 (723) T ss_pred eeeeeEEE---------EEecCceeEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcce Confidence 00000000 011122334577777776666555667799999998887777666677777777765444444 Q ss_pred EEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeec----------CcccccccccceeeeeccCCCcchh Q lcl|NC_017974. 263 ELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLP----------SDTKDNAPQYSMTLKGIDGMGKQYS 332 (507) Q Consensus 263 v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP----------~g~~i~~~~~~~el~~~~g~g~~~~ 332 (507) +++.| . .+++...++.+.+.+...... +..-.++++ .|++ ++..+-+..-.. T Consensus 205 iL~~~-------~-l~~e~~~~~~~~~~~~~~G~~-Nagk~~vL~g~~~~~~vl~~G~~---------~~~l~~s~~D~q 266 (723) T protein:vir:94 205 VVNLG-------D-MDEQTFTKTVAAFRSQVEGVQ-NAGRHLLIAGQGSDGGAAGKGAT---------FTSLSMSPAEMD 266 (723) T ss_pred EEEcC-------C-CCHHHHHHHHHHHHHHhhchh-hcCcceeecccccccccccCCce---------EEEccCCHHHHH Confidence 44432 1 223333333333332211000 011124443 3432 223322222234 Q ss_pred HHHHHHHHHHHHHHHHhccc-ccccccchhhhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccc Q lcl|NC_017974. 333 TAQLISDRKKSILDRLGAGF-INVGNDKGGSYNLSES-KQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMP 410 (507) Q Consensus 333 ~~~li~~~d~~Isk~iLGqt-Lts~~~~~GS~Al~~v-h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P 410 (507) |.+.-++..++|++++.-.. +-.+ +++++-.+. .......-+.-.++.|+..||+.|++.. . .. - T Consensus 267 ~le~r~~~~~eIa~afgVPp~~i~~---~st~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~---g----~~---~ 333 (723) T protein:vir:94 267 YINSRMHSAEEVMLAFGIRKDALLG---GSTYENQAEAKAAVWTETLIPQMEVMASITDLQLLPDI---G----WT---V 333 (723) T ss_pred HHHHHHHhHHHHHHHhCCChhHcCC---CCCcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhcccc---c----Cc---e Confidence 66667788888999876663 2221 222321222 2233456677788889999998776531 1 11 1 Q ss_pred eEEecCcC--chhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc---cchhhhcCCCcC-CCC-CCccc Q lcl|NC_017974. 411 YVKAGEIV--DVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS---QEELMKLLGEDT-SRA-GDGMT 483 (507) Q Consensus 411 ~~~~~~~~--~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~---~~~~~~~~~~~~-~~~-~~~~~ 483 (507) +|.|+... ..|.+..++++.+++..|++.+ +.+|+.+|+|+-+.++.. .+......|.+. .|. .++.+ T Consensus 334 ~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~-----NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~ 408 (723) T protein:vir:94 334 EWDFNSVPALQEDLEAQAGRNQGYLVNDVLMV-----DEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAA 408 (723) T ss_pred EEeecchhhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchhhhH Confidence 46666533 5788999999999999999877 579999999876555421 111111111111 111 11111 Q ss_pred ccCc--c--cccccccCCCCchhhhccC Q lcl|NC_017974. 484 KGSS--G--NGTGKISSTRDNSAANLDN 507 (507) Q Consensus 484 ~~~~--~--~~~~~~~~~~d~~a~~~~~ 507 (507) ..-+ . .+..+.+..+-..+.-..+ T Consensus 409 ~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 436 (723) T protein:vir:94 409 RMLALLERVAADRPLPELPVRATTVLHH 436 (723) T ss_pred hhhhhccccccccCcCCCCCCCCCCCCC Confidence 0000 0 0000111111111111111 No 75 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.44 E-value=4.6e-12 Score=82.76 Aligned_cols=385 Identities=12% Similarity=0.072 Sum_probs=176.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH-HHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV-ESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~a 79 (507) |.-..+.-+.... + . ... .+.-...++ +..++-+.|.+|+..+-.. T Consensus 1 Mg~f~~lf~~~~~--~-----~--------~~~------------------~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:95 1 MSILEKIFKTRKD--I-----T--------YML------------------DLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred CchhhhhhccCcc--c-----c--------ccc------------------cchhccccchhhhhhhHHHHHHHHHHHHh Confidence 4443221111000 0 0 000 000001111 1234568899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHH-hhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFN-EYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~-~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) |.++++. +..+....+.....++.. +--...++.++++.++..+ +.|-++. ++.. + .+.+.+...... T Consensus 48 iA~~p~~--~~~~~~~~~~~~~~ll~~--~PN~~~t~~~f~~~~~~~lll~g~~~~-~~~~-~-----~~~~~~~~~~~~ 116 (395) T protein:vir:95 48 VAQSHFK--VLEGNRIQKNDVYYKLNI--KPNTDLSSDSFWQQVIYKLIYDNEVLI-VVSD-S-----KELLIADSFYRE 116 (395) T ss_pred hccceeE--eccCCccccchHHHHHHh--ccCcCCCHHHHHHHHHHHHhhCCceEE-EEec-C-----CCeEecCCccce Confidence 9888654 333222222233333321 1112234566777666544 4554443 2221 1 122222111111 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) + ....++. +... ..........+|++.+|++++....+..+|.|++..+.- T Consensus 117 ~---------~~~~~~~------------~~~~--------~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~ 167 (395) T protein:vir:95 117 E---------YALYDDI------------FKDV--------TVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGK 167 (395) T ss_pred e---------EeecCcc------------eeEE--------EEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHH Confidence 0 0000000 0000 000111123577888887777777888999999887753 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHh-hcCCceeEeecCccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANA-HSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~-~~G~~a~~iiP~g~~i~~~~~ 317 (507) .. .....+..+.|.+--++++|. ...+++..+.+++..+...+. .++..+.++++.|+++.-.+ T Consensus 168 ~~-------~~~~~~~~~~~~~~gii~~~~-------~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~- 232 (395) T protein:vir:95 168 IF-------GRMIGAQLKNYQIRGILKSAS-------SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELS- 232 (395) T ss_pred HH-------HHHHHHHHhcCCCceEEEeCC-------CCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeecc- Confidence 32 222233344454443443332 222233333344433333222 22222333457777643221 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYN-LSESKQTIHTQFVQRVNEIILEALNENLLPQL 396 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A-l~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 396 (507) +...........|.+..++..++|++++--..--. +|+++ ..+........-+.--+..|+..||+.|+.+- T Consensus 233 ---~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l----~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~ 305 (395) T protein:vir:95 233 ---NGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI----YGETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQS 305 (395) T ss_pred ---ccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChh Confidence 11111111112355566788888999776543212 13333 23444555556677788888888988775542 Q ss_pred HHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCc--ccc-cchhhh---c Q lcl|NC_017974. 397 LALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEE--DIS-QEELMK---L 470 (507) Q Consensus 397 ~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~--~~~-~~~~~~---~ 470 (507) -.. .. -+|.++.....|.++.++++.+++..|++.+ +++|+.+|+|+-.++ +.. .+.... . T Consensus 306 ~~~-----~~---~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~-----NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~ 372 (395) T protein:vir:95 306 MYL-----KD---TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTR-----NEVRIMLGEEPSDNPELDEYLITKNYEKANS 372 (395) T ss_pred hhc-----cc---ceecchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceeeeccccccccc Confidence 111 11 1455666667889999999999999998876 579999999876544 221 111000 0 Q ss_pred CCCcCCCCCCcccccCcccccccccCCCCchhhh Q lcl|NC_017974. 471 LGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAAN 504 (507) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~ 504 (507) ..+...+..+...++...++.+ | T Consensus 373 ~~~~~~~~~~~~~kgg~~~~~g-----------~ 395 (395) T protein:vir:95 373 GENDEKEKDENTLKGGDEDESG-----------D 395 (395) T ss_pred cccccCcccccccCCCCCCCCC-----------C Confidence 0011111111111111111111 0 No 76 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.44 E-value=4.6e-12 Score=82.76 Aligned_cols=385 Identities=12% Similarity=0.072 Sum_probs=176.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH-HHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV-ESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~a 79 (507) |.-..+.-+.... + . ... .+.-...++ +..++-+.|.+|+..+-.. T Consensus 1 Mg~f~~lf~~~~~--~-----~--------~~~------------------~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:10 1 MSILEKIFKTRKD--I-----T--------YML------------------DLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred CchhhhhhccCcc--c-----c--------ccc------------------cchhccccchhhhhhhHHHHHHHHHHHHh Confidence 4443221111000 0 0 000 000001111 1234568899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHH-hhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFN-EYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~-~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) |.++++. +..+....+.....++.. +--...++.++++.++..+ +.|-++. ++.. + .+.+.+...... T Consensus 48 iA~~p~~--~~~~~~~~~~~~~~ll~~--~PN~~~t~~~f~~~~~~~lll~g~~~~-~~~~-~-----~~~~~~~~~~~~ 116 (395) T protein:vir:10 48 VAQSHFK--VLEGNRIQKNDVYYKLNI--KPNTDLSSDSFWQQVIYKLIYDNEVLI-VVSD-S-----KELLIADSFYRE 116 (395) T ss_pred hccceeE--eccCCccccchHHHHHHh--ccCcCCCHHHHHHHHHHHHhhCCceEE-EEec-C-----CCeEecCCccce Confidence 9888654 333222222233333321 1112234566777666544 4554443 2221 1 122222111111 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) + ....++. +... ..........+|++.+|++++....+..+|.|++..+.- T Consensus 117 ~---------~~~~~~~------------~~~~--------~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~ 167 (395) T protein:vir:10 117 E---------YALYDDI------------FKDV--------TVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGK 167 (395) T ss_pred e---------EeecCcc------------eeEE--------EEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHH Confidence 0 0000000 0000 000111123577888887777777888999999887753 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHh-hcCCceeEeecCccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANA-HSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~-~~G~~a~~iiP~g~~i~~~~~ 317 (507) .. .....+..+.|.+--++++|. ...+++..+.+++..+...+. .++..+.++++.|+++.-.+ T Consensus 168 ~~-------~~~~~~~~~~~~~~gii~~~~-------~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~- 232 (395) T protein:vir:10 168 IF-------GRMIGAQLKNYQIRGILKSAS-------SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELS- 232 (395) T ss_pred HH-------HHHHHHHHhcCCCceEEEeCC-------CCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeecc- Confidence 32 222233344454443443332 222233333344433333222 22222333457777643221 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYN-LSESKQTIHTQFVQRVNEIILEALNENLLPQL 396 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A-l~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 396 (507) +...........|.+..++..++|++++--..--. +|+++ ..+........-+.--+..|+..||+.|+.+- T Consensus 233 ---~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l----~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~ 305 (395) T protein:vir:10 233 ---NGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI----YGETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQS 305 (395) T ss_pred ---ccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChh Confidence 11111111112355566788888999776543212 13333 23444555556677788888888988775542 Q ss_pred HHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCc--ccc-cchhhh---c Q lcl|NC_017974. 397 LALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEE--DIS-QEELMK---L 470 (507) Q Consensus 397 ~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~--~~~-~~~~~~---~ 470 (507) -.. .. -+|.++.....|.++.++++.+++..|++.+ +++|+.+|+|+-.++ +.. .+.... . T Consensus 306 ~~~-----~~---~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~-----NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~ 372 (395) T protein:vir:10 306 MYL-----KD---TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTR-----NEVRIMLGEEPSDNPELDEYLITKNYEKANS 372 (395) T ss_pred hhc-----cc---ceecchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceeeeccccccccc Confidence 111 11 1455666667889999999999999998876 579999999876544 221 111000 0 Q ss_pred CCCcCCCCCCcccccCcccccccccCCCCchhhh Q lcl|NC_017974. 471 LGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAAN 504 (507) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~ 504 (507) ..+...+..+...++...++.+ | T Consensus 373 ~~~~~~~~~~~~~kgg~~~~~g-----------~ 395 (395) T protein:vir:10 373 GENDEKEKDENTLKGGDEDESG-----------D 395 (395) T ss_pred cccccCcccccccCCCCCCCCC-----------C Confidence 0011111111111111111111 0 No 77 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.44 E-value=4.6e-12 Score=82.76 Aligned_cols=385 Identities=12% Similarity=0.072 Sum_probs=176.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH-HHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV-ESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~a 79 (507) |.-..+.-+.... + . ... .+.-...++ +..++-+.|.+|+..+-.. T Consensus 1 Mg~f~~lf~~~~~--~-----~--------~~~------------------~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:10 1 MSILEKIFKTRKD--I-----T--------YML------------------DLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred CchhhhhhccCcc--c-----c--------ccc------------------cchhccccchhhhhhhHHHHHHHHHHHHh Confidence 4443221111000 0 0 000 000001111 1234568899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHH-hhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFN-EYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~-~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) |.++++. +..+....+.....++.. +--...++.++++.++..+ +.|-++. ++.. + .+.+.+...... T Consensus 48 iA~~p~~--~~~~~~~~~~~~~~ll~~--~PN~~~t~~~f~~~~~~~lll~g~~~~-~~~~-~-----~~~~~~~~~~~~ 116 (395) T protein:vir:10 48 VAQSHFK--VLEGNRIQKNDVYYKLNI--KPNTDLSSDSFWQQVIYKLIYDNEVLI-VVSD-S-----KELLIADSFYRE 116 (395) T ss_pred hccceeE--eccCCccccchHHHHHHh--ccCcCCCHHHHHHHHHHHHhhCCceEE-EEec-C-----CCeEecCCccce Confidence 9888654 333222222233333321 1112234566777666544 4554443 2221 1 122222111111 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) + ....++. +... ..........+|++.+|++++....+..+|.|++..+.- T Consensus 117 ~---------~~~~~~~------------~~~~--------~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~ 167 (395) T protein:vir:10 117 E---------YALYDDI------------FKDV--------TVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGK 167 (395) T ss_pred e---------EeecCcc------------eeEE--------EEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHH Confidence 0 0000000 0000 000111123577888887777777888999999887753 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHh-hcCCceeEeecCccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANA-HSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~-~~G~~a~~iiP~g~~i~~~~~ 317 (507) .. .....+..+.|.+--++++|. ...+++..+.+++..+...+. .++..+.++++.|+++.-.+ T Consensus 168 ~~-------~~~~~~~~~~~~~~gii~~~~-------~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~- 232 (395) T protein:vir:10 168 IF-------GRMIGAQLKNYQIRGILKSAS-------SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELS- 232 (395) T ss_pred HH-------HHHHHHHHhcCCCceEEEeCC-------CCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeecc- Confidence 32 222233344454443443332 222233333344433333222 22222333457777643221 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYN-LSESKQTIHTQFVQRVNEIILEALNENLLPQL 396 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A-l~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 396 (507) +...........|.+..++..++|++++--..--. +|+++ ..+........-+.--+..|+..||+.|+.+- T Consensus 233 ---~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l----~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~ 305 (395) T protein:vir:10 233 ---NGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI----YGETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQS 305 (395) T ss_pred ---ccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChh Confidence 11111111112355566788888999776543212 13333 23444555556677788888888988775542 Q ss_pred HHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCc--ccc-cchhhh---c Q lcl|NC_017974. 397 LALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEE--DIS-QEELMK---L 470 (507) Q Consensus 397 ~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~--~~~-~~~~~~---~ 470 (507) -.. .. -+|.++.....|.++.++++.+++..|++.+ +++|+.+|+|+-.++ +.. .+.... . T Consensus 306 ~~~-----~~---~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~-----NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~ 372 (395) T protein:vir:10 306 MYL-----KD---TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTR-----NEVRIMLGEEPSDNPELDEYLITKNYEKANS 372 (395) T ss_pred hhc-----cc---ceecchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceeeeccccccccc Confidence 111 11 1455666667889999999999999998876 579999999876544 221 111000 0 Q ss_pred CCCcCCCCCCcccccCcccccccccCCCCchhhh Q lcl|NC_017974. 471 LGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAAN 504 (507) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~ 504 (507) ..+...+..+...++...++.+ | T Consensus 373 ~~~~~~~~~~~~~kgg~~~~~g-----------~ 395 (395) T protein:vir:10 373 GENDEKEKDENTLKGGDEDESG-----------D 395 (395) T ss_pred cccccCcccccccCCCCCCCCC-----------C Confidence 0011111111111111111111 0 No 78 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.41 E-value=1e-11 Score=80.87 Aligned_cols=397 Identities=12% Similarity=0.046 Sum_probs=186.3 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.=-|....-....-. ..+.+.| . .+..+.. ... ...++-+.|.+|+..+...| T Consensus 1 m~~~~~~~~~~~~~~~---------~~~~~~~---~-----------~~~~~g~-~~~--~~Al~~~~V~~cv~~ia~~i 54 (417) T protein:vir:38 1 MKLFRGLATEVDPHWA---------DHLLDSG---V-----------IPSFRGG-YLG--ISALRNSDVLTAVSIVSGDV 54 (417) T ss_pred CccccccccCCCccch---------hhhcccc---c-----------ccccCCc-eec--hhhcccHHHHHHHHHHHHhh Confidence 4332211110000000 0000000 0 0000100 000 11245678999999999999 Q ss_pred hhCCceeeeCCCCCh-hHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 81 EKAFKDFKVIPGESE-ESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 81 ~~~~~~~~~~~~~~~-~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .++++.+.....+.. .... +...|...++ .++.++++.++ +.+++|.+..+++....++ .+..|. T Consensus 55 A~lp~~~~~~~~~~~~~~~~----~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~-------~~~~l~ 123 (417) T protein:vir:38 55 SRFPLVITDSSTDEVIDLAN----IEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITN-------EPAMFE 123 (417) T ss_pred ccCeeEEEEcCCcceeccch----HHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC-------EEEEEE Confidence 888766543332211 1112 2333432222 24556777655 4678999999988643221 233344 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++.... +....+++ +.. .+.. ........+|...+|++++.+ .+..+|.|.+..+ T Consensus 124 ~l~p~~v~----v~~~~~~~-~~y---------~~~~--------~~~~~~~~~~~~dviH~r~~~-~d~~~G~s~l~~~ 180 (417) T protein:vir:38 124 FYAPSQTQ----VDTSDPDN-IIY---------RFTP--------YNSSMQKVCGFEDVIHWKFFS-YDTIMGRSPLLSL 180 (417) T ss_pred EeCCceEE----EEEcCCCe-EEE---------EEEE--------cCCcEEEEecCcceEEecCCC-CCCccccCHHHHH Confidence 44444332 11122221 110 0000 011122345666777777654 4447899999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCcccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKDNA 314 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i~~ 314 (507) .-..-.-....++...|...-+.|--+++.|.. -++++..++.+.+. ....|..+ .++++.|++.. T Consensus 181 ~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~-------l~~e~~~~~~~~~~----~~~~g~n~g~~~vl~~g~~~~- 248 (417) T protein:vir:38 181 GDEIGLQESGVSTLQKFFKSGLKGSIIKAKESR-------LSAEARQKIREDFE----RAQAGADAGSPIIVDATMDYQ- 248 (417) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-------CCHHHHHHHHHHHH----HHhcccccCCceeccCCceEE- Confidence 877767777777778888765555555554422 23333333323332 22333222 46778887533 Q ss_pred cccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLS-ESKQTIHTQFVQRVNEIILEALNENLL 393 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~-~vh~~v~~~~~~aD~~~i~~~ln~~li 393 (507) ..+-+.....|.+..++..++|++++.-..--.+. .++++-. +........-+.-.++.|++.||+.|+ T Consensus 249 --------~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~--~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll 318 (417) T protein:vir:38 249 --------PLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQ--NSPNQSVKQLADDYIRNDLPFYFEPITSEFELKLL 318 (417) T ss_pred --------EccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 22222222346667778888999866444322222 2233322 222333445677778888888887775 Q ss_pred HHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccc--ccchhh--- Q lcl|NC_017974. 394 PQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDI--SQEELM--- 468 (507) Q Consensus 394 ~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~--~~~~~~--- 468 (507) .+.-.. + -.|.|+... .+ ....+.++++++.|++.+ +.+|+.+|+|+-.+++. ...... T Consensus 319 ~~~~~~------~---~~~~fd~~~-l~-~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~~g~~d~~~~~~n~~~ 382 (417) T protein:vir:38 319 DDAQRH------Q---YCIGFDTKS-VN-GLPIADVNTAVNGGLWTG-----NEGRAELGKKPLKDPNMDRIQSTLNTVF 382 (417) T ss_pred Chhhcc------c---ceEEechhh-hh-HHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCCeeeecccccc Confidence 543211 1 147776443 22 222445788899998776 57999999986544421 111000 Q ss_pred ------hcCCCcCC-CCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 469 ------KLLGEDTS-RAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 469 ------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ...++... +.+++.. ++.. +...+-+| T Consensus 383 ~d~~~~~~~~~~~~~kgg~~~~-----~~~~-------~~~~~~~~ 416 (417) T protein:vir:38 383 LDQKEAYQAEHAAELKGGDTNA-----KGNQ-------NGSGTNAN 416 (417) T ss_pred cccccccccccccccCCCCCCC-----CCCC-------cCCCCcCC Confidence 00010000 1111100 0000 00000111 No 79 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.40 E-value=3.4e-11 Score=78.01 Aligned_cols=386 Identities=13% Similarity=0.013 Sum_probs=187.6 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +++-+++.. +.+.+... .. +..+..... .+..+ .-+..++-+.|.+|+..+...| T Consensus 4 ~~~~~~~~~--~~~~~~~~--------~~--~~~~~~~~~------------~~~~v-t~~~al~~~~v~~~i~~Ia~~i 58 (394) T protein:vir:62 4 RDRFSNYLF--KKAEKRGY--------LD--NVLGKSIRY------------SGVYV-TDSNILQSSDVYELLQDISNQM 58 (394) T ss_pred hhhhhhhcc--CCCCchhh--------hh--hhhhccccc------------Ccccc-ChhhhhccHHHHHHHHHHHHhh Confidence 334333221 11111100 00 000000000 00000 0122346688999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHH-HHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKF-VEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~-v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .+++|.+....+. ++.+. +..++.+ ++ .++.+++..++ +.+++|-+.+.+.- +....+..+. T Consensus 59 A~lp~~v~~~~g~-----~~~~~~~~~Ll~~-PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~--------~~~~~~~~~~ 124 (394) T protein:vir:62 59 VLADIVVEDEFGN-----EIKDDIALQILRN-PNNYLTQSEFIKLMTNTYLLEGETFPILNG--------AQIHLASNVF 124 (394) T ss_pred cccceEEEcCCCc-----ccchhhHHHHhcc-CCCCCCHHHHHHHHHHHHHhcCCeEEEEec--------ceeeccccce Confidence 9998776543322 12222 2223322 22 23456666544 56779998876521 1111111110 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) ...+.++. .. ....+..+|.+.++++++.. .+..+|.|++..+ T Consensus 125 ------------~~~~~~~~------------~~------------~~~~~~~~~~~eiih~r~~~-~d~~~G~s~~~~~ 167 (394) T protein:vir:62 125 ------------TELDDNLV------------EH------------FNIGGHEIPPCMIRHVKNIG-ADHLRGKGILDLG 167 (394) T ss_pred ------------EEECCceE------------EE------------EeeCCEEechhheEEecCcC-CCCccccChHHHH Confidence 11111111 00 01124557777776666554 4557899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-..-.-....++...+...-+.|--++++|..+. .+++......+++.+..... ......+++|.|++.++.. T Consensus 168 ~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~-----~~~~~~~~~~~~~~~~~~g~-~n~g~~~vl~~g~~~~~~~ 241 (394) T protein:vir:62 168 RDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHIN-----PQNGAQSKLINAILDQLESI-DEARSVKMIPLGKGYSIDT 241 (394) T ss_pred HHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCC-----cCHHHHHHHHHHHHHHhccc-cccCceeEeeCCCceeEEe Confidence 87766666677777788777666666666553321 12222222223332221110 0112336888888655432 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQL 396 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 396 (507) . +.+.....+.+..++..++|++++--..--.+.. .+|.+ .+........-+.-.++.|+..||+.|+.+- T Consensus 242 l-------~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-~~sn~-e~~~~~~~~~~l~P~~~~ie~~l~~kll~~~ 312 (394) T protein:vir:62 242 L-------KSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTEL-IKEDI-EKAMMYIHNKAVRPIMKNFEDHLSLLFYAQN 312 (394) T ss_pred c-------CCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCC-CCcCH-HHHHHHHHHHHHHHHHHHHHHHHhhhhcCcc Confidence 1 1222222355666788899999876554333221 12221 2223334455666777888888886554321 Q ss_pred HHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccc-ccchhhhcCCCcC Q lcl|NC_017974. 397 LALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDI-SQEELMKLLGEDT 475 (507) Q Consensus 397 ~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~-~~~~~~~~~~~~~ 475 (507) +...-+|.|+...-.+.+..++++.+++..|++.+ +.+|+.+|+|+-.+++- ..-......+-.. T Consensus 313 ---------~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~-----NE~R~~~gl~p~~~~~gd~~~~~~n~~~~~~ 378 (394) T protein:vir:62 313 ---------SGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSP-----DNVADMLGFPKQNTKESQAIYISNDVTEIGK 378 (394) T ss_pred ---------ccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCCeeecccccccccc Confidence 11123578887777777788999999999998776 57999999987533221 1111111111111 Q ss_pred CCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 476 SRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ....+...++... -+| T Consensus 379 ~~~~~~~~kgge~----------------~en 394 (394) T protein:vir:62 379 KEATDGSLGGGEE----------------NEN 394 (394) T ss_pred cccccccCCCCCC----------------CCC Confidence 1000000001000 011 No 80 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.38 E-value=1.3e-11 Score=80.25 Aligned_cols=371 Identities=11% Similarity=0.010 Sum_probs=188.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.+.++........ . .+.......+ |.. ..+. . +-...+. =+.+++-+.|.+|+..+-..| T Consensus 32 ~~~~~~~~~~~~~~-----~--~~~~~~~~~~--g~~--~~~~----~---~~~~~~t-~~~~~~~~~v~acV~~Ia~~i 92 (409) T protein:vir:83 32 MVEFRGPEEEPEAR-----A--LPWIRPTAWS--GYP--ESWA----T---PSWGSAQ-DKLRTLIDVAWACIDLNASVL 92 (409) T ss_pred eeeccCCCcchhhh-----h--cccccccccc--ccc--cccc----c---cCccccc-hhhHhhhHHHHHHHHHHHHhh Confidence 22221111111000 0 0000000000 000 0000 0 0000000 133556789999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) .++++...-. + ...+ . +...|+.-++ .++.+++..++..+..|-+..+++-+. .+|. +..|.+. T Consensus 93 A~lpl~~~~~-~-~~~~-~----~~~ll~~~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~-----~~G~--~~~L~pl 158 (409) T protein:vir:83 93 SSMPIYRMRN-G-RIID-S----VAWMSNPDPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHG-----SDGY--PIRFRVV 158 (409) T ss_pred ccCceEEeeC-C-cccc-c----hhhhcccCCCCCCCHHHHHHHHHHHHhhCCcEEEEEEEC-----CCCc--EEEEEEE Confidence 9887654322 1 1111 1 1223332222 356677777766666688877654321 1232 4456666 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) ++.++. +..+.||+.. +.+. +...+ +.+|++++....+..||.|.+..+.. T Consensus 159 ~p~~v~----v~~~~~g~~~----------y~~~--------------~~~~~-~eiiHir~~~~~~~~~G~spi~~~~~ 209 (409) T protein:vir:83 159 PPWLVN----VELKKGARRE----------YRIG--------------GLNVT-DEILHIRYQGNTADAHGHGPLESAAP 209 (409) T ss_pred CCcceE----EEEcCCceEE----------EEEc--------------cccCc-cceEEeCCCCCCCCcccccHHHHHHH Confidence 665432 2333443211 0000 11112 22343345555677899999999998 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCc-eeEeecCccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQ-SFFMLPSDTKDNAPQY 317 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~-a~~iiP~g~~i~~~~~ 317 (507) ..-......++-..|...-+.|--++++| ..-+++...++.+...+ ...|+. ..+++..|++..- T Consensus 210 ~i~~~~a~~~~~~~~f~nga~p~gil~~~-------~~ls~e~~~~~~~~~~~----~~~~nag~~~il~~g~~~~~--- 275 (409) T protein:vir:83 210 RQVVIGLLQKYVQNLAETGGVPLYWLGVE-------RRLSETEAVDLMDRWIE----SRSKYAGHPALVTGGATLNQ--- 275 (409) T ss_pred HHHHHHHHHHHHHHHHhcCCCcceEeecC-------CCCCHHHHHHHHHHHHH----hhCCccCccceecCCccccc--- Confidence 88887777777777777555555555443 33334444443333332 222322 2366667765311 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc---cchhhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN---DKGGSYNLSE-SKQTIHTQFVQRVNEIILEALNENLL 393 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~---~~~GS~Al~~-vh~~v~~~~~~aD~~~i~~~ln~~li 393 (507) .+ +-+.....|.+.-++..++|++++.-...-.+. +...+|+-.+ ........-+.-.++.|+..||+.|+ T Consensus 276 ---~~--~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll 350 (409) T protein:vir:83 276 ---AK--SMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWAL 350 (409) T ss_pred ---cc--CCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhC Confidence 11 112222235556677888999988765433331 1222344323 23333344566678888888887664 Q ss_pred HHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCC Q lcl|NC_017974. 394 PQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGE 473 (507) Q Consensus 394 ~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~ 473 (507) +. ..+-+|.++.....|+++.++.++++++.|++.+ +++|+..|+|+...+|..... + T Consensus 351 ~~-----------~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~-----NE~R~~~glpp~~ggd~l~~~------g 408 (409) T protein:vir:83 351 PS-----------PQHLELNRDDYTRPSLVERATAYKIMIEAGVMEP-----NEARAMERLHSEAAAVRLSGG------G 408 (409) T ss_pred CC-----------CcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCcccCCC------C Confidence 21 1222344455556888999999999999999876 579999999865554432100 0 Q ss_pred cCCCCCCcccccCc Q lcl|NC_017974. 474 DTSRAGDGMTKGSS 487 (507) Q Consensus 474 ~~~~~~~~~~~~~~ 487 (507) . T Consensus 409 -------------v 409 (409) T protein:vir:83 409 -------------V 409 (409) T ss_pred -------------C Confidence 0 No 81 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.37 E-value=4.9e-11 Score=77.09 Aligned_cols=385 Identities=12% Similarity=0.040 Sum_probs=193.6 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.--++.++....+..+... .. ... + ....... +....+ ..+..++-+.|.+|+..+...| T Consensus 1 M~~f~~~~~~~~~~~~~~~~--~~--~~~--~------~~~~~~~------~~~~~v-~~~~~~~~~~v~~~i~~ia~~i 61 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGG--FF--DIT--D------PDFLSTL------NGSEWV-SAESALRNSDLFSIINQLSNDL 61 (386) T ss_pred Cccccccccccccccccccc--cc--ccc--c------chhcccc------cCCcee-chhhhhcchHHHHHHHHHHHhh Confidence 66555444433222211100 00 000 0 0000000 111111 1233467899999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .++++.+. .. . ...+.... -...++.++++.++ +.+.+|-+++++++... |+ +..|.+.| T Consensus 62 a~~p~~~~--~~--~----~~~l~~~p---N~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------g~--~~~L~~l~ 122 (386) T protein:vir:48 62 ATVKLTAS--RK--Q----LQGIIDNP---SNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN------GR--DMKWEYLR 122 (386) T ss_pred ccCceeec--cc--h----hHHHhhcC---CCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC------Cc--EEEEEEec Confidence 88865543 11 1 11111111 11235667888766 56779999999988642 22 34455555 Q ss_pred chhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHH Q lcl|NC_017974. 160 QASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRA 239 (507) Q Consensus 160 ~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~ 239 (507) +..+. +..+.+|..+... +.... ...+....+|.+.+|++++....+.++|.|.+..+... T Consensus 123 ~~~v~----v~~~~~~~~~~y~---------~~~~~------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~ 183 (386) T protein:vir:48 123 PSQVS----FNRLDNKDGIYYN---------ITFDD------PRIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRE 183 (386) T ss_pred CceeE----EEEcCCCceEEEE---------EEecC------ccccceeEecCccEEEecCCCCCCceeeccHHHHHHHH Confidence 54332 1223333222110 00000 01122345677777777766666778999999998877 Q ss_pred HHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccce Q lcl|NC_017974. 240 WREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSM 319 (507) Q Consensus 240 ~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~ 319 (507) .-.-....++...+...-+.|--+++.|. ..+++......+...... .++...++++.|+++. T Consensus 184 i~~~~~~~~~~~~~~~ng~~~~~ii~~~~-------~~~~e~~~~~~~~~~~~~----~n~g~~~vl~~g~~~~------ 246 (386) T protein:vir:48 184 LNIQKASDKLTLNSLKNALNANGILKIKG-------GGLLDFKTKLSRSRQAMK----QMQGGPLVLDDLEEFT------ 246 (386) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCC-------CCCHHHHHHHHHHHHHhh----cCCCCceecCCCceEE------ Confidence 76667777888888887676666666543 233333333333332222 2333457778877543 Q ss_pred eeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 320 TLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLAL 399 (507) Q Consensus 320 el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l 399 (507) ..+-+.....|.+..++..++|++++.-...-.+...+++++ .+........-+.--++.|+..||+.|++.+ ++ T Consensus 247 ---~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-~~ 321 (386) T protein:vir:48 247 ---PLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSS-LEMSLDLYNKAVSRYLRPFLSELSQKLSCDV-DA 321 (386) T ss_pred ---EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcchh-hc Confidence 222222233466777888889999875544323221122222 2222334445556667888888887765432 12 Q ss_pred CCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCC Q lcl|NC_017974. 400 NDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAG 479 (507) Q Consensus 400 N~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~ 479 (507) + +......|...++..+.+|+..|++.+ +.+|+.+|.+.-.+++ ...-. ..+ ..+.. T Consensus 322 ~-------------~~~~~~~d~~~~~~~~~~l~~~g~~t~-----nE~r~~lg~~~~~~~~-~~~~~---~~~-~~~~~ 378 (386) T protein:vir:48 322 D-------------ILPAVDPTGSNSVSRINSMVKSGTLAQ-----NQGLYILQQAEILPKE-LPEGE---NPN-KTTLK 378 (386) T ss_pred c-------------hhhhhccChHHHHHHHHHHHhCCCcCH-----HHHHHHhhcCCCCCcc-chhhc---CCC-CCccC Confidence 1 111122344566778889999998765 5688988876432222 11100 000 01110 Q ss_pred CcccccCcccc Q lcl|NC_017974. 480 DGMTKGSSGNG 490 (507) Q Consensus 480 ~~~~~~~~~~~ 490 (507) .|.. .+++ T Consensus 379 gGd~---~~~~ 386 (386) T protein:vir:48 379 GGEI---NGED 386 (386) T ss_pred CCCC---CCCC Confidence 0100 1111 No 82 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.35 E-value=6.9e-11 Score=76.30 Aligned_cols=388 Identities=13% Similarity=0.109 Sum_probs=185.9 Q ss_pred Cccc---ccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHH Q lcl|NC_017974. 1 MSRK---RNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLY 77 (507) Q Consensus 1 ~~~k---~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk 77 (507) |.=. |++.++ .+... ..- -++..+... .....|..+...+.|.+|+..+- T Consensus 1 Mg~~~~f~~k~~~----~~~~~---~~~-~~~~~~~~~-------------------~~~~~~~~~~~~~~V~~~I~~ia 53 (403) T protein:vir:80 1 MGLFNFFRRKTRS----EPTNA---ISW-FLTQEAYDT-------------------LAIPGYTRLSDNPEVRMAVHKIA 53 (403) T ss_pred Ccccccccccccc----cccch---hhh-hcccccccc-------------------cccchhhhhhhhHHHHHHHHHHH Confidence 5422 221111 11000 000 000000000 00111334555788999999999 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHHH-HHh--hcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAAT-FNE--YGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l~-a~~--~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) ..|.++++.+..... +. .+++..-+..+|..-++ .+..++++.++. .+. +|++.++++|... | .+ T Consensus 54 ~~iA~~p~~~~~~~~-~g-~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~------g--~~ 123 (403) T protein:vir:80 54 ELISSMTIHLMQNTD-NG-DIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTS------G--LI 123 (403) T ss_pred HhhhhCceEEEEecC-Cc-eeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCC------C--cE Confidence 999888776432211 11 11122223444442222 234566666553 333 6889999887542 2 23 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCc-cccch Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQ-ALGVS 231 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~-p~G~g 231 (507) ..|.+.++.++. +..+.+|..+ +. .+..+|.+.+++++....+.+ .+|.| T Consensus 124 ~~L~~l~p~~v~----~~~~~~g~~~------------~y-------------~~~~~~~~eiih~~~~~~~~~~~~G~s 174 (403) T protein:vir:80 124 DELIPLAPSKVS----FVDTDTGYQI------------WY-------------QGKAYNYDEVLHFIVNPDPEKPYMGRG 174 (403) T ss_pred EEEEEEcCCeeE----EEEcCCceEE------------EE-------------eecccchhhEEEEeccCCCcCcccccc Confidence 445555554432 1223333111 00 112345566666665444444 45999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) .+..+.-.+-.-....++...|...-+.|--+++.|-.+ ++....+..+++.+...... .+...+++|.+.. T Consensus 175 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-------~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~~ 246 (403) T protein:vir:80 175 YRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAAT-------AELSSEEGRNAVFKKYLEAS-EAGQPWIIPAELL 246 (403) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-------ChHHHHHHHHHHHHHHhhhh-hcCCeeeeccccc Confidence 888877666666666677777777666666666655322 12222223333333221111 1123356776642 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) +.. ++... +.....+.+..++...+|++++--..--.+.+.+++ + ........-+.-.++.|+..||+. T Consensus 247 -~~~----~~~~l--~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~---~~~~f~~~~l~P~~~~ie~~l~~k 315 (403) T protein:vir:80 247 -DVE----QVKPL--SLKDLAIHETVELDKRTVAGIFGVPAFLLGVGKYDK-D---EYNNFINSTILPIAKGIEQELTRK 315 (403) T ss_pred -ccc----eeccC--CHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCccH-H---HHHHHHHHHHHHHHHHHHHHHHHh Confidence 111 12221 111224567778888899997765541122111111 1 122344455666777777777765 Q ss_pred HHHHHHHhCCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhh Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMK 469 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~ 469 (507) |+. +.+. +|+|+ .....|.++.++++.+++..|++.+ +++|+.+|+|+-+.+|....... T Consensus 316 ll~----------~~~~--~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~-----NE~R~~~gl~p~~ggd~~~~~~n- 377 (403) T protein:vir:80 316 LLI----------SPDL--YFKFNPRSLYAYDLKELAEVGSNMYVRGLMEG-----NEVRDWLGLSPKEGLSELVILEN- 377 (403) T ss_pred ccC----------CCCc--EEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeeccc- Confidence 532 1111 35554 3445788999999999999999876 57999999997554443211111 Q ss_pred cCCCc----CCCCCCcccccCccccccc Q lcl|NC_017974. 470 LLGED----TSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 470 ~~~~~----~~~~~~~~~~~~~~~~~~~ 493 (507) ..|-+ ......+...++.+ ... T Consensus 378 ~~pl~~~~~~~~~k~ge~~~~~~--~~~ 403 (403) T protein:vir:80 378 YIPLDKIGDQNKLKGGEKGGADG--QTD 403 (403) T ss_pred ccchhhccchhhccCCCCCCCCC--CCC Confidence 11100 00011111111111 110 No 83 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.35 E-value=3.6e-11 Score=77.86 Aligned_cols=383 Identities=11% Similarity=0.003 Sum_probs=167.6 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHH-HHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVE-SMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~-~m~~D~~v~s~l~~Rk~a 79 (507) |.=+.+.+.-..... .+.. .. ...... . ...++. ..++.+.|.+|+..+... T Consensus 1 Mg~~~~~~~~~~~~~--~~~~-~~--~~~~~~----------~------------~~~~~~~~~l~~~~v~~~v~~Ia~~ 53 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQ--RTLN-LT--DTVWCS----------I------------PSEKLKELSIKKWAIDSCANKIANT 53 (395) T ss_pred CchHHHHHhhhcccc--cccc-cc--cchhhc----------c------------ccccchhhhhhhHHHHHHHHHHHHH Confidence 443333322211000 0000 00 000000 0 001111 124567899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.+++|... ..+...... +..+|+.-++ .++.++++.++ +.+++|.+.+.+... +.+....+. T Consensus 54 ia~~p~~~~--~~~~~~~~~----~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~--------~~~~~~~~~ 119 (395) T protein:vir:40 54 LSCAEVLTY--EKGEEVRKK----NWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDE--------YIYVADSFT 119 (395) T ss_pred HhhCceeec--cCCccccch----HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecC--------ceeecCCcc Confidence 998876543 222222222 3334442222 24556666644 466799888654321 111111111 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) ... ..+.. .. + .... .........+|+..++++++....+.+++.++...+ T Consensus 120 ~~~-~~~~~---~~-----------------~-----~~v~---~~~~~~~~~~~~~evih~r~~~~~~~~~~~~l~~~~ 170 (395) T protein:vir:40 120 KND-KSLYE---NT-----------------Y-----TEVT---LKDLTLKKEFKESEVLHLTLNNESIKSIIDGFYLLY 170 (395) T ss_pred ccc-ccccc---ce-----------------e-----eeee---ecCceeeeeeccccEEEeecCCCCccccchhHHHHH Confidence 000 00000 00 0 0000 001111234677777887877777778877776533 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcC--CceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMG--GVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNA 314 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~--p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~ 314 (507) .-. +... .....+.+. |..+++.+ ..-+++...+..+.+.+.......+....++++.|+++.- T Consensus 171 ~~~--~~~~-----~~~~~~~~~~~~~l~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~ 236 (395) T protein:vir:40 171 GDL--LTAA-----VNKYKKLNSRKIIVKLKAM-------FGQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDE 236 (395) T ss_pred HHH--HHHH-----HHHHHhcCCCCceEEEecc-------cCCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEe Confidence 311 1111 111112222 33333222 2222333333333333333332233334577888876432 Q ss_pred cccceeeeeccCCCcchhHH---HHHHHHHHHHHHHHhcccccccccchhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYSTA---QLISDRKKSILDRLGAGFINVGNDKGGSYNL-SESKQTIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~~---~li~~~d~~Isk~iLGqtLts~~~~~GS~Al-~~vh~~v~~~~~~aD~~~i~~~ln~ 390 (507) . +.+.....+. ++-+.+-++|++++.-..--. +|+++- .+........-+.-.++.|++.||+ T Consensus 237 l---------~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l----~~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~ 303 (395) T protein:vir:40 237 L---------AGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLA----KGDTVGLSEQVNSFLMFSINPIAEMFTDEGNR 303 (395) T ss_pred c---------cCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 1111111232 233334478999876543212 233432 2223334445566677788888887 Q ss_pred HHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC--cccccchhh Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE--EDISQEELM 468 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~--~~~~~~~~~ 468 (507) .|++.--... ..+-+|.++.....|.++.++++.+++..|++.+ +++|+.+|+|+-.+ .|..-.. . T Consensus 304 kLl~~~~~~~------g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~pi~~~~gD~~~~~-~ 371 (395) T protein:vir:40 304 KFYGRDSVLE------RTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTI-----DDNLRMIGREPVMSPETQERFVT-K 371 (395) T ss_pred hcCChhhhcC------CceEEEechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCCCceeeec-c Confidence 7655432211 1122344556667899999999999999998776 57999999986533 2211111 1 Q ss_pred hcCCCcCCCCCCcccccCcccccccccC Q lcl|NC_017974. 469 KLLGEDTSRAGDGMTKGSSGNGTGKISS 496 (507) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (507) ...+-+..+. . .++...++.... + T Consensus 372 n~~~~~~~~~--~-~kgge~~~~~~~-~ 395 (395) T protein:vir:40 372 NYAPLGENEE--D-LKGGDINENKGD-S 395 (395) T ss_pred cccccccccc--c-cCCCCCCCCcCC-C Confidence 0011110000 0 000010000000 0 No 84 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.35 E-value=2.9e-11 Score=78.37 Aligned_cols=395 Identities=13% Similarity=0.024 Sum_probs=186.6 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |+=.+.++.......+ .+..+.. +...+ .... ..-++-+.|.+|+..+-..| T Consensus 1 m~~f~~~~~~~~~~~~---------------------~~~~~~~---~~~~~--~~~~--~~Al~~~~V~~~i~~Ia~~i 52 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDD---------------------YISSVLA---GDVSQ--KYLG--VSALKNSDILTATSIIAGDI 52 (406) T ss_pred CccccccCCCCCCcch---------------------HHHHHhc---CCCCc--cccc--chhhccHHHHHHHHHHHHhh Confidence 6655432211100000 0000000 00000 0011 11245678999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhc--cCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNM--EGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~--~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .++++...-..+....+ .-+..+|+.- +..++.++++.++ +.+++|-+.++++....+ | .+..|.+ T Consensus 53 A~lp~~~~~~~g~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~-----g--~~~~L~~ 121 (406) T protein:vir:97 53 ARFPLVKKDVNGDIIHD----EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKT-----N--QALQFQF 121 (406) T ss_pred hhCeeEEEecCcccccc----chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCC-----C--eEEEEEE Confidence 88876543222221111 1244455422 2234667777665 467799999988764221 2 2345666 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .++..+.. ....++ .+... +.. ...+....+|...+|++++.+ .+..+|.|.+..+. T Consensus 122 i~p~~v~v----~~~~~~-~~~y~---------~~~--------~~~~~~~~~~~~evih~r~~~-~dg~~G~spi~~~~ 178 (406) T protein:vir:97 122 YRPSETTV----EETDNH-EIVYT---------FTD--------MLTAKQVKCFAHDVIHWKFFS-HDTILGRSPLLSLG 178 (406) T ss_pred ECCCeeEE----EEcCCc-eEEEE---------EEe--------cCCceEEEEccccEEEecCCC-CCCcccccHHHHHH Confidence 66654431 112222 11110 000 011233457777777776654 34467999998887 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKDNAP 315 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i~~~ 315 (507) -..-.-....++.+.|.+. |++--.+..+ ...-+++...+..+.+.+ ...|..+ .++++.|++..-. T Consensus 179 ~~i~~~~a~~~~~~~~f~n-g~~~~~i~~~------~~~l~~e~~~~~~~~~~~----~~~g~n~g~~~vl~~g~~~~~l 247 (406) T protein:vir:97 179 DEIDLQTGGINTLIKFFKD-GFSSGILTMK------GAQLSGDARQRARQEFEK----MREGSVGGSPLVFDSTMEYTPL 247 (406) T ss_pred HHHHHHHHHHHHHHHHHhc-cCCCceEEec------CCCCCHHHHHHHHHHHHH----HhcccccCceeecCCCceEEEc Confidence 6665566666777777754 4332222222 122233333333333332 2334333 3567888754321 Q ss_pred ccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 316 QYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQ 395 (507) Q Consensus 316 ~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~ 395 (507) +-+..-..|.+.-++..++|++++--..--.+...++++ ..+........-+.-.++.|++.||+.|+.+ T Consensus 248 ---------~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~-~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~ 317 (406) T protein:vir:97 248 ---------EIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQS-VAQLMEDYVTNDLPFYFDAITSELGLKTLND 317 (406) T ss_pred ---------cCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcch-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcCh Confidence 111122234455667788899876433322221112222 2233333444556667788888888766433 Q ss_pred HHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC--cccccchhh----h Q lcl|NC_017974. 396 LLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE--EDISQEELM----K 469 (507) Q Consensus 396 l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~--~~~~~~~~~----~ 469 (507) -- ...-++.|+.. .+++..++.+.++++.|++.+ +.+|+.+|+|+-.+ .|....... . T Consensus 318 ~~---------~~~~~i~fd~~--~~~~~~~~~~~~~~~~g~~T~-----NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~ 381 (406) T protein:vir:97 318 KD---------RRLYHIEFDTR--SVTGRNVDEIVKLVNNQILTP-----NQGLVELGKQKSTDPNMDRYQSSLNYVFLD 381 (406) T ss_pred hh---------ccceeEEEecC--ccchhhHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCCeEeeccCccchh Confidence 11 01113566543 356667788889999998766 57999999986544 222111100 0 Q ss_pred cCCCcCCCCCCcccccCcccccccccCCCCch Q lcl|NC_017974. 470 LLGEDTSRAGDGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 501 (507) ...+.+.+... ..+++..++. .|.+ T Consensus 382 ~~~~~~~~~~~-~~~gg~~~~~------~~~~ 406 (406) T protein:vir:97 382 KKEEYQDKVGI-KGKGGEVNAE------EDKS 406 (406) T ss_pred ccccccccccc-ccCCCCCCCC------CCCC Confidence 00010100000 0111111111 1111 No 85 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.35 E-value=2.7e-11 Score=78.51 Aligned_cols=376 Identities=12% Similarity=0.050 Sum_probs=177.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH-HhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES-MKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~-m~~D~~v~s~l~~Rk~a 79 (507) |.=-.+.-+... ++ ... + .+..+ -.++.. -++.+.|.+|++.+-.. T Consensus 1 Mg~f~~~f~~~~--~~-----~~~---~--~~~~~---------------------~~~~~~~a~~~~~v~~~i~~ia~~ 47 (385) T protein:vir:95 1 MGLFDSVFKRHS--EL-----SWM---Y--DLEFL---------------------QDKSKKAYLKQIALNTVVEMVART 47 (385) T ss_pred CchhhhhhccCc--cc-----ccc---c--chhhh---------------------hccchhhhhhhHHHHHHHHHHHHH Confidence 443222211100 00 000 0 00000 001111 23568899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhc--cCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNM--EGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~--~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.++++...-. .......+ ..+|..- ...++.++++.++ +.+.+|.+.+.+.. ++.+.+.... T Consensus 48 ia~~p~~~~~~--~~~~~~~l----~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~--------~~~~~~~~~~ 113 (385) T protein:vir:95 48 ISQSEFRVMKN--NTKEKGTL----YYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKND--------EGHFFVADDF 113 (385) T ss_pred Hcccceeeeec--CccccchH----HHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEec--------CCCeeecccc Confidence 99887654322 22222223 3344321 2234667777755 46678998864422 1222222211 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) .++... .... ..+.... .........+|...+|++++....+..+|.|++..+ T Consensus 114 ~~~~~~-~~~~------------------~~~~~~~--------~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~ 166 (385) T protein:vir:95 114 EKEDEL-GLYS------------------HRFTNVL--------VNDFEFKRVFTMDDVIYLKYNNQKLDAFSLGLFEDY 166 (385) T ss_pred cccccc-cccc------------------ccceeee--------ecccceeeeeccccEEEecCCCCCcccccchHHHHH Confidence 111110 0000 0000000 001111234677777777777667778899998877 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHH-HHhhcCCceeEeecCccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDA-ANAHSGEQSFFMLPSDTKDNAP 315 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~-~n~~~G~~a~~iiP~g~~i~~~ 315 (507) .-..- ..+.. ....+.+-.++.+|.. ..-+ ++..+.+++..+.. .....+....++++.|+++.-. T Consensus 167 ~~~i~------~~~~~-~~~~~~~~g~l~~~~~-----~~~~-~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l 233 (385) T protein:vir:95 167 GEIFG------RMIDL-QMLNNQIRGILKVDAT-----KFYN-KEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEH 233 (385) T ss_pred HHHHH------HHHHH-HHhcCCCceEEEeCCc-----cCCC-HHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEee Confidence 64321 11111 1222333233333321 1112 22223333333222 2222223334667888765422 Q ss_pred ccceeeeecc-CCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 316 QYSMTLKGID-GMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNL-SESKQTIHTQFVQRVNEIILEALNENLL 393 (507) Q Consensus 316 ~~~~el~~~~-g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al-~~vh~~v~~~~~~aD~~~i~~~ln~~li 393 (507) + ..... .+-.-.+|.+..++...+|+++..-..--. +|+++- .+........-+.-.++.|+..||+.|+ T Consensus 234 ~----~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l----~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~L~ 305 (385) T protein:vir:95 234 S----NRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLV----LGEMADLEKTIESYLQFCINPLLRKIEAELNSKFF 305 (385) T ss_pred c----ccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh----cCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 1 11111 111122477788889999999886643112 234442 2233445555667788888888888776 Q ss_pred HHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC--cccccchhhhcC Q lcl|NC_017974. 394 PQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE--EDISQEELMKLL 471 (507) Q Consensus 394 ~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~--~~~~~~~~~~~~ 471 (507) ++--..+ .+-+|.++.....|.++.++++.++++.|++.+ +.+|+.+|+|+-.+ +|..- .+.... T Consensus 306 ~~~~~~~-------~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~-----NE~R~~~g~~p~~~~~gd~~~-~~~n~~ 372 (385) T protein:vir:95 306 YQDEYLN-------DDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTR-----NQVRIMTGEEPADDPELDKFI-ITKNLQ 372 (385) T ss_pred Chhhccc-------ceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceee-ecccce Confidence 6532222 122344556667889999999999999998876 57999999986422 22211 111111 Q ss_pred CCcCCCCCCcccccCccccccc Q lcl|NC_017974. 472 GEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~ 493 (507) +-+..+. +. ...+ T Consensus 373 ~~~~~kg-------ge--~~~e 385 (385) T protein:vir:95 373 SADAFKG-------GE--SNEE 385 (385) T ss_pred ecccccC-------CC--CCCC Confidence 1111111 11 1111 No 86 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.34 E-value=5.7e-11 Score=76.73 Aligned_cols=383 Identities=12% Similarity=0.061 Sum_probs=187.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.=.++.+++..-+..+.. . ...+...+. ...... ...+. .+..++.+.|.+|+..+...| T Consensus 1 M~~f~~~~~~~~~~~~~~~--~--~~~~~~~~~--------~~~~~~------~~~v~-~~~al~~~~v~~~i~~ia~~i 61 (386) T protein:vir:49 1 MPIFNITNLATESPPINQE--S--FFDIADSDF--------LASLNS------SEWVS-AENALKNSDLFSIISQLSNDL 61 (386) T ss_pred CchhhhhccCCCCcccchh--h--hhhhhhccc--------cccccC------Cceec-hhhhhccHHHHHHHHHHHHHh Confidence 6554443332211111000 0 000000000 000000 00010 122456789999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .++++.+. .. . ...++.+. -...++.++++.++. .+.+|-+++++++... |+ +..|.+.+ T Consensus 62 a~~p~~~~--~~--~----~~~l~~~P---N~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------g~--~~~l~~i~ 122 (386) T protein:vir:49 62 ATAKITTS--RK--Q----LQGIVDNP---SNNANRFNFYQSIFAQMLLGGEAFAYRWRNDN------GR--DMKWEYLR 122 (386) T ss_pred hhCceeec--cc--h----hhhhhhcc---CCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC------Cc--EEEEEEec Confidence 88866543 21 1 01111110 112246678887774 5679999999998643 22 33455555 Q ss_pred chhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHH Q lcl|NC_017974. 160 QASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRA 239 (507) Q Consensus 160 ~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~ 239 (507) +.++. +..+.++..+.. .+...+ ...+....+|...+|++++....+..+|.|.+..|... T Consensus 123 ~~~v~----v~~~~~~~~~~y---------~~~~~~------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~ 183 (386) T protein:vir:49 123 PSQVS----FNRLDNQNGLYY---------NITFDD------PHIAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGRE 183 (386) T ss_pred CceeE----EEEcCCCceEEE---------EEEEcC------ccccceeEEccccEEEecCCCCCCccccccHHHHHHHH Confidence 54332 122333322211 110000 01123346777777777666666668999999999988 Q ss_pred HHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccce Q lcl|NC_017974. 240 WREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSM 319 (507) Q Consensus 240 ~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~ 319 (507) .-.-....++...+...-+.|--++++|-. ..+++.....++.. . ...++...+++|.|+++. T Consensus 184 i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-------~~~~~~~~~~~~~~-~---~~~n~g~~~vl~~g~~~~------ 246 (386) T protein:vir:49 184 FNIQKASDKLTISALKNALNANGILKIKGG-------GLLDFKTKVSRSRQ-A---MKQMQGGPLVLDDLEDFT------ 246 (386) T ss_pred HHHHHHHHHHHHHHHHccCCccEEEEeCCC-------CChHHHHHHHHHHH-H---hccCCCCceecCCCceEE------ Confidence 877777888888888877777777766532 22333333323222 2 223333457778887532 Q ss_pred eeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 320 TLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLAL 399 (507) Q Consensus 320 el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l 399 (507) ..+-+.....+.+..++...+|+++..-..--.+ +.+.+++.++.........+.--++.++..||+.|.. T Consensus 247 ---~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~~~~~~~~~~~~~~i~~~l~~i~~~~~~~l~~----- 317 (386) T protein:vir:49 247 ---PLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVG-GDGDQQSSLEMIYNIYFKSVSRYLRPFVSEMSKKLSC----- 317 (386) T ss_pred ---EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----- Confidence 2222222334667778888999987654332222 1122333333333333444555555555556554422 Q ss_pred CCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCC Q lcl|NC_017974. 400 NDMRLPETEMPYVKAG--EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSR 477 (507) Q Consensus 400 N~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~ 477 (507) ++.|+ .....|...++..+.+|+..|++.+ +.+|+.++-..-.+.+ .+....+..+ ..+ T Consensus 318 -----------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~-----nE~r~~l~~~~~~~~~-~~~~~~~~~~--~~~ 378 (386) T protein:vir:49 318 -----------EVDVDISPAVDPTGSNYISLINSMVKSGTLAQ-----NQGLYILQQAEILPKE-LPDGKNPNRT--SLK 378 (386) T ss_pred -----------hhcccchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHHhhCCCCCCc-CcchhccCCC--CCC Confidence 23333 2334566788899999999998766 4678776532111111 1110000000 000 Q ss_pred CCCcccccCcccccccccCCCCchhhh Q lcl|NC_017974. 478 AGDGMTKGSSGNGTGKISSTRDNSAAN 504 (507) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~d~~a~~ 504 (507) .| ..+ +.| T Consensus 379 gG-------d~~------------~~~ 386 (386) T protein:vir:49 379 GG-------EIN------------EQD 386 (386) T ss_pred CC-------CCC------------CCC Confidence 01 000 011 No 87 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=99.34 E-value=2.7e-11 Score=78.57 Aligned_cols=382 Identities=11% Similarity=0.081 Sum_probs=170.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHH-hhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESM-KQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~a 79 (507) |.=...-++.. ... .+. ...+. .. -.++.++ ++-+.|.+|+..+-.. T Consensus 1 Mgl~d~~~~~~----~~~----~~~---~~~~~-------~~--------------~~~~~~~~l~~~~v~~~i~~Ia~~ 48 (395) T protein:vir:96 1 MGILDFFSFKK----SGT----LSD---DDSGS-------TT--------------SEKLTNVVLKEDALYKCVNYLARI 48 (395) T ss_pred CcchhhhcCCC----Ccc----ccc---ccccc-------ch--------------hhhcchhhhhhHHHHHHHHHHHHh Confidence 22111111100 000 000 00000 00 0011222 3557899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.++++.+.-.+.....+.. +..+|+.-++ .+..++++.++ +.+++|.+.+.+.+.. ...+.... T Consensus 49 ia~lp~~v~~~~~~~~~~~~----~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~--------~~~~~~~~ 116 (395) T protein:vir:96 49 ISKSTFRIKAPEKLTENQKD----WLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGK--------GIYVADAF 116 (395) T ss_pred hccceeEEEeCCccccccch----HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCC--------ceecCCcc Confidence 99988765543322222222 3344442222 23456666654 4566899887665431 11111111 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) ++... + .+..+ .... .....-...+|+..++++++......+++.|+.... T Consensus 117 ~~~~~-~---~~~~~-----------------~~v~--------~~~~~~~~~~~~~dvih~k~~~~~~~~~~~~~~~~~ 167 (395) T protein:vir:96 117 TQDKK-L---SGNKF-----------------KVSR--------VQGQTYEKIFTFDQVIYLKNDNSDLMLKVESLWEEY 167 (395) T ss_pred ccccc-c---cccee-----------------eeee--------eccceeeeEeccCceEEecccCCccccccccccchH Confidence 11100 0 00000 0000 000011234677777777766655556655553322 Q ss_pred H------HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcc Q lcl|NC_017974. 237 Y------RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDT 310 (507) Q Consensus 237 ~------~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~ 310 (507) . .....+....++...+....+ .|..+.+.......+...+.+++..+.. ..|..+.++++.|+ T Consensus 168 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~v~~l~~g~ 237 (395) T protein:vir:96 168 GELLGHVINNQKIANQIRFTMTPPKDKV-------RERAQENSDGGRQPKSDKDFFKRTIEKI---RTESVVGIPVTANT 237 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhccccc-------ccceeeccCchhhHHHHHHHHHHHHHHh---hcCCcceEEccCCc Confidence 1 111222223333333333222 2222223333333344444444443332 23444556677887 Q ss_pred cccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 311 KDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNL-SESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 311 ~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al-~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) +..-.. +.-..++. -+.-.+.++...+-++|++++.-..--. +|+++- .+........-+.-.++.|++.|| T Consensus 238 ~~~~l~--~~~~d~q~-~e~~~~~~~~~~~~~eIa~~fgVPp~~l----~~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~ 310 (395) T protein:vir:96 238 NYEEYG--SKNTGSVK-SYVDDIKKLKDQYMAEFAEMLGIPISLL----HGDIADNQKNYELLLEGPIESLITNIVDGLE 310 (395) T ss_pred eeEecc--cChhhhhh-hhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCCCccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 543211 10000000 0011233344556688999876544222 133432 222334445566777788888888 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC--cccccchh Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE--EDISQEEL 467 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~--~~~~~~~~ 467 (507) +.|++.--.. .+. +|.|+.....|+++.+++++++++.|+..+ +.+|+.+|+|+-.+ +|..-... T Consensus 311 ~~Ll~~~e~~-----~~~---~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~~~~gD~~~~~~ 377 (395) T protein:vir:96 311 YAIFDKSETL-----EGS---FIKVTGLKNYDLFSISSQADKLISSGFVFI-----DEVREEIGLPELPDGLGKVLYMTK 377 (395) T ss_pred hhcCChhhhc-----Cce---eEeecchhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceeeecc Confidence 7765432111 111 367777888899999999999999998766 57999999987533 33221111 Q ss_pred hhcCCCcCCCCCCcccccCccc Q lcl|NC_017974. 468 MKLLGEDTSRAGDGMTKGSSGN 489 (507) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~ 489 (507) . ..+-+ .+.|+ .....++ T Consensus 378 N-~~~~~-~~gge--~~~~~~~ 395 (395) T protein:vir:96 378 N-YESVL-ERGGE--VDEEVET 395 (395) T ss_pred c-ceech-hccCC--CCCCCCC Confidence 0 01110 11111 1111111 No 88 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.33 E-value=3.8e-11 Score=77.73 Aligned_cols=373 Identities=13% Similarity=0.113 Sum_probs=187.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |+++...++..+.. ..... + .. +............- -+..++.+.|.+|+...-..| T Consensus 4 ~~~~~~~~~~~~~~-~~~~~---~--------~~----~~~~~~~~~~~~v~-------~~~al~~~~v~~~i~~ia~~i 60 (385) T protein:vir:10 4 LTPRNFNKRKAKNM-VYPSN---P--------AF----FTTTVGGMQLSYVS-------ALSALQNTNVYSVINRIASDV 60 (385) T ss_pred ccchhccccccccc-ccccc---h--------hh----hhhhccccCccccC-------HHHhhccHHHHHHHHHHHHHH Confidence 44432222211111 00000 0 00 00000000000000 122456788999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHh-ccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKN-MEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~-~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) .++++++.-.+ . ...|++ -...++.++++.+. +.+++|-+.++++... .+ +.|- T Consensus 61 a~~p~~v~~~~--------~----~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~------~~------~~p~ 116 (385) T protein:vir:10 61 ASAHFKTENTA--------T----LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN------LE------HIPN 116 (385) T ss_pred hhCceeeeccc--------h----hhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc------ee------Eeec Confidence 98876653221 1 112321 11234667777766 4557999999886531 11 2222 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCc--cCccccchHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGS--SSQALGVSPLVGC 236 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~--~~~p~G~gll~~~ 236 (507) ++.++ ....|+.++.... . .........+|.+.+|++++... .+..+|.|.+..| T Consensus 117 ~~~~v------~~~~~~~~~~~~~---------~--------~~~~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~ 173 (385) T protein:vir:10 117 SDVQI------NYLPGNMGIVYTV---------L--------ESNDRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESL 173 (385) T ss_pred CCceE------EEEEcCCceEEEE---------E--------EcCCceEEEEccccEEEeccCCCCcccccccccHHHHH Confidence 22222 1222222211100 0 01122344577777776665332 3456899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCcccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKDNA 314 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i~~ 314 (507) ....-.-....++-+.+...-+.|--++++|- ...+++..+.++ +...+..+|..+ .+++|.|++++- T Consensus 174 ~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~-------~~~~~e~~~~~~---~~~~~~~~~~n~~~~~vl~~g~~~~~ 243 (385) T protein:vir:10 174 QNALNLDDKASKSNMSAMENQINPAGKLTISN-------YLSDGKDLESAR---EEFEKANTGDNSGRLMVLPDGFDYTQ 243 (385) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-------CCCCHHHHHHHH---HHHHHHhCccccCCccccCCCceEEe Confidence 98887777788888888876666655665542 122222222232 222333344434 377788875432 Q ss_pred cccceeeeeccCCCcchhH-HHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYST-AQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~-~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) .+-+.....+ .+..++..++|++++.-..--.+ +.++.+++-.+.+......-+.-.++.|++.||+. T Consensus 244 ---------l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~ie~~l~~~ 314 (385) T protein:vir:10 244 ---------LEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYVNPIVDELRLK 314 (385) T ss_pred ---------cCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1112112222 35567778889987755432222 22223344334344444456677888888888875 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) |+ .+. -+|.++..-..|.++.++++.++++.|++.+ +++|+.+|.++-.+++.. .... T Consensus 315 l~----------~~~---~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~p~~~~~-~~~~--- 372 (385) T protein:vir:10 315 MN----------APD---LELDIKDMLDVDDSALINQVSNLAKSGVLGA-----EQAQFILTRSGFLPDNLP-EFKP--- 372 (385) T ss_pred hC----------Cce---EEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCccCCCCCc-cccC--- Confidence 42 111 1344445556799999999999999999876 579999988643332211 1111 Q ss_pred CCcCCCCCCcccc Q lcl|NC_017974. 472 GEDTSRAGDGMTK 484 (507) Q Consensus 472 ~~~~~~~~~~~~~ 484 (507) +....+.|+...+ T Consensus 373 ~~~~~~~g~~~dn 385 (385) T protein:vir:10 373 LTTQVKGGDEGDN 385 (385) T ss_pred cccccCCCCCCCC Confidence 1111111211111 No 89 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.31 E-value=8.4e-11 Score=75.83 Aligned_cols=385 Identities=11% Similarity=0.072 Sum_probs=170.3 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH-HhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES-MKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~-m~~D~~v~s~l~~Rk~a 79 (507) |.=...-.+. ++... .. ...+. ... ..+.+ -+.-+.|.+|+...-.. T Consensus 1 MGlf~~~~~~----~~~~~----~~---~~~~~-------~~~--------------~~~~~~~~~~~~v~~~I~~ia~~ 48 (395) T protein:vir:98 1 MGILDFFSFK----KSGTL----SD---DDSGS-------TTS--------------EKLTNVVLKEDALYKCVNYLARI 48 (395) T ss_pred CcchhhhcCC----Ccccc----cc---cccch-------hhh--------------hhcchhhhhhHHHHHHHHHHHHH Confidence 2111111000 00000 00 00000 000 01111 13557799999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.++++...-.......+.. +..+|+.-++ .+..++++.+. +.+++|.+.+.++... ...+.... T Consensus 49 iA~lp~~~~~~~~~~~~~~~----~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~--------~~~~~~~~ 116 (395) T protein:vir:98 49 ISKSTFRLKTPEKLTENQKD----WLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGK--------GIYVADSF 116 (395) T ss_pred HhhCceeEEecCCcccccch----HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCC--------ceecCCcc Confidence 99987765433222222222 3334442222 23455666644 4666899987776532 11111110 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) .+.. .+. +..+ ... ..........++++.++++++....+.+++.|+.... T Consensus 117 ~~~~-~~~---~~~~-----------------~~~--------~~~~~~~~~~~~~~evih~k~~~~~~~~~~~~~~~~~ 167 (395) T protein:vir:98 117 TQDK-KIS---GSQF-----------------KVS--------RVQGQTYEKTFTFDQVIYLKNDNSDLMSKVESLWEEY 167 (395) T ss_pred cccc-ccc---Cccc-----------------cee--------eecCceeeeEecCccEEEecCCCCCccccccchhhhH Confidence 0000 000 0000 000 0001111234566677777777666677777765532 Q ss_pred HHHH--HHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHH-HHHhhcCCceeEeecCccccc Q lcl|NC_017974. 237 YRAW--REKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSD-AANAHSGEQSFFMLPSDTKDN 313 (507) Q Consensus 237 ~~~~--~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~-~~n~~~G~~a~~iiP~g~~i~ 313 (507) .... .........-..+....+.+..+...+ .+.. .++.....++..+. ......+..+.++++.|++.. T Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~ 240 (395) T protein:vir:98 168 GELLGHVINNQKIANQIRFTMIPPKDKVRERAQ---ENSD----GGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYE 240 (395) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccccccc---ccCC----cHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeE Confidence 2211 001111111111222222222222221 1111 11222222222222 222223444455677887643 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLS-ESKQTIHTQFVQRVNEIILEALNENL 392 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~-~vh~~v~~~~~~aD~~~i~~~ln~~l 392 (507) -... ...... ....-.+.++.++.-.+|++++.-..--. +|+++-. +........-+.-.++.|++.||+.| T Consensus 241 ~l~~--~~~~~~-~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l----~~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kl 313 (395) T protein:vir:98 241 EYGS--KNTGAV-KSYVDDIKKLKDQYMAEFAEMLGIPISLL----HGDIADNQKNYELLLEGPIESLITNIVDGLEYAI 313 (395) T ss_pred eccc--cccccc-ChhHHHHHHHHHHHHHHHHHHhCCCHHHh----cCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 2211 000000 11112456667777889999776644222 2444422 22234445667778888888888877 Q ss_pred HHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC--cccccchhhhc Q lcl|NC_017974. 393 LPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE--EDISQEELMKL 470 (507) Q Consensus 393 i~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~--~~~~~~~~~~~ 470 (507) +++--... .-+|.|+.....|.++.++++.++++.|++.+ +++|+.+|+|+-.+ .|....... . T Consensus 314 l~~~~~~~--------g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~Pi~~~~gD~~~~~~n-~ 379 (395) T protein:vir:98 314 FDKSETLQ--------GSFIKVTGLKNYDLFSISNQADKLISSGFVFI-----DEVREEIGLPELPDGLGKVLYMTKN-Y 379 (395) T ss_pred CChhhhcC--------cceeeehhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceeeeccc-c Confidence 65432111 11477777888899999999999999998776 58999999986544 332211111 0 Q ss_pred CCCcCCCCCCcccccCccccc Q lcl|NC_017974. 471 LGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~ 491 (507) .|- ..+.|+ ..... ++ T Consensus 380 ~~~-~~~gge--~~~~~--~~ 395 (395) T protein:vir:98 380 ESV-LERGGE--VDEEV--ET 395 (395) T ss_pred eec-ccccCC--CCCCC--CC Confidence 111 011111 11111 11 No 90 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.31 E-value=4.3e-11 Score=77.44 Aligned_cols=371 Identities=13% Similarity=0.097 Sum_probs=186.2 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |+++.-+|+..+-..+... + .. +...........+- -+..++-+.|.+|+..+-..+ T Consensus 4 ~~~~~~~k~~~~~~~~~~~----------~---~~---~~~~~~~~~~~~v~-------~~~~l~~~~v~~~i~~ia~~i 60 (383) T protein:vir:10 4 LTPKNFSKRNAKNMVYPSN----------P---AF---FTTTVGGMQLSYVS-------ALSALQNTNVYSVINRIASDV 60 (383) T ss_pred ccccccccccccccccccc----------h---hh---hhhhccCccccccc-------hhHhhcchHHHHHHHHHHHhh Confidence 5543222222211110000 0 00 00000000010000 122345678999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHh-ccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKN-MEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~-~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) .++++++.-.+ ....|++ -...++.++++.++. .+++|-+.++++-.. .+ +.+- T Consensus 61 a~~~~~~~~~~------------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~------~~------~~p~ 116 (383) T protein:vir:10 61 SSAHFKTENTA------------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN------LE------HIPN 116 (383) T ss_pred ccCceeecccc------------hhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc------ee------Eeec Confidence 88866543211 1112321 112346667776654 556899998875321 11 1111 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCc--cCccccchHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGS--SSQALGVSPLVGC 236 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~--~~~p~G~gll~~~ 236 (507) ++.+ +.+..++.++..... ....+....++.+.++++++... .+..+|.|.+..| T Consensus 117 ~~~~------v~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~ 173 (383) T protein:vir:10 117 SDVQ------INYLPGNMGIVYTVL-----------------ESNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESL 173 (383) T ss_pred Ccce------EEEEEcCCceEEEEE-----------------EcCCceEEEEcccceEEeccCCCCcccccccccHHHHH Confidence 1111 111222222111100 01112345577777776664332 2346899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCcccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKDNA 314 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i~~ 314 (507) ....-.-....++-+.|...-+.|--++.+|-. ..+++..+.+++ ...+...|..+ .++++.|++++- T Consensus 174 ~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-------~~~~e~~~~~~~---~~~~~~~~~n~~~~~vl~~g~~~~~ 243 (383) T protein:vir:10 174 QNALNLDDKASKSNMSAMENQINPAGKLTISNY-------LSDGKDLESARE---EFEKANTGDNSGRLMVLPDGFDYTQ 243 (383) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-------CCCHHHHHHHHH---HHHHHhCccccCCccccCCCceEEe Confidence 888777777888888888877766666665422 112222222222 22333334333 477788876533 Q ss_pred cccceeeeeccCCCcchhH-HHHHHHHHHHHHHHHhccc--ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYST-AQLISDRKKSILDRLGAGF--INVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~-~~li~~~d~~Isk~iLGqt--Lts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) . +-+.....+ .++.++..++|+.++.-.. |...+.++.++|-.+.+...+..-+.-.++.|++.||+. T Consensus 244 l---------~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~ie~~l~~~ 314 (383) T protein:vir:10 244 L---------EMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYVNPIVDELRLK 314 (383) T ss_pred c---------CCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2 111111223 4566788899999776543 211121222333334444445556667778888888875 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) |+ ++ .-+|.++.....|.+..++++.++++.|++.+ +.+|+.+|+|+-..++..... . T Consensus 315 l~----------~~---~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~-----nE~R~~lg~~p~~~~d~~~~~----~ 372 (383) T protein:vir:10 315 MN----------AP---DLELDIKDMLDVDDSILINQVSNLAKSGVLGA-----EQAQFILTRSGFLPDNLPEFK----P 372 (383) T ss_pred hC----------Cc---eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCcccCCcccccC----C Confidence 42 11 12344555557899999999999999998776 579999999765544432111 1 Q ss_pred CCcCCCCCCcc Q lcl|NC_017974. 472 GEDTSRAGDGM 482 (507) Q Consensus 472 ~~~~~~~~~~~ 482 (507) +....+.|+.. T Consensus 373 ~~~~~~gGd~e 383 (383) T protein:vir:10 373 LTNETKGGDDK 383 (383) T ss_pred CcccCCCCCCC Confidence 11111111111 No 91 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.28 E-value=9.1e-11 Score=75.64 Aligned_cols=369 Identities=11% Similarity=0.061 Sum_probs=168.8 Q ss_pred cCchhhhHHHHhhhhcccchhccchhHHHH-HHHhhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHH Q lcl|NC_017974. 31 QGTFALAKVQVDSERMKAEEIRWPNLIGTV-ESMKQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLK 109 (507) Q Consensus 31 ~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~ 109 (507) +|-.. ...+.....+.....-. .-.++ +..++-+.|.+|++.+-..+.++++.+ .......+..+.. .|. T Consensus 1 Mg~f~-~l~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~--~~~~~~~~~~l~~----ll~ 71 (376) T protein:vir:78 1 MGFFS-ELFKRNKEIEWMWDLDF--LEDKTTKVYLKKMALNTCVKHIARTIAKSDFRL--KNGETSVRDKLYY----KLN 71 (376) T ss_pred Cchhh-hhhccCCccccccchhh--ccccchhhhhhhHHHHHHHHHHHHhhcccceee--ccccccccchHHH----HHh Confidence 23110 00000000000000000 00111 223466789999999999998886654 3322222222333 333 Q ss_pred hc--cCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccc Q lcl|NC_017974. 110 NM--EGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLS 186 (507) Q Consensus 110 ~~--~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~ 186 (507) .- +..++.++++.++ +.+++|.+...+++.. ...+..+.++.+..+.......+ T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~--------~~~~~~~~~~~~~~~~~~~~~~~--------------- 128 (376) T protein:vir:78 72 IRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTD--------DFLIADSYVRKEFAFFPDVFEGV--------------- 128 (376) T ss_pred hccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCC--------Ceeeccceeecccceeeeeeeee--------------- Confidence 21 2234566766655 4566899887665432 22333444443322211100000 Q ss_pred cccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-eEEe Q lcl|NC_017974. 187 AFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGV-IELK 265 (507) Q Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~-~v~~ 265 (507) . .........+|.+.++++++....+.+++.++...+.- +.+ ..+..+ +++.+. +.+. T Consensus 129 -----~--------~~~~~~~~~~~~~evih~~~~~~~~~~~~~~~~~~~~~--~~~----~~~~~~--~~~~~~~~~~~ 187 (376) T protein:vir:78 129 -----T--------VKDYRYNRNFSMDDVIFLEYGNERLSAFTDGMFEDYGE--LFG----KMIRAQ--MRNFQIRGAVN 187 (376) T ss_pred -----e--------eecceeeeeeccccEEEeccCCCCchhhhhHHHHHHHH--HHH----HHHHHH--HhcCCCceeEE Confidence 0 00001112356777777777777766666655443321 111 111122 222222 2222 Q ss_pred cccccCCCCCCCchHHHHHHHHHHHHHHH-HhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHH Q lcl|NC_017974. 266 IPSQILNKAAMDQSSPEAEMVRGLMSDAA-NAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSI 344 (507) Q Consensus 266 ~P~~~~k~~~~~~~~~~~~~l~a~~~~~~-n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~I 344 (507) + +.+..-+++. .+.+++..+... ....+..+.++++.|++..-.+. ....-..+..+|.+..++...+| T Consensus 188 ~-----~~~~~~~~e~-~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~----~~~~~~~~~~q~~e~~~~~~~~I 257 (376) T protein:vir:78 188 F-----KMAGVADKDK-QTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGT----TSVNNSQSFDEVKKLRKEMIDYV 257 (376) T ss_pred E-----ccCCCCCHHH-HHHHHHHHHHHhccccccCcceEEcCCCceEEeecc----CccccchhHHHHHHHHHHHHHHH Confidence 1 1112222323 333333333222 11122233345788876543221 11111111225677778888999 Q ss_pred HHHHhcccccccccchhhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHH Q lcl|NC_017974. 345 LDRLGAGFINVGNDKGGSYNLS-ESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDME 423 (507) Q Consensus 345 sk~iLGqtLts~~~~~GS~Al~-~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~ 423 (507) ++++.-..--. +|++|-. +........-+.-.++.|++.||+.|+.+ ....-++.++..-..|.+ T Consensus 258 a~~fgVPp~~l----~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~----------~~~~~~~~~~~ll~~d~~ 323 (376) T protein:vir:78 258 ASILGIPSSLL----HGDMADLSNNMKAYMEYCIDPLTKKLEDELNAKLFTF----------SEFLAGEHIKIIHKKDII 323 (376) T ss_pred HHHhCCCHHHh----CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCc----------ccceecccchhhcccCHH Confidence 99876544222 1333322 22234445556667788888888766443 111122344444567889 Q ss_pred HHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCcccc Q lcl|NC_017974. 424 GFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNG 490 (507) Q Consensus 424 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (507) +.++++.+++..|+..+ +.+|+.+|+|+-.+++-..-- .+....+.+++... | T Consensus 324 ~~~~~~~~~~~~G~~t~-----NE~R~~lg~~p~~~g~~d~~~----~~~n~~~~~~~~e~-----g 376 (376) T protein:vir:78 324 ENAEAVDKLVASGSFNR-----NEVRELLGAERVDNPELDKYL----ITKNYQSADEGGED-----G 376 (376) T ss_pred HHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceee----eccCceehhccccC-----C Confidence 99999999999998766 579999999865444211100 01111111111111 1 No 92 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.25 E-value=3.3e-10 Score=72.59 Aligned_cols=385 Identities=11% Similarity=0.032 Sum_probs=173.3 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHH-HHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKV-QVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |. +..+-...+.+. ++....+............+-+..++.+.|.+|+...... T Consensus 1 mg-------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ 55 (403) T protein:vir:10 1 MG-------------------------FKSWITEKLNPGQRIIRDMEPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDS 55 (403) T ss_pred Cc-------------------------chhhhhhccchhhhhhhcccccccccCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 10 000000000000 0100000000000000001113334667899999999999 Q ss_pred HhhCCceeeeCCC-CChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 80 VEKAFKDFKVIPG-ESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 80 v~~~~~~~~~~~~-~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) |.+++|.+..... ....+.....-+..+|..-++ .++.++.+.+. +.+++|-+.+++. + . .| T Consensus 56 ia~~p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~---~------~-----~l 121 (403) T protein:vir:10 56 AAECSYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD---G------T-----SL 121 (403) T ss_pred HhhCceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe---C------c-----ee Confidence 9988776532211 111111111223344543222 23556776655 5777998875431 1 0 11 Q ss_pred cccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCC----ccCccccch Q lcl|NC_017974. 156 AFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGG----SSSQALGVS 231 (507) Q Consensus 156 ~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~----~~~~p~G~g 231 (507) .+.|... +....|...+... +. ...+...+...++.++... ..+.++|.+ T Consensus 122 ~~l~~~~------~~v~~~~~~~~~~---------~~-----------~~~~~~~~~~eiih~~~~~~~~~~~~~~~G~s 175 (403) T protein:vir:10 122 YHVPAAL------MQVEADANKFIKK---------FI-----------FNNQINYRVDEIIFIKDNSYVCGTNSQISGQS 175 (403) T ss_pred EeecCcc------eEEEEcCCceEEE---------EE-----------ecCceeecccceEEecccccccCCCCCccccc Confidence 2222211 1111111111110 00 0112234445544444221 246688999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC---ceeEeecC Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE---QSFFMLPS 308 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~---~a~~iiP~ 308 (507) .+..+.-..-.-....++-+.|...-+.|--+++.|- .-+++... .+++-.+ +..+|. ...++++. T Consensus 176 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-------~l~~e~~~-~~~~~~~---~~~~g~~n~g~~~vl~~ 244 (403) T protein:vir:10 176 RVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDE-------ILNKKLRE-RKQEELQ---LDYNPSTGQSSVLILDG 244 (403) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-------CCCHHHHH-HHHHHHH---HHhCCcccCcceeecCC Confidence 9998887776666666666666654444444444442 22233322 2332222 222232 23578888 Q ss_pred cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 309 DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEAL 388 (507) Q Consensus 309 g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~l 388 (507) |++....+ ...+.....|.+..++..++|++++--..--.+.+. +|.. .+........-+.-.++.|++.| T Consensus 245 g~~~~~~~-------~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~sn~-e~~~~~f~~~tl~P~~~~ie~~l 315 (403) T protein:vir:10 245 GMKAKPYS-------QISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGN-NANI-RPNIELFYYMTIIPMLNKLTSSL 315 (403) T ss_pred CceeEEec-------ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CcCH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 87643221 111111224667778889999997765543332222 2211 22233344555566677777777 Q ss_pred HHHHHHHHHHhCCCcCCccccceEEecCc----CchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_017974. 389 NENLLPQLLALNDMRLPETEMPYVKAGEI----VDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQ 464 (507) Q Consensus 389 n~~li~~l~~lN~~~~~~~~~P~~~~~~~----~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~ 464 (507) |+.|. ++|.|+.. -..|.+..++++.++++.|++.+ +.+|+.+|+|+-.++.... T Consensus 316 ~~~L~----------------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~-----NE~R~~~gl~pi~~~~~d~ 374 (403) T protein:vir:10 316 TFFFG----------------YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITG-----NEARSELNLEPLDDEQMNK 374 (403) T ss_pred HHhcC----------------ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCcccccc Confidence 76541 12444432 24578889999999999999876 5899999998643322111 Q ss_pred ch-hhhcCCC-cCCCCCCcccccCccccc Q lcl|NC_017974. 465 EE-LMKLLGE-DTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 465 ~~-~~~~~~~-~~~~~~~~~~~~~~~~~~ 491 (507) .- +...... .....+++....++.++. T Consensus 375 ~~~p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 375 IRIPANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred cccccccccccccCCCCcCCCCCCCcCCC Confidence 00 0001111 011111111111111111 No 93 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.25 E-value=9.1e-11 Score=75.64 Aligned_cols=382 Identities=10% Similarity=0.011 Sum_probs=188.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.=-+++.++...+....... . .+..-+-.+ .+.. ...+ .-+..+.-+.|.+|+..+-..| T Consensus 1 Mglf~~~~~~~~~~~~~~~~~--~--~~~~~~~~~--~~~~------------~~~v-~~~~al~~~~V~~~i~~Ia~~i 61 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSF--F--DITDPEFLD--ALNG------------SEWV-SAETALKNSDLFSIISQLSNDL 61 (384) T ss_pred CccccccccCcccccccchhh--c--cccchhhcc--cccC------------Ccee-chhhhhccHHHHHHHHHHHHHH Confidence 544322222222111111000 0 000000000 0000 0000 0122456788999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .++++.+. .. .. ..+.+. --...++.++++.++ +.+.+|-+.+++++... |+ +..|.+.+ T Consensus 62 a~l~~~~~--~~--~~-----~~l~~~--PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~------g~--~~~L~~l~ 122 (384) T protein:vir:49 62 ATAKITTS--RK--QL-----QGIVDN--PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN------GR--DMKWEYLR 122 (384) T ss_pred hhCceeee--cc--hh-----hhhhhc--cCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCC------Cc--EEEEEEEc Confidence 88866543 11 10 111110 011234667777776 46779999999998543 22 34555555 Q ss_pred chhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHH Q lcl|NC_017974. 160 QASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRA 239 (507) Q Consensus 160 ~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~ 239 (507) +.++.. ..+.|+..+. +.+...+ ...+....+|...+|++++....+..+|.|.+..+... T Consensus 123 ~~~v~v----~~~~~~~~~~---------y~~~~~~------~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~ 183 (384) T protein:vir:49 123 PSQVSF----NRLDNQNGLY---------YNITFDD------PRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRE 183 (384) T ss_pred CceeEE----EEcCCCceEE---------EEEEecC------ccccceeEecCccEEEecCCCCCCceeeccHHHHHHHH Confidence 544321 1223332211 0000000 01123356777777777766666778999999999887 Q ss_pred HHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccce Q lcl|NC_017974. 240 WREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSM 319 (507) Q Consensus 240 ~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~ 319 (507) .-.-....++-..+...-+.|--++++|- ....++..+..++-.... .+....++++.|++.. T Consensus 184 i~~~~~~~~~~~~~~~ng~~~~~il~~~~-------~~~~~~~~~~~~~~~~~~----~n~~~~~vl~~g~~~~------ 246 (384) T protein:vir:49 184 LNIQKASDKLTLNALKNALNANGILKIKG-------GGLLDFKTKQSRSRQAMK----QMQGGPLVLDDLEDFT------ 246 (384) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCC-------CCChHHHHHHHHHHHhcc----cCCccceecCCCceEE------ Confidence 77777777888888887676666666542 222233333333222221 2233456777777532 Q ss_pred eeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_017974. 320 TLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLL-A 398 (507) Q Consensus 320 el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~-~ 398 (507) ..+-+....++.+..++..++|++++.-..--.+.. ++..+..+..++.....++.-++-+...|++.|-+.+. + T Consensus 247 ---~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~-~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~~~ 322 (384) T protein:vir:49 247 ---PLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGE-GDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVDAD 322 (384) T ss_pred ---EccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhh Confidence 222222333566777888899998775443222111 11111122223344445555555555556555544331 1 Q ss_pred hCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccch Q lcl|NC_017974. 399 LNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEE 466 (507) Q Consensus 399 lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~ 466 (507) +.....++....+|.++..-..|+....++...|...|+.+ + .+|+..|+|+-..+|....= T Consensus 323 ~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-n-----e~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 323 ILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-K-----DLPEGETDSTLKGGETNEQY 384 (384) T ss_pred hhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-h-----hHHHHcCCCCCCCCCCCCCC Confidence 11000111112223333344556677778888888888753 2 48888888765444432221 No 94 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.24 E-value=1.8e-10 Score=74.02 Aligned_cols=367 Identities=11% Similarity=-0.021 Sum_probs=172.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |.=.++...-.+...++... ++ .-+...--.| .-+.|.+|+..+-..| T Consensus 1 Mg~f~~~~~~~~~~~~~~~~------~~----------------------~~~~~~~~~~----~~~~v~~~v~~IA~~i 48 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQ------RV----------------------TAWQNEAVEY----TSAFVTNIHNKIANEI 48 (378) T ss_pred CCccccchhcccccccCCcc------ee----------------------eeeccchhHH----HHHHHHHHHHHHHhhh Confidence 43332222111111111000 00 0000000011 2246899999999999 Q ss_pred hhCCceeeeC-CCCChhH--HH-HHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 81 EKAFKDFKVI-PGESEES--KK-AAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 81 ~~~~~~~~~~-~~~~~~~--~~-~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) .++++.+.-. .++...+ .+ ...-+.++|+.-++ .+..++++.++ +.+.+|.+.+.++|..+. |.++. T Consensus 49 A~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~-----g~~~~- 122 (378) T protein:vir:94 49 TKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNT-----GELLD- 122 (378) T ss_pred hhCceeeEEEcccCcccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCC-----ceEEE- Confidence 9988764322 1111111 01 11224455543222 23556766655 567799998877775321 22110 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL 233 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll 233 (507) + ++.. ....+|++.+| |....-++ -.|.|++ T Consensus 123 -l--------------~p~~--------------------------------~~~~~~~~dii-H~~~~~~~-~~g~s~l 153 (378) T protein:vir:94 123 -L--------------LFAD--------------------------------DKKEYKPEELV-RLTSPFYI-NEDTSIL 153 (378) T ss_pred -E--------------EecC--------------------------------CeeEeeeeeeE-EecCcCCc-cchhHHH Confidence 0 0100 11224444544 43333222 3467777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCcee--EeecCccc Q lcl|NC_017974. 234 VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSF--FMLPSDTK 311 (507) Q Consensus 234 ~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~--~iiP~g~~ 311 (507) ..+.-..- . +. +.+.+=-++++|.. -+++......+++.+...+...|..++ ++++.|++ T Consensus 154 ~~~~~~i~------~----~~-~~~~~~gil~~~~~-------l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~ 215 (378) T protein:vir:94 154 DNALASIQ------T----KL-EQGKLRGLLKINAF-------LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTE 215 (378) T ss_pred HHHHHHHH------H----HH-hcccccceeeeCCc-------CCHHHHHHHHHHHHHHHHHhhcccccccceecCCCce Confidence 66543211 1 11 12222223334422 222333334455555555555555554 66677765 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) +. + .+-+....++. -.++..++|++++.-..--. .|+++ .+-.......-+.--++.|+..||+. T Consensus 216 ~~--~-------l~~~~~~~~~~-~~~~~~~~Ia~~fgVP~~~l----~~~~s-e~~~~~f~~~tL~P~~~~ie~~l~~~ 280 (378) T protein:vir:94 216 IV--E-------LKKDYSVLNKD-EIDLIKSELLTGYFMNENIL----LGTAS-QEQQIYFYNSTIIPLLIQLEKELTYK 280 (378) T ss_pred EE--E-------ccCChhhhhHH-HHHHHHHHHHHHhCCCHHHh----cCChH-HHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 32 2 22222222333 34777889999876643112 23444 22334455566777888888888887 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) |+.+--.-.+.+..-..-+.|.++.....|+++.++++.++++.|++.+ +.+|+.+|+|+-+.+|..-.... .. T Consensus 281 Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~gGD~~~~~~n-~~ 354 (378) T protein:vir:94 281 LISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDVYIANLN-AV 354 (378) T ss_pred cCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeeccc-cc Confidence 7765322111000011124566667778899999999999999998876 57999999997766554221111 11 Q ss_pred CCcCCCC--CCcccccCccccccc Q lcl|NC_017974. 472 GEDTSRA--GDGMTKGSSGNGTGK 493 (507) Q Consensus 472 ~~~~~~~--~~~~~~~~~~~~~~~ 493 (507) +...... .......+++++..+ T Consensus 355 ~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 355 AVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred ccccchhhcCCcCCCCCCCCCCCC Confidence 1000000 000011122333332 No 95 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.20 E-value=5.4e-10 Score=71.39 Aligned_cols=442 Identities=10% Similarity=0.022 Sum_probs=193.0 Q ss_pred Ccccccccccc-cCccccccc-cccccccccccCchhhhHHHHhhhhcccchhccchhH-HH-HHHHhhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSV-KVAKATSEQ-INVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLI-GT-VESMKQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~-~~~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~-~~-y~~m~~D~~v~s~l~~R 76 (507) +.|-+...++. ++....... ...........+. +.+..++....-......-+... .+ -+..++-+.|.+|+..+ T Consensus 4 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~~I 82 (466) T protein:vir:81 4 IDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGF-GGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACMLVR 82 (466) T ss_pred hHHHhhccCcccccchhhhhhhhhhhhcccccccc-ccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHHHH Confidence 44544444332 111111100 0000000111111 11111111100000000001111 11 23356789999999999 Q ss_pred HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 77 YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 77 k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) -..|.++++.+....+.. .+++.+.....|..-++ .++.++++.++ +.+++|-+.+++++...+....+-.-.+. T Consensus 83 a~~ia~lp~~~~~~~~~~--~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~~ 160 (466) T protein:vir:81 83 QLVFSSVRFRWQRLRDGK--PSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVVV 160 (466) T ss_pred HHhhccCceEEEEecCCc--eeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCccee Confidence 999999987665432211 11222222222222222 24567777766 57789999999987543211111112234 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCC-ccCccccchH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGG-SSSQALGVSP 232 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~-~~~~p~G~gl 232 (507) .|.+.++.+.. +..+.|+..... +.+..+. .........+|.+.+|++++.. ..+..+|.|. T Consensus 161 ~l~~l~~~~v~----~~~~~~~~~~~~--------y~~~~~~-----~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~ 223 (466) T protein:vir:81 161 EERMVRGGRGE----LGGGQLGWRKVG--------YLYTEGG-----RQSGNESVGFLAEDVVHFAPIPDPLASYRGMSW 223 (466) T ss_pred EEEEecCcceE----EEEcCCCceEEE--------EEEEecC-----cccccceeeeccccEEEEcCCCCcccccccccH Confidence 55555554332 112233221110 0000000 0011233457777777766543 3455689999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccc Q lcl|NC_017974. 233 LVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKD 312 (507) Q Consensus 233 l~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i 312 (507) +..+....-.-....++-..+....+.|--+++.|.. -+++...+..+.+.+..... .+....++++.|+++ T Consensus 224 i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~-------l~~e~~~~~~~~~~~~~~g~-~n~g~~~vl~~g~~~ 295 (466) T protein:vir:81 224 LTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPM-------ADPAAVKKWADEVNSKHAGV-DNAWKNLNLYPGADA 295 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCC-------CCHHHHHHHHHHHHHHhcCc-cccccceEcCCCceE Confidence 9999887776677777778888776666555555432 23333333333333221110 011234678888754 Q ss_pred cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc---ccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 313 NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG---NDKGGSYNLSESKQ-TIHTQFVQRVNEIILEAL 388 (507) Q Consensus 313 ~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~---~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~l 388 (507) .- .+.+.....|.+..++...+|++++--..--.+ ....++|+-.+-+. .....-+.-.++.|+..| T Consensus 296 ~~---------l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l 366 (466) T protein:vir:81 296 DV---------VGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCI 366 (466) T ss_pred EE---------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32 222233345677778899999998643321111 11234454333233 344556677788888888 Q ss_pred HHHHHHHHHHhCCCcCCccccceEEecC--cCchhHHHHHH-------HHHHHHhcCCccccHHHHHHHHHHcCCCCCCC Q lcl|NC_017974. 389 NENLLPQLLALNDMRLPETEMPYVKAGE--IVDVDMEGFSK-------AIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE 459 (507) Q Consensus 389 n~~li~~l~~lN~~~~~~~~~P~~~~~~--~~~~dl~~~a~-------~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~ 459 (507) |+.|+.. ..... -+|.|+. .-..|.+..++ .++.+++.|+ .+ +.+|+.... .+ T Consensus 367 ~~~L~~~--------~~~~~-~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~-t~-----nE~r~~~~~---gd 428 (466) T protein:vir:81 367 GHVMPDM--------GPDVR-LWYDADDVPFLREDEKDAADIQKVRAETINTLITAGY-EP-----ESVVAAVNS---GD 428 (466) T ss_pred HhhcCCc--------ccCcc-eEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCC-Ch-----hhccccccC---Cc Confidence 8765432 11111 1344443 33456665544 3677888886 23 345532211 00 Q ss_pred cccccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 460 EDISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .....+... .+....+++.... .+. ++.......+..| T Consensus 429 ~~~~~~~~~--~~~~~~~~~~~~~----~~~----~~~~~~Gg~~ngn 466 (466) T protein:vir:81 429 LRLLKHTGL--TSVQLLPPGVSAS----ASS----DTPTSGGADDNGN 466 (466) T ss_pred cccccCCCc--chhhhcccccccc----cCC----CCcccCCCCcCCC Confidence 000000000 0000000000000 000 0000111122222 No 96 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.13 E-value=6.3e-10 Score=71.04 Aligned_cols=367 Identities=10% Similarity=-0.040 Sum_probs=171.2 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccc-hhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWP-NLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~-~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |.=.++.+.-.+-..++.. ...+-+. ..+ .| .-+.|.+|+..+... T Consensus 1 Mg~f~~~~~f~~~~~~~~~----------------------------~~~~~~~~~~~-~~----~~~~v~~~i~~Ia~~ 47 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDT----------------------------QRVTAWQNEAV-EY----TSAFVTNIHNKIANE 47 (378) T ss_pred CccchhhhhhhccccCCCc----------------------------ceeeecccchh-HH----HHHHHHHHHHHHHhh Confidence 3333222110000000000 0000000 011 11 224699999999999 Q ss_pred HhhCCceeeeCCC-CChhH--HH-HHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 80 VEKAFKDFKVIPG-ESEES--KK-AAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 80 v~~~~~~~~~~~~-~~~~~--~~-~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) |.++++.+.-... ....+ .+ ...-+..+|+.-++ .+..++++.++ +.+.+|-+.+.+++... .|.+.. T Consensus 48 iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~-----~g~~~~ 122 (378) T protein:vir:93 48 ITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDN-----TGELLD 122 (378) T ss_pred hhhCceeeEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecC-----CceEEE Confidence 9998876432211 11100 01 11224455543222 23456666654 57779999877666422 222110 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchH Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSP 232 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gl 232 (507) + ++. ..+..+|.+.+|+++ ..- +-..|.|+ T Consensus 123 --l--------------~~~--------------------------------~~~~~~~~~diih~r-~~~-~~~~~~s~ 152 (378) T protein:vir:93 123 --L--------------LFA--------------------------------DDKKEYKTEELVRLT-SPF-YINEDTSI 152 (378) T ss_pred --E--------------Eec--------------------------------CCeeEeccceeEEec-Ccc-ccchhhHH Confidence 0 000 012234555545443 222 22236677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCcee--EeecCcc Q lcl|NC_017974. 233 LVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSF--FMLPSDT 310 (507) Q Consensus 233 l~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~--~iiP~g~ 310 (507) +..+.-.. ..+.. .|.+=-++++|.. -.++....+.++..+...+...|..++ ++++.|+ T Consensus 153 l~~~~~~i----------~~~~~-~~~~~g~l~~~~~-------l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~ 214 (378) T protein:vir:93 153 LDNALASI----------QTKLE-QGKLRGLLKINAF-------LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKT 214 (378) T ss_pred HHHHHHHH----------HHHHh-cCcccceeeeCCc-------CCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCc Confidence 76554321 11222 2322223333322 123333444555666555555555554 5556666 Q ss_pred cccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 311 KDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 311 ~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~ 390 (507) ++. ..+-+....++ ...++..++|++++.-..--. .|+++ .+........-+.-.++.|+..||+ T Consensus 215 ~~~---------~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l----~g~~~-e~~~~~f~~~tl~P~~~~ie~~l~~ 279 (378) T protein:vir:93 215 EIV---------ELKKDYSVLNK-DEIDLIKSELLTGYFMNENIL----LGTAT-QEQQIYFYNSTIIPLLIQLEKELTY 279 (378) T ss_pred eEE---------EccCChhhhhH-HHHHHHHHHHHHHhCCCHHHh----cCCcH-HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 532 22222222223 344778889999876653111 23333 1223344456677888999999998 Q ss_pred HHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhc Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKL 470 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~ 470 (507) .|+..--.--+.+.......+|.++.....|+++.++++.+++..|++.+ +.+|+.+|+|+-+.+|.......-. T Consensus 280 kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~gl~p~~ggD~~~~~~n~~ 354 (378) T protein:vir:93 280 KLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDVYIANLNAV 354 (378) T ss_pred hcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeeccccc Confidence 88754211000000011223566677788999999999999999998876 5799999999766655322111111 Q ss_pred CCCcCCCCCCc-ccccCccccccc Q lcl|NC_017974. 471 LGEDTSRAGDG-MTKGSSGNGTGK 493 (507) Q Consensus 471 ~~~~~~~~~~~-~~~~~~~~~~~~ 493 (507) +.+.......+ ....+.+++.++ T Consensus 355 ~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 355 AVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred cccchhhhcCccCCCCCCCCCCCC Confidence 10110000000 011122333333 No 97 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.11 E-value=8e-10 Score=70.45 Aligned_cols=367 Identities=10% Similarity=-0.024 Sum_probs=167.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccc-hhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWP-NLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~-~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |.=-++.+. -.+...........-+. +.+ .| .-+.|.+|+..+... T Consensus 1 Mg~f~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~-~~----~~~~v~~~i~~Ia~~ 47 (378) T protein:vir:16 1 MNLFGKVVS----------------------------FSRGKLNNDTQRVTAWQNEAV-EY----TSAFVTNIHNKIANE 47 (378) T ss_pred Cccchhhhh----------------------------hhcccccCCcceeeecccchh-hH----HHHHHHHHHHHHHhh Confidence 211111100 00000000000000011 111 12 335689999999999 Q ss_pred HhhCCceeeeC-CCCC--hhHHH-HHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 80 VEKAFKDFKVI-PGES--EESKK-AAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 80 v~~~~~~~~~~-~~~~--~~~~~-~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) |.++++.+.-. .++. ....+ ...-+.++|+.-++ .+..++++.++ +.+++|-+.+.++|.-. .|.++ T Consensus 48 iA~l~~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~-----~g~~~- 121 (378) T protein:vir:16 48 ITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDN-----TGELL- 121 (378) T ss_pred hhhCceeEEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecC-----CceEE- Confidence 99988754221 1111 00001 11224445542222 24556666654 46679999988887532 12211 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchH Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSP 232 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gl 232 (507) .+ ++.. ....+|.+..|+++ .. ...-.|.++ T Consensus 122 -~l--------------~~~~--------------------------------~~~~~~~~diih~r-~~-~~~~~~~s~ 152 (378) T protein:vir:16 122 -DL--------------LFAD--------------------------------DKKEYKPEELVRLT-SP-FYINEDTSI 152 (378) T ss_pred -EE--------------EecC--------------------------------CeeEecccceEEec-Cc-cCccchhHH Confidence 01 0100 01223444444443 22 112235566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCcee--EeecCcc Q lcl|NC_017974. 233 LVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSF--FMLPSDT 310 (507) Q Consensus 233 l~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~--~iiP~g~ 310 (507) +..+.-.. ..+.. .|.+=-++++|. .-+++...+..+++.+...+...|+.++ ++++.|+ T Consensus 153 l~~~~~~i----------~~~~~-~~~~~g~l~~~~-------~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~ 214 (378) T protein:vir:16 153 LDNALASI----------QTKLE-QGKLRGLLKINA-------FLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKT 214 (378) T ss_pred HHHHHHHH----------HHHHh-cCccceeeEeCC-------cCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCc Confidence 65544221 11111 222222333332 2223333334455555555555565555 5566666 Q ss_pred cccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 311 KDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 311 ~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~ 390 (507) ++. ..+.+....++. ..++..++|++++.-..--. .|+++- +-.......-+.-.++.|+..||+ T Consensus 215 ~~~---------~l~~~~~~~~~~-~~~~~~~~Ia~~fgVPp~~l----~g~~~e-~~~~~f~~~tl~P~~~~ie~~l~~ 279 (378) T protein:vir:16 215 EIV---------ELKKDYSVLNKD-EIDLIKSELLTGYFMNENIL----LGTASQ-EQQIYFYNSTIIPLLIQLEKELTY 279 (378) T ss_pred eEE---------EccCChhhhhHH-HHHHHHHHHHHHhCCCHHHh----cCCchH-HHHHHHHHHHHHHHHHHHHHHHHh Confidence 532 222222222333 34788889999876654212 234442 222334455667788888888888 Q ss_pred HHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhc Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKL 470 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~ 470 (507) .|+.+--...+.......-.+|.++.....|+++.++++.+++..|++.+ +.+|+.+|+|+-+.+|..-....-. T Consensus 280 kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~ggD~~~~~~n~~ 354 (378) T protein:vir:16 280 KLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDVYIANLNAV 354 (378) T ss_pred hcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeeccccc Confidence 87755322111001111223566677778899999999999999998776 5799999999765555321111111 Q ss_pred CCCcCCC-CCCcccccCccccccc Q lcl|NC_017974. 471 LGEDTSR-AGDGMTKGSSGNGTGK 493 (507) Q Consensus 471 ~~~~~~~-~~~~~~~~~~~~~~~~ 493 (507) ..+.... .+......+.+++.++ T Consensus 355 ~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 355 AVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred cccchhhhcCccCCCCCCCCCCCC Confidence 1000000 0000011112233332 No 98 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.10 E-value=1.5e-09 Score=69.04 Aligned_cols=380 Identities=13% Similarity=0.044 Sum_probs=187.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH-HHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV-ESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~a 79 (507) |.=.++..+... ++.... .. + .+......+ .... .+. +..++-+.|.+|+..+-.. T Consensus 1 Mg~f~~~~~~~~--~~~~~~--~~---~--~~~~~~~~~------------~~~~--~v~~~~~l~~~~v~~~i~~ia~~ 57 (382) T protein:vir:48 1 MPIFNLATESPP--DNQGGF--FD---V--VDSDFLASL------------KGNE--WVSAETALRNSDLFSIINQLSND 57 (382) T ss_pred CccccccccCCc--cccccc--cc---c--hhhhccccc------------cCCc--ccchHhhhccHHHHHHHHHHHHh Confidence 655444333211 111000 00 0 000000000 0000 111 2235678899999999999 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) |.++++... ... . +.+... --...++.++++.++ +.+.+|-++++++.... |+ +..|.+. T Consensus 58 ia~~~~~~~--~~~--~-----~~L~~~--PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~------G~--~~~l~~i 118 (382) T protein:vir:48 58 LATVKLITS--RKK--L-----QGIVDN--PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN------GR--DMKWEYL 118 (382) T ss_pred hccCceeee--cch--h-----hhhhhh--cCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC------Cc--EEEEEEE Confidence 988866543 211 0 111111 011235678888877 57789999999876432 32 3355555 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) ++..+. +..+.++..+.. .+.... ...+....+|...++++++....+.++|.|.+..+.. T Consensus 119 ~~~~v~----v~~~~~~~~~~y---------~~~~~~------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~ 179 (382) T protein:vir:48 119 RPSQVS----FNRLDNKDGIYY---------NITFDD------PRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSR 179 (382) T ss_pred cCceeE----EEEcCCCCeEEE---------EEEecC------ccccceeEEcCccEEEecCCCCCCccccccHHHHHHH Confidence 554432 122333322211 000000 0112334577778777777767778999999999988 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) ..-.-....++...+...-+.|--++++|- .-.+++..+..++..+.. .++...++++.|+++. T Consensus 180 ~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-------~~~~e~~~~~~~~~~~~~----~n~g~~~vl~~g~~~~----- 243 (382) T protein:vir:48 180 ELDIQKASGNLTINSLKNALNANGILKIKG-------GGLLDFKTKLSRSRQAMK----QMQGGPLVLDDLEDFT----- 243 (382) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCC-------CCChHHHHHHHHHHHhhc----cCCCCeeEcCCCceEE----- Confidence 777777777888888887777766666542 222333333333333322 2333457788887543 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA 398 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~ 398 (507) ..+-+.....+.+..++..++|++++.-...-.+....+++ ..+.........+.--++.|++.||+.|..++ . T Consensus 244 ----~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~-~~~~~~~~~~~~l~p~~~~i~~~l~~~l~~~~-~ 317 (382) T protein:vir:48 244 ----PLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQS-SLEMSSDLYSKAVSRYLRPFLSELSQKLSCDV-D 317 (382) T ss_pred ----EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcChh-h Confidence 22222223346677788889999987555433332222222 23333445555666678888888887665443 1 Q ss_pred hCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCC Q lcl|NC_017974. 399 LNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRA 478 (507) Q Consensus 399 lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~ 478 (507) ++ ..+.+. .|-..+...+.+|+..|+... +.+|+.++-..-.. ++.+.... +.++-+- T Consensus 318 ~~-------~~~~~~------~~~~~~~~~~~~l~~~g~~t~-----~e~r~~l~~~g~~~-~~~~~~~~---~~~~~~G 375 (382) T protein:vir:48 318 AD-------IFPAVD------PTGSNYISRINSLVKTGTLAQ-----NQGLYILQQAEILP-KELPNGEN---PNSTLKG 375 (382) T ss_pred hh-------hhhhhc------cchhHHHHHHHHHhhcCccCH-----HHHHHHHhhCCCCC-cchhhhhc---CCCCCCC Confidence 11 011111 122344556778888888765 45777663211011 11100000 0000111 Q ss_pred CCcccccCcccc Q lcl|NC_017974. 479 GDGMTKGSSGNG 490 (507) Q Consensus 479 ~~~~~~~~~~~~ 490 (507) |+. .++. T Consensus 376 Gd~-----~~~~ 382 (382) T protein:vir:48 376 GEE-----DGQD 382 (382) T ss_pred CCC-----CCCC Confidence 111 0000 No 99 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.10 E-value=2.2e-09 Score=68.03 Aligned_cols=455 Identities=11% Similarity=-0.024 Sum_probs=195.1 Q ss_pred CcccccccccccCcccccccc--ccccccccccCch----hh--hHHHHhhhhcccch--hccchhHHHHHHHhhChHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQI--NVSRMRMSEQGTF----AL--AKVQVDSERMKAEE--IRWPNLIGTVESMKQDATVA 70 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~--~~~~~~~~~~g~~----~~--~~~~~~~~~e~~~~--lr~~~~~~~y~~m~~D~~v~ 70 (507) |..||+++++..++.-....+ ..|+.+-+-.+.. |+ ...... ....... +.+-+++.+| ...+-+. T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~-~~~~~~~~~~~~~~l~a~Y---~~~~l~r 98 (532) T protein:vir:94 23 VDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRN-ALSFVEATSWPGFPTLALL---AQLPEYR 98 (532) T ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCccccccc-ccccccccccchHHHHHHH---HcCchhh Confidence 888888888777666433322 1122111111100 11 000000 0011111 1111333333 3588888 Q ss_pred HHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCc-cccC-- Q lcl|NC_017974. 71 TGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTV-GEYI-- 147 (507) Q Consensus 71 s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~-~~~~-- 147 (507) .++++.-.-+++-.|+|....+ .+.+.+..+.++..++.+.. |..+...+-.+.+||.+.+=+.-+-.+. ..++ T Consensus 99 ~~Vd~~aed~~r~~~~i~~~~~-~~~~~~~~~~i~~~~~~l~v--~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p 175 (532) T protein:vir:94 99 TMHETPADECVRAWGKITCSSK-DELAADKATRITQKLEQYNV--RTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAP 175 (532) T ss_pred hhhccchHHHhhCCceEeeCCc-cccchHHHHHHHHHHHhhhH--HHHHHHHHHhhhcccceEEEEEeccCCcccccccc Confidence 9999888888888888765432 22234556667777776642 4444444556889999874332211000 0010 Q ss_pred --------cceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEe Q lcl|NC_017974. 148 --------GKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMN 219 (507) Q Consensus 148 --------g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~ 219 (507) ++..++.|.+..+ ++...+.. -.++|... .++....+.. ..+..|-+.+++.|. T Consensus 176 ~~l~~~~I~~g~~~~l~vld~--------~~v~p~~~----~~~dp~sp---~fg~P~~y~v---~~g~~iH~SRli~f~ 237 (532) T protein:vir:94 176 LLLSPSFVQRGCLIGFATIEP--------MWLSPNAY----NATDPTLP---SFYKPDSWIA---TSGKKIHSSRIHTVV 237 (532) T ss_pred ccccccccccceeeEEEeech--------heeccccc----cccccccc---ccCCceeEEE---ccCeeeccceEEEec Confidence 0111122222211 11111110 00111110 1111111111 123467777877775 Q ss_pred eCC------ccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHH Q lcl|NC_017974. 220 TGG------SSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDA 293 (507) Q Consensus 220 ~~~------~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~ 293 (507) ... ...+.+|.+++..+|-...--.....--++.+.++.. .++++-.. + .-+.+......+.+ +.+ T Consensus 238 g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~--~v~k~~~a--~---~ls~~~~~~~~~r~-~~~ 309 (532) T protein:vir:94 238 GRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSM--TNLATDMA--Q---LLAPGGAQSLDARL-QLF 309 (532) T ss_pred CCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ceeeechH--H---hhcchhHHHHHHHH-HHH Confidence 332 2345579999999987764433334444455555544 34443110 0 00111222222222 222 Q ss_pred HHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc--ccccchhhhhHHHHHHH Q lcl|NC_017974. 294 ANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN--VGNDKGGSYNLSESKQT 371 (507) Q Consensus 294 ~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS~Al~~vh~~ 371 (507) ...+ +....++++.+.+ +++.+..+-+| ...+++..-.+||-+. |-.+| .|.+.+|-.|.|+-... T Consensus 310 ~~~~-~n~g~~~id~~~e------~~e~~~~~lsg----l~~~l~~~~~~iAaa~-~IP~t~LfG~sp~GlnstGe~D~~ 377 (532) T protein:vir:94 310 NLYR-DNRNIGALDKGTE------EIQQTNTPLSG----LDSLQAQSQEQMAAVS-HIPLVKLLGITPNGLNASSDGEIR 377 (532) T ss_pred Hhhc-CCccceEEcCCCc------eeEEEecccCC----HHHHHHHHHHHHHhHh-CCCeeeeecCCcccccccchHHHH Confidence 2222 3345577776532 22333333332 3557788777887554 33223 23444566666776777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHH-------HHHHHHHHHHhcCCccccHHH Q lcl|NC_017974. 372 IHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDME-------GFSKAIQRIGAVGYLPKTPKV 444 (507) Q Consensus 372 v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~-------~~a~~~~~L~~~G~~~~~~~~ 444 (507) ..-+.+++-......-+.+.|+.-|+..-++..++. =.|+|...-..+-+ ..+++++++++.|++.+ T Consensus 378 ~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d--~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~---- 451 (532) T protein:vir:94 378 VWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPG--LAWEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDA---- 451 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCH---- Confidence 788888887655544444456666654332222211 14566644433333 34677888999998665 Q ss_pred HHHHHHHcCCCCCCC----------cccccchhhhcCCCcCCCC-C--Cccc-ccCcccccccc-cC-CCCchhhh--cc Q lcl|NC_017974. 445 INRVLEVLGIDEKIE----------EDISQEELMKLLGEDTSRA-G--DGMT-KGSSGNGTGKI-SS-TRDNSAAN--LD 506 (507) Q Consensus 445 ~~~i~e~~Gip~~~~----------~~~~~~~~~~~~~~~~~~~-~--~~~~-~~~~~~~~~~~-~~-~~d~~a~~--~~ 506 (507) +.+|+.++..+... .+..........++...++ . .+.+ ...+..++++. ++ ..|+.... .- T Consensus 452 -~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 530 (532) T protein:vir:94 452 -KMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQNDQPVG 530 (532) T ss_pred -HHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCccccccCCCcC Confidence 45777776542110 0000000000000000000 0 0000 00000111100 00 00010000 11 Q ss_pred C Q lcl|NC_017974. 507 N 507 (507) Q Consensus 507 ~ 507 (507) | T Consensus 531 ~ 531 (532) T protein:vir:94 531 N 531 (532) T ss_pred C Confidence 1 No 100 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.02 E-value=5.2e-09 Score=66.02 Aligned_cols=423 Identities=12% Similarity=0.081 Sum_probs=185.5 Q ss_pred ccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeC Q lcl|NC_017974. 11 VKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVI 90 (507) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~ 90 (507) .++...-. +. ..-.|+-.-+.+.. ..-...+ ....++...+..+-+..++++.-.-.++..|+|..+ T Consensus 1 ~~~~D~~~-~~------~~~~g~~~~~~~~~-----~~~~~~~-~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~ 67 (437) T protein:vir:52 1 MKFFDGIK-SL------ALKLGSKQEQTYYS-----PSLSLTD-DLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSN 67 (437) T ss_pred CchhhhhH-hH------HhcCCCccccceee-----cCccccc-cHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecC Confidence 00000000 00 00001000000000 0000000 112222233458888889988777666766776542 Q ss_pred CCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccc---cCcceeeeeccccCchhccccc Q lcl|NC_017974. 91 PGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGE---YIGKYKVKNLAFRPQASLSRTN 167 (507) Q Consensus 91 ~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~---~~g~~~~~~l~~r~~~~i~~~~ 167 (507) . . +.+..+.+++.++.+.. |..+...+..+-+||=+++=+ ..++... .+..-.++.|.+.++..+.. T Consensus 68 d---~-~~~~~~~~~~~~~~l~~--~~~l~~a~~~~rl~G~a~i~i--~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~-- 137 (437) T protein:vir:52 68 D---L-NSKQLDLFTKFERSLKL--RETLTKALQWSSLYGSVGLLV--VTDSQNTSAPLKPTERLKRLIILPKWKISP-- 137 (437) T ss_pred C---C-CHHHHHHHHHHHHhhcH--HHHHHHHHHhcccccceEEEE--EecCCCcccccccCCceeEEEEechhhccc-- Confidence 1 1 12333456666776642 444444444588999876533 2222110 00011233333333322210 Q ss_pred ceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeC---CccCccccchHHHHHHHHHHHHH Q lcl|NC_017974. 168 PIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTG---GSSSQALGVSPLVGCYRAWREKI 244 (507) Q Consensus 168 ~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~---~~~~~p~G~gll~~~~~~~~fK~ 244 (507) .+ ...++|.. -.++.+..+..........+.+.++++|... ....+.+|.|+|..+|....--. T Consensus 138 ------~~----~~~~dp~s---~~fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~ 204 (437) T protein:vir:52 138 ------TG----TKDDDVLS---PNFGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFD 204 (437) T ss_pred ------cc----cccccccc---cccCcceEEEEecCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHH Confidence 00 00111111 1122233333333344567888888887532 24467789999999998765544 Q ss_pred HHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeec Q lcl|NC_017974. 245 LIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGI 324 (507) Q Consensus 245 ~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~ 324 (507) ....-=+..+.++. +.+++++- +.+.-....++ .+....+.+...++ ....+++..+-+++ .+.. T Consensus 205 ~~~~~~~~l~~~~~--~~v~k~~~-l~~~l~~~~~~----~~~~~~~~~~~~~~-~~~~~~~d~~~~~e-------~~~~ 269 (437) T protein:vir:52 205 SASVNVGDLIFESK--IDIFKIAG-LSDKIAAGMEN----EVASVISAVQEIKS-ATNSLLLDAENEYD-------RKEL 269 (437) T ss_pred HHHHHHHHHHHHcC--CCceecch-HHHHhcCCcHH----HHHHHHHHHHHhcC-CCceEEEcCCcceE-------EEec Confidence 44444455666654 44555431 11111111111 22333333333333 34567777664433 3333 Q ss_pred cCCCcchhHHHHHHHHHHHHHHHHhcccc-cccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Q lcl|NC_017974. 325 DGMGKQYSTAQLISDRKKSILDRLGAGFI-NVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMR 403 (507) Q Consensus 325 ~g~g~~~~~~~li~~~d~~Isk~iLGqtL-ts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~ 403 (507) +-+| ...+++..-.+||.+.--..- -.+.+.+| .|.|+-.....-+.+++.......-+.+.|+..|+.--+.. T Consensus 270 ~~sg----l~~~l~~~~~~iaaa~~iP~t~L~G~s~~G-lasge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~ 344 (437) T protein:vir:52 270 TFTG----LKDLLTEFRNAVAGAADMPVTILFGQSVSG-LASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGG 344 (437) T ss_pred CcCC----HHHHHHHHHHHHHHHhcCchhhhcCcCccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 2222 345777777888875533321 11222233 36677777777888888776544444455777665433221 Q ss_pred CCccccceEEecCcCchh-------HHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCC Q lcl|NC_017974. 404 LPETEMPYVKAGEIVDVD-------MEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTS 476 (507) Q Consensus 404 ~~~~~~P~~~~~~~~~~d-------l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~ 476 (507) .+++ =.|.|...-..+ .+..+++++++++.|++.+ .+..+.+++.-..+.-.++++..........+... T Consensus 345 ~~~~--~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~-~e~r~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~ 421 (437) T protein:vir:52 345 LPAD--WWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNE-YQIANELRESGLFANISAEHIEELKNADEFAGNFE 421 (437) T ss_pred CCCc--ceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCH-HHHHHHHHhcCCCCCCCccccccccCCCCCCCccC Confidence 1211 135565433333 2456778999999998777 34456666653333222222111111111111111 Q ss_pred CCCCcccccCcccccccccCCCCch Q lcl|NC_017974. 477 RAGDGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~d~~ 501 (507) ++ +... + ..++.+-+. T Consensus 422 ~~-~~~~----~----~~~~~~~~~ 437 (437) T protein:vir:52 422 EP-EKME----G----AQVQNSEDQ 437 (437) T ss_pred CC-CCCC----C----CCCCCCCCC Confidence 10 0000 0 001111111 No 101 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=98.96 E-value=9.3e-09 Score=64.61 Aligned_cols=434 Identities=13% Similarity=0.044 Sum_probs=196.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhH-HHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAK-VQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |.+.++-+++.-..+..+..-- +.-+|. |... .......-....+-+..+..+| +.++-+..++++--.- T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~-----~~~~g~-~~~~d~~~~~~~~~~~~~~~~~l~~lY---~~~~l~r~iVd~~a~d 71 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDF-----MVGHGK-ANSRDKLTRQTPGNGQKLDLKACENLY---ASNSIAMNIVDIISED 71 (461) T ss_pred CccchhhhhhhhhhhhhhhhHH-----HhhcCC-cchhhhhhccccCcccccCHHHHHHHH---HhCCccchhhccchHH Confidence 8876665554432222211100 011111 1100 0011100000011112222333 3577777788877777 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCc-------cccCcce-e Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTV-------GEYIGKY-K 151 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~-------~~~~g~~-~ 151 (507) .++-.|.|..+ +++..+.+++.++.+.. |..+...+-.+.+||.+.+=+.-..... ....+.+ . T Consensus 72 ~~r~g~~i~~~------~~~~~~~~~~~~~~l~~--~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~ 143 (461) T protein:vir:80 72 MVRAGWSLKTD------NKEMKKNIESKWRKLKT--KDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKS 143 (461) T ss_pred hhcCCeeeecC------CHHHHHHHHHHHHHhhH--HHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccc Confidence 77766666432 23344456666665542 5555565667999999876443221100 0011111 1 Q ss_pred eeeccccCchhcccccceeecCCCcccccccccccccccccccccc--ccccccCCCccccccccEEEEeeCCccCcccc Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIG--VGGVSTRMSDVIIPINRVMLMNTGGSSSQALG 229 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~--~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G 229 (507) +..|.+-.+..+.. ...+.|-.. -....|..+......... ............|-+.+++.+....-.+..+| T Consensus 144 ~~~l~~~~~~~i~~---~~~~~dp~s--p~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G 218 (461) T protein:vir:80 144 IPYINTFNTQKVTQ---LYLNQDMFS--EHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKG 218 (461) T ss_pred eeEEEeccccccch---hhhcccCcC--cccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccC Confidence 22222211111100 001111000 000011111110000000 00011122335577788888887777788889 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCc Q lcl|NC_017974. 230 VSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSD 309 (507) Q Consensus 230 ~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g 309 (507) .|++..+|....--.....-=++.+.++... +++++. + .... .+... ++.+.+..++.+ .+.+++..+ T Consensus 219 ~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~-l---~~~~-~~~~~----~~~~~~~~~~~~-~g~~~~d~~ 286 (461) T protein:vir:80 219 RSIFESLYDIITVMDTSLWSVGQILYDFAFK--VYKTDD-I---DALN-KDDKA----NLTAMLDFMFRT-EALAIIKGD 286 (461) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ceecch-H---Hhhh-chHHH----HHHHHHHHhcCC-ceEEEEcCC Confidence 9999999987755555555555666666544 555431 1 1111 11111 122222233333 345666665 Q ss_pred ccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc-ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 310 TKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN-VGNDKGGSYNLSESKQTIHTQFVQRVNEIILEAL 388 (507) Q Consensus 310 ~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt-s~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~l 388 (507) -++ +.+..+-+| ...+++..-.+||-+.--..-- .+++ .|.+|.|+-...+.-+.+++..+....-+ T Consensus 287 e~~-------e~~~~~lsg----l~~~l~~~~~~iaa~s~iP~t~L~G~s-~g~~asge~D~~~yyd~i~~~qe~~l~p~ 354 (461) T protein:vir:80 287 EQL-------TKESTNVSG----MKDLLDYGWDYLAGAVRMPKTVLKGQE-AGTLTGAQYDVMNYYARVSSIQENRLRPQ 354 (461) T ss_pred cce-------EEEecCcCC----HHHHHHHHHHHHhhhhcCCeeeeeccc-CCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 433 333333332 3457777777887765333311 1233 36677777777888888888887655555 Q ss_pred HHHHHHHHHHhCCCcCCcccc----ceEEecCcCchhHH-------HHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCC Q lcl|NC_017974. 389 NENLLPQLLALNDMRLPETEM----PYVKAGEIVDVDME-------GFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEK 457 (507) Q Consensus 389 n~~li~~l~~lN~~~~~~~~~----P~~~~~~~~~~dl~-------~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~ 457 (507) .+.|++-|+.--+...+...+ =.|.|...-+.+-+ ..|++++++++.|++.+ .+..+.++.++|+.+. T Consensus 355 le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~-~e~r~~l~~~~~~~~~ 433 (461) T protein:vir:80 355 LEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDP-DEVKETRFGRFGLENS 433 (461) T ss_pred HHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCH-HHHHHHHHHhcCCCCC Confidence 455777666422222221111 13667655443333 34567888999998776 3455667778888543 Q ss_pred CCc--cc-ccchhhhcCC-CcCCCCCCc Q lcl|NC_017974. 458 IEE--DI-SQEELMKLLG-EDTSRAGDG 481 (507) Q Consensus 458 ~~~--~~-~~~~~~~~~~-~~~~~~~~~ 481 (507) ..- .. .......... ..+...++| T Consensus 434 ~~~~~~~~~~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 434 SKFSGDSAEIDKLAKLVYDAYAKKNADG 461 (461) T ss_pred ccCCCCCchhhhhhhhccccccccCCCC Confidence 211 10 0000000000 001111111 No 102 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=98.91 E-value=3.6e-09 Score=66.87 Aligned_cols=335 Identities=12% Similarity=0.114 Sum_probs=145.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhc---ccchhccchhHHHHHHH-hhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERM---KAEEIRWPNLIGTVESM-KQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e---~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~R 76 (507) |+|||.++.++.....+.......-+..++ +...++...+..+.+ ++.--.-|=...-..++ +..+|.+|+|.-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i~~k 79 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTASAPKMEAFTFGE-PVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVK 79 (344) T ss_pred CCCCCCCCCchhhHHhhcCCCceEEEEcCC-ceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCccceeh Confidence 999998766554333222222222222221 001111111111111 11100001111112233 3588999999887 Q ss_pred HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 77 YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 77 k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) +..+.+. +.+.+.-+.. .|..+ +++-+.+|.+.+|++.... | .+..|. T Consensus 80 ~n~l~~~---~~Pnp~~t~~------------------~f~~~---~~d~ll~Gnay~~~~rn~~------G--~~~~L~ 127 (344) T protein:vir:56 80 RNILAST---FIPHPWLSQQ------------------DFSRF---VLDFLVFGNAFLEKRYSTT------G--KVIRLE 127 (344) T ss_pred hhhHHhh---cCCCCCCCHH------------------HHHHH---HHHHHhcCCeEEEEEECCC------C--cEEEEE Confidence 7776542 5554433221 12222 3455678999999987542 3 234566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.|...+. ...++.. ++. ....+.....+++.+|+++...-.+..||.+.+..+ T Consensus 128 pl~~~~v~------~~~~~~~----------~~~----------~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a 181 (344) T protein:vir:56 128 TSPAKYTR------RGVEEDV----------YWW----------VPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSA 181 (344) T ss_pred EeCCceeE------EeecCCE----------EEE----------EecCCeEEEEcCccEEEECCCCCCCCcccccHHHHH Confidence 66654332 1122211 110 011223345666665555433334567899998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEee--cCcccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFML--PSDTKDNA 314 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~ii--P~g~~i~~ 314 (507) ......-.....+-..|.+.-+.|=-++++| +..-++++...+.+++.+ ... .|+.-.+++ |.|-+- T Consensus 182 ~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~------d~~ls~e~~~~lk~~~~~-~~g--~~~~r~l~l~~p~g~~~-- 250 (344) T protein:vir:56 182 LNSAWLNESATLFRRKYYENGAHAGYIMYVT------DAVQDRNDIEMLRENMVK-SKG--RNNFKNLFLYAPQGKAD-- 250 (344) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEec------CCCCCHHHHHHHHHHHHH-hcC--CCCccceEEecCCCCcc-- Confidence 8777766666666666665444444444433 111233333333333332 221 122222333 433211 Q ss_pred cccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIH-TQFVQRVNEIILEALNEN 391 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~-~~~~~aD~~~i~~~ln~~ 391 (507) .++++..+-+..-..|.+.-++-..+|+.+.--.---.+ ...+|+++-.+....++ ..-+.--++.+++ +|.. T Consensus 251 ---G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~~~ie~-~n~~ 326 (344) T protein:vir:56 251 ---GIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-INGW 326 (344) T ss_pred ---ceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhh Confidence 133444444444445677777778888887654432222 22344454333222221 1111222333332 3332 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhH Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDM 422 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl 422 (507) |...++. |.-...+..|- T Consensus 327 l~~~~~~-------------F~~y~l~~~~~ 344 (344) T protein:vir:56 327 IGQEVIR-------------FKNYSLDTDNG 344 (344) T ss_pred hcccccc-------------CCCccccccCC Confidence 3222222 22122222221 No 103 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=98.89 E-value=1.9e-08 Score=62.91 Aligned_cols=454 Identities=8% Similarity=-0.068 Sum_probs=186.6 Q ss_pred Cccc-------------ccccccccCccc--------cccccccccccccc-cCchhhhHHH-Hhh-hhcccchhccchh Q lcl|NC_017974. 1 MSRK-------------RNRNRSVKVAKA--------TSEQINVSRMRMSE-QGTFALAKVQ-VDS-ERMKAEEIRWPNL 56 (507) Q Consensus 1 ~~~k-------------~~~~~~~~~~~~--------~~~~~~~~~~~~~~-~g~~~~~~~~-~~~-~~e~~~~lr~~~~ 56 (507) |+-. +..+.......+ .+...+..+..+.. ..+.+..... ... ........++ -. T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 103 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAF-IG 103 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCC-cc Confidence 1110 000000000000 00001111111100 0000000000 000 0000000011 12 Q ss_pred HHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeee Q lcl|NC_017974. 57 IGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEK 136 (507) Q Consensus 57 ~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei 136 (507) +.++...+..+-+..++++.-.-.++-.|.|..++++. .+.+..+.++..++.+.. |..+...+-.+.+||-+.+=+ T Consensus 104 ~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~-~~~~~~~~l~~~~~~l~~--~~~l~~a~~~~rlyG~~~i~i 180 (537) T protein:vir:10 104 HQMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGNE-LDPKDAKFIDRYDRAFNI--KKHAIQFVRKGRIFGIRIALF 180 (537) T ss_pred HHHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCccc-ccHHHHHHHHHHHHHhhH--HHHHHHHHHhcccccceEEEE Confidence 23333334688999999988777777777777654332 233445667777777642 444444444678899876523 Q ss_pred eeeccCcccc--------CcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCcc Q lcl|NC_017974. 137 VYTQVTVGEY--------IGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDV 208 (507) Q Consensus 137 ~w~~~~~~~~--------~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (507) .=........ .++..++.|....+. +....- ...+.++|... .++.+..+. ..+. T Consensus 181 ~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~--------~~~~~~--~~~~~~dp~sp---~fg~P~~y~----v~g~ 243 (537) T protein:vir:10 181 KVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPY--------WCAPLL--DAQASSNPVSM---HFYEPTYWL----INGK 243 (537) T ss_pred eecCcCCcccccccccccccccceeEEEEechh--------hccccc--chhhhccCCcc---ccCCceeee----ecCe Confidence 2211100000 011112222211111 110000 00011111110 011111111 1223 Q ss_pred ccccccEEEEeeCC------ccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHH Q lcl|NC_017974. 209 IIPINRVMLMNTGG------SSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPE 282 (507) Q Consensus 209 ~iP~~k~i~~~~~~------~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~ 282 (507) .+=+.+++.+.... ...+.+|.++|..||-...--.....--+..+.++... ++++..- ..-..++. T Consensus 244 ~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~--v~k~~~~----~~l~~~~~- 316 (537) T protein:vir:10 244 KYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQT--VLKVDAA----QVLANKQQ- 316 (537) T ss_pred EecceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--eeeechH----HhhcCHHH- Confidence 44456666654321 23456799999999877644333444445555565544 4443210 01111111 Q ss_pred HHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc--ccccch Q lcl|NC_017974. 283 AEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN--VGNDKG 360 (507) Q Consensus 283 ~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~ 360 (507) +....+.+...+ ++...+++..+.+ +++.+..+-+| ...+++..-.+||-+.-- .+| .|.+.+ T Consensus 317 ---~~~r~~~~~~~r-~n~g~~~id~e~e------~~e~~~~~lsg----l~~~l~~~~~~iAa~~~I-P~t~L~G~sp~ 381 (537) T protein:vir:10 317 ---FDETMSWWTATR-DNYQVRVVDKDNE------DVVQIDTTLND----LDKVIMNQYQLVCAIART-PAPKMLGTVPT 381 (537) T ss_pred ---HHHHHHHHHhhc-CCcceeEecCCCc------eeEEEeccCCC----HHHHHHHHHHHHHhhhCC-CceeeccCCcc Confidence 222222222222 2334466665422 23333333333 245777777777776422 222 244445 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHH-------HHHHHHHHH Q lcl|NC_017974. 361 GSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEG-------FSKAIQRIG 433 (507) Q Consensus 361 GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~-------~a~~~~~L~ 433 (507) |..|.|+-.....-+.+++-...|...+++ |+.-|+..-+ +++.. -.|.|...-..+-++ .++++++++ T Consensus 382 GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~-l~~ll~~~~~--~~~~~-~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~ 457 (537) T protein:vir:10 382 GFNSTGDYEEASYHEECESTQDDMRPLIDR-HHQLVCRSHL--RKRIR-VKVEFPPMDAPKESERADTFLKKMQAAKLAF 457 (537) T ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcC--CCCcc-eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Confidence 777778877888888888887777777764 6666665442 22211 245666555444444 456788999 Q ss_pred hcCCccccHHHHHHHHHHcC---------CCCCCCcccccch--hhhcCCCcCCC-CCCcccccCcccccccccCCCCch Q lcl|NC_017974. 434 AVGYLPKTPKVINRVLEVLG---------IDEKIEEDISQEE--LMKLLGEDTSR-AGDGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 434 ~~G~~~~~~~~~~~i~e~~G---------ip~~~~~~~~~~~--~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~ 501 (507) +.|++.++ .+|+.++ |.+..+.+..... .....+....+ ++.+....+...........++.. T Consensus 458 ~~G~i~~~-----Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (537) T protein:vir:10 458 EMGAVDGV-----DVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSG 532 (537) T ss_pred HcCCCCHH-----HHHHHHhccCccccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCccCc Confidence 99987663 4555543 3221111111000 00001111111 111000111111111223345555 Q ss_pred hhhcc Q lcl|NC_017974. 502 AANLD 506 (507) Q Consensus 502 a~~~~ 506 (507) |++.+ T Consensus 533 a~~~~ 537 (537) T protein:vir:10 533 AAFED 537 (537) T ss_pred cccCC Confidence 55555 No 104 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=98.86 E-value=1.2e-08 Score=63.95 Aligned_cols=273 Identities=7% Similarity=-0.041 Sum_probs=145.9 Q ss_pred HhhCCceeeeCCCCChhHHHHHHHHHHHHHhc--cCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKAAKFVEYCLKNM--EGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~--~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) |.++++...-.. ...+..+. ..|... ...++.++++.++ +.+.+|-++++++.... |. +..|. T Consensus 1 ia~l~~~~~~~~--~~~~~~l~----~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~------G~--~~~l~ 66 (278) T protein:vir:78 1 MASLPLKMYEDY--KVVNTEVS----DLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY------HQ--PSKLF 66 (278) T ss_pred CccceeEEEecC--cccccHHH----HHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCC------Cc--EEEEE Confidence 777765543222 11122233 333321 2235677888877 57789999999887532 32 34565 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.++.++. +..+.++..+... + ....+....+|.+-++++++....+.++|.|.+..| T Consensus 67 ~l~~~~v~----v~~~~~~~~~~y~---------~---------~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~ 124 (278) T protein:vir:78 67 LLNPDVVE----MLIENQSRELYYS---------I---------HAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVL 124 (278) T ss_pred EECCceeE----EEEcCCCceEEEE---------E---------EcCCceEEEEccccEEEECCCCCCCCeeeccHHHHH Confidence 65555443 1223343322111 0 011223345677776666665556778999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCC-ceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGG-VIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAP 315 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p-~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~ 315 (507) ....-....... |.. .+++.+ --+++.| ..-+++...+..+.+.+.. .+....+++|.|+++.- T Consensus 125 ~~~i~~~~~~~~-~~~--~~~~~~~~~i~~~~-------~~l~~e~~~~~~~~~~~~~----~~~g~~~vl~~g~~~~~- 189 (278) T protein:vir:78 125 KNTTDFDNAVRT-FNL--TEMQKPDSFMLKYG-------SNVGKEKRQQVLEDFKQYY----EENGGILFQEPGVEIEP- 189 (278) T ss_pred HHHHHHHHHHHH-HHH--HHhcCCCcEEEEeC-------CCCCHHHHHHHHHHHHHHh----ccCCCceecCCCceEEE- Confidence 766554444333 332 233332 2222222 2233444444444443332 33344677788875432 Q ss_pred ccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 316 QYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-TIHTQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 316 ~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-~v~~~~~~aD~~~i~~~ln~~li~ 394 (507) .+-+....++.+..++..++|++++.-...-.+...+++|+-.+.+. ......+.-.++.|++.||+.|++ T Consensus 190 --------l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~ 261 (278) T protein:vir:78 190 --------LPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLT 261 (278) T ss_pred --------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 22233344577788899999999887665444444456666544444 555667888999999999988765 Q ss_pred HHHHhCCCcCCccccceEEecCcCc Q lcl|NC_017974. 395 QLLALNDMRLPETEMPYVKAGEIVD 419 (507) Q Consensus 395 ~l~~lN~~~~~~~~~P~~~~~~~~~ 419 (507) +--...+ . +|.|+...= T Consensus 262 ~~e~~~g------~--~~~f~~~~l 278 (278) T protein:vir:78 262 KTDREKI------G--ILNLTLNLI 278 (278) T ss_pred hhHhcCC------c--eEEEecccC Confidence 4211111 1 244442221 No 105 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=98.83 E-value=3.3e-08 Score=61.62 Aligned_cols=366 Identities=11% Similarity=-0.031 Sum_probs=156.7 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |+--.+...--+ .++.. ++ ++. +. -+.+.+. +.-+.|.+|+..+-..| T Consensus 1 M~~f~k~~~~~~-~~~~~-------------~~---~~~--~~--------~~~~~~~-----~~~~~v~~~v~~ia~~i 48 (378) T protein:vir:85 1 MNLFGKVVSFSR-GKLNN-------------DT---QRV--TA--------WQNEAVE-----YTSAFVTNIHNKIANEI 48 (378) T ss_pred Cchhhhhhhhhh-ccccc-------------CC---cce--ee--------eeccchh-----hhhHHHHHHHHHHHHhH Confidence 443222110000 00000 00 000 00 0011111 12355899999999999 Q ss_pred hhCCceeeeCCCC-C--hhHHH-HHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 81 EKAFKDFKVIPGE-S--EESKK-AAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 81 ~~~~~~~~~~~~~-~--~~~~~-~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) .++++...-.... . ....+ ...-+..+|+.-++ .+..++...++ +.+.+|-+.+.+++... .|.+.- T Consensus 49 A~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~-----~g~~~~- 122 (378) T protein:vir:85 49 TKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSE-----TGELLD- 122 (378) T ss_pred hhCceeEEEEeccccccccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCC-----CceEEE- Confidence 9988754322111 0 00011 11224445542222 23445555544 56678998887776532 222210 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL 233 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll 233 (507) .++..++ ....++..|+|+.. -+.+. +.+.+ T Consensus 123 ---------------~~~~~~~--------------------------------~~~~~~dvih~~~~-~~~~~-~~~~~ 153 (378) T protein:vir:85 123 ---------------LLFANDK--------------------------------KEYKPEELVRLVSP-FYINE-DTSIL 153 (378) T ss_pred ---------------EEecCCC--------------------------------EEEcccceEEEecC-cCccc-hhhHH Confidence 0111111 11223344544422 11111 12333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCcee--EeecCccc Q lcl|NC_017974. 234 VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSF--FMLPSDTK 311 (507) Q Consensus 234 ~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~--~iiP~g~~ 311 (507) ..+.-. .. .+. +.|.+=-+++.|-. -+.+......+.+.+...+...|..++ ++++.|++ T Consensus 154 ~~a~~~------~~----~~~-~~~~~~g~l~~~~~-------l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~ 215 (378) T protein:vir:85 154 DNALAS------IQ----TKL-EQGKLRGLLKINAF-------LDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTE 215 (378) T ss_pred HHHHHH------HH----HHH-hcCCcceEEEeCCc-------CCHHHHHHHHHHHHHHHHHhhcccccccceecCCCce Confidence 322211 11 111 23332223333321 122222333344444444444444443 56677765 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) +. ..+-+....++ ...++..++|++++.-..--. +||++. +-.......-+.-.++.|+..||+. T Consensus 216 ~~---------~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l----~~s~~e-~~~~~f~~~tL~P~~~~ie~~l~~k 280 (378) T protein:vir:85 216 IV---------ELKKDYSVLNK-DEIELIKSELLTGYFMNENIL----LGTATQ-EQQIYFYNSTIIPLLIQLEKELTYK 280 (378) T ss_pred EE---------eccCChhhhhH-HHHHHHHHHHHHHhCCCHHHh----cCCchH-HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 42 22222222233 345788889999876653111 244442 1122344455566677777777776 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) |+.+--.--+.+.....-..|..+.....|+++.++.+.+++..|++.+ +.+|+.+|+|+-+.+|.......-.. T Consensus 281 Ll~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~lgl~p~~gGD~~~~~~N~~~ 355 (378) T protein:vir:85 281 LISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDIYIANLNAVA 355 (378) T ss_pred cCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccccc Confidence 6543110000000001112345556667899999999999999999876 57999999987666553221111110 Q ss_pred C---CcCCCCCCcccccCccccccc Q lcl|NC_017974. 472 G---EDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 472 ~---~~~~~~~~~~~~~~~~~~~~~ 493 (507) . ....+...+ ...++++..+ T Consensus 356 ~~~~~~~~~~~~~--~~~~~e~~n~ 378 (378) T protein:vir:85 356 VKNLSDLQGSRKD--VASTDETNNQ 378 (378) T ss_pred cccchhhcCccCC--CCCCCCCCCC Confidence 0 000011100 0112222222 No 106 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=98.80 E-value=4.2e-08 Score=61.03 Aligned_cols=365 Identities=10% Similarity=-0.041 Sum_probs=160.7 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccch-hccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEE-IRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~-lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |.--.+.... .+........+. .-+.+.+. | .-+.|.+|+..+-.. T Consensus 1 M~if~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~-~----~~~~v~~~v~~Ia~~ 47 (378) T protein:vir:94 1 MNLFGKVVSF----------------------------SRGKLNNDTQRVTAWQNEAVE-Y----TSAFVTNIHNKIANE 47 (378) T ss_pred CchhHHhHhh----------------------------hhcccccCcceeeeeecchhh-h----hhHHHHHHHHHHHHh Confidence 2222211100 000000000000 00111111 1 225789999999999 Q ss_pred HhhCCceeeeC-CCC--ChhHHH-HHHHHHHHHHhccC--CCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 80 VEKAFKDFKVI-PGE--SEESKK-AAKFVEYCLKNMEG--QTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 80 v~~~~~~~~~~-~~~--~~~~~~-~ae~v~~~l~~~~~--~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) |.++++...-. ..+ .....+ ...-+..+|..-++ .+..++...++ +.+++|.+.+-.+|... .|.+.- T Consensus 48 iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~-----~g~~~~ 122 (378) T protein:vir:94 48 ITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSE-----TGELLD 122 (378) T ss_pred HhhCceeeeeecccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCC-----CCcEEE Confidence 99887653211 111 100001 11223445543222 23456666554 46678988776666532 222210 Q ss_pred eeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchH Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSP 232 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gl 232 (507) + ++..+ +..+|.+.+++++ ..-+.+. +.++ T Consensus 123 --~--------------~~~~~--------------------------------~~~~~~~dvih~~-~~~~~~~-~~~~ 152 (378) T protein:vir:94 123 --L--------------LFAND--------------------------------KKEYKPEELVRLT-SPFYINE-DTSI 152 (378) T ss_pred --E--------------EEecC--------------------------------cEEechhceeeec-CcCCccc-chhH Confidence 0 01111 1223444444333 2222222 3455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCcee--EeecCcc Q lcl|NC_017974. 233 LVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSF--FMLPSDT 310 (507) Q Consensus 233 l~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~--~iiP~g~ 310 (507) +..+.-..- .+. +.|.+=-+++.|..+ +.+......+++.+...+...|..++ ++++.|+ T Consensus 153 ~~~~~~~~~----------~~~-~~~~~~g~l~~~~~l-------~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~ 214 (378) T protein:vir:94 153 LDNALASIQ----------TKL-EQGKLRGLLKINAFL-------DIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKT 214 (378) T ss_pred HHHHHHHHH----------HHH-hhCCcccceeeCCcC-------CHHHHHHHHHHHHHHHHHhhcccccccceeccCCc Confidence 555432111 111 122221233444322 22223334455555555555555554 5666666 Q ss_pred cccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 311 KDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 311 ~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~ 390 (507) ++. ..+.+....+. ...++..++|++++.-..--. .|+++- +-.......-+.-.++.|+..||+ T Consensus 215 ~~~---------~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l----~g~~~e-~~~~~f~~~tl~P~~~~ie~~l~~ 279 (378) T protein:vir:94 215 EIV---------ELKKDYSVLNK-DEIDLIKSELLTGYFMNENIL----LGTATQ-EQQIYFYNSTIIPLLIQLEKELTY 279 (378) T ss_pred eEE---------EccCChHHhhH-HHHHHHHHHHHHHhCCCHHHh----cCCchH-HHHHHHHHHHHHHHHHHHHHHHHh Confidence 532 22222222233 345788889999876643111 233331 111223344566677888888887 Q ss_pred HHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhc Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKL 470 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~ 470 (507) .|+..--..-+....-.....|.++.....|.++.++++.++++.|++.+ +.+|+.+|+|+-+.++.......-. T Consensus 280 ~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~-----NE~R~~~g~~p~~ggd~~~~~~n~~ 354 (378) T protein:vir:94 280 KLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDVYIANLNAV 354 (378) T ss_pred hcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeeccccc Confidence 77644211111001111123455566677899999999999999998776 5799999998765555321111100 Q ss_pred C---CCcCCCCCCcccccCccccccc Q lcl|NC_017974. 471 L---GEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 471 ~---~~~~~~~~~~~~~~~~~~~~~~ 493 (507) . .....+..++ ..+++++..+ T Consensus 355 ~~~~~~~~~~~~~~--~~~~~e~~n~ 378 (378) T protein:vir:94 355 AVKNLSDLQGNRKD--VTSTDETNNQ 378 (378) T ss_pred chhcchhcccccCC--CCCCCCCCCC Confidence 0 0111111111 1122223222 No 107 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=98.79 E-value=4.6e-08 Score=60.79 Aligned_cols=455 Identities=10% Similarity=0.002 Sum_probs=178.3 Q ss_pred Ccccccccc---cccCccc-ccc-ccccccc--ccccc-CchhhhHHHHhh---hhcccchhc-------cchhHHHHHH Q lcl|NC_017974. 1 MSRKRNRNR---SVKVAKA-TSE-QINVSRM--RMSEQ-GTFALAKVQVDS---ERMKAEEIR-------WPNLIGTVES 62 (507) Q Consensus 1 ~~~k~~~~~---~~~~~~~-~~~-~~~~~~~--~~~~~-g~~~~~~~~~~~---~~e~~~~lr-------~~~~~~~y~~ 62 (507) |.|-.+..- ....+.+ ..+ ...+|.. .+... +.-.......+. .....+.+. .--.+.++.. T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~al 116 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAI 116 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHH Confidence 222211000 0000000 000 0001100 00000 000000001110 000111110 0112333333 Q ss_pred HhhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccC Q lcl|NC_017974. 63 MKQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVT 142 (507) Q Consensus 63 m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~ 142 (507) .+...-+..++++.-.-.++-.|++..++ ++.+.+..+.++..++.+.. |..+...+-.+.+||-+.+=+.-.-.. T Consensus 117 Y~~~~l~rkiVd~pAeDa~R~g~~I~~~~--~e~~~~~~~~l~~~~~rl~v--~~~l~ea~~~~RlyGga~i~i~i~~~D 192 (765) T protein:vir:96 117 ISQHWLVDKACSMSGEDAARNGWELKSDG--RKLSDEQSALIARRDMEFRV--KDNLVELNRFKNVFGVRIALFVVESDD 192 (765) T ss_pred HHhCchhhhhhhcchHHhhcCCceeecCc--cccCHHHHHHHHHHHHHhhH--HHHHHHHHHHhhhceeeEEEEEecccC Confidence 34588888888888666666666665432 22334555667777777641 444444444588998665322211000 Q ss_pred ccccC--------cceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCcccccccc Q lcl|NC_017974. 143 VGEYI--------GKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINR 214 (507) Q Consensus 143 ~~~~~--------g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k 214 (507) ...++ ++..++.|... .+++..... .....++|... .++....+.. .+..|=+.| T Consensus 193 ~~~l~~PL~~~~I~kg~~kgl~vl--------dp~~~~~~~--v~e~~~Dp~sp---~fg~P~~y~i----~g~~IH~SR 255 (765) T protein:vir:96 193 PDYYEKPFNPDGIAPGSYKGISQI--------DPYWAMPQL--TAESTADPSAE---HFYEPDFWII----SGKKYHRSH 255 (765) T ss_pred cchhhccccccccccceeeEEEEe--------chhhccccc--chhcccccccc---ccCcceeeee----cCceeccce Confidence 00000 00111111111 111111100 00011111111 0111111111 122344556 Q ss_pred EEEEeeCC------ccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHH Q lcl|NC_017974. 215 VMLMNTGG------SSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRG 288 (507) Q Consensus 215 ~i~~~~~~------~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a 288 (507) ++.+.... ...+.+|.++|..||-...--.....-=++.+.++... ++++... +. ...++ .+.+. T Consensus 256 li~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~--v~k~~~~--~~--l~~~~---~l~~r 326 (765) T protein:vir:96 256 LVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTS--TIHVDVE--KA--IANED---AFNAR 326 (765) T ss_pred EEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeeechH--hh--hccHH---HHHHH Confidence 66653222 34456799999999877654444444445566665543 4443211 11 11122 22233 Q ss_pred HHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccc-ccccccchhhhhHHH Q lcl|NC_017974. 289 LMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGF-INVGNDKGGSYNLSE 367 (507) Q Consensus 289 ~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt-Lts~~~~~GS~Al~~ 367 (507) +..+. ..+ ++...++|..+-+++ .+..+-+| ...+++..-.+||-+.--.. .-.+++-.|.+|.|+ T Consensus 327 ~~~~~-~~r-~n~g~~~id~ee~~e-------~~s~~lsg----l~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe 393 (765) T protein:vir:96 327 LAFWI-ANR-DNHGVKVIGIDETME-------QFDTNLSD----FDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGE 393 (765) T ss_pred HHHHH-Hhc-CCceeEEecCCccee-------EEecccCC----HHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcch Confidence 33222 222 344566676654433 33333332 34577777777776653322 112333467777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhH-------HHHHHHHHHHHhcCCccc Q lcl|NC_017974. 368 SKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDM-------EGFSKAIQRIGAVGYLPK 440 (507) Q Consensus 368 vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl-------~~~a~~~~~L~~~G~~~~ 440 (507) --..+.-+.+++.......-+.+.|+..|+... ..+.. =.|.|...-..+- +..|++++++++.|++.+ T Consensus 394 ~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s~--~i~~d--~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~ 469 (765) T protein:vir:96 394 HETISYHEELESIQEHIFDPLLERHYLLLAKSE--SIDVQ--LEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSP 469 (765) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCH Confidence 777788888888876654444455777776542 12222 1456655443332 345677899999998766 Q ss_pred cHHHHHHHHHHc------CCCCCCCcccc-----cchhhhcCCCcCCCC----CCcc---cccCcccc-----------c Q lcl|NC_017974. 441 TPKVINRVLEVL------GIDEKIEEDIS-----QEELMKLLGEDTSRA----GDGM---TKGSSGNG-----------T 491 (507) Q Consensus 441 ~~~~~~~i~e~~------Gip~~~~~~~~-----~~~~~~~~~~~~~~~----~~~~---~~~~~~~~-----------~ 491 (507) + .+|+++ |+....+++.. .++........+... +++. +..++.++ . T Consensus 470 d-----EvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~ 544 (765) T protein:vir:96 470 D-----EVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGT 544 (765) T ss_pred H-----HHHHHHhccccCCCCCCCccccccccCCCccccccccCCCcccccccCccccccCCCCccCCCCcccccCCccc Confidence 3 455554 33211111110 000000000000000 0000 00000001 0 Q ss_pred ccccCCCCchhhhccC Q lcl|NC_017974. 492 GKISSTRDNSAANLDN 507 (507) Q Consensus 492 ~~~~~~~d~~a~~~~~ 507 (507) .+.+.....++...+| T Consensus 545 ~p~~~~~~~~~g~~~~ 560 (765) T protein:vir:96 545 KPLAKAAEEGAGEAAT 560 (765) T ss_pred CCccccccccCccccC Confidence 0000111111111222 No 108 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=98.79 E-value=4.7e-08 Score=60.74 Aligned_cols=334 Identities=12% Similarity=0.116 Sum_probs=142.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhh---hcccchhccchhHHHHHHH-hhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSE---RMKAEEIRWPNLIGTVESM-KQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~---~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~R 76 (507) |+|||+++++....++.... ..+..++ +...++...+..+ ..++....-|=.+.-..++ ...+|..|+|.-+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~---~~~~~~~-p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k 76 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKM---EAFTFGE-PVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVK 76 (340) T ss_pred CCCCCCCccccccccCccce---eEEEcCC-ceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhh Confidence 99877666554444332211 1111211 0000111011111 1111111111111222333 3589999999988 Q ss_pred HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 77 YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 77 k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) +..+.+. +.+.+.-+.. .|..++ ++-+.+|-+.+|+++... |. +..|. T Consensus 77 ~n~l~~~---~~Pn~~lt~~------------------~f~~~~---~d~ll~Gnay~~~~rn~~------G~--~~~L~ 124 (340) T protein:vir:98 77 RNVLAST---YIPHPLLSRQ------------------DFSRFA---LDYLVFGNAFLEQRHSVT------GQ--LIKLL 124 (340) T ss_pred hhHHhhc---cCCCCCCCHH------------------HHHHHH---HHHHhcCCeEEEEEECCC------Cc--EEEEE Confidence 7777552 4544432211 122333 355668999999987543 22 34455 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.|..++. ...++.. ++. ....+....++++.+++++...-.+..||.+.+..+ T Consensus 125 pl~~~~vr------~~~~~~~----------~~~----------~~~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a 178 (340) T protein:vir:98 125 TSPAKYTR------RGVDDSV----------FWF----------VENFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSA 178 (340) T ss_pred EeCCceEE------EcccCcE----------EEE----------EecCCeEEEEccccEEEEcCCCCCCCcccccHHHHH Confidence 55543221 1222211 111 011123345666676555543334567999988888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-....-.....+-..|.+--+.|=-+++++ +..-++++...+.+++ +............+..|.|.+- T Consensus 179 ~~si~l~~aa~~~~~~~f~NGa~pg~il~~~------~~~ls~e~~~~lk~~~-~~~~G~~n~~~~~vl~~~g~~~---- 247 (340) T protein:vir:98 179 LNSAWLNESATLFRRKYYQNGAHAGYIMYVT------DPAQSATDVESLRDAM-RNSKGLGNFKNLFFYSPNGKPD---- 247 (340) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEec------CCCCCHHHHHHHHHHH-HHhcCccccCceeEecCCCCcc---- Confidence 7766665555555566665433344444433 1122233333333333 2221111111111223333211 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~ 394 (507) .++++..+-+..-.+|.+.-++-..+|+.+..-..--.+ .+.+|+++-.+.... ..+..+|.- ++. T Consensus 248 -g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~----------~f~~~~l~P-l~~ 315 (340) T protein:vir:98 248 -GIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAK----------VFVRNELSP-LQD 315 (340) T ss_pred -ceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHH----------HHHHHHHHH-HHH Confidence 233444444444445777777778888887654432222 123444443322222 233333321 222 Q ss_pred HHHHhCCCcCCccccceEEecCcCchhHH Q lcl|NC_017974. 395 QLLALNDMRLPETEMPYVKAGEIVDVDME 423 (507) Q Consensus 395 ~l~~lN~~~~~~~~~P~~~~~~~~~~dl~ 423 (507) .+.++|-.-+ ... +.|++..-.+.+ T Consensus 316 ~iee~n~~L~--~e~--~rF~~~~l~~~d 340 (340) T protein:vir:98 316 RFREVNDWLG--MEV--IRFKEYTLDNPE 340 (340) T ss_pred HHHHHHhccc--ccc--cccCccccccCC Confidence 2333442111 122 333322222211 No 109 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=98.77 E-value=5.5e-08 Score=60.36 Aligned_cols=336 Identities=8% Similarity=0.071 Sum_probs=146.9 Q ss_pred Cccccccccccc-CccccccccccccccccccCchhhhHHHHh----hhhcccchhccchhHHHHHHH-hhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVK-VAKATSEQINVSRMRMSEQGTFALAKVQVD----SERMKAEEIRWPNLIGTVESM-KQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~-~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~----~~~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l~ 74 (507) |+||+++.++.. .++++... .-+..++ +...++...+. ...+.+.-...|=...-..++ ...+|.+++|. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~---~~~~~~~-p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h~~~i~ 76 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQT---EIFSFGD-PIPVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHHESAII 76 (346) T ss_pred CCcccCCCCCcccccccccCe---EEEecCC-cceecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhcchhhh Confidence 999987766443 22222111 1122221 11111111111 111222111111111222333 34788888988 Q ss_pred HHHHHHhhCCceee-eCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 75 MLYTFVEKAFKDFK-VIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 75 ~Rk~av~~~~~~~~-~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) .++..+..+ +. |.+.-+ ..++..-+++-+.+|-+.+|+++... |. +. T Consensus 77 ~k~n~l~~l---~~~Pn~~~t---------------------~~~f~~~~~d~ll~Gnay~~i~r~~~------G~--~~ 124 (346) T protein:vir:10 77 TKANILLST---CEVDSRYLS---------------------RRDLSSFVKDYLVFGNAYFEVVRNRL------GQ--VQ 124 (346) T ss_pred hhhhhHHHH---HhCCCCCCC---------------------HHHHHHHHHHHHhcCCeEEEEEEcCC------Cc--EE Confidence 877777553 22 222211 22232334455679999999987543 22 33 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL 233 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll 233 (507) .|.+.|+.++.. ..+.|+ +.... ....+....+|+..+|+++...-.+..||.+.+ T Consensus 125 ~L~pl~~~~v~~----~~~~~~--~~~~~------------------~~~~g~~~~~~~~dIih~r~~~~~~~~~G~~~~ 180 (346) T protein:vir:10 125 RIESPLAKYVRK----GLEAGQ--FYYVP------------------QRFDHQEHEFAKGSIYHLLEPDINQDIYGLPQY 180 (346) T ss_pred EEEEecCCceEE----EEcCCe--EEEEE------------------EccCCeEEEEecccEEEecCCCCCCCeeeccHH Confidence 566666544321 111111 10000 001122345677666655544445678999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCC-ceeEeecCcccc Q lcl|NC_017974. 234 VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGE-QSFFMLPSDTKD 312 (507) Q Consensus 234 ~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~-~a~~iiP~g~~i 312 (507) ..+.-....-.....+-..+...-+.|--+++++ +..-++++... +++..+.... .|+ ..-++++.|.+- T Consensus 181 ~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~------d~~l~~e~~~~-i~~~~~~~~g--~~n~~~~~vl~~~~~~ 251 (346) T protein:vir:10 181 LSALQSAWLNESATLFRRKYFLNGAHAGFVFYMS------DASQKQEDVEN-IRQQLKQSKG--VGNFKNLFVHAPNGKK 251 (346) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC------CCCCCHHHHHH-HHHHHHHhcC--ccccCceeEecCCCCc Confidence 8888777777777777777776544444344432 11122333333 3332232221 122 122444444332 Q ss_pred cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 313 NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 313 ~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~ 390 (507) + .++++..+-+..-..|.+.-++-..+|+.+..-..--.+ .+.+|+++-.+.... ..+..+|.- T Consensus 252 ~----gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~----------~f~~~~l~P 317 (346) T protein:vir:10 252 D----GIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAE----------VFFITEIEP 317 (346) T ss_pred c----ceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHH----------HHHHHHHHH Confidence 1 133444444444445667777888889887764432222 223344543332222 222333321 Q ss_pred HHHHHHHHhCCCcCCccccceEEecCcCchhHHH Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEG 424 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~ 424 (507) ++..+.++|..-+ . . .+.|+..+-.+.++ T Consensus 318 -~~~~iee~n~~L~-~-e--~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 318 -LQERLKEFNQWLG-Q-E--VIKFKPSKLLQRTQ 346 (346) T ss_pred -HHHHHHHHHhhcc-c-c--eeeechhhhcccCC Confidence 2222333331111 1 1 23443222111111 No 110 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=98.70 E-value=4.3e-08 Score=60.97 Aligned_cols=338 Identities=13% Similarity=0.127 Sum_probs=143.5 Q ss_pred CcccccccccccCcccccccccc-----ccccccccCchhhhHHHHhhhhc---ccchhccchhHHHHHHH-hhChHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINV-----SRMRMSEQGTFALAKVQVDSERM---KAEEIRWPNLIGTVESM-KQDATVAT 71 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~-----~~~~~~~~g~~~~~~~~~~~~~e---~~~~lr~~~~~~~y~~m-~~D~~v~s 71 (507) |+|+|.++++...+.+....... .-+..++ +...++...+..+.+ ++....-|=...-.-++ ...+|..+ T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~-p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s 104 (376) T protein:vir:10 26 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDD-PTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 104 (376) T ss_pred chhccCCCcccchhhhhHhhhccCcceeEEEEcCC-ceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHHhhh Confidence 99998887766555443211100 0111111 000111111111111 11100001111111233 35889999 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) +|..++.-+.+ .+.+.+.-+.. .|..+ +++.+.+|.+.+|++.... | . T Consensus 105 ~l~~k~n~l~~---~~~Pnp~lT~~------------------~f~~~---v~d~ll~Gnay~~~~rn~~------G--~ 152 (376) T protein:vir:10 105 ALFFKANVLAS---TFRPHRWLSRH------------------AFERW---ALDFLTFGNGYLERRRNMV------G--G 152 (376) T ss_pred hHHHHhHHHHh---ccCCCCCCCHH------------------HHHHH---HHHHHhcCCeEEEEEECCC------C--C Confidence 99988777654 35554432211 12233 3455678999999987543 2 2 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) +..|.+.|+.++.. ..|+.....+ ........++.+.+++++...-.+..||.+ T Consensus 153 ~~~L~pl~~~~vr~------~~d~~~~~~~--------------------~~~~~~~~~~~~eViHir~~~~~~~~yGls 206 (376) T protein:vir:10 153 TLRLEPALAKYVRR------KADFNGFVYV--------------------NGWQERHEFEPDSVFQLVRPDINQEVYGLP 206 (376) T ss_pred EEEEEEeCCcceEE------EeeCCeEEEE--------------------EcCCeEEEEccccEEEecCCCCCCCccccc Confidence 44566666543321 1121111100 011223345666666555444456788998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) .+..+.-....-.....+-..|.+--+.|=-+++++ +..-++++...+.+++. ......+.....+..|.|.+ T Consensus 207 ~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~------d~~l~~e~~~~lr~~~~-~~~G~~N~~~~~vl~~~g~~ 279 (376) T protein:vir:10 207 EYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMT------DAAQKQDDVDNMRDALK-NAKGPGNFRNVFMYAPGGKK 279 (376) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec------CCCCCHHHHHHHHHHHH-HhcCccccCceeEecCCCCc Confidence 888777666655555555455554333333334332 11223333333333332 22111111111122333321 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) - .++++..+-+..-.+|.+.-++-..+|+.+.--.---.| +..+|+++-.+.... ..+..+|. T Consensus 280 ~-----Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~----------~f~~~~L~ 344 (376) T protein:vir:10 280 D-----GIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAAR----------VFGRNEIR 344 (376) T ss_pred c-----ceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHH----------HHHHHHHH Confidence 1 123444444444456777778888889887654432222 122344443332222 22222222 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecC--cCchhHHH Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGE--IVDVDMEG 424 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~--~~~~dl~~ 424 (507) - ++..+-++|.. . +..+ +.|+. ...-|.++ T Consensus 345 P-l~~~ieeln~~-L-~~~~--~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 345 P-LQARFAELNDW-L-GEEV--VRFDDYEIPPAPVAA 376 (376) T ss_pred H-HHHHHHHHHhh-c-cccc--cccChhHhhcccccC Confidence 1 22222233311 1 1222 33332 22222222 No 111 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=98.69 E-value=8.5e-08 Score=59.36 Aligned_cols=338 Identities=13% Similarity=0.127 Sum_probs=145.2 Q ss_pred CcccccccccccCccccccccc-----cccccccccCchhhhHHHHhhhhc---ccchhccchhHH-HHHHHhhChHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQIN-----VSRMRMSEQGTFALAKVQVDSERM---KAEEIRWPNLIG-TVESMKQDATVAT 71 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~~e---~~~~lr~~~~~~-~y~~m~~D~~v~s 71 (507) |+|||.++++...+.+...... ...+..++ +...++...+..+.+ ++.--.-|=.+. +-+-....+|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~ 79 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDD-PTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 79 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCC-ceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhh Confidence 9999888776655543321110 01111111 000111111111111 110000011111 1222345899999 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) +|..++..+.+ .+.+.+.-+.. .|..+ +++-+.+|-+.+|++.... | . T Consensus 80 ~l~~k~n~l~~---~~~Pnp~~t~~------------------~f~~~---v~d~ll~Gnay~~~~r~~~------G--~ 127 (351) T protein:vir:79 80 ALFFKANVLAS---TFRPHRWLSRH------------------AFERW---ALDFLTFGNGYLERRRNMV------G--G 127 (351) T ss_pred hhhhhhhHHhh---cccCCCCCCHH------------------HHHHH---HHHHHhcCCeEEEEEECCC------C--C Confidence 99887777755 24554432211 12223 3455678999999987542 2 2 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) +..|.+.++.++.. ..++.+.... ...+....++++.+|+++...-.+..||.+ T Consensus 128 ~~~L~~l~~~~v~~------~~~~~~~~~~--------------------~~~g~~~~~~~~eIihir~~~~~~~~yGl~ 181 (351) T protein:vir:79 128 TLRLEPALAKYVRR------KADFSGFVYV--------------------NGWQERHEFEPDSVFQLVRPDINQEVYGLP 181 (351) T ss_pred EEEEEEeCCcceee------eecCCeEEEE--------------------ecCceEEEEcCccEEEeCCCCCCCCccccc Confidence 34566666544321 1111111000 011223345666655444433356788998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) .+..+......-.....+-..|.+--+.|=-+++++ +..-++++...+.+++ +............+.+|.|.+ T Consensus 182 ~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~------~~~ls~e~~~~lk~~~-~~~~G~~N~~~~~v~~~~g~~ 254 (351) T protein:vir:79 182 EYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMT------DAAQKQDDVDNMRDAL-KNAKGPGNFRNVFMYAPGGKK 254 (351) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec------CCCCCHHHHHHHHHHH-HHhcCccccCceeEecCCCCc Confidence 888887776665555555555554333333333332 1122333333333333 222211111112223343332 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) . .++++..+-+..-..|.+.-++-..+|+.+..-.---.| +..+|+++-.+.... ..+..+|. T Consensus 255 ~-----gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~----------~f~~~~l~ 319 (351) T protein:vir:79 255 D-----GIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAAR----------VFGRNEIR 319 (351) T ss_pred c-----ceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHH----------HHHHHHHH Confidence 1 133444444444445777778888889987644332222 122343443222222 23333332 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcC--chhHHH Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIV--DVDMEG 424 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~--~~dl~~ 424 (507) -++..|-++|.. . +..+ +.|+..+ .-|.++ T Consensus 320 -Pl~~~ie~ln~~-l-g~~~--~~F~~~~llr~d~~a 351 (351) T protein:vir:79 320 -PLQARFAELNDW-L-GDEV--VTFDDYEIPPAPVAA 351 (351) T ss_pred -HHHHHHHHHHhh-c-Ccce--eeeChhhhccccccC Confidence 122333344421 1 2222 4444322 222221 No 112 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=98.68 E-value=8e-08 Score=59.50 Aligned_cols=335 Identities=13% Similarity=0.126 Sum_probs=147.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhh---hccc----chhccchhHHHHHHHhhChHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSE---RMKA----EEIRWPNLIGTVESMKQDATVATGL 73 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~---~e~~----~~lr~~~~~~~y~~m~~D~~v~s~l 73 (507) |+|||.++..+...+.+..+....-+..++ +...++......+ ..++ |...+..+ -+-.+..+|..|+| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~l---a~~~~a~~~h~~~i 76 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFTFGE-PVPVLDRRDILDYVECISNGRWYEPPISFTGL---AKSLRAAVHHSSPI 76 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEEcCC-ceeecCCcchhHHHHhhhcCccccCCCCHHHH---HHHHHhhhhhccch Confidence 999988776554333322221111122221 0000111111111 1111 11111112 22233588889998 Q ss_pred HHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 74 DMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 74 ~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) .-++..+.+. +++.+.-+.. .|..++ ++-+.+|-+.+|++.... |. +. T Consensus 77 ~~k~n~l~~~---~~Pn~~~t~~------------------~f~~~~---~d~ll~Gnay~~i~rn~~------G~--~~ 124 (344) T protein:vir:60 77 YVKRNILAST---FIPHPWLSQQ------------------DFSRFV---LDFLVFGNAFLEKRYSTT------GK--VI 124 (344) T ss_pred hhhhhHHHhh---ccCCCCCCHH------------------HHHHHH---HHHHhcCCeEEEEEECCC------Cc--EE Confidence 8877777552 4554432221 133233 345568999999987542 32 34 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL 233 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll 233 (507) .|.+.|..++. +..++... +.. ...+....+|++.+++++...-.+..||.+.+ T Consensus 125 ~L~~l~~~~vr------~~~~~~~~----------~~v----------~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~ 178 (344) T protein:vir:60 125 RLETSPAKYTR------RGVEEDVY----------WWV----------PSFNEPTAFAPGSVFHLLEPDINQELYGLPEY 178 (344) T ss_pred EEEEcCcceEE------EeecCCeE----------EEE----------ccCCeEEEEcCccEEEEcCCCCCCCcccccHH Confidence 56666654332 11222110 000 01123345666666554433334567899998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccc Q lcl|NC_017974. 234 VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDN 313 (507) Q Consensus 234 ~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~ 313 (507) ..+.-....-.....+-..|.+.-+.|=-++++| +..-++++...+.+++.+ .....+|....+.+|.|-+- T Consensus 179 ~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~------~~~ls~e~~~~ik~~~~~-~~g~~~~r~~~l~~p~g~~~- 250 (344) T protein:vir:60 179 LSALNSAWLNESATLFRRKYYENGAHAGYIMYVT------DAVQDRNDIEMLRENMVK-SKGRNNFKNLFLYAPQGKAD- 250 (344) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec------CcCCCHHHHHHHHHHHHH-hcCCCCCcceEEecCCCCcc- Confidence 8887776665555555556665434444444433 112233333333333322 22111111122233433211 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) .++++..+-+..-..|.+.-++-..+|+.+.--.---.+ .+.+|+++-.+-... ..+..+|. - T Consensus 251 ----g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~----------~f~~~~L~-P 315 (344) T protein:vir:60 251 ----GIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK----------VFVRNELI-P 315 (344) T ss_pred ----ceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHH----------HHHHHHHH-H Confidence 133444444444445677777888889987754432222 223444553322222 22233331 1 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhH Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDM 422 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl 422 (507) ++..|.++|. +-+...-+|.....+..|- T Consensus 316 l~~~~e~ln~--~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 316 LQDRIREING--WLGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHH--hcCCcccccCccccCCCCC Confidence 2222333442 1122333455555555553 No 113 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=98.65 E-value=1.4e-07 Score=58.13 Aligned_cols=350 Identities=11% Similarity=0.046 Sum_probs=167.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |+=...-++.. .+.+..-. ......+.+ .....+ .-+..++-+.|.+|+..+...| T Consensus 1 M~~~~~f~~r~-~~~~~~~~--------~~~~~~~~~--------------~~~~~v-~~~~al~~~av~~cv~~ia~~i 56 (359) T protein:vir:10 1 MSILNPFERRS-SITPNNYY--------PFMVQNGSI--------------VPNSLV-DATEALKNSDLYAVTSLISSDI 56 (359) T ss_pred Ccccchhhccc-cCCCCcch--------hhhhccccc--------------cCCccc-CHHHhhcchHHHHHHHHHHHhh Confidence 65543221110 01110000 000000000 000000 0122456688999999999999 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .++++ +. ++....++.+- -...+..++++.+. +.+++|-+..++++... | .+..|.+.| T Consensus 57 a~~p~--~~-------~~~~~~L~~~P---N~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------g--~~~~l~~l~ 116 (359) T protein:vir:10 57 AGTRF--IG-------NQVFTSVLNNP---SHLTNAFSFWQTAILNLLLNGNVFLAILKGDN------S--LMKELRLIP 116 (359) T ss_pred hcCcc--cc-------chHHHHHhhcc---cccCCHHHHHHHHHHhccccCceEEEEEECCC------C--eEEEEEEeC Confidence 87754 21 11122222110 01123456666665 56678999988887532 2 244455555 Q ss_pred chhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCc----cCccccchHHHH Q lcl|NC_017974. 160 QASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGS----SSQALGVSPLVG 235 (507) Q Consensus 160 ~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~----~~~p~G~gll~~ 235 (507) +.++.. ..+ ++ .+.+.. . .........+|.+.++++++... .+..+|.|.+.. T Consensus 117 ~~~v~i----~~~-~~-~~~y~~---------~--------~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~ 173 (359) T protein:vir:10 117 SNAITI----DLT-DD-TLTYEV---------N--------QFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLES 173 (359) T ss_pred CceEEE----EEc-CC-eEEEEE---------E--------ecCCceEEEEcccceEEeccCCCCCCccCccccccHHHH Confidence 543321 111 22 111110 0 01112344567777776665432 234579999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce--eEeecCccccc Q lcl|NC_017974. 236 CYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS--FFMLPSDTKDN 313 (507) Q Consensus 236 ~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a--~~iiP~g~~i~ 313 (507) +....-......++...+...-+.|--++++| ....+++..+.+++ ......+|..+ .++++.|++.. T Consensus 174 ~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-------~~~l~~e~~~~~~~---~~~~~~~~~n~g~~~vl~~g~~~~ 243 (359) T protein:vir:10 174 LTSEIGQQKEANRLSLSTLKGALNPTSVVKVP-------QGTLSSEAKDSIRK---EFEKANGGNNSGRVMVLDQSADFS 243 (359) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-------CCCCCHHHHHHHHH---HHHHHhCccccCCceecCCCccee Confidence 88877777777777777776544454455444 22222222222332 22223344434 47788887543 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc--cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN--DKGGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) - .+-+.....|.+..++...+|++++.-..--.+. +...+++. + ++.....+.--+..+.+.|+.. T Consensus 244 ~---------l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~--~-e~~~~~~l~~~l~p~~~~l~~~ 311 (359) T protein:vir:10 244 T---------VSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQ--I-KDLYVNALNRFIEPLISELRIK 311 (359) T ss_pred e---------ecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHH--H-HHHHHHHHHHHHHHHHHHHHHH Confidence 2 2222222346667788888999987554322221 11223322 2 1222222222333444444443 Q ss_pred HHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCC Q lcl|NC_017974. 392 LLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKI 458 (507) Q Consensus 392 li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~ 458 (507) |.+.+ .++. .. .+.++ .+.+...+.+++..|++.+ +++|+.+|+|+=- T Consensus 312 l~~~~-~~~~-----~~--~~~~d------~~~~~~~~~~~~~~G~~t~-----NE~R~~l~~~pv~ 359 (359) T protein:vir:10 312 CDSSI-GVDM-----SP--ITDYS------NSVFKADILNWVKEGIIEP-----TEAKTLLESKGII 359 (359) T ss_pred hhhhh-cccc-----hh--hhhcC------HHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCC Confidence 33322 2221 10 12222 2445566788999998766 5799999997543 No 114 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=98.64 E-value=1.2e-08 Score=63.94 Aligned_cols=344 Identities=13% Similarity=0.088 Sum_probs=137.7 Q ss_pred CcccccccccccC---ccccccccc----------cccccccccCchhhhHHHHhh---hhcccchhccchhHHHHHHHh Q lcl|NC_017974. 1 MSRKRNRNRSVKV---AKATSEQIN----------VSRMRMSEQGTFALAKVQVDS---ERMKAEEIRWPNLIGTVESMK 64 (507) Q Consensus 1 ~~~k~~~~~~~~~---~~~~~~~~~----------~~~~~~~~~g~~~~~~~~~~~---~~e~~~~lr~~~~~~~y~~m~ 64 (507) |+|||++...... +.+...... ..-+..++-.. .+....+.. ....+...+-|=.+....++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~-~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~ 79 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVE-VLDRRELLDYVECMRMGQWYEPPMPWDGLARSF 79 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCcee-ecchhhHHHHHHHHhccchhccCcCHHHHHHHH Confidence 9999977754322 121111100 00011111000 011110110 011111111111111112222 Q ss_pred -hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCc Q lcl|NC_017974. 65 -QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTV 143 (507) Q Consensus 65 -~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~ 143 (507) ..+|-.+++..+...+ .+ .+.+.+.-+ ..++-.-+.+-+.+|-+.+|++.... T Consensus 80 ~~~~~h~~~~~~~~n~l-~l--~~~Pn~~~t---------------------~~~f~~l~~d~ll~Gnay~~~~r~~~-- 133 (368) T protein:vir:79 80 RAAAHHSSAVYVKRNIL-VS--TFIPHPLLS---------------------RATFERLVLDWQVFGNAYLERRENVL-- 133 (368) T ss_pred hhccccchhhhhhcchh-hh--hcCCCcCCC---------------------HHHHHHHHHHHhhcCCeEEEEEEcCC-- Confidence 3555555555443332 22 122222211 22232334466678999999987542 Q ss_pred cccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCc Q lcl|NC_017974. 144 GEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGS 223 (507) Q Consensus 144 ~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~ 223 (507) |. +..|.+.++.++. ...|+...... ...+....+|...+++++...- T Consensus 134 ----G~--~~~L~~l~~~~v~------~~~~~~~~~~~--------------------~~~~~~~~~~~~dIihir~~~~ 181 (368) T protein:vir:79 134 ----GG--TIRLDTPLAKYVR------RGLDLNTYFFV--------------------QNWQQPYTFAAGSVFHLQEPDI 181 (368) T ss_pred ----CC--EEEEEEeCcccce------eeccCCEEEEE--------------------ecCCeEEEEccccEEEecCCCC Confidence 22 3445555554331 12222211111 1122344566666655554333 Q ss_pred cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCcee Q lcl|NC_017974. 224 SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSF 303 (507) Q Consensus 224 ~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~ 303 (507) .+..||.+.+..+......-.....+-..|...-+.|=-++++| +..-++++...+.+++ + +..+...++ T Consensus 182 ~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~------~~~l~~e~~~~lk~~~-~---~~~G~~N~g 251 (368) T protein:vir:79 182 NQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMT------DAAQKQEDVDTLREAM-K---SAKGPGNFR 251 (368) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC------CCCCCHHHHHHHHHHH-H---HhcCCcccC Confidence 55679999998888777666666666566665444344344432 1122333333332333 2 222222222 Q ss_pred --Eee-cCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHH Q lcl|NC_017974. 304 --FML-PSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQ 378 (507) Q Consensus 304 --~ii-P~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~ 378 (507) +++ |.|.+- .++++..+-+..-..|.+.-++-.++|+.+..-...-.+ .+.+|+++-.+... T Consensus 252 ~~~vl~~~g~~~-----g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~-------- 318 (368) T protein:vir:79 252 NLFMYAPNGKKD-----GIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAA-------- 318 (368) T ss_pred ceeEecCCCCcc-----ceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHH-------- Confidence 333 333221 123444444444445777778888999998754432222 12233344222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcC--chhHHHHHHHHHHHHhcCCcccc Q lcl|NC_017974. 379 RVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIV--DVDMEGFSKAIQRIGAVGYLPKT 441 (507) Q Consensus 379 aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~--~~dl~~~a~~~~~L~~~G~~~~~ 441 (507) ..++..+|.- ++..|.++|-.-++ . .+.|+..+ ..|.+..|+. |. -+. T Consensus 319 --~~f~~~~l~P-l~~~ie~ln~~l~~--e--~~rF~~~~l~~~D~~a~a~~-------~~-rsa 368 (368) T protein:vir:79 319 --MVFARNEVKP-LQDRLLAINDWIGD--E--VVRFAPYALGGHDQPAAAPG-------GQ-RSA 368 (368) T ss_pred --HHHHHHHHHH-HHHHHHHHHhccCc--c--eeeechhHhhcccccccCCc-------cc-ccC Confidence 2333333321 33333344421111 1 23443211 2222222211 11 110 No 115 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=98.60 E-value=1.4e-07 Score=58.13 Aligned_cols=338 Identities=14% Similarity=0.128 Sum_probs=141.5 Q ss_pred CcccccccccccCccccccccc-----cccccccccCchhhhHHHHhhhhc---ccchhccchhHH-HHHHHhhChHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQIN-----VSRMRMSEQGTFALAKVQVDSERM---KAEEIRWPNLIG-TVESMKQDATVAT 71 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~~e---~~~~lr~~~~~~-~y~~m~~D~~v~s 71 (507) |+|||.++++...+.+...... ...+..++ +...++...+..+.+ ++.--.-|=... +-+-....+|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~ 79 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDD-PTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 79 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCC-ceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhhh Confidence 9999888776655543321110 01111111 000111111111111 110000011111 1222335899999 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) +|..++..+.+ .+.+.+.-+. .++-.-+++.+.+|-+.+|++-... | . T Consensus 80 ~l~~k~n~l~~---~~~Pn~~~t~---------------------~~f~~~~~d~ll~Gnay~~~~rn~~------G--~ 127 (351) T protein:vir:78 80 ALFFKANVLAS---TFRPHRWLSR---------------------HAFERWALDFLTFGNGYLERRRNMV------G--G 127 (351) T ss_pred hhhhhhhHHhh---cccCCCCCCH---------------------HHHHHHHHHHHhcCCeEEEEEECCC------C--C Confidence 99887777654 2444443221 1222233456678999999886532 2 2 Q ss_pred eeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccch Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVS 231 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~g 231 (507) +..|.+.|..++... .+.++ . .. .........+|+..+++++...-.+..||.+ T Consensus 128 ~~~L~pl~~~~v~~~----~~~~~--~----------~~----------~~~~~~~~~~~~~eVihir~~~~~~~~yGl~ 181 (351) T protein:vir:78 128 TLRLEPALAKYVRRK----ADFSG--F----------VY----------VNGWQERHEFAPDSVFQLVRPDINQEVYGLP 181 (351) T ss_pred EEEEEEecCcceEEe----eeCCe--E----------EE----------EecCCeEEEEccccEEEEcCCCCCCCccccc Confidence 345666665433211 11111 0 00 0011223446666655444333346778998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc Q lcl|NC_017974. 232 PLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK 311 (507) Q Consensus 232 ll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~ 311 (507) .+..+.-....-.....+-..|..--+.|=-+++++ +..-++++...+.+++ +......+.....+..|.|.+ T Consensus 182 ~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~------~~~ls~e~~~~lr~~~-~~~~G~~N~~~~~v~~~~g~~ 254 (351) T protein:vir:78 182 EYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMT------DAAQKQDDVDNMRDAL-KNAKGPGNFRNVFMYAPGGKK 254 (351) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec------CCCCCHHHHHHHHHHH-HHhcCcccccceeeecCCCCc Confidence 888887766655555555455554333333333322 1122333333333333 222211111111122344332 Q ss_pred ccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 312 DNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALN 389 (507) Q Consensus 312 i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln 389 (507) . .++++..+-+..-..|.+.-++-..+|+.+..-..--.+ .+.+|+++-.+.... ..+..+|. T Consensus 255 ~-----g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~----------~f~~~~l~ 319 (351) T protein:vir:78 255 D-----GIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAAR----------VFGRNEIR 319 (351) T ss_pred c-----ceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHH----------HHHHHHHH Confidence 1 123444444434445777777778889887654432222 122344442222222 22233332 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcC--chhHHH Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIV--DVDMEG 424 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~--~~dl~~ 424 (507) - ++..|.++|.. . +..+ |.|+..+ ..|.++ T Consensus 320 P-~~~~iee~n~~-l-~~~~--~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 320 P-LQARFAELNDW-L-GDEV--VRFDDYEIPPAPVAA 351 (351) T ss_pred H-HHHHHHHHHhh-c-Cccc--eecChhhhccccccC Confidence 1 22223334421 1 1122 4444333 222222 No 116 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=98.60 E-value=1.5e-07 Score=58.01 Aligned_cols=338 Identities=12% Similarity=0.099 Sum_probs=143.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhc---ccchhccchhHHHHHHH-hhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERM---KAEEIRWPNLIGTVESM-KQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e---~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~R 76 (507) |+|||.++.++.....+........+..++ +...++...+..+.+ ++.--.-|=...-..++ ...+|..++|.-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k 79 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMTASGPKMEAFTFGE-PVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVK 79 (344) T ss_pred CCcccCCCCcchhhhhhccCCceEEEEcCC-ceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhhhhCccceeh Confidence 999988766554333222211111222221 000111111111111 11100001111111233 3588999998877 Q ss_pred HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 77 YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 77 k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) +..+.+ .+.+.+.-+.. .|..+ +++-+.+|-+.+|++.... | .+..|. T Consensus 80 ~n~l~~---~~~Pn~~lt~~------------------~f~~~---~~d~ll~Gnay~~i~rn~~------G--~~~~L~ 127 (344) T protein:vir:20 80 RNILAS---TFIPHPWLSQQ------------------DFSRF---VLDFLVFGNAFLEKRYSTT------G--KVIRLE 127 (344) T ss_pred hhhHHH---hccCCCCCCHH------------------HHHHH---HHHHHhcCCeEEEEEECCC------C--cEEEEE Confidence 776654 24544432211 13223 3355568999999987532 2 245566 Q ss_pred ccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHH Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGC 236 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~ 236 (507) +.|..++. ...++... +.. ...+....++++.+++++...-.+..||.+.+..+ T Consensus 128 pl~~~~vr------~~~~~~~~----------~~~----------~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a 181 (344) T protein:vir:20 128 TSPAKYTR------RGVEEDVY----------WWV----------PSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSA 181 (344) T ss_pred EcCCceeE------eeecCCEE----------EEE----------ccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHH Confidence 66654332 11222111 000 01122345666666555543334667899988887 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccc Q lcl|NC_017974. 237 YRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQ 316 (507) Q Consensus 237 ~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~ 316 (507) .-....-.....+-..|.+.-+.|=-++++| +..-++++...+.+++.+ .....+|....+.+|.|-+- T Consensus 182 ~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~------d~~l~~e~~~~ik~~~~~-~~g~~n~r~l~l~~p~g~~~---- 250 (344) T protein:vir:20 182 LNSAWLNESATLFRRKYYENGAHAGYIMYVT------DAVQDRNDIEMLRENMVK-SKGRNNFKNLFLYAPQGKAD---- 250 (344) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEec------CcCCCHHHHHHHHHHHHH-hcCCCCccceEEecCCCCcc---- Confidence 7766665555555556665434344444432 122233333333333322 21111111112223433211 Q ss_pred cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 317 YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 317 ~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~ 394 (507) .++++..+-+..-..|.+.-++-..+|+.+..-.---.+ .+.+|+++-.+....+ .+..+|. -++. T Consensus 251 -gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~----------f~~~~l~-P~~~ 318 (344) T protein:vir:20 251 -GIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKV----------FVRNELI-PLQD 318 (344) T ss_pred -ceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHH----------HHHHHHH-HHHH Confidence 133444444444445777778888889987754332222 2234445433322222 2222221 1222 Q ss_pred HHHHhCCCcCCccccceEEecCcCchhH Q lcl|NC_017974. 395 QLLALNDMRLPETEMPYVKAGEIVDVDM 422 (507) Q Consensus 395 ~l~~lN~~~~~~~~~P~~~~~~~~~~dl 422 (507) .|-++|..-+. ..-+|.+...+..|- T Consensus 319 ~~e~in~~lg~--~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 319 RIREINGWLGQ--EVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHhcCC--cccccCccccccCCC Confidence 22233311111 111233333333331 No 117 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=98.54 E-value=3.2e-07 Score=56.19 Aligned_cols=332 Identities=12% Similarity=0.095 Sum_probs=140.3 Q ss_pred CcccccccccccCccc----ccccc----ccccccccccCchhhhHHHHhhhhc---ccchhccchhHHHHHHH-hhChH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKA----TSEQI----NVSRMRMSEQGTFALAKVQVDSERM---KAEEIRWPNLIGTVESM-KQDAT 68 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~----~~~~~----~~~~~~~~~~g~~~~~~~~~~~~~e---~~~~lr~~~~~~~y~~m-~~D~~ 68 (507) |+|||++++....+.+ ..... ....+..++-. ..++...+..+.+ ++.-..-|=...-.-++ +..+| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~-~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~~~~ 79 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPM-PVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGSSVY 79 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCce-eecCcchhhHHHHHhhcCccccCCCCHHHHHHHHhhhhh Confidence 9999877665432221 11100 01111222110 0111111111111 11110001111111223 34788 Q ss_pred HHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCc Q lcl|NC_017974. 69 VATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIG 148 (507) Q Consensus 69 v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g 148 (507) .+++|..++..+.+ .+.+.+.-+.. .|..++ ++-+.+|-+.+|++.... | T Consensus 80 h~~~l~~k~n~l~~---~~~Pn~~~t~~------------------~f~~~v---~d~ll~Gnay~~~~rn~~------G 129 (350) T protein:vir:11 80 LQSGLKFKRNMLAK---TFIPHRLLSRA------------------TFEQFS---LDWLTFGSAYLEQPRSRL------G 129 (350) T ss_pred hccchhhhhhhhhh---cccCCCCCCHH------------------HHHHHH---HHHHhcCCeEEEEEEcCC------C Confidence 88888877666544 24444432211 133333 455678999999986532 2 Q ss_pred ceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccc Q lcl|NC_017974. 149 KYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQAL 228 (507) Q Consensus 149 ~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~ 228 (507) . +..|.+.|+.++. ...++.. ++.. ...+....+|+..+++++...-.+..| T Consensus 130 ~--~~~L~~l~~~~vr------~~~~~~~----------~~~~----------~~~~~~~~~~~~eVihir~~~~~~~~y 181 (350) T protein:vir:11 130 T--RMPLQAPLAKYMR------RGTDLET----------FYQV----------RSWKDEHEFEKGSVIQLREADINQEIY 181 (350) T ss_pred C--EEEEEEeCCceeE------eeecCCe----------EEEE----------eeCCeEEEECcccEEEeCCCCCCCCcc Confidence 2 3456565554332 1222211 1100 112233456777766555443345678 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCcee--Ee- Q lcl|NC_017974. 229 GVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSF--FM- 305 (507) Q Consensus 229 G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~--~i- 305 (507) |.+.+..+.-....-.....+-..|...-+.|=-++++| +..-++++..++.+++. .. .++..++ ++ T Consensus 182 Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~------~~~ls~e~~~~l~~~~~-~~---~G~~N~~~~~v~ 251 (350) T protein:vir:11 182 GVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMT------DAAQNEEDIDALRTALK-TA---KGPGNFRNLFVY 251 (350) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec------CCCCCHHHHHHHHHHHH-Hh---cCccccCceeee Confidence 999888887776665555555555654433343344433 11223333333333332 22 2221121 22 Q ss_pred ecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHH-HHHHHHHHH Q lcl|NC_017974. 306 LPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIH-TQFVQRVNE 382 (507) Q Consensus 306 iP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~-~~~~~aD~~ 382 (507) .|.|.+- .++++..+-+..-.+|.+.-++-..+|+.+..-.---.| .+.+|+++-.+....++ ..-+.--++ T Consensus 252 ~~~g~~~-----g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~ 326 (350) T protein:vir:11 252 APNGKKE-----GIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAVWASLELAPMQT 326 (350) T ss_pred cCCCCcc-----ceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHHHHHHHHHHHHH Confidence 2332211 233444444444445777778888889887754332222 12234444333222221 122222333 Q ss_pred HHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhH Q lcl|NC_017974. 383 IILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDM 422 (507) Q Consensus 383 ~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl 422 (507) .+++ +|+.|.+.. +.|.+-.-.+| T Consensus 327 ~ie~-ln~~l~~~~---------------~~F~~~~~~~l 350 (350) T protein:vir:11 327 RLQQ-VNEMIGEEV---------------VRFAQFDAPGL 350 (350) T ss_pred HHHH-HHhhcCccc---------------cccCcccccCC Confidence 3332 333222111 22332222222 No 118 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=98.37 E-value=1e-06 Score=53.48 Aligned_cols=408 Identities=11% Similarity=0.056 Sum_probs=168.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |+||+.+ ..++..-..... +..|+....... ...+-+.++..+| +.++-+..++++--.-. T Consensus 5 m~~~~~~---~~~~D~~~~~~~------~~~g~~~~~~~~-------~~~~~~~~l~~~Y---~~~~l~~~~Vd~~aed~ 65 (435) T protein:vir:79 5 MSDKVKA---ITKEDGYNEIFG------SKDGTFRPNAFY-------MQRAAFKALSQFY---EEDGMARRIVDVIPEEM 65 (435) T ss_pred ccccccc---chhhcchhhhhc------ccccccccCccc-------CCcCCHHHHHHHH---hcCchhhhhhccchHHh Confidence 9998422 222221111010 001111000000 0001112222222 35777788888776666 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccC---cceeeeeccc Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYI---GKYKVKNLAF 157 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~---g~~~~~~l~~ 157 (507) ++..|+|. . +.. .+ .++..++.+. -|..+...+-.+.+||++.+=+.-. ++..... -.-.++.|.+ T Consensus 66 ~r~g~~i~--g-~~~-~~----~~~~~~~~l~--~~~~l~~a~~~~rl~G~~~i~i~~~-d~~~~~~Pl~~~g~i~~i~v 134 (435) T protein:vir:79 66 VTPGFKVD--G-VKN-EK----SFKSRWDELR--LNAKIIDALSWSRLFGGSAILAVVA-DNKMLKSPVKPGAQLEDIRV 134 (435) T ss_pred hcCCceec--C-CCh-HH----HHHHHHHHhh--HHHHHHHHHHhhhccccEEEEEEec-CCCCcccccccCCceeeEEe Confidence 66666553 2 111 11 2444445443 1444444455688999987533321 1111100 0001222222 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccc--cCCCccccccccEEEEee------CCccCcccc Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVS--TRMSDVIIPINRVMLMNT------GGSSSQALG 229 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~--~~~~~~~iP~~k~i~~~~------~~~~~~p~G 229 (507) ..+..+.. ..++ ++|... .++.+..+... ....+..|-+.|++.+.. ....++++| T Consensus 135 ~d~~~i~~---~~~~----------~dp~sp---~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G 198 (435) T protein:vir:79 135 YDRYQITI---HERE----------TNARSV---RYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWG 198 (435) T ss_pred echhhccc---hhhc----------cCCccc---ccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCccc Confidence 22211110 0011 111111 11111111111 112345676777777652 234678899 Q ss_pred chHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecC Q lcl|NC_017974. 230 VSPL-VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPS 308 (507) Q Consensus 230 ~gll-~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~ 308 (507) .++| +.||....--.....-=++.+.|+... +++++- +...-.. .+........+. .+...++...+.+++.. T Consensus 199 ~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~--v~~~~~-l~~~~~~--~~~~~~~~~r~~-~~~~~~~~~~~~~i~~~ 272 (435) T protein:vir:79 199 ASILNKRLIEAIVDYNYCQELATQLLRRKQQA--VWKARD-LALMCDD--EEGRYAARLRLA-QVDDESGVGKAIGIDAT 272 (435) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc--cccchh-HHHhhcC--ccchHHHHHHHH-HHHHhcCCCCceeEecC Confidence 9965 788876654444444445666665543 444321 1111111 111111111221 11222222333344444 Q ss_pred cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc-ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 309 DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN-VGNDKGGSYNLSESKQTIHTQFVQRVNEIILEA 387 (507) Q Consensus 309 g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt-s~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ 387 (507) +-+++. +..+-+| ...+++..-.+||.+.--..-- .|++.+|-.|.|+-.....-+.+++-......- T Consensus 273 ~e~~e~-------~~~~lsg----l~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p 341 (435) T protein:vir:79 273 DEEYEV-------LNSDVSG----VPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKP 341 (435) T ss_pred CcceEE-------EecccCC----HHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHH Confidence 333333 3332222 3567788888888765443311 233344545556666677777777766544333 Q ss_pred HHHHHHHHHHHhCCCcCCccccceEEecCcCchhH-------HHHHHHHHHHHhcCCccccHHHHHHHHH---HcCCCCC Q lcl|NC_017974. 388 LNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDM-------EGFSKAIQRIGAVGYLPKTPKVINRVLE---VLGIDEK 457 (507) Q Consensus 388 ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl-------~~~a~~~~~L~~~G~~~~~~~~~~~i~e---~~Gip~~ 457 (507) +.+.|+.-++ ++ ++- .|+|...-..+- +..|++++++++.|++.++ ++.+.++. .+|+... T Consensus 342 ~l~~l~~li~-~s----~d~---~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~-e~r~~L~~~~~~~~~~~~ 412 (435) T protein:vir:79 342 ILEFLLPFMI-SE----TEW---SIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLK-ETRDTLRSICPDLKIMDN 412 (435) T ss_pred HHHHHHHHhh-cC----CCC---eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHHhccccCCCCc Confidence 3333444332 11 111 345543333222 4567888899999987663 33444443 3344321 Q ss_pred CCcccccchhhhcCCCcCCCCCCcc Q lcl|NC_017974. 458 IEEDISQEELMKLLGEDTSRAGDGM 482 (507) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (507) ...+ .+.+....++...+.|++. T Consensus 413 ~~~~--~~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 413 DNIE--LPEPEDLDPEPGQEGGLNK 435 (435) T ss_pred cccc--CCccccCCCCCCCCCCCCC Confidence 1111 0111111111111222211 No 119 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=98.35 E-value=1.2e-06 Score=53.13 Aligned_cols=326 Identities=13% Similarity=0.121 Sum_probs=145.1 Q ss_pred CcccccccccccCcccccc-ccccccccccccCchhhhHHHHhh-----hhcccchhccchhHHHHHHH-hhChHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSE-QINVSRMRMSEQGTFALAKVQVDS-----ERMKAEEIRWPNLIGTVESM-KQDATVATGL 73 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~~~~~~~~~~-----~~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l 73 (507) |+|+|+++.+.....+.+. ...-|..-++ +..-+.-...+. ..|++ .-...+ .++ +..+|..++| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~p~~~~~--~~~~~~~~~~~~~~~~~~~~pP--~~~~~L----a~l~~~~~~h~~~L 72 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRPSVVFSMPEAIDP--TAWMTDYTGVFYNPYGEYYQPP--IDRKGL----AKVARANAHHGAIL 72 (337) T ss_pred CCCcccCcccccccCceeEEEecCcccccC--cchhHhhhhhhhccCcceecCC--CCHHHH----HHHhhcchhhhhHH Confidence 9998887654443332221 1111110010 000000001110 01111 111112 233 3488999999 Q ss_pred HHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 74 DMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 74 ~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) ..|...+.+. +.+. .++ |..++ ++-+.+|.+.+|+++... | .+. T Consensus 73 ~~k~N~~~~~---f~~~----------~~~------------~~~~~---~d~ll~GNay~~~~rn~~------G--~~~ 116 (337) T protein:vir:78 73 MARRNMVAGR---FTNQ----------RAT------------ITAFV---HNYLQFGDGGLLKLRNSF------G--QVV 116 (337) T ss_pred Hhhhcccccc---CcCc----------HHH------------HHHHH---HHHHhhCCeEEEEEECCC------C--cEE Confidence 9888766542 2211 011 33333 355568999999988642 2 245 Q ss_pred eccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL 233 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll 233 (507) .|.|.|+.++. ...|++. ++. ........+|++.+++.+...-.+..||.+.+ T Consensus 117 ~L~pl~~~~v~------~~~d~~~----------~~~-----------~~~~~~~~~~~~eIiHik~~~~~~~~~Gls~~ 169 (337) T protein:vir:78 117 GLHPLSSVYLR------RREDGCF----------VYL-----------QQGKPNLIYRPDDVIWLAQYDPEQQVYGMPDY 169 (337) T ss_pred EEEEeCCceeE------eeeCCeE----------EEE-----------EcCCceEEECCccEEEECCCCCCCCcccccHH Confidence 66666664432 2233321 110 11123445677776544433334567899888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccc Q lcl|NC_017974. 234 VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDN 313 (507) Q Consensus 234 ~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~ 313 (507) ..+.-....-.....+-..|..--+.|--++++| +..-++++..++.+++ +.......+....+.+|.|.+- T Consensus 170 ~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~------~~~l~~e~~~~lk~~~-~~~~G~~n~~~~~v~~~~g~~~- 241 (337) T protein:vir:78 170 LGGLQSALLNQDATLFRRRYFLNGAHMGFIFYAT------DPNMDDDTEEEMKEMI-ANSKGVGNFRSMFVNIPDGKPD- 241 (337) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC------CCCCCHHHHHHHHHHH-HHhcCcccccceEEEcCCCCcc- Confidence 8877766665555555555554333333343332 1122233333332333 2222111111122333444321 Q ss_pred ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc---ccchhhhhHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 314 APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG---NDKGGSYNLSESKQT-IHTQFVQRVNEIILEALN 389 (507) Q Consensus 314 ~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~---~~~~GS~Al~~vh~~-v~~~~~~aD~~~i~~~ln 389 (507) .++++..+-+..-..|.+.-++-..+|+.+..-.---.| ++.+|+++-.+-+.. ....-+.-.++.|++.+| T Consensus 242 ----Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n 317 (337) T protein:vir:78 242 ----GIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDATYARNEVLPLCELVQDAIN 317 (337) T ss_pred ----ceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 133444444444445666677777888887654321111 222444543333333 233444556666666666 Q ss_pred HHHHHHHHHhCCCcCCccccceEEecCcCchhH Q lcl|NC_017974. 390 ENLLPQLLALNDMRLPETEMPYVKAGEIVDVDM 422 (507) Q Consensus 390 ~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl 422 (507) +.+++.. . .+.|+.....-+ T Consensus 318 ~~ll~~~-----------~--~~~f~~~~~~~~ 337 (337) T protein:vir:78 318 SAGLPRA-----------L--WVTFRETIGAAV 337 (337) T ss_pred hhcCChh-----------h--ceeccccccccC Confidence 5433221 1 123333222222 No 120 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=98.02 E-value=7e-06 Score=48.85 Aligned_cols=458 Identities=11% Similarity=0.030 Sum_probs=177.1 Q ss_pred Cccccc-ccccccCc----cccccccc------cccccccc--------c-------Cc---hhhhHHH-Hhh-hhcccc Q lcl|NC_017974. 1 MSRKRN-RNRSVKVA----KATSEQIN------VSRMRMSE--------Q-------GT---FALAKVQ-VDS-ERMKAE 49 (507) Q Consensus 1 ~~~k~~-~~~~~~~~----~~~~~~~~------~~~~~~~~--------~-------g~---~~~~~~~-~~~-~~e~~~ 49 (507) |++.|. -.-.++.+ +++...+. .+..+... . +. .++..+- ... ...... T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~ 118 (862) T protein:vir:99 39 LARTRQNWPVQKEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSS 118 (862) T ss_pred HHhhcccCCcccccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccc Confidence 444332 11111111 11111110 00000000 0 00 0000000 000 000000 Q ss_pred ----h-hc-c-----chhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHH Q lcl|NC_017974. 50 ----E-IR-W-----PNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQ 118 (507) Q Consensus 50 ----~-lr-~-----~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~ 118 (507) . ++ + --.+.++...+..+-+..++++.-.-+++..|+|.........+.+..+.+++.++.+. ..+ T Consensus 119 y~~~~~~~~~~~~~~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~---v~~ 195 (862) T protein:vir:99 119 YAVPEALQDWYLSQGFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFK---VKE 195 (862) T ss_pred cccchhccccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhh---HHH Confidence 0 00 0 01123334344688999999999998888888887654333334455666777777664 334 Q ss_pred HHHHHH-HHHhhcceeeeeeeeccCccccCcce--------eeeeccccCchhcccccceeecCCCcccccccccccccc Q lcl|NC_017974. 119 FARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKY--------KVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQ 189 (507) Q Consensus 119 ~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~--------~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~ 189 (507) .|.+++ .+.+||-+.+=++-.......+.-.+ .++.|.... +++...... ..+.++|... T Consensus 196 ~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlD--------p~w~~p~~v--~~~~~Dp~sp- 264 (862) T protein:vir:99 196 NLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQID--------PYWMMPMLT--AESTADPSSQ- 264 (862) T ss_pred HHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEec--------hhhhccccc--cccccccccc- Confidence 444544 48889865431221111000000000 111111111 111111000 0011111110 Q ss_pred ccccccccccccccCCCccccccccEEEEeeCC------ccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceE Q lcl|NC_017974. 190 NYTASEIGVGGVSTRMSDVIIPINRVMLMNTGG------SSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIE 263 (507) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~------~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v 263 (507) .++....+. ..+..|=+.|++.+.... ...+++|.|+|..||-...--......=++.+.++. +.+ T Consensus 265 --~yGkP~~y~----I~g~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~--l~v 336 (862) T protein:vir:99 265 --FFYEPEFWI----ISGQKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKR--TTA 336 (862) T ss_pred --ccCCceeee----ecCeeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhc--cce Confidence 001111111 112233345555554322 344578999999998765433333333444555544 345 Q ss_pred EecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHH Q lcl|NC_017974. 264 LKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKS 343 (507) Q Consensus 264 ~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~ 343 (507) +++..- . ... .++ .+.+.+.. +...+ ++...++|..+-++ +.+..+-+| ...+++..-.+ T Consensus 337 ~ktd~l--~-~l~-~ed---~l~~r~~~-~~~~r-dN~Gi~liD~eEe~-------e~ls~slSG----L~dll~~~~q~ 396 (862) T protein:vir:99 337 IHTDTA--K-AIA-NED---KFIQRLMF-WVRYR-DNHAVKVLGTDETM-------EQFDTSLAD----FDAVIMGQYQL 396 (862) T ss_pred eechhH--h-hhc-cHH---HHHHHHHH-HHhcc-CcceeEEecCCCce-------eEEecccCC----hHHHHHHHHHH Confidence 554211 0 001 111 22222222 21222 33445666665433 333333232 24567777778 Q ss_pred HHHHHhccccc--ccccchhhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCch Q lcl|NC_017974. 344 ILDRLGAGFIN--VGNDKGGSYNLSESKQTIHTQFVQRVNE-IILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDV 420 (507) Q Consensus 344 Isk~iLGqtLt--s~~~~~GS~Al~~vh~~v~~~~~~aD~~-~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~ 420 (507) ||-+.--.. | .|.+..|-+|.|+-...+.-+.+++... .|...|++ |+. |+.+-.. .+.+ + .|+|...-.. T Consensus 397 IAaas~IP~-tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~Ler-L~~-li~~~lg-~~~d-~-~ieFnpL~~~ 470 (862) T protein:vir:99 397 VASIAKTPA-TKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQR-HYL-ISRLSLG-IQHE-I-DVVMEPVASM 470 (862) T ss_pred HHhhhCCCc-eeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHH-HHH-HHHHhcC-CCCc-c-eEEeCCCCCC Confidence 887643322 2 1233346666666666667777777653 34455543 333 3222110 1211 1 3556544433 Q ss_pred hHH-------HHHHHHHHHHhcCCccccHHHHHHHHH--HcCCCCCCCcccc----c-ch-----hhhcCCCcCCCC--- Q lcl|NC_017974. 421 DME-------GFSKAIQRIGAVGYLPKTPKVINRVLE--VLGIDEKIEEDIS----Q-EE-----LMKLLGEDTSRA--- 478 (507) Q Consensus 421 dl~-------~~a~~~~~L~~~G~~~~~~~~~~~i~e--~~Gip~~~~~~~~----~-~~-----~~~~~~~~~~~~--- 478 (507) +-+ ..|++++++++.|++.++ ++.+.++. .+|++.-.+++.. . +. .....+..+.+. T Consensus 471 sekEkAEi~kk~Aea~~~lv~sGvispd-EvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~ 549 (862) T protein:vir:99 471 TAQQQADLNKTKAEGGKVLIDGGVISPD-EERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKET 549 (862) T ss_pred CHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCccccccccccc Confidence 323 345778899999987663 22333332 2343311111100 0 00 000000000000 Q ss_pred ----------CCcc--------cccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 479 ----------GDGM--------TKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 479 ----------~~~~--------~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ++.. +.+...-.+....++.++.+++.++ T Consensus 550 ~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~~~~~~ 596 (862) T protein:vir:99 550 QAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPMPEDDAPVAG 596 (862) T ss_pred ccccCCccccCCcccccccCCCCCCCccccccccccCCCccccccCc Confidence 0000 0000000111222344555555555 No 121 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=97.96 E-value=8.9e-06 Score=48.27 Aligned_cols=404 Identities=12% Similarity=0.023 Sum_probs=162.5 Q ss_pred CccccccccccccccccccCchhhhHHH-H-hhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeC Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQ-V-DSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVI 90 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~-~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~ 90 (507) |+.-+. .|+...- + ......+. .+....+.++...+.++-+..++++--.-.++..|+|. T Consensus 1 ~~~~~~---------------d~~~~~~~~~~~~~~~~~-~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~-- 62 (427) T protein:vir:10 1 MKIVKH---------------DGYNDIFNGGADGSPKPF-FMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMS-- 62 (427) T ss_pred CCcccc---------------chHHHHhhcCCCCcccCc-cccCchHHHHHHHHcCchhhhhhccchHHhhcCCcccc-- Confidence 111100 0110000 0 00000111 11112334444445578888888777666666655553 Q ss_pred CCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccc---cCcceeeeeccccCchhccccc Q lcl|NC_017974. 91 PGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGE---YIGKYKVKNLAFRPQASLSRTN 167 (507) Q Consensus 91 ~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~---~~g~~~~~~l~~r~~~~i~~~~ 167 (507) . +. +. +.++..++.+.. |..+..-+-.+.+||.+.+=+.-+- .... ..+.-.++.|.+.++..+.. T Consensus 63 g-~~-~~----~~~~~~~~~l~~--~~~l~~a~~~~rl~G~a~i~i~v~d-~~~l~~p~~~~g~l~~l~v~d~~~~~~-- 131 (427) T protein:vir:10 63 G-VK-DE----KEFKSLWDSYKL--DSSLVDLLCWARLYGGAAMVAIIKD-NRMLTSQAKPGAKLEGVRVYDRFAITV-- 131 (427) T ss_pred C-cc-HH----HHHHHHHHHhhH--HHHHHHHHHhccccceeEEEEEecC-CCccccccCCCcceeEEEEechhcccc-- Confidence 2 11 11 124445555431 4444444446999999986433221 1100 01111233333332222211 Q ss_pred ceeecCCCccccccccccccccccccccccccccc--cCCCccccccccEEEEeeC------CccCccccchHHH-HHHH Q lcl|NC_017974. 168 PIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVS--TRMSDVIIPINRVMLMNTG------GSSSQALGVSPLV-GCYR 238 (507) Q Consensus 168 ~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~--~~~~~~~iP~~k~i~~~~~------~~~~~p~G~gll~-~~~~ 238 (507) ..++ ++|... .++.+..+... .......|-+.+++++... ....+++|.|+|. ++|. T Consensus 132 -~~~~----------~dp~s~---~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~ 197 (427) T protein:vir:10 132 -EKRV----------TNARSP---RYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLID 197 (427) T ss_pred -cccc----------cCcccc---ccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHH Confidence 0111 111110 11111111111 1122345667777776422 3466788999775 5555 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) ...--.....-=+..+.|+... +++++ .+...... .+........+.... ..++...+.+++.++-+++.+ T Consensus 198 ~i~~~~~~~~~~~~l~~k~~~~--v~k~~-~l~~~~~~--~~~~~~~~~r~~~~~-~~~~~~~~~~l~~~~e~~e~~--- 268 (427) T protein:vir:10 198 AICDYDYCESLATQILRRKQQA--VWKVK-GLAEMCDD--DDAQYAARLRLAQVD-DNSGVGRAIGIDAETEEYDVL--- 268 (427) T ss_pred HHHHHHHHHHHHHHHHHHhccc--cccch-hHHHHhcC--ccchHHHHHHHHHHH-HhcCcccceeeecCCCceeEE--- Confidence 4433223333334445555433 44432 11111111 111111222222221 112222233444444433332 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc-ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN-VGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLL 397 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt-s~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~ 397 (507) ..+-+| ...+++..-.+||.+.--..-- .|++.+|-.|.|+-.....-+.+++.......-+-+.|++-++ T Consensus 269 ----~~~lsg----l~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~ 340 (427) T protein:vir:10 269 ----NSDISG----VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV 340 (427) T ss_pred ----ecccCC----hHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 222222 3457777777888765333211 1333445556666666777777777664433333333444433 Q ss_pred HhCCCcCCccccceEEecC------cCchhH-HHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcc-cccchhhh Q lcl|NC_017974. 398 ALNDMRLPETEMPYVKAGE------IVDVDM-EGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEED-ISQEELMK 469 (507) Q Consensus 398 ~lN~~~~~~~~~P~~~~~~------~~~~dl-~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~-~~~~~~~~ 469 (507) ++ + .+ .++|.. .+..|+ +..|++++++++.|++.++ ++.+.++...+.....+.. ...+... T Consensus 341 -~s----~--~~-~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~-e~r~~L~~~~~~~~~~~~~~~~~e~~~- 410 (427) T protein:vir:10 341 -DE----E--EW-SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLE-EARDTLRSIAPEFKLKDGNNINIREPE- 410 (427) T ss_pred -cC----C--Cc-EEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHhhhccccCCCCccccccccc- Confidence 22 1 11 344432 222222 4567899999999987774 4456666443332211111 1111111 Q ss_pred cCCCcCCCCCCcccccCccc Q lcl|NC_017974. 470 LLGEDTSRAGDGMTKGSSGN 489 (507) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~ 489 (507) ...++.++.+++..+. + T Consensus 411 ~~~e~~p~~~e~~~d~---~ 427 (427) T protein:vir:10 411 ETTEPEPGLGEKLEDE---N 427 (427) T ss_pred hhcCCCCCCCCCCCCC---C Confidence 0111111111111111 0 No 122 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=97.93 E-value=1e-05 Score=47.89 Aligned_cols=330 Identities=10% Similarity=0.069 Sum_probs=144.7 Q ss_pred CcccccccccccCccccccccccccccccccCchhhh------HHHHh-----hhhcccchhccchhHHHHHHH-hhChH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALA------KVQVD-----SERMKAEEIRWPNLIGTVESM-KQDAT 68 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~------~~~~~-----~~~e~~~~lr~~~~~~~y~~m-~~D~~ 68 (507) |+++...+.+...+...+. +..+.-+...++ -.... ...|+. +- ..-..++ ...+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~------~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp--~~----~~~La~l~~~n~~ 68 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSV------YSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPP--IS----LKGLAEIANANGY 68 (348) T ss_pred CCccccchhhccccCCceE------EEecCCCeeecCcchHHHHHHHHhcCCCccccCC--CC----HHHHHHHHhhhhh Confidence 8876665555443333221 111100111111 01111 111111 11 1222334 35899 Q ss_pred HHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCc Q lcl|NC_017974. 69 VATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIG 148 (507) Q Consensus 69 v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g 148 (507) ..++|..++.-+.+ .+.+.+.-+. .++..-+++-+.+|-+.+|++.... | T Consensus 69 h~~~i~~k~N~l~~---~~~Pn~~~t~---------------------~~f~~~~~d~ll~Gnay~~~~rn~~------G 118 (348) T protein:vir:26 69 HGSLLKARANYVAG---RFMNGGGLPM---------------------YKMNSACWDYFGLGMSAFVKIRSYL------K 118 (348) T ss_pred hhhhHhhhhhHHhh---cccCCCCCCH---------------------HHHHHHHHHHHhcCCeEEEEEEcCC------C Confidence 99999988887755 3554443221 1122223455678999999986542 2 Q ss_pred ceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccc Q lcl|NC_017974. 149 KYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQAL 228 (507) Q Consensus 149 ~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~ 228 (507) .+..|.+.|..+.. ...|+.. ++ ....+....++++.+++++...-.+..| T Consensus 119 --~~~~L~~l~~~~v~------~~~d~~~----------~~-----------~~~~g~~~~f~~~dIiHir~~~~~~~~~ 169 (348) T protein:vir:26 119 --NVIALEPLPMVHMR------KRKNGDF----------VQ-----------LLRNNEQKVFKAKDVIFIPQYDPQQQIY 169 (348) T ss_pred --cEEEEEEecCceeE------eeecCcE----------EE-----------EEecCeEEEEcCccEEEEcCCCCCCCcc Confidence 23455566553321 1222210 00 0111234456677766555433345678 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCc---eeEe Q lcl|NC_017974. 229 GVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQ---SFFM 305 (507) Q Consensus 229 G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~---a~~i 305 (507) |.+.+..+.-....-.....+=..|..--+.|=-+++.+ +..-++++..++.+++ + ...++.. ..+. T Consensus 170 Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~------~~~ls~e~~~~lk~~~-~---~~~G~~n~~~~~vl 239 (348) T protein:vir:26 170 GLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYAT------DPNLSEADEKALKEKI-A---SSKGIGNFRSMFVN 239 (348) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec------CCCCCHHHHHHHHHHH-H---HhcCcccccceeEE Confidence 998888877766665555555556654333344444332 1122233333332333 2 2222111 1233 Q ss_pred ecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHH-HHHHHHHHHH Q lcl|NC_017974. 306 LPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTI-HTQFVQRVNE 382 (507) Q Consensus 306 iP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v-~~~~~~aD~~ 382 (507) +|.|-+- .++++..+-+..-..|.+.-++-..+|+.+..-.---.| ...+|+++-.+....+ ...-+.-.++ T Consensus 240 ~~~g~~~-----Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~f~~~~l~P~~~ 314 (348) T protein:vir:26 240 IPNGKEK-----GIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQVYDFYEVIPVCK 314 (348) T ss_pred cCCCCcc-----ceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHH Confidence 3433221 133444444444445666666667778886554332222 1223444433222222 2233344555 Q ss_pred HHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHH Q lcl|NC_017974. 383 IILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAI 429 (507) Q Consensus 383 ~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~ 429 (507) .|++.||+.| ..+... +|.|+..-..|... +.++ T Consensus 315 ~ie~~ln~~l----------~~~~~~--~~~fdl~~~~e~~~-~~a~ 348 (348) T protein:vir:26 315 RFMDAVNNDP----------EIPDNL--KLKFNLNPGVESAN-GSAV 348 (348) T ss_pred HHHHHHhhhh----------CCCCcc--EEEEecCcccccch-hhcC Confidence 5555565432 012222 35554332222111 1222 No 123 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=97.77 E-value=2e-05 Score=46.29 Aligned_cols=339 Identities=14% Similarity=0.053 Sum_probs=140.8 Q ss_pred Ccccccccccc-cCccccccccccccccccc-cCchhhhHHHHhhhhcccchhccchhHHHHHHH-hhChHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSV-KVAKATSEQINVSRMRMSE-QGTFALAKVQVDSERMKAEEIRWPNLIGTVESM-KQDATVATGLDMLY 77 (507) Q Consensus 1 ~~~k~~~~~~~-~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk 77 (507) |+|.+...... .++.+.+. ..+..++ .++..+ .+.....-..++--.-|-.+.-.-++ +..+|-+|+|.-++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~-~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k~ 75 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPIND----RTFSLSEITASPAL-DYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRA 75 (345) T ss_pred CCccccccchhhhcCCCceE----EEeecCCcccchhh-cccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhhhh Confidence 65543332211 22222111 0111111 011111 00000000111111111111112223 45899999998888 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .-+.. .+.+.+.-+.. .|..++ ++-+.+|-+.+|++.... |. +..|.+ T Consensus 76 n~l~~---~~~Pn~~~t~~------------------~f~~~v---~d~ll~Gnay~~i~rn~~------G~--~~~L~p 123 (345) T protein:vir:37 76 NMVSA---TYEGGKALSKM------------------EMRALC---LNLIQFGDVGLLKVRNGF------GQ--VVRLVP 123 (345) T ss_pred hHHhh---ccCCCCCCCHH------------------HHHHHH---HHHHhcCCeEEEEEECCC------CC--EEEEEE Confidence 87754 35554433321 122333 455568999999987543 22 345555 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .|+.++. ...|+......+. ......+....+|++.+++++...-.+..||.+.+..+. T Consensus 124 l~~~~vr------~~~d~~~~~~~~~---------------~~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~ 182 (345) T protein:vir:37 124 LSSLYLR------VHKDGGYSYLMKK---------------SLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGI 182 (345) T ss_pred ecCceeE------EeecCCeeEEEee---------------eeeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHH Confidence 5554332 1222221111000 001112233456777766555433345678988777766 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQY 317 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~ 317 (507) -....-.....+-..|...-+.|=-+++++ +..-++++...+.+++.+ .+....+....+.+|.|.+- T Consensus 183 ~si~l~~~a~~~~~~~f~NGa~~~~Il~~t------~~~l~~e~~~~lk~~~~~-~~g~~n~~~~~i~~~~g~~~----- 250 (345) T protein:vir:37 183 QSALLNSDATVFRRRYFSNGAHMGFILYST------DPDLTEEMEEEIARKISE-SKGVGNFRSMFVNIAGGHPD----- 250 (345) T ss_pred HHHHHHHHHHHHHHHHHhccCCcceEEEeC------CCCCCHHHHHHHHHHHHH-hcCccccCceeEecCCCCcc----- Confidence 665555555555555554333333333322 112233333333333332 22211111222344444221 Q ss_pred ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 318 SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIH-TQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 318 ~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~-~~~~~aD~~~i~~~ln~~li~ 394 (507) .++++..+-+..-.+|.+.-++-..+|+.+.--..--.+ ...+|+++-.+-...++ ..-+.--++.|++.+|+ T Consensus 251 G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~---- 326 (345) T protein:vir:37 251 GLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQ---- 326 (345) T ss_pred ceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhh---- Confidence 133444444444445667777777788887654432222 12344454333333322 22233455555555553 Q ss_pred HHHHhCCCcCCccccceEEecCcCchhHHH Q lcl|NC_017974. 395 QLLALNDMRLPETEMPYVKAGEIVDVDMEG 424 (507) Q Consensus 395 ~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~ 424 (507) +.+ +++. ..+.|+..+ +.. T Consensus 327 -~~e-----~~~~--~~i~F~~~~---l~k 345 (345) T protein:vir:37 327 -DPE-----IKNL--LKIKFREQN---FAK 345 (345) T ss_pred -hhc-----cCCc--ceEEECchh---hcC Confidence 111 1121 235665332 111 No 124 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=97.71 E-value=2.6e-05 Score=45.71 Aligned_cols=248 Identities=10% Similarity=-0.004 Sum_probs=127.9 Q ss_pred Cc--ccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_017974. 1 MS--RKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYT 78 (507) Q Consensus 1 ~~--~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~ 78 (507) |. +|+.+ +...-+ +.... +..+... .... .....+ .-+..++-+.|.+|+..+.. T Consensus 1 MglF~~~~~-r~~~~~-~~~~~--------~~~~~~~----~~~~--------~~~~~v-~~~~al~~~~v~~~i~~ia~ 57 (251) T protein:vir:46 1 MGIFYKNEK-RDLQYN-EDDLQ--------MMVQTLP----SFQG--------TKLRQY-KDIEAIRHSDIFTAVMMIAS 57 (251) T ss_pred CCccccccc-cccCCC-ccchh--------hhhhhhc----cccC--------cCccee-chhhhhccHHHHHHHHHHHH Confidence 54 33221 111111 10000 0000000 0000 000001 12334567889999999999 Q ss_pred HHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .|-++++..... +....+..+..++..- --...++.++++.+.. .+++|-+.++++.... | .+..|.+ T Consensus 58 ~iA~lp~~~~~~-~~~~~~~~~~~ll~~~--Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~------G--~~~~L~~ 126 (251) T protein:vir:46 58 DLARMPIRVTVN-GQINYSDRIVNLLNTR--PNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT------G--EPMNLTF 126 (251) T ss_pred hHhhCceEEeeC-ccccccchHHHHHhcc--CCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------C--cEEEEEE Confidence 999987665432 2222222333443311 1122346677777664 6889999999877532 3 2556777 Q ss_pred cCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHH Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCY 237 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~ 237 (507) .++.++. +..+.||...-... ... ....+....+|++.+|++++.+ .+..+|.|++..+. T Consensus 127 i~~~~v~----v~~~~~g~~~~~~~--------~~~-------~~~~g~~~~~~~~diiH~r~~~-~dg~~G~spi~~~~ 186 (251) T protein:vir:46 127 RKTSEIE----LKSDARGRLYYFHQ--------RID-------SNGNNIERNVKFEDMLDIKFYS-LDGINGLSLLDTLS 186 (251) T ss_pred ECCceEE----EEECCCCcEEEEEE--------Eec-------cCCcceeEEECCccEEEecCcC-CCCeeecCHHHHHH Confidence 7766543 22334443211000 000 0111233567888877777654 44579999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccc Q lcl|NC_017974. 238 RAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKD 312 (507) Q Consensus 238 ~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i 312 (507) -..-.-....++...+...-+.|--++++|-.+ .+++...++.+... ....|.+-++.++.||+. T Consensus 187 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l------~~~e~~~~~~~~~~----~~~~g~~n~g~~~~gm~~ 251 (251) T protein:vir:46 187 RTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL------DNKKARDRAREEFP----KVLVELNKLGKLSYSMNQ 251 (251) T ss_pred HHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCC------CCHHHHHHHHHHHH----HHhcCcccccccccccCC Confidence 888777888888888888766676666665221 12222222222222 233454445667778874 No 125 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=97.67 E-value=3.1e-05 Score=45.33 Aligned_cols=340 Identities=13% Similarity=0.030 Sum_probs=143.7 Q ss_pred Ccccccccccc-cCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHH-hhChHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSV-KVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESM-KQDATVATGLDMLYT 78 (507) Q Consensus 1 ~~~k~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~ 78 (507) |.+++.+.++. .++.+.+.. -+..++.-...+..+..--....++--.-|=...-.-++ +..+|-+++|..++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~----~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i~~k~n 76 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDR----TFSLNEISASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRAN 76 (345) T ss_pred CCCCccccchhhcccCcceeE----EeecCCcccccchhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccceeeech Confidence 76655444332 222222211 111221110001111100000111000001011111233 458899999987777 Q ss_pred HHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCcceeeeecccc Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFR 158 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r 158 (507) .+.+ .+.+.+.-+ ..++..-+++.+.+|.+.+|++.... | .+..|.+. T Consensus 77 ~l~~---~~~Pn~~lt---------------------~~~f~~~~~d~ll~Gnay~~~~rn~~------G--~~~~L~pl 124 (345) T protein:vir:37 77 MVSS---LYEGGKALS---------------------RMDMRALCLNLIQFGDVGLLKVRNGF------G--QVVRLVPL 124 (345) T ss_pred HHHh---hccCCCCCC---------------------HHHHHHHHHHHHhcCCeEEEEEEcCC------C--cEEEEEEE Confidence 6654 244443322 12222233455678999999987543 2 24456666 Q ss_pred CchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHH Q lcl|NC_017974. 159 PQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYR 238 (507) Q Consensus 159 ~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~ 238 (507) |+..+. ...|+.....++ .+ .....+....+|++.+++++...-.+..||.+.+..+.- T Consensus 125 ~~~~vr------~~~d~~~~~~~~----------~~-----~~~~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~ 183 (345) T protein:vir:37 125 SSLYLR------VRKDGGYSYLMK----------KS-----LYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQ 183 (345) T ss_pred cCceeE------EEEeCCeeEEEE----------Ee-----EecCCceEEEEccccEEEecCCCCCCCcccccHHHHHHH Confidence 554331 112221111000 00 001122334567666655554333456689998888877 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) ....-.....+=..|.+.-+.|=-+++++ +..-++++...+.+++.+ ......+....+..|.|.+- . T Consensus 184 si~l~~~a~~~~~~~f~NG~~p~~Il~~~------d~~l~~e~~~~lk~~~~~-~~g~~n~~~~~i~~p~g~~~-----G 251 (345) T protein:vir:37 184 SALLNSDATVFRRRYFSNGAHMGFILYST------DPDLTEEMEEEIARKISE-SKGVGNFRSMFVNIANGHPD-----G 251 (345) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEEec------CCCCCHHHHHHHHHHHHH-hcCcccccceEEEcCCCccc-----c Confidence 76666555555556654333343344432 111233333333333332 22111111222334443321 1 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNLSESKQTIH-TQFVQRVNEIILEALNENLLPQ 395 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~~v~-~~~~~aD~~~i~~~ln~~li~~ 395 (507) ++++..+-+..-..|.+.-++...+|+.+.--.---.| ...+|+++-.+.+..++ ..-+.-.++.|++.+|+. T Consensus 252 ~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~~---- 327 (345) T protein:vir:37 252 LKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQD---- 327 (345) T ss_pred eEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhh---- Confidence 23444444444445667777888889887644332222 12234444333333222 233344555666666541 Q ss_pred HHHhCCCcCCccccceEEecCcCchhHHH Q lcl|NC_017974. 396 LLALNDMRLPETEMPYVKAGEIVDVDMEG 424 (507) Q Consensus 396 l~~lN~~~~~~~~~P~~~~~~~~~~dl~~ 424 (507) .. .++.. .+.|+..+ +.+ T Consensus 328 -~~-----~~~~~--~i~F~~~~---L~~ 345 (345) T protein:vir:37 328 -PE-----IKNLL--KIKFREQN---FAK 345 (345) T ss_pred -cc-----CCCcc--eEEecchh---hcC Confidence 11 12222 35554322 221 No 126 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.65 E-value=3.3e-05 Score=45.19 Aligned_cols=447 Identities=9% Similarity=-0.015 Sum_probs=157.2 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) |...... ...-..++..--.....+..+.+-...++.- ...+.+++. + +..+.+-=+..|....=.+.+.+- T Consensus 1 ~~t~~d~----i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~-~~~~~~~~~-~--~~~~~n~~~~ivd~~~~~l~~~g~ 72 (480) T protein:vir:78 1 MTTYHEH----VERLQGLLARDLPNLLEAEAYRNGTRRLKTI-GIGAPPELA-Y--LDVQPGWVATYLRTLSDRLDIEGF 72 (480) T ss_pred CCCHHHH----HHHHHHHHHHHHHHHHHHHHHHhccccchhc-ccccchhhh-h--hhhhcchHHHHHHHHHhhhccCce Confidence 1111000 0000000000000000000111111111100 000001110 1 001111111111110000111111 Q ss_pred CChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCc-cccCcceeeeeccccCchhcccccc-- Q lcl|NC_017974. 93 ESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTV-GEYIGKYKVKNLAFRPQASLSRTNP-- 168 (507) Q Consensus 93 ~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~-~~~~g~~~~~~l~~r~~~~i~~~~~-- 168 (507) ..+++.+..+.+.++++. ..|...+.++ .++.-||.| ++++|..... ...+|...+.-+.|+. ++..|.+ T Consensus 73 ~~~~d~~~~~~l~~i~~~---N~~~~~~~~~~~~a~~~G~a-y~~v~~~~~~~~d~~~~~~i~~~~p~~--~~~i~D~~~ 146 (480) T protein:vir:78 73 RISEDSEGLEELWNWWQA---NDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRVESPLY--MYAELDPRN 146 (480) T ss_pred ecCCCchhHHHHHHHHHh---cCHHHHHHHHHHHHhhcCce-EEEeecCccccCCCCCeeEEEEEcccc--eEEEEcCCC Confidence 111222333445555543 2466666665 578889997 4688853211 1234555554443321 1111100 Q ss_pred ---------eeecCCCccc-ccccc-cccccccccccccccc--ccccCCCccccccccEEEEeeCCccCccccchHHHH Q lcl|NC_017974. 169 ---------IVYNEDGSAI-IGIKQ-SLSAFQNYTASEIGVG--GVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVG 235 (507) Q Consensus 169 ---------~~~~~d~~~l-~~~~q-~~~~~~~~~~~~~~~~--~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~ 235 (507) ++...|..+. ..+.- .+.....+........ ..........++.-.++.|.+..+.+.|+|.|-+.. T Consensus 147 ~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~ 226 (480) T protein:vir:78 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) T ss_pred ccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhH Confidence 0111111110 00000 0000011100000000 000111112233334577788888899999887654 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccc Q lcl|NC_017974. 236 CYRAWREK-ILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNA 314 (507) Q Consensus 236 ~~~~~~fK-~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~ 314 (507) -..+.+-. +..+...+..++.|..|.++++ |-......++.....+.+.. | ....+ .|.+.+ T Consensus 227 ~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-----G~~~~~~~~~~~~~~~~~~~--------~--~~~~~-~~~~~~- 289 (480) T protein:vir:78 227 ELRKVTDAASRTLMNLQSASQILGTPLRVIS-----GVTTDELTNDGENTTLDIYY--------G--RILTL-ASEAAK- 289 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-----CCCccccccccccchhhhhh--------h--hhccC-CCCCce- Confidence 22121111 3344555667787887776653 10000001111111111111 1 11122 222222 Q ss_pred cccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc-c-c----hhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN-D-K----GGSYNLSESKQTIHTQFVQRVNEIILEAL 388 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~-~----~GS~Al~~vh~~v~~~~~~aD~~~i~~~l 388 (507) +...+.. +. ..++++++.-|....-...+.... + . .+.-|+ .....-....++.-.+.+...| T Consensus 290 ------~~~~~~~-~~---~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al-~~~~~~l~~k~~~~~~~f~~~l 358 (480) T protein:vir:78 290 ------ISEFKAA-EL---RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAI-IATDSRIVKMAERKGRIFGGAW 358 (480) T ss_pred ------EEecCcc-CH---HHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 2222222 12 234444444443332111111110 1 0 111122 1222333344444555556666 Q ss_pred HHHHHHHHHHhCCCcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc---c Q lcl|NC_017974. 389 NENLLPQLLALNDMRLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS---Q 464 (507) Q Consensus 389 n~~li~~l~~lN~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~---~ 464 (507) .+ +++-++.+++...... .--.+.|......++.+.++.+.+|+.+|..+.+ .+.+++.+|+.+.+.++.. . T Consensus 359 ~~-~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s---~et~~~~lg~~~d~~~e~~~~~~ 434 (480) T protein:vir:78 359 ER-AMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIP---KEQARIDLGYTATQREQMRDWDK 434 (480) T ss_pred HH-HHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCC---HHHHHhcCCCCHhHHHHHHHHHH Confidence 54 6677777764332222 2246788888888999999999999998865544 3568888998643222111 0 Q ss_pred chhh---hcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 465 EELM---KLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 465 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ++.. .....+....++.......++. ...+...+++.+-.+ T Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 478 (480) T protein:vir:78 435 QETEDMIDTLYSTTKAQADATPKPTVTET--KTETQTSPSGFNRTK 478 (480) T ss_pred HHHHHHHHHhhccccCCCccccCCCCCCC--CCccCCCcccCCCcC Confidence 0000 0000000000000000111111 111222333333344 No 127 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=97.64 E-value=3.3e-05 Score=45.12 Aligned_cols=450 Identities=10% Similarity=-0.051 Sum_probs=149.7 Q ss_pred Ccccc-ccccccccccccccCchhh-hHHHHhhhhcccchhccchhHHHHHHHhhChH----HHHHHH-------HHHHH Q lcl|NC_017974. 13 VAKAT-SEQINVSRMRMSEQGTFAL-AKVQVDSERMKAEEIRWPNLIGTVESMKQDAT----VATGLD-------MLYTF 79 (507) Q Consensus 13 ~~~~~-~~~~~~~~~~~~~~g~~~~-~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~----v~s~l~-------~Rk~a 79 (507) |.... +....+.+ ...+...-. +..++.... .....|..++.+.|+--..-.+ +--.+. ==+.. T Consensus 1 ~~~~~~~~~~~~~~--~~~l~~~e~~~i~~L~~~~-~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~i 77 (504) T protein:vir:99 1 MTEETTSASKFTFR--IPELNDDVVDKVNGLYQQL-VDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKA 77 (504) T ss_pred CCccCCcccccccc--cCCCCHHHHHHHHHHHHHH-HHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHH Confidence 22211 11111111 111111100 000111000 0000111222233321100001 111111 11112 Q ss_pred HhhCC----c-eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 80 VEKAF----K-DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 80 v~~~~----~-~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) |..+. . -|..+ +++..+.. +.++... ..|.....++ .+|+-||.|. ++||....+ .....+. T Consensus 78 Vd~~a~rl~~~Gf~~~-d~~~~~~~----l~~i~~~---N~ld~~~~~~~~~a~iyG~af-~~v~~~~d~---~~~~~I~ 145 (504) T protein:vir:99 78 VDTLARRCNLESFVWP-DGDYGSIG----GPDVWDE---NFFATKANNAMVSSLIHGPAF-LINTEGGAG---EPDSLIH 145 (504) T ss_pred HHHHHhhhccceeeCC-CCChhhHH----HHHHHHh---cChhhHHHHHHHHHHhhCcee-EEEecCCCC---CceeEEE Confidence 22110 0 12222 22222222 3333332 2355444443 4688899976 688864321 1122232 Q ss_pred eccccCchhcccccc-----------eeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCC Q lcl|NC_017974. 154 NLAFRPQASLSRTNP-----------IVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGG 222 (507) Q Consensus 154 ~l~~r~~~~i~~~~~-----------~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~ 222 (507) -+.|+. ++.-|.+ +..+.+|.......-.+.....+.....+............+| ++-|++.. T Consensus 146 ~~sP~~--~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvP---vV~~~n~~ 220 (504) T protein:vir:99 146 VKSAMQ--ATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVP---VEVLPYKP 220 (504) T ss_pred Eeccce--eEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcc---eEEecccc Confidence 222221 1110110 1112222211100011111111100000001111111122244 57788888 Q ss_pred ccCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_017974. 223 SSSQALGVSPLV-GCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQ 301 (507) Q Consensus 223 ~~~~p~G~gll~-~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~ 301 (507) +.+.|+|.+-+. .+....=-=+..+..-+.-.|-|.+|..++. +.......+.+......+++... T Consensus 221 ~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~---G~~~~~~~~~d~~~~~~~~~~~~---------- 287 (504) T protein:vir:99 221 REDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILL---GADAKNFRNKDGSMKPAWQIALA---------- 287 (504) T ss_pred cCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc---cCCccccccccccccchhhhhhh---------- Confidence 888999977542 2211110001122222334455555554442 11000000011111111222111 Q ss_pred eeEeecCcccccccc-cceeeeeccCCCcchhHHHHHHHHHHHHHHH------Hhcccccccccchhh-hhHHHHHHHHH Q lcl|NC_017974. 302 SFFMLPSDTKDNAPQ-YSMTLKGIDGMGKQYSTAQLISDRKKSILDR------LGAGFINVGNDKGGS-YNLSESKQTIH 373 (507) Q Consensus 302 a~~iiP~g~~i~~~~-~~~el~~~~g~g~~~~~~~li~~~d~~Isk~------iLGqtLts~~~~~GS-~Al~~vh~~v~ 373 (507) ....+|.+.+..+.+ ...++...+.+ +...|...++.+-.+||.. .||.. + +...+| -|+ .....-. T Consensus 288 ~i~~~~~~~~~~~~~~~~~~~~q~~~~-~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~--~-~~n~sSa~Ai-~~~~~~L 362 (504) T protein:vir:99 288 RVFALPDDEDEPDAARARADVKQFPAS-SPQPHIEMLEQIAMMFSGETSIPVESLGFS--N-RANPTSADAY-IASREDL 362 (504) T ss_pred hhhcCCCccccccccCccceeeecCCC-ChHHHHHHHHHHHHHHHhhhCCCHHHhccc--c-cccccHHHHH-HHHHHHH Confidence 112334332221110 01222222222 2223444444444444431 22211 1 111122 232 2233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh--CCCcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHH Q lcl|NC_017974. 374 TQFVQRVNEIILEALNENLLPQLLAL--NDMRLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLE 450 (507) Q Consensus 374 ~~~~~aD~~~i~~~ln~~li~~l~~l--N~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e 450 (507) ...++.-.+.+...+.+ +++-.+.+ |....+.. .-.++.|......++.+.|+++.||+.+|.....+ .+.+.+ T Consensus 363 ~~ka~~k~~~f~~~l~~-~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~--~~~l~~ 439 (504) T protein:vir:99 363 IAEAEGATDDWSPAFRR-SMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKE--TEVGLE 439 (504) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccc--hHHHHh Confidence 34445555666777743 55554444 22111221 22467788888999999999999999998532211 245778 Q ss_pred HcCCCCCCCcc---cccchh-----hhcCCCcCCCCCCcc-cccCccccccc---ccCCCCchhh Q lcl|NC_017974. 451 VLGIDEKIEED---ISQEEL-----MKLLGEDTSRAGDGM-TKGSSGNGTGK---ISSTRDNSAA 503 (507) Q Consensus 451 ~~Gip~~~~~~---~~~~~~-----~~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~~~d~~a~ 503 (507) .+|+++.+-+. +...+. .......+.+.+++. ...++++.++. ..++||++.- T Consensus 440 ~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 440 LLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred hcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCcccCC Confidence 88996442211 000000 111111111111110 01111111111 1122333332 No 128 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.36 E-value=8.5e-05 Score=42.89 Aligned_cols=443 Identities=9% Similarity=-0.009 Sum_probs=157.4 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) |......-... ..++..--.+...+..+.+-...++.. ...+.+++. +-.+..-+- +..|....=...+.+- T Consensus 1 ~~t~~~~i~~L----~~~~~~~~~r~~~l~~Yy~G~~~i~~~-~~~~~~~~~-~~~~~~n~~--~~ivd~~~~~l~~~g~ 72 (480) T protein:vir:78 1 MTTYHEHVERL----QGLLARDLPNLLEAEAYRNGTRRLKTI-GIGAPPELA-YLDVQPGWV--ATYLRTLSDRLDIEGF 72 (480) T ss_pred CCCHHHHHHHH----HHHHHHHHHHHHHHHHHHhcccccccc-ccccchhHh-hhhhhcchH--HHHHHHHHhhhccCce Confidence 11111100000 001110000000111111111111110 001111111 111111111 1111110000111111 Q ss_pred CChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCc-cccCcceeeeeccccCchhcccccc-- Q lcl|NC_017974. 93 ESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTV-GEYIGKYKVKNLAFRPQASLSRTNP-- 168 (507) Q Consensus 93 ~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~-~~~~g~~~~~~l~~r~~~~i~~~~~-- 168 (507) ..+++.+..+.+.++++. ..+...+.++ .+|..||.| ++++|..... ...+|...+.-+.|. .++..|.+ T Consensus 73 ~~~~d~~~~~~l~~i~~~---N~~d~~~~~~~~~a~~~G~a-y~~v~~~~~~~~d~~g~~~i~~~~p~--~~~~~~D~~~ 146 (480) T protein:vir:78 73 RISEDSEGLEELWNWWQA---NDLDEESVLGHDDSLTFGRS-YITVSHPDVESGDPAGIPLIRVESPL--YMYAELDPRN 146 (480) T ss_pred ecCCCchhHHHHHHHHHh---cCHHHHHHHHHHHHhhcCce-EEEEecCccccCCCCCeeEEEEEccc--ceEEEEcCCC Confidence 111222334445555443 2466666665 578899986 5688863211 123455555443322 12211110 Q ss_pred ---------eeecCCCccc-ccccc-cccccccccc--ccccccccccCCCccccccccEEEEeeCCccCccccchHHHH Q lcl|NC_017974. 169 ---------IVYNEDGSAI-IGIKQ-SLSAFQNYTA--SEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVG 235 (507) Q Consensus 169 ---------~~~~~d~~~l-~~~~q-~~~~~~~~~~--~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~ 235 (507) ++...|..+. ..+.- .+.....+.. +...............++.-.++.|++..+.+.|+|.|-+.. T Consensus 147 ~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~ 226 (480) T protein:vir:78 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) T ss_pred ccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchh Confidence 0011111100 00000 0000000000 000000000111112234445677888888899999987764 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccc Q lcl|NC_017974. 236 CYRAWRE-KILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNA 314 (507) Q Consensus 236 ~~~~~~f-K~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~ 314 (507) -..+.+- =+..+..++..++-|..|.+++. +.. .....++.....+.... | .... ..|.+.++ T Consensus 227 ~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~---G~~--~~~~~~~~~~~~~~~~~--------~--~~~~-~~~~~~~~ 290 (480) T protein:vir:78 227 ELRKVTDAASRTLMNLQSASQILGTPLRVIS---GVT--TDELTNDGENTTLDIYY--------G--RILT-LASEAAKI 290 (480) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhcchhhhhh---cCC--ccccccccccchhhhhh--------h--hhcc-CCCCCceE Confidence 2222221 14456677788888888876653 110 00001111111111111 1 1111 12323222 Q ss_pred cccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc-c-ch---hh-hhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 315 PQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN-D-KG---GS-YNLSESKQTIHTQFVQRVNEIILEAL 388 (507) Q Consensus 315 ~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~-~~---GS-~Al~~vh~~v~~~~~~aD~~~i~~~l 388 (507) ...+.. +...|...++.+-.+|+... + +.... + .+ +| -|+ .....-....++.-.+.+...| T Consensus 291 -------~~~~~~-~~~~~~~~l~~~i~~~~~~~--~-~p~~~~g~~~~n~~Sg~Al-k~~~~~l~~ka~~~~~~f~~~l 358 (480) T protein:vir:78 291 -------SEFKAA-ELRNFAEEMEVFRKEAASIT--G-LPPQYLSSSSENPASAEAI-IATDSRIVKMAERKGRIFGGAW 358 (480) T ss_pred -------EecCcc-CHHHHHHHHHHHHHHHhccc--C-CChHHhccccCcchHHHHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 222222 22234444444444444321 1 11100 1 01 11 122 1122223333344445555566 Q ss_pred HHHHHHHHHHhCCCcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc---c Q lcl|NC_017974. 389 NENLLPQLLALNDMRLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS---Q 464 (507) Q Consensus 389 n~~li~~l~~lN~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~---~ 464 (507) .+ +++-++.+.+...... .--.+.|......++.+.++.+.+|+.+|..+.. .+.+++.+|+.+.+.++.. . T Consensus 359 ~~-~~~l~~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s---~et~~~~lg~~~d~~~~~~~~~~ 434 (480) T protein:vir:78 359 ER-AMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIP---KEQARIDLGYTATQREQMRDWDK 434 (480) T ss_pred HH-HHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCC---HHHHHhcCCCCHhHHHHHHHHHH Confidence 43 5666777764322222 1124578778888989999999999998854433 3567888888643222111 1 Q ss_pred chhh---hcCC----CcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 465 EELM---KLLG----EDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 465 ~~~~---~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) +... .... ......+++. +++.+ .++..-++..+-++ T Consensus 435 e~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~--~~~~~~~~~~~~~~ 478 (480) T protein:vir:78 435 QETEDMIDTLYSTTKAQADATPKPT----VTETK--TETQTSPSGFNRTK 478 (480) T ss_pred HHHHHHHHHhhccccccCCCCCCCC----CCCCC--CccccccCCCCccc Confidence 1100 0011 0000000000 11100 01111122222223 No 129 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=97.18 E-value=0.00014 Score=41.69 Aligned_cols=445 Identities=11% Similarity=-0.031 Sum_probs=161.5 Q ss_pred Cccccccccccccccc-cccCchhhhHHHHhhhhcccchhccchhHHHHHHHh----hChHHHHHHHHHHHHHhhCCcee Q lcl|NC_017974. 13 VAKATSEQINVSRMRM-SEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMK----QDATVATGLDMLYTFVEKAFKDF 87 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~----~D~~v~s~l~~Rk~av~~~~~~~ 87 (507) ++..........-.++ .++...-........+.+-...+... ...+.+++. ......-++.+....+.-.++.+ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~-~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~ 79 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAI-GLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRI 79 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhc-CcccchhhhhhhhhcchHHHHHHHHHHhhhccceec Confidence 3332221111000000 00000000000000111111111100 000111111 11111111121111110001211 Q ss_pred eeC---CCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcc---ccCcceeeeeccccCc Q lcl|NC_017974. 88 KVI---PGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVG---EYIGKYKVKNLAFRPQ 160 (507) Q Consensus 88 ~~~---~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~---~~~g~~~~~~l~~r~~ 160 (507) -.+ +.....+.+..+.+.+++.. ..|.....++ .++.-||.| ++++|...... ..++...++.+.|+ T Consensus 80 ~~~~~~~~~~~~d~~~~~~l~~i~~~---N~~~~~~~~~~~~a~i~G~a-~~~v~~~~~~~~~~~~~~~~~i~~~~p~-- 153 (488) T protein:vir:23 80 PSANGEEPESGGENDPASELWDWWQA---NNLDIEATLGHTDALIYGTA-YITISMPDPEVDFDVDPEVPLIRVEPPT-- 153 (488) T ss_pred cCCcccccccccchhHHHHHHHHHHh---cChhHHHHHHHHHHhhcCce-EEEEecCCcccccCCCCCcceEEEeccc-- Confidence 111 11122344556666666543 2466666654 468889997 56888643221 11233333322211 Q ss_pred hhccccc----------ceeecCCCcccccccc-ccccccccccccccccccccCCCccccccccEEEEeeCCccCcccc Q lcl|NC_017974. 161 ASLSRTN----------PIVYNEDGSAIIGIKQ-SLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALG 229 (507) Q Consensus 161 ~~i~~~~----------~~~~~~d~~~l~~~~q-~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G 229 (507) .++..|. .+++..++.....+.- .+.....+.......... ......++.-.++.|+++.+.+.|+| T Consensus 154 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~--~~~~h~~g~vPvv~f~n~~~~~~~~G 231 (488) T protein:vir:23 154 ALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAP--TSTPHGLEMVPVIPISNRTRLSDLYG 231 (488) T ss_pred eeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEec--cccccCCCCcceEEeccccccCCcCC Confidence 1111111 1122233322221111 111111111111111011 11122233334577888888889999 Q ss_pred chHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecC Q lcl|NC_017974. 230 VSPLVGCYRAWRE-KILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPS 308 (507) Q Consensus 230 ~gll~~~~~~~~f-K~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~ 308 (507) .|-+.......+- =+..+..++...+-|..|.++++ +.........+......+++ +..+..+++. T Consensus 232 ~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~---G~~~~~~~~~~~~~~~~~~~----------~~~~v~~~~~ 298 (488) T protein:vir:23 232 TSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIF---GAKPEELGINAETGQRMFDA----------YMARILAFEG 298 (488) T ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHh---CCCcccccccccccchhhhh----------hhhhhccCCC Confidence 8877543322221 13345566667777777766654 11000000001000111111 1223455666 Q ss_pred cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc--c----c-hhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 309 DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN--D----K-GGSYNLSESKQTIHTQFVQRVN 381 (507) Q Consensus 309 g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~----~-~GS~Al~~vh~~v~~~~~~aD~ 381 (507) |.+.++. ..+.++ ...++++++.-|-+.. +.+-+... + . .+.-|+ .....-....++.-. T Consensus 299 g~~~~~~-------q~~~~~----~~~~~~~l~~~i~~~~-~~~~~p~~~~g~~~~n~~Sg~Al-~~~~~~l~~k~~~~~ 365 (488) T protein:vir:23 299 GEGAHAE-------QFSAAE----LRNFVDALDALDRKAA-SYSGLPPQYLSSSSDNPASAEAI-KAAESRLVKKVERKN 365 (488) T ss_pred CCCceeE-------ecCCCC----hHHHHHHHHHHHHHHh-cccCCCHHHhccccCcchHHHHH-HHHHHHHHHHHHHHH Confidence 6544332 222221 2334555554443322 11111100 0 0 111222 222223333334444 Q ss_pred HHHHHHHHHHHHHHHHHhCCCc-CCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC Q lcl|NC_017974. 382 EIILEALNENLLPQLLALNDMR-LPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE 459 (507) Q Consensus 382 ~~i~~~ln~~li~~l~~lN~~~-~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~ 459 (507) +.+...|. ++++-++.+.... .+.. .--.++|....+.++.+.++++.+|++.|..+.+ .+.+++.+|+-+... T Consensus 366 ~~f~~~l~-~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s---~et~~~~l~~~~d~~ 441 (488) T protein:vir:23 366 KIFGGAWE-QAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIP---RERGWVDMGYTIVER 441 (488) T ss_pred HHHHHHHH-HHHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCC---HHHHHHhCCCCchHH Confidence 55566664 4566666653211 1111 1235688888889999999999999998853332 356888888743222 Q ss_pred cccc---cch-------hhhcCCCcCCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 460 EDIS---QEE-------LMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 460 ~~~~---~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) ++.. .+. ........+.+...+ ..+++. +..+-+.+| T Consensus 442 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-----~~~~e~~~a 488 (488) T protein:vir:23 442 EQMRQWLEQDQKQGLGLIGSLYGASTPEGKPG--EAPVGE-----PPAPEPDAA 488 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCcccCC--CCCCCC-----CCCCCCCCC Confidence 1111 000 001111111111111 111111 112222222 No 130 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.14 E-value=0.00015 Score=41.47 Aligned_cols=419 Identities=11% Similarity=0.024 Sum_probs=144.6 Q ss_pred CcccccccccccCccccccccccccccccc-cCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSE-QGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) |++|--- ++.-+ ..+......|..+.. ....|..+...+...-....+...+++..|+ .+.-...++++--.. T Consensus 1 ~~~~~~~--~~~~~-~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr---~~~ia~~iVd~~~d~ 74 (449) T protein:vir:10 1 MTDKLTL--AVNHA-LNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYR---RGGIAHGAVEKLVGK 74 (449) T ss_pred CchhhHH--HHhhh-cchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHh---cCchhHHHHHhhhhh Confidence 8877211 00000 000000001100100 0011222222332222222233334444443 355555555544333 Q ss_pred HhhCCceeeeCCCCChhHHHH---HHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccccCc---ceeee Q lcl|NC_017974. 80 VEKAFKDFKVIPGESEESKKA---AKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGEYIG---KYKVK 153 (507) Q Consensus 80 v~~~~~~~~~~~~~~~~~~~~---ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g---~~~~~ 153 (507) +.+ .|...+++.+.++.+.. ..-+++.+.. .-|..+....-.+.+||++++=+.-+ ++.....- .-.++ T Consensus 75 ~~~-~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~---~~~~~l~ea~~~~rl~Gga~i~i~v~-d~~~l~~Pl~~~~~i~ 149 (449) T protein:vir:10 75 CWQ-TNPEIIEGDDADDSEDETSWEKKSKQVFTN---RLWRSFAEADRRRLVGRYAGILLHIR-DEKDWNLPATKGRGLQ 149 (449) T ss_pred hhh-cCcccccCccccchhhhHHHHHHHHHHHHH---HHHHHHHHHHHhhhccCcEEEEEEec-CCCCCCcccccCccee Confidence 322 23323333222211111 1112222211 01444433334577899998633222 22111110 01222 Q ss_pred eccccCchhcccccceeecCCCcccccccccccccccccccccccccccc-----CCCccccccccEEEEeeCCccCccc Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVST-----RMSDVIIPINRVMLMNTGGSSSQAL 228 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~-----~~~~~~iP~~k~i~~~~~~~~~~p~ 228 (507) +|.+.....+.. -. +..+|.. -.++.+..+.... ...+..|=+.+++.+...+ .. T Consensus 150 ~i~v~~~~~i~~---~~----------~~~dp~s---p~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~----~~ 209 (449) T protein:vir:10 150 KVSVSWAGSLKV---AE----------WDTGINS---KTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYS----ED 209 (449) T ss_pred eEEeeccccCCh---hh----------hhcCCCC---CCCCCceEEEEeeeccCCCccceeeccceeEeecCCC----CC Confidence 222221111100 00 1111111 1112222221111 1122234444545443222 12 Q ss_pred cchHHHHHHHHHH-HHHH---HHHHHHHHHHH-----hcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC Q lcl|NC_017974. 229 GVSPLVGCYRAWR-EKIL---IENLEVVGATK-----DMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG 299 (507) Q Consensus 229 G~gll~~~~~~~~-fK~~---~~~~w~~f~Er-----~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G 299 (507) |.++|+++|-..+ +-+. .-.-|..-+.| +-.-+-+..+..-+ ....++. .+++.+.+..+..| T Consensus 210 g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~-----~~~~e~~---~~~~~~~~~~~~~~ 281 (449) T protein:vir:10 210 AIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLY-----GVSIDEL---QDKFNEVAGEINRG 281 (449) T ss_pred ChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHh-----hCCchHH---HHHHHHHHHHHhcc Confidence 7789999885321 1010 00111111000 00000011111000 0111111 12222222223345 Q ss_pred CceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc--ccccchhhhhHHHHHHHHHHHHH Q lcl|NC_017974. 300 EQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN--VGNDKGGSYNLSESKQTIHTQFV 377 (507) Q Consensus 300 ~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS~Al~~vh~~v~~~~~ 377 (507) .++.+ +-.+-+. +.+..+-+| ...+++..=.++|-++ |-.+| .|.+.+|-.|-++ .....+.+ T Consensus 282 ~~~~~-i~~~~d~-------~~~~~~~sg----l~d~l~~~~q~iaaa~-~IP~t~L~Gqsp~glnst~D--~~nyyd~i 346 (449) T protein:vir:10 282 NDVLM-TTQGATV-------TPLVTSVAD----PTATYNVNLQTAAAGV-DIPTRILIGNQQAERSSTED--QKYFNARC 346 (449) T ss_pred chhee-ecCCcce-------EEEecccCC----hhHHHHHHHHHHHHHh-CCCeeeeeccCccccccchh--HHHHHHHH Confidence 55433 4343332 222332222 2234555445566544 22222 1222233333232 45566777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHH-------HHHHHHHHhcCCccccHHHHHHHHH Q lcl|NC_017974. 378 QRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGF-------SKAIQRIGAVGYLPKTPKVINRVLE 450 (507) Q Consensus 378 ~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~-------a~~~~~L~~~G~~~~~~~~~~~i~e 450 (507) .+-...+...|.+ |+..|+.......++ . =.|+|...-..+-++. |++++++++.|....- ..+.+|+ T Consensus 347 ~~~Q~~l~p~le~-l~~~l~~s~~g~~~~-d-~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~--~~~EiR~ 421 (449) T protein:vir:10 347 QSRRVDLSFEIED-FCDKLIELKIIDAVA-K-KAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAF--SREEIRT 421 (449) T ss_pred HHHHHhhhHHHHH-HHHHHHHhhcCCCCC-c-eeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCc--CHHHHHH Confidence 7665556666643 667666554211111 1 1456655544443333 6677788888743221 1367999 Q ss_pred HcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCcccccccccCC Q lcl|NC_017974. 451 VLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISST 497 (507) Q Consensus 451 ~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (507) .+|..++..+.. + +...+ .++.+..++| T Consensus 422 ~~~~~~~~~~~~-~--------~e~~d----------e~~~~~d~~a 449 (449) T protein:vir:10 422 AAGYDNDDEEPL-G--------EEDGD----------EEDKATDSAA 449 (449) T ss_pred HhcccCCCCCCC-C--------CCCCc----------cccccCCcCC Confidence 999865432211 0 00000 1111111122 No 131 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=97.13 E-value=0.00016 Score=41.40 Aligned_cols=462 Identities=10% Similarity=0.041 Sum_probs=175.4 Q ss_pred CcccccccccccCccccccccccccccccccC-chh--hhHHHHhhhhcccchhc--cchhHHHHHHHh-hChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQG-TFA--LAKVQVDSERMKAEEIR--WPNLIGTVESMK-QDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~--~~~~~~~~~~e~~~~lr--~~~~~~~y~~m~-~D~~v~s~l~ 74 (507) |+ .+..-.+...........+.+-+ +.+ ...|... ......+++ +..+..--+++. .++++.++++ T Consensus 1 ~~-------~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~-~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~ 72 (530) T protein:vir:38 1 MK-------IPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPP-SESADAALLPNYSRGNARADDLVRNNGYAANAVQ 72 (530) T ss_pred Cc-------cceeecCccccchHHHhhhhcccCCCCCcccccccC-CCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 21 11111111000000000010000 000 0000000 000000111 112222234454 5999999999 Q ss_pred HHHHHHhhCCceeeeCCC------CChhHHHHHHHHHHHHHh--------c---cCCCHHHHHHHHHHH-Hhhcceeeee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPG------ESEESKKAAKFVEYCLKN--------M---EGQTLRQFARDAATF-NEYGLSVVEK 136 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~------~~~~~~~~ae~v~~~l~~--------~---~~~~~~~~l~~~l~a-~~~Gfs~~Ei 136 (507) .....|-+..+...+.+. +...+++..+-|+..++. . ...+|..+...++.+ +.-|=.++-+ T Consensus 73 ~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 152 (530) T protein:vir:38 73 LHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQA 152 (530) T ss_pred HHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEe Confidence 999999888776665432 223345555555555532 0 123577777666654 4467777777 Q ss_pred eeeccCccccCcceeeeeccccCchhccc----------ccceeecCCCccccccccccccccccccccccccccccCCC Q lcl|NC_017974. 137 VYTQVTVGEYIGKYKVKNLAFRPQASLSR----------TNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMS 206 (507) Q Consensus 137 ~w~~~~~~~~~g~~~~~~l~~r~~~~i~~----------~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~ 206 (507) .|....+..++- +|...+++-|.. .+-+.|+.+|+-+-++-. ..+.+......+..... T Consensus 153 ~~~~~~g~~~~~-----~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~------~~~~~~~~~~~~~~~~~ 221 (530) T protein:vir:38 153 TWDSDSTRLFRT-----QFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVS------DDGYPGWMAQNWTYIPR 221 (530) T ss_pred eeccCCCCccce-----EEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEe------eccCCCccccccceeee Confidence 787653322221 122222222221 122444555543332211 00001111111111111 Q ss_pred ccccccccEEEEeeCC-ccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce-EEecccc---cCCCCCCCchHH Q lcl|NC_017974. 207 DVIIPINRVMLMNTGG-SSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVI-ELKIPSQ---ILNKAAMDQSSP 281 (507) Q Consensus 207 ~~~iP~~k~i~~~~~~-~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~-v~~~P~~---~~k~~~~~~~~~ 281 (507) ...+|.. -|+|.+.. +.+..-|.+.|.++.....--.......++-. |-..-+. +++.+.. .+........++ T Consensus 222 ~~~v~a~-~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a-~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~ 299 (530) T protein:vir:38 222 ELPGGRP-SFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSA-IVKAMYAATIESELDTQSAMDFILGADNKE 299 (530) T ss_pred eeccChh-HeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHH-HHhhhheeeeeccCCccccccccccCCccc Confidence 2223333 36666654 47888899999888765443322222222111 1111111 1221110 000000000000 Q ss_pred HHHHHHHHHHH------HHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccc--- Q lcl|NC_017974. 282 EAEMVRGLMSD------AANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGF--- 352 (507) Q Consensus 282 ~~~~l~a~~~~------~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt--- 352 (507) ....+...... ...+..+......++.|.+|++. ..+..+. .|..+.+..-+.|+..+ |-+ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~-------~p~~p~~--~~~~f~~~~lr~iaagl-Gi~ye~ 369 (530) T protein:vir:38 300 QQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQ-------SAQDTDN--GYSTFEQSLLRYIAAGL-GVSYEQ 369 (530) T ss_pred ccccccccchhhhhcccccceeccCceeeecCCCCeeeee-------CCCCCCC--CHHHHHHHHHHHHHhhc-CCCHHH Confidence 00000000000 00011122245566778766553 3333333 36678888888988877 433 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCC-cCCcc----------ccc--eEEecC Q lcl|NC_017974. 353 INVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA---LNDM-RLPET----------EMP--YVKAGE 416 (507) Q Consensus 353 Lts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~---lN~~-~~~~~----------~~P--~~~~~~ 416 (507) ||. +-.+.|||.+-.-..-+...++.....+...+-+-+...+++ +++. ..|.. .+. .+.... T Consensus 370 lt~-D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~ 448 (530) T protein:vir:38 370 LSR-NYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSG 448 (530) T ss_pred Hhc-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCC Confidence 333 323446665544444444445555555544443334333332 2321 11110 011 233344 Q ss_pred cCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCc-ccccch---hhhcCCCcCCCCCCcccccCcccccc Q lcl|NC_017974. 417 IVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEE-DISQEE---LMKLLGEDTSRAGDGMTKGSSGNGTG 492 (507) Q Consensus 417 ~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (507) ....|..+-+++....+++|+.... +.+++ .|.....-- +...+. ....++.+..+.....+ ....+. T Consensus 449 ~~~iDP~Ke~~a~~~~i~~G~~s~~----~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~---~~~~~~ 520 (530) T protein:vir:38 449 RMAIDGLKEVQEAVMLIEAGLSTYE----KECAK-RGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEA---GVKKSN 520 (530) T ss_pred ccccChHHHHHHHHHHHHcCCCCHH----HHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCC---CCCCCC Confidence 5567777778888888999986552 22333 243211000 000000 00001000000000000 000000 Q ss_pred cccCCCCchhhhc Q lcl|NC_017974. 493 KISSTRDNSAANL 505 (507) Q Consensus 493 ~~~~~~d~~a~~~ 505 (507) ..+...+.+. T Consensus 521 ---~~~~d~~~~a 530 (530) T protein:vir:38 521 ---EEEQDGARAA 530 (530) T ss_pred ---CCCCCCCCCC Confidence 0000000000 No 132 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=97.12 E-value=0.00016 Score=41.35 Aligned_cols=434 Identities=10% Similarity=-0.005 Sum_probs=153.3 Q ss_pred CccccccccccccccccccCchh--hh-HHHHhhhhcccchhccchhHHHHHH---Hh-----hChHH---HHHHHHHHH Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFA--LA-KVQVDSERMKAEEIRWPNLIGTVES---MK-----QDATV---ATGLDMLYT 78 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~--~~-~~~~~~~~e~~~~lr~~~~~~~y~~---m~-----~D~~v---~s~l~~Rk~ 78 (507) ++-+. .-+.+.-... +. ....+.. ..-|..++.+.|+- +. .+..+ .++.+==+. T Consensus 1 ~~~~~--------~~~~e~~~~~~~~~~l~~~~~~----~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~ 68 (486) T protein:vir:42 1 MTAPL--------PGMEEIEDPAVVREEMISAFED----ASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRL 68 (486) T ss_pred CCCCC--------CCCCCcccHHHHHHHHHHHHHH----HHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHH Confidence 21111 1111111100 00 0111110 00011111111210 00 00000 001000111 Q ss_pred HHhhC-----CceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccc---cCcc Q lcl|NC_017974. 79 FVEKA-----FKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGE---YIGK 149 (507) Q Consensus 79 av~~~-----~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~---~~g~ 149 (507) .|..+ .--|.++ +++..+.. +.+++.. ..|.....++ .+|.-||.| ++++|....+.. .++. T Consensus 69 iVd~~~~~l~~~g~~~~-~~~~~~~~----~~~i~~~---N~~d~~~~~~~~~a~~~G~a-y~~v~~~e~~~~~~~~~~~ 139 (486) T protein:vir:42 69 YVDSVAERQAVEGFRLG-DADEADEE----LWQWWQA---NNLDIEAPLGYTDAYVHGRS-FITISKPDPQLDLGWDQNV 139 (486) T ss_pred HHHHHHhhhcccceecC-CCchhHHH----HHHHHHh---cChhHHHHHHHHHHhhcCce-EEEEecCCcccccccCCCe Confidence 11110 0012222 22222233 3444432 2355555554 468889997 668987543321 2333 Q ss_pred eeeeeccccCchhcccccc----------eeecCCCccccccccc-cccccccccccccccccccCCCccccccccEEEE Q lcl|NC_017974. 150 YKVKNLAFRPQASLSRTNP----------IVYNEDGSAIIGIKQS-LSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLM 218 (507) Q Consensus 150 ~~~~~l~~r~~~~i~~~~~----------~~~~~d~~~l~~~~q~-~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~ 218 (507) ..+.-+.|+ .++.-|.+ ++++.++..+....-. +.....+........ ........++.--++.| T Consensus 140 ~~i~~~~p~--~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~--~~~~~~h~~g~vPvv~~ 215 (486) T protein:vir:42 140 PIIRVEPPT--RMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWA--EWFNVPHGLGVVPVVPL 215 (486) T ss_pred eEEEEeccc--ceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEE--eecceecCCCCceEEEe Confidence 333322222 11111111 1222333222211111 111111111000000 00111122233345778 Q ss_pred eeCCccCccccchHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhh Q lcl|NC_017974. 219 NTGGSSSQALGVSPLVGCYRAWR-EKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAH 297 (507) Q Consensus 219 ~~~~~~~~p~G~gll~~~~~~~~-fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~ 297 (507) +++.+.+.|+|.|-+..-....+ -=+..+...+...+-|..|..++.- .........++.....+++ T Consensus 216 ~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G---~~~~~~~~~~~~~~~~~~~--------- 283 (486) T protein:vir:42 216 PNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG---IKPEEIGVDSETGQTLFDA--------- 283 (486) T ss_pred ccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhc---CCccccccccccccchhhh--------- Confidence 88888888999887764222211 1122344555566767766665541 0000000000000000110 Q ss_pred cCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc-c-c----hhhhhHHHHHHH Q lcl|NC_017974. 298 SGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN-D-K----GGSYNLSESKQT 371 (507) Q Consensus 298 ~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~-~----~GS~Al~~vh~~ 371 (507) ..++-.++|. .+.++ ...+.+ +...++++++.-|.+.-....++... + . .+.-|+. .... T Consensus 284 -~~~~~~~~~~-~~~~~-------~q~~~~----~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~-~~~~ 349 (486) T protein:vir:42 284 -YLARILAFED-AEGKI-------QQFSAA----ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIR-AAES 349 (486) T ss_pred -hhchhcccCC-CCceE-------Eeeccc----CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHH-HHHH Confidence 0111222332 22222 222222 13446777777765543222222110 0 0 1112222 1222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHH Q lcl|NC_017974. 372 IHTQFVQRVNEIILEALNENLLPQLLAL-NDMRLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVL 449 (507) Q Consensus 372 v~~~~~~aD~~~i~~~ln~~li~~l~~l-N~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~ 449 (507) -....++.-.+.+...|.+ +++-++.+ |....+.+ .--++.|....+.++.+.|+++.+|++.|.-+.+ .+-++ T Consensus 350 ~l~~ka~~~~~~f~~~l~~-~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s---~et~~ 425 (486) T protein:vir:42 350 RLIKKVERKNLMFGGAWEE-AMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIP---RERAR 425 (486) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCC---HHHHH Confidence 2233334444555666643 45554454 32111211 1235688888889999999999999998653332 35577 Q ss_pred HHcCCCCCCCccccc--c-h------hhhcCCCcCCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 450 EVLGIDEKIEEDISQ--E-E------LMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 450 e~~Gip~~~~~~~~~--~-~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) +.+|+-+...++... + . ....+....... ++. ..++++.....++....++. T Consensus 426 ~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 426 IDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTV-PGS-PSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred hcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCC-CCC-CCCCCCCCCCcccCCCCCCC Confidence 888885432222110 0 0 001111111111 100 00111111111112222222 No 133 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.12 E-value=0.00016 Score=41.33 Aligned_cols=431 Identities=8% Similarity=0.002 Sum_probs=148.1 Q ss_pred cccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH------------------H---hhChH Q lcl|NC_017974. 10 SVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES------------------M---KQDAT 68 (507) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~------------------m---~~D~~ 68 (507) .+..|.- .+.... ..++ +-....+.+. .+.+ |..++.+.|+- + ..-.. T Consensus 1 ~~~~p~~---~l~~~~--~~~~--~~~~l~~~~~--~~~~--r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~ 69 (479) T protein:vir:99 1 MIDLPDE---DLSSEG--LAKY--LETKVFPKMN--TECE--RLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPW 69 (479) T ss_pred CccCCcc---cCChhH--HHHH--HHHHHHHHHH--HHhH--HHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCc Confidence 1222210 000000 0000 0000000110 0000 11111111211 0 00011 Q ss_pred HHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccC Q lcl|NC_017974. 69 VATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYI 147 (507) Q Consensus 69 v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~ 147 (507) ..-+++..-..+ ..--|... +.... +.+.+++.. ..+.....++ .++.-||. .++++|........+ T Consensus 70 ~~~iVd~~~~~l--~~~gf~~~--d~~~~----~~~~~i~~~---N~~d~~~~~~~~~a~~~G~-af~~v~~~~~~~d~~ 137 (479) T protein:vir:99 70 MGLMVNSFAQQL--IVDGYRKT--GTNEN----AKGWDTWRL---NQMDKQQFWLNRAVLTFGY-AFIKVTSGISPLDGT 137 (479) T ss_pred HHHHHHHHHhhc--ccccccCC--Cchhh----HHHHHHHHh---cChhHHHHHHHHHHhhcCc-eEEEEecCCCCcCCC Confidence 111111111100 01112221 11122 223444432 2356666664 47888998 467898533222334 Q ss_pred cceeeeeccccCchhcccccceeecCCCc--ccccccccccc-ccccccccccc------cccccCCCccccccccEEEE Q lcl|NC_017974. 148 GKYKVKNLAFRPQASLSRTNPIVYNEDGS--AIIGIKQSLSA-FQNYTASEIGV------GGVSTRMSDVIIPINRVMLM 218 (507) Q Consensus 148 g~~~~~~l~~r~~~~i~~~~~~~~~~d~~--~l~~~~q~~~~-~~~~~~~~~~~------~~~~~~~~~~~iP~~k~i~~ 218 (507) |...+.-+.|+. ++.-| .+.... .+..++..... ...++...... ...........++..-++.| T Consensus 138 g~~~i~~~~p~~--~~~iy----dd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f 211 (479) T protein:vir:99 138 TVARIKCIDPRD--AFAIW----EDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRY 211 (479) T ss_pred CceEEEEechhh--eEEEe----cCCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEe Confidence 555444433221 11101 011000 00011111000 00011100000 00000111112233346777 Q ss_pred eeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhc Q lcl|NC_017974. 219 NTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHS 298 (507) Q Consensus 219 ~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~ 298 (507) ++..+. .++|.|.+..+.-..=-=+..+...+...+.|..|..++. +.. ...+..... ....+ T Consensus 212 ~n~~~~-~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~---G~~--~~~~~~~~~-----~~~~~------ 274 (479) T protein:vir:99 212 VNVMDL-RGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT---GLM--LPEGANADQ-----EKMRF------ 274 (479) T ss_pred ecCCCc-CcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc---CCC--cccccccch-----hcccc------ Confidence 887766 4578998876443222223455666677788888776653 111 111111110 00111 Q ss_pred CCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc-ccchhhhhHHHHHHHHHHHHH Q lcl|NC_017974. 299 GEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG-NDKGGSYNLSESKQTIHTQFV 377 (507) Q Consensus 299 G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~-~~~~GS~Al~~vh~~v~~~~~ 377 (507) ....++...|.+.++. ..+. .+...|...++.+-.+|+...--..-..+ .+..+.-|+. ....-....+ T Consensus 275 -~~~~i~~~~~~~~~~~-------q~~~-~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~-~~~~~l~~ka 344 (479) T protein:vir:99 275 -AQESMLISQNEKASFG-------AIPA-APLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALA-AGTRQTMQKL 344 (479) T ss_pred -ccccceeecCCCceEE-------Eecc-cchHHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHH-HHHHHHHHHH Confidence 0112333444443332 2222 12222444444444455432111100011 0111112222 2222233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCcCCccc-cceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHc-CCC Q lcl|NC_017974. 378 QRVNEIILEALNENLLPQLLALNDMRLPETE-MPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVL-GID 455 (507) Q Consensus 378 ~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~-~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Gip 455 (507) +.-.+.+...|.+ +++.++.+.+...+... --.+.|......++.+.++++.+|+.+|.+ +. +.+.+.+ |+. T Consensus 345 ~~~~~~f~~al~~-~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~i-s~----et~l~~l~gv~ 418 (479) T protein:vir:99 345 FEKQATWKASHNQ-TMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKI-PA----EGVWDMIPNLD 418 (479) T ss_pred HHHHHHHHHHHHH-HHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCC-CH----HHHHHhcCCCC Confidence 4444555566643 55666666543222111 124567777778889999999999999863 32 3344444 886 Q ss_pred CCCCcccc----cc----hhhhcCCCcCCC-----CCCccc-ccCcccccccccCCCCchh Q lcl|NC_017974. 456 EKIEEDIS----QE----ELMKLLGEDTSR-----AGDGMT-KGSSGNGTGKISSTRDNSA 502 (507) Q Consensus 456 ~~~~~~~~----~~----~~~~~~~~~~~~-----~~~~~~-~~~~~~~~~~~~~~~d~~a 502 (507) .++-+... .. .....+.....+ ..++.+ ...+++.++...+-.-..+ T Consensus 419 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 419 QSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcchhccCCCCC Confidence 44222110 00 001111110011 111111 1111222221111111111 No 134 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=97.12 E-value=0.00016 Score=41.32 Aligned_cols=396 Identities=12% Similarity=0.037 Sum_probs=160.6 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) |.+..+-.-. -++ |+.+.+. +. ...... ...++...+.++-+..++++--.-.++-.|+|. + T Consensus 1 ~~~~D~~~n~----~~g--g~~~~~~---~~---~~~~~~---~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~---~ 62 (422) T protein:vir:10 1 MVKTDSYANI----FLG--GSDGSEI---YG---SLQNQA---PTILASLYADNALVRRIIDTIPETALAAGFHID---G 62 (422) T ss_pred CccchhhHHH----HcC--CCCCccc---cC---cccccC---HHHHHHHHHhChhhHHHHhhhhHHHhcCCcccc---C Confidence 3332221000 000 1111000 00 000011 122222234588888888887776777666653 2 Q ss_pred CChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccc-----cCcceeeeeccccCchhccccc Q lcl|NC_017974. 93 ESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGE-----YIGKYKVKNLAFRPQASLSRTN 167 (507) Q Consensus 93 ~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~-----~~g~~~~~~l~~r~~~~i~~~~ 167 (507) ++. ++ .+++.++.+.. |..+...+-.+.+||++.+=+.-. ++... ..| .++.|.+.++..+. T Consensus 63 ~~~--~~---~~~~~~~~l~~--~~~l~~a~~~~rl~G~a~i~i~v~-d~~~~~~Pl~~~g--~~~~l~v~d~~~i~--- 129 (422) T protein:vir:10 63 IDD--EP---AFWSRWDDLEM--TQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGA--ELETVRVYDRTQVK--- 129 (422) T ss_pred CCH--HH---HHHHHHHHhhH--HHHHHHHHHhhccccceEEEEEec-CCCCccccccccC--ceeeEEeecccccc--- Confidence 221 11 13444555431 444444455689999998533321 11111 111 12222222221111 Q ss_pred ceeecCCCccccccccccccccccccccccccccccCC--CccccccccEEEEeeC------CccCccccchHHH-HHHH Q lcl|NC_017974. 168 PIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRM--SDVIIPINRVMLMNTG------GSSSQALGVSPLV-GCYR 238 (507) Q Consensus 168 ~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~--~~~~iP~~k~i~~~~~------~~~~~p~G~gll~-~~~~ 238 (507) +..++. +|... .++.+..+...... ....|=+.+++.+... ....+++|.|+|. .||. T Consensus 130 ~~~~~~----------dp~s~---~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~ 196 (422) T protein:vir:10 130 VQTREE----------NPRNA---RFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILD 196 (422) T ss_pred chhccc----------Ccccc---ccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHH Confidence 011111 11110 11122221111111 2234555666666422 3456778999775 4776 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) ...--.....-=++.+.|+.. .+++++. +... ..+ .+...+.+..+... ...++...+.+++..+.+++ T Consensus 197 ~i~~~~~~~~~~~~l~~~~~~--~v~~~~~-l~~~-~~~-~~~~~~~~~r~~~~-~~~~~~~~~~~l~~~~e~~e----- 265 (422) T protein:vir:10 197 SIKDYTNCERLATQLLKRKQQ--AVWKAKG-LAEL-CDD-SEGFGAARLRLAQV-DNNSGVGQAIGIDAESEEYS----- 265 (422) T ss_pred HHHHHHHHHHHHHHHHHHhcc--ccccchh-HHHh-cCC-ccchHHHHHHHHHH-HHhcCCccceeEecCCcceE----- Confidence 554434444444555666543 3454431 0000 011 11122222222222 22222233334444444433 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc-ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN-VGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLL 397 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt-s~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~ 397 (507) .+..+-+| ...+++..-.+||-+.--..-- .|++.+|-.|.|+-.....-+.+++.......-+.+.|++-++ T Consensus 266 --~~~~~lsg----l~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~ 339 (422) T protein:vir:10 266 --VLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPILEFLIPFIV 339 (422) T ss_pred --EEecccCC----hHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 33333332 3557777778888654332210 1233345455566666777777777765433333333544443 Q ss_pred HhCCCcCCccccceEEecC------cCchhH-HHHHHHHHHHHhcCCccccHHHHHHHHHH---cCCCCCCCc-ccccch Q lcl|NC_017974. 398 ALNDMRLPETEMPYVKAGE------IVDVDM-EGFSKAIQRIGAVGYLPKTPKVINRVLEV---LGIDEKIEE-DISQEE 466 (507) Q Consensus 398 ~lN~~~~~~~~~P~~~~~~------~~~~dl-~~~a~~~~~L~~~G~~~~~~~~~~~i~e~---~Gip~~~~~-~~~~~~ 466 (507) ++ .+ -.|+|.. .+..|. +..|++++++++.|++.++ +..+.+++. .|+.....+ +..... T Consensus 340 -~s-----~~--~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~-e~r~~L~~~~~~~~~~~~~~~~~~~~~~ 410 (422) T protein:vir:10 340 -NA-----EE--WSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDID-EARDTLRTIAPEVKINDGSVETEVTISE 410 (422) T ss_pred -cc-----CC--cEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHhhhhcccccCCCCCCccccchhh Confidence 22 11 1244432 222222 4567889999999987763 333344332 222211111 111000 Q ss_pred hhhcCCCcCCCCCC Q lcl|NC_017974. 467 LMKLLGEDTSRAGD 480 (507) Q Consensus 467 ~~~~~~~~~~~~~~ 480 (507) ..+ +....|.++ T Consensus 411 ~~~--~~~~~~~~d 422 (422) T protein:vir:10 411 TSN--DPLEVPTDD 422 (422) T ss_pred cCC--CCCCCCCCC Confidence 000 000111111 No 135 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=97.08 E-value=0.00018 Score=41.11 Aligned_cols=398 Identities=10% Similarity=-0.050 Sum_probs=151.4 Q ss_pred hhccchhHHHHHHHh---hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HH Q lcl|NC_017974. 50 EIRWPNLIGTVESMK---QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-AT 125 (507) Q Consensus 50 ~lr~~~~~~~y~~m~---~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~ 125 (507) .|- +..-.-|..+. .-.+..-++..-...+.-- -|..+. ...+.. +.+++++ ..|.....++ .+ T Consensus 1 ~l~-~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~--gf~~~d--~~~~~~----~~~i~~~---N~~d~~~~~~~~~ 68 (434) T protein:vir:98 1 MLP-KNAEQAFLDFQRKARTNFCGLIANASVHRLLAL--GVTGPD--GEPDTR----ASRWWQA---NRLDSRQKLVWRM 68 (434) T ss_pred CCC-CCccHHHHHhhhhhhccchHHHHHHHHhhhccC--ceecCC--CchHHH----HHHHHHh---cChhHHHHHHHHH Confidence 000 00001111111 1112222222211111111 133221 222223 3344432 2466666664 57 Q ss_pred HHhhcceeeeeeeeccCccccCcc--eeeeeccccC---------chhcccccceeecCCCccccccccccccccccccc Q lcl|NC_017974. 126 FNEYGLSVVEKVYTQVTVGEYIGK--YKVKNLAFRP---------QASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTAS 194 (507) Q Consensus 126 a~~~Gfs~~Ei~w~~~~~~~~~g~--~~~~~l~~r~---------~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~ 194 (507) |.-||.|. +++|........++. ..+.-+.|+. ...+...+.+..+.++....... .+.....+... T Consensus 69 a~i~G~ay-~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 146 (434) T protein:vir:98 69 AMAQSAGY-MLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVF-FDDTSFPYRTR 146 (434) T ss_pred HhhcCceE-EEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEE-EeCcEEEEEEe Confidence 88999775 588864433222221 2222222210 00000011111111221100000 00000000000 Q ss_pred ccccccc------------ccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_017974. 195 EIGVGGV------------STRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVI 262 (507) Q Consensus 195 ~~~~~~~------------~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~ 262 (507) ....... ........+..--++.|+++...+. .|.|-+..+....=.=+..+...+...+-|..|.+ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~ 225 (434) T protein:vir:98 147 ERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQK 225 (434) T ss_pred eccccccccccccceecccccccccCCCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhh Confidence 0000000 0000011122223455666665544 48888877554433334456677778888888776 Q ss_pred EEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHH Q lcl|NC_017974. 263 ELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKK 342 (507) Q Consensus 263 v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~ 342 (507) +++- .......+...... ...++ ...+. .++...++.+.++. +.+.+ +...|...++.|=. T Consensus 226 ~i~G---~~~~~~~~~~~~~~----~~~~~---~~~~~-~~i~~~~~~~~~~~-------q~~~~-~~~~~~~~l~~~i~ 286 (434) T protein:vir:98 226 WIKG---HKFAKRTDPATGMT----VVDQP---FVPSP-SAVWASEGENTQFG-------QLDAT-DLSGFLKEHASDVR 286 (434) T ss_pred hhcC---CCcccccccccccc----hhhhh---hhccc-cccccCCCCCceEE-------EecCc-chHHHHHHHHHHHH Confidence 6641 00000001100000 00111 11111 22333334333332 22222 22234445555555 Q ss_pred HHHHHHhcc--cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCch Q lcl|NC_017974. 343 SILDRLGAG--FINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDV 420 (507) Q Consensus 343 ~Isk~iLGq--tLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~ 420 (507) +||...--. .+. +.....|-..-.....-....++.-.+.+...|. ++++.++.+++... +..-.++.|....+. T Consensus 287 ~~~~~~~~p~~~~~-~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~-~~~rl~~~~~g~~~-~~~~~~v~w~~~~~~ 363 (434) T protein:vir:98 287 DMLTISQTPTYLYA-TDLVNISADTIGALDILHVAKVREHIASFSEGLE-SVLALAAAQAGVPE-DYTEAEVRWANPAHV 363 (434) T ss_pred HHhcccCCCHHHhc-cccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCh-hheeeeEEecCCCCC Confidence 554332111 010 0101122212233444444445555566667774 46777777775322 222356888889999 Q ss_pred hHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhh------hcCCCc-CCCCCCcccccCccccccc Q lcl|NC_017974. 421 DMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELM------KLLGED-TSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 421 dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~------~~~~~~-~~~~~~~~~~~~~~~~~~~ 493 (507) ++.+.|+++.+|+.+|+ + .+.+++.+|+++.+-+....+..+ ...+.. ..+++.....+++. T Consensus 364 s~~~~ada~~kl~~~g~--~----~e~~~~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~----- 432 (434) T protein:vir:98 364 TMAVKADAATKLKSIGY--P----LDVIAEELDESPARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGAV----- 432 (434) T ss_pred CHHHHHHHHHHHHhcCC--c----HHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCCC----- Confidence 99999999999999885 3 256888899864321111111000 001110 11111100011111 Q ss_pred ccCC Q lcl|NC_017974. 494 ISST 497 (507) Q Consensus 494 ~~~~ 497 (507) ++ T Consensus 433 --dg 434 (434) T protein:vir:98 433 --DG 434 (434) T ss_pred --CC Confidence 11 No 136 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=96.93 E-value=0.00025 Score=40.32 Aligned_cols=459 Identities=12% Similarity=0.046 Sum_probs=179.0 Q ss_pred ccCcccccccc---cccccc---ccccCc-hh--hhHHHHhhhhcccchhc--cchhHHHHHHHh-hChHHHHHHHHHHH Q lcl|NC_017974. 11 VKVAKATSEQI---NVSRMR---MSEQGT-FA--LAKVQVDSERMKAEEIR--WPNLIGTVESMK-QDATVATGLDMLYT 78 (507) Q Consensus 11 ~~~~~~~~~~~---~~~~~~---~~~~g~-~~--~~~~~~~~~~e~~~~lr--~~~~~~~y~~m~-~D~~v~s~l~~Rk~ 78 (507) .++|.+..... ..++.. +..-++ .+ ...|+.. ......+++ +..+..--++|. .++++.++++.... T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~-~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 79 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPP-SESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQD 79 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccccC-CCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 33333221110 001000 000000 00 0001000 000001111 111222234454 59999999999999 Q ss_pred HHhhCCceeeeCCC------CChhHHHHHHHHHHHHHh-----------ccCCCHHHHHHHHHHH-Hhhcceeeeeeeec Q lcl|NC_017974. 79 FVEKAFKDFKVIPG------ESEESKKAAKFVEYCLKN-----------MEGQTLRQFARDAATF-NEYGLSVVEKVYTQ 140 (507) Q Consensus 79 av~~~~~~~~~~~~------~~~~~~~~ae~v~~~l~~-----------~~~~~~~~~l~~~l~a-~~~Gfs~~Ei~w~~ 140 (507) .|-+..+...+.+. +...+++..+.|+..++. -...+|..+...++.+ +.-|=+++-+.|.. T Consensus 80 nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~ 159 (533) T protein:vir:34 80 HIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDT 159 (533) T ss_pred HhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeecc Confidence 99888777665542 223445555556554432 1122577777776665 45787888788876 Q ss_pred cCccccCcceeeeeccccCchhcccc----------cceeecCCCccccccccccccccccccccccccccccCCCcccc Q lcl|NC_017974. 141 VTVGEYIGKYKVKNLAFRPQASLSRT----------NPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVII 210 (507) Q Consensus 141 ~~~~~~~g~~~~~~l~~r~~~~i~~~----------~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~i 210 (507) ..+..++-++ ...+++-|... +-+.|+.+|+-+-++-.. .+.+......+........+ T Consensus 160 ~~g~~~~~~l-----q~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~------~~~~~~~~~~~~~~~~~~~v 228 (533) T protein:vir:34 160 SSSRLFRTQF-----RMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSE------DGYPGWMPQKWTWIPRELPG 228 (533) T ss_pred CCCCccceEE-----EEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEee------cCCCCccccccceeeeeecc Confidence 5433322222 22222222211 123444444433322210 00011001111111112233 Q ss_pred ccccEEEEeeCC-ccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce-EEecccc-------cCCCCCCCchHH Q lcl|NC_017974. 211 PINRVMLMNTGG-SSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVI-ELKIPSQ-------ILNKAAMDQSSP 281 (507) Q Consensus 211 P~~k~i~~~~~~-~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~-v~~~P~~-------~~k~~~~~~~~~ 281 (507) |.. -|+|.+.. +.+..-|.+.|.++.....-........++-. |-..-+. +++.+.. ++.....+..+. T Consensus 229 ~a~-~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a-~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~ 306 (533) T protein:vir:34 229 GRA-SFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSA-IVKAMYAATIESELDTQSAMDFILGANSQEQRER 306 (533) T ss_pred Chh-HeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHH-HHhhhheeeeecCCCcccccccccCCCccccccc Confidence 333 46666654 48888899999888765433222222221111 1111111 1221110 000000000100 Q ss_pred HHHHHHHHHH--HHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccc---cccc Q lcl|NC_017974. 282 EAEMVRGLMS--DAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGF---INVG 356 (507) Q Consensus 282 ~~~~l~a~~~--~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt---Lts~ 356 (507) .......... ....+..+......++.|.+|++. ..+..+. .|..+.+..-+.|+..+ |-+ ||. T Consensus 307 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~-------~~~~p~~--~~~~f~~~~lr~iAagl-Gi~ye~lt~- 375 (533) T protein:vir:34 307 LTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQ-------TAQDTDN--GYSVFEQSLLRYIAAGL-GVSYEQLSR- 375 (533) T ss_pred ccccchhhhhccCcceeeccCceeeecCCCCeeeec-------CCCCCCC--CHHHHHHHHHHHHHhhc-CCCHHHHhh- Confidence 0000000000 000011122245567788776553 3333333 36778888999998877 433 333 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCc-CCcc----------cc--ceEEecCcCch Q lcl|NC_017974. 357 NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA---LNDMR-LPET----------EM--PYVKAGEIVDV 420 (507) Q Consensus 357 ~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~---lN~~~-~~~~----------~~--P~~~~~~~~~~ 420 (507) +-...|||.+-.-..-+....+.....+...+-+-+...+++ +++.- .|.. .+ ..+........ T Consensus 376 D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~i 455 (533) T protein:vir:34 376 NYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAI 455 (533) T ss_pred hcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCcccc Confidence 323456665544443344444444444444443333333332 34311 1110 01 12333444556 Q ss_pred hHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC-cccccc-hhhh--cCCCcCCCCCCcccccCcccccccccC Q lcl|NC_017974. 421 DMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE-EDISQE-ELMK--LLGEDTSRAGDGMTKGSSGNGTGKISS 496 (507) Q Consensus 421 dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~-~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (507) |..+-+++....+++|+.... +.+++ .|.....- ++...+ .... .++.+..+. ....++........ T Consensus 456 DP~Ke~~a~~~~i~~G~~s~~----~~~a~-~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~----~~~~s~~~~~~~~~ 526 (533) T protein:vir:34 456 DGLKEVQEAVMLIEAGLSTYE----KECAK-RGDDYQEIFAQQVRETMERRAAGLKPPAWAA----AAFESGLRQSTEEE 526 (533) T ss_pred ChHHHHHHHHHHHHcCCCCHH----HHHHH-cCCCHHHHHHHHHHHHHHHHhcCCCCCCCCC----cCccCCCCCCCCCC Confidence 777778888888999986552 22332 24421100 000000 0000 000000000 00000000000001 Q ss_pred CCCchhh Q lcl|NC_017974. 497 TRDNSAA 503 (507) Q Consensus 497 ~~d~~a~ 503 (507) +-|..+| T Consensus 527 ~~~~~~~ 533 (533) T protein:vir:34 527 KSDSRAA 533 (533) T ss_pred cccCCCC Confidence 1111111 No 137 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=96.91 E-value=0.00026 Score=40.21 Aligned_cols=428 Identities=11% Similarity=0.096 Sum_probs=154.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHh-hhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVD-SERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) +..+....-...+-.-... .+.-+. |.. ++ .+ ...+.....|....+.+ -..|. .+--.. T Consensus 26 ~~~~~~i~~~~~~~~~I~~----w~~~Y~--g~~---~~-~~~~~~~~~~~~~~~~sl~~------~~~i~---~~~A~L 86 (517) T protein:vir:98 26 INDHEKINIDPNELARIER----NLRQYE--GDY---PQ-VEYINSQGKIQERDYMTLNL------RKLSA---DVLSGL 86 (517) T ss_pred hhcCCceecCHHHHHHHHH----HHHHhc--CCC---cc-cccccccccccccceeecCc------HHHHH---HHhhhh Confidence 3333333222211110000 000000 000 00 00 00000000000000000 00111 111122 Q ss_pred HhhCCceeeeCCCC-----ChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 80 VEKAFKDFKVIPGE-----SEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 80 v~~~~~~~~~~~~~-----~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) |.+-.-.+.+++.+ +......+++++++++.. .|...+.. +.+++..|=.++=+.|..+ .+.+ T Consensus 87 l~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n---~f~~~~~~~~e~a~a~G~~a~k~~~d~~-------~~~I- 155 (517) T protein:vir:98 87 VFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHN---KFIKNLSDYLEPTFALGGLTVRPYVDNG-------EIEF- 155 (517) T ss_pred hcCCcceEEecccccccccccchhHHHHHHHHHHHhc---cHHHHHHHHHHHHhhhCCEEEEEEEeCC-------eeEE- Confidence 22222345555422 223355788999988753 35555444 4567778888876666532 2222 Q ss_pred eccccCchhcccccceeecCCC----------------ccccc--cccc-ccc--------------ccccccccccccc Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDG----------------SAIIG--IKQS-LSA--------------FQNYTASEIGVGG 200 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~----------------~~l~~--~~q~-~~~--------------~~~~~~~~~~~~~ 200 (507) ...++.++. |..++.++ +.... .+-| +.. +.......++... T Consensus 156 --~~v~ad~~~---Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v 230 (517) T protein:vir:98 156 --SWALANAFY---PLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRI 230 (517) T ss_pred --EEEcCCeeE---EEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccc Confidence 112222221 11111111 10000 0000 000 0000000000000 Q ss_pred -----cccCCCcccccc-cc--EEEEee----CCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccc Q lcl|NC_017974. 201 -----VSTRMSDVIIPI-NR--VMLMNT----GGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPS 268 (507) Q Consensus 201 -----~~~~~~~~~iP~-~k--~i~~~~----~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~ 268 (507) +......+.++. .+ |.+++. ....++|+|.|.+..+.-..-.-+..+.-|+.-++. | .. -+.+|. T Consensus 231 ~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~-g-~~-~i~vp~ 307 (517) T protein:vir:98 231 PLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM-G-QR-TVFVSD 307 (517) T ss_pred cccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh-C-Cc-ceecCh Confidence 000011111111 11 223322 223468999999998876554444444444443332 1 11 223454 Q ss_pred ccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccc-ccccceeeeeccCCCcchhHHHHHHHHHHHHH-H Q lcl|NC_017974. 269 QILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDN-APQYSMTLKGIDGMGKQYSTAQLISDRKKSIL-D 346 (507) Q Consensus 269 ~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~-~~~~~~el~~~~g~g~~~~~~~li~~~d~~Is-k 346 (507) .+.+............... .....+..+..+..-. +..++.+|. ...|.+.++.+=++|+ . T Consensus 308 ~~l~~~~~~~g~~~~~~~d----------~~~~~y~~~~~~~~~~~i~~~~~~iR-------~e~~~~~~~~~L~~i~~~ 370 (517) T protein:vir:98 308 VMLRTVPDESGMPPPQVFD----------PDVNVYKSIRMGTDEEFVKDVTHDIR-------TEQYKEAINQALRTLEME 370 (517) T ss_pred hhhccccCCCCcccCCCCC----------cccceeeeccCCCCCCceeeeccccc-------hHHHHHHHHHHHHHHHHH Confidence 4432221110000000000 0000000010000000 000000000 0123344444444443 3 Q ss_pred HHhc-ccccccccchhhhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHh-------CCCcCCccccceEEecC Q lcl|NC_017974. 347 RLGA-GFINVGNDKGGSYNLSESKQT--IHTQFVQRVNEIILEALNENLLPQLLAL-------NDMRLPETEMPYVKAGE 416 (507) Q Consensus 347 ~iLG-qtLts~~~~~GS~Al~~vh~~--v~~~~~~aD~~~i~~~ln~~li~~l~~l-------N~~~~~~~~~P~~~~~~ 416 (507) +-++ +|++. +..|...+-+|-.+ -.-.-+.+-.+.+...| ++|++.++.+ |+. .+...-+.+.|+. T Consensus 371 ~Gls~~t~~~--~~~~~kTATEi~s~~~~~~~t~~~~~~~~~~aL-~~lv~~i~~l~~~~~~~~~~-~~~~~~v~v~f~D 446 (517) T protein:vir:98 371 LKLSVGTFSF--DGRSMKTATEIVSENDLTYRTRNDHVYEVEQFI-KGLVISVLELAKTYKLFGGE-IPSAEHIGVDFDD 446 (517) T ss_pred hCCCcccccc--cccccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCC-CCCCcceEEEcCC Confidence 3344 33332 22333212233221 11123344555566666 4577666532 211 1222336789999 Q ss_pred cCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 417 IVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 417 ~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) .-.+|.++.++.+.+++.+|++.. +.++.+.||+.+.+..++.+.........+..+........+.|.+. T Consensus 447 ~i~~D~~~~~~~~~~~v~aG~ms~----~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 447 GVFQDRSALLRFYGQAKTFGFIPT----VEAIQRIFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQKRMFGDEE 517 (517) T ss_pred CCCCCHHHHHHHHHHHHhcCCCCH----HHHHHHhCCCChHHHHHHHHHHHHhccccCCCCccccccCCCCCCCC Confidence 989999999999999999998654 57899999997554433333322222111111111111111222211 No 138 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=96.88 E-value=0.00028 Score=40.07 Aligned_cols=437 Identities=12% Similarity=0.096 Sum_probs=167.3 Q ss_pred Cccccc----------ccccccCccccccccc--cccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHh-hCh Q lcl|NC_017974. 1 MSRKRN----------RNRSVKVAKATSEQIN--VSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMK-QDA 67 (507) Q Consensus 1 ~~~k~~----------~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~-~D~ 67 (507) |.+=++ ..+.-.++.|.....+ ++....+..++.+... .+.. +..---.++++.|++|. .++ T Consensus 4 ~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~--~~~g---g~~~n~~eLI~~YR~ma~~~p 78 (533) T protein:vir:58 4 LEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYAS--RFYG---GIEFNRFFLYDMYDRMDYTDP 78 (533) T ss_pred cchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhh--hhhc---cccccHHHHHHHHHHhhccCc Confidence 111000 0111122222221111 1110011111111111 1110 11111257899999995 699 Q ss_pred HHHHHHHHHHHHHhhCCc---eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCc Q lcl|NC_017974. 68 TVATGLDMLYTFVEKAFK---DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTV 143 (507) Q Consensus 68 ~v~s~l~~Rk~av~~~~~---~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~ 143 (507) .|.++++....-+....- .+.+.-.+.+-++.+.+-|..+|+ |.....++.. -.-.|-..+.++=+. T Consensus 79 EVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lld------f~~~~~~~fR~WYVDGriy~Hkiik~--- 149 (533) T protein:vir:58 79 LISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVIN------IEKNAYPIIRNMIKYGDMFLHILEKG--- 149 (533) T ss_pred chhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhc------chhhhhHHHHhhhhcceeEEEeccCC--- Confidence 999999988775532110 011111122223333333332221 2222112111 122333333332111 Q ss_pred cccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeC-C Q lcl|NC_017974. 144 GEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTG-G 222 (507) Q Consensus 144 ~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~-~ 222 (507) .+-.|.+|....|.++.++.-..- -..++.+... ..........+.||.+..+.+.++ . T Consensus 150 ----~k~GI~elr~lDPr~i~~vr~~~t-------------~~eyyvy~~~---~~~~~s~~~~~kI~~daI~y~~SGl~ 209 (533) T protein:vir:58 150 ----SDGTIEKFQVVSPYIFSKRYNPET-------------DTWYYVITDV---YRNVVSGYFNEDIPEEDVIHFSHKID 209 (533) T ss_pred ----cccchhhheecCCeeeEEEEeecc-------------ceEEEeeccc---ccccccCccccccchhheeeeeeccc Confidence 112233333334444433321100 0111222211 112223445688997765544444 3 Q ss_pred ccCccccchHHHHHHHHHHHHHHHHHHHHHH-----HHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHh- Q lcl|NC_017974. 223 SSSQALGVSPLVGCYRAWREKILIENLEVVG-----ATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANA- 296 (507) Q Consensus 223 ~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f-----~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~- 296 (507) ....+++.|.|+++..++=--+.....-+.+ -||- ++++. +|..+. ....+.|+.++.-.+|= T Consensus 210 d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRR-----vFYID--VGNlpk----~KAeqYl~~im~k~kNkl 278 (533) T protein:vir:58 210 TNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRR-----VFYVD--VGNVPP----DKINEYLTNIAMQYKRDY 278 (533) T ss_pred cCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhhe-----EEEEe--ecCCCc----cCHHHHHHHHHHhcccce Confidence 4567999999999988875555544333333 2332 44432 222222 12233455444433220 Q ss_pred hcCCceeEe-----------------ec-----CcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc Q lcl|NC_017974. 297 HSGEQSFFM-----------------LP-----SDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN 354 (507) Q Consensus 297 ~~G~~a~~i-----------------iP-----~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt 354 (507) .-.+..|=| +| .|++|++ ..|+ ... -..-|+|..+.+-+++--..-- T Consensus 279 vYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~T---------LpGg-~lg-emeDV~YF~kkLy~ALnVP~sR 347 (533) T protein:vir:58 279 WVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDI---------LQGS-KVD-LAEDVEYMLNRLISALKVPKAF 347 (533) T ss_pred EEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeee---------cCCC-CCC-cHHHHHHHHHHHHHHhCCCeee Confidence 001111111 11 1344432 2232 233 3467999999999998877643 Q ss_pred ccccchhhhhHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecC----cCchhHHHHHHH Q lcl|NC_017974. 355 VGNDKGGSYNLSESK-QTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGE----IVDVDMEGFSKA 428 (507) Q Consensus 355 s~~~~~GS~Al~~vh-~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~----~~~~dl~~~a~~ 428 (507) .+... |....+++. .++ |...++.....+...|.+||| |++.. ..+.+ ++.|.. .|-.|.+.+.++ T Consensus 348 l~~e~-~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLi-----lk~ii-t~eew-~~~f~~Dn~f~ElKe~Eil~~R 419 (533) T protein:vir:58 348 IGYEG-DVNAKNTLATQDIKFNNTIKRIQGFFVEELERMVR-----MNKEF-ADQDF-RLVMNRSNSIVEGERFAVIEQR 419 (533) T ss_pred cCCCC-CCccchhhhHHHHHHHHHHHHHHHHHHHHHhcccc-----cccCc-chhhe-eeeeeccchHHHHHHHHHHHHH Confidence 33222 222223332 222 445555566666666666553 56432 22333 344422 233455555666 Q ss_pred HHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccch------------------hhhcCCCcCCCCCCcccccCccc Q lcl|NC_017974. 429 IQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEE------------------LMKLLGEDTSRAGDGMTKGSSGN 489 (507) Q Consensus 429 ~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~ 489 (507) +..|..+--. +...||++. |.++....++...-+ +++..+....|...+....-.+. T Consensus 420 i~~l~~~dpy----vgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~ 495 (533) T protein:vir:58 420 IGIAERLKGW----VREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEFDF 495 (533) T ss_pred HHHHHHhcch----hhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCChhhHhc Confidence 6666543211 223566543 566532111100000 00001111111100000000011 Q ss_pred ccccccCCCCchhhhc----------------cC Q lcl|NC_017974. 490 GTGKISSTRDNSAANL----------------DN 507 (507) Q Consensus 490 ~~~~~~~~~d~~a~~~----------------~~ 507 (507) +++ +..-.-++-|+ += T Consensus 496 ~~~--~~~~~~~~~~~~~a~~~~~~~~g~~~~~~ 527 (533) T protein:vir:58 496 GTE--GGEELGGELNLGGAFEEFEEETGGGEEEL 527 (533) T ss_pred ccC--CcccccccccccccchhhhhhcCCcccCC Confidence 111 11011111111 11 No 139 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=96.83 E-value=0.00031 Score=39.83 Aligned_cols=440 Identities=10% Similarity=0.003 Sum_probs=150.0 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHH--hhC------hHH---HHHHHHHHHHHh Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESM--KQD------ATV---ATGLDMLYTFVE 81 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m--~~D------~~v---~s~l~~Rk~av~ 81 (507) |..+.....+..+.. ........+. +..+ |..++-+.|+-- ..+ ... .++.+==+..|. T Consensus 1 ~~~~~~~~~~~~~~~------~~~~l~~~~~--~~~~--rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd 70 (484) T protein:vir:77 1 MTSPLQKQENVDPEK------AREEMLNLFT--ERTQ--DLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYID 70 (484) T ss_pred CCCcccccCCCCHHH------HHHHHHHHHH--HHHH--HHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHH Confidence 333322221111100 0000001110 0111 111111112110 000 000 000000011111 Q ss_pred hCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcccc-C--cceeeeeccc Q lcl|NC_017974. 82 KAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEY-I--GKYKVKNLAF 157 (507) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~-~--g~~~~~~l~~ 157 (507) .+.=...+.+-..+.+.+..+-+.+++.. ..|.....++ .+|.-||.| +++||....+... . +...+.-+.+ T Consensus 71 ~~~~~l~~~g~~~~~~~~~~~~l~~i~~~---N~~d~~~~~~~~~a~~~G~a-~~~v~~~~~~~~~~~~~~~~~i~~~~p 146 (484) T protein:vir:77 71 AIAARQELEGFRLGGADKADEQLWDWWQA---NDLDIESTLGHTDSLVHGRS-YITISKPDPNIDPGVDPEVPIIRVEPP 146 (484) T ss_pred HHHhhhccCceecCCcchhHHHHHHHHHh---cCHhHHHHHHHHHHhhcCce-EEEEecCCCCcccccccccceEEEecc Confidence 11000011110111112222334444432 2466666664 468889996 5688865432110 0 0111211111 Q ss_pred c---------CchhcccccceeecCCCccccccccc-cccccccccccccccccccCCCccccccccEEEEeeCCccCcc Q lcl|NC_017974. 158 R---------PQASLSRTNPIVYNEDGSAIIGIKQS-LSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQA 227 (507) Q Consensus 158 r---------~~~~i~~~~~~~~~~d~~~l~~~~q~-~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p 227 (507) + ....+..++ +++..++.....+.-. +.....+.............. ..++.--++.|+++.+.+.| T Consensus 147 ~~~~~~~D~~~~~~~~a~~-~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~--~~~g~vPvv~f~N~~~~~~~ 223 (484) T protein:vir:77 147 TNLYAQIDPRTRQVMRAIR-AIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVA--HNLEMVPVIPIPNRTRLSDL 223 (484) T ss_pred ceeEEEecCCCCceEEEEE-EEEeecCCcEEEEEEEecCeEEEEEecCCceEeecccc--CCCCCcceEEeccccccCcc Confidence 1 011111111 1222222222111100 000001101010000111111 12222235778888899999 Q ss_pred ccchHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEee Q lcl|NC_017974. 228 LGVSPLVGCYRAWREK-ILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFML 306 (507) Q Consensus 228 ~G~gll~~~~~~~~fK-~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~ii 306 (507) .|.|-+.......+-. +..+..++..++-|..|.+++.- .........++.....++. +.+...++ T Consensus 224 ~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G---~~~~~~~~~~~~~~~~~~~----------~~~~~~~~ 290 (484) T protein:vir:77 224 YGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFG---VKGEELGVDPETGQTLFDA----------YLARILAF 290 (484) T ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhC---CCcchhcccccccchhhhh----------hhhhhccc Confidence 9988775433232211 34555666777777766665531 0000000000000011111 11122334 Q ss_pred cCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc----cc-c-hhhhhHHHHHHHHHHHHHHHH Q lcl|NC_017974. 307 PSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG----ND-K-GGSYNLSESKQTIHTQFVQRV 380 (507) Q Consensus 307 P~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~----~~-~-~GS~Al~~vh~~v~~~~~~aD 380 (507) |. .+.++. ..+.+ +. ..++++++.-|.+.--...++-. .+ . .+.-|+. ....-....++.- T Consensus 291 ~~-~~~~~~-------q~~~~-~~---e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~-~~~~~l~~ka~~k 357 (484) T protein:vir:77 291 ED-HESKAQ-------QFSAA-EL---RNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIR-SSESRLVKTVERK 357 (484) T ss_pred CC-CCceeE-------eecCC-Ch---HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHH-HHHHHHHHHHHHH Confidence 42 222222 22211 12 23455555554433211111100 00 0 1112222 2222223333444 Q ss_pred HHHHHHHHHHHHHHHHHHh-CCCcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCC Q lcl|NC_017974. 381 NEIILEALNENLLPQLLAL-NDMRLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKI 458 (507) Q Consensus 381 ~~~i~~~ln~~li~~l~~l-N~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~ 458 (507) .+.+...|.+ +++-++.+ |....+.. .--.+.|....+.++.+.++++.+|+++|.-+.+ .+-+.+.+|+-+.. T Consensus 358 ~~~f~~~l~~-~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s---~et~~~~l~~~~~~ 433 (484) T protein:vir:77 358 NKIFGGAWEQ-AMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIP---KERARIDMGYSITE 433 (484) T ss_pred HHHHHHHHHH-HHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCC---HHHHHhcCCCChhH Confidence 4555556643 55555544 32111111 1235678888889999999999999998864443 34577788885432 Q ss_pred Cccccc---ch-------hhhcCCCcCCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 459 EEDISQ---EE-------LMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 459 ~~~~~~---~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) .++... ++ .....+..+...+++ ...+.....+++-..+++ T Consensus 434 ~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 434 REEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNP----DNPETPEPQPNPAEEAAA 484 (484) T ss_pred HHHHHHHHHHHHHHHHHHHhhhccccccCCCCC----CCCCcccccCCCccccCC Confidence 222110 00 000001111111111 011111112222222222 No 140 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=96.80 E-value=0.00033 Score=39.65 Aligned_cols=435 Identities=10% Similarity=-0.012 Sum_probs=148.0 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhC---------h---HHHHHHHHHHHHH Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQD---------A---TVATGLDMLYTFV 80 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D---------~---~v~s~l~~Rk~av 80 (507) |..+..- +.+.-........+....+ ....|..++.+.|+- ..+ . ....+.+==+..| T Consensus 1 ~~~~i~~--------~~~~~~~~~~~~~l~~~~~-~~~~r~~~~~~Yy~G-~~~i~~~~~~~~~~~~~~~~~~n~~~~iv 70 (485) T protein:vir:10 1 MTAPLPG--------QEEIEDPAIARDEMVSAFE-DSTQNLKTNTSYYEA-ERRPEAIGVTVPIQMQSLLAHVGYPRLYV 70 (485) T ss_pred CCCCCCC--------CCCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHHhc-CCcchhcCCCCChhhhhhhhhcCcHHHHH Confidence 2221110 0001111100001000000 000011111111110 000 0 0000000001111 Q ss_pred hhC-----CceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcc---ccCccee Q lcl|NC_017974. 81 EKA-----FKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVG---EYIGKYK 151 (507) Q Consensus 81 ~~~-----~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~---~~~g~~~ 151 (507) ..+ .--|.+. +++. ..+.+.+++.. ..|......+ .+|+-||.| ++++|.-..+. ..++... T Consensus 71 d~~~~~l~~~g~~~~-~~~~----~~~~~~~i~~~---N~~d~~~~~~~~~a~i~G~a-y~~v~~~e~~~~~~~~~~~~~ 141 (485) T protein:vir:10 71 DSIAERQAVEGFRFG-DADE----ADEELWQWWQA---NNLDIEAPLGYTDAYVHGRS-YITISRPDPQIDLGWDPNTPI 141 (485) T ss_pred HHHHhhhcccceecC-CCch----hHHHHHHHHHh---cCHhHHHHHHHHHHhhcCce-EEEEeeCCcccccccCCCeeE Confidence 110 0012222 2222 23334444432 2466665554 468889988 56888754321 1233333 Q ss_pred eeeccccCchhccccc----------ceeecCCCccccccc-cccccccccccccccccccccCCCccccccccEEEEee Q lcl|NC_017974. 152 VKNLAFRPQASLSRTN----------PIVYNEDGSAIIGIK-QSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNT 220 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~----------~~~~~~d~~~l~~~~-q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~ 220 (507) +.-+.|+ .++.-|. .+++..++....... -.+.....+...... ..........++.--++.|.+ T Consensus 142 i~~~~p~--~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~--~~~~~~~~~~~g~vPvv~~~n 217 (485) T protein:vir:10 142 IRVEPPT--RMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENE--WQEWFNNPHGLGVVPVVPIPN 217 (485) T ss_pred EEEEccc--eeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCc--eEEeccccCCCCcccEEEecc Confidence 3322221 1111000 011111111111110 001111111110000 000111122233334577888 Q ss_pred CCccCccccchHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC Q lcl|NC_017974. 221 GGSSSQALGVSPLVGCYRAWR-EKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG 299 (507) Q Consensus 221 ~~~~~~p~G~gll~~~~~~~~-fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G 299 (507) ..+.+.|+|.|-+..-....+ --+..+...+...+-|..|..++.- .........++.... ..+. . T Consensus 218 ~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G---~~~~~~~~~~~~~~~----~~~~------~ 284 (485) T protein:vir:10 218 RTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG---IKPEEIGVDPETGQT----LFDA------Y 284 (485) T ss_pred ccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhc---CCcccccccccccch----hhhh------c Confidence 888888999886653222211 1133455566677777777665541 000000001111111 1111 1 Q ss_pred CceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc------ccchhhhhHHHHHHHHH Q lcl|NC_017974. 300 EQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG------NDKGGSYNLSESKQTIH 373 (507) Q Consensus 300 ~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~------~~~~GS~Al~~vh~~v~ 373 (507) ..+ +....+.+.+|.+ .+.+ +. ..++++++.-|.+......++.. +...+.-|+.. ...-. T Consensus 285 ~~~-i~~~~~~d~k~~q-------~~~~-~~---~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~-~~~~l 351 (485) T protein:vir:10 285 LAR-ILAFEDAEGKIQQ-------FSAA-EL---ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRA-AESRL 351 (485) T ss_pred ccc-eeccCCCCceEEe-------eccc-ch---HHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHH-HHHHH Confidence 112 2233333333322 2221 12 23455555444333211111110 00012223222 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH Q lcl|NC_017974. 374 TQFVQRVNEIILEALNENLLPQLLALND-MRLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV 451 (507) Q Consensus 374 ~~~~~aD~~~i~~~ln~~li~~l~~lN~-~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~ 451 (507) ...++.-.+.+...|.+ +++-++.+.. ...+.. .--.+.|....+.++.+.++++.+|++.|..+.. .+-+++. T Consensus 352 ~~k~~~k~~~f~~~l~~-~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s---~et~~~~ 427 (485) T protein:vir:10 352 IKKVERKNSIFGGAWEE-AMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIP---RERARKD 427 (485) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCC---HHHHHHh Confidence 33334444555566643 5555555432 111111 1224678888899999999999999998843332 3456788 Q ss_pred cCCCCCCCccccc--chhh-------hcCCCcCCCCCCcccccCcccccccccCCCCchh Q lcl|NC_017974. 452 LGIDEKIEEDISQ--EELM-------KLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSA 502 (507) Q Consensus 452 ~Gip~~~~~~~~~--~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a 502 (507) +|+.+...++... ++.. ..+..+.....+......+++.++ .....+.| T Consensus 428 lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 485 (485) T protein:vir:10 428 MGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAA--LESGGDAA 485 (485) T ss_pred CCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcC--CCCCCCCC Confidence 8886442221110 0000 001111110000000000000000 01111111 No 141 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=96.76 E-value=0.00036 Score=39.49 Aligned_cols=440 Identities=12% Similarity=0.076 Sum_probs=172.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHh----hhhcccchhccchhHHHHHHHhhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVD----SERMKAEEIRWPNLIGTVESMKQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~----~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~R 76 (507) -.+++-++++...+.|.....++. . +.+..+.....++ ...+.. .+-.++++.|++|..++.|-++++.. T Consensus 11 ~~~~~~~~~~~S~~~p~~~DGa~~-i---~~~~~~~~~~g~~~~~~~~~~~~--~~~~eLI~~YR~ma~~pEvd~Av~eI 84 (511) T protein:vir:56 11 DIQKIEKNPVRSFSAPDNVDGAKE-I---HTNLLAPQLGHAIIPSDAQSEGT--IPVKELIKSYRALAEYHEVDDAIQEI 84 (511) T ss_pred hhhhhccCCcccccCCCCCCCceE-E---ecccccceecceeccccccccCc--cchHHHHHHHHHHhhccchhhHHHHh Confidence 112222222222333332222111 0 0011111111111 112211 12237999999999999999999988 Q ss_pred HHHHhhCCc---eeeeCCCCChhHHH----HHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCccc---- Q lcl|NC_017974. 77 YTFVEKAFK---DFKVIPGESEESKK----AAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVGE---- 145 (507) Q Consensus 77 k~av~~~~~---~~~~~~~~~~~~~~----~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~~---- 145 (507) ...+...+- .+.+.-..++-++. +.+-++..| .+|+.--+||..+- -|-.++.-. T Consensus 85 vne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il-------------~ll~F~~~~~~~fR-~WYVDgRi~fHki 150 (511) T protein:vir:56 85 VDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVV-------------SLLQMRKHGYKWFR-KWYVDSRIYFHKI 150 (511) T ss_pred hcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHH-------------HHhccchhhhHHHh-hhhhcceEEEEEE Confidence 776532210 01111111222233 333333333 34444444444332 121111000 Q ss_pred cCcceeeeeccccCchhcccccceeec-CCCcccccccccccccccccccccc-cccccc---CCCccccccccEEEEee Q lcl|NC_017974. 146 YIGKYKVKNLAFRPQASLSRTNPIVYN-EDGSAIIGIKQSLSAFQNYTASEIG-VGGVST---RMSDVIIPINRVMLMNT 220 (507) Q Consensus 146 ~~g~~~~~~l~~r~~~~i~~~~~~~~~-~d~~~l~~~~q~~~~~~~~~~~~~~-~~~~~~---~~~~~~iP~~k~i~~~~ 220 (507) .+.+-.+..|..-.|+++.+++-...+ .+|..+ +... ..++.+.+.... ...... ...++.||.+ -|+|+| T Consensus 151 id~k~GI~eLr~lDPr~i~~vr~i~~~~~~~~~v--~~~~-~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~d-aI~y~h 226 (511) T protein:vir:56 151 LDKDNNIIELRPLNPMKMELVREIQKETIDGVEV--VKGT-LEYYVYKQSDYKMPSWMSATNRAQTSFRIPKD-AIVFAH 226 (511) T ss_pred eccccceeehhhcCcccchhhhhhhccccccccc--ccce-eeeeEecCCCcccCcccccccccccceeechh-heeeec Confidence 011113445555556666665544432 222222 1111 122222221111 111111 2255777765 477787 Q ss_pred CCc----cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 221 GGS----SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANA 296 (507) Q Consensus 221 ~~~----~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~ 296 (507) .+- .++++..|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.-..|- T Consensus 227 SGL~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--VGnLP----k~KAeqYl~~iM~k~kNk 300 (511) T protein:vir:56 227 SGLMRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVD--VGNLP----TQKAQQYVNGIMQNVKNR 300 (511) T ss_pred ccceeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhcCce Confidence 653 57788999999999887544443333333210011111244442 22222 223334555544332220 Q ss_pred -hcCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_017974. 297 -HSGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVG 356 (507) Q Consensus 297 -~~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~ 356 (507) .-.+..|- -+|. |+||.+. .|+.+.. -..=|+|..+.+-+++--..--.+ T Consensus 301 lVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlg-em~DV~YF~kKLy~aLnVP~SRl~ 370 (511) T protein:vir:56 301 VVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTL---------PGGQSLG-DIEDVLYFNRKLYKAMRIPTSRAA 370 (511) T ss_pred EEEeccCceeccchhhhhhHhhhcccccCCCCccceeec---------cccCCcC-hHHHHHHHHHHHHHHhCCCccccc Confidence 00111111 1222 5555432 2332232 234589999999998877653332 Q ss_pred -ccchhhhhH---HHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------Cch Q lcl|NC_017974. 357 -NDKGGSYNL---SESK-QTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------VDV 420 (507) Q Consensus 357 -~~~~GS~Al---~~vh-~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~~~ 420 (507) ++.+++..+ +++. .|+ |...+......+...|..-|-..|+-=+.. ..+. .+.+.|+.. |-. T Consensus 371 ~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgii--t~eeW~~i~~~I~~~f~~Dn~f~ElK 448 (511) T protein:vir:56 371 SEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIVNNII--TEEEWDANHEKLYVVFNQDSYFEEAK 448 (511) T ss_pred CCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhcceEEeeecchHHHHH Confidence 211222222 2222 232 223344444444444443333333321211 1111 122333322 334 Q ss_pred hHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 421 DMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 421 dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) +.+.+.+++..|..+--.+-.-.+.+|+++. |.+.+.+-.++. ..-.....++--+.. .+|. T Consensus 449 e~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~-k~I~~E~k~~~~~~~--------e~~f 511 (511) T protein:vir:56 449 ELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQ-SEIDEEETNPRFQQD--------DQGF 511 (511) T ss_pred HHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHH-HHHHHhhcCCCCCCc--------ccCC Confidence 5455556666665442111112235777654 566422111111 111111111111110 1111 No 142 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=96.72 E-value=0.00038 Score=39.32 Aligned_cols=434 Identities=11% Similarity=0.005 Sum_probs=154.3 Q ss_pred ccCccccccccccccccccccCchhhh-HHHHhhhhcccchhccchhHHHHHH---H-----hhChHH---HHHHHHHHH Q lcl|NC_017974. 11 VKVAKATSEQINVSRMRMSEQGTFALA-KVQVDSERMKAEEIRWPNLIGTVES---M-----KQDATV---ATGLDMLYT 78 (507) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~~~~e~~~~lr~~~~~~~y~~---m-----~~D~~v---~s~l~~Rk~ 78 (507) ...|-+...... .....+. ....+. ...+ |..++-+.|+- + .....+ ..+.+==+. T Consensus 1 ~~~~i~~~~~~~--------~~~~~~~~L~~~~~--~~~~--r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ 68 (485) T protein:vir:24 1 MTAPLPGQEEIA--------DPAIARDEMVSAFE--DQNQ--NLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRL 68 (485) T ss_pred CCCCCCCCCccc--------chHHHHHHHHHHHH--HHHH--HHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHH Confidence 111111111110 0111110 011110 0000 11111111210 0 000000 000000011 Q ss_pred HHhhC-----CceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccc---cCcc Q lcl|NC_017974. 79 FVEKA-----FKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGE---YIGK 149 (507) Q Consensus 79 av~~~-----~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~---~~g~ 149 (507) .|... .--|.+. +++..+.. +.+++.. ..|..+..++ .++.-||.| ++++|..+.+.. ..+. T Consensus 69 ivd~~~~~l~~~g~~~~-~~~~~~~~----l~~i~~~---N~~d~~~~~~~~~a~i~G~a-y~~v~~~~~~~~~~~~~~~ 139 (485) T protein:vir:24 69 YVDSIAERQAVEGFRLG-DADEADEE----LWQWWQA---NNLDIEAPLGYTDAYVHGRS-YITISRPDPQIDLGWDPNV 139 (485) T ss_pred HHHHHhhhhccCceecC-CCchhHHH----HHHHHHh---cChhHHHHHHHHHHhhcCce-EEEEecCCcccccccCCCc Confidence 11111 0012222 22222233 3444332 2365555554 468889997 568887543211 1222 Q ss_pred eeeeeccccCchhccccc----------ceeecCCCccccccccc-cccccccccccccccccccCCCccccccccEEEE Q lcl|NC_017974. 150 YKVKNLAFRPQASLSRTN----------PIVYNEDGSAIIGIKQS-LSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLM 218 (507) Q Consensus 150 ~~~~~l~~r~~~~i~~~~----------~~~~~~d~~~l~~~~q~-~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~ 218 (507) ..++-+.|+ .++.-|. .+++..++......+-. +.....+...... ..........++.--++.| T Consensus 140 ~~i~~~~p~--~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~--~~~~~~~~h~~g~vPvv~f 215 (485) T protein:vir:24 140 PLIRVEPPT--RMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGE--WVEWFSDPHGLGAVPVVPL 215 (485) T ss_pred ceEEEeccc--eeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCc--eEeecccccCCCcccEEEe Confidence 233322221 1111110 01112222211111100 0011111110100 1011111222333345778 Q ss_pred eeCCccCccccchHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhh Q lcl|NC_017974. 219 NTGGSSSQALGVSPLVGCYRAWRE-KILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAH 297 (507) Q Consensus 219 ~~~~~~~~p~G~gll~~~~~~~~f-K~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~ 297 (507) ++..+.+.|+|.|-+.......+- -+..+...+...+-|..|.+++. +.........++... .+.+. T Consensus 216 ~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~---G~~~~~~~~~~~~~~----~~~~~----- 283 (485) T protein:vir:24 216 PNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF---GIKPEEIGVDPETGQ----TLFDA----- 283 (485) T ss_pred ccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhc---cCCcccccccccccc----chhhh----- Confidence 888888889999877643322221 13445566667788888877654 110000000110000 01111 Q ss_pred cCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc-cc-----hhhhhHHHHHHH Q lcl|NC_017974. 298 SGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN-DK-----GGSYNLSESKQT 371 (507) Q Consensus 298 ~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~~-----~GS~Al~~vh~~ 371 (507) +..+...+| +.+.++. ..+.+ + ...++++++.-|.+.--...++... +. .+.-|+ ..... T Consensus 284 -~~~~i~~~~-~~~~~~~-------q~~~~-~---~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al-~~~~~ 349 (485) T protein:vir:24 284 -YLARILAFE-DAEGKIQ-------QFSAA-E---LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAI-RAAES 349 (485) T ss_pred -cccceeccC-CCCceEE-------eeccc-c---hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHH-HHHHH Confidence 111222333 3333332 22221 1 2345666665555443221122110 00 111222 22233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcCC-ccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHH Q lcl|NC_017974. 372 IHTQFVQRVNEIILEALNENLLPQLLAL-NDMRLP-ETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVL 449 (507) Q Consensus 372 v~~~~~~aD~~~i~~~ln~~li~~l~~l-N~~~~~-~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~ 449 (507) -....++.-.+.+...|++ +++-++.+ |....+ +..--.+.|....+.++.+.++.+.+|+..|....+ .+-++ T Consensus 350 ~l~~ka~~~~~~f~~~l~~-~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s---~et~~ 425 (485) T protein:vir:24 350 RLIKKVERKNAIFGGAWEE-AMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIP---RERAR 425 (485) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCC---HHHHH Confidence 3333444455555666653 55555555 321111 112346788888888999999999999998843222 34577 Q ss_pred HHcCCCCCCCccccc--chh-------hhcCCCcCCCCCCcccccCcccccccccC-CCCchh Q lcl|NC_017974. 450 EVLGIDEKIEEDISQ--EEL-------MKLLGEDTSRAGDGMTKGSSGNGTGKISS-TRDNSA 502 (507) Q Consensus 450 e~~Gip~~~~~~~~~--~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~d~~a 502 (507) +.+|+.+...++... ++. ...+.+. .+..++.. . .++.....++ .+++.| T Consensus 426 ~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~-~~~~~~~~-~-~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 426 KDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDA-DPTVPGSP-N-PTPAPKPQPAIEGGDSA 485 (485) T ss_pred hhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhccc-CCCCCCCC-C-CCCCCCCccCCCCCCCC Confidence 888886432222110 000 0011111 11111110 0 1111111111 122233 No 143 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=96.61 E-value=0.00047 Score=38.84 Aligned_cols=429 Identities=9% Similarity=-0.038 Sum_probs=160.2 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHH-HHhhhhccc-chhccchhHHHH--HHHhhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKV-QVDSERMKA-EEIRWPNLIGTV--ESMKQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~-~~lr~~~~~~~y--~~m~~D~~v~s~l~~R 76 (507) |.-++|.- -.+|.-..........-...+-.. +.+. ....+.+.. +.+..+.--..- .++ ..+...-++.+. T Consensus 1 ~~~~~~~~--~~~p~d~~~~~~~l~~~i~~~~~~-~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki-~~n~~~~ivd~~ 76 (453) T protein:vir:39 1 MKYKPPKL--MTFPKDEPITNEVVTKFMEKHRLE-VARYEYLKNMYRGIMAIDAEPTKDLWKPDNRL-TVNFTKYIVDTF 76 (453) T ss_pred CeecCCcc--eEcCCCCCCCHHHHHHHHHHHHHH-HHHHHHHHHHhhccCchhcCCCccccCcccee-ecchHHHHHHHH Confidence 54443332 233333221111100000000000 0000 000011000 011110000000 000 122333333333 Q ss_pred HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 77 YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 77 k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) ..-+.+-+..+. ..++++.+.+.+++... .|...+.++ .++.-||.+. +++|... +|.+.+.-+ T Consensus 77 ~~~l~g~~~~~~------~~d~~~~~~l~~i~~~N---~~~~~~~~~~~~~~~~G~~~-~~v~~d~-----~g~~~i~~~ 141 (453) T protein:vir:39 77 TGYFNGIPVKKS------HSDKETLSKLQEFDNLN---DMEDEESELAKMACIYGRAF-ELLYQNE-----ETQTNVIYN 141 (453) T ss_pred hhhhcccCceec------cCChHHHHHHHHHHHhc---ChhHHHHHHHHHHhhcCeEE-EEEEecC-----CCceEEEEE Confidence 333333332222 12344556677776642 355555554 4688899754 5777532 355554444 Q ss_pred cccCchhcccccc-----------eeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCcc Q lcl|NC_017974. 156 AFRPQASLSRTNP-----------IVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSS 224 (507) Q Consensus 156 ~~r~~~~i~~~~~-----------~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~ 224 (507) .|+. .+.-|.. ++...+......+ -.+...+.+............. ...++.--++.|+ T Consensus 142 ~p~~--~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~-yt~~~i~~~~~~~~~~~~~~~~--~~~~g~vPvv~~~----- 211 (453) T protein:vir:39 142 TPEN--MFMVYDDTIKQEPLFAVRYGYDDDYKLYGEV-YTKETTYALNGTMGFYNMTEQA--PNPFDDLPVVEFY----- 211 (453) T ss_pred cccc--eEEEecCCCCCeEEEEEEEEEeCCeEEEEEE-EeCCeEEEEEecCCceeeeccc--ccCCCceeEEEec----- Confidence 3321 1111100 0001111000000 0011111111110000000000 1111111123332 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeE Q lcl|NC_017974. 225 SQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFF 304 (507) Q Consensus 225 ~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~ 304 (507) +++.|.|.+..+....=--+..+..++..++.+..|+++++- ...+++... .+.. + ..+ T Consensus 212 n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g--------~~~~~~~~~----~~~~-------~--~~~ 270 (453) T protein:vir:39 212 FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG--------AAVEEEDLK----NIRS-------N--RVI 270 (453) T ss_pred CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec--------CCCCchhhh----hhhh-------c--cee Confidence 356788888775544434566778888889999999888751 111111111 1110 0 112 Q ss_pred eecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccccc-chhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 305 MLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGND-KGGSYNLSESKQTIHTQFVQRVNEI 383 (507) Q Consensus 305 iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~-~~GS~Al~~vh~~v~~~~~~aD~~~ 383 (507) .++.+.. ......++++..+.. ...+...++.+.+.|...-....++.+.. ..+.-|+. ....-....+..-.+. T Consensus 271 ~~~~~~~-~~~~~~~~~lt~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~-~~~~~l~~ka~~~~~~ 346 (453) T protein:vir:39 271 NYYGESS-EAKNVDVKFLEKPDS--DSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLA-YKLQAMSNLALSFQRK 346 (453) T ss_pred eecCCCC-CCCCCceeEEeecCC--HHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHH-HHHHHHHHHHHHHHHH Confidence 2222111 111223344443322 23467788999998877543333333221 11112221 1111222333334455 Q ss_pred HHHHHHHHHHHHHHHhCCCcCCcc--ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCC-CCCCCc Q lcl|NC_017974. 384 ILEALNENLLPQLLALNDMRLPET--EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGI-DEKIEE 460 (507) Q Consensus 384 i~~~ln~~li~~l~~lN~~~~~~~--~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gi-p~~~~~ 460 (507) +...+. ++++.++++....+... .-..+.|....+.|+.+.++++.+|+ |+ ++ .+.+.+.++. +.+. . T Consensus 347 ~~~~l~-~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~--g~-is----~et~l~~l~~v~D~~-~ 417 (453) T protein:vir:39 347 FQSSLN-SRYKLYCELSTNVSNKEAWKDIEYTFTRNEPKDIKEQAETANILM--GI-TS----QETALSVISVIPDVQ-A 417 (453) T ss_pred HHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CC----hHHHHHhCCCCCCHH-H Confidence 555664 35566655532222111 22368888899999999999999984 65 44 2456666764 3221 1 Q ss_pred cccc--chhhhc--CCCcCCCCCCcccccCccccccc Q lcl|NC_017974. 461 DISQ--EELMKL--LGEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 461 ~~~~--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 493 (507) +... .+.... ......+..++....+.. ...+ T Consensus 418 E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~-~~~e 453 (453) T protein:vir:39 418 EMEKIKKEEASTAIFDKDKQPSEKGTDTVVPE-TNEE 453 (453) T ss_pred HHHHHHHHHHHHHHHHHhccCCCCCCCCCCCC-cCCC Confidence 1110 010000 000111111111111110 0011 No 144 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=96.56 E-value=0.00051 Score=38.64 Aligned_cols=422 Identities=9% Similarity=0.005 Sum_probs=161.0 Q ss_pred CcccccccccccCccccccccccccccccccCch---hhhHHHHhhhhcccc-hhccc--hhHHHHHH------H----- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTF---ALAKVQVDSERMKAE-EIRWP--NLIGTVES------M----- 63 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~~~~~~e~~~-~lr~~--~~~~~y~~------m----- 63 (507) |+=-|=... -+.....+.....-+..+-.. ..+..+-........ .+.++ ..+..|.. . T Consensus 1 ~~~~~~~~~----~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:94 1 MTLYKLIDD----IEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVN 76 (474) T ss_pred CchHHHHhh----ccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcc Confidence 110000000 000000000000000000000 000000000000000 00000 00001100 0 Q ss_pred --hhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeec Q lcl|NC_017974. 64 --KQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQ 140 (507) Q Consensus 64 --~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~ 140 (507) .-.+...-++.....-+.+-+..+.+++ ++..++++.+++.+++... .+...... ..++.-||.+ ++++|.. T Consensus 77 ~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~-~~~~~e~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a-~~~~~~d 151 (474) T protein:vir:94 77 NKLNNSFDSEIVDTRVGYLHGVPVTYDLDE-NAEKNEKLKKFITNFAIRN---SVDDEDSEIGKMAAICGYG-ARLAYID 151 (474) T ss_pred cccccchHHHHHHhHhhheeccceeEeeCC-CCcchHHHHHHHHHHHhhc---CHhHHHHHHHHHHhhcCeE-EEEEEeC Confidence 0133444444444444545454554433 2345667888888877653 35555555 4568889975 5688753 Q ss_pred cCccccCcceeeeeccccCchhcccccceeecCCCccccccccc------------------cccccccccccccccccc Q lcl|NC_017974. 141 VTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQS------------------LSAFQNYTASEIGVGGVS 202 (507) Q Consensus 141 ~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~------------------~~~~~~~~~~~~~~~~~~ 202 (507) . +|.+.+..+.|+. ++. .++..+..+..++.. +.....+....... ... T Consensus 152 ~-----~~~~~~~~i~p~~--~~~-----v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~-~~~ 218 (474) T protein:vir:94 152 T-----NGDIRIKNIDPYN--VIF-----VGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDA-LQE 218 (474) T ss_pred C-----CCeeEEEEEcccc--eEE-----EEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCc-ccc Confidence 3 4555555444331 111 111111111111100 00011111100000 000 Q ss_pred cCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHH Q lcl|NC_017974. 203 TRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPE 282 (507) Q Consensus 203 ~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~ 282 (507) .......++.-.++.|+ .|+.|.|.+..+-...=-=+..+...+..++.+..|+++++- . ..+++.. T Consensus 219 ~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g---~-----~~~~~~~ 285 (474) T protein:vir:94 219 VGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG---M-----GMSEEMI 285 (474) T ss_pred cccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc---C-----CCCchhh Confidence 00001111111123332 467789988875444333345667778888888888877651 1 1111111 Q ss_pred HHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhh Q lcl|NC_017974. 283 AEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGS 362 (507) Q Consensus 283 ~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS 362 (507) ..+.. ..+..+.+.+.+++ ++..... ...+..+++.+.+.|...--+..++.+. .++ T Consensus 286 ----~~~~~--------~~~i~~~~~~~~~~-------~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~~~ 342 (474) T protein:vir:94 286 ----QETQK--------SGAFELFDKDMDVK-------YLTKDVN--DTMIENHLDRIEKNIMRFAKSVNFNSDE--FNG 342 (474) T ss_pred ----hhhhh--------cceeEecCCCCcee-------EEeccCC--HHHHHHHHHHHHHHHHHHhCCccccccc--ccc Confidence 11110 11233446565543 3333322 2346778899999998754433333321 222 Q ss_pred hhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHh-CC--CcCCcc--ccceEEecCcCchhHHHHHHHHHHHH Q lcl|NC_017974. 363 YNLSESKQTIH----TQFVQRVNEIILEALNENLLPQLLAL-ND--MRLPET--EMPYVKAGEIVDVDMEGFSKAIQRIG 433 (507) Q Consensus 363 ~Al~~vh~~v~----~~~~~aD~~~i~~~ln~~li~~l~~l-N~--~~~~~~--~~P~~~~~~~~~~dl~~~a~~~~~L~ 433 (507) ...|. .-... ...+..-.+.+...|. ++++.++.+ +. ....+. .-..+.|...-+.|..+.++++.+|+ T Consensus 343 n~Sg~-Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~ 420 (474) T protein:vir:94 343 NVPII-GMKLKLMALENKCMTFERKMTAMLR-YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK 420 (474) T ss_pred cchHH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh Confidence 22222 22222 2222333344555553 355555543 21 111111 22468899888999999999999984 Q ss_pred hcCCccccHHHHHHHHHHcCC-CCCCCc-ccccch---hhhcCCCcCCCCCCcccccCcccccccccCCCC Q lcl|NC_017974. 434 AVGYLPKTPKVINRVLEVLGI-DEKIEE-DISQEE---LMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRD 499 (507) Q Consensus 434 ~~G~~~~~~~~~~~i~e~~Gi-p~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 499 (507) |+ ++ .+.+.+.+++ +.+..+ +....+ .....++... +.. .+.... +.-| T Consensus 421 --g~-iS----~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~----~~~---~~~~~~---~~s~ 474 (474) T protein:vir:94 421 --GQ-VS----ERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDE----GDA---NDKSQN---NQSE 474 (474) T ss_pred --cc-Cc----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccC----CCc---CCCCcc---ccCC Confidence 65 34 2456666654 322111 000011 1111111110 000 000000 0001 No 145 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=96.56 E-value=0.00051 Score=38.64 Aligned_cols=422 Identities=9% Similarity=0.005 Sum_probs=161.0 Q ss_pred CcccccccccccCccccccccccccccccccCch---hhhHHHHhhhhcccc-hhccc--hhHHHHHH------H----- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTF---ALAKVQVDSERMKAE-EIRWP--NLIGTVES------M----- 63 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~~~~~~e~~~-~lr~~--~~~~~y~~------m----- 63 (507) |+=-|=... -+.....+.....-+..+-.. ..+..+-........ .+.++ ..+..|.. . T Consensus 1 ~~~~~~~~~----~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:10 1 MTLYKLIDD----IEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVN 76 (474) T ss_pred CchHHHHhh----ccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcc Confidence 110000000 000000000000000000000 000000000000000 00000 00001100 0 Q ss_pred --hhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeec Q lcl|NC_017974. 64 --KQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQ 140 (507) Q Consensus 64 --~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~ 140 (507) .-.+...-++.....-+.+-+..+.+++ ++..++++.+++.+++... .+...... ..++.-||.+ ++++|.. T Consensus 77 ~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~-~~~~~e~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a-~~~~~~d 151 (474) T protein:vir:10 77 NKLNNSFDSEIVDTRVGYLHGVPVTYDLDE-NAEKNEKLKKFITNFAIRN---SVDDEDSEIGKMAAICGYG-ARLAYID 151 (474) T ss_pred cccccchHHHHHHhHhhheeccceeEeeCC-CCcchHHHHHHHHHHHhhc---CHhHHHHHHHHHHhhcCeE-EEEEEeC Confidence 0133444444444444545454554433 2345667888888877653 35555555 4568889975 5688753 Q ss_pred cCccccCcceeeeeccccCchhcccccceeecCCCccccccccc------------------cccccccccccccccccc Q lcl|NC_017974. 141 VTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQS------------------LSAFQNYTASEIGVGGVS 202 (507) Q Consensus 141 ~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~------------------~~~~~~~~~~~~~~~~~~ 202 (507) . +|.+.+..+.|+. ++. .++..+..+..++.. +.....+....... ... T Consensus 152 ~-----~~~~~~~~i~p~~--~~~-----v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~-~~~ 218 (474) T protein:vir:10 152 T-----NGDIRIKNIDPYN--VIF-----VGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDA-LQE 218 (474) T ss_pred C-----CCeeEEEEEcccc--eEE-----EEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCc-ccc Confidence 3 4555555444331 111 111111111111100 00011111100000 000 Q ss_pred cCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHH Q lcl|NC_017974. 203 TRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPE 282 (507) Q Consensus 203 ~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~ 282 (507) .......++.-.++.|+ .|+.|.|.+..+-...=-=+..+...+..++.+..|+++++- . ..+++.. T Consensus 219 ~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g---~-----~~~~~~~ 285 (474) T protein:vir:10 219 VGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG---M-----GMSEEMI 285 (474) T ss_pred cccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc---C-----CCCchhh Confidence 00001111111123332 467789988875444333345667778888888888877651 1 1111111 Q ss_pred HHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhh Q lcl|NC_017974. 283 AEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGS 362 (507) Q Consensus 283 ~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS 362 (507) ..+.. ..+..+.+.+.+++ ++..... ...+..+++.+.+.|...--+..++.+. .++ T Consensus 286 ----~~~~~--------~~~i~~~~~~~~~~-------~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~~~ 342 (474) T protein:vir:10 286 ----QETQK--------SGAFELFDKDMDVK-------YLTKDVN--DTMIENHLDRIEKNIMRFAKSVNFNSDE--FNG 342 (474) T ss_pred ----hhhhh--------cceeEecCCCCcee-------EEeccCC--HHHHHHHHHHHHHHHHHHhCCccccccc--ccc Confidence 11110 11233446565543 3333322 2346778899999998754433333321 222 Q ss_pred hhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHh-CC--CcCCcc--ccceEEecCcCchhHHHHHHHHHHHH Q lcl|NC_017974. 363 YNLSESKQTIH----TQFVQRVNEIILEALNENLLPQLLAL-ND--MRLPET--EMPYVKAGEIVDVDMEGFSKAIQRIG 433 (507) Q Consensus 363 ~Al~~vh~~v~----~~~~~aD~~~i~~~ln~~li~~l~~l-N~--~~~~~~--~~P~~~~~~~~~~dl~~~a~~~~~L~ 433 (507) ...|. .-... ...+..-.+.+...|. ++++.++.+ +. ....+. .-..+.|...-+.|..+.++++.+|+ T Consensus 343 n~Sg~-Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~ 420 (474) T protein:vir:10 343 NVPII-GMKLKLMALENKCMTFERKMTAMLR-YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK 420 (474) T ss_pred cchHH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh Confidence 22222 22222 2222333344555553 355555543 21 111111 22468899888999999999999984 Q ss_pred hcCCccccHHHHHHHHHHcCC-CCCCCc-ccccch---hhhcCCCcCCCCCCcccccCcccccccccCCCC Q lcl|NC_017974. 434 AVGYLPKTPKVINRVLEVLGI-DEKIEE-DISQEE---LMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRD 499 (507) Q Consensus 434 ~~G~~~~~~~~~~~i~e~~Gi-p~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 499 (507) |+ ++ .+.+.+.+++ +.+..+ +....+ .....++... +.. .+.... +.-| T Consensus 421 --g~-iS----~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~----~~~---~~~~~~---~~s~ 474 (474) T protein:vir:10 421 --GQ-VS----ERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDE----GDA---NDKSQN---NQSE 474 (474) T ss_pred --cc-Cc----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccC----CCc---CCCCcc---ccCC Confidence 65 34 2456666654 322111 000011 1111111110 000 000000 0001 No 146 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=96.55 E-value=0.00052 Score=38.60 Aligned_cols=416 Identities=8% Similarity=-0.009 Sum_probs=144.1 Q ss_pred CccccccccccccccccccCchhhhHH-HHhhhhcccchhc-c----chhHHHHHHHhhChHHHHHHHHHHHHHhhCCce Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKV-QVDSERMKAEEIR-W----PNLIGTVESMKQDATVATGLDMLYTFVEKAFKD 86 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~~~lr-~----~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~ 86 (507) |..-+..... .. -..++... .... .+..+.+-...+. . ++.++............-++......+..-+ T Consensus 1 ~~~~t~~~~~-~~-l~~~~~~~-~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~-- 75 (456) T protein:vir:10 1 MTASTPAEWL-PV-LTKRIDDG-MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNG-- 75 (456) T ss_pred CCCCCHHHHH-HH-HHHHHHHH-HHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCC-- Confidence 1111000000 00 00000000 0000 0000111111000 0 0111110000112222222332222222222 Q ss_pred eeeCCC-CChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcc Q lcl|NC_017974. 87 FKVIPG-ESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLS 164 (507) Q Consensus 87 ~~~~~~-~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~ 164 (507) |.+... +...+..+.+ .+.. ..+..+..++ .+|.-||.+ ++++|... +|...++.+.|.. ++. T Consensus 76 ~~~~~~~d~~~~~~~~~----i~~~---N~~d~~~~~~~~~a~i~G~a-y~~v~~d~-----~g~~~i~~~~p~~--~~~ 140 (456) T protein:vir:10 76 ITVGGSADSDLALRARR----IWRD---NRMDSVCKQWVKYGLDFGES-YLTCWRRD-----DGTATITADSPET--MVV 140 (456) T ss_pred eecCCCCCcchHHHHHH----HHHh---cChhhHHHHHHHHHhhcCee-EEEEeeCC-----CCceEEEEEccce--eEE Confidence 333222 1222223333 3332 2355555554 568889997 57999643 3444444333221 111 Q ss_pred ccc-----------ceeecCCCcccccccccccccccc--------ccccc-------cccccccCCCccccccccEEEE Q lcl|NC_017974. 165 RTN-----------PIVYNEDGSAIIGIKQSLSAFQNY--------TASEI-------GVGGVSTRMSDVIIPINRVMLM 218 (507) Q Consensus 165 ~~~-----------~~~~~~d~~~l~~~~q~~~~~~~~--------~~~~~-------~~~~~~~~~~~~~iP~~k~i~~ 218 (507) -|. .++...|+.....+.-.+.....+ ..... .............+|+ ++ T Consensus 141 i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p---vv- 216 (456) T protein:vir:10 141 SVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP---VV- 216 (456) T ss_pred EEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCcee---EE- Confidence 111 011122222111110000000000 00000 0000000001111111 11 Q ss_pred eeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhc Q lcl|NC_017974. 219 NTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHS 298 (507) Q Consensus 219 ~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~ 298 (507) ...|+.|.|.+..+....=--+..+...+...+-+..|..++.-. .......++....+.... . +.. T Consensus 217 ----~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~-----~~~~~~~d~~g~~~~~~~-~---~~~ 283 (456) T protein:vir:10 217 ----VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKST-----EHGLPNVDENGNAIDYAS-I---FEA 283 (456) T ss_pred ----EecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhcc-----Ccccccccccccccchhh-h---hhh Confidence 125778888888765432222333444445555555555444310 000000000000011100 0 111 Q ss_pred CCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccc-cc-cchhhhhHHHHHHHHHHHH Q lcl|NC_017974. 299 GEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV-GN-DKGGSYNLSESKQTIHTQF 376 (507) Q Consensus 299 G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts-~~-~~~GS~Al~~vh~~v~~~~ 376 (507) +..+...+|.|.++. ++.+ ++...|...++.+-.+|+..---..... +. +..+.-|+ +....-.... T Consensus 284 ~~~~~~~~~~~~~~~------q~~~----~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai-~~~~~~l~~k 352 (456) T protein:vir:10 284 APGALWELPPGVDIW------ESQA----NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGA-HNIEKGFLFK 352 (456) T ss_pred hccccccCCCCcceE------Eecc----cChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHH-HHHHHHHHHH Confidence 222334467766532 2221 1222355566666666654221111000 00 11111122 2222233334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCC Q lcl|NC_017974. 377 VQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDE 456 (507) Q Consensus 377 ~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~ 456 (507) +..-.+.+...|. ++++-++.+++. .+..-.++.|....+.++.+.|+++.+|+.+|+... .-+++.+|+.+ T Consensus 353 ~~~~~~~f~~~l~-~~~rl~~~~~g~--~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~-----~~~~~~lg~~~ 424 (456) T protein:vir:10 353 CEDRLSIAKIGLE-AILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWA-----SIRRNILNYNA 424 (456) T ss_pred HHHHHHHHHHHHH-HHHHHHHHhcCC--CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChH-----HHHHhhCCCCH Confidence 4445556666664 466777777642 222234678888888999999999999999987322 33567788853 Q ss_pred CCCcc----cccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhh Q lcl|NC_017974. 457 KIEED----ISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAAN 504 (507) Q Consensus 457 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~ 504 (507) .+-+. ...++.....+++...+. .+ ++. T Consensus 425 ~~i~~~e~er~~~e~~~~~~~~~~~~~--------~~------------~~~ 456 (456) T protein:vir:10 425 DQIKQDDLDRAREQITLFAGNPVQRPQ--------ED------------GSR 456 (456) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhhcCC--------CC------------CCC Confidence 21110 000111111111111110 00 000 No 147 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=96.55 E-value=0.00052 Score=38.60 Aligned_cols=416 Identities=8% Similarity=-0.009 Sum_probs=144.1 Q ss_pred CccccccccccccccccccCchhhhHH-HHhhhhcccchhc-c----chhHHHHHHHhhChHHHHHHHHHHHHHhhCCce Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKV-QVDSERMKAEEIR-W----PNLIGTVESMKQDATVATGLDMLYTFVEKAFKD 86 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~~~lr-~----~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~ 86 (507) |..-+..... .. -..++... .... .+..+.+-...+. . ++.++............-++......+..-+ T Consensus 1 ~~~~t~~~~~-~~-l~~~~~~~-~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~-- 75 (456) T protein:vir:10 1 MTASTPAEWL-PV-LTKRIDDG-MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNG-- 75 (456) T ss_pred CCCCCHHHHH-HH-HHHHHHHH-HHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCC-- Confidence 1111000000 00 00000000 0000 0000111111000 0 0111110000112222222332222222222 Q ss_pred eeeCCC-CChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcc Q lcl|NC_017974. 87 FKVIPG-ESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLS 164 (507) Q Consensus 87 ~~~~~~-~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~ 164 (507) |.+... +...+..+.+ .+.. ..+..+..++ .+|.-||.+ ++++|... +|...++.+.|.. ++. T Consensus 76 ~~~~~~~d~~~~~~~~~----i~~~---N~~d~~~~~~~~~a~i~G~a-y~~v~~d~-----~g~~~i~~~~p~~--~~~ 140 (456) T protein:vir:10 76 ITVGGSADSDLALRARR----IWRD---NRMDSVCKQWVKYGLDFGES-YLTCWRRD-----DGTATITADSPET--MVV 140 (456) T ss_pred eecCCCCCcchHHHHHH----HHHh---cChhhHHHHHHHHHhhcCee-EEEEeeCC-----CCceEEEEEccce--eEE Confidence 333222 1222223333 3332 2355555554 568889997 57999643 3444444333221 111 Q ss_pred ccc-----------ceeecCCCcccccccccccccccc--------ccccc-------cccccccCCCccccccccEEEE Q lcl|NC_017974. 165 RTN-----------PIVYNEDGSAIIGIKQSLSAFQNY--------TASEI-------GVGGVSTRMSDVIIPINRVMLM 218 (507) Q Consensus 165 ~~~-----------~~~~~~d~~~l~~~~q~~~~~~~~--------~~~~~-------~~~~~~~~~~~~~iP~~k~i~~ 218 (507) -|. .++...|+.....+.-.+.....+ ..... .............+|+ ++ T Consensus 141 i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p---vv- 216 (456) T protein:vir:10 141 SVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP---VV- 216 (456) T ss_pred EEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCcee---EE- Confidence 111 011122222111110000000000 00000 0000000001111111 11 Q ss_pred eeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhc Q lcl|NC_017974. 219 NTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHS 298 (507) Q Consensus 219 ~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~ 298 (507) ...|+.|.|.+..+....=--+..+...+...+-+..|..++.-. .......++....+.... . +.. T Consensus 217 ----~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~-----~~~~~~~d~~g~~~~~~~-~---~~~ 283 (456) T protein:vir:10 217 ----VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKST-----EHGLPNVDENGNAIDYAS-I---FEA 283 (456) T ss_pred ----EecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhcc-----Ccccccccccccccchhh-h---hhh Confidence 125778888888765432222333444445555555555444310 000000000000011100 0 111 Q ss_pred CCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccc-cc-cchhhhhHHHHHHHHHHHH Q lcl|NC_017974. 299 GEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV-GN-DKGGSYNLSESKQTIHTQF 376 (507) Q Consensus 299 G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts-~~-~~~GS~Al~~vh~~v~~~~ 376 (507) +..+...+|.|.++. ++.+ ++...|...++.+-.+|+..---..... +. +..+.-|+ +....-.... T Consensus 284 ~~~~~~~~~~~~~~~------q~~~----~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai-~~~~~~l~~k 352 (456) T protein:vir:10 284 APGALWELPPGVDIW------ESQA----NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGA-HNIEKGFLFK 352 (456) T ss_pred hccccccCCCCcceE------Eecc----cChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHH-HHHHHHHHHH Confidence 222334467766532 2221 1222355566666666654221111000 00 11111122 2222233334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCC Q lcl|NC_017974. 377 VQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDE 456 (507) Q Consensus 377 ~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~ 456 (507) +..-.+.+...|. ++++-++.+++. .+..-.++.|....+.++.+.|+++.+|+.+|+... .-+++.+|+.+ T Consensus 353 ~~~~~~~f~~~l~-~~~rl~~~~~g~--~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~-----~~~~~~lg~~~ 424 (456) T protein:vir:10 353 CEDRLSIAKIGLE-AILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWA-----SIRRNILNYNA 424 (456) T ss_pred HHHHHHHHHHHHH-HHHHHHHHhcCC--CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChH-----HHHHhhCCCCH Confidence 4445556666664 466777777642 222234678888888999999999999999987322 33567788853 Q ss_pred CCCcc----cccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhh Q lcl|NC_017974. 457 KIEED----ISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAAN 504 (507) Q Consensus 457 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~ 504 (507) .+-+. ...++.....+++...+. .+ ++. T Consensus 425 ~~i~~~e~er~~~e~~~~~~~~~~~~~--------~~------------~~~ 456 (456) T protein:vir:10 425 DQIKQDDLDRAREQITLFAGNPVQRPQ--------ED------------GSR 456 (456) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhhcCC--------CC------------CCC Confidence 21110 000111111111111110 00 000 No 148 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=96.51 E-value=0.00056 Score=38.42 Aligned_cols=446 Identities=13% Similarity=0.084 Sum_probs=171.3 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhc-cchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIR-WPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr-~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) +.+.+...++.....|.+...+.+- -.+ |..| .+.. .+. .++ ..++++.|++|..++.|-++++..... T Consensus 10 i~~~~~~~~~~s~~~~~~~dg~~~~-~~~--~~~g---~~~~--~e~--~~~~~~eLI~~YR~ma~~pEvd~Av~eIVne 79 (537) T protein:vir:10 10 LQRAKKVPKGPSFVQKDSLDGSQPI-VGG--GYFG---YSVD--FDG--TIRNDHELITRYREMVLNPECDSAVDDVVNE 79 (537) T ss_pred eecccccccCCcccCCCccccccee-ecc--cccc---cccc--ccc--ccchHHHHHHHHHHHhhccchhhHHHHhhcc Confidence 4444444444444333332222211 111 1111 1111 111 112 357899999999999999999988876 Q ss_pred HhhCCce---eeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceee-----------eeeeeccCccc Q lcl|NC_017974. 80 VEKAFKD---FKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVV-----------EKVYTQVTVGE 145 (507) Q Consensus 80 v~~~~~~---~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~-----------Ei~w~~~~~~~ 145 (507) +...+-. +.++-...+.++.+.+.|.+- |.. |..+|+.--+||..+ .++-.... - T Consensus 80 aiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eE--------F~~-Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~--p 148 (537) T protein:vir:10 80 TICGNFDDVPISIDLHNLKQSEKIKKLIRSE--------FDE-ILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKK--P 148 (537) T ss_pred eeEecCCCceEEEEecccccchHHHHHHHHH--------HHH-HHHHhccchhhhHHHhhheeeeEEEEEEEEeCCC--c Confidence 5322110 111111222233333333322 222 224444444554443 33322110 1 Q ss_pred cCcceeeeeccccCchhcccccceeecCC-Cccccc----cccccccccccccccccccccccCCCccccccccEEEEee Q lcl|NC_017974. 146 YIGKYKVKNLAFRPQASLSRTNPIVYNED-GSAIIG----IKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNT 220 (507) Q Consensus 146 ~~g~~~~~~l~~r~~~~i~~~~~~~~~~d-~~~l~~----~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~ 220 (507) -+|-..++ ...|.+|.+++-+....+ +..... +.+.-..++.+.+ ... ..+...++.||. .-|+|+| T Consensus 149 k~GI~ELr---~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp---~g~-~~~~~~~vkI~~-dAI~y~h 220 (537) T protein:vir:10 149 RQGLVELR---YVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNP---KGL-KNSTNQGMKIAP-DSIAYCH 220 (537) T ss_pred cccceeee---eeCCccceeeEeecccCCccceEEecceeeeecccceeeecc---ccc-cccCCCceeccH-hheeeec Confidence 12333333 334445544443222111 111100 0111111111111 111 223456788887 5688888 Q ss_pred CC--ccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhh- Q lcl|NC_017974. 221 GG--SSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAH- 297 (507) Q Consensus 221 ~~--~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~- 297 (507) .. ...+++..|.|+++..++==-+.....-+.+-=--.-.=.++|+. +|..+ .....+.|+.++.-..|-. T Consensus 221 SGl~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--VGnLP----k~KAeqYlr~iM~k~KNklV 294 (537) T protein:vir:10 221 SGIQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYID--VGNLP----KNKAEQYLREVMGRYRNKLV 294 (537) T ss_pred ccceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhccceEE Confidence 43 455678889999999887544443333333310001111244442 22222 2233345555544332210 Q ss_pred cCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccccc Q lcl|NC_017974. 298 SGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGND 358 (507) Q Consensus 298 ~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~ 358 (507) -.+..|- -+|. |++|.+. .|+.+.. -..=|+|..+.+-+++--..--.+.. T Consensus 295 YDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTL---------pGgqnlg-em~DV~YF~kKLy~aLnVP~SRl~~e 364 (537) T protein:vir:10 295 YDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTL---------PGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETE 364 (537) T ss_pred EeccCceecccchhhhhhhhhcccccCCCcccceeec---------cccCCcC-hHHHHHHHHHHHHHHhCCCccccCCC Confidence 0111111 1222 5555432 2332232 23458999999999887765333322 Q ss_pred chhhhhHH---HH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------CchhHH Q lcl|NC_017974. 359 KGGSYNLS---ES-KQTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------VDVDME 423 (507) Q Consensus 359 ~~GS~Al~---~v-h~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~~~dl~ 423 (507) ++ ..+| ++ ..|+ |...+......+...|..-|-..|+-=|.. ..+. ...+.|+.. |-.+.+ T Consensus 365 ~~--f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgii--t~eeW~~i~~~I~~~f~~Dn~f~ElKe~E 440 (537) T protein:vir:10 365 TT--FNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGIC--SIEEWEEMKEHIQFDFIADNYFTELKEIE 440 (537) T ss_pred Cc--ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhcceEEeeecchHHHHHHHH Confidence 22 2222 22 2222 223344444444444443333333221211 1111 122333322 234445 Q ss_pred HHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcc----------------ccc--------chhhhcCCCcCCCC Q lcl|NC_017974. 424 GFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEED----------------ISQ--------EELMKLLGEDTSRA 478 (507) Q Consensus 424 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~----------------~~~--------~~~~~~~~~~~~~~ 478 (507) .+.+++..|..+-=.+-.-...+||++. |.+.+.+-.+ +.. ...++.++....|. T Consensus 441 il~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 520 (537) T protein:vir:10 441 IRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQ 520 (537) T ss_pred HHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcc Confidence 5555666555432111111124566543 4543211000 000 00000001100111 Q ss_pred CCcccccCcccccccccCCCCchhhhc Q lcl|NC_017974. 479 GDGMTKGSSGNGTGKISSTRDNSAANL 505 (507) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~d~~a~~~ 505 (507) +..++++. |+.+ .---+ T Consensus 521 ----~~~~~~~~----~~~~--~~~~~ 537 (537) T protein:vir:10 521 ----TDPNSAVS----PADQ--KRGEL 537 (537) T ss_pred ----cCCccCCC----CCCc--cCCCC Confidence 11111111 1111 11111 No 149 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=96.44 E-value=0.00062 Score=38.17 Aligned_cols=417 Identities=11% Similarity=0.017 Sum_probs=150.1 Q ss_pred ccccccccccccccccCchhhhHH-HHhhhhcccchhcc-chhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCCC Q lcl|NC_017974. 16 ATSEQINVSRMRMSEQGTFALAKV-QVDSERMKAEEIRW-PNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPGE 93 (507) Q Consensus 16 ~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~~~lr~-~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~ 93 (507) .+.........-..++... .... .+..+.+-.+.++. +.. .-+++ +|-.+. .+-=+..|....=.+.+.+-. T Consensus 1 ~~~~~~~~i~~l~~~~~~~-~~r~~~l~~Yy~G~~~i~~~~~~--~~~~~-~~~k~~--~n~~~~ivd~~~~~l~~~g~~ 74 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRL-SSWHCCIEGYYEGSNRVRDLGVA--IPPEL-QRVQTV--VSWPGIAVDALEERLDWLGWT 74 (441) T ss_pred CCccHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCcchhcCcc--cchhh-hhhhhh--cchHHHHHHHHHhhhcccccc Confidence 1110000000000011100 0000 00111111111110 000 00111 111111 111111221110011222212 Q ss_pred ChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccccCchhccccc----- Q lcl|NC_017974. 94 SEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTN----- 167 (507) Q Consensus 94 ~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~----- 167 (507) .+++.. +.+++.. ..|..++.++ .++..||.| ++++|... +|...++.+.|+. ++.-|. T Consensus 75 ~~d~~~----l~~i~~~---n~~~~~~~~~~~~~~~~G~a-~~~v~~d~-----~g~~~i~~~~p~~--~~~i~d~~~~~ 139 (441) T protein:vir:80 75 NGDGYG----LDGVYAA---NRLATASCDVHLDALIFGLS-FVAIIPHG-----DGTVSVRPQSPKN--CTGKFSADGSR 139 (441) T ss_pred CCChHH----HHHHHHh---cCHHHHHHHHHHHHhhcCee-EEEEEeCC-----CCceEEEEEccce--EEEEEeCCCCc Confidence 222233 3333332 3466776665 468889987 56888633 4555554443321 110000 Q ss_pred -----ceeecCCCccccccc-cccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHH Q lcl|NC_017974. 168 -----PIVYNEDGSAIIGIK-QSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWR 241 (507) Q Consensus 168 -----~~~~~~d~~~l~~~~-q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~ 241 (507) .+++..++. ..... -.+.....+....... ..........++.--++.|.+..+.+.|+|.|-+..-+...+ T Consensus 140 ~~~~~~~~~~~~~~-~~~~~vy~~~~~~~~~~~~~~~-~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~li 217 (441) T protein:vir:80 140 LDAGLVVQQTCDPE-VVEAELLLPDVIVQVERRGSRE-WVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYT 217 (441) T ss_pred eeEEEEEEEEecCc-eEEEEEEecCeEEEEEEcCCcc-eeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHH Confidence 001111111 00000 0011111000000000 000111111222233566777888899999886543222222 Q ss_pred -HHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccccee Q lcl|NC_017974. 242 -EKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMT 320 (507) Q Consensus 242 -fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~e 320 (507) -=+..+..++...+.++.|.++++ |-....... . ..+ . .......+|.+.+-+. ++ T Consensus 218 Da~~~~~s~~~~~~~~~~~~~~~i~-----G~~~~~~~~-~---~~~----~------~~~~i~~~~~~~~~~~----~~ 274 (441) T protein:vir:80 218 DEAVRTLLGQSVNRDFYAYPQRWVT-----GVSADEFSQ-P---GWV----L------SMASVWAVDKDDDGDT----PN 274 (441) T ss_pred HHHHHHHHHHHHHHHhhcCceeeee-----cCCcccccc-c---hhh----h------cccccccCCCCCCCCc----ce Confidence 124455667778888998887765 110001111 1 001 0 1113344555443222 22 Q ss_pred eeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc-cchh----h-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 321 LKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN-DKGG----S-YNLSESKQTIHTQFVQRVNEIILEALNENLLP 394 (507) Q Consensus 321 l~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~~~G----S-~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~ 394 (507) +...+.+ + ...++++++.-|.+..-...+.... +..| | -|+. ....-....+..-.+.+...|. ++++ T Consensus 275 ~~~~~~~-~---~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~-~~~~~l~~k~~~~~~~f~~~l~-~~~~ 348 (441) T protein:vir:80 275 VGSFPVN-S---PTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALA-AEESRLVKRAERRQTSFGQGWL-SVGF 348 (441) T ss_pred eEecCcc-c---hHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 3232222 1 2345566655554433222121110 1111 1 1211 1222222333333444445553 3445 Q ss_pred HHHHhCCCcCC-cc--ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcC Q lcl|NC_017974. 395 QLLALNDMRLP-ET--EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLL 471 (507) Q Consensus 395 ~l~~lN~~~~~-~~--~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~ 471 (507) -++.+-+.... .. .-..++|....+.++.+.++++.+|+..|....+ .+.+++.+|.++.+-++...+. . . T Consensus 349 l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s---~~~~~~~l~~~~~e~~~~~~e~-~--e 422 (441) T protein:vir:80 349 LAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPAD---SRTVLEMLGLDDVQVEAVMRHR-A--E 422 (441) T ss_pred HHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCccccc---HHHHHHhCCCCHHHHHHHHHHH-H--H Confidence 55554211111 11 1236789999999999999999999999986543 2346788888643221111000 0 0 Q ss_pred CCcCCCCCCcccccCcccccccccCCCCch Q lcl|NC_017974. 472 GEDTSRAGDGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 501 (507) ..+. .+.. .+.....+|.. T Consensus 423 ~~~~------~~~~-----~~~~~~~~~~~ 441 (441) T protein:vir:80 423 SSDP------LAVL-----AGAISRQTNEV 441 (441) T ss_pred HHHH------HHHH-----hhhhhcccccC Confidence 0000 0000 00000111111 No 150 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=96.35 E-value=0.00072 Score=37.82 Aligned_cols=465 Identities=12% Similarity=0.049 Sum_probs=185.3 Q ss_pred ccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH----H---------------------hh Q lcl|NC_017974. 11 VKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES----M---------------------KQ 65 (507) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~----m---------------------~~ 65 (507) .+++.++....-........+++.....+.-+..+.+.-+..|-+.++.|.. | .- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~ 80 (651) T protein:vir:80 1 MKLATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKIT 80 (651) T ss_pred CcccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCcccc Confidence 3333333211001111111233344444444444333322233222222211 0 12 Q ss_pred ChHHHHHHHHHHHHHhhC----CceeeeCCC-CChhHHHHHHHHHHHHHh-ccCCCHHHHHHHH-HHHHhhcceeeeeee Q lcl|NC_017974. 66 DATVATGLDMLYTFVEKA----FKDFKVIPG-ESEESKKAAKFVEYCLKN-MEGQTLRQFARDA-ATFNEYGLSVVEKVY 138 (507) Q Consensus 66 D~~v~s~l~~Rk~av~~~----~~~~~~~~~-~~~~~~~~ae~v~~~l~~-~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w 138 (507) ++.|...++.+...++.. +--|.+.|. +..++++.++.|+.++.. +...+|...+..+ .|++.+|.+++=+.| T Consensus 81 ~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~w 160 (651) T protein:vir:80 81 TGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPW 160 (651) T ss_pred ChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEee Confidence 357888888888877764 111444443 334456677778877764 3345688777775 689999999887777 Q ss_pred eccC----------------ccccC---------cceeeeecccc----Cch--hcccc----cceeecCC-------Cc Q lcl|NC_017974. 139 TQVT----------------VGEYI---------GKYKVKNLAFR----PQA--SLSRT----NPIVYNED-------GS 176 (507) Q Consensus 139 ~~~~----------------~~~~~---------g~~~~~~l~~r----~~~--~i~~~----~~~~~~~d-------~~ 176 (507) .... +..+. |...+..+.|. ++. .+... +.++...+ |. T Consensus 161 e~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~ 240 (651) T protein:vir:80 161 RVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGY 240 (651) T ss_pred cceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHhccc Confidence 6431 00111 11111111100 000 00000 00000000 00 Q ss_pred c-----cccccc------------------------c-ccccccc---cc----cccccccccc-CCCc----ccccc-- Q lcl|NC_017974. 177 A-----IIGIKQ------------------------S-LSAFQNY---TA----SEIGVGGVST-RMSD----VIIPI-- 212 (507) Q Consensus 177 ~-----l~~~~q------------------------~-~~~~~~~---~~----~~~~~~~~~~-~~~~----~~iP~-- 212 (507) . ...+.. . .+....+ .. +......... .+.. ...|. T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il~~~~~~~~~ 320 (651) T protein:vir:80 241 YYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLRFEQNPYWC 320 (651) T ss_pred ccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEecccccCCCC Confidence 0 000000 0 0000000 00 0000000000 0000 12222 Q ss_pred -ccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHH Q lcl|NC_017974. 213 -NRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMS 291 (507) Q Consensus 213 -~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~ 291 (507) .-|++++.....+..||.|....+...-...+...+..+..+.+...|...+ +.. ....++ .+ T Consensus 321 ~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v--~~d----~~~~~~--------~l-- 384 (651) T protein:vir:80 321 GRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTL--RSD----GLLQPE--------DV-- 384 (651) T ss_pred CCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEe--cCC----ccccHH--------Hh-- Confidence 3588999999999999999999999999999999999988888877765432 211 001111 01 Q ss_pred HHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccc-h-hhhhHHHHH Q lcl|NC_017974. 292 DAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDK-G-GSYNLSESK 369 (507) Q Consensus 292 ~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~-~-GS~Al~~vh 369 (507) .+ +. ++++--+...++ ..+.. +.........++++++..|..+.+-..+..+.+. + +.--+++|+ T Consensus 385 --~~-~p----g~vi~~~~~~~~-----~~l~~-~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~ 451 (651) T protein:vir:80 385 --YT-EP----GKVFLVSDHGDL-----QPLAN-QSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVA 451 (651) T ss_pred --hc-CC----CceEEecCCCCc-----eeecc-CcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHH Confidence 01 11 122222222111 11111 1111222346899999999887765544443221 1 111123333 Q ss_pred H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEE--------ecCcCch------------------h Q lcl|NC_017974. 370 Q--TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVK--------AGEIVDV------------------D 421 (507) Q Consensus 370 ~--~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~--------~~~~~~~------------------d 421 (507) . +.....+..-.+.+..++-+.|+..++.++..++...+.|++. |...... . T Consensus 452 ~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r 531 (651) T protein:vir:80 452 AVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIER 531 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHH Confidence 3 2334455556666666666667777777775444433333331 1001111 1 Q ss_pred HHHHHHHHHHHHhcCCcccc-------HHHHHHHHHHcCCCCCCCcccccchhhhcCCCc--CCCC-CCcccccCccccc Q lcl|NC_017974. 422 MEGFSKAIQRIGAVGYLPKT-------PKVINRVLEVLGIDEKIEEDISQEELMKLLGED--TSRA-GDGMTKGSSGNGT 491 (507) Q Consensus 422 l~~~a~~~~~L~~~G~~~~~-------~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~~~~ 491 (507) ...+++.+ .+..++...+. ......+.+..|++.+..--..++..++..++. ...+ ..+......- . T Consensus 532 ~~~~~~l~-~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~--~ 608 (651) T protein:vir:80 532 KQYIEDRL-TFIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNM--L 608 (651) T ss_pred HHHHHHHH-HHHHhhccCCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHH--H Confidence 11222222 22332222221 112234667889875422100000111000000 0000 0000000000 0 Q ss_pred cccc--CCCCchhhhccC Q lcl|NC_017974. 492 GKIS--STRDNSAANLDN 507 (507) Q Consensus 492 ~~~~--~~~d~~a~~~~~ 507 (507) ..+. .+--...+.+.. T Consensus 609 ~~~~~~~~~~~~~~~~~~ 626 (651) T protein:vir:80 609 QNQLQADGGTQMMSEMYG 626 (651) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 0000 000000011111 No 151 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=96.25 E-value=0.00082 Score=37.49 Aligned_cols=409 Identities=9% Similarity=-0.080 Sum_probs=156.4 Q ss_pred cccccCchhhhHHH-Hhhhhccc-chh-ccchhHHHHHH-H-hhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHH Q lcl|NC_017974. 27 RMSEQGTFALAKVQ-VDSERMKA-EEI-RWPNLIGTVES-M-KQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAA 101 (507) Q Consensus 27 ~~~~~g~~~~~~~~-~~~~~e~~-~~l-r~~~~~~~y~~-m-~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~a 101 (507) -+..+-..-...+. +..+.+.. +.+ +......-... - .......-++.+...-+.+-+..|.+.. ..+.+.. T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~---~~~~~~~ 77 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVME---GGSADQL 77 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCC---CccHHHH Confidence 01110000000110 00111000 000 00000000000 0 0123334455555555555555554432 2334555 Q ss_pred HHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCC--ccc Q lcl|NC_017974. 102 KFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDG--SAI 178 (507) Q Consensus 102 e~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~--~~l 178 (507) +++.+++..- .+......+ .++.-||.+. +++|... +|.+.+.-+.|+. .+. .+++.+ ..+ T Consensus 78 ~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a~-~~~~~d~-----~~~~~i~~~~p~~--~~~-----~~d~~~~~~~~ 141 (440) T protein:vir:95 78 STIKDIEWQN---DINALNSDLAFDASVYGRAY-EYHFRDK-----DKVDRVVLISPLE--MFV-----IRDLTVEQNII 141 (440) T ss_pred HHHHHHHHhc---CHhHHHHHHHHHHhhcCeEE-EEEEecC-----CCceEEEEEcccc--eEE-----EEcCCCCCceE Confidence 6676665542 355555444 4677899975 4666532 4555544433321 111 111111 011 Q ss_pred ccccc---ccc-ccccccccccc-ccccc------cCCCccccccc--cEEEEeeCCccCccccchHHHHHHHHHHHHHH Q lcl|NC_017974. 179 IGIKQ---SLS-AFQNYTASEIG-VGGVS------TRMSDVIIPIN--RVMLMNTGGSSSQALGVSPLVGCYRAWREKIL 245 (507) Q Consensus 179 ~~~~q---~~~-~~~~~~~~~~~-~~~~~------~~~~~~~iP~~--k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~ 245 (507) -.++. .-. ....++..... ..... ......+-|.. -++.|+ ++..|.|.+..+....=-=+. T Consensus 142 ~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~lida~~~ 216 (440) T protein:vir:95 142 AAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWW-----NNRFRMGDYESEISLIDAYDA 216 (440) T ss_pred EEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEee-----CCCCCCCchhhhHHHHHHHHH Confidence 11110 000 00011111000 00000 00011111111 123333 356788888876654444455 Q ss_pred HHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccc--ccccceeeee Q lcl|NC_017974. 246 IENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDN--APQYSMTLKG 323 (507) Q Consensus 246 ~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~--~~~~~~el~~ 323 (507) .+..++..++.|..|+++++-.... ...+.+........ ..+.++...... .....++++. T Consensus 217 ~~s~~~~~~~~~~~~~~v~~g~~~~----~~~~~e~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~lt 279 (440) T protein:vir:95 217 GQSDTANYMSDLNDAMLLVKGDLDG----IKLSPEDAAKMKDA-------------NMLFLKTGISTTGQQTTADASYIY 279 (440) T ss_pred HHHHHHHHHHHhhcceeeeeccccc----CCCCccchhhhhhc-------------cceecccccccccCCCCcceeEEe Confidence 6788888899999998887632110 01111111111111 111111111000 0011233333 Q ss_pred ccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 324 IDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIH----TQFVQRVNEIILEALNENLLPQLLAL 399 (507) Q Consensus 324 ~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~----~~~~~aD~~~i~~~ln~~li~~l~~l 399 (507) .+. ....+..+++.+.+.|...--...++.+.. ++...|. ..+.. ...+..-.+.+...+. ++++.++.+ T Consensus 280 ~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~n~Sg~-Al~~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~ 353 (440) T protein:vir:95 280 KQY--DVNGTEAYKNRLANDIHRFSRIPNLDDDRF--NSTSSGI-ALLYKMIGLEQVRKDKETYFTKALR-RRYELISNI 353 (440) T ss_pred ecC--CHHHHHHHHHHHHHHHHHHhCCcccccccc--cccchHH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 322 223467788999998877654444443221 1111122 12111 1222223344455553 344554432 Q ss_pred -CCCcCCc--cccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC-cccccchhhhcCCCcC Q lcl|NC_017974. 400 -NDMRLPE--TEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE-EDISQEELMKLLGEDT 475 (507) Q Consensus 400 -N~~~~~~--~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~-~~~~~~~~~~~~~~~~ 475 (507) +...+.. ..-..+.|....+.+..+.++++.+| .|+ ++. +.+.+.++.-.+.. .+....+......+.. T Consensus 354 ~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl--~g~-iS~----et~~~~l~~~d~~~E~~ri~~E~~~~~~~~~ 426 (440) T protein:vir:95 354 HKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA--GGE-ISQ----ETLMENASFTDYKTEHSRILKQGGSSDLEIG 426 (440) T ss_pred HhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH--hcc-CcH----HHHHHhCCCCCcHHHHHHHHHHHHHhhhhHH Confidence 2111222 23357889999999999999999998 465 442 34555565432211 1111111111111111 Q ss_pred CCCCCcccccCccc Q lcl|NC_017974. 476 SRAGDGMTKGSSGN 489 (507) Q Consensus 476 ~~~~~~~~~~~~~~ 489 (507) ...++....+...| T Consensus 427 ~~~~~~~~~~~~~e 440 (440) T protein:vir:95 427 QIVGDADVGQADTE 440 (440) T ss_pred hhccCCCCCCcCCC Confidence 11111111001111 No 152 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=96.17 E-value=0.00092 Score=37.22 Aligned_cols=460 Identities=10% Similarity=0.064 Sum_probs=180.8 Q ss_pred CcccccccccccCccccccccccccccccccCchh--hhHHHHh---hhhcccchhccchhHHHHHHHh-hChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFA--LAKVQVD---SERMKAEEIRWPNLIGTVESMK-QDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~--~~~~~~~---~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s~l~ 74 (507) |.+...+.-+...+.............+..-...+ ...|+.. ...+-... +..+-.--+++. .++++.++++ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~--~~~lr~RaRdL~rNn~~a~~av~ 78 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPL--KRIADARGRDMADNDGFTNGAVG 78 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHH--HHHHHHHHHHHHhcChHHHHHHH Confidence 66555444322211110000000000000000000 0001100 00110111 111222234443 5999999999 Q ss_pred HHHHHHhhCCceeeeCCC-------CChhHHHHHHHHHHHHHhc-----------cCCCHHHHHHHHHHH-Hhhcceeee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPG-------ESEESKKAAKFVEYCLKNM-----------EGQTLRQFARDAATF-NEYGLSVVE 135 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~-------~~~~~~~~ae~v~~~l~~~-----------~~~~~~~~l~~~l~a-~~~Gfs~~E 135 (507) .....|-+..+...+.++ +...+++..+.|+..++.- ...+|+.+...++.+ +.-|=+++- T Consensus 79 ~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 158 (553) T protein:vir:63 79 YQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLAT 158 (553) T ss_pred HHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEE Confidence 999999888877766542 2233445555555444321 122577777777654 557878877 Q ss_pred eeeeccCccccCcceeeeeccccCchhccc----------ccceeecCCCccccccccccccccccccccccc-----cc Q lcl|NC_017974. 136 KVYTQVTVGEYIGKYKVKNLAFRPQASLSR----------TNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGV-----GG 200 (507) Q Consensus 136 i~w~~~~~~~~~g~~~~~~l~~r~~~~i~~----------~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~-----~~ 200 (507) +.|....+..++-++ ...+++-|.. .+-+.|+.+|+-+-++... .++++... .. T Consensus 159 ~~~~~~~~~~~~~~l-----q~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~------~hPgd~~~~~~~~~~ 227 (553) T protein:vir:63 159 AEWDRAANRPYATCF-----QMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQV------AHPGDLYQMAPDMYK 227 (553) T ss_pred eeeccCCCCcccceE-----EEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeec------cCCCccccccccccc Confidence 888765433222121 1122222211 1234555555544333211 11111110 00 Q ss_pred cccCCCccccccccEEEEeeCC-ccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce-EEeccccc---CCC-C Q lcl|NC_017974. 201 VSTRMSDVIIPINRVMLMNTGG-SSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVI-ELKIPSQI---LNK-A 274 (507) Q Consensus 201 ~~~~~~~~~iP~~k~i~~~~~~-~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~-v~~~P~~~---~k~-~ 274 (507) +........+|... |+|.+.. +.+..-|.+.|.++.....-........++-. |-..-+. +++.+..- ... . T Consensus 228 ~~r~~~~~~v~a~~-vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a-~i~A~~a~fi~~~~~~~~~~~~~~ 305 (553) T protein:vir:63 228 WKFVQQSKPWGRRQ-VIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNA-VINASYAAAIESELPPEFIHSQMS 305 (553) T ss_pred eeeeccccccChhH-heecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHH-HHhhhheeeeecCCChhhhhhhcc Confidence 01111123455544 5666654 57888899999888765433322222222221 1111111 22211100 000 0 Q ss_pred CCCchHHHH----HHHHHHHHH---HHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_017974. 275 AMDQSSPEA----EMVRGLMSD---AANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDR 347 (507) Q Consensus 275 ~~~~~~~~~----~~l~a~~~~---~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~ 347 (507) ...+..+.. ......... ..++..+......++.|.++++. ..+..+. .|..|.+..-+.|+.. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~-------~p~~p~~--~~~~F~~~~lr~iaag 376 (553) T protein:vir:63 306 GGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLK-------PMGTPGG--VGSEFEASLNRHLASA 376 (553) T ss_pred cccccccccccccccccccccccccccceeecCceeeecCCCCeeeec-------CCCCCCC--CHHHHHHHHHHHHHhh Confidence 000000000 000000000 00011112244556777765543 3332222 3677888888998887 Q ss_pred Hhc--ccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCC-cCCccc------------- Q lcl|NC_017974. 348 LGA--GFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA---LNDM-RLPETE------------- 408 (507) Q Consensus 348 iLG--qtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~---lN~~-~~~~~~------------- 408 (507) +.- +.||. +-.+.|||.+-.-..-+....+.....+...+-+-+...+++ +++. ..|+.. T Consensus 377 lGi~Ye~lt~-D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a 455 (553) T protein:vir:63 377 FGMSYEEFTR-DFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEA 455 (553) T ss_pred cCCCHHHHhh-hcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhh Confidence 632 22433 323345654443333333444444444444444433333332 3321 112110 Q ss_pred c--ceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHH----------------HHHHHHcCCCCCCCcccccchhhhc Q lcl|NC_017974. 409 M--PYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVI----------------NRVLEVLGIDEKIEEDISQEELMKL 470 (507) Q Consensus 409 ~--P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~----------------~~i~e~~Gip~~~~~~~~~~~~~~~ 470 (507) + ..+........|..+-+++....+.+|+.....+.. ....+++||+.+.+... T Consensus 456 ~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~-------- 527 (553) T protein:vir:63 456 LSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKR-------- 527 (553) T ss_pred hhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCcc-------- Confidence 0 022333344567677778888888999865521110 11223344432211100 Q ss_pred CCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 471 LGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) +.+ ++ ..++...+.++.+++-++ T Consensus 528 ------~~~-------~~-~~~~~~~~~~~~~~~~~~ 550 (553) T protein:vir:63 528 ------SLG-------DG-RDAATGIAEDPAAAQTSQ 550 (553) T ss_pred ------ccC-------CC-cccCCCCCCCCCCCCccc Confidence 000 00 000111111111111111 No 153 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=96.13 E-value=0.00096 Score=37.12 Aligned_cols=454 Identities=11% Similarity=0.048 Sum_probs=177.6 Q ss_pred Ccccccccccc-cCccccccccccccccccccCchhhhHHHHhh-------hhcccchhc--cchhHHHHHHHh-hChHH Q lcl|NC_017974. 1 MSRKRNRNRSV-KVAKATSEQINVSRMRMSEQGTFALAKVQVDS-------ERMKAEEIR--WPNLIGTVESMK-QDATV 69 (507) Q Consensus 1 ~~~k~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-------~~e~~~~lr--~~~~~~~y~~m~-~D~~v 69 (507) |.|-+.+..-. ++..+....- ..+.+..-.+..+....+... ......+++ +..+..--+++. .++++ T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~-~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a 79 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRY-VEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYA 79 (505) T ss_pred CCCCccccchhhcccchhhhhh-HHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHH Confidence 65544333211 1111110000 000000000000000000000 000001111 111222234454 59999 Q ss_pred HHHHHHHHHHHhh-CCceeeeCCC--CChhHHHHHHHHHHHHHhc---------cCCCHHHHHHHHHHH-Hhhcceeeee Q lcl|NC_017974. 70 ATGLDMLYTFVEK-AFKDFKVIPG--ESEESKKAAKFVEYCLKNM---------EGQTLRQFARDAATF-NEYGLSVVEK 136 (507) Q Consensus 70 ~s~l~~Rk~av~~-~~~~~~~~~~--~~~~~~~~ae~v~~~l~~~---------~~~~~~~~l~~~l~a-~~~Gfs~~Ei 136 (507) .++++.....|-+ ..+.+++.+. +...+++.++.|+..++.- ...+|..+...++.+ +.-|=+++-+ T Consensus 80 ~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~ 159 (505) T protein:vir:96 80 KRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVRE 159 (505) T ss_pred HHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEE Confidence 9999999999987 4666665442 1233556666666655431 112477766666654 3356555544 Q ss_pred eeeccCccccCcceeeeeccccCchhccc------------ccceeecCCCccccccccccccccccccccccccccccC Q lcl|NC_017974. 137 VYTQVTVGEYIGKYKVKNLAFRPQASLSR------------TNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTR 204 (507) Q Consensus 137 ~w~~~~~~~~~g~~~~~~l~~r~~~~i~~------------~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~ 204 (507) .|..+ +.++-. |...+++-|.. .+-+.|+.+|+-+-++.. ..++++......... T Consensus 160 ~~~~~--~~~~~~-----lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~------~~hPgd~~~~~~~~~ 226 (505) T protein:vir:96 160 HRGYP--NKWGYA-----LQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLL------VNHPGDNSYCYHYAG 226 (505) T ss_pred eecCC--CCcceE-----EEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEe------ecCCCcccccccccc Confidence 44332 222211 22222222221 123344555443332211 111222111111122 Q ss_pred CCccccccccEEEEeeC-CccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHH Q lcl|NC_017974. 205 MSDVIIPINRVMLMNTG-GSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEA 283 (507) Q Consensus 205 ~~~~~iP~~k~i~~~~~-~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~ 283 (507) .....+|... |+|.+. .+.+..-|.+.|.++.....-........++-..--.+=..+++.+..-......+.... T Consensus 227 ~~~~rvpa~~-vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~-- 303 (505) T protein:vir:96 227 QTYERVPADE-IIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQGE-- 303 (505) T ss_pred ccccccCHhH-hhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccCc-- Confidence 2344566554 556654 457788899999988765543333333333222211111113332222111111111111 Q ss_pred HHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccc---ccccccch Q lcl|NC_017974. 284 EMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGF---INVGNDKG 360 (507) Q Consensus 284 ~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt---Lts~~~~~ 360 (507) . . ..+..| ....++.|.+|++.+ .+..+ ..|..|.+..-++|+..+ |-+ ||. +-.+ T Consensus 304 -~---~----~~l~pG--~i~~L~pGe~i~~~~-------~~~p~--~~~~~f~~~~lr~iaagl-gi~ye~lt~-D~s~ 362 (505) T protein:vir:96 304 -I---V----EEVEAG--TYQLLPYGIRFKEHK-------IDHPH--TNFGAFVKSSLRGVAAGM-GPAYNRLAH-DLEG 362 (505) T ss_pred -c---c----cccCCc--eeeecCCCCeeeeeC-------CCCCC--CCHHHHHHHHHHHHHhhc-CCCHHHHhc-cccc Confidence 0 1 122233 456678887766533 33322 236778888899988877 422 333 2223 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCC-cCCc---cccce--EEecCcCchhHHHHHHHHHH Q lcl|NC_017974. 361 GSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA---LNDM-RLPE---TEMPY--VKAGEIVDVDMEGFSKAIQR 431 (507) Q Consensus 361 GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~---lN~~-~~~~---~~~P~--~~~~~~~~~dl~~~a~~~~~ 431 (507) .|||.+-.-..-+....+.....+...+-+-+...+++ +++. ..|+ ..+-. +........|..+-+++... T Consensus 363 ~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~ 442 (505) T protein:vir:96 363 VNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSE 442 (505) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHH Confidence 45554433332223333333333333333323222222 3321 1111 12222 22233344677777888888 Q ss_pred HHhcCCccccHHHHHHHHHHcCCCCCCC-cccccch-hhhcCCCcCCCCCCcccccCcccccccccCCCCc Q lcl|NC_017974. 432 IGAVGYLPKTPKVINRVLEVLGIDEKIE-EDISQEE-LMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDN 500 (507) Q Consensus 432 L~~~G~~~~~~~~~~~i~e~~Gip~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 500 (507) .+++|+.... +.+++ .|.....- ++...+. ..+...- .+..+.....++. .......+.|+ T Consensus 443 ~i~~G~~t~~----~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl--~~~~~~~~~~~~~-~~~~~~~~~d~ 505 (505) T protein:vir:96 443 SIKNRTRSRS----SIIRA-AGDDPEDVFDEIAWEEQLMRDKGV--NPTPPEQESKDAT-TDEEDDSASDD 505 (505) T ss_pred HHHcCCCCHH----HHHHH-cCCCHHHHHHHHHHHHHHHHHcCC--CCCCCCCCCCCCC-CCCCCCCCCCC Confidence 9999986552 33444 35532100 0000000 0000000 0000000000000 00000111111 No 154 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=96.13 E-value=0.00097 Score=37.11 Aligned_cols=453 Identities=13% Similarity=0.054 Sum_probs=164.7 Q ss_pred CcccccccccccCccccc--cccccccc--ccc----ccCch--hhhHHHHhhhhcccch---------hcc-------c Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATS--EQINVSRM--RMS----EQGTF--ALAKVQVDSERMKAEE---------IRW-------P 54 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~--~~~~~~~~--~~~----~~g~~--~~~~~~~~~~~e~~~~---------lr~-------~ 54 (507) |+|||-.-+.-..-+.+. ..+.+|.. .+. .++.. ....|......-.+.. |-. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~ 80 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDE 80 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCc Confidence 999874221111000000 01111110 011 11111 1112222211111110 100 1 Q ss_pred hhHHHHHHHhh----ChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhh Q lcl|NC_017974. 55 NLIGTVESMKQ----DATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEY 129 (507) Q Consensus 55 ~~~~~y~~m~~----D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~ 129 (507) +.-+.|+.-+. =.++...++.-...|.+.+..+.++ ..+..|++.+ +..+.++..++++++. ++.| T Consensus 81 E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p-------~~l~~l~~d~--D~~G~~L~~f~~~~~~~~l~~ 151 (535) T protein:vir:80 81 EQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQLP-------PALEAIVEDI--DGEGVSLDQQAKKALGYTMGF 151 (535) T ss_pred CCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcceecc-------HHHHHHHhcc--CCCCCCHHHHHHHHHHHHHhc Confidence 22233554332 3455555555555554433222211 2233333222 2445678899988775 6679 Q ss_pred cceeeeeeeeccCcc-----------------------------ccCcceeeeeccccCchhcccccceeecCCCccccc Q lcl|NC_017974. 130 GLSVVEKVYTQVTVG-----------------------------EYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIG 180 (507) Q Consensus 130 Gfs~~Ei~w~~~~~~-----------------------------~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~ 180 (507) |.+.+=+.|-..+.. ..+|+-.|..+..|-. +...+|+.+-.. T Consensus 152 G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~--------~~~~dd~f~~~~ 223 (535) T protein:vir:80 152 GRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQEN--------VLAQDDGFETTY 223 (535) T ss_pred CeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEE--------EEecCCCcccce Confidence 988764445322110 0112111111111110 011122222222 Q ss_pred cccccccc---------cccc--ccccccc----ccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHH Q lcl|NC_017974. 181 IKQSLSAF---------QNYT--ASEIGVG----GVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKIL 245 (507) Q Consensus 181 ~~q~~~~~---------~~~~--~~~~~~~----~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~ 245 (507) +.|..-.. ..+. ....... ......++..++.=-|+++. ....+-..+...|..++..-+---. T Consensus 224 ~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~-~~~~~~~~~~pPLl~LA~lni~Hy~ 302 (535) T protein:vir:80 224 VQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIG-PLDNNADIDHPPLLDLCEVNIGHYR 302 (535) T ss_pred eEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEee-cCCCCCCCCccchHHHHHHHHHHhh Confidence 22211100 0000 0000000 00001112222211234332 2223333455555555544332111 Q ss_pred HHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeecc Q lcl|NC_017974. 246 IENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGID 325 (507) Q Consensus 246 ~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~ 325 (507) ...+.-.-+..-+.|++++.-+ .++..+ ...+ ...+.-|+.+++.+|++....+. +.+ T Consensus 303 ~ssd~~~il~~~~~P~l~i~G~--------~~~~~~------~~~~-~~~i~iG~~~~~~lP~~~~~~~~-------e~~ 360 (535) T protein:vir:80 303 NSADYEEMAFVAGQPTAFFTGL--------TKDWVE------DVFK-DFKVHLGSRAIIPLPQGATAGIL-------QIT 360 (535) T ss_pred chhHHHHHHHHhcCceeeeecC--------chhhhh------cCCC-CcceEecCcccccCCCCCCccee-------eec Confidence 2222323333445777776511 000000 0000 01133478888899998875543 333 Q ss_pred CCCcchhHHHHHHHHHHHHHHHHhcccccccc-cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcC Q lcl|NC_017974. 326 GMGKQYSTAQLISDRKKSILDRLGAGFINVGN-DKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRL 404 (507) Q Consensus 326 g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~ 404 (507) +++-. ...++-...+|..+ .+..+..+. ..+.+.|.- .......++.+-+..+++.++ ++++++.++-+... T Consensus 361 ~~~~a---~~~l~~~e~qM~~l-Ga~ll~~~~~~~Ta~~a~~--~~~~~~S~L~~~a~~le~al~-~aL~~~A~w~G~~~ 433 (535) T protein:vir:80 361 PNSVP---FEAMTHKESQMIAM-GANLLVKSGGNRTFGEAQQ--EEASEQSILSACTKNVSMAFR-KALRWANQFQTGIV 433 (535) T ss_pred cchhH---HHHHHHHHHHHHHH-HHHhhccCcccccHHHHHH--HHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHcCCcc Confidence 33322 23445555555442 122222211 123333322 223335667788899999996 48899888753211 Q ss_pred CccccceEEec--C-cCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC--cccccchhhhcCCCcCCCCC Q lcl|NC_017974. 405 PETEMPYVKAG--E-IVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE--EDISQEELMKLLGEDTSRAG 479 (507) Q Consensus 405 ~~~~~P~~~~~--~-~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~--~~~~~~~~~~~~~~~~~~~~ 479 (507) +..-+.|... . ....|.. .++++.++...|.+.. ....+++ ++.|+..+.. +++. .....+..+....++ T Consensus 434 -~~~~~~i~~n~dF~~~~ld~~-~~~all~~~~~G~Is~-et~~~~L-~r~gvl~~~~~~eee~-~ri~~E~~~~~~~~g 508 (535) T protein:vir:80 434 -NDETVEYNLNTDFPAARLTPN-ERAELILEWQQGAITF-KEMRAGL-RRAGVASEDDAKAETE-GKATVEFIAKTAAAG 508 (535) T ss_pred -CCCceEEEeccccccccCCHH-HHHHHHHHHhcCCCCH-HHHHHHH-HhCCCCCcccchHHHH-HHHHhhhhhccccCC Confidence 1222334322 1 2222333 3456667788887543 3334445 4567754322 1111 100001011111111 Q ss_pred CcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 480 DGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .....++. ++.+.++.--+.-+|.+- T Consensus 509 ~~~d~~~~--g~~~~~~~~~~~~~~~~~ 534 (535) T protein:vir:80 509 KVGDAASG--GTNKAKLNNGNGGGNQAG 534 (535) T ss_pred CCCCCCCC--CCCcCcccCCccccccCC Confidence 11111111 111111111111222222 No 155 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=96.10 E-value=0.001 Score=37.01 Aligned_cols=415 Identities=11% Similarity=0.001 Sum_probs=156.6 Q ss_pred CcccccccccccCcccccccccc--ccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHH--------------- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINV--SRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESM--------------- 63 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m--------------- 63 (507) |.= .++..+..--.+ .+.........-..-...+. +.. -|..++.+.|+-- T Consensus 1 ~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~--~~~~~~~~YY~g~~~i~~~~~~~~~~~~ 68 (472) T protein:vir:93 1 MYP--------SQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL--EKL--PEISIGQEYYEQRPDIVKEPKPVDATGA 68 (472) T ss_pred CCC--------CCCcchhhhhceeeecCchhhHHHHHHHHHHHHH--HHH--HHHHHHHHHhccccccccccchhhcccc Confidence 111 111111100000 00000000000000000110 000 0111222222110 Q ss_pred ----hh-----ChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhccee Q lcl|NC_017974. 64 ----KQ-----DATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSV 133 (507) Q Consensus 64 ----~~-----D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~ 133 (507) +. .+...-++.+...-+.+-+..+. .+++++.+++++++.+ .+.+.+.++ .++.-||.+ T Consensus 69 ~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~------~~d~~~~~~l~~~~~n----~~~~~~~~~~~~~~~~G~~- 137 (472) T protein:vir:93 69 VDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK------HTDDEVVKRIDEVLGN----RFDDKLHSVLTGASNKGIE- 137 (472) T ss_pred ccccccccccccchHHHHHHHHhhhhcccCeeec------cCChHHHHHHHHHHhc----cHHHHHHHHHHHHhhcCeE- Confidence 00 12222223333333322222221 2345567778777642 466666665 567889985 Q ss_pred eeeeeeccCccccCcceeeeeccccCchhcccccceeecCC-Ccccccccccc----cccccccccccccc--------- Q lcl|NC_017974. 134 VEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNED-GSAIIGIKQSL----SAFQNYTASEIGVG--------- 199 (507) Q Consensus 134 ~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d-~~~l~~~~q~~----~~~~~~~~~~~~~~--------- 199 (507) ++++|..+ +|.+.+.-+.|+. .+.-|. +.. +..+..++... .....++....... T Consensus 138 ~~~v~~d~-----d~~~~i~~~~p~~--~~~i~d----~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (472) T protein:vir:93 138 WLHPYLDE-----EGEFKLFRVPAEQ--GIPIWT----DKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIP 206 (472) T ss_pred EEEEEECC-----CCceEEEEEcccc--eEEEEc----CCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeee Confidence 45787532 4555554443321 111110 010 11111110000 00000000000000 Q ss_pred -------ccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCC Q lcl|NC_017974. 200 -------GVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILN 272 (507) Q Consensus 200 -------~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k 272 (507) ..........++.--++.|+ +|+.|.|.+..+-...=--+..+..++..++-+..|+++++- . T Consensus 207 ~~~~~~~~~~~~~~~~~~~~vPvv~~~-----nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g---~-- 276 (472) T protein:vir:93 207 DYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTN---Y-- 276 (472) T ss_pred cccccccccccccccCCCCCcceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeec---C-- Confidence 00000000111111223332 367889999874433323345667777778888888887751 1 Q ss_pred CCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccc Q lcl|NC_017974. 273 KAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGF 352 (507) Q Consensus 273 ~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt 352 (507) ...+..+.. +.+. ....+.+|.+.+.++. ..+. ....+..+++.+.+.|...--... T Consensus 277 --~~~~~~~~~---~~~~---------~~~~~~~~~~~~~~~l-------~~~~--~~~~~~~~~~~l~~~i~~~s~~p~ 333 (472) T protein:vir:93 277 --DDQELPEFK---RLLR---------YYGAIKVSDNGGVDTI-------QVEV--PVENSKKYLDELYQKIMLFGQAVD 333 (472) T ss_pred --CcccchhhH---HHHh---------hccccccCCCCcceeE-------eecC--CHHHHHHHHHHHHHHHHHHhCCCC Confidence 111111111 1111 1123456777655442 2221 223467788888888877654443 Q ss_pred ccccccchhhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHH Q lcl|NC_017974. 353 INVGNDKGGSYNLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAI 429 (507) Q Consensus 353 Lts~~~~~GS~Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~ 429 (507) ++.+.. |+...|.. ...-....+..-.+.+...+ +++++.++.+.+... +..--.+.|....+.|..+.++++ T Consensus 334 ~~~~~~--~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l-~~~~~li~~~~~~~~-~~~~i~v~f~~~~p~~~~~~~~~~ 409 (472) T protein:vir:93 334 FSSDKF--GSAPSGVALEFLYTNLNLKADKLARKAKVAI-QELLWFVFEHFDIKG-EHKDVDISFNYNKVANTELQVQTA 409 (472) T ss_pred CCcccc--ccCchHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCc-ccceeeEEeCCCCCCCHHHHHHHH Confidence 443321 11111211 11222223334445555566 346677777653221 112235778888899999999999 Q ss_pred HHHHhcCCccccHHHHHHHHHHcC-CCCCCCc-ccccc---hhhhcCCCcCCCCCCcccccC-cccccccccCCCCch Q lcl|NC_017974. 430 QRIGAVGYLPKTPKVINRVLEVLG-IDEKIEE-DISQE---ELMKLLGEDTSRAGDGMTKGS-SGNGTGKISSTRDNS 501 (507) Q Consensus 430 ~~L~~~G~~~~~~~~~~~i~e~~G-ip~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~d~~ 501 (507) .+|+ |+ ++ .+.+.+.++ ++.+..+ +.... ..+..+.. ..++.+... .+++.. ...+. T Consensus 410 ~k~~--gi-is----~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~----~~~~~~d~~~~~~~~~----~~~~e 472 (472) T protein:vir:93 410 QQSM--GI-VS----HETVLENHPFVEDLQAELERIEQEQMEYNKQLPN----LDDGGADGAQQQERSN----NKESE 472 (472) T ss_pred HHHh--cc-Cc----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccC----cCcccCCCCCCCCCCC----cccCC Confidence 9974 65 33 234555554 4322111 00000 01111110 000001000 000000 00000 No 156 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=96.09 E-value=0.001 Score=36.97 Aligned_cols=425 Identities=9% Similarity=-0.069 Sum_probs=145.4 Q ss_pred ccccccccccccccCchhh-hHHHHhhhhcccchhccchhHHHHHHHhhChHH----H-------HHHHHHHHHHhhCC- Q lcl|NC_017974. 18 SEQINVSRMRMSEQGTFAL-AKVQVDSERMKAEEIRWPNLIGTVESMKQDATV----A-------TGLDMLYTFVEKAF- 84 (507) Q Consensus 18 ~~~~~~~~~~~~~~g~~~~-~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v----~-------s~l~~Rk~av~~~~- 84 (507) -+...+.+ ...+..... +..++.... ...-.|...+.+.|+--....++ - .+++==+.+|..+. T Consensus 1 ~~~~~~~~--~~gl~~~~~~~~~~L~~~~-~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~ 77 (474) T protein:vir:81 1 MIQQQTVR--IPSLSNDENALINGLLAQI-ENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALAR 77 (474) T ss_pred CcCCCcCc--CCCCChhHHHHHHHHHHHH-HHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHh Confidence 01111111 111111100 000111000 00000101111112110000111 1 11111122222210 Q ss_pred ---c-eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 85 ---K-DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 85 ---~-~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) . -|..+ +++.++.. +.+++.. ..+...... ..+|+-||.|. ..||.-..+ ++...+.- ++ T Consensus 78 rl~~~Gf~~~-d~~~~~~~----l~~iw~~---N~ld~~~~~~~~~al~~G~sf-~~V~~~~d~---~~~~~i~~---~s 142 (474) T protein:vir:81 78 RCNLEGFVWP-DGDLDSLG----GTEVVDD---NHLLSEIDSAIVAAMQHGPAF-LINTVGEDD---EPEALIHV---KD 142 (474) T ss_pred hhcccceECC-CCCccchH----HHHHHHh---cChhHHHHHHHHHHHhhCcee-EEEecCCCC---CceeEEEE---ec Confidence 0 13322 22222222 2333332 235544444 45789999996 578853321 11222221 12 Q ss_pred ch-hccccc-----------ceeecCCCccccccccccccccccccccccc-cccccCCCccccccccEEEEeeCCccCc Q lcl|NC_017974. 160 QA-SLSRTN-----------PIVYNEDGSAIIGIKQSLSAFQNYTASEIGV-GGVSTRMSDVIIPINRVMLMNTGGSSSQ 226 (507) Q Consensus 160 ~~-~i~~~~-----------~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~-~~~~~~~~~~~iP~~k~i~~~~~~~~~~ 226 (507) +. ++.-|. .+..+.+|......--.+.....+....... ........+..+| ++.|+++.+.+. T Consensus 143 p~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvP---vV~~~n~~~~~~ 219 (474) T protein:vir:81 143 ASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVP---AQVLPYKPAPKR 219 (474) T ss_pred cceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcc---eEEecccccccC Confidence 21 111011 1112233321110100011111100000000 0011111222234 678889999999 Q ss_pred cccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEe Q lcl|NC_017974. 227 ALGVSPL-VGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFM 305 (507) Q Consensus 227 p~G~gll-~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~i 305 (507) |+|.+-+ +.+-...=-=+..+-.-+.-.|=|.+|-.++. +.......+.+......+++...- ... T Consensus 220 ~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~---G~~~~~~~d~d~~~~~~~~~~~~~----------i~~ 286 (474) T protein:vir:81 220 PFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL---GADESALKNADGTIKSVWEARLGR----------IKG 286 (474) T ss_pred cCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee---cCChhhcccccccccchhhhhHHH----------Hhc Confidence 9987633 23321110011122222334455555554443 111111111111111223332221 123 Q ss_pred ecCccccccccc-ceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccc----c---ccc-hhhhhHHHHHHHHHHHH Q lcl|NC_017974. 306 LPSDTKDNAPQY-SMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV----G---NDK-GGSYNLSESKQTIHTQF 376 (507) Q Consensus 306 iP~g~~i~~~~~-~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts----~---~~~-~GS~Al~~vh~~v~~~~ 376 (507) +|.+.+..+... ..++...+. ++. ..+++.++.-|.. +.+.|-+. | ... .+.-|+...+... ... T Consensus 287 ~~~d~d~~~~~~~~~~~~q~~~-a~l---~~~~~~l~~~~~~-~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l-~~k 360 (474) T protein:vir:81 287 LPDDADADIPQLARADVKQFPA-ASP---DAHWSDINGLAKL-FAREASLPDTAVAISGLSNPTSAESYDASQYEL-IAE 360 (474) T ss_pred CCCcccccccccccccccccCC-CCh---hHHHHHHHHHHHH-HHhhhCCCHHHhcccccccccHHHHHHHHHHHH-HHH Confidence 444433222110 011111111 122 2344544443322 22222111 1 111 1223433333322 333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc-----cceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH Q lcl|NC_017974. 377 VQRVNEIILEALNENLLPQLLALNDMRLPETE-----MPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV 451 (507) Q Consensus 377 ~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~-----~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~ 451 (507) .+.-.+.+...+. ++++..+.+.+....+.. --++.|...+...+.+.|+++.||+++|..+++ .+-+++. T Consensus 361 ae~k~~~fg~~l~-~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~---~~~~~~~ 436 (474) T protein:vir:81 361 AEGAVDDFTPALR-KAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAE---TEVGLEL 436 (474) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCc---HHHHHhh Confidence 4445566777774 577777777532211111 124577778888889999999999999975554 2456788 Q ss_pred cCCCCCCCccc---ccchhhhcCCCcCCCCCCcccccCccccccc Q lcl|NC_017974. 452 LGIDEKIEEDI---SQEELMKLLGEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 452 ~Gip~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (507) +|+.+.+-+.. ...+.+..+- +.+.+...+.+++. T Consensus 437 lg~t~~~i~~~~~~~~~~~~~~~~-------~~l~~~~~~~~~aq 474 (474) T protein:vir:81 437 IGLTPQQARRAMADKRRVQGRGTL-------QALIDRSNNGATAQ 474 (474) T ss_pred cCCCHHHHHHHHHHHHHHhHHHHH-------HHHHhcCCCCCCCC Confidence 89964322110 0000000000 00111111111221 No 157 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=96.00 E-value=0.0011 Score=36.71 Aligned_cols=424 Identities=9% Similarity=-0.011 Sum_probs=156.3 Q ss_pred Cccccccccc---ccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRS---VKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLY 77 (507) Q Consensus 1 ~~~k~~~~~~---~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk 77 (507) |... ++++- .+.-.-.......++......+... . ....+..| + ..+...-++.+.. T Consensus 38 i~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~------~--~~~~~~~r----------i-~~n~~~~ivd~~~ 97 (503) T protein:vir:59 38 IDEH-NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQL------V--DDTKTNNR----------T-SHAWHKLFVDQKT 97 (503) T ss_pred HHhh-cHHHHHHHHHHhccccchhhccchhcccccccc------c--ccccccce----------e-ecchHHHHHHHHH Confidence 2211 11000 0000000000000000000000000 0 00000000 0 1122222333333 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) .-+.+-+..+. .+++++.++++.++++ .|.+.+..+ .++.-||.+. +.+|.. .+|++.+.-+. T Consensus 98 ~yl~g~~~~~~------~~d~~~~~~l~~~~~n----~~~~~~~~~~~~~~~~G~~~-~~v~~d-----~dg~~~i~~~~ 161 (503) T protein:vir:59 98 QYLVGEPVTFT------SDNKTLLEYVNELADD----DFDDILNETVKNMSNKGIEY-WHPFVD-----EEGEFDYVIFP 161 (503) T ss_pred hhhhcCCeeec------cCcHHHHHHHHHHHhc----CHHHHHHHHHHHHhhCCeEE-EEEeec-----CCCceEEEEEc Confidence 33333333332 2234566777776642 466665554 4577799986 466643 24566555444 Q ss_pred ccCchhcccccc------------eee-cCCCcccccccc-ccccccccccccccccccc----------cCCCcccccc Q lcl|NC_017974. 157 FRPQASLSRTNP------------IVY-NEDGSAIIGIKQ-SLSAFQNYTASEIGVGGVS----------TRMSDVIIPI 212 (507) Q Consensus 157 ~r~~~~i~~~~~------------~~~-~~d~~~l~~~~q-~~~~~~~~~~~~~~~~~~~----------~~~~~~~iP~ 212 (507) |+. .+.-|.. +.. +.++.....++- .+.....+........... ....+...+. T Consensus 162 p~~--~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (503) T protein:vir:59 162 AEE--MIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGW 239 (503) T ss_pred cce--eEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccC Confidence 332 1110000 000 011111110000 0000000000000000000 0000111111 Q ss_pred c--cEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHH Q lcl|NC_017974. 213 N--RVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLM 290 (507) Q Consensus 213 ~--k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~ 290 (507) . .++.| ..|+.|.|.+..+-...=.=+..+...+..++.+..|+++++- . . .+...+. ...+ T Consensus 240 ~~vPiv~~-----~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g---~---~-~~~~~~~---~~~~- 303 (503) T protein:vir:59 240 GRVPIIPF-----KNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKN---Y---D-GENPKEF---TANL- 303 (503) T ss_pred CccceEEe-----cCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeec---C---C-ccccchh---hhhh- Confidence 1 12322 2467789988875433322344667777778899999888751 1 1 1111111 1111 Q ss_pred HHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHH--- Q lcl|NC_017974. 291 SDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSE--- 367 (507) Q Consensus 291 ~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~--- 367 (507) .....+.+|.+.++++. ..+. ....+...++.+.+.|.+.-.+..++.+.. |+.+.|. T Consensus 304 --------~~~~~~~~~~~~~~~~l-------~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~~~Sg~Ai~ 364 (503) T protein:vir:59 304 --------RYHSVIKVSGDGGVDTL-------RAEI--PVDSAAKELERIQDELYKSAQAVDNSPETI--GGGATGPALE 364 (503) T ss_pred --------hcccceeccCCCcceeE-------eccC--CHHHHHHHHHHHHHHHHHHhcccCCCcccc--cccccHHHHH Confidence 11234556776654432 2222 223467788999988877765544443222 1111121 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CCCc-CCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHH Q lcl|NC_017974. 368 SKQTIHTQFVQRVNEIILEALNENLLPQLLAL---NDMR-LPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPK 443 (507) Q Consensus 368 vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l---N~~~-~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~ 443 (507) ....-....+..-.+.+...|. ++++.++.+ .+.. .....--.+.|....+.|..+.++++.+|+++|++.. T Consensus 365 ~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~--- 440 (503) T protein:vir:59 365 NLYALLDLKANMAERKIRAGLR-LFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSK--- 440 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCch--- Confidence 1122222233334445555564 355554442 2111 0111224788999999999999999999999996432 Q ss_pred HHHHHHHHcCC-CCCCCc-ccccchh---hhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 444 VINRVLEVLGI-DEKIEE-DISQEEL---MKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 444 ~~~~i~e~~Gi-p~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) +.+.+.++. +.+..+ +....+. +........+. .....++.....+....+..+=.++ T Consensus 441 --et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 441 --ETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLDDE----GGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred --HHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccCcc----CCCCCCCcCCCCCCcccCCCCCCcC Confidence 345555543 211111 0000000 10000000000 0000000000000000000000000 No 158 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=95.92 E-value=0.0013 Score=36.48 Aligned_cols=425 Identities=8% Similarity=-0.042 Sum_probs=158.6 Q ss_pred Cc---cccccccccc---CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH-----------H Q lcl|NC_017974. 1 MS---RKRNRNRSVK---VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES-----------M 63 (507) Q Consensus 1 ~~---~k~~~~~~~~---~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~-----------m 63 (507) |. .+|-...+.. +|...+..+.... .....+... ..+ |..++.+.|+- - T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~-----------~~i~~~~~~-~~~--~~~~l~~Yy~g~~~i~~~~~~~~ 66 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELL-----------GFIAYNETV-LKP--RYRENMKLYLGKHKILTAPEKET 66 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHH-----------HHHHHHHHh-hHH--HHHHHHHHhccccccccCccccc Confidence 32 2222222221 3322222111110 000111000 000 11111222211 0 Q ss_pred hh-----ChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeee Q lcl|NC_017974. 64 KQ-----DATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKV 137 (507) Q Consensus 64 ~~-----D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~ 137 (507) .. .....-++.....-+.+-+..|.+.. +.+..+.+.+++.. ..|...+.++. ++.-||.+ ++++ T Consensus 67 ~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~~-----d~~~~~~l~~~~~~---n~~~~~~~~~~~~~~~~G~~-~~~v 137 (470) T protein:vir:99 67 GADNRIVVNSAKYVVDVYNGYFCGIEPKLALLN-----DSSKIDEIARWNRQ---ENFFDTINEISKQCDIFGRS-IASI 137 (470) T ss_pred CCcceeecchHHHHHHHHhhhhccCCeeEeeCC-----chhHHHHHHHHHHh---cCHhHHHHHHHHHHHhcCee-EEEE Confidence 01 12333334444444444444444322 12223344444443 34666655554 57779976 5677 Q ss_pred eeccCccccCcceeeeeccccCchhcccccc-----------eeecCCC-cccccc-ccccccccccccccccccccccC Q lcl|NC_017974. 138 YTQVTVGEYIGKYKVKNLAFRPQASLSRTNP-----------IVYNEDG-SAIIGI-KQSLSAFQNYTASEIGVGGVSTR 204 (507) Q Consensus 138 w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~-----------~~~~~d~-~~l~~~-~q~~~~~~~~~~~~~~~~~~~~~ 204 (507) |... +|.+.+.-+.|+. .+.-|.. ++...++ ...... ...+...+.+............. T Consensus 138 ~~d~-----dg~~~i~~~~p~~--~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (470) T protein:vir:99 138 YQGE-----DARPHLMYSSPNH--AFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAG 210 (470) T ss_pred EeCC-----CCeEEEEEEccce--eEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEeccccccccccc Confidence 7532 4566555443331 1111100 0000111 000000 00011111111111010000000 Q ss_pred CCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHH Q lcl|NC_017974. 205 MSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAE 284 (507) Q Consensus 205 ~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~ 284 (507) .....+..-.++.|. .++.|.|.+..+-...=-=+..+..++..++.+..|+++++- .+ .++++... T Consensus 211 ~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g---~~-----~~~~~~g~ 277 (470) T protein:vir:99 211 YAINPYGLVPAVEFF-----ENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIG---FK-----LPEDDEGN 277 (470) T ss_pred ccccCCCccceEeec-----CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---CC-----cccccccc Confidence 111111111123332 466788988875443323344667788888989999888751 11 11111111 Q ss_pred HHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhh Q lcl|NC_017974. 285 MVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYN 364 (507) Q Consensus 285 ~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A 364 (507) .+..+. ....+.+|....- ...+++++..+. ....+...++.+.+.|...-....++.+.. +| .. T Consensus 278 ~~~~~~---------~~~~~~~~~~~~~--~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~-n~ 342 (470) T protein:vir:99 278 PKFDFK---------NNRVLYVSQLDPD--TNPQIGFIAKPD--ADQMQENLIQHLTDFIFMMAMVPNIQDKNF-AG-NS 342 (470) T ss_pred hhhhhh---------hcceeeecCCCCC--CCCcceEEeecC--ChHHHHHHHHHHHHHHHHHhCCcccccccc-cc-Cc Confidence 111111 1122333321100 011233443322 223456788888888877765544443221 12 11 Q ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcCC--ccccceEEecCcCchhHHHHHHHHHHHHhcCCc Q lcl|NC_017974. 365 LSES---KQTIHTQFVQRVNEIILEALNENLLPQLLAL-NDMRLP--ETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYL 438 (507) Q Consensus 365 l~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~l-N~~~~~--~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~ 438 (507) .|.. ...-....+..-.+.+...|. ++++.++.+ +..... ...-..+.|....+.|..++++++.+|+ |+ T Consensus 343 Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--gi- 418 (470) T protein:vir:99 343 SGVALQYKLFAMKNKADSKERKFDKSLM-QLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAE--GI- 418 (470) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHh--cc- Confidence 1211 111222233334455555563 455555543 211111 1223578899999999999999999985 65 Q ss_pred cccHHHHHHHHHHcCCCCCCCc-ccccchhhhc--CCCcCCCCCCcccccCccccc Q lcl|NC_017974. 439 PKTPKVINRVLEVLGIDEKIEE-DISQEELMKL--LGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 439 ~~~~~~~~~i~e~~Gip~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 491 (507) ++. +.+.+.++.-.+..+ +....+.... .....+...+.......+++. T Consensus 419 is~----et~l~~l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 419 VSK----KTQLGMIPDIEPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred CCH----HHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccCC Confidence 442 345555544322111 0011111100 011111111111100111100 No 159 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=95.78 E-value=0.0015 Score=36.11 Aligned_cols=422 Identities=9% Similarity=-0.044 Sum_probs=141.7 Q ss_pred cccccccccc----cccCchhhhHHHHhhhhcccchhc-c----chhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeee Q lcl|NC_017974. 19 EQINVSRMRM----SEQGTFALAKVQVDSERMKAEEIR-W----PNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKV 89 (507) Q Consensus 19 ~~~~~~~~~~----~~~g~~~~~~~~~~~~~e~~~~lr-~----~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~ 89 (507) ..+.+|..-+ .++...-.....+..+.+-...++ . +..++............-++.+...-+..-+ |.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g--~~~ 78 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNG--ITV 78 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCC--eec Confidence 1111111000 000000000000000111111110 0 0010000110111222223333332222222 333 Q ss_pred CCCC-ChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccCchhccccc Q lcl|NC_017974. 90 IPGE-SEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTN 167 (507) Q Consensus 90 ~~~~-~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~ 167 (507) ...+ ...+..+ .+.+.. ..|..+..+++ ++.-||.+ ++++|... +|...+..+.|.. ++.-|+ T Consensus 79 ~~~~d~~~~~~~----~~~~~~---n~~d~~~~~~~~~a~~~G~a-~~~~~~~e-----dg~~~i~~~~p~~--~~~i~d 143 (456) T protein:vir:79 79 GGSADSDLALRA----RRIWRD---NRMDSVCKQWVKYGLDFGES-YLTCWRRD-----DGTATITADSPET--MVVSVD 143 (456) T ss_pred CCCCCccHHHHH----HHHHHh---cChhHHHHHHHHHHhhcCee-EEEEeeCC-----CCceEEEEeccce--eEEEEc Confidence 2222 2222333 444433 23566666654 68889986 57888643 4444444333221 111111 Q ss_pred c-----------eeecCCCccccccccccccccc-----cccccccccccccCCCc---cc-cc-cccEEEEeeCCccCc Q lcl|NC_017974. 168 P-----------IVYNEDGSAIIGIKQSLSAFQN-----YTASEIGVGGVSTRMSD---VI-IP-INRVMLMNTGGSSSQ 226 (507) Q Consensus 168 ~-----------~~~~~d~~~l~~~~q~~~~~~~-----~~~~~~~~~~~~~~~~~---~~-iP-~~k~i~~~~~~~~~~ 226 (507) + ++...|+.......-.+..... +................ +. .| ....+.+. .+.| T Consensus 144 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv---~~~N 220 (456) T protein:vir:79 144 PLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVV---VYQN 220 (456) T ss_pred CCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEE---EecC Confidence 0 0111111100000000000000 00000000000000000 00 00 01111111 1357 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEee Q lcl|NC_017974. 227 ALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFML 306 (507) Q Consensus 227 p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~ii 306 (507) +.|.|.+..+-...=--+..+..-+..++-+..|..++.- ..... ...+... ..+..... +..+..+...+ T Consensus 221 ~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G-~~~~~-~~~d~~g---~~i~~~~~----~~~~~~~~~~~ 291 (456) T protein:vir:79 221 PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKS-SEHRL-PKVDENG---NAIDYASI----FEAAPGALWEL 291 (456) T ss_pred CCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhc-CCccc-ccccccc---cccchhhh----hhhhccccccC Confidence 7888888775432111112222333344444444444320 00000 0000000 01111111 11122234456 Q ss_pred cCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccc-cc-cchhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 307 PSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV-GN-DKGGSYNLSESKQTIHTQFVQRVNEII 384 (507) Q Consensus 307 P~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts-~~-~~~GS~Al~~vh~~v~~~~~~aD~~~i 384 (507) |.|.++. +|..+ +...|...++..-.+|+...--..-.. +. +..++-|+ +....-....++.-.+.+ T Consensus 292 ~~~~~~~------q~~~~----~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al-~~~~~~l~~k~~~~~~~f 360 (456) T protein:vir:79 292 PPGVDIW------ESQTN----DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGA-HNIEKGFLFKCEDRLSIA 360 (456) T ss_pred CCCccee------eeccc----ChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHH-HHHHHHHHHHHHHHHHHH Confidence 7766431 22222 122355566666666654321110000 01 11111222 222233334444455666 Q ss_pred HHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_017974. 385 LEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQ 464 (507) Q Consensus 385 ~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~ 464 (507) ...|. ++++.++.+.+. .+..-.++.|....+.++.+.|+++.+|+.+|+... .-.++.+|+.+.+-+ . T Consensus 361 ~~~l~-~~~~l~~~~~g~--~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~-----~~~~~~lg~~~~~i~---~ 429 (456) T protein:vir:79 361 KIGLE-AILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWA-----SIRRNILNYNADQIK---Q 429 (456) T ss_pred HHHHH-HHHHHHHHhcCC--CccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChH-----HHHHhcCCCCHHHHH---H Confidence 77775 477777777742 222234677888888888999999999999998332 235677888532111 1 Q ss_pred chhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhh Q lcl|NC_017974. 465 EELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAAN 504 (507) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~ 504 (507) .+......+ ...-..++.+ .+++.+|. T Consensus 430 ~e~~r~~~e-~~~~~~~~~~------------~~~~~~~~ 456 (456) T protein:vir:79 430 DDLDRAREQ-ITLFAGNPVQ------------RPQEDGSR 456 (456) T ss_pred HHHHHHHHH-HHHHhhhHhh------------cCCCCCCC Confidence 010000000 0000000000 11111111 No 160 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=95.77 E-value=0.0015 Score=36.10 Aligned_cols=450 Identities=13% Similarity=-0.002 Sum_probs=176.4 Q ss_pred CcccccccccccCcccc-ccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHh-hChHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKAT-SEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMK-QDATVATGLDMLYT 78 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~-~D~~v~s~l~~Rk~ 78 (507) ++=++..++........ -......+ +.+-.+. .......... ....| ..--+++. .++++.+++..... T Consensus 11 ~sP~~~~~R~~ar~~~~~y~aa~~~r-~~~~~~~--~~s~~~~~~~-~~~~l-----r~RaRdl~rNn~~a~~av~~~~~ 81 (502) T protein:vir:79 11 FSPGWKAARLRSRAVIQAYEAVKTTR-THKARRE--NRTADQLSQY-GAVSL-----REQARYLDNNHDLVIGVFDKLEE 81 (502) T ss_pred cChHHHHHHHhhHHHHhhccccCccc-ccCCCCC--CCChHHHHHH-HHHHH-----HHHHHHHHhcChHHHHHHHHHHH Confidence 22111111100000000 00000011 0000000 0001111100 11111 12224443 59999999999999 Q ss_pred HHhhC-CceeeeCCC-CC-hhHHHHHHHHHHHHHhc-------cCCCHHHHHHHHHHH-HhhcceeeeeeeeccCccccC Q lcl|NC_017974. 79 FVEKA-FKDFKVIPG-ES-EESKKAAKFVEYCLKNM-------EGQTLRQFARDAATF-NEYGLSVVEKVYTQVTVGEYI 147 (507) Q Consensus 79 av~~~-~~~~~~~~~-~~-~~~~~~ae~v~~~l~~~-------~~~~~~~~l~~~l~a-~~~Gfs~~Ei~w~~~~~~~~~ 147 (507) .|-+. .+.+.+.+. .+ ..+++.++.|++.++.- ...+|..+...++.+ +.-|=.++-++|.+... ..+ T Consensus 82 nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~-~~~ 160 (502) T protein:vir:79 82 RVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINS-LTP 160 (502) T ss_pred hhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCc-cCC Confidence 99876 455555442 12 23344555555544311 113577777766654 45788888888865321 112 Q ss_pred cceeeeeccccCchhcccc--------cceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEe Q lcl|NC_017974. 148 GKYKVKNLAFRPQASLSRT--------NPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMN 219 (507) Q Consensus 148 g~~~~~~l~~r~~~~i~~~--------~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~ 219 (507) |.-..-+|...+++-|... +-+.|+.+|+-+-++-. ..++++ .....-..+|... |+|. T Consensus 161 g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~------~~hPgd------~~~~~~~rvpA~~-vlH~ 227 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVY------KSRPVS------GRQMETKEVDAER-MLHL 227 (502) T ss_pred CcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEe------ecCCCC------CcccceeEechhh-eEEe Confidence 2222222233333333211 12344455443333211 111111 0112335677765 5555 Q ss_pred eC-CccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce-EEeccc--ccCCCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_017974. 220 TG-GSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVI-ELKIPS--QILNKAAMDQSSPEAEMVRGLMSDAAN 295 (507) Q Consensus 220 ~~-~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~-v~~~P~--~~~k~~~~~~~~~~~~~l~a~~~~~~n 295 (507) +. .+.+..-|.+.|.++.....-........++-. |-..-+. +++.+. ........+..... . .. T Consensus 228 f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a-~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~------~----~~ 296 (502) T protein:vir:79 228 KFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAA-RIAAALGMYIRKGDGQSYEPDGNGSKENER------E----LT 296 (502) T ss_pred ecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHH-HHhhhheeeeecCCCcccccccCCCCCccc------c----cc Confidence 54 558888899999988766543333332222222 1122221 223221 11110001111000 0 01 Q ss_pred hhcCCceeE-eecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhc--ccccccccchhhhhHHHHHHHH Q lcl|NC_017974. 296 AHSGEQSFF-MLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGA--GFINVGNDKGGSYNLSESKQTI 372 (507) Q Consensus 296 ~~~G~~a~~-iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLG--qtLts~~~~~GS~Al~~vh~~v 372 (507) +..| +.+ .++.|.++++. ..+..+. .|..+++..-++|+..+.- +.||. +- +++||.+-.-..- T Consensus 297 l~pG--~i~~~L~pGe~i~~~-------~p~~p~~--~~~~f~~~~lr~iaaglGi~ye~lt~-D~-s~nySs~R~~~~e 363 (502) T protein:vir:79 297 IQPG--IIYDDLKPGEEIGMV-------KSDRPNP--NLETFRNGQLRAVAAGSRLSFSSTAR-NY-NGTYSAQRQELVE 363 (502) T ss_pred ccCC--ccccccCCCceeeee-------CCCCCCC--CHHHHHHHHHHHHHhhcCCCHHHHhc-cc-cchHHHHHHHHHH Confidence 1112 122 35677766543 3333322 4678899999999887622 23443 33 3477755444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCc-CCc----cccceEEe--cCcCchhHHHHHHHHHHHHhcCCccccH Q lcl|NC_017974. 373 HTQFVQRVNEIILEALNENLLPQLLA---LNDMR-LPE----TEMPYVKA--GEIVDVDMEGFSKAIQRIGAVGYLPKTP 442 (507) Q Consensus 373 ~~~~~~aD~~~i~~~ln~~li~~l~~---lN~~~-~~~----~~~P~~~~--~~~~~~dl~~~a~~~~~L~~~G~~~~~~ 442 (507) +....+....++...+-+-+...+++ +++.. .|. ..+....+ ......|..+-+++....+++|+.... T Consensus 364 ~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~- 442 (502) T protein:vir:79 364 STDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATES- 442 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHH- Confidence 44455555555554444433333332 33211 111 11222222 334456777777888888899986552 Q ss_pred HHHHHHHHHcCCCCCCC-cccccch---hhhcCCCcCCCCCCcccccCcccccccccCCCCchhhh Q lcl|NC_017974. 443 KVINRVLEVLGIDEKIE-EDISQEE---LMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAAN 504 (507) Q Consensus 443 ~~~~~i~e~~Gip~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~ 504 (507) +.+++ .|.....- ++...+. ....++-+..+...... ++..++...++.-++.... T Consensus 443 ---~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~--~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 443 ---DWVRA-GGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGG--SSAATKRQEPQHTDDQSEE 502 (502) T ss_pred ---HHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCC Confidence 22322 24421100 0000000 00000000000000000 0000000000000000000 No 161 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=95.66 E-value=0.0017 Score=35.83 Aligned_cols=445 Identities=11% Similarity=0.026 Sum_probs=183.4 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhh--ChHHHHHHHHHH- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQ--DATVATGLDMLY- 77 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~--D~~v~s~l~~Rk- 77 (507) |...|+-= ..++| ++-++ . .+-++-...+-++-..|++|+++-. -.++.++++-+. T Consensus 1 ~~~~~~~~---~~~~~---------~~~g~---~------~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~ 59 (527) T protein:vir:10 1 MGQDKRQY---GSTQQ---------LRAGE---A------NFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDE 59 (527) T ss_pred CCcccccc---CCCcC---------cCCcc---c------cCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccc Confidence 43222111 11111 11111 0 0000011111233445666666543 135555544333 Q ss_pred -----------HHHhhCCceeeeCCCC---ChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccC Q lcl|NC_017974. 78 -----------TFVEKAFKDFKVIPGE---SEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVT 142 (507) Q Consensus 78 -----------~av~~~~~~~~~~~~~---~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~ 142 (507) ..|....-.|.+++.+ +..+.++.+.++.+.+. ..|+....+ --++..-|=.|+=+.|.... T Consensus 60 ~~~r~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~---e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k 136 (527) T protein:vir:10 60 GDQRPIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDR---ENWEQKFESLKRWTEIRGDYVLLLIGDDEK 136 (527) T ss_pred cccceeeehhhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHH---hhhHHHHHHHHHhhhhhcceeEEEeeccCC Confidence 2445555667766643 23344555555555432 234433222 23688889999999998532 Q ss_pred ccccCcceeeeeccccC---------chhccccc---ceeecCC-Ccccccccc-------cccc------c-----ccc Q lcl|NC_017974. 143 VGEYIGKYKVKNLAFRP---------QASLSRTN---PIVYNED-GSAIIGIKQ-------SLSA------F-----QNY 191 (507) Q Consensus 143 ~~~~~g~~~~~~l~~r~---------~~~i~~~~---~~~~~~d-~~~l~~~~q-------~~~~------~-----~~~ 191 (507) . ..+++.+...-|+- .+.+..++ .+....| .++....+- +..+ . ..| T Consensus 137 ~--~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w 214 (527) T protein:vir:10 137 D--EGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELY 214 (527) T ss_pred C--cCCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeecee Confidence 1 11233332221110 00000000 0000001 100000000 0000 0 000 Q ss_pred cccccc-----------ccccccC--CCcccccc--ccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 192 TASEIG-----------VGGVSTR--MSDVIIPI--NRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATK 256 (507) Q Consensus 192 ~~~~~~-----------~~~~~~~--~~~~~iP~--~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er 256 (507) ..++.. +...... ....+.|. =-++.|...+..+..+|.|-|..+.-..---+..+.+....++- T Consensus 215 ~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~ 294 (527) T protein:vir:10 215 EPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVF 294 (527) T ss_pred eccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHH Confidence 000000 0000000 01112222 13455677788899999999988877666666777777777887 Q ss_pred hcCCceEEe-cccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHH Q lcl|NC_017974. 257 DMGGVIELK-IPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQ 335 (507) Q Consensus 257 ~g~p~~v~~-~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~ 335 (507) .|.||.++. +++. . ..+... .. ++ |..+..=+|++.++.+ .++..+...|.. T Consensus 295 sG~Pi~~~tg~~~v-d--~~G~~~-----------~~--~V--gPG~iweL~e~ak~~~---------v~~~~~la~~~~ 347 (527) T protein:vir:10 295 GGLGFYATDSAPPR-D--SRGNMV-----------PW--TI--SPLGMVEHGQNNKIYR---------VNGVASLEPSQT 347 (527) T ss_pred hCCceeeecccccc-c--ccCCcC-----------cc--cc--CCceeEecCCCcceee---------ccchhhhHHHHH Confidence 888887775 2221 0 000000 00 11 2212233566554432 233334445788 Q ss_pred HHHHHHHHHHHHHhccccccc--c-cch-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----H-HHHHHHhCCCcCCc Q lcl|NC_017974. 336 LISDRKKSILDRLGAGFINVG--N-DKG-GSYNLSESKQTIHTQFVQRVNEIILEALNEN----L-LPQLLALNDMRLPE 406 (507) Q Consensus 336 li~~~d~~Isk~iLGqtLts~--~-~~~-GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~----l-i~~l~~lN~~~~~~ 406 (507) .++++.+.|+-.---.-...+ + +.. +.+|+ +....-....++.....+..+. ++ + +.||-.+-+..+-+ T Consensus 348 h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~AL-eL~L~PLlar~~rk~L~~~~vq-rq~~~~~~~~~L~aye~v~~~d 425 (527) T protein:vir:10 348 HMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIAL-DLKLSAILSSCAEQELELKSVL-KQFFYNLVTQWLPAYEGVGIDD 425 (527) T ss_pred HHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHhhhhhHHHHHHHhhhcccCC Confidence 899999988765433333333 1 111 22343 2333333333333332222222 22 2 23433322211222 Q ss_pred c---ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcc-cccchhhhcCCCcCCCCCCcc Q lcl|NC_017974. 407 T---EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEED-ISQEELMKLLGEDTSRAGDGM 482 (507) Q Consensus 407 ~---~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 482 (507) . ..-+++|...-+.|.++..+.+.+|+++|++.. .-..+.+.+.-|+..+..+- ..... ...++.+...+.++. T Consensus 426 ~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~-~tAv~~L~~~~g~eD~E~E~~~I~~e-ra~~a~a~a~A~~~~ 503 (527) T protein:vir:10 426 ADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPA-KKLTEELSKIMGFELTEEDFKQATED-KKTQGIAQAEAADPF 503 (527) T ss_pred CccccceEEEecccCCCCHHHHHHHHHHHHHcCchhH-HHHHHHHHhccCCCChHHHHHHHHHH-HHHHhHHhhhhcCch Confidence 1 122789999999999999999999999999654 22223344444664332221 11111 111111111111111 Q ss_pred cccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 483 TKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 483 ~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .+...+++ ...+...-+.-| T Consensus 504 ~a~~~~~~-----g~~~~~~d~~~~ 523 (527) T protein:vir:10 504 GAQMAAEQ-----GIPDEEDDQALN 523 (527) T ss_pred hhhhcccc-----CCCCCCcccccC Confidence 11101110 011111112223 No 162 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=95.58 E-value=0.0018 Score=35.61 Aligned_cols=456 Identities=17% Similarity=0.183 Sum_probs=207.9 Q ss_pred CcccccccccccCccccccc-------cccccccccccCchhhhHHHHhhh--hc-------ccchhc-c---c-hhHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQ-------INVSRMRMSEQGTFALAKVQVDSE--RM-------KAEEIR-W---P-NLIGT 59 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~-------~~~~~~~~~~~g~~~~~~~~~~~~--~e-------~~~~lr-~---~-~~~~~ 59 (507) |+|-|..+....+-.-...+ ++.-..+++.+....-+-..+|+. .. ..+-|+ | | +.+.- T Consensus 1 ~~~~~~~~~~~~t~~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~f~npd~~~~~ 80 (525) T protein:vir:10 1 MTRTKGSKNKSTTIEKQSLQIEQLQEHINELERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLWFNNPDKYINN 80 (525) T ss_pred CCCCcCCcccccchhhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhcChHHHHHH Confidence 88876544332221111111 111122344443322222233321 00 011111 0 1 11111 Q ss_pred HHHH-----hhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHh-ccCCCHHHHHHHHHHHHhhccee Q lcl|NC_017974. 60 VESM-----KQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKN-MEGQTLRQFARDAATFNEYGLSV 133 (507) Q Consensus 60 y~~m-----~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~-~~~~~~~~~l~~~l~a~~~Gfs~ 133 (507) .+.+ -.|+.|....+-. .++-.+++.+.+-.. ....++--..++-.|+. +.- ..+-+++|--+-..=.+ T Consensus 81 i~~l~~y~yi~~~~v~ql~~li-~~lp~l~y~i~~~~~-~k~~~~~~s~~n~~l~k~i~h---k~ltrdll~q~a~~gtl 155 (525) T protein:vir:10 81 IVNLLTYYYIIDGNVFQLYDLI-FSLPPLDYQIKVLKR-DKDYKEDLSTINLYLEKKIQH---KQLTRDLLVQLAHSGTL 155 (525) T ss_pred HHHHHHHhhhhcchHHHHHHHH-HhcCCcceeehhhhh-ccchhhHHHHHHHHHHHhHHH---HHHHHHHHHHhhccCce Confidence 1222 1477776655543 334445555554322 12233333445544442 211 12233333221111111 Q ss_pred eeeeeeccC----------------ccccCcceee-eeccccCchhcccccceeecCCCcccc------cccccccc-cc Q lcl|NC_017974. 134 VEKVYTQVT----------------VGEYIGKYKV-KNLAFRPQASLSRTNPIVYNEDGSAII------GIKQSLSA-FQ 189 (507) Q Consensus 134 ~Ei~w~~~~----------------~~~~~g~~~~-~~l~~r~~~~i~~~~~~~~~~d~~~l~------~~~q~~~~-~~ 189 (507) + -.|=-.. .+...|.++. .++. .++ .+.++-+.++ .+.|+.-. |. T Consensus 156 i-g~wlg~~~~py~~vf~~~kyvfp~~r~~g~~v~vid~~-----~f~-----~~~~~~r~~~~~~lsp~i~~~~y~~~~ 224 (525) T protein:vir:10 156 I-GTWLGSKREPYFNVFNNLKYVFPYGRAKGKMVAVIDLQ-----WFD-----EMSELERKLTFENLSPLITENKYKKWK 224 (525) T ss_pred e-EeeecCCCCcchhhhhhhhhhccccccCCceEEEEehH-----Hhh-----hhhHHHHHHHHHhhchhhhhhhhhHHh Confidence 1 1221110 0111222221 0110 000 1112222111 11111111 11 Q ss_pred ccccccccccccccCCCccccccccEEEEeeCCccCccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccc Q lcl|NC_017974. 190 NYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQAL-GVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPS 268 (507) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~-G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~ 268 (507) .+.+.+ .....-++||..+.++.+...-+.||- |.++..+.+.....|+......-..+.|-..+|.+.++-- T Consensus 225 ~~~~~~------~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~EqsIA~kii~a~avLk~gg 298 (525) T protein:vir:10 225 EYNGEN------EDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDLEQSIADKIIKAMAVLKFRG 298 (525) T ss_pred hccccc------chhheeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHhhhhheeeeecc Confidence 111100 112235678999999888877777776 8888888888999999988888889999988888887533 Q ss_pred ccCCCCCCCchHHHHHHHHHHHHHHHH-hhc--CCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHH Q lcl|NC_017974. 269 QILNKAAMDQSSPEAEMVRGLMSDAAN-AHS--GEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSIL 345 (507) Q Consensus 269 ~~~k~~~~~~~~~~~~~l~a~~~~~~n-~~~--G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Is 345 (507) .-|. ...-++..++..+..+..+..- +.+ | -+.+.||.=..++|++. ++. . ...+- +-.+..++-|. T Consensus 299 ~~gn-~mk~p~~~kqkil~gVk~aleK~~kdK~G-i~vi~~Pdfa~~efp~i----k~~--~-~glDg-~K~d~I~~DI~ 368 (525) T protein:vir:10 299 KDDN-DSKVKESAKRKVLAGVKRALEKGVKDKNG-IACIAMPDFATFEFPEI----KNG--D-KTLDP-KKYDSIDNDIT 368 (525) T ss_pred ccCc-cccCchHHHHHHHHHHHHHHhcccccccC-eEEEeccceeecccccc----cCc--c-cCCCc-hhhhhhhhhhh Confidence 3332 2223444556666655555422 222 2 13345598887777642 221 1 11121 23466777774 Q ss_pred H-HHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccc--eEEecCcCchhH Q lcl|NC_017974. 346 D-RLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMP--YVKAGEIVDVDM 422 (507) Q Consensus 346 k-~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P--~~~~~~~~~~dl 422 (507) - .-|.+.|++ |.+|.||.+.+..+++-..+---.+.|.++-| +|+.|++. ++.-. .|.++...+.++ T Consensus 369 ~A~GlS~sL~n--GdggNyAtaslnld~fykkigVm~e~Iee~y~-kL~d~Vl~-------~~k~~nyifnydkd~pi~~ 438 (525) T protein:vir:10 369 NATGISQVLTN--GTKGNYASAKLNLDVFYKKIGVMLEIIEEIYN-QLIDIILG-------EEKGCNYIFQYNKDTPIER 438 (525) T ss_pred hhhccceeeec--CCCCceeeeeeeHHHHHHHHHHHHHHHHHHHH-HHHhhhcC-------cccCcceEEecCCCchhhh Confidence 4 446677877 55799999999999998888888888887665 57777643 22211 356788888899 Q ss_pred HHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc-cchhhhcCCCcCCCCCCcc-cccCcccccccccCCCCc Q lcl|NC_017974. 423 EGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS-QEELMKLLGEDTSRAGDGM-TKGSSGNGTGKISSTRDN 500 (507) Q Consensus 423 ~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d~ 500 (507) ++..+.+=+|.+.|+... ++-+..|+.....-+.+ -+.....+-+..-|+--.+ -.+.+++.-+. |+-- T Consensus 439 kkk~d~LIkL~d~g~s~k------~vldl~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SGk~~n~iG~-P~~d-- 509 (525) T protein:vir:10 439 EKKLDTLIKLEAQGYSAK------YVLDILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSGKDGNDIGS-PKLD-- 509 (525) T ss_pred hhhhhhhhhhhccchhhh------hhhhhhccCcchHHHHHHHHHHHHHHhhhccccccceeeeccccccccC-CccC-- Confidence 988888889988887533 44455566532111110 0000000000000000000 00011111110 0000 Q ss_pred hhhhccC Q lcl|NC_017974. 501 SAANLDN 507 (507) Q Consensus 501 ~a~~~~~ 507 (507) ...+.+| T Consensus 510 d~~~~da 516 (525) T protein:vir:10 510 DSDSSDA 516 (525) T ss_pred CCcchhh Confidence 1111111 No 163 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=95.57 E-value=0.0018 Score=35.60 Aligned_cols=445 Identities=11% Similarity=0.022 Sum_probs=183.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhh--ChHHHHHHHHHH- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQ--DATVATGLDMLY- 77 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~--D~~v~s~l~~Rk- 77 (507) |...|+-= ..++| ++-++ . .+-++-...+-++-..|++|+++-. -.++.+++.-+. T Consensus 1 ~~~~~~~~---~~~~~---------~~~g~---~------~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~ 59 (527) T protein:vir:10 1 MGQDKRQY---GSTQQ---------LRAGE---A------NFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDE 59 (527) T ss_pred CCcccccc---CCCcC---------cCCcc---c------cCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccc Confidence 43222111 11111 11111 0 0000011111233445666666543 135555544333 Q ss_pred -----------HHHhhCCceeeeCCCC---ChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccC Q lcl|NC_017974. 78 -----------TFVEKAFKDFKVIPGE---SEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVT 142 (507) Q Consensus 78 -----------~av~~~~~~~~~~~~~---~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~ 142 (507) ..|....-.|.+++.+ +..+.++.+.++.+.+. ..|+....+ --++..-|=.|+=+.|.... T Consensus 60 ~~~r~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~---e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k 136 (527) T protein:vir:10 60 GDQRPIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDR---ENWEQKFESLKRWTEIRGDYVLLLIGDDEK 136 (527) T ss_pred cccceeeehhhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHH---hhhHHHHHHHHHhhhhhcceeEEEeeccCC Confidence 2445555667776643 23344555555554432 234433222 23688889999999998532 Q ss_pred ccccCcceeeeeccccC---------chhccccc---ceeecCC-Ccccccccc-------cccc------c-----ccc Q lcl|NC_017974. 143 VGEYIGKYKVKNLAFRP---------QASLSRTN---PIVYNED-GSAIIGIKQ-------SLSA------F-----QNY 191 (507) Q Consensus 143 ~~~~~g~~~~~~l~~r~---------~~~i~~~~---~~~~~~d-~~~l~~~~q-------~~~~------~-----~~~ 191 (507) . ..+++.+...-|+- .+.+..++ .+....| .++....+- +..+ . ..| T Consensus 137 ~--~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w 214 (527) T protein:vir:10 137 D--EGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELY 214 (527) T ss_pred C--cCCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeecee Confidence 1 11233332221110 00000000 0000001 100000000 0000 0 000 Q ss_pred cccccc-----------ccccccC--CCcccccc--ccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 192 TASEIG-----------VGGVSTR--MSDVIIPI--NRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATK 256 (507) Q Consensus 192 ~~~~~~-----------~~~~~~~--~~~~~iP~--~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er 256 (507) ..++.. +...... ....+.|. =-++.|...+..+..+|.|-|..+.-..---+..+.+....++- T Consensus 215 ~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~ 294 (527) T protein:vir:10 215 EPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVF 294 (527) T ss_pred eccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHH Confidence 000000 0000000 01112222 13455677788899999999988877666666777777777887 Q ss_pred hcCCceEEe-cccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHH Q lcl|NC_017974. 257 DMGGVIELK-IPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQ 335 (507) Q Consensus 257 ~g~p~~v~~-~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~ 335 (507) .|.||.++. +++. . ..+... .. ++ |..+..=+|++.++.+ .++..+...|.. T Consensus 295 sG~Pi~~~tg~~~v-d--~~G~~~-----------~~--~V--gPG~iweL~e~ak~~~---------v~~~~~la~~~~ 347 (527) T protein:vir:10 295 GGLGFYATDSAPPR-D--SRGNMV-----------PW--TI--SPLGMVEHGQNNKIYR---------VNGVASLEPSQT 347 (527) T ss_pred hCCceeeecccccc-c--ccCCcC-----------cc--cc--CCceeEecCCCcceee---------ccchhhhHHHHH Confidence 888887775 2221 0 000000 00 11 2212233566554432 233334445788 Q ss_pred HHHHHHHHHHHHHhccccccc--c-cch-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----H-HHHHHHhCCCcCCc Q lcl|NC_017974. 336 LISDRKKSILDRLGAGFINVG--N-DKG-GSYNLSESKQTIHTQFVQRVNEIILEALNEN----L-LPQLLALNDMRLPE 406 (507) Q Consensus 336 li~~~d~~Isk~iLGqtLts~--~-~~~-GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~----l-i~~l~~lN~~~~~~ 406 (507) .++++.+.|+-.---.-...+ + +.. +.+|+ +....-....++.....+..+. ++ + +.||-.+-+..+-+ T Consensus 348 h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~AL-eL~L~PLlar~~rk~L~~~~Vq-rq~~~~~~~~~L~aye~v~~~d 425 (527) T protein:vir:10 348 HMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIAL-DLKLSAILSSCAEQELELKSVL-KQFFYNLVTQWLPAYEGVGIDD 425 (527) T ss_pred HHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHhhhhhHHHHHHHhhhcccCC Confidence 899999988765433333333 1 111 22343 2333333333333332222222 22 2 23433322211222 Q ss_pred c---ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcc-cccchhhhcCCCcCCCCCCcc Q lcl|NC_017974. 407 T---EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEED-ISQEELMKLLGEDTSRAGDGM 482 (507) Q Consensus 407 ~---~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 482 (507) . ..-+++|...-+.|.++..+.+.+|+++|++.. .-..+.+.+.-|+..+..+- ...... ..++.+...+.++. T Consensus 426 ~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~-etAv~~L~~~~g~eD~E~E~~~I~~er-a~~a~a~a~a~~~~ 503 (527) T protein:vir:10 426 ADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPA-KKLTEELSKIMGFELTEEDFRQATEDK-KTQGIAQAEAADPF 503 (527) T ss_pred CccccceEEEecccCCCCHHHHHHHHHHHHHcCchhH-HHHHHHHHhccCCCchHHHHHHHHHHH-HHHhHHhhhhcCch Confidence 1 122789999999999999999999999999654 22223344444664332221 111111 11111111111111 Q ss_pred cccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 483 TKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 483 ~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .+...+++ ...+...-+.-| T Consensus 504 ~a~~~~~~-----g~~~~~~d~~~~ 523 (527) T protein:vir:10 504 GAQMAAEQ-----GIPDEEDDQALN 523 (527) T ss_pred hhhhcccc-----CCCCCCcccccC Confidence 11101110 011111112223 No 164 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=95.44 E-value=0.0021 Score=35.31 Aligned_cols=416 Identities=12% Similarity=0.105 Sum_probs=142.3 Q ss_pred CcccccccccccCcccccccc--------ccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQI--------NVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATG 72 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~--------~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~ 72 (507) +..+++..-...+..-..... ...+ +....+..+.+. ..-.+..++.+ T Consensus 25 ~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~-~~~~~~~~~~~~--------~~~~~n~~k~i--------------- 80 (496) T protein:vir:38 25 VKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHN-LNYEHNGNPVNR--------RQLSMNLPKVT--------------- 80 (496) T ss_pred HHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhc-chhccCCCcccc--------ceeecchHHHH--------------- Confidence 333332222111100000000 0000 000000000000 00001111111 Q ss_pred HHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 73 LDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 73 l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) ..+--.-+++-+..|.++ +++..++++++++. ..|.+.+.+++ .|..+|-+.+=+.|.. +|.+. T Consensus 81 ~~~~a~~l~~~p~~i~~~------d~~~~e~l~~~~~~---n~f~~~~~~~~~~a~~~G~~~~~~~~D~------~~~~~ 145 (496) T protein:vir:38 81 AKYMSKLLFNEKVKINID------DKAAEEFVLNVLKT---NGFTKNMERYIEYGEAMGGFVIKVYHDG------NKNVK 145 (496) T ss_pred HHHHhhhhhCCcceEeeC------ChHHHHHHHHHHhc---cCHHHHHHHHHHHHhhhCcEEEEEEEcC------CCcEE Confidence 111112223333333332 34667788888764 23666666654 6888998776444432 34444 Q ss_pred eeeccccCchhcccccceeecC--------------CCcccccccccc----cc------ccccccccccc-ccc----c Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNE--------------DGSAIIGIKQSL----SA------FQNYTASEIGV-GGV----S 202 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~--------------d~~~l~~~~q~~----~~------~~~~~~~~~~~-~~~----~ 202 (507) +.-+ +++++. |++++. ++.....++-+- .. +.......++. ... . T Consensus 146 i~~v---~~~~~~---P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~ 219 (496) T protein:vir:38 146 VSFA---TADCMY---PLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFD 219 (496) T ss_pred EEEE---cccceE---EEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccc Confidence 3322 222221 111111 110000000000 00 00000000000 000 0 Q ss_pred c-----CCCccccccccEEEEe----eCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCC Q lcl|NC_017974. 203 T-----RMSDVIIPINRVMLMN----TGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNK 273 (507) Q Consensus 203 ~-----~~~~~~iP~~k~i~~~----~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~ 273 (507) . ...++.-| -|+.++ .....++|+|.|.+..|-...-.=+..+..|+.-++. | -..+.+|..+.+. T Consensus 220 ~~~~~~~~~~~~~~--~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~--~~~i~v~~~~l~~ 294 (496) T protein:vir:38 220 DIEPVVPLPDFTRP--TFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-G--KKKVLVPSSFVKT 294 (496) T ss_pred ccccceeecCCCcc--eEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-c--ccceecchHHhhc Confidence 0 00011111 122232 2346778999999998865433333344444443432 2 2222333322211 Q ss_pred CCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccc Q lcl|NC_017974. 274 AAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFI 353 (507) Q Consensus 274 ~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtL 353 (507) ......... .... .......++..+.. +... .++. .+..--...|...++.+-++|+..+.-+.- T Consensus 295 ~~~~~g~~~-~~~~----------~~~~~~~~~~~~~~-~~~~-~i~~--~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~ 359 (496) T protein:vir:38 295 AVNLDGSTT-QYFD----------STDEAFFLYQGDQD-DNGK-AIKD--ISVEIRSTEFIESINAMLRIYAMQVGLSAG 359 (496) T ss_pred cCCCCCccc-cCCC----------CccceEEEeecCCC-cccc-ccee--eccccCHHHHHHHHHHHHHHHHHhhCCChh Confidence 111000000 0000 00000111111000 0000 0000 000000112344444444555544311111 Q ss_pred cccccchhhhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHh-------CCCcCCccccceEEecCcCchhHHH Q lcl|NC_017974. 354 NVGNDKGGSYNLSESKQTIHT--QFVQRVNEIILEALNENLLPQLLAL-------NDMRLPETEMPYVKAGEIVDVDMEG 424 (507) Q Consensus 354 ts~~~~~GS~Al~~vh~~v~~--~~~~aD~~~i~~~ln~~li~~l~~l-------N~~~~~~~~~P~~~~~~~~~~dl~~ 424 (507) +-+.+.+|.....++...... ..+..-.+.+...|. ++++.++.+ ++...+ ..-+.+.|+..-+.|..+ T Consensus 360 ~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~g~~~~-~~~i~v~f~d~i~~d~~~ 437 (496) T protein:vir:38 360 TFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIK-EMIVSILEVGKFIEAYSGEVVE-LDTITVDFDDSIAQDEDT 437 (496) T ss_pred hcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCCCC-ccceEEEeCCCCCCCHHH Confidence 112122232222233222211 122333444555563 465555532 322222 223678999888999899 Q ss_pred HHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccccc Q lcl|NC_017974. 425 FSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 425 ~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (507) .++.+.+++.+|++.. +.++.+.+|+++.+.+++....... .+. +.+..+.. +..++ .+ T Consensus 438 ~~~~~~~~~~~GiiS~----et~l~~~~~~~d~ea~~el~ri~~E-~~~-~~~~~d~~--~~~~~--~e 496 (496) T protein:vir:38 438 TINRYTNAKNQGMIPL----KIALQRAWNITEAEADEWAEMLAKE-KQA-EMPNNDMN--GIFGE--EE 496 (496) T ss_pred HHHHHHHHHhcCCCCH----HHHHHhcCCCChHHHHHHHHHHHHh-hhc-cCcccccc--CCCCC--CC Confidence 9999999999998553 5677787888643322221111111 000 00100000 01111 00 No 165 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=95.42 E-value=0.0021 Score=35.27 Aligned_cols=429 Identities=11% Similarity=0.042 Sum_probs=155.7 Q ss_pred Cccccccccc----c---cCccccccc-----cccccccccccCchhhhHHHHhhhhcccchhc-cc-hhHHHHHHHhhC Q lcl|NC_017974. 1 MSRKRNRNRS----V---KVAKATSEQ-----INVSRMRMSEQGTFALAKVQVDSERMKAEEIR-WP-NLIGTVESMKQD 66 (507) Q Consensus 1 ~~~k~~~~~~----~---~~~~~~~~~-----~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr-~~-~~~~~y~~m~~D 66 (507) |+=..+.|.- . .+.++.+.. +++++..+.. +- .++..- ....+.+- ++ .... ....+.- T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~r---i~--~~~~~y-~g~~~~~~~~~~~~~~-~~~~~~s 73 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVR---IQ--TDLDYY-SDKLQYIHYQASDGIK-KKRLKNT 73 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHH---HH--HHHHHh-cCCCcccccccCCCCc-cccceee Confidence 3321111111 1 111111111 1112111110 00 011110 00000000 00 0000 0000000 Q ss_pred hHHHHHHHHHHHH-HhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCcc Q lcl|NC_017974. 67 ATVATGLDMLYTF-VEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVG 144 (507) Q Consensus 67 ~~v~s~l~~Rk~a-v~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~ 144 (507) .-+...+-+.... |.+-.-.+.+.+ +....+++++++++- .|...+.. +.++..+|-.++=..|.. T Consensus 74 ln~~~~i~~~~A~lv~~e~~~i~v~~-----~~~~~e~l~~il~~n---~f~~~~~~~~e~a~a~G~~~~k~~~d~---- 141 (508) T protein:vir:15 74 INMAKTAARRIASVVFNEKAEIHVKD-----NNEADKFLNDVLEDN---DFKNKFEEALEKGVALGGFAMRPYIDG---- 141 (508) T ss_pred cchHHHHHHHHHhhhhCCCceEEeCC-----chHHHHHHHHHHHhc---cHHHHHHHHHHHHhhcCceEEEEEEeC---- Confidence 1122222222222 222222344432 123345777777652 35554444 457888898888666642 Q ss_pred ccCcceeeeeccccCchhcccccceeecCCC----------------cccccc--ccc------ccc-----cccccccc Q lcl|NC_017974. 145 EYIGKYKVKNLAFRPQASLSRTNPIVYNEDG----------------SAIIGI--KQS------LSA-----FQNYTASE 195 (507) Q Consensus 145 ~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~----------------~~l~~~--~q~------~~~-----~~~~~~~~ 195 (507) +.+.+.- .++.++. |+.++.++ ...... +-+ +.. +....... T Consensus 142 ---~~~~i~~---v~ad~~~---P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~ 212 (508) T protein:vir:15 142 ---NHIKIAW---VRADQFY---PLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDI 212 (508) T ss_pred ---CeeEEEE---EcCCeeE---EEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchh Confidence 2222221 1111111 11111110 000000 000 000 00000000 Q ss_pred ccccc-cccCC-----------CccccccccEEEEee----CCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_017974. 196 IGVGG-VSTRM-----------SDVIIPINRVMLMNT----GGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMG 259 (507) Q Consensus 196 ~~~~~-~~~~~-----------~~~~iP~~k~i~~~~----~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~ 259 (507) ++... ..... .++.-| -|++++. ....++|+|.|.+..|.-..-.=+..+..|+.-+ +.|- T Consensus 213 lG~~v~l~~~~e~~~l~~~~~~~g~~~p--~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~ 289 (508) T protein:vir:15 213 VGNQVPLSTLPVYKELAPQVTISGLQRP--LFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQ 289 (508) T ss_pred cCcccchhhcccccCCCcceEecCCCcc--eeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-Hhcc Confidence 00000 00000 111111 1334332 2345789999999988755444444444455444 3333 Q ss_pred CceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHH Q lcl|NC_017974. 260 GVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISD 339 (507) Q Consensus 260 p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~ 339 (507) + -+.+|..+.+++..... .+ ..+...+..++.+..-. ..++.....-. ...|...++. T Consensus 290 ~--~i~v~~~~l~~d~~~~~-----~~----------~~~~~~~~~~~~~~~~~---~~i~~~~~~ir--~e~~~~~~~~ 347 (508) T protein:vir:15 290 K--HIAVQPGMLRFDDEHKP-----TF----------DTEQNVYVGVLSDDNNG---LGVKDMTTPIR--TVQYKDAIDH 347 (508) T ss_pred c--ceeechHHhcCCCCCcc-----cc----------CCCCeeEEeccCCCCCC---CceeEeecccC--hHHHHHHHHH Confidence 2 33345544443221100 00 11222233333221100 00111111100 0124444555 Q ss_pred HHHHHHHHH-hcccccccccchhhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCcCCc------- Q lcl|NC_017974. 340 RKKSILDRL-GAGFINVGNDKGGSYNLSESK--QTIHTQFVQRVNEIILEALNENLLPQLLA---LNDMRLPE------- 406 (507) Q Consensus 340 ~d~~Isk~i-LGqtLts~~~~~GS~Al~~vh--~~v~~~~~~aD~~~i~~~ln~~li~~l~~---lN~~~~~~------- 406 (507) +-+.|...+ ++.. |-+.+.+|....-++. ..-...-+..-.+.+...|. +|++.++. ++..+.++ T Consensus 348 ~l~~~~~~~gls~~-~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~~~~~al~-~lv~~il~l~~~~~~~~~g~~~~~~~ 425 (508) T protein:vir:15 348 FIKEFEVQIGLSTG-TFSYSNDGVKTATEVVSNNSMTYQTRSSYLTMVEKAID-ELCQSIFELANAGALFDDGKPLFTLD 425 (508) T ss_pred HHHHHHHHhCCCch-hcccccCccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccccc Confidence 555554444 3321 2222223322112221 12223333445566677774 57666554 33221111 Q ss_pred ----cccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCC-CCc Q lcl|NC_017974. 407 ----TEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRA-GDG 481 (507) Q Consensus 407 ----~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~-~~~ 481 (507) ..-+.|.|+..-..|.++.++.+.+++.+|++.. +.++.+.+|+++.+.+++....... .+...... ..+ T Consensus 426 ~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~----e~~i~~~~g~~deea~~el~ri~~E-~~~~~~~~~~~~ 500 (508) T protein:vir:15 426 SASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSK----QTFLQRNYGMTDEQAAEELAKIQSE-APTDTFEGGRSA 500 (508) T ss_pred cccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCH----HHHHHhcCCCChHHHHHHHHHHHHh-ccccCccccccc Confidence 1124688888888898888999999999998554 5788888998754333332221111 11111111 112 Q ss_pred ccccCccc Q lcl|NC_017974. 482 MTKGSSGN 489 (507) Q Consensus 482 ~~~~~~~~ 489 (507) ...++.|| T Consensus 501 ~~~g~~ge 508 (508) T protein:vir:15 501 ILNGGDGE 508 (508) T ss_pred cCCCCCCC Confidence 22333343 No 166 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=95.32 E-value=0.0023 Score=35.04 Aligned_cols=439 Identities=11% Similarity=0.021 Sum_probs=150.0 Q ss_pred Cccccc----ccccc--cCcccc-ccc----cccccccccccCchhhhHHHHhhhhcccchhccchhH-HHHHHHhhChH Q lcl|NC_017974. 1 MSRKRN----RNRSV--KVAKAT-SEQ----INVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLI-GTVESMKQDAT 68 (507) Q Consensus 1 ~~~k~~----~~~~~--~~~~~~-~~~----~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~-~~y~~m~~D~~ 68 (507) |+=--+ .+|.. -..++. .+. ++.++..+.. + ..++.. +.-..+.+...... ....+-+...- T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~---i--~~~~~~-y~g~~~~~~~~~~~~~~~~~~~~sln 74 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDR---I--KRNLVY-YQSKWDDVQYKNTDGDIKSRPMNHLP 74 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHH---H--HHHHHH-hcCCcccccccccCcchhcccceecc Confidence 111000 00000 011111 100 1111111110 0 011111 01111111100000 00000001112 Q ss_pred HHHHHHHHHHHH-hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcccc Q lcl|NC_017974. 69 VATGLDMLYTFV-EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEY 146 (507) Q Consensus 69 v~s~l~~Rk~av-~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~ 146 (507) +...+-+....+ .+-.-.+.+ .+.+..++++++|++. .|...+... ..|...|=.++=..|..+ T Consensus 75 l~~~i~~~~A~lv~~e~~~i~v------~d~~~~~~l~~~l~~n---~f~~~~~~~~e~a~a~G~~a~k~~~d~~----- 140 (522) T protein:vir:47 75 IARTASKKIASLVYNEQATITT------KNEILQKFLDDMLTND---RFNKNFERYLESCLALGGLAMRPYIDGD----- 140 (522) T ss_pred hHHHHHHHHhhhhcCCcceeec------CChHHHHHHHHHHhhc---chHHHHHHHHHHhhccCCEEEEEEEcCC----- Confidence 223332222222 111112222 2356777888888753 355555554 457778877776666532 Q ss_pred CcceeeeeccccCchhcccccceeecCCCc----------------cccc--cccc-----------------cc----- Q lcl|NC_017974. 147 IGKYKVKNLAFRPQASLSRTNPIVYNEDGS----------------AIIG--IKQS-----------------LS----- 186 (507) Q Consensus 147 ~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~----------------~l~~--~~q~-----------------~~----- 186 (507) .+.+. ..+...+.. ..++.++. .... .+.+ +- T Consensus 141 --~~~i~---~v~ad~~~P---~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ 212 (522) T protein:vir:47 141 --KVRVA---FIQAPVFFP---LESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNE 212 (522) T ss_pred --ceEEE---EEcCCceEE---EEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEE Confidence 22211 111111111 11111110 0000 0000 00 Q ss_pred cccccccccccccc-------cccCCCcccccc-cc--EEEEee----CCccCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 187 AFQNYTASEIGVGG-------VSTRMSDVIIPI-NR--VMLMNT----GGSSSQALGVSPLVGCYRAWREKILIENLEVV 252 (507) Q Consensus 187 ~~~~~~~~~~~~~~-------~~~~~~~~~iP~-~k--~i~~~~----~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~ 252 (507) .+.......++... +........++. .+ |++|+. ....++|+|.|.+..|.-..-.-+. .|.+ T Consensus 213 ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~---~~s~ 289 (522) T protein:vir:47 213 LYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINR---SYDE 289 (522) T ss_pred EeecCCCcccCccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHH---HHHH Confidence 00000000000000 001011111111 11 333332 2345789999999988754433333 3334 Q ss_pred HHHHhcCCceEEecccccCCCCCCCchHH--HHHHHHHHHHHHHHhhcCCceeEeecCc----ccccccccceeeeeccC Q lcl|NC_017974. 253 GATKDMGGVIELKIPSQILNKAAMDQSSP--EAEMVRGLMSDAANAHSGEQSFFMLPSD----TKDNAPQYSMTLKGIDG 326 (507) Q Consensus 253 f~Er~g~p~~v~~~P~~~~k~~~~~~~~~--~~~~l~a~~~~~~n~~~G~~a~~iiP~g----~~i~~~~~~~el~~~~g 326 (507) ++.-+=.+=.-+.+|..+.+....-.... ...... .+...+..+..+ ..|...+ .++ . T Consensus 290 ~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~fd----------~~~~~f~~~~~~~~~~~~i~~~~--~~i----r 353 (522) T protein:vir:47 290 FMWEVRMGQRRVIVPEHLTQRQYQRPDGTIDFRPRFD----------VEQNVYMQIGGSSMDAGGITDLT--SPI----R 353 (522) T ss_pred HHHHHHhccceeecchHHhccCCCCCCcccccccccC----------cccceEeecCCCCCCCCcceeec--ccc----C Confidence 33321111112334443333211100000 000000 000111111111 1111111 111 1 Q ss_pred CCcchhHHHHHHHHHHHHHH-HHhcccccccccchhhhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-- Q lcl|NC_017974. 327 MGKQYSTAQLISDRKKSILD-RLGAGFINVGNDKGGSYNLSESKQ--TIHTQFVQRVNEIILEALNENLLPQLLALND-- 401 (507) Q Consensus 327 ~g~~~~~~~li~~~d~~Isk-~iLGqtLts~~~~~GS~Al~~vh~--~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~-- 401 (507) . ..|...++.+-+.|+. +.++.. |-+.+.+|...+.++.. .-...-+..-.+.+..+| ++|+..++.+-. T Consensus 354 ~---e~~~~~~~~~l~~i~~~~gls~~-tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al-~~lv~~i~~l~~~~ 428 (522) T protein:vir:47 354 A---NDYILAISEGLKLFEMQIGVSSG-MFTFDGQGMKTATEIVSENSDTYQMRSSIVALVEQSI-KELCVSMCELGKAV 428 (522) T ss_pred h---HHHHHHHHHHHHHHHHHhCCCcc-ccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhh Confidence 1 1244455555555543 334432 22222223222223321 112223344556666777 467777775421 Q ss_pred ----CcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCC Q lcl|NC_017974. 402 ----MRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSR 477 (507) Q Consensus 402 ----~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~ 477 (507) ...+...-+.+.|+..-.+|.++.++.+.+++.+|++.. +.++.+.+|+++.+..++.........+.. + T Consensus 429 ~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~----e~~i~~~~g~~eeea~~el~ri~~E~~~~~--~ 502 (522) T protein:vir:47 429 GVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTK----KRAIGKTLNISGVEAEKELNAINSELLPMN--D 502 (522) T ss_pred hhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCH----HHHHHhcCCCChHHHHHHHHHHHHhhccCC--C Confidence 111222336788998888898888999999999998654 578899999975433333322222111110 0 Q ss_pred CCCcccccCcccccccccCCCCc Q lcl|NC_017974. 478 AGDGMTKGSSGNGTGKISSTRDN 500 (507) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~d~ 500 (507) ....... ++.+...++.-.- T Consensus 503 ~~~~~~~---~~~~~~~~~d~~~ 522 (522) T protein:vir:47 503 AELAIYG---MHDQNEEKADDKG 522 (522) T ss_pred CCCCCCC---CCCcccccCCCCC Confidence 0000000 0000000000000 No 167 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=95.23 E-value=0.0025 Score=34.86 Aligned_cols=454 Identities=13% Similarity=0.099 Sum_probs=178.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhc-cchhHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIR-WPNLIGTVESMKQDATVATGLDMLYTF 79 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr-~~~~~~~y~~m~~D~~v~s~l~~Rk~a 79 (507) +.+++...+.+..+.|.....+++- ...|..| .... .+. .++ ..++++.|++|..++.|-++++..... T Consensus 9 i~~~~~~~~~~s~~~~~~~dg~~~i---~~~~~~~---~~~~--~e~--~~~~~~eLI~~YR~ma~~pEvd~Av~eIVne 78 (533) T protein:vir:10 9 LERAKKAPKGPSFVQKDNLDGSQPV---SGGGYYG---YTVD--FDG--QVRNEYQLISRYREMVLQPECDSAVDDIVNE 78 (533) T ss_pred cccccccccCCCCCCCCccccccee---ecccccc---eeee--ccc--ccchHHHHHHHHHHHhhccchhhHHHHhhcc Confidence 4444444444333333332222211 0111111 1111 111 122 457999999999999999999988876 Q ss_pred HhhCCce---eeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhccee-----------eeeeeeccCccc Q lcl|NC_017974. 80 VEKAFKD---FKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSV-----------VEKVYTQVTVGE 145 (507) Q Consensus 80 v~~~~~~---~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~-----------~Ei~w~~~~~~~ 145 (507) +...+-. +.++-...+.++.+.+.|.+- |.. |..+|+.--+||.. +.++-.... . T Consensus 79 aiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eE--------F~~-Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~--p 147 (533) T protein:vir:10 79 TICGNFDDVPVSVELSNLKVSDKIKKLIREE--------FGE-ILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDN--P 147 (533) T ss_pred eeeecCCCceEEEEecccccchHHHHHHHHH--------HHH-HHHHhccchhhhHHHhhhhhcceEEEEEEecCCC--c Confidence 5422111 111112222333333333222 222 22344444444443 333332210 1 Q ss_pred cCcceeeeeccccCchhcccccceee-cCCCcc-c---cccccccccccccccccccccccccCCCccccccccEEEEee Q lcl|NC_017974. 146 YIGKYKVKNLAFRPQASLSRTNPIVY-NEDGSA-I---IGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNT 220 (507) Q Consensus 146 ~~g~~~~~~l~~r~~~~i~~~~~~~~-~~d~~~-l---~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~ 220 (507) -+|-. .|....|+++.+++-... ..++-. . ......-..++.+.+ ... ..+...++.||. .-|+|+| T Consensus 148 k~GI~---ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp---~g~-~~~~~~~vkI~~-dAI~y~h 219 (533) T protein:vir:10 148 QGGLI---ELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDP---KGL-KNSTTQGLKIAP-DSICYVH 219 (533) T ss_pred cccce---eeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeecc---ccc-cccCCCceecch-hheeeee Confidence 12333 333444445555443321 122211 0 001111111222221 111 223456788887 5688888 Q ss_pred CCc--cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhh- Q lcl|NC_017974. 221 GGS--SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAH- 297 (507) Q Consensus 221 ~~~--~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~- 297 (507) ..- .++..=.|.|.++..++==-+.....-+.+-=--.-.=.++|+. +|..+ .....+.|+.++.-..|-. T Consensus 220 SGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--VGnLP----k~KAeqYlr~iM~k~KNklV 293 (533) T protein:vir:10 220 SGIMDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYID--VGNLP----KNKAEQYLREVMGRYRNKLV 293 (533) T ss_pred ccceeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhccceEE Confidence 543 22233348888888776444433333333210001111244442 22222 2233345555544332210 Q ss_pred cCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccccc Q lcl|NC_017974. 298 SGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGND 358 (507) Q Consensus 298 ~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~ 358 (507) -.+..|- -+|. |++|.+. .|+.+.. -..=|+|..+.+-+++--..--.+.. T Consensus 294 YDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTL---------pGgqnLg-em~DV~YF~kKLY~aLnVP~SRl~~e 363 (533) T protein:vir:10 294 YDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTL---------PGGQNLG-ELEDVKYFQKKLYKSLNVPGSRLETE 363 (533) T ss_pred EeccCceecccchhhhhHhhhcccccCCCCccceeec---------cccCCcC-hHHHHHHHHHHHHHHhCCCccccCCC Confidence 0111111 1222 5555442 2332232 23458999999999887765333322 Q ss_pred chhhhhHH---HH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------CchhHH Q lcl|NC_017974. 359 KGGSYNLS---ES-KQTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------VDVDME 423 (507) Q Consensus 359 ~~GS~Al~---~v-h~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~~~dl~ 423 (507) |...+| ++ ..|+ |...+......+...|..-|-..|+-=|.. ..+. ...+.|+.. |-.+.+ T Consensus 364 --~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgii--t~eeW~~i~~~I~~~f~~Dn~f~ElKe~E 439 (533) T protein:vir:10 364 --TTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQLVLKGVI--SIEEWDQMKEHIQYDYIADNYFAELKEIE 439 (533) T ss_pred --CcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhcceEeeeecchHHHHHHHH Confidence 122222 22 2222 223344444444444443333333221211 1111 122333322 334555 Q ss_pred HHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccc------------cchhhhcCCCcCCCCCCcccccCcccc Q lcl|NC_017974. 424 GFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDIS------------QEELMKLLGEDTSRAGDGMTKGSSGNG 490 (507) Q Consensus 424 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (507) .+.+++..|..+--.+-.-.+.+|+++. |.+.+.+-.++. +.+.+...+ ...++++...+..++ T Consensus 440 il~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~--~~~~~~~~~~~~~~~- 516 (533) T protein:vir:10 440 IRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDP--AMAAGDPDAGGAPAE- 516 (533) T ss_pred HHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhH--HhcCCCCCcCCcccc- Confidence 5556666665541111111234666543 555321110000 000000000 011112222222222 Q ss_pred cccccCCCCchhhhccC Q lcl|NC_017974. 491 TGKISSTRDNSAANLDN 507 (507) Q Consensus 491 ~~~~~~~~d~~a~~~~~ 507 (507) +-+|..++|+.-..|- T Consensus 517 -~~~~~~~~~~~~~~~~ 532 (533) T protein:vir:10 517 -EVAPEGPDPSDERKAE 532 (533) T ss_pred -cCCCCCCCcchhhccC Confidence 2235667776666666 No 168 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=95.13 E-value=0.0027 Score=34.67 Aligned_cols=426 Identities=10% Similarity=-0.013 Sum_probs=162.1 Q ss_pred CcccccccccccCcc-cccccc--ccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH---Hh---------- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAK-ATSEQI--NVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES---MK---------- 64 (507) Q Consensus 1 ~~~k~~~~~~~~~~~-~~~~~~--~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~---m~---------- 64 (507) |+.---+.-.+..+- |+.+.. .+...... .-.+. ....-+........-|.-++.+.|+- ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~e~~~-~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~ 78 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIFDAIVRTNNK-PETLE-EMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATG 78 (483) T ss_pred CccchhcCCceeecCcchhhhhhhcccccCCc-hhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Confidence 554333333222222 111110 00000000 00000 00111100000000011222222211 00 Q ss_pred -----------hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcce Q lcl|NC_017974. 65 -----------QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLS 132 (507) Q Consensus 65 -----------~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs 132 (507) -.....-++.+...-+.+-+..+ +.+++++.+++++++++ .+.+.+.++ .++.-||.+ T Consensus 79 ~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~------~~~d~~~~~~l~~~~~n----~~~~~~~~~~~~~~~~G~~ 148 (483) T protein:vir:12 79 AVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF------KHTDDEVVKRIDEVLGN----RFDDKLHSVLTGASNKGIE 148 (483) T ss_pred cccccccccccccchHHHHHHHHhhhhcccCcee------ccCChHHHHHHHHHHhc----cHHHHHHHHHHHHhhCCeE Confidence 01222222222222222222222 22345566777777642 466666664 567889986 Q ss_pred eeeeeeeccCccccCcceeeeeccccCchhcccccceeecCC-Ccccccccccc----cccccccccccccc-------- Q lcl|NC_017974. 133 VVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNED-GSAIIGIKQSL----SAFQNYTASEIGVG-------- 199 (507) Q Consensus 133 ~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d-~~~l~~~~q~~----~~~~~~~~~~~~~~-------- 199 (507) ++++|... +|.+.++-+.|+. ++.-|. +.. +..+-.++... .....++....... T Consensus 149 -y~~v~~d~-----d~~~~i~~~~p~~--~~~v~d----~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~ 216 (483) T protein:vir:12 149 -WLHPYLDE-----EGEFKLFRVPAEQ--GIPIWT----DKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLI 216 (483) T ss_pred -EEEEEEcC-----CCceEEEEEcccc--eEEEEc----CCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeee Confidence 45777532 4566555444331 111110 000 11111111000 00000000000000 Q ss_pred --------ccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccC Q lcl|NC_017974. 200 --------GVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQIL 271 (507) Q Consensus 200 --------~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~ 271 (507) ..........+..--++.|+ +|+.|.|.+..+-...=--+..+..++..++-|..|+++++- . T Consensus 217 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g---~- 287 (483) T protein:vir:12 217 PDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTN---Y- 287 (483) T ss_pred ecccccccccccccccCCCCccceEEec-----CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---C- Confidence 00000000011111123332 367788998875444334455678888888889998887751 1 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcc Q lcl|NC_017974. 272 NKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAG 351 (507) Q Consensus 272 k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGq 351 (507) ..+...+... .+. ....+.++.|.++++. ..+. ....+..+++.+.+.|...--.. T Consensus 288 ---~~~~~~~~~~---~~~---------~~~~~~~~~~~~~~~l-------~~~~--~~~~~~~~~~~l~~~I~~~s~~p 343 (483) T protein:vir:12 288 ---DDQELPEFKR---LLR---------YYGAIKVSDNGGVDTI-------QVEV--PVENSKKYLDELYQKIMLFGQAV 343 (483) T ss_pred ---CcccchhHHH---hhh---------hccccccCCCCcceEE-------eecC--CHHHHHHHHHHHHHHHHHHhCCC Confidence 1111111111 111 1123455676655442 2222 12346778888888887765444 Q ss_pred cccccccchhhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHH Q lcl|NC_017974. 352 FINVGNDKGGSYNLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKA 428 (507) Q Consensus 352 tLts~~~~~GS~Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~ 428 (507) .++.+.. |+...|.. ...-....+..-.+.+...+. ++++.++.+.+... +..-..+.|....+.|..+.+++ T Consensus 344 ~~~~~~~--~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~~~~li~~~~~~~~-~~~~i~v~f~~~~p~~~~~~a~~ 419 (483) T protein:vir:12 344 DFSSDKF--GSAPSGVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKG-EHKDVDISFNYNKVANTELQVQT 419 (483) T ss_pred CCCcccc--ccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCC-ccceeeEEeCCCCCCCHHHHHHH Confidence 4443221 11111211 111122223334444555553 46666666653222 22234688899999999999999 Q ss_pred HHHHHhcCCccccHHHHHHHHHHcC-CCCCCCc-ccccc---hhhhcCCCcCCCCCCcccccCccccccccc Q lcl|NC_017974. 429 IQRIGAVGYLPKTPKVINRVLEVLG-IDEKIEE-DISQE---ELMKLLGEDTSRAGDGMTKGSSGNGTGKIS 495 (507) Q Consensus 429 ~~~L~~~G~~~~~~~~~~~i~e~~G-ip~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (507) +.+|+ |+ ++ ++.+.+.++ ++.+..+ +.... ..+....+. ...+..........+.++.. T Consensus 420 ~~kl~--Gi-iS----~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~-~~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 420 AQQSM--GI-VS----HETVLENHPFVEDLQAELERIEQEQMEYNKQLPNL-DDGGADGAQQQERSNNKESE 483 (483) T ss_pred HHHHh--cc-Cc----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc-cccccCCcccCCCCCcccCC Confidence 99984 65 33 234555554 3322111 00000 111111110 00000000000111111111 No 169 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=94.86 E-value=0.0033 Score=34.17 Aligned_cols=400 Identities=13% Similarity=0.033 Sum_probs=162.2 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhh--------cccc-hh-cc-chhHHHHHHHhh----ChHHHHHHHHHH Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSER--------MKAE-EI-RW-PNLIGTVESMKQ----DATVATGLDMLY 77 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~--------e~~~-~l-r~-~~~~~~y~~m~~----D~~v~s~l~~Rk 77 (507) |+ +.++.. ++.. ....|...... +.+. .| ++ .+.-+-|+.-+. -.++...++.-. T Consensus 1 m~------V~~~hp---~y~a-~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~ 70 (452) T protein:vir:94 1 MP------IETKHP---EYLA-YENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALS 70 (452) T ss_pred CC------CCCcCH---HHHH-HHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHh Confidence 22 000100 0000 01111111110 0000 00 11 122334544332 456666666666 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeecc Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLA 156 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~ 156 (507) ..|.+.+..+.++ ....++ ..+..+.+++.++++++ .++.||.+.+=+.|-..+..-| +. T Consensus 71 G~vf~k~p~~~~p-------~~l~~~----~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy--------~~ 131 (452) T protein:vir:94 71 GMVLDQPPVITHP-------DAMSKY----FEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPY--------IS 131 (452) T ss_pred chhhcCCceeccc-------HHHHHH----HhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceE--------EE Confidence 6665555444322 122222 22456788999999987 5888998887666654432111 12 Q ss_pred ccCchhcccccceeecCCCcc-------------------ccccccc------cccccc--ccccccccc----ccccCC Q lcl|NC_017974. 157 FRPQASLSRTNPIVYNEDGSA-------------------IIGIKQS------LSAFQN--YTASEIGVG----GVSTRM 205 (507) Q Consensus 157 ~r~~~~i~~~~~~~~~~d~~~-------------------l~~~~q~------~~~~~~--~~~~~~~~~----~~~~~~ 205 (507) ...+..|..|+ ++.+|+. .....|. +-.+.. +........ ...... T Consensus 132 ~~~~~~Ii~W~---~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~ 208 (452) T protein:vir:94 132 VYTTENILNWE---EDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQN 208 (452) T ss_pred EechhhhcCcc---ccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecC Confidence 22333343333 1222221 0011110 000000 000000000 000000 Q ss_pred CccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHH Q lcl|NC_017974. 206 SDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEM 285 (507) Q Consensus 206 ~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~ 285 (507) ++..++.=-|+++-.. ..+--.+.+.|..++..-+---....+.-+-+..-+.|++++. | .++.+ T Consensus 209 ~~~~l~~IP~v~~~~~-~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~-----g----~~~~~----- 273 (452) T protein:vir:94 209 VGVTMDYIPFFCITPS-GLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWIT-----G----AESQS----- 273 (452) T ss_pred CCcccceeEEEEEcCC-CCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEee-----c----CcCCC----- Confidence 1111111113333222 2222234545555554322111122333334444467777764 1 11111 Q ss_pred HHHHHHHHHHhhcCCceeEeecC-cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc-cchhhh Q lcl|NC_017974. 286 VRGLMSDAANAHSGEQSFFMLPS-DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN-DKGGSY 363 (507) Q Consensus 286 l~a~~~~~~n~~~G~~a~~iiP~-g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~~~GS~ 363 (507) .++.|+.+++.+|+ |.+.. +++.+|++ ....+..++....+|.. +.+..+..+. +...+- T Consensus 274 ---------~i~iG~~~~~~lpe~~~~~~-------yie~~g~~-i~~~~~~l~~le~~m~~-~Ga~ll~~~~~~~~s~e 335 (452) T protein:vir:94 274 ---------TMHIGSTKAWVIPEVAAKVG-------FLEFTGQG-LQSLEKALSEKQAQLAS-LSARLIDNSTRGSEATE 335 (452) T ss_pred ---------ceEecccccccCCCCCCcce-------EEccCchh-HHHHHHHHHHHHHHHHH-HHHHhhccCCCcchHHH Confidence 12347888899996 76543 44555543 22345566666666522 2223333211 112222 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEec--Cc-CchhHHHHHHHHHHHHhcCCccc Q lcl|NC_017974. 364 NLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAG--EI-VDVDMEGFSKAIQRIGAVGYLPK 440 (507) Q Consensus 364 Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~--~~-~~~dl~~~a~~~~~L~~~G~~~~ 440 (507) |. .....-...++.+-+..+++.++ ++++++..+-+. +. -.+|... .. ...| ...++++.++...|.+.. T Consensus 336 a~-~~~~~~~~s~L~~~a~~~e~al~-~~l~~~a~w~g~---~~-~~~v~~n~dF~~~~~~-~~~~~al~~~~~~G~is~ 408 (452) T protein:vir:94 336 TV-KLRYMSETASLKSVTRAVEALLN-KAYSCIMDMESM---GG-TLNIKLNSAFLDSKLT-AAELKAWVEAYLSGGISK 408 (452) T ss_pred HH-HHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCC---CC-ceEEEeccccccccCC-HHHHHHHHHHHhcCCCcH Confidence 21 12233334677788888899996 588988887642 21 1234322 11 2223 344566777889997543 Q ss_pred cHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCcccccccc Q lcl|NC_017974. 441 TPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKI 494 (507) Q Consensus 441 ~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (507) ....++++ +.|++.+..+++......+.+ ..++..++ +++.++. T Consensus 409 -~t~~~~L~-~~gvl~~~~e~~~i~~E~~~~--~~~~~~~~------~~~~~~~ 452 (452) T protein:vir:94 409 -EIYIHALK-VGKVLPPPGESMGVIPDPPAP--EPSPSNTP------PNPSSKA 452 (452) T ss_pred -HHHHHHHH-hCCCCCCccCHHHHHHHhhcc--CcccCCCC------CCCccCC Confidence 22233443 368876554443222221111 11111111 1111111 No 170 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=94.75 E-value=0.0036 Score=33.98 Aligned_cols=416 Identities=10% Similarity=0.008 Sum_probs=159.3 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH----------------Hh Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES----------------MK 64 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~----------------m~ 64 (507) |.-|+.+- .-.++-...... ...+-..-+. ...+ |..++.+.|+- .+ T Consensus 1 ~~~~~~~~--~~~~~~~~~~~~-----------~i~~~i~~~~--~~~~--r~~~~~~Yy~g~~~i~~~~~~~~~~~~~k 63 (452) T protein:vir:36 1 MKYKPPKL--MTFSKDEPITVE-----------VVTKFMEKHK--LEVA--RYEYLKNMYLGIMAIDDEPAKDSWKPDNR 63 (452) T ss_pred CcccCcee--EEcCCccCCCHH-----------HHHHHHHHHH--HHHH--HHHHHHHHhccccccccCccccccCccce Confidence 32221110 011111000000 0000001000 0000 11111111211 00 Q ss_pred -hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccC Q lcl|NC_017974. 65 -QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVT 142 (507) Q Consensus 65 -~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~ 142 (507) ..+...-++.+...-+.+-+..+.+ .+.++.+++.++++.. .|...+..+. ++.-||.+. +++|... T Consensus 64 i~~n~~~~ivd~~~~~l~g~~~~~~~------~d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~~~-~~v~~d~- 132 (452) T protein:vir:36 64 LAVNFTKYIVDTFTGYFNGIPVKKSH------SDKEILTKLQEFDNLN---DMEDEESELAKMACIYGRAF-EFLYQDE- 132 (452) T ss_pred eecchHHHHHHHHhhhhcccCceeec------CChhHHHHHHHHHhhc---ChhHHHHHHHHHHHhcCeEE-EEEEecC- Confidence 1233444444444444444333332 2334556677766542 3656555544 677899754 5777532 Q ss_pred ccccCcceeeeeccccCchhccccc-----------ceeecCCCccccccccccccccccccccccccccccCCCccccc Q lcl|NC_017974. 143 VGEYIGKYKVKNLAFRPQASLSRTN-----------PIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIP 211 (507) Q Consensus 143 ~~~~~g~~~~~~l~~r~~~~i~~~~-----------~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP 211 (507) +|.+.+.-+.|+. .+.-|. .++.+.++.....+. .+...+.+......... .......++ T Consensus 133 ----~g~~~i~~~~p~~--~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vy-t~~~i~~~~~~~~~~~~--~~~~~~~~g 203 (452) T protein:vir:36 133 ----DTQTNVVYNSPEN--MFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVY-TLLETIKISGENDEISF--GEGTYNPYP 203 (452) T ss_pred ----CCeeEEEEEcccc--eEEEEcCCCCCceEEEEEEEEecCceEEEEEE-ecCeEEEEEEcCCceEE--ecceeccCC Confidence 4555554433321 111000 011111111110000 00000000000000000 000011111 Q ss_pred cccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHH Q lcl|NC_017974. 212 INRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMS 291 (507) Q Consensus 212 ~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~ 291 (507) .--++.|+ +++.|.|.+..+....=--+..+..++..++.+..|+++++ | ...+.+.. ..+. T Consensus 204 ~iPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~-----g---~~~~~~~~----~~~~- 265 (452) T protein:vir:36 204 DLPVVEFY-----FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFL-----G---AAVEEEDL----KNIR- 265 (452) T ss_pred cccEEEec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee-----c---CCcCchhh----hhhh- Confidence 11133332 35668888876554433335567888888999999988875 1 11111111 1111 Q ss_pred HHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccc-hhhhhHHHHHH Q lcl|NC_017974. 292 DAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDK-GGSYNLSESKQ 370 (507) Q Consensus 292 ~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~-~GS~Al~~vh~ 370 (507) .+ ..+.++.+.+- ....++++..+. ....+...++.+.+.|...--+..++.+..+ .+.-|+. ... T Consensus 266 ------~~--~~~~~~~~~~~--~~~~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~-~~~ 332 (452) T protein:vir:36 266 ------SN--RVINYYADGEG--KNVDVKFLEKPD--SDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLA-YKL 332 (452) T ss_pred ------hc--ceEEecCCCCc--cCCcceeEeecC--CHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHH-HHH Confidence 11 23444443211 111233433322 2334677888888888776544444332211 1112211 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcc--ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHH Q lcl|NC_017974. 371 TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPET--EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRV 448 (507) Q Consensus 371 ~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~--~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i 448 (507) .-....+..-.+.+...|. ++++.++.+....+... .-..+.|....+.|..++++.+.++ .|+ ++ .+.+ T Consensus 333 ~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~--~g~-iS----~et~ 404 (452) T protein:vir:36 333 QAMSNLALSFQRKFQSSLN-SRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDIKEQAETANIL--MGI-TS----QETA 404 (452) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHH--hcc-CC----hHHH Confidence 2222233334445555564 46666666543222222 2246888888999999999999987 465 33 2456 Q ss_pred HHHcCCCCCCCccccc--chhhh--cCCCcCCCCCCcccccCccccccc Q lcl|NC_017974. 449 LEVLGIDEKIEEDISQ--EELMK--LLGEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 449 ~e~~Gip~~~~~~~~~--~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 493 (507) .+.++.-.....+... .+... .......+..++ .....+.+..+ T Consensus 405 ~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~~~~~~~~~e 452 (452) T protein:vir:36 405 LSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKG-TDTVVSETNEE 452 (452) T ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccCCCCc-ccccCccccCC Confidence 6667642211111110 00000 000111111111 11111111111 No 171 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=94.70 E-value=0.0037 Score=33.90 Aligned_cols=452 Identities=12% Similarity=0.039 Sum_probs=166.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhc--cchhHHHHHHHh-hChHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIR--WPNLIGTVESMK-QDATVATGLDMLY 77 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr--~~~~~~~y~~m~-~D~~v~s~l~~Rk 77 (507) |..+.-...+......... ..+. -++.+. .++.........+++ +..+-.--+++. .++++.++++... T Consensus 3 ~~~~~~~a~~~~~~~~~~~------~~y~-aa~~~~-~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 74 (495) T protein:vir:10 3 MTPSGYQSLASGLLVPVGA------SAYE-GASGGH-RWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWV 74 (495) T ss_pred cccccccccchhhhhHHHh------hhhh-ccccCc-ccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 2222110000000000000 0000 000000 000000000011111 111222234443 5999999999999 Q ss_pred HHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhcc---CCCHHHHHHHHHHH-HhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 78 TFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNME---GQTLRQFARDAATF-NEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 78 ~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~---~~~~~~~l~~~l~a-~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) ..|-+..+...+.+.+..-.+++.++.+.+-++-+ ..+|+.+...++.+ +--|=++.=+.|.... +|.-..- T Consensus 75 ~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~----~g~~~~~ 150 (495) T protein:vir:10 75 AAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLS----EGLSVPL 150 (495) T ss_pred HhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccC----CCCccce Confidence 99988776554433222222333333333332211 13577776666654 3356665555565322 2211111 Q ss_pred eccccCchhccc-------------ccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEee Q lcl|NC_017974. 154 NLAFRPQASLSR-------------TNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNT 220 (507) Q Consensus 154 ~l~~r~~~~i~~-------------~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~ 220 (507) +|...+++-|.. .+-+.|+.+|+-+-++-. ..++++... ......-+.||... |+|.+ T Consensus 151 ~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~------~~hpgd~~~--~~~~~~~~rvpA~~-vlH~f 221 (495) T protein:vir:10 151 QLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFY------RNHPAESSL--IGDPVDTVWIKAEH-VLHVT 221 (495) T ss_pred EEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEe------ecCCCcccc--cccccceeeechhh-eEecc Confidence 122222222221 123444555543333311 111111110 11112335577665 56777 Q ss_pred CCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce-EEe--cccccCCCCCCCchHHHHHHHHHHHHHHHHhh Q lcl|NC_017974. 221 GGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVI-ELK--IPSQILNKAAMDQSSPEAEMVRGLMSDAANAH 297 (507) Q Consensus 221 ~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~-v~~--~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~ 297 (507) ..+.+.--|.+.|.++...-.+..+. ...++ ..|-..-+. +++ .|...+......+..+.. -... .++. T Consensus 222 ~~r~gQ~RGis~la~i~~l~~l~~y~-dael~-~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~--~~~~----~~l~ 293 (495) T protein:vir:10 222 VLTVRSDAGAPWFQLLLRLNELDQYE-DAELV-RKKTAALFAAFIQEATADSTGGPTIGQPKRSKG--GKRI----TGLN 293 (495) T ss_pred ccCCCcccCcchhHHHHHHHHhhHHH-HHHHH-HHHHhhhheeeeecCCCccccccccCccccccC--cccc----eecC Confidence 77788888888776543222222211 11111 112222221 222 121111111110110000 0000 1122 Q ss_pred cCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhc--ccccccccchhhhhHHHHHHHHHHH Q lcl|NC_017974. 298 SGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGA--GFINVGNDKGGSYNLSESKQTIHTQ 375 (507) Q Consensus 298 ~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLG--qtLts~~~~~GS~Al~~vh~~v~~~ 375 (507) .| ....++.|.+|++.+ .+..+. .|..+....-+.|+..+.- +.||. +-.+.+||.+-.-..-+.. T Consensus 294 pG--~i~~L~pGe~i~~~~-------p~~p~~--~~~~f~~~~lr~iaaglGi~Ye~ltg-D~s~~nYSS~R~~~~e~~r 361 (495) T protein:vir:10 294 PG--TLQYLQPGQEVKFSN-------PADVGT--TYEPWLRYQLLSIAKGYGITYEMLTG-DLRGVNYSSIRAGLLEFRR 361 (495) T ss_pred Cc--eeeecCCCCeeeeeC-------CCCCCC--CHHHHHHHHHHHHHhhcCCCHHHHhc-ccccccHHHHHHHHHHHHH Confidence 22 456678888766543 333222 3667888888888877632 11332 3234456644333333333 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHH---hCCC-cCCcc-----ccc--eEEecCcCchhHHHHHHHHHHHHhcCCccccHH Q lcl|NC_017974. 376 FVQRVN-EIILEALNENLLPQLLA---LNDM-RLPET-----EMP--YVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPK 443 (507) Q Consensus 376 ~~~aD~-~~i~~~ln~~li~~l~~---lN~~-~~~~~-----~~P--~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~ 443 (507) ..+... .++...+-+-+...+++ +++. ..|+. .+- .+........|..+-+++....+++|+.... T Consensus 362 ~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~-- 439 (495) T protein:vir:10 362 LCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPIS-- 439 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHH-- Confidence 333333 23444443333333333 3321 11210 111 2223344556777778888889999986553 Q ss_pred HHHHHHHHcCCCCCCC-cccccc-hh--hhcCCCcCCCCCCcccccCcccccccccCCCCchhhhc Q lcl|NC_017974. 444 VINRVLEVLGIDEKIE-EDISQE-EL--MKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANL 505 (507) Q Consensus 444 ~~~~i~e~~Gip~~~~-~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~ 505 (507) +.+++ .|.....- .+.+.+ .. ...++-+..|. ....++.... ...++..-|. T Consensus 440 --~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~----~~~~~~~~~~---~~~~~~~~~e 495 (495) T protein:vir:10 440 --DKQAE-RGYDMEELFDMISDANQLIDEYDLRLDSDPR----YVNGSGAEQK---SVMEAALNNE 495 (495) T ss_pred --HHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCC----cCCCccCCCC---CCCCCCCCCC Confidence 23333 24421100 000000 00 00000000000 0000000000 0000000001 No 172 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=94.49 E-value=0.0043 Score=33.57 Aligned_cols=417 Identities=12% Similarity=0.037 Sum_probs=155.9 Q ss_pred CcccccccccccCcccccccccccccccc-ccCchh---hhHHHHhhhhcccchhccchhHHHHH---HHh--------- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMS-EQGTFA---LAKVQVDSERMKAEEIRWPNLIGTVE---SMK--------- 64 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~---~~~~~~~~~~e~~~~lr~~~~~~~y~---~m~--------- 64 (507) |..|=+ +|-+......... ++. +...+. ..-...+. +..+ |..++.+.|+ +++ T Consensus 1 ~~~~~~------~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~i~~~~--~~~~--~~~~l~~Yy~g~~~i~~~~~~~~~~ 69 (474) T protein:vir:95 1 MINIIR------MPWDKPYGEEVVE-QMKPKVETQEEMIIRLINNHK--QKLK--DINVGQKYYDKDNDINYQAYKQDLH 69 (474) T ss_pred Cccccc------CCCCCCCCcchhh-hccccccchHHHHHHHHHHHH--HHHH--HHHHHHHHhcccCccccccchhhhc Confidence 665422 2222111110000 011 111110 01111111 0000 1112222222 110 Q ss_pred -------hChHHH-----HHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcc Q lcl|NC_017974. 65 -------QDATVA-----TGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGL 131 (507) Q Consensus 65 -------~D~~v~-----s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gf 131 (507) .|-.|. -++.....-+.+-+..+ +..+.++.+++.++++ ..|.+.+.++. ++.-||. T Consensus 70 ~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~------~~~~~~~~~~l~~~~~----n~~~~~~~~l~~~~~~~G~ 139 (474) T protein:vir:95 70 GNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTY------AHDDDKVLDVIHQVLD----TRWDNKLIDILTAASNKGI 139 (474) T ss_pred ccccccccccccccchHHHHHHhhhhhhcccCcee------ccCChHHHHHHHHHHh----ccHHHHHHHHHHHHhhCCe Confidence 011121 22222222222222222 2233455667777664 24666666544 5777999 Q ss_pred eeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccc-cccccc---c-ccccccccccccccccc---- Q lcl|NC_017974. 132 SVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAI-IGIKQS---L-SAFQNYTASEIGVGGVS---- 202 (507) Q Consensus 132 s~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l-~~~~q~---~-~~~~~~~~~~~~~~~~~---- 202 (507) +. +++|... +|.+.++-+.|+ .++. ++-+.+.+.+ -.++.. . .....++.....-.... T Consensus 140 ~~-~~~~~d~-----~~~~~i~~~~p~--~~~~----v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~ 207 (474) T protein:vir:95 140 DW-LQVYINE-----DGELKLFRVPAE--QAIP----IWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGL 207 (474) T ss_pred EE-EEeeeCC-----CCceEEEEEccc--ceEE----EEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCce Confidence 64 6777532 455655544332 1111 1111110110 000000 0 00000000000000000 Q ss_pred ------------cCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccccc Q lcl|NC_017974. 203 ------------TRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQI 270 (507) Q Consensus 203 ------------~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~ 270 (507) .......++.--++.| ..|+.|.|.+..+-...=--+..+..++..++.|..|+++++ +. T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~---g~ 279 (474) T protein:vir:95 208 IPDFYYGDEHIQTHFSTGSWERVPFIAF-----KNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR---GY 279 (474) T ss_pred eeccccccccccCcccccCCCccceEEe-----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc---CC Confidence 0000001111112222 246778898877443332335577888888999999888765 11 Q ss_pred CCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_017974. 271 LNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGA 350 (507) Q Consensus 271 ~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLG 350 (507) + .+...+ ....+. ....+.++.+.++++ +..+.. ...+..+++.+.+.|...--. T Consensus 280 ~----~~~~~~---~~~~~~---------~~~~i~~~~~~~~~~-------l~~~~~--~~~~~~~~~~l~~~I~~~s~~ 334 (474) T protein:vir:95 280 E----GEDLSE---FMEGLK---------YYKAINVSSDGGVET-------IQVEVP--VASTKEYLDMMRAYIVEFGQG 334 (474) T ss_pred C----cccccc---hhhhhh---------ccceeeccCCCceeE-------EeccCC--HHHHHHHHHHHHHHHHHHhCC Confidence 1 111111 111111 112344666665444 322222 234677889988888776554 Q ss_pred ccccccccchhhhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHH Q lcl|NC_017974. 351 GFINVGNDKGGSYNLSESKQTIH----TQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFS 426 (507) Q Consensus 351 qtLts~~~~~GS~Al~~vh~~v~----~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a 426 (507) ..++.+ ..|+...|.. .+.. ...+..-.+.+...|. ++++.++.+.+..+ +..-..+.|....+.+..+.+ T Consensus 335 p~~~~~--~~~~n~Sg~A-lk~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~~g~~~-d~~~i~i~f~~~~p~~~~e~a 409 (474) T protein:vir:95 335 VDFQTD--KFGSATSGIA-LKFLYTNLNLKANKLKNKANVALQ-ELMQFILDFNKIKL-DAKEIEITFNFNVMVNDLEQS 409 (474) T ss_pred cCcccc--ccccccHHHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCc-ccceeeEEecCCCccCHHHHH Confidence 444432 2222222221 2111 1222223334555553 46677777654322 222346788888888877777 Q ss_pred HHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc--cchhhhcCCCcCCCCCCccccc-CcccccccccC Q lcl|NC_017974. 427 KAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS--QEELMKLLGEDTSRAGDGMTKG-SSGNGTGKISS 496 (507) Q Consensus 427 ~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 496 (507) +.++ .+|.+ + ++.+.+.++.-.....+.. ..+............+.+.... ..+++.++... T Consensus 410 ~~~~---~~gii-S----~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 410 QIGA---QSQYL-S----KETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred HHHH---HcCCC-C----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 6544 46764 3 2445566654221111111 1111101111001111000000 00111111111 No 173 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=94.49 E-value=0.0043 Score=33.57 Aligned_cols=417 Identities=12% Similarity=0.037 Sum_probs=155.9 Q ss_pred CcccccccccccCcccccccccccccccc-ccCchh---hhHHHHhhhhcccchhccchhHHHHH---HHh--------- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMS-EQGTFA---LAKVQVDSERMKAEEIRWPNLIGTVE---SMK--------- 64 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~---~~~~~~~~~~e~~~~lr~~~~~~~y~---~m~--------- 64 (507) |..|=+ +|-+......... ++. +...+. ..-...+. +..+ |..++.+.|+ +++ T Consensus 1 ~~~~~~------~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~i~~~~--~~~~--~~~~l~~Yy~g~~~i~~~~~~~~~~ 69 (474) T protein:vir:96 1 MINIIR------MPWDKPYGEEVVE-QMKPKVETQEEMIIRLINNHK--QKLK--DINVGQKYYDKDNDINYQAYKQDLH 69 (474) T ss_pred Cccccc------CCCCCCCCcchhh-hccccccchHHHHHHHHHHHH--HHHH--HHHHHHHHhcccCccccccchhhhc Confidence 665422 2222111110000 011 111110 01111111 0000 1112222222 110 Q ss_pred -------hChHHH-----HHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcc Q lcl|NC_017974. 65 -------QDATVA-----TGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGL 131 (507) Q Consensus 65 -------~D~~v~-----s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gf 131 (507) .|-.|. -++.....-+.+-+..+ +..+.++.+++.++++ ..|.+.+.++. ++.-||. T Consensus 70 ~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~------~~~~~~~~~~l~~~~~----n~~~~~~~~l~~~~~~~G~ 139 (474) T protein:vir:96 70 GNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTY------AHDDDKVLDVIHQVLD----TRWDNKLIDILTAASNKGI 139 (474) T ss_pred ccccccccccccccchHHHHHHhhhhhhcccCcee------ccCChHHHHHHHHHHh----ccHHHHHHHHHHHHhhCCe Confidence 011121 22222222222222222 2233455667777664 24666666544 5777999 Q ss_pred eeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccc-cccccc---c-ccccccccccccccccc---- Q lcl|NC_017974. 132 SVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAI-IGIKQS---L-SAFQNYTASEIGVGGVS---- 202 (507) Q Consensus 132 s~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l-~~~~q~---~-~~~~~~~~~~~~~~~~~---- 202 (507) +. +++|... +|.+.++-+.|+ .++. ++-+.+.+.+ -.++.. . .....++.....-.... T Consensus 140 ~~-~~~~~d~-----~~~~~i~~~~p~--~~~~----v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~ 207 (474) T protein:vir:96 140 DW-LQVYINE-----DGELKLFRVPAE--QAIP----IWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGL 207 (474) T ss_pred EE-EEeeeCC-----CCceEEEEEccc--ceEE----EEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCce Confidence 64 6777532 455655544332 1111 1111110110 000000 0 00000000000000000 Q ss_pred ------------cCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccccc Q lcl|NC_017974. 203 ------------TRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQI 270 (507) Q Consensus 203 ------------~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~ 270 (507) .......++.--++.| ..|+.|.|.+..+-...=--+..+..++..++.|..|+++++ +. T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~---g~ 279 (474) T protein:vir:96 208 IPDFYYGDEHIQTHFSTGSWERVPFIAF-----KNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR---GY 279 (474) T ss_pred eeccccccccccCcccccCCCccceEEe-----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc---CC Confidence 0000001111112222 246778898877443332335577888888999999888765 11 Q ss_pred CCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_017974. 271 LNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGA 350 (507) Q Consensus 271 ~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLG 350 (507) + .+...+ ....+. ....+.++.+.++++ +..+.. ...+..+++.+.+.|...--. T Consensus 280 ~----~~~~~~---~~~~~~---------~~~~i~~~~~~~~~~-------l~~~~~--~~~~~~~~~~l~~~I~~~s~~ 334 (474) T protein:vir:96 280 E----GEDLSE---FMEGLK---------YYKAINVSSDGGVET-------IQVEVP--VASTKEYLDMMRAYIVEFGQG 334 (474) T ss_pred C----cccccc---hhhhhh---------ccceeeccCCCceeE-------EeccCC--HHHHHHHHHHHHHHHHHHhCC Confidence 1 111111 111111 112344666665444 322222 234677889988888776554 Q ss_pred ccccccccchhhhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHH Q lcl|NC_017974. 351 GFINVGNDKGGSYNLSESKQTIH----TQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFS 426 (507) Q Consensus 351 qtLts~~~~~GS~Al~~vh~~v~----~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a 426 (507) ..++.+ ..|+...|.. .+.. ...+..-.+.+...|. ++++.++.+.+..+ +..-..+.|....+.+..+.+ T Consensus 335 p~~~~~--~~~~n~Sg~A-lk~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~~g~~~-d~~~i~i~f~~~~p~~~~e~a 409 (474) T protein:vir:96 335 VDFQTD--KFGSATSGIA-LKFLYTNLNLKANKLKNKANVALQ-ELMQFILDFNKIKL-DAKEIEITFNFNVMVNDLEQS 409 (474) T ss_pred cCcccc--ccccccHHHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCc-ccceeeEEecCCCccCHHHHH Confidence 444432 2222222221 2111 1222223334555553 46677777654322 222346788888888877777 Q ss_pred HHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc--cchhhhcCCCcCCCCCCccccc-CcccccccccC Q lcl|NC_017974. 427 KAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS--QEELMKLLGEDTSRAGDGMTKG-SSGNGTGKISS 496 (507) Q Consensus 427 ~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 496 (507) +.++ .+|.+ + ++.+.+.++.-.....+.. ..+............+.+.... ..+++.++... T Consensus 410 ~~~~---~~gii-S----~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 410 QIGA---QSQYL-S----KETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred HHHH---HcCCC-C----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 6544 46764 3 2445566654221111111 1111101111001111000000 00111111111 No 174 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=94.40 E-value=0.0045 Score=33.43 Aligned_cols=425 Identities=11% Similarity=0.028 Sum_probs=160.2 Q ss_pred Cccccccccccc--Ccccccc---ccccccccccccCchhhhHH-HHhhhhcc-cchhccchhH-H--HHHHHh-----h Q lcl|NC_017974. 1 MSRKRNRNRSVK--VAKATSE---QINVSRMRMSEQGTFALAKV-QVDSERMK-AEEIRWPNLI-G--TVESMK-----Q 65 (507) Q Consensus 1 ~~~k~~~~~~~~--~~~~~~~---~~~~~~~~~~~~g~~~~~~~-~~~~~~e~-~~~lr~~~~~-~--~y~~m~-----~ 65 (507) |-.-.+-..-.. +...... .......-...+-. -.... .+..+.+- .+.+.++.-+ . ..+..+ . T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~ 99 (492) T protein:vir:94 21 LYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 99 (492) T ss_pred eecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHH-HHHHHHHHHHHhccccccccccccccccccccccccccccc Confidence 111111100000 0000000 00000000000000 00000 01111100 0111111000 0 000000 1 Q ss_pred ChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcc Q lcl|NC_017974. 66 DATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVG 144 (507) Q Consensus 66 D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~ 144 (507) .+...-++.+...-+.+-+..+. .+++++.+++++++++ ++.+.+.++ .++.-||.+. +++|..+ T Consensus 100 ~n~~k~Ivd~~~~yl~G~p~~~~------~~d~~~~~~l~~~~~n----~~~~~~~~~~~~a~~~G~a~-~~v~~d~--- 165 (492) T protein:vir:94 100 TNFHANLVDQKVSYIVGKPIAFK------HTDDEVVKRIDEVLGN----RFDDKLHSVLTGASNKGIEW-LHPYLDE--- 165 (492) T ss_pred cchHHHHHHHHHhhhcccCceec------cCchHHHHHHHHHHhc----cHHHHHHHHHHHHhhCCeEE-EEEEecC--- Confidence 23333444444444444333332 2234566777777642 455655554 4677899875 4777532 Q ss_pred ccCcceeeeeccccCchhcccccceeecCC-Ccccccccccc----ccccccccccccccc----------------ccc Q lcl|NC_017974. 145 EYIGKYKVKNLAFRPQASLSRTNPIVYNED-GSAIIGIKQSL----SAFQNYTASEIGVGG----------------VST 203 (507) Q Consensus 145 ~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d-~~~l~~~~q~~----~~~~~~~~~~~~~~~----------------~~~ 203 (507) +|.+.+.-+.|+- ++. ++-+.. +..+-.++-.. .....++........ +.. T Consensus 166 --dg~~~~~~~~p~~--~~~----v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 237 (492) T protein:vir:94 166 --EGEFKLFRVPAEQ--GIP----IWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKT 237 (492) T ss_pred --CCceEEEEEcccc--eEE----EEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccc Confidence 4555554443321 111 110111 11111111000 000000000000000 000 Q ss_pred CCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHH Q lcl|NC_017974. 204 RMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEA 283 (507) Q Consensus 204 ~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~ 283 (507) ......++.--++.|. +|+.|.|.+..+....=.-+..+...+..++.+..|+++++- + ..+...+. T Consensus 238 ~~~~~~~g~vPvv~~~-----nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g---~----~~~~~~~~- 304 (492) T protein:vir:94 238 HFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKN---Y----DDQELPEF- 304 (492) T ss_pred cccccCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---C----Ccccchhh- Confidence 0000011111123332 356788888875544333344667788888889998887751 1 11111111 Q ss_pred HHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhh Q lcl|NC_017974. 284 EMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSY 363 (507) Q Consensus 284 ~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~ 363 (507) ...+. ....+.++.+.+.++. ..+.. ...+...++++.+.|.+.--...++.+.. |+. T Consensus 305 --~~~~~---------~~~~~~~~~~~~~~~l-------~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~n 362 (492) T protein:vir:94 305 --KRLLR---------YYGAIKVSDNGGVDTI-------QVEVP--VENSKKYLDELYQKIMLFGQAVDFSSDKF--GSA 362 (492) T ss_pred --HHHHh---------hccceecCCCCcceeE-------eccCC--HHHHHHHHHHHHHHHHHHhCCcCCCcccc--ccC Confidence 11111 1134556777655442 22221 22356788898888887765544444321 111 Q ss_pred hHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccc Q lcl|NC_017974. 364 NLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPK 440 (507) Q Consensus 364 Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~ 440 (507) ..|+. ...-....+..-.+.+...|. ++++.++.+.+... +..--.+.|....+.|..+.++++.+|+ |+ ++ T Consensus 363 ~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~-~~~~li~~~~~~~~-~~~~i~v~f~~~~p~~~~e~~~~~~kl~--gi-iS 437 (492) T protein:vir:94 363 PSGVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKG-EHKDVDISFNYNKVANTELQVQTAQQSM--GI-VS 437 (492) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCc-ccceeeEEecCCCCCCHHHHHHHHHHHh--cc-Cc Confidence 21221 112222333444455555563 46666776653222 2222357888888999999999999985 65 44 Q ss_pred cHHHHHHHHHHcCC-CCCCCcccc--cc---hhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 441 TPKVINRVLEVLGI-DEKIEEDIS--QE---ELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 441 ~~~~~~~i~e~~Gi-p~~~~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .+.+.+.++. +.+.. +.. .. ..+...++. .+... ++++ -+..+-+..| T Consensus 438 ----~et~~~~l~~v~d~~~-E~eri~~E~~~~~~~~~~~----~~~~~------~~~~----~~~~~~~~e~ 491 (492) T protein:vir:94 438 ----HETVLENHPFVEDLQA-ELERIEQEQMEYNKQLPNL----DDGGA------DSAQ----QQERSNNKES 491 (492) T ss_pred ----hHHHHHhCCCCCCHHH-HHHHHHHHHHHHHhhcccc----ccccC------CCCc----cccCCccccC Confidence 2456666654 32211 110 00 111111110 00000 0000 0111111111 No 175 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=94.28 E-value=0.0049 Score=33.27 Aligned_cols=415 Identities=11% Similarity=0.045 Sum_probs=155.1 Q ss_pred Cc---ccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHH---HH----------- Q lcl|NC_017974. 1 MS---RKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVE---SM----------- 63 (507) Q Consensus 1 ~~---~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~---~m----------- 63 (507) |. +..-+.-...+-+.......+. ...-.+....+. +..+ |..++.+.|+ ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~i~~~i~~~~--~~~~--r~~~~~~Yy~g~~~i~~~~~~~~~~~ 69 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQ-------EEMILRLVREHK--ENID--NITMGERYYNHHPDILDAPFKRDVNG 69 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCCh-------HHHHHHHHHHHH--HHHH--HHHHHHHHhcccccccccchhhhccc Confidence 22 1111111111100000000000 000000000000 0000 0011111111 00 Q ss_pred h----------hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcce Q lcl|NC_017974. 64 K----------QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLS 132 (507) Q Consensus 64 ~----------~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs 132 (507) . ..+...-++.+...-+.+-+..+. ..+.++.+.+.+++++ .|.+.+.++ .++.-||.+ T Consensus 70 ~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~------~~~~~~~~~l~~~~~n----~~~~~~~~~~~~~~~~G~~ 139 (478) T protein:vir:10 70 DYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFG------VDNDKALKQIQHTLNH----KWDDKLVDILTAASNKGIE 139 (478) T ss_pred ccccccccceeccchHHHHHHHHhhhhcccCceee------cCChHHHHHHHHHHhc----cHHHHHHHHHHHHhhCCeE Confidence 0 022333333333333333333332 2234455666666542 466666654 568889986 Q ss_pred eeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCcccc-cccccc----ccccccccccccccccc----- Q lcl|NC_017974. 133 VVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAII-GIKQSL----SAFQNYTASEIGVGGVS----- 202 (507) Q Consensus 133 ~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~-~~~q~~----~~~~~~~~~~~~~~~~~----- 202 (507) . +++|... +|.+.+.-+.|+. .+. ++-+.+.+.+. .++... .....++.....-.... T Consensus 140 ~-~~v~~d~-----~~~~~~~~~~p~~--~~~----v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~ 207 (478) T protein:vir:10 140 W-VQPYVDE-----EGEFKTFRVPAEQ--AVP----IWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLI 207 (478) T ss_pred E-EEEEecC-----CCceEEEEEcccc--eEE----EEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeee Confidence 5 5777532 4555555443331 111 11111111111 110000 00000000000000000 Q ss_pred -------c------CCCccccccc--cEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecc Q lcl|NC_017974. 203 -------T------RMSDVIIPIN--RVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIP 267 (507) Q Consensus 203 -------~------~~~~~~iP~~--k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P 267 (507) . ...+..-+.. -++.|+ .++.|.|.+..+....=-=+..+..++..++.+..|+++++- T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g- 281 (478) T protein:vir:10 208 PDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFK-----NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKG- 281 (478) T ss_pred ccccccccccccceecccccccCCcceEEEec-----cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeec- Confidence 0 0001111111 123333 356788988874433322344667788888888888887651 Q ss_pred cccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeec--CcccccccccceeeeeccCCCcchhHHHHHHHHHHHHH Q lcl|NC_017974. 268 SQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLP--SDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSIL 345 (507) Q Consensus 268 ~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP--~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Is 345 (507) . ..++..+ ....+. +. .++.++ +|.+ ++++..+. ....+...++.+.+.|. T Consensus 282 --~----~~~~~~~---~~~~~~--------~~-~~~~~~~~~~~~-------~~~l~~~~--~~~~~~~~~~~l~~~I~ 334 (478) T protein:vir:10 282 --Y----EGEDMKD---FMHNLK--------YY-KAISVAGESGSG-------VDTIKVEV--PIDSVKEYTKMLRDYII 334 (478) T ss_pred --C----Ccccccc---hhhhhh--------hC-ceeEecCCCCCc-------ceEEeecC--CHHHHHHHHHHHHHHHH Confidence 1 1111111 111111 11 233343 2333 33333332 22346778888888887 Q ss_pred HHHhcccccccccchhhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhH Q lcl|NC_017974. 346 DRLGAGFINVGNDKGGSYNLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDM 422 (507) Q Consensus 346 k~iLGqtLts~~~~~GS~Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl 422 (507) ..--...++.+.. +| ...|.. ...-....+..-.+.+...|. ++++-++.+.+.... ..-..+.|....+.|. T Consensus 335 ~~s~~p~~~~~~~-~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~d-~~~i~i~f~~~~p~~~ 410 (478) T protein:vir:10 335 EFGQGVDFQQDKF-GN-SPSGIALKFMYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYRLDVR-VQDIEITFNFNVMVNE 410 (478) T ss_pred HHhCCcCcCcccc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCcc-cccceEEeCCCCCCCH Confidence 7654444443221 11 111211 112222233334455666664 466777776543222 2234688888888999 Q ss_pred HHHHHHHHHHHhcCCccccHHHHHHHHHHcC-CCCCCCc-ccccch---hhhcCCCcCCCCCCcccccCccccccc Q lcl|NC_017974. 423 EGFSKAIQRIGAVGYLPKTPKVINRVLEVLG-IDEKIEE-DISQEE---LMKLLGEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 423 ~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~G-ip~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (507) .+.++.+.++ .|. ++. +.+.+.++ +..+..+ +....+ .....++.....-+.... .+..+.++ T Consensus 411 ~e~~~~~~~~--~g~-iS~----et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~-~~~d~~~e 478 (478) T protein:vir:10 411 LENSQIAMNS--TGL-LSK----ETILGNHSWVQDPVAEMERIEQENIELNQQLPDIEEGLNDEQQR-QSEDNQSE 478 (478) T ss_pred HHHHHHHHHH--hCC-CCh----HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccccCCCCcccccc-cCcCCCCC Confidence 9999998887 465 342 34445554 3322111 001111 111111111110000000 01111111 No 176 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=93.88 E-value=0.0061 Score=32.73 Aligned_cols=417 Identities=10% Similarity=-0.007 Sum_probs=157.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH---H-------------h Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES---M-------------K 64 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~---m-------------~ 64 (507) |+-+.+|- ..+++-.+...... ......+. +..+ |..++.+.|+- + + T Consensus 1 ~~~~~~~~--~~~~~~~~~~~~~i-----------~~~i~~~~--~~~~--r~~~~~~yy~g~~~i~~~~~~~~~~~~~k 63 (453) T protein:vir:73 1 MNLKPIKL--MTYSRDEEITDKVV-----------NDFMKKHQ--EEVE--RYEYLGNMYKGIMEISSQKAKDSWKPDNR 63 (453) T ss_pred Ccccccee--eeccccccCCHHHH-----------HHHHHHHH--HHHH--HHHHHHHHhccccchhcCCCCCccCccce Confidence 33322221 11111111100000 00000000 0000 11111111110 0 0 Q ss_pred -hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccC Q lcl|NC_017974. 65 -QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVT 142 (507) Q Consensus 65 -~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~ 142 (507) -.+...-++.....-+.+-+..+. .++.++.+++.+++... .|......+. ++.-||.+. +.+|..+ T Consensus 64 i~~n~~~~ivd~~~~~l~g~~~~~~------~~d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~~~-~~v~~d~- 132 (453) T protein:vir:73 64 LTNNFAKYIVDTFVGYFNGIPIKKT------HDDKSVLEAMQLFDNLN---DMEDEESELAKIACVYGRAY-ELMYQNE- 132 (453) T ss_pred eecchHHHHHHHhhhhhcccCceee------cCChHHHHHHHHHHHhc---ChhHHHHHHHHHHHhcCeEE-EEEEeCC- Confidence 123333333333333333332232 12344556777776542 3555555544 677899764 5777532 Q ss_pred ccccCcceeeeeccccCchhccccc-----------ceeecCCCccccccccccccccccccccccccccccCCCccccc Q lcl|NC_017974. 143 VGEYIGKYKVKNLAFRPQASLSRTN-----------PIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIP 211 (507) Q Consensus 143 ~~~~~g~~~~~~l~~r~~~~i~~~~-----------~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP 211 (507) +|...+.-+.|+. ++.-|. .++++.++.....+.. +.....+............ ....+. T Consensus 133 ----~~~~~i~~~~p~~--~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt-~~~i~~~~~~~~~~~~~~~--~~~~~g 203 (453) T protein:vir:73 133 ----STESEVIYCSPLN--VFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYT-LLETISITGKAGEVKFGES--TYNVYS 203 (453) T ss_pred ----CCceEEEEEcccc--eEEEEeCCCCceeEEEEEEEEecCceEEEEEEe-CCeEEEEEecCCceEEccc--eeccCC Confidence 4555554433221 111110 1122222221111000 0000011000000000000 001111 Q ss_pred cccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHH Q lcl|NC_017974. 212 INRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMS 291 (507) Q Consensus 212 ~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~ 291 (507) .-.++.|+ .++.|.|.+..+-...=-=+..+..++..++.|..|.++++- ...+++........-.. T Consensus 204 ~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g--------~~~~~~~~~~~~~~~~~ 270 (453) T protein:vir:73 204 DLPIVEYN-----FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLG--------AEVDEEDAKNIKDNRLI 270 (453) T ss_pred ceeEEEec-----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeec--------CCCCchhhhcccccccc Confidence 11123333 356788888765433323355678888889999999888751 11111111111110000 Q ss_pred HHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHH---H Q lcl|NC_017974. 292 DAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSE---S 368 (507) Q Consensus 292 ~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~---v 368 (507) ..... ...+....+.+.++ +++..+. ....+..+++.+.+.|...--+..++.+. .| .+.|. . T Consensus 271 ~~~~~--~~~~~~~~~~~~d~-------~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~g-n~Sg~Al~~ 336 (453) T protein:vir:73 271 NFFDK--NSNGQGTNAAKVDV-------KFLDKPD--SDVQTENLLNRLERSIFQFTMAANISDEN--FG-NSSGVALAY 336 (453) T ss_pred ccccc--ccccccccccCcee-------EEeeecC--CHHHHHHHHHHHHHHHHHHhCCcccCccc--cc-CccHHHHHH Confidence 00000 01112222333333 3332222 22346778899999997755444444322 12 11121 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCC--ccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHH Q lcl|NC_017974. 369 KQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLP--ETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVIN 446 (507) Q Consensus 369 h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~--~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~ 446 (507) ...-....+..-.+.+...|. ++++.++.+....+. +..-..+.|....+.|..+.++++.+++ |+ ++. + T Consensus 337 ~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--gi-is~----e 408 (453) T protein:vir:73 337 KLQAMSNLALSFQRKFQSALN-RRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GI-TSE----E 408 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-CcH----H Confidence 111122223333344555554 455555555211111 1223478899999999999999999986 65 442 3 Q ss_pred HHHHHcCC-CCCCCc-ccccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhc Q lcl|NC_017974. 447 RVLEVLGI-DEKIEE-DISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANL 505 (507) Q Consensus 447 ~i~e~~Gi-p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~ 505 (507) .+.+.++. +.+..+ +....+........ ..+ ...++|..--|| T Consensus 409 t~~~~~~~~~d~~~E~~ri~~E~~~~~~~~-----------~~~-----~~~~~~~~~~~~ 453 (453) T protein:vir:73 409 TALSVISVIPDVQAEMEKIKKKKLLQLSLT-----------RTS-----NLVRMKQMRGNL 453 (453) T ss_pred HHHHhCCCCCCHHHHHHHHHHHHHHHHHHH-----------Hhc-----cCCcchhhhcCC Confidence 34455544 322111 00000000000000 000 112233333333 No 177 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=93.59 E-value=0.007 Score=32.39 Aligned_cols=424 Identities=9% Similarity=0.020 Sum_probs=148.4 Q ss_pred CccccccccccccccccccCchhh----------hHHHHhhhhcccchhccchhHHHHHH------Hh--hChHHH---- Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFAL----------AKVQVDSERMKAEEIRWPNLIGTVES------MK--QDATVA---- 70 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~----------~~~~~~~~~e~~~~lr~~~~~~~y~~------m~--~D~~v~---- 70 (507) |..+.......|...+ +++..-. ..+..+. ...+ |..++.+.|+- +- .++... T Consensus 1 ~~~~~~~~~~~~~~~~-~~p~~~~~~~~~~~l~~~l~~~~~--~~~~--rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~ 75 (501) T protein:vir:25 1 MTVPVDVIADAPAADV-EFPEDSMSREQLGALVADMWRLHI--SERQ--WLDRIYEYTKGLRGRPEVPEGASDEVKELAK 75 (501) T ss_pred CcccchhhhccCcccc-cCCcccCChHHHHHHHHHHHHHHH--HHHH--HHHHHHHHHhcCCCchhccccCChhhhhhHh Confidence 3332221111111111 1111100 0111111 0111 11122222221 00 011110 Q ss_pred -HHHHHHHHHHhhC-----CceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCc Q lcl|NC_017974. 71 -TGLDMLYTFVEKA-----FKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTV 143 (507) Q Consensus 71 -s~l~~Rk~av~~~-----~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~ 143 (507) ++.+==+..|..+ .--|..+.+ ..... +.+++.. ..|.....+ ..++.-||.|. +++|..+. T Consensus 76 ~~v~n~~~~ivd~~a~~l~~~gf~~~d~--~~~~~----l~~i~~~---N~~d~~~~~~~~~a~i~G~ay-~~v~~de~- 144 (501) T protein:vir:25 76 LSVKNVLSLVRDSFAQNLSVVGYRNALA--KENDP----AWEMWQR---NRMDARQAEVHRPALTYGASY-VTVTPTDE- 144 (501) T ss_pred hhhcChHHHHHHHHHhhhcccceecCCc--cchHH----HHHHHHh---cChhHHHHHHHHHHhhcCceE-EEEecCCC- Confidence 1100001111110 001222211 12222 3333332 235665555 45788899975 78886432 Q ss_pred cccCcceeeeeccccCchhcccc-c------ce-----eecC---CCcccccccccccccccccccccccc------c-- Q lcl|NC_017974. 144 GEYIGKYKVKNLAFRPQASLSRT-N------PI-----VYNE---DGSAIIGIKQSLSAFQNYTASEIGVG------G-- 200 (507) Q Consensus 144 ~~~~g~~~~~~l~~r~~~~i~~~-~------~~-----~~~~---d~~~l~~~~q~~~~~~~~~~~~~~~~------~-- 200 (507) |. .+.-+.|+ .++.-| . +. +... +..... ..-.+...+.+........ . T Consensus 145 ----~~-~i~~~sp~--~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~-~~y~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (501) T protein:vir:25 145 ----GP-VFRTRSPR--QILAVYADPSVDAWPQYALETWVAQKDAKPHRRG-VLYDDTYMYELDLGEVVLGDAGGGQATQ 216 (501) T ss_pred ----CC-eEEEeccc--cEEEEEecCCCCcceeEEEEEEeeccccCcceeE-EEecCeeEEEEecCceeeeecccccccc Confidence 21 12222222 111000 0 00 0000 000000 0000000111110000000 0 Q ss_pred -----------cccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccc Q lcl|NC_017974. 201 -----------VSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQ 269 (507) Q Consensus 201 -----------~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~ 269 (507) .........++.--++-|....+ .++.|.|-+..+....=-=+..+...+...+-|..|..++. + T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~-~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~---G 292 (501) T protein:vir:25 217 QPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRD-ADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVIS---G 292 (501) T ss_pred ccccccccccccccccccCCccceeeEeccCccc-cCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHh---C Confidence 00000111122223344444443 36778887765432211123344455667776666654442 1 Q ss_pred cCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 270 ILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLG 349 (507) Q Consensus 270 ~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iL 349 (507) ...+..+ .++ + .. +.+...+|.+.++. ++.+ ++...|...++.+-.+||+.-- T Consensus 293 ----~~~~~~~----~~~----~------~~-~~i~~~~~~~~~~~----q~~~----~~~~~~~~~l~~~i~~i~~~s~ 345 (501) T protein:vir:25 293 ----WTGSKAE----VLK----A------SA-LRVWTFEDPEVKAQ----AFPP----ASVEPYNLILEEMLQHVAMVAQ 345 (501) T ss_pred ----CCCCccc----hhh----h------cc-cceeccCCCCceEE----Eecc----cChHHHHHHHHHHHHHHHhhcC Confidence 1111111 111 1 01 12333344443332 2221 1222355666777777766432 Q ss_pred ccccccc--ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcc-ccceEEecCcCchhHHHHH Q lcl|NC_017974. 350 AGFINVG--NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPET-EMPYVKAGEIVDVDMEGFS 426 (507) Q Consensus 350 GqtLts~--~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~-~~P~~~~~~~~~~dl~~~a 426 (507) -...+.+ .+..+.-|+ .....-....++.-.+.+...|. ++++-++.+.+...+.. .-..+.|....+.++.+.| T Consensus 346 ~P~~~~~~~~~N~Sg~Al-~~~~~~l~~ka~~k~~~f~~~l~-~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~a 423 (501) T protein:vir:25 346 ISPAQVTGKMINVSAEAL-AAAEANQQRKLAAKRESFGESWE-QLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVV 423 (501) T ss_pred CChhhhccccCChHHHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHH Confidence 2222111 011111222 22333344444555666677775 46666666664222111 1246788889999999999 Q ss_pred HHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccccc---chhh-----hcCCCcCCCCCCcccccCcccccccccCCC Q lcl|NC_017974. 427 KAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQ---EELM-----KLLGEDTSRAGDGMTKGSSGNGTGKISSTR 498 (507) Q Consensus 427 ~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~---~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (507) +++.+|+.+|+ +. +.-+.+..|+.+++-+.... ++.+ ...+....+..+ .++...+.. T Consensus 424 da~~kl~~~gi--s~---et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~ 489 (501) T protein:vir:25 424 DGITKLASAGI--PI---EHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPP---------PPPQAAAQA 489 (501) T ss_pred HHHHHHHhcCC--CH---HHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCC---------CCCCCCccc Confidence 99999999985 32 23355667886432111100 0000 001110000000 000001111 Q ss_pred CchhhhccC Q lcl|NC_017974. 499 DNSAANLDN 507 (507) Q Consensus 499 d~~a~~~~~ 507 (507) ++.+.+-.+ T Consensus 490 ~~~~~~~~~ 498 (501) T protein:vir:25 490 LNEGGVNGN 498 (501) T ss_pred cccccCCCC Confidence 222222111 No 178 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=93.56 E-value=0.0071 Score=32.35 Aligned_cols=410 Identities=8% Similarity=-0.044 Sum_probs=155.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhH---HHHhhhhcccchhccchhHHHHHH---Hh---------- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAK---VQVDSERMKAEEIRWPNLIGTVES---MK---------- 64 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~---~~~~~~~e~~~~lr~~~~~~~y~~---m~---------- 64 (507) |- .+.+-........ + .+...... ..-+......+ |..++.+.|+. ++ T Consensus 1 ~~-------~~~~~~~~~~~~~-----~--~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~yy~g~~~i~~~~~~~~~~~ 64 (479) T protein:vir:79 1 ML-------NIYISETDLIKVQ-----L--KKESTINLVKVIEHYILKHRPE--KYKQGEEYYYGNTDVNNKRRYYLLDG 64 (479) T ss_pred CC-------CceecccceEeec-----c--ccCChhHHHHHHHHHHhhhhHH--HHHHHHHHhccCCccccccccccccc Confidence 00 0000000000000 0 00000000 11111111111 01112222210 00 Q ss_pred -------------hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhc Q lcl|NC_017974. 65 -------------QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYG 130 (507) Q Consensus 65 -------------~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~G 130 (507) ..+...-++.....-+.+-+..|. ..++++.++++.++++ +|.+.+.++. ++..|| T Consensus 65 ~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~p~~~~------~~~~~~~~~~~~~~~n----~~~~~~~~~~~~~~~~G 134 (479) T protein:vir:79 65 AKVDDFTKVNNKAINNYHKLLVDQKVGYSVGNPIVFN------ADDDNLTKLLNDLLGE----EFDDTITELYLNASNKG 134 (479) T ss_pred ccccccccCcceeecchHHHHHHHHHhhhhcCCceec------cCCHHHHHHHHHHHhc----CHHHHHHHHHHHHHhcC Confidence 011122233333344433333332 2234456667666542 4666656654 677899 Q ss_pred ceeeeeeeeccCccccCcceeeeeccccCchhcccc------------ccee-ecCCCccccccccc-cccccccccccc Q lcl|NC_017974. 131 LSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRT------------NPIV-YNEDGSAIIGIKQS-LSAFQNYTASEI 196 (507) Q Consensus 131 fs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~------------~~~~-~~~d~~~l~~~~q~-~~~~~~~~~~~~ 196 (507) .+. +++|... +|++.++-+.|+. .+.-| +.+. .+.++..+..+.-+ +.....+..... T Consensus 135 ~~~-~~v~~d~-----~~~~~i~~~~p~~--~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~ 206 (479) T protein:vir:79 135 VEW-LHPYINR-----KGEFKYVIIPAEE--AIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGN 206 (479) T ss_pred eEE-EEEEeCC-----CCceEEEEEccce--eEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCC Confidence 775 5777532 3455544433221 11000 0000 01111111100000 000000000000 Q ss_pred cccc---cc--------------cCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_017974. 197 GVGG---VS--------------TRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMG 259 (507) Q Consensus 197 ~~~~---~~--------------~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~ 259 (507) .... .. .......++.--++.|+ +|++|.|.+..+....=-=+..+..++..++.+.. T Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~ 281 (479) T protein:vir:79 207 SFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFK-----NNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQE 281 (479) T ss_pred cccccccccccccccccccccccccccccCCCcccEEEec-----CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhC Confidence 0000 00 00000011111223332 46778888876554433335566789999999999 Q ss_pred CceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHH Q lcl|NC_017974. 260 GVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISD 339 (507) Q Consensus 260 p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~ 339 (507) |+++++- . . ....++. ...+. .+ ..+.++.+.+.++ +..+. ....+...++. T Consensus 282 ~~~v~~g---~---~-~~~~~~~---~~~~~-------~~--~~i~~~~~~~~~~-------l~~~~--~~~~~~~~~~~ 333 (479) T protein:vir:79 282 VIYVLKE---Y---P-GTSLQEF---IDNIR-------YY--KSIKVDGGGGVDK-------LEINI--PVEAKKELLDR 333 (479) T ss_pred ceeeeec---C---C-ccccccc---hhhhh-------hc--cceecCCCCcceE-------EeccC--CHHHHHHHHHH Confidence 9888651 1 1 1111111 11111 11 2344566655443 33332 22346778899 Q ss_pred HHHHHHHHHhcccccccccchhhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcCC--ccccceEE Q lcl|NC_017974. 340 RKKSILDRLGAGFINVGNDKGGSYNLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLAL-NDMRLP--ETEMPYVK 413 (507) Q Consensus 340 ~d~~Isk~iLGqtLts~~~~~GS~Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~l-N~~~~~--~~~~P~~~ 413 (507) +.+.|...--+..++.+. .|. +.|.. ...-....+..-.+.+...|. ++++.++.+ +...+. +..-+.+. T Consensus 334 l~~~i~~~s~~p~~~~~~--~gn-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~i~ 409 (479) T protein:vir:79 334 LEKNIIIFGQGVNPESQN--TGD-KSGVALKFLYSLLDLKCSKTEKKFKKAIR-ELLWFVCEYLKISGNKSYDYKTVQIT 409 (479) T ss_pred HHHHHHHHhCcccccccc--ccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCccccccceEE Confidence 999988876665554432 222 11211 111122223333344445553 355555543 211111 22345788 Q ss_pred ecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCC-CCCCCc-ccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 414 AGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGI-DEKIEE-DISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 414 ~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gi-p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) |....+.|.++.++++.+|+ |. ++ .+.+.+.++. +.+..+ +....+......... ..+ +.. ....+++ T Consensus 410 f~~~~p~~~~~~a~~~~kl~--g~-iS----~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~-~~~-~~~-~~~~~e~ 479 (479) T protein:vir:79 410 FNHSMIINEAEKIDMAAKST--GI-VS----DETIVSNHPWVEDVNDELERLKKQEDTQKEYDD-LIP-NNQ-DGVIDET 479 (479) T ss_pred eCCCCCcCHHHHHHHHHHHh--cc-Cc----HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHh-ccC-ccc-CCCcCcC Confidence 99899999999999999984 65 34 2445566654 222111 000111110010000 000 000 0011111 No 179 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=93.34 E-value=0.0079 Score=32.10 Aligned_cols=430 Identities=11% Similarity=0.026 Sum_probs=161.6 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhccc--------chhcc---chhHHHHHHHhh---- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKA--------EEIRW---PNLIGTVESMKQ---- 65 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~--------~~lr~---~~~~~~y~~m~~---- 65 (507) |---+.++..+..+-|.-.+. ...|......-.+ ..|.+ ...-+-|+.-+. T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~--------------~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~ 66 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHY--------------APKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIV 66 (491) T ss_pred CcccCCccCCCCccCHHHHHH--------------HHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccC Confidence 333332222222222221110 0111111000000 00110 011122554332 Q ss_pred ChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCcc Q lcl|NC_017974. 66 DATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVG 144 (507) Q Consensus 66 D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~ 144 (507) =+++...++.-...|.+.+..+.++ ..+..|++.+ +..+.++..++++++. ++.||.+.+=+.+-..++ T Consensus 67 ~n~~~~tl~~l~G~vfrk~p~~~~p-------~~l~~l~~d~--D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~- 136 (491) T protein:vir:95 67 YNFTRRTLSGMVGSVMRKEPEINIP-------KELEYLLKNA--DGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAA- 136 (491) T ss_pred CChHHHHHHHHhchhhcCCceeecc-------HHHHHHHhcc--CCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcc- Confidence 3566666666555665554433321 2233333322 3455688899988765 777998876554422211 Q ss_pred cc-----CcceeeeeccccCchhcccccceee-----------------c--CCCcccccccccccc---------cccc Q lcl|NC_017974. 145 EY-----IGKYKVKNLAFRPQASLSRTNPIVY-----------------N--EDGSAIIGIKQSLSA---------FQNY 191 (507) Q Consensus 145 ~~-----~g~~~~~~l~~r~~~~i~~~~~~~~-----------------~--~d~~~l~~~~q~~~~---------~~~~ 191 (507) .. .+... --|....+..|..|+.-.. . .|+.+-....|..-. ...+ T Consensus 137 ~T~Ade~~~~~r-Py~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~ 215 (491) T protein:vir:95 137 ATAAEQNAGLLN-PTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLF 215 (491) T ss_pred cCHHHHHHhcCC-cEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEE Confidence 00 00000 0011122222322221000 0 011111111111000 0000 Q ss_pred ccccccccc-----cccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCceEEe Q lcl|NC_017974. 192 TASEIGVGG-----VSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKIL-IENLEVVGATKDMGGVIELK 265 (507) Q Consensus 192 ~~~~~~~~~-----~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~-~~~~w~~f~Er~g~p~~v~~ 265 (507) .....+... .....++..++.=-|+++-. ...+-..+...|..++..-+ ++| ...+.-+-+-.-+.|++++. T Consensus 216 r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~-~~~~~~~~~pPLl~LA~lni-~Hy~~ssd~~~~l~~~~~P~l~~~ 293 (491) T protein:vir:95 216 RFDAEGGAQEEVVEIYPDLGESLRGVIPFTFIGA-TNNDATIDDAPLLPLAELNI-GHYRNSADNEESSFVVGQPTLFIY 293 (491) T ss_pred EEcCCCcceeeeeeeeecCCCcccCeeEEEEEec-CCCCCCCCcCchHHHHHHHH-HHhhhhhHHHHHHHHcccceeeee Confidence 000000000 00011122222112333322 22333334555555554322 222 11222222333356666653 Q ss_pred cccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHH Q lcl|NC_017974. 266 IPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSIL 345 (507) Q Consensus 266 ~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Is 345 (507) - ..+..++. +.... ...++-|+.++..+|.+.+.. ++++++++ .. ++.++-...+|. T Consensus 294 G--------~d~~~~~~---~~~~~--~~~i~~g~~~~~~lP~~~~~~-------~ie~~~~~--~~-~~~l~~~e~qm~ 350 (491) T protein:vir:95 294 P--------GDNLTPQS---FKEAN--PNGIKFGSRCGHNLGYGGSAQ-------LIQAGENN--LA-RQNMLDKEQQAI 350 (491) T ss_pred c--------CcccCcch---hhccC--cceeEecCcCCcCCCCCCccc-------eeecCcch--HH-HHHHHHHHHHHH Confidence 1 00011110 01000 112345777888888876544 34444432 22 333443333433 Q ss_pred HHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEe--cC-cCchhH Q lcl|NC_017974. 346 DRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKA--GE-IVDVDM 422 (507) Q Consensus 346 k~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~--~~-~~~~dl 422 (507) . +..+.++.+..++++-+.. .......++.+-+..+++.++ +++++++.+-+. +...-+.|.. +. ....|. T Consensus 351 ~-~Ga~l~~~~~~~Ta~~~~~--~~~~~~S~L~~~a~~~e~al~-~~l~~~a~w~G~--~~~~~v~i~~n~dF~~~~~~~ 424 (491) T protein:vir:95 351 Q-IGAQLITPSQQITAESARI--QRGADTSVMATIARNVSQAYT-DALRWVAMMLGK--PEDSEVEFQLNMDFFLQPMTA 424 (491) T ss_pred H-HHHHhccCCcchhHHHHHH--HHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCC--CCCCceEEEeecccccccCCH Confidence 2 2223333322345554433 333346777888899999997 478999988532 1111122321 21 122232 Q ss_pred HHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhc--CCCcCCCCCCccc-ccCccc Q lcl|NC_017974. 423 EGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKL--LGEDTSRAGDGMT-KGSSGN 489 (507) Q Consensus 423 ~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~ 489 (507) ...+++-++...|.+.. .....++ ++.||++...+++........ .+..++-+++... +.+..+ T Consensus 425 -~~~~all~~~~~G~is~-~t~~~~L-~~~~vl~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 425 -QDRAAWMADINAGLLPA-TAYYAAL-RKAGVTDWTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred -HHHHHHHHHHhcCCCCH-HHHHHHH-HhCCCCCccHHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 23556667778887543 3334444 456888665544332222211 1122222222211 111111 No 180 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=93.18 E-value=0.0085 Score=31.94 Aligned_cols=213 Identities=13% Similarity=0.026 Sum_probs=96.2 Q ss_pred eeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 169 IVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIEN 248 (507) Q Consensus 169 ~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~ 248 (507) .....||.. ++.... ......+....++++.+++++.....+..||.+.+..+......-....+ T Consensus 1 ~r~~~dg~~----------~y~~~~-----~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~ 65 (219) T protein:vir:98 1 MRVCKDGNY----------KYLMKK-----SLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATI 65 (219) T ss_pred CceeecCeE----------EEEEec-----ceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHH Confidence 111223211 000000 00111233456677776555543334566899998887766655454555 Q ss_pred HHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCce-eEeec--Ccccccccccceeeeecc Q lcl|NC_017974. 249 LEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQS-FFMLP--SDTKDNAPQYSMTLKGID 325 (507) Q Consensus 249 ~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a-~~iiP--~g~~i~~~~~~~el~~~~ 325 (507) +-..|....++|--++++|- ..-+++...+..+.+ +. .+++..+ .++++ .|.+- -++++..+ T Consensus 66 ~~~~~f~Ng~~p~gil~~~~------~~l~~e~~~~~~~~~-~~---~~g~~n~~~~~l~~~gg~~~-----G~~~~~~~ 130 (219) T protein:vir:98 66 FRRRYYSNGAHMGFILYSTD------PDMTEEMEDEIAERI-RD---SKGVGNFRSMFVNIAGGHPD-----GLKVIPIG 130 (219) T ss_pred HHHHHHhcCCCCceEEEeCC------CCCCHHHHHHHHHHH-HH---hcCcccccceeEecCCCCcc-----ceeEEEcc Confidence 55567766666655554431 111233333332233 22 2222221 23332 22110 12333333 Q ss_pred CCCcchhHHHHHHHHHHHHHHHHhccccccc--ccchhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_017974. 326 GMGKQYSTAQLISDRKKSILDRLGAGFINVG--NDKGGSYNL-SESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDM 402 (507) Q Consensus 326 g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al-~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~ 402 (507) -+..-..|.+.-++-..+|+.+.--.---.+ +..+++++- .+.....-..-+.--++.|++.||+++ + T Consensus 131 ~~~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~---~------ 201 (219) T protein:vir:98 131 DTGQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDY---E------ 201 (219) T ss_pred CCHHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhh---c------ Confidence 3333334556666667888887754332222 112233432 222233333444455566666666531 1 Q ss_pred cCCccccceEEecCcCchhHH Q lcl|NC_017974. 403 RLPETEMPYVKAGEIVDVDME 423 (507) Q Consensus 403 ~~~~~~~P~~~~~~~~~~dl~ 423 (507) .+.. .++.|+.....|+. T Consensus 202 -~~~~--~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 202 -IKSA--LKVNFKQPEKRDKN 219 (219) T ss_pred -CCCc--cEEeecCcccccCC Confidence 1111 25888888888866 No 181 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=92.61 E-value=0.011 Score=31.39 Aligned_cols=410 Identities=10% Similarity=-0.007 Sum_probs=153.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +++-+ .+..+.-+... .=.+++. +...+....... ..+. ..+ ..++ -.....-++.....-+ T Consensus 53 i~~~~--~~~~r~~~l~~-------YY~g~~~-i~~~~~~~~~~~-~~~~-~~~-~~ri-----~~n~~k~Ivd~~~~yl 114 (492) T protein:vir:97 53 IKQHL--EKLPEISIGQE-------YYEQRPD-IVKEPKPVDATG-AVDP-LKP-DDRM-----ITNFHANLVDQKVSYI 114 (492) T ss_pred HHHHH--HHHHHHHHHHH-------HhcccCc-cccccccccccc-cccc-ccc-cccc-----ccchHHHHHHHHhhhh Confidence 11100 00000000000 0000000 000000000000 0000 000 0000 0111111222222223 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..+.. +++++.+++++++++ ++.+.+.++ .++.-||.+ ++++|..+ +|.+.+.-+.|+. T Consensus 115 ~g~p~~~~~------~d~~~~~~l~~~~~n----~~~~~~~~~~~~~~~~G~a-~~~v~~d~-----dg~~~~~~~~p~~ 178 (492) T protein:vir:97 115 VGKPIAFKH------TDDEVVKRIDEVLGN----RFDDKLHSVLTGASNKGIE-WLHPYLDE-----EGEFKLFRVPAEQ 178 (492) T ss_pred cccCceecc------CchHHHHHHHHHHhc----cHHHHHHHHHHHHhhcCeE-EEEEEecC-----CCceEEEEEcccc Confidence 333333322 234566777777642 466666654 467789974 56888532 4566555444331 Q ss_pred chhcccccceeecCCCcccccccccc----cccccccccccccc----------------ccccCCCccccccccEEEEe Q lcl|NC_017974. 160 QASLSRTNPIVYNEDGSAIIGIKQSL----SAFQNYTASEIGVG----------------GVSTRMSDVIIPINRVMLMN 219 (507) Q Consensus 160 ~~~i~~~~~~~~~~d~~~l~~~~q~~----~~~~~~~~~~~~~~----------------~~~~~~~~~~iP~~k~i~~~ 219 (507) ++.-|.. ...+..+-.++... .....++....... ..........++.--++.|+ T Consensus 179 --~~~i~d~---~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 253 (492) T protein:vir:97 179 --GIPIWTD---KEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK 253 (492) T ss_pred --eEEEEcC---CCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEec Confidence 1111100 00111111111000 00000000000000 00000000001111123332 Q ss_pred eCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcC Q lcl|NC_017974. 220 TGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSG 299 (507) Q Consensus 220 ~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G 299 (507) +|+.|.|.+..+-...=--+..+...+..++.+..|+++++- . ..++..+. ...+. T Consensus 254 -----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g---~----~~~~~~~~---~~~~~--------- 309 (492) T protein:vir:97 254 -----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKN---Y----DDQELPEF---KRLLR--------- 309 (492) T ss_pred -----CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeec---C----CcccchhH---HHHHh--------- Confidence 356788888875443333445677778888888888887651 1 11111111 11111 Q ss_pred CceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHH---HHHHHHHH Q lcl|NC_017974. 300 EQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSES---KQTIHTQF 376 (507) Q Consensus 300 ~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~v---h~~v~~~~ 376 (507) ....+.++.+.+.++ +..+. ....+..+++++.+.|.+.--...++.+.. +| ...|.. ...-.... T Consensus 310 ~~~~~~~~~~~~~~~-------l~~~~--~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-~~-n~Sg~Al~~~~~~l~~k 378 (492) T protein:vir:97 310 YYGAIKVSDNGGVDT-------IQVEV--PVENSKKYLDELYQKIMLFGQAVDFSSDKF-GS-APSGVALEFLYTNLNLK 378 (492) T ss_pred hccceecCCCCccee-------EeccC--CHHHHHHHHHHHHHHHHHHhCCCCCCcccc-cc-CcHHHHHHHHHHHHHHH Confidence 112455677765444 22222 223467788999888888755444443221 11 111221 11112223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCC-C Q lcl|NC_017974. 377 VQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGI-D 455 (507) Q Consensus 377 ~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gi-p 455 (507) +..-.+.+...+. ++++.++.+.+.... ..--.+.|....+.|..+.++++.+|+ |+ ++ .+.+.+.++. + T Consensus 379 a~~~~~~f~~~l~-~~~~li~~~~~~~~~-~~~i~v~f~~~~p~~~~e~a~~~~kl~--G~-iS----~et~l~~l~~v~ 449 (492) T protein:vir:97 379 ADKLARKAKVAIQ-ELLWFVFEHFDIKGE-HKDVDISFNYNKVANTELQVQTAQQSM--GI-VS----HETVLENHPFVE 449 (492) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHhcCCcc-cceeeEEecCCCCCCHHHHHHHHHHHh--cc-Cc----hHHHHHhCCCCC Confidence 3334444555553 356666665432222 122357888888999999999999984 65 34 2445566654 3 Q ss_pred CCCCcccc--cch---hhhcCCCcCCCCCCcccccCccccccccc Q lcl|NC_017974. 456 EKIEEDIS--QEE---LMKLLGEDTSRAGDGMTKGSSGNGTGKIS 495 (507) Q Consensus 456 ~~~~~~~~--~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (507) .+.. +.. ..+ .+...++ ....+........+.+..+.. T Consensus 450 d~~~-Eleri~~E~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 450 DLQA-ELERIEQEQTEYNKQLPN-LDDGGADSAQQQERSNNKESE 492 (492) T ss_pred CHHH-HHHHHHHHHHHHHHhhhc-cccCCCCCCcccccccccccC Confidence 2211 110 000 0111111 000000000000011111100 No 182 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=424 Identities=9% Similarity=0.002 Sum_probs=158.9 Q ss_pred CcccccccccccCccccccccccccccccccCc--hhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGT--FALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYT 78 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~ 78 (507) |++-+.+ +.++...- .++-+ |. ..++.. . ..+. ..+ . . -....+-.-+...... T Consensus 49 i~~h~~~----~~~rl~~l------~~yY~-g~~~~i~~~~--~----~~~~-~~~-~---~--ki~~n~~k~Ivd~~~~ 104 (502) T protein:vir:48 49 INHHKLR----QAPRIQEL------LDYAR-GENHDVLKSG--R----RKDN-EMA-D---K--RAVHNYGRMISKFKTG 104 (502) T ss_pred HHHHHHH----HHHHHHHH------HHHhc-CCCccccccc--c----cccc-ccc-c---c--eeecchHHHHHHHHhh Confidence 1111100 00000000 00000 00 000000 0 0000 000 0 0 0012333344444445 Q ss_pred HHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) -+.+-+..|... +++....+.+++.+++.. ..|...+.+++ ++.-||.+ ++++|... +|.+.+.-+.| T Consensus 105 yl~g~p~~~~~~--d~~~~~~~~~~l~~~~~~---N~~~~~~~~~~~~~~~~G~a-~~~v~~de-----dg~~~i~~~~p 173 (502) T protein:vir:48 105 YLAGNPIRVEYD--DNEDNSQNDDAIKRIGRI---NDIDTHNRNLIRDLSQTGRA-YEVIYRSE-----YDETRIKRLSP 173 (502) T ss_pred hhcccCeeEecC--CccchhHHHHHHHHHHhh---cCHhHHHHHHHHHHhhcCeE-EEEEEeCC-----CCceEEEEEcc Confidence 555555445443 334445677778877654 24777766654 57779975 46888643 45555554433 Q ss_pred cCch----------hcccccceeecCCCccccccc-cccccccccccccccccccccCCCcc-ccccccEEEEeeCCccC Q lcl|NC_017974. 158 RPQA----------SLSRTNPIVYNEDGSAIIGIK-QSLSAFQNYTASEIGVGGVSTRMSDV-IIPINRVMLMNTGGSSS 225 (507) Q Consensus 158 r~~~----------~i~~~~~~~~~~d~~~l~~~~-q~~~~~~~~~~~~~~~~~~~~~~~~~-~iP~~k~i~~~~~~~~~ 225 (507) +.-. .+..++.+....+......+. -.+...+.+...... .......... .|| ++.|. + T Consensus 174 ~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~-~~~~~~~~~~g~vP---vv~~~-----n 244 (502) T protein:vir:48 174 LETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSF-NEISVTPHAFGTVP---ITEFL-----N 244 (502) T ss_pred cceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCce-eeccceecCCCccc---eEEec-----C Confidence 2110 000011000000000000000 000001100000000 0000000011 122 23332 4 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEe Q lcl|NC_017974. 226 QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFM 305 (507) Q Consensus 226 ~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~i 305 (507) |+.|.|.+..+....=--...+..++..++.|..|+++++-. .. ...++....+.. +.++ T Consensus 245 n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~---~~----~~~~~~~~~~~~-------------~~~~ 304 (502) T protein:vir:48 245 NADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGD---LA----LPQGMQASDMKR-------------TRLM 304 (502) T ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecC---cc----cccccchhhhhh-------------ccee Confidence 678899998755443334557788888899999998887621 10 111111111110 1111 Q ss_pred -ecCc--ccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHH-H--HHHHHHHHH Q lcl|NC_017974. 306 -LPSD--TKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESK-Q--TIHTQFVQR 379 (507) Q Consensus 306 -iP~g--~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh-~--~v~~~~~~a 379 (507) ++.. .........++++..+.. ...+..+++.+.+.|.+.--...++.+.. ++.+.|..- . .-....+.. T Consensus 305 ~~~~~~~~~~~~~~~d~~~l~~~~~--~~~~~~~~~~L~~~I~~~s~~p~~~~~~~--~~n~Sg~Alk~~~~~l~~k~~~ 380 (502) T protein:vir:48 305 QLKPPKSADGKEGTVKAEYLTKSYD--VSGAEAYKTRLNKDIHVFTNTPDMSDNHF--SGNASGEALKYKLFGLDQDRVD 380 (502) T ss_pred eccccccccccccCcceeEeeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCcccc--ccCchHHHHHHHHHHHHHHHHH Confidence 1110 000001112333333221 22356788999999987654444443321 112212211 1 112222333 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCCcC--C--ccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCC Q lcl|NC_017974. 380 VNEIILEALNENLLPQLLALNDMRL--P--ETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGID 455 (507) Q Consensus 380 D~~~i~~~ln~~li~~l~~lN~~~~--~--~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip 455 (507) -.+.+...|.+ +++-++.+-...+ . +..-..+.|....+.|..+.++++.+|. |. ++ .+.+.+.++.- T Consensus 381 ~~~~~~~~l~~-~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-iS----~et~l~~l~~v 452 (502) T protein:vir:48 381 TQSQFTQGLKR-RYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLG--GQ-VS----QETALSLSGLV 452 (502) T ss_pred HHHHHHHHHHH-HHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-Cc----HHHHHHhCCCC Confidence 34555555643 4455544311111 1 1123578899999999999999999984 65 44 24566666542 Q ss_pred CCCCcccc--cchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 456 EKIEEDIS--QEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 456 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .....+.. ..+......+........ ....+.+. ........-.| T Consensus 453 ~D~~~E~~ri~~E~~~~~~~~~~~~~~~--~~~~~~d~-----~~e~~~~~~~~ 499 (502) T protein:vir:48 453 ENPTEELDKINEESSKIDFKGYPSYFYD--NVGKYTDE-----VKETHTDDFER 499 (502) T ss_pred CCHHHHHHHHHHHHHhhhhhcccccccc--cccccCCC-----ccCCCCcCcCC Confidence 21111111 001110000000000000 00000000 00011111111 No 183 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=91.93 E-value=0.014 Score=30.82 Aligned_cols=446 Identities=9% Similarity=0.010 Sum_probs=161.3 Q ss_pred Cc--------------cccccccccc-Cccccccccccccc-----cccccCchhhhHHH-Hhhhhc--ccchhccchhH Q lcl|NC_017974. 1 MS--------------RKRNRNRSVK-VAKATSEQINVSRM-----RMSEQGTFALAKVQ-VDSERM--KAEEIRWPNLI 57 (507) Q Consensus 1 ~~--------------~k~~~~~~~~-~~~~~~~~~~~~~~-----~~~~~g~~~~~~~~-~~~~~e--~~~~lr~~~~~ 57 (507) |. .+|-.+.+.. .-.+.-.....+.. -...+-..-...+. ...+.+ +...+...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~ 80 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRK 80 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccC Confidence 11 1111111110 00000000000000 00000000000000 000000 00011100000 Q ss_pred HHH-HH-HhhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceee Q lcl|NC_017974. 58 GTV-ES-MKQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVV 134 (507) Q Consensus 58 ~~y-~~-m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~ 134 (507) .-. .. -....+..-++.+...-+.+-+..+... +..+.+++.+++.++++. ..|...+.++. ++.-||.+. T Consensus 81 ~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~--~~~~~~~~~~~l~~~~~~---n~~~~~~~~~~~~~~~~G~a~- 154 (501) T protein:vir:96 81 DNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYD--DNDDNSQNDDAIKRIGRI---NDLDSLNRTLIRDLSQTGRAY- 154 (501) T ss_pred ccccccceeecchHHHHHHHHhhhhcccCeeEeeC--CccchhHHHHHHHHHHHh---cCHHHHHHHHHHHHhhcCeEE- Confidence 000 00 0012333344444444444444444433 234456677778877764 24666666654 577799765 Q ss_pred eeeeeccCccccCcceeeeeccccCchhcccccceeecCC-Ccccccccccc--------cccccccccccc-----ccc Q lcl|NC_017974. 135 EKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNED-GSAIIGIKQSL--------SAFQNYTASEIG-----VGG 200 (507) Q Consensus 135 Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d-~~~l~~~~q~~--------~~~~~~~~~~~~-----~~~ 200 (507) +.+|... +|.+.+.-+.|+. ++. ++.+.. +..+..++-.. .....++....- ... T Consensus 155 ~~v~~de-----dg~~~i~~~~p~~--~~~----v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~ 223 (501) T protein:vir:96 155 EVIYRSE-----YDETRIKRLSPLE--TFV----IYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDF 223 (501) T ss_pred EEEEEcC-----CCceEEEEEccce--eEE----EEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCc Confidence 5777532 4556555443331 111 111111 11111111100 000011111000 000 Q ss_pred cccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchH Q lcl|NC_017974. 201 VSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSS 280 (507) Q Consensus 201 ~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~ 280 (507) .........+..--++.| ..|+.|.|.+..+-...=.-+..+..++..++.+..|+++++- .. ..+.+ T Consensus 224 ~~~~~~~~~~g~vPvv~~-----~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G---~~----~~~~~ 291 (501) T protein:vir:96 224 NEISVTTHAFGTVPITEY-----LNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYG---DL----ALPKG 291 (501) T ss_pred eeccccccCCCccceEEe-----cCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeec---cc----ccCcc Confidence 000011111111112333 2467899999876444333455777888888999999888751 11 11111 Q ss_pred HHHHHHHHHHHHHHHhhcCCceeEeecCc--ccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccccc Q lcl|NC_017974. 281 PEAEMVRGLMSDAANAHSGEQSFFMLPSD--TKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGND 358 (507) Q Consensus 281 ~~~~~l~a~~~~~~n~~~G~~a~~iiP~g--~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~ 358 (507) +....+... ..+.++.. ....-...+++++..+.. ...+..+++.+.+.|...--...++.+. T Consensus 292 ~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~- 356 (501) T protein:vir:96 292 MQASDMKRT------------RLMQLKPPKSADGKEGTVKAEYLTKSYD--VSGAEAYKTRLNRDIHIFTNTPDMSDTN- 356 (501) T ss_pred cchhhhhhc------------CeeeecccccccccccCcceeeEeccCC--HHHHHHHHHHHHHHHHHHhCCcccCccc- Confidence 111111100 01111110 000001112333333222 2245778888888887765444444322 Q ss_pred chhhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-C--CC-cCCccccceEEecCcCchhHHHHHHHHHH Q lcl|NC_017974. 359 KGGSYNLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLAL-N--DM-RLPETEMPYVKAGEIVDVDMEGFSKAIQR 431 (507) Q Consensus 359 ~~GS~Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~l-N--~~-~~~~~~~P~~~~~~~~~~dl~~~a~~~~~ 431 (507) .|+...|.. ...-....+..-.+.+...|. ++++.++.+ + .. ...+..-..+.|....+.|..+.++++.+ T Consensus 357 -~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~k 434 (501) T protein:vir:96 357 -FSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLK-RRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTG 434 (501) T ss_pred -ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHH Confidence 111211211 111222223333455555553 355555443 1 11 01112235688999999999999999999 Q ss_pred HHhcCCccccHHHHHHHHHHcCC-CCCCCccc--ccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 432 IGAVGYLPKTPKVINRVLEVLGI-DEKIEEDI--SQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 432 L~~~G~~~~~~~~~~~i~e~~Gi-p~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) |+ |. ++. +.+.+.++. +.+.. +. ...+......+........ ..++.+ ....+..+.+..| T Consensus 435 l~--g~-iS~----et~~~~l~~v~D~~~-E~~ri~~E~~~~~~~~~~~~~~~----~~~~~~---~~~~e~~~d~~e~ 498 (501) T protein:vir:96 435 LG--GQ-VSQ----ETALSLSGLVESPNE-ELDKINKEMSEIDFKGYSNDFNE----HVGKYT---DEVKETHTDDFER 498 (501) T ss_pred Hh--cc-Cch----HHHHHhCCCCCCHHH-HHHHHHHHHHHhhccccccchhh----cccccC---CcCCCCCCCcccc Confidence 85 64 442 345555543 32211 11 1111111000000000000 000000 0112222222333 No 184 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=91.48 E-value=0.016 Score=30.48 Aligned_cols=418 Identities=11% Similarity=0.027 Sum_probs=156.5 Q ss_pred Cc---ccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHH---HHh---------- Q lcl|NC_017974. 1 MS---RKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVE---SMK---------- 64 (507) Q Consensus 1 ~~---~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~---~m~---------- 64 (507) |. +-.-+..+..+-+. + ..........-..-...+. ...+ |..++.+.|+ +++ T Consensus 1 ~~~~~~~~~~~~~~e~~~~--~-----~~~~~~~~~~i~~~i~~~~--~~~~--~~~~~~~yY~g~~~i~~~~~~~~~~~ 69 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQ--I-----KPKYETQEEMILRLVREHK--ENID--NITMGERYYNHHPDILDAPPKRDVNG 69 (478) T ss_pred CccccCCCCchhHHHHHHH--H-----hhccCCcHHHHHHHHHHHH--HHHH--HHHHHHHHhcCCCchhcccccccccc Confidence 11 11111111000000 0 0000000000000001110 0000 1111122221 000 Q ss_pred -----------hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcce Q lcl|NC_017974. 65 -----------QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLS 132 (507) Q Consensus 65 -----------~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs 132 (507) -.....-++.+...-+.+-+..+. ..+.+..+.+.+++++ .|.+.+.++. ++.-||.+ T Consensus 70 ~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~------~~~d~~~~~l~~~~~n----~~~~~~~~~~~~~~~~G~~ 139 (478) T protein:vir:10 70 DYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFG------VDNDKALKQIQHTLNH----KWDDKLVDILTAASNKGIE 139 (478) T ss_pred ccccccccceeccchHHHHHHHHHhhhccCCeeee------cCChHHHHHHHHHHhc----CHHHHHHHHHHHHHhcCeE Confidence 011222222222222222222222 2234455667766642 4666666554 67789987 Q ss_pred eeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCcccc-cccccc----cccccccccccccccc------ Q lcl|NC_017974. 133 VVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAII-GIKQSL----SAFQNYTASEIGVGGV------ 201 (507) Q Consensus 133 ~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~-~~~q~~----~~~~~~~~~~~~~~~~------ 201 (507) . +++|... +|.+.+.-+.|+ ..+ +++-+...+.+. .++... .....++.....-... T Consensus 140 ~-~~~~~d~-----~g~~~~~~~~p~--~~~----~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~ 207 (478) T protein:vir:10 140 W-VQPYVDE-----EGEFKTFRVPAE--QAV----PIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLI 207 (478) T ss_pred E-EEEEecC-----CCeeEEEEEccc--ceE----EEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeee Confidence 6 4777532 455554433322 111 111111111111 110000 0000000000000000 Q ss_pred --------------ccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecc Q lcl|NC_017974. 202 --------------STRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIP 267 (507) Q Consensus 202 --------------~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P 267 (507) ........++.-.++.|+ .+++|.|.+..+....=--+..+..++..++.+..|+++++- T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g- 281 (478) T protein:vir:10 208 PDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFK-----NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKG- 281 (478) T ss_pred ccccccccccccceecccccccCCccceEEec-----cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeec- Confidence 000000111111233332 478899998875444434455778888888999999887651 Q ss_pred cccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeec--CcccccccccceeeeeccCCCcchhHHHHHHHHHHHHH Q lcl|NC_017974. 268 SQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLP--SDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSIL 345 (507) Q Consensus 268 ~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP--~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Is 345 (507) . . .++..+ ....+. .+ ..+.++ +|.+ ++++.... ....+...++.+.+.|. T Consensus 282 --~---~-~~~~~~---~~~~~~-------~~--~~~~~~~~~~~~-------~~~l~~~~--~~~~~~~~~~~l~~~i~ 334 (478) T protein:vir:10 282 --Y---E-GEDMKD---FMHNLK-------YY--KAISVAGESGSG-------VDTIKVEV--PIDSVKEYTKMLRDYII 334 (478) T ss_pred --C---C-ccccch---hhhhhh-------hc--ceEEecCCCCCc-------ceEEeecC--ChHHHHHHHHHHHHHHH Confidence 1 1 111111 111111 11 223343 2332 23333222 22346778899998888 Q ss_pred HHHhcccccccccchhhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhH Q lcl|NC_017974. 346 DRLGAGFINVGNDKGGSYNLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDM 422 (507) Q Consensus 346 k~iLGqtLts~~~~~GS~Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl 422 (507) +.-.+..++.+. .|+...|.. ...-....+..-.+.+...|. ++++.++.+.+... +..-+.+.|....+.|. T Consensus 335 ~~s~~p~~~~~~--~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~g~~~-~~~~i~i~f~~~~p~d~ 410 (478) T protein:vir:10 335 EFGQGVDFQQDK--FGNSPSGIALKFMYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYRLDV-KVQDIEITFNFNVMVNE 410 (478) T ss_pred HHhCccccCccc--cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCc-ccccceEEecCCCCCCH Confidence 876555444322 122222221 111122223334455566664 46677777654222 22335788999999999 Q ss_pred HHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccc--cchhhhcCCCcCCCCCCcccccCcccccccccCCCCc Q lcl|NC_017974. 423 EGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDIS--QEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDN 500 (507) Q Consensus 423 ~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 500 (507) .+.|+++.+| .|+ ++ .+.+.+.++.-...+.+.. ..+.. ...+..+....+........+. ...+- T Consensus 411 ~e~a~~~~kl--~g~-iS----~et~~~~l~~v~D~~~E~~ri~~E~~-~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 478 (478) T protein:vir:10 411 LENSQIAMNS--TGL-LS----KETILSNHAWVEDPVAEMERIEQENI-ELNQQLPDIEEGLNGEQQRQSE----NNQPE 478 (478) T ss_pred HHHHHHHHHH--hCC-CC----hHHHHHhCCCCCCHHHHHHHHHHHHH-HHHhhccccccccCCCCCCCCC----CCCCC Confidence 9999999987 465 44 2456677765221111111 00000 0001011111111100000000 00000 No 185 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=91.04 E-value=0.018 Score=30.18 Aligned_cols=446 Identities=13% Similarity=0.018 Sum_probs=163.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccc----------hhcc-chhHHHHHHHh----h Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAE----------EIRW-PNLIGTVESMK----Q 65 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~----------~lr~-~~~~~~y~~m~----~ 65 (507) |+-+..+. +.++. .++- .....|.....+-.+. .-+. .+.-+-|+.=+ . T Consensus 1 m~~~~~~~------------v~~~h---~~y~-a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~ 64 (513) T protein:vir:97 1 MADKDPKS------------PATTS---GAYD-QMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVL 64 (513) T ss_pred CCCCCCCC------------CCcCC---HHHH-HHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccC Confidence 44432111 11111 1110 0011111111000000 0011 12233354422 2 Q ss_pred ChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCcc Q lcl|NC_017974. 66 DATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVG 144 (507) Q Consensus 66 D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~ 144 (507) -+++...++.-...|.+.+..+ .. +. ..+..+.+.+.. +..+.++..++++++. ++.||.+.+=+.|-..++. T Consensus 65 ~n~~~~tl~~l~G~vf~k~p~~--~~-~~--p~~~~~~l~~d~-D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~ 138 (513) T protein:vir:97 65 LNMVEQTLDTLSGKPFSEPIKL--NE-DV--PKAIEETILPDV-DLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPR 138 (513) T ss_pred CChHHHHHHHHhhhhhhcCccc--Cc-Cc--hHHHHHHHhhcc-CCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCc Confidence 4566666665555554432211 11 11 122332221111 2445688999999876 8889988764444221110 Q ss_pred ccCcce---------e-eeeccccCchhcccccc----------------eeecCCCcccccccccc----ccccccccc Q lcl|NC_017974. 145 EYIGKY---------K-VKNLAFRPQASLSRTNP----------------IVYNEDGSAIIGIKQSL----SAFQNYTAS 194 (507) Q Consensus 145 ~~~g~~---------~-~~~l~~r~~~~i~~~~~----------------~~~~~d~~~l~~~~q~~----~~~~~~~~~ 194 (507) .++.. . .--+....+..|..|+. ....+||.+-....|.. ..+..+... T Consensus 139 -~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~ 217 (513) T protein:vir:97 139 -EDGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPV 217 (513) T ss_pred -cchhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEee Confidence 00000 0 00011122233333321 11133443322222111 111111110 Q ss_pred ccc---c-cccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccccc Q lcl|NC_017974. 195 EIG---V-GGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQI 270 (507) Q Consensus 195 ~~~---~-~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~ 270 (507) ... . ........+..|+.=-|+++... ..+-..+...|..++..-+---....+.-..+..-+.|++++.- + T Consensus 218 ~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~-~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G---~ 293 (513) T protein:vir:97 218 KKSNAQKEEWALADEWATGLNYVPLVTFYAD-RQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSG---A 293 (513) T ss_pred cCCCccccceEEecCCCCcCCceeEEEEecC-CCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeec---C Confidence 000 0 00000111222222223443322 33333455555555543322212333333344444677777651 1 Q ss_pred CCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecC-cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 271 LNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPS-DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLG 349 (507) Q Consensus 271 ~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~-g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iL 349 (507) .++..+ .+..|+.+++.+|. |.+.. +++.+|++ ....+..++...++|..+ . T Consensus 294 -----~~~~~~-------------~i~iG~~~~~~lpe~~~~~~-------yie~~g~~-i~~~~~~l~~le~qm~~~-G 346 (513) T protein:vir:97 294 -----SGEDSD-------------PVVVGPNKVLYNPDPAGRFY-------YVEHTGQA-IAAGRTDLKDLEEQMAGY-G 346 (513) T ss_pred -----CcCCCC-------------ceEeeccccccCCCCCCcce-------eeccCchh-HHHHHHHHHHHHHHHHHH-H Confidence 111100 12347778888995 66544 44454442 223455666666776332 2 Q ss_pred ccccccc-ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecC-cCchhH-HHHH Q lcl|NC_017974. 350 AGFINVG-NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGE-IVDVDM-EGFS 426 (507) Q Consensus 350 GqtLts~-~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~-~~~~dl-~~~a 426 (507) ...|... ..++++.+ .........++.+-+..+++.++ ++++++..+-+. +..-++|.+.. ....++ ...+ T Consensus 347 a~ll~~~~~~~Ta~a~--~~~~~~~~S~L~~~a~~le~al~-~~l~~~a~wlg~---~~~~~~v~in~dF~~~~~~~~~~ 420 (513) T protein:vir:97 347 AEFLKRKTGGQTATAR--ALDSAEATSDLSAMTGLFEDALA-QALDITADWLRL---GPNGGTVELVKDYDLEEMDAPGL 420 (513) T ss_pred HHhhccCCccccHHHH--HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCC---CCCccEEEeccccCcccCCHHHH Confidence 3333321 12344433 33444455667778888999996 588998887531 21123343311 112222 2345 Q ss_pred HHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCccc----ccchhhhcC---CCcCCCCCC--cccccCcccccccccCC Q lcl|NC_017974. 427 KAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDI----SQEELMKLL---GEDTSRAGD--GMTKGSSGNGTGKISST 497 (507) Q Consensus 427 ~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~----~~~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~~~~~ 497 (507) +++.++...|.+.. ....++++++-=|++..+.+. ......+.. ..+..+..+ +....+++++..+..+. T Consensus 421 ~al~~a~~~G~is~-~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (513) T protein:vir:97 421 QALQVAREKRDISR-KTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEG 499 (513) T ss_pred HHHHHHHhCCCCCH-HHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCC Confidence 67777788887544 223344544333332222111 111111110 001111110 00111112221111000 Q ss_pred --CCchhhhccC Q lcl|NC_017974. 498 --RDNSAANLDN 507 (507) Q Consensus 498 --~d~~a~~~~~ 507 (507) +-+.-+|--- T Consensus 500 ~~~~~~~~~~~~ 511 (513) T protein:vir:97 500 GEGGEGGGNPGG 511 (513) T ss_pred CCccccCCCCCC Confidence 0000000000 No 186 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=90.45 E-value=0.021 Score=29.81 Aligned_cols=431 Identities=8% Similarity=0.007 Sum_probs=153.4 Q ss_pred CccccccccccccccccccCchhhh-HHHHhhhhcccchhccchhHHHHHH----------------Hh-hChHHHHHHH Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALA-KVQVDSERMKAEEIRWPNLIGTVES----------------MK-QDATVATGLD 74 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~~~~e~~~~lr~~~~~~~y~~----------------m~-~D~~v~s~l~ 74 (507) |+=..+.....- ..+.....+. -..-+. ...+ |..++.+.|+- -+ ...+..-++. T Consensus 1 ~~~~~~~~~~~~---~~~~~~~~i~~~i~~~~--~~~~--~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~ 73 (499) T protein:vir:10 1 MAVVIDKDLLDD---VNEPNIEAINYAIRELQ--NRKK--RLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITD 73 (499) T ss_pred CccchhhhHHhh---hhcCCHHHHHHHHHHHH--HHHH--HHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHH Confidence 211111100000 0000000000 000000 0000 00111111110 00 1334444555 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcccc------- Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEY------- 146 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~------- 146 (507) +...-+.+-+..+..+ +.+..+.+.+++... .+...+.++ .++.-||.+ ++++|....+... T Consensus 74 ~~~~~l~g~p~~~~~~------~~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~~-~~~v~~~~~g~~~~~~~~~~ 143 (499) T protein:vir:10 74 MNVGFMTGNPVKYVAE------KGKNIDDILEVFNQI---DIHKHDIELEKDLSVFGYG-YELLYLKKTDPISVRDELGN 143 (499) T ss_pred HHhhhhcccCceeecC------ChhHHHHHHHHHhhc---CHhHHHHHHHHHHHhcCce-EEEEEecccccccccccccc Confidence 5555554444444332 223344455555432 355554444 468889975 4577765432110 Q ss_pred -----CcceeeeeccccCchhccccc-----------ceee--cCCCcc-ccccc-ccccccccccccccc---cccccc Q lcl|NC_017974. 147 -----IGKYKVKNLAFRPQASLSRTN-----------PIVY--NEDGSA-IIGIK-QSLSAFQNYTASEIG---VGGVST 203 (507) Q Consensus 147 -----~g~~~~~~l~~r~~~~i~~~~-----------~~~~--~~d~~~-l~~~~-q~~~~~~~~~~~~~~---~~~~~~ 203 (507) ...+.+..+.|+. ++.-+. .++. +.++.. ...++ -.+.....+...... ...... T Consensus 144 ~~~~~~~~~~~~~v~p~~--~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~ 221 (499) T protein:vir:10 144 EKLTPNTELKIEVIDPRA--TVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIV 221 (499) T ss_pred cccccccceEEEEEcccc--eEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceec Confidence 0122233332221 110000 0000 011100 00000 000000111000000 000000 Q ss_pred CCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHH Q lcl|NC_017974. 204 RMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEA 283 (507) Q Consensus 204 ~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~ 283 (507) ......+..--++.|+ +++.|.|.+..+-...=-=+..+..++..++.+..|+++++ |.. ...+.+.. T Consensus 222 ~~~~~~~g~vPvv~~~-----n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~-----G~~--~~~~~~~~ 289 (499) T protein:vir:10 222 YDGENLFGAVPIIEFR-----NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTF-----GFG--LGDDKDDI 289 (499) T ss_pred ccccCCCCccceEEec-----CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee-----cCc--cccccchh Confidence 0001111111234443 35678888877554433335567888888999999988876 111 11111111 Q ss_pred HHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhh Q lcl|NC_017974. 284 EMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSY 363 (507) Q Consensus 284 ~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~ 363 (507) ..+. .| ....++.+... +++++.... ....+..+++.+.+.|...--...++.+. .++. T Consensus 290 ~~~~----------~~--~~~~~~~~~~~-----d~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~~gn 348 (499) T protein:vir:10 290 QRLK----------RG--AIEAPPREEGA-----DIEWLTKSF--DETQVNLLSQSIENDIHKISYVPNMNDEK--FMGN 348 (499) T ss_pred hhhh----------hc--ceeccCCCCCC-----cceEEeccC--CHHHHHHHHHHHHHHHHHHhCcccCCchh--hccc Confidence 1111 11 12222222111 223333222 22346788999999987754333333322 1111 Q ss_pred hHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCC--ccccceEEecCcCchhHHHHHHHHHHHHhcCCc Q lcl|NC_017974. 364 NLSE---SKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLP--ETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYL 438 (507) Q Consensus 364 Al~~---vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~--~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~ 438 (507) ..|. ....-....+..-.+.+...++ ++++.++.+-...+. +..-..+.|....+.|..+.++.+.+| .|. T Consensus 349 ~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~- 424 (499) T protein:vir:10 349 VSGEAMKFKLFGLENLLSIKQRYFFDGLR-RRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNA--DGI- 424 (499) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHH--hcc- Confidence 1111 1112222333444455666664 466666664211111 112347889999999999999999998 465 Q ss_pred cccHHHHHHHHHHcCC-CCCCCc-ccccchhh---h---cCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 439 PKTPKVINRVLEVLGI-DEKIEE-DISQEELM---K---LLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 439 ~~~~~~~~~i~e~~Gi-p~~~~~-~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ++. +.+.+.++. +.+..+ +...++.. + .......+...+ ..+.....++......+.|..- T Consensus 425 iS~----et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~ 493 (499) T protein:vir:10 425 IPR----KYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLE----LEDKQDDSSENDKEAGSNHNQS 493 (499) T ss_pred CCh----HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC----CCCCCcccCCCCCCCccccccC Confidence 342 344455533 221111 00000000 0 000000000000 0000000011111111111111 No 187 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=90.26 E-value=0.022 Score=29.70 Aligned_cols=445 Identities=12% Similarity=0.079 Sum_probs=168.2 Q ss_pred Ccccccc------cccccCccccccccccc-ccccc-ccCchhhhHHHHhhhhcccchhc-cchhHHHHHHHhhChHHHH Q lcl|NC_017974. 1 MSRKRNR------NRSVKVAKATSEQINVS-RMRMS-EQGTFALAKVQVDSERMKAEEIR-WPNLIGTVESMKQDATVAT 71 (507) Q Consensus 1 ~~~k~~~------~~~~~~~~~~~~~~~~~-~~~~~-~~g~~~~~~~~~~~~~e~~~~lr-~~~~~~~y~~m~~D~~v~s 71 (507) +++.-++ .++..++.|.....++. ...+. .....|. ..-+...+. .++ -.++++.|++|..++.|-+ T Consensus 13 ~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~--~~~~~~~~~--~~~n~~eLI~~YR~ma~~pEvd~ 88 (521) T protein:vir:10 13 MKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAI--VQSVLGYAP--KIQNTKDLINQYRSLSKYHEVDN 88 (521) T ss_pred hhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccch--hhhhhcccc--ccchHHHHHHHHHHHhhccchhh Confidence 2211111 12222222222211100 00000 0000010 011111111 112 3578999999999999999 Q ss_pred HHHHHHHHHhhCCc---eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceee-----------eee Q lcl|NC_017974. 72 GLDMLYTFVEKAFK---DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVV-----------EKV 137 (507) Q Consensus 72 ~l~~Rk~av~~~~~---~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~-----------Ei~ 137 (507) +++.....+...+- .+.++-..++.++.+.+-|.+- |.. |..+|+.--+||..+ .++ T Consensus 89 Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~ee--------F~~-Il~ll~F~~~~~~~fR~WYVDgRi~fHki 159 (521) T protein:vir:10 89 AIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREE--------FRT-ILKLLKFEREGKRHFRRWYVDSRIYFHKM 159 (521) T ss_pred HHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHH--------HHH-HHHHhccchhhhHHHhhheeeeeEEEEEE Confidence 99988776532210 0111111222223333333222 222 224444444444443 332 Q ss_pred eeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccccc-cccCCCccccccccEE Q lcl|NC_017974. 138 YTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGG-VSTRMSDVIIPINRVM 216 (507) Q Consensus 138 w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~-~~~~~~~~~iP~~k~i 216 (507) -.... .-+|-.. |....|.+|.+.+......++ +...+... ..++.+.+....... ......++.||. .-| T Consensus 160 id~~~--pk~GI~E---lr~lDPr~i~~vr~i~k~~~~-~~~v~~~~-~e~f~Y~~~~~~~~~~~g~~~~~vkI~~-daI 231 (521) T protein:vir:10 160 IDPAR--PKDGIKE---LRLLDPRNVEYYRVNLKSNEN-GNDVYKGV-KEFFTYGATEDNRYNISGNSNNLVQIPI-DAI 231 (521) T ss_pred eeCCC--cccccee---eeeeCCcceeeeeeecCCCCC-cchhhccc-eeeeeeccCCCceecCCCCCCcceeech-hhe Confidence 22110 1123333 333444455555443332221 11111111 122222221111111 112344567887 678 Q ss_pred EEeeCCc--cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_017974. 217 LMNTGGS--SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAA 294 (507) Q Consensus 217 ~~~~~~~--~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~ 294 (507) +|+|..- .+.++..|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.-.. T Consensus 232 ~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--vGnlp----k~KAeqYl~~iM~k~k 305 (521) T protein:vir:10 232 VYSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYID--VGTMP----NKKATQHLNNVMQGLK 305 (521) T ss_pred eeecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhcC Confidence 8998543 45788899999999887544443333333210001111244442 22222 2233345555443322 Q ss_pred Hh-hcCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc Q lcl|NC_017974. 295 NA-HSGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN 354 (507) Q Consensus 295 n~-~~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt 354 (507) |- .-.+..|- -+|. |++|.+. .|+.+.. -..=|+|..+.+-+++--..-- T Consensus 306 NklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TL---------pggqnlg-em~DV~YF~kkLy~aLnVP~sR 375 (521) T protein:vir:10 306 NRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTL---------PGAQSMG-EMDDVRWFNRKLYESMKIPLSR 375 (521) T ss_pred ceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeec---------cccCCcC-hHHHHHHHHHHHHHHhCCCccc Confidence 20 00111111 1222 5555442 2332232 2345899999999988776533 Q ss_pred ccccchhhhhH---HHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------Cc Q lcl|NC_017974. 355 VGNDKGGSYNL---SESKQ-TI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------VD 419 (507) Q Consensus 355 s~~~~~GS~Al---~~vh~-~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~~ 419 (507) .+.. ++...+ +++.. |+ |...+......+...|..-|-..|+-=|.. ..+. .+.+.|+.. |- T Consensus 376 l~~e-~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~El 452 (521) T protein:vir:10 376 LPQE-GAGVTFGAGNDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKM--SVSEWEEQAENIKVVFSKDSYYEEI 452 (521) T ss_pred cCCC-CCceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhcceEEeeecchHHHH Confidence 3322 111111 22222 22 223344444444444443333333221211 1111 122333322 23 Q ss_pred hhHHHHHHHHHHHHhcCC--ccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 420 VDMEGFSKAIQRIGAVGY--LPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 420 ~dl~~~a~~~~~L~~~G~--~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) .+.+.+.+++..|..+-- .+-.-...+|+++. |.+++.+-.++. ..-.....++--+ ++.+. .++. T Consensus 453 Ke~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~-k~I~~E~~~~~~~--~p~~e---~~df 521 (521) T protein:vir:10 453 KDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTER-EKIDGELKDSVYK--NPEDP---MEEF 521 (521) T ss_pred HHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHH-HHHHHhhhCCCCC--CCcch---hhcC Confidence 444445555555544311 11112335777654 677532111110 0111111110000 00000 0001 No 188 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=90.26 E-value=0.022 Score=29.69 Aligned_cols=412 Identities=13% Similarity=0.055 Sum_probs=149.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +.+.-++.+. +.+...... +.=.+++. +..+..+.... ...+.+.+. .++ -.....-++.....-+ T Consensus 32 i~~~i~~~~~-~~~~~~~~~----~YY~g~~~-i~~~~~~~~~~--~~~~~~~~~-~ki-----~~n~~k~Ivd~~~~~l 97 (474) T protein:vir:97 32 IVRLIDDHRK-QLDKITVGQ----RYYDKDND-IVKQMKKVDVH--GNIDYDKPD-WRI-----TTNFHQNLVDQKVSYV 97 (474) T ss_pred HHHHHHHHHH-HHHHHHHHH----HHhccccc-hhcccchhccc--cccccccCc-cee-----ecchHHHHHHHHHhhh Confidence 1111111000 000000000 00000000 00000000000 000000000 000 0111111222222233 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..+.. .++.+.+++++++++ .|.+.+.++ .++.-||.+ ++++|... +|.+.+.-+.|+ T Consensus 98 ~g~p~~~~~------~d~~~~~~l~~~~~n----~~~~~~~e~~~~~~~~G~~-~~~~~~d~-----~~~~~i~~~~p~- 160 (474) T protein:vir:97 98 ASKPVTYSC------EDENVLKVIHDVLDT----RWDNKLIDILTATSNKGID-WLQVYINE-----NGEMKLFRVPAE- 160 (474) T ss_pred hcCCceecc------CcHHHHHHHHHHHhc----cHHHHHHHHHHHHhhcCce-EEEEEecC-----CCeeEEEEEccc- Confidence 333333322 234556677776642 366655554 457779975 56777533 455555443332 Q ss_pred chhcccccceeecCC-Ccccccccccc----cccccccccccccccccc----------------CCCccccccccEEEE Q lcl|NC_017974. 160 QASLSRTNPIVYNED-GSAIIGIKQSL----SAFQNYTASEIGVGGVST----------------RMSDVIIPINRVMLM 218 (507) Q Consensus 160 ~~~i~~~~~~~~~~d-~~~l~~~~q~~----~~~~~~~~~~~~~~~~~~----------------~~~~~~iP~~k~i~~ 218 (507) ++. +++.+.+ +..+-.++... .....++.....-..... ......+..-.++.| T Consensus 161 --~~~---~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 235 (474) T protein:vir:97 161 --QAI---PIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAF 235 (474) T ss_pred --ceE---EEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEe Confidence 111 1111111 11111111000 000001110000000000 000011111123333 Q ss_pred eeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhc Q lcl|NC_017974. 219 NTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHS 298 (507) Q Consensus 219 ~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~ 298 (507) + .|++|.|.+..+....=--+..+..++..++.+..|+++++- . ..++..+ ....+ . T Consensus 236 ~-----nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g---~----~~~~~~~---~~~~~-------~- 292 (474) T protein:vir:97 236 K-----NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKG---Y----EGEDLEE---FMRGL-------K- 292 (474) T ss_pred c-----CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---C----Ccccchh---hhhhh-------h- Confidence 2 467899998875443333355778888888999999888751 1 1111111 11111 1 Q ss_pred CCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHH---HHHHHH Q lcl|NC_017974. 299 GEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESK---QTIHTQ 375 (507) Q Consensus 299 G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh---~~v~~~ 375 (507) ....+.++.|.++++ +.... ....+...++.+.+.|...--+..++.+.. |+...|..- ..-... T Consensus 293 -~~~~i~~~~~~~~~~-------l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~n~Sg~Al~~~~~~l~~ 360 (474) T protein:vir:97 293 -YYKAINVDGDGGVET-------IQVEV--PVSSTKEYIDLMRVYIMEFGQGVDFQTDKF--GSAPSGIALKFLYGNLDL 360 (474) T ss_pred -ccceeeccCCCceeE-------EeecC--CHHHHHHHHHHHHHHHHHHhCccccCcccc--ccccHHHHHHHHHHHHHH Confidence 113355666655443 33222 223466788888888877654443443222 111112111 111222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcC-C Q lcl|NC_017974. 376 FVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLG-I 454 (507) Q Consensus 376 ~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~G-i 454 (507) .+..-.+.+...|. ++++.++.+.+... +..--.+.|....+.+..+.++.+.+ +|.+ + .+.+.+.++ + T Consensus 361 k~~~k~~~~~~~l~-~~~~li~~~~~~~~-d~~~i~v~f~~~~p~~~~e~a~~~~~---~g~i-S----~et~l~~l~~v 430 (474) T protein:vir:97 361 KANKLKNKATVAIQ-ELISFIIDFNNLKT-DVKDIEISFNFNRMMNDAEQSQIIAQ---SQYL-S----RETLVKSSPLV 430 (474) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHhCCCc-ccceeeEEeccCcccCHHHHHHHHHH---cCCC-C----HHHHHHhCCCC Confidence 22333345555664 46677777653221 11223567877778887777766544 5753 3 244556664 4 Q ss_pred CCCCCccc-ccc---hhhhcCCCcCCC-CCCcccccCccccccc Q lcl|NC_017974. 455 DEKIEEDI-SQE---ELMKLLGEDTSR-AGDGMTKGSSGNGTGK 493 (507) Q Consensus 455 p~~~~~~~-~~~---~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 493 (507) +.+..+-. ... ..+...+..... ...+.....+++..++ T Consensus 431 ~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 431 DDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred CCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 33221100 001 111111111000 0000001111111111 No 189 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=90.26 E-value=0.022 Score=29.69 Aligned_cols=412 Identities=13% Similarity=0.055 Sum_probs=149.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +.+.-++.+. +.+...... +.=.+++. +..+..+.... ...+.+.+. .++ -.....-++.....-+ T Consensus 32 i~~~i~~~~~-~~~~~~~~~----~YY~g~~~-i~~~~~~~~~~--~~~~~~~~~-~ki-----~~n~~k~Ivd~~~~~l 97 (474) T protein:vir:94 32 IVRLIDDHRK-QLDKITVGQ----RYYDKDND-IVKQMKKVDVH--GNIDYDKPD-WRI-----TTNFHQNLVDQKVSYV 97 (474) T ss_pred HHHHHHHHHH-HHHHHHHHH----HHhccccc-hhcccchhccc--cccccccCc-cee-----ecchHHHHHHHHHhhh Confidence 1111111000 000000000 00000000 00000000000 000000000 000 0111111222222233 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..+.. .++.+.+++++++++ .|.+.+.++ .++.-||.+ ++++|... +|.+.+.-+.|+ T Consensus 98 ~g~p~~~~~------~d~~~~~~l~~~~~n----~~~~~~~e~~~~~~~~G~~-~~~~~~d~-----~~~~~i~~~~p~- 160 (474) T protein:vir:94 98 ASKPVTYSC------EDENVLKVIHDVLDT----RWDNKLIDILTATSNKGID-WLQVYINE-----NGEMKLFRVPAE- 160 (474) T ss_pred hcCCceecc------CcHHHHHHHHHHHhc----cHHHHHHHHHHHHhhcCce-EEEEEecC-----CCeeEEEEEccc- Confidence 333333322 234556677776642 366655554 457779975 56777533 455555443332 Q ss_pred chhcccccceeecCC-Ccccccccccc----cccccccccccccccccc----------------CCCccccccccEEEE Q lcl|NC_017974. 160 QASLSRTNPIVYNED-GSAIIGIKQSL----SAFQNYTASEIGVGGVST----------------RMSDVIIPINRVMLM 218 (507) Q Consensus 160 ~~~i~~~~~~~~~~d-~~~l~~~~q~~----~~~~~~~~~~~~~~~~~~----------------~~~~~~iP~~k~i~~ 218 (507) ++. +++.+.+ +..+-.++... .....++.....-..... ......+..-.++.| T Consensus 161 --~~~---~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 235 (474) T protein:vir:94 161 --QAI---PIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAF 235 (474) T ss_pred --ceE---EEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEe Confidence 111 1111111 11111111000 000001110000000000 000011111123333 Q ss_pred eeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhc Q lcl|NC_017974. 219 NTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHS 298 (507) Q Consensus 219 ~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~ 298 (507) + .|++|.|.+..+....=--+..+..++..++.+..|+++++- . ..++..+ ....+ . T Consensus 236 ~-----nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g---~----~~~~~~~---~~~~~-------~- 292 (474) T protein:vir:94 236 K-----NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKG---Y----EGEDLEE---FMRGL-------K- 292 (474) T ss_pred c-----CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---C----Ccccchh---hhhhh-------h- Confidence 2 467899998875443333355778888888999999888751 1 1111111 11111 1 Q ss_pred CCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHH---HHHHHH Q lcl|NC_017974. 299 GEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESK---QTIHTQ 375 (507) Q Consensus 299 G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh---~~v~~~ 375 (507) ....+.++.|.++++ +.... ....+...++.+.+.|...--+..++.+.. |+...|..- ..-... T Consensus 293 -~~~~i~~~~~~~~~~-------l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~n~Sg~Al~~~~~~l~~ 360 (474) T protein:vir:94 293 -YYKAINVDGDGGVET-------IQVEV--PVSSTKEYIDLMRVYIMEFGQGVDFQTDKF--GSAPSGIALKFLYGNLDL 360 (474) T ss_pred -ccceeeccCCCceeE-------EeecC--CHHHHHHHHHHHHHHHHHHhCccccCcccc--ccccHHHHHHHHHHHHHH Confidence 113355666655443 33222 223466788888888877654443443222 111112111 111222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcC-C Q lcl|NC_017974. 376 FVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLG-I 454 (507) Q Consensus 376 ~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~G-i 454 (507) .+..-.+.+...|. ++++.++.+.+... +..--.+.|....+.+..+.++.+.+ +|.+ + .+.+.+.++ + T Consensus 361 k~~~k~~~~~~~l~-~~~~li~~~~~~~~-d~~~i~v~f~~~~p~~~~e~a~~~~~---~g~i-S----~et~l~~l~~v 430 (474) T protein:vir:94 361 KANKLKNKATVAIQ-ELISFIIDFNNLKT-DVKDIEISFNFNRMMNDAEQSQIIAQ---SQYL-S----RETLVKSSPLV 430 (474) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHhCCCc-ccceeeEEeccCcccCHHHHHHHHHH---cCCC-C----HHHHHHhCCCC Confidence 22333345555664 46677777653221 11223567877778887777766544 5753 3 244556664 4 Q ss_pred CCCCCccc-ccc---hhhhcCCCcCCC-CCCcccccCccccccc Q lcl|NC_017974. 455 DEKIEEDI-SQE---ELMKLLGEDTSR-AGDGMTKGSSGNGTGK 493 (507) Q Consensus 455 p~~~~~~~-~~~---~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 493 (507) +.+..+-. ... ..+...+..... ...+.....+++..++ T Consensus 431 ~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 431 DDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred CCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 33221100 001 111111111000 0000001111111111 No 190 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=90.20 E-value=0.022 Score=29.66 Aligned_cols=434 Identities=12% Similarity=0.085 Sum_probs=156.3 Q ss_pred Cccccccccccc------Cccccc-----cccccccccccccCchhhhHHHHhhhhcccchhccchhHH-HHHHHhhChH Q lcl|NC_017974. 1 MSRKRNRNRSVK------VAKATS-----EQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIG-TVESMKQDAT 68 (507) Q Consensus 1 ~~~k~~~~~~~~------~~~~~~-----~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~-~y~~m~~D~~ 68 (507) |+=.-+.|.-.+ +.+... ..+++++..+.. + ..++..- .-..+.++...... ...+=..--- T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~---i--~~~~~~Y-~g~~~~~~~~~~~~~~~~~~~~sln 74 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDR---I--TTNLKYY-KSDWDSVLYLNTDGETKKRDLNHLP 74 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHH---H--HHHHHHh-cCCCCCcccccCCCCcccCceeecc Confidence 222111111110 111111 011112111110 0 0011100 00111111000000 0000000001 Q ss_pred HHHHHHHH-HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcccc Q lcl|NC_017974. 69 VATGLDML-YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEY 146 (507) Q Consensus 69 v~s~l~~R-k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~ 146 (507) +...+-+. -..|.+-.-.+.+ .+++..+++++++++. .|...+... ..|..+|=.++=+.|.. T Consensus 75 l~~~i~~~~A~lv~~e~~~i~~------~d~~~~~~l~~il~~n---~f~~~~~~~~e~a~a~G~~~~k~~~d~------ 139 (500) T protein:vir:30 75 IARTAAKKIASLVFNEQAEIKV------DDDAANEFISETLKND---RFNKNFERYLESCLALGGLAMRPYVDG------ 139 (500) T ss_pred hHHHHHHHHhhhhcCCcceEec------CChHHHHHHHHHHhhc---cHHHHHHHHHHHHhhcCCEEEEEEEeC------ Confidence 11222211 1122222222333 2356778888888753 355555554 45888888877666652 Q ss_pred CcceeeeeccccCchhcccccceeecCCC----------------cccccc--ccc------ccc-----cccccccccc Q lcl|NC_017974. 147 IGKYKVKNLAFRPQASLSRTNPIVYNEDG----------------SAIIGI--KQS------LSA-----FQNYTASEIG 197 (507) Q Consensus 147 ~g~~~~~~l~~r~~~~i~~~~~~~~~~d~----------------~~l~~~--~q~------~~~-----~~~~~~~~~~ 197 (507) +.+.+. ..++.++. |+.++.++ ...... +-| +.. +.......++ T Consensus 140 -~~~~I~---~v~ad~~~---P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG 212 (500) T protein:vir:30 140 -DKVRVA---FVQAPVFL---PLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVG 212 (500) T ss_pred -CceEEE---EEcCCeeE---EEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccC Confidence 222221 11222221 11111111 110000 000 000 0000000000 Q ss_pred ccc-----cccCC-----CccccccccEEEEe----eCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceE Q lcl|NC_017974. 198 VGG-----VSTRM-----SDVIIPINRVMLMN----TGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIE 263 (507) Q Consensus 198 ~~~-----~~~~~-----~~~~iP~~k~i~~~----~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v 263 (507) ... +.+.. .++.-| =|.+++ .....++|+|.|.+..|.-..-.=+..+..|+.-++. |- - . T Consensus 213 ~~v~l~~~~~~l~~~~~~~~~~~p--~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~-~-~ 287 (500) T protein:vir:30 213 SRVPLSEVYKDLKDEAKVTDVTRP--IFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQ-R-R 287 (500) T ss_pred cccccccccCCcCcceEeccCCCc--cEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-Cc-c-e Confidence 000 00000 111111 133332 2345678999999999886655445555555554442 22 1 2 Q ss_pred EecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHH Q lcl|NC_017974. 264 LKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKS 343 (507) Q Consensus 264 ~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~ 343 (507) +.+|..+.+........+.. ..... .........++.+..-. ..++.....-- ...|...++.+=++ T Consensus 288 i~v~~~~l~~~~~~~~g~~~--~~~~~------d~~~~~~~~~~~~~~~~---~~i~~~~~~ir--~e~~~~~l~~~l~~ 354 (500) T protein:vir:30 288 VAVPESLTALTVRTTDGDVV--PRPRF------ESDQNVYIRMGGRDLDS---SAIQDLTTPIR--ADDYIKAINEGLSL 354 (500) T ss_pred eeechHHhcccCCCCCcccc--CCccc------CCCcceEEEcCCCCCcC---cceeEeccccC--hHHHHHHHHHHHHH Confidence 33454433322111100000 00000 00011111222110000 00111100000 11244445555555 Q ss_pred HHHH-HhcccccccccchhhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcCCccccceEEe Q lcl|NC_017974. 344 ILDR-LGAGFINVGNDKGGSYNLSESK--QTIHTQFVQRVNEIILEALNENLLPQLLALN------DMRLPETEMPYVKA 414 (507) Q Consensus 344 Isk~-iLGqtLts~~~~~GS~Al~~vh--~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN------~~~~~~~~~P~~~~ 414 (507) |+.. -+++. |.+.+.+|...+-++. +.-...-+.+-.+.+..+|. +|++.++.+. ....+...-+.+.| T Consensus 355 i~~~~gls~~-~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~-~lv~~il~~~~~~~~~~~~~~~~~~v~v~f 432 (500) T protein:vir:30 355 FEMQIGVSAG-LFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQSLK-ELVISIFEIAKAYDLYQSEVPSMDNISISL 432 (500) T ss_pred HHHHhCCCcc-ccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCCCCCCCcceEEEe Confidence 5433 23322 2222222322112221 12222333445566666774 5777776432 11112222246889 Q ss_pred cCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccc Q lcl|NC_017974. 415 GEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGN 489 (507) Q Consensus 415 ~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (507) +..-..|.++.++.+.+++.+|++.. +.++.+.+|+++.+.+++.....+..++...+...+ ..-.|+ T Consensus 433 ~d~i~~d~~~~~~~~~~~v~aGi~s~----~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~---~~~~g~ 500 (500) T protein:vir:30 433 DDGVFTDRDAELDYWIKVVNAGFGTR----EMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTD---THLYGE 500 (500) T ss_pred CCCCCCCHHHHHHHHHHHHHcCCCCH----HHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCcc---ccccCC Confidence 88888888888999999999998654 568888899975543333333333333322222111 111222 No 191 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=90.20 E-value=0.022 Score=29.66 Aligned_cols=434 Identities=12% Similarity=0.085 Sum_probs=156.3 Q ss_pred Cccccccccccc------Cccccc-----cccccccccccccCchhhhHHHHhhhhcccchhccchhHH-HHHHHhhChH Q lcl|NC_017974. 1 MSRKRNRNRSVK------VAKATS-----EQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIG-TVESMKQDAT 68 (507) Q Consensus 1 ~~~k~~~~~~~~------~~~~~~-----~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~-~y~~m~~D~~ 68 (507) |+=.-+.|.-.+ +.+... ..+++++..+.. + ..++..- .-..+.++...... ...+=..--- T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~---i--~~~~~~Y-~g~~~~~~~~~~~~~~~~~~~~sln 74 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDR---I--TTNLKYY-KSDWDSVLYLNTDGETKKRDLNHLP 74 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHH---H--HHHHHHh-cCCCCCcccccCCCCcccCceeecc Confidence 222111111110 111111 011112111110 0 0011100 00111111000000 0000000001 Q ss_pred HHHHHHHH-HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcccc Q lcl|NC_017974. 69 VATGLDML-YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEY 146 (507) Q Consensus 69 v~s~l~~R-k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~ 146 (507) +...+-+. -..|.+-.-.+.+ .+++..+++++++++. .|...+... ..|..+|=.++=+.|.. T Consensus 75 l~~~i~~~~A~lv~~e~~~i~~------~d~~~~~~l~~il~~n---~f~~~~~~~~e~a~a~G~~~~k~~~d~------ 139 (500) T protein:vir:98 75 IARTAAKKIASLVFNEQAEIKV------DDDAANEFISETLKND---RFNKNFERYLESCLALGGLAMRPYVDG------ 139 (500) T ss_pred hHHHHHHHHhhhhcCCcceEec------CChHHHHHHHHHHhhc---cHHHHHHHHHHHHhhcCCEEEEEEEeC------ Confidence 11222211 1122222222333 2356778888888753 355555554 45888888877666652 Q ss_pred CcceeeeeccccCchhcccccceeecCCC----------------cccccc--ccc------ccc-----cccccccccc Q lcl|NC_017974. 147 IGKYKVKNLAFRPQASLSRTNPIVYNEDG----------------SAIIGI--KQS------LSA-----FQNYTASEIG 197 (507) Q Consensus 147 ~g~~~~~~l~~r~~~~i~~~~~~~~~~d~----------------~~l~~~--~q~------~~~-----~~~~~~~~~~ 197 (507) +.+.+. ..++.++. |+.++.++ ...... +-| +.. +.......++ T Consensus 140 -~~~~I~---~v~ad~~~---P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG 212 (500) T protein:vir:98 140 -DKVRVA---FVQAPVFL---PLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVG 212 (500) T ss_pred -CceEEE---EEcCCeeE---EEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccC Confidence 222221 11222221 11111111 110000 000 000 0000000000 Q ss_pred ccc-----cccCC-----CccccccccEEEEe----eCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceE Q lcl|NC_017974. 198 VGG-----VSTRM-----SDVIIPINRVMLMN----TGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIE 263 (507) Q Consensus 198 ~~~-----~~~~~-----~~~~iP~~k~i~~~----~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v 263 (507) ... +.+.. .++.-| =|.+++ .....++|+|.|.+..|.-..-.=+..+..|+.-++. |- - . T Consensus 213 ~~v~l~~~~~~l~~~~~~~~~~~p--~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~-~-~ 287 (500) T protein:vir:98 213 SRVPLSEVYKDLKDEAKVTDVTRP--IFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQ-R-R 287 (500) T ss_pred cccccccccCCcCcceEeccCCCc--cEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-Cc-c-e Confidence 000 00000 111111 133332 2345678999999999886655445555555554442 22 1 2 Q ss_pred EecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHH Q lcl|NC_017974. 264 LKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKS 343 (507) Q Consensus 264 ~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~ 343 (507) +.+|..+.+........+.. ..... .........++.+..-. ..++.....-- ...|...++.+=++ T Consensus 288 i~v~~~~l~~~~~~~~g~~~--~~~~~------d~~~~~~~~~~~~~~~~---~~i~~~~~~ir--~e~~~~~l~~~l~~ 354 (500) T protein:vir:98 288 VAVPESLTALTVRTTDGDVV--PRPRF------ESDQNVYIRMGGRDLDS---SAIQDLTTPIR--ADDYIKAINEGLSL 354 (500) T ss_pred eeechHHhcccCCCCCcccc--CCccc------CCCcceEEEcCCCCCcC---cceeEeccccC--hHHHHHHHHHHHHH Confidence 33454433322111100000 00000 00011111222110000 00111100000 11244445555555 Q ss_pred HHHH-HhcccccccccchhhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcCCccccceEEe Q lcl|NC_017974. 344 ILDR-LGAGFINVGNDKGGSYNLSESK--QTIHTQFVQRVNEIILEALNENLLPQLLALN------DMRLPETEMPYVKA 414 (507) Q Consensus 344 Isk~-iLGqtLts~~~~~GS~Al~~vh--~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN------~~~~~~~~~P~~~~ 414 (507) |+.. -+++. |.+.+.+|...+-++. +.-...-+.+-.+.+..+|. +|++.++.+. ....+...-+.+.| T Consensus 355 i~~~~gls~~-~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~-~lv~~il~~~~~~~~~~~~~~~~~~v~v~f 432 (500) T protein:vir:98 355 FEMQIGVSAG-LFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQSLK-ELVISIFEIAKAYDLYQSEVPSMDNISISL 432 (500) T ss_pred HHHHhCCCcc-ccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCCCCCCCcceEEEe Confidence 5433 23322 2222222322112221 12222333445566666774 5777776432 11112222246889 Q ss_pred cCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccc Q lcl|NC_017974. 415 GEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGN 489 (507) Q Consensus 415 ~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (507) +..-..|.++.++.+.+++.+|++.. +.++.+.+|+++.+.+++.....+..++...+...+ ..-.|+ T Consensus 433 ~d~i~~d~~~~~~~~~~~v~aGi~s~----~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~---~~~~g~ 500 (500) T protein:vir:98 433 DDGVFTDRDAELDYWIKVVNAGFGTR----EMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTD---THLYGE 500 (500) T ss_pred CCCCCCCHHHHHHHHHHHHHcCCCCH----HHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCcc---ccccCC Confidence 88888888888999999999998654 568888899975543333333333333322222111 111222 No 192 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=90.12 E-value=0.023 Score=29.61 Aligned_cols=416 Identities=11% Similarity=0.054 Sum_probs=157.7 Q ss_pred CcccccccccccCccccccc---cccccccccccCc-hhhhHHHHhhhhcccchhccchhHHHHHH---Hh--------- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQ---INVSRMRMSEQGT-FALAKVQVDSERMKAEEIRWPNLIGTVES---MK--------- 64 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~---~~~~~~~~~~~g~-~~~~~~~~~~~~e~~~~lr~~~~~~~y~~---m~--------- 64 (507) |.++=+ +|-+..-. +..-.... +.-. .-..-...+. .+.+ |..++.+.|+. ++ T Consensus 1 ~~~~~~------~~~~~~~~~~~~~~~~~~~-~~~~~~i~~~i~~~~--~~~~--~~~~~~~Yy~g~~~i~~r~~~~~~~ 69 (474) T protein:vir:95 1 MFNIIR------MPWDKPYGEEVVEQLKPQF-ETQEEMIIRLIDDHR--KQLD--KITVGQRYYDKDNDIVKQMKKVDVY 69 (474) T ss_pred Ccceee------cCCCCchhhHHHHhhhhcc-CChHHHHHHHHHHHH--HHHH--HHHHHHHHhcccCchhccccccccc Confidence 554322 22221100 00000000 0000 0000000000 0000 00111111110 00 Q ss_pred ------------hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcc Q lcl|NC_017974. 65 ------------QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGL 131 (507) Q Consensus 65 ------------~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gf 131 (507) -.....-++.+...-+.+-+..+. .+++++.+++++++++ .|...+.. ..++.-||. T Consensus 70 ~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~------~~d~~~~~~l~~~~~n----~~~~~~~e~~~~~~~~G~ 139 (474) T protein:vir:95 70 GNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYS------CEDESVLKIIHDVLDT----RWDNKLIDILTATSNKGI 139 (474) T ss_pred cccccccccceeccchHHHHHHHHHhhhccCCceec------cCchHHHHHHHHHHhc----cHHHHHHHHHHHHhhcCc Confidence 012222223333333333333332 2335566777777652 36665555 446778997 Q ss_pred eeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccc-cccc----ccccccccccccccccc----- Q lcl|NC_017974. 132 SVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIG-IKQS----LSAFQNYTASEIGVGGV----- 201 (507) Q Consensus 132 s~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~-~~q~----~~~~~~~~~~~~~~~~~----- 201 (507) +. +++|..+ +|++.+.-+.|. ..+ +++.+.+.+.+.. ++.. ......++.....-... T Consensus 140 ~~-~~v~~d~-----~~~~~i~~~~p~--~~~----~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 207 (474) T protein:vir:95 140 DW-LQVYINE-----NGEMKLFRVPAE--QAI----PIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGL 207 (474) T ss_pred EE-EEEEecC-----CCceEEEEEccc--ceE----EEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCcc Confidence 54 5777532 455555544332 111 1111111111111 1000 00000000000000000 Q ss_pred -----------ccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccccc Q lcl|NC_017974. 202 -----------STRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQI 270 (507) Q Consensus 202 -----------~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~ 270 (507) ........+..-.++.|+ .|+.|.|.+..+-...=--+..+..++..++.+..|+++++- + T Consensus 208 ~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g---~ 279 (474) T protein:vir:95 208 IPDYYYGANHIQSHFSNGNWGRVPFIAFK-----NNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKG---Y 279 (474) T ss_pred ccccccCcccccccccccCCCccceEeec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---C Confidence 000000011111123332 468899988875444333355778888888989999887651 1 Q ss_pred CCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_017974. 271 LNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGA 350 (507) Q Consensus 271 ~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLG 350 (507) + .++..+ ....+ .....+.++.+.++++. .... ....+...++.+.+.|...-.+ T Consensus 280 ~----~~~~~~---~~~~~---------~~~~~i~~~~~~~~~~l-------~~~~--~~~~~~~~~~~l~~~i~~~s~~ 334 (474) T protein:vir:95 280 E----GQDLEE---FMRGL---------KYYKAINVDGDGGVETI-------QVEV--PVSSTKEYIDLMRAYIMEFGQG 334 (474) T ss_pred C----cccchh---hhhhh---------hccceeeccCCCceeEE-------eecC--CHHHHHHHHHHHHHHHHHHhCC Confidence 1 111111 01111 11123446666654432 2222 2234677889999998876555 Q ss_pred ccccccccchhhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHH Q lcl|NC_017974. 351 GFINVGNDKGGSYNLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSK 427 (507) Q Consensus 351 qtLts~~~~~GS~Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~ 427 (507) ..++.+.. |+...|.. ...-....+..-.+.+...| +++++.++.+.+... +..-..+.|....+.|..+.++ T Consensus 335 p~~~~~~~--~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l-~~~~~li~~~~g~~~-d~~~i~v~f~~~~p~d~~e~a~ 410 (474) T protein:vir:95 335 VDFQTDKF--GSAPSGIALKFLYGNLDLKANKLKNKATVAI-QELIGFIIDFNNLKM-DVKDIEISFNFNRMMNDAEQSQ 410 (474) T ss_pred cccccccc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCc-ccceeeEEeccCCCcCHHHHHH Confidence 44444322 11111111 11112222333445566666 457777777764221 2223457788888888777776 Q ss_pred HHHHHHhcCCccccHHHHHHHHHHcC-CCCCCCc-ccccchh---hhcCCCcCCCCCCcccccCcccccccccCCCCch Q lcl|NC_017974. 428 AIQRIGAVGYLPKTPKVINRVLEVLG-IDEKIEE-DISQEEL---MKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 428 ~~~~L~~~G~~~~~~~~~~~i~e~~G-ip~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 501 (507) ++ +.+|++ + .+.+.+.++ ++.+..+ +...++. ...+.. ....+ .... .....+...++. T Consensus 411 ~~---~~~g~i-S----~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~-~~~~~---~d~~---~~~~~~~~~~~~ 474 (474) T protein:vir:95 411 II---AQSQYL-S----RETLVKSSPLVDDYKAELERIEQEQMEYNKQLPN-LDDGG---ADGA---QQQERSNDKESE 474 (474) T ss_pred HH---HhcCCC-c----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccc-ccccc---CCCC---cCCCCCccCCCC Confidence 55 456864 3 234445554 3322111 0001111 111111 11100 0000 000001111111 No 193 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=89.46 E-value=0.026 Score=29.25 Aligned_cols=430 Identities=10% Similarity=0.042 Sum_probs=157.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhh--------cccchh----ccchhHHHHHHHhh--- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSER--------MKAEEI----RWPNLIGTVESMKQ--- 65 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~--------e~~~~l----r~~~~~~~y~~m~~--- 65 (507) |---+.++..+..+-|.-.+ ....|...... .....+ +.+ .-+-|+.-+. T Consensus 1 ~~~~~~~~~~V~~~hp~y~a--------------~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~-~e~~Y~~rl~rA~ 65 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLH--------------YAPKWQKVRHALAGELVSYLRNVGLNEPDKAY-GEARQAEYEAGGI 65 (489) T ss_pred CccCCCccCCCCccCHHHHH--------------HHHHHHHHHHHhcCcccccccCCCCCCCCCCC-ChHHHHHHHhccc Confidence 33333332222222222111 01111111100 010111 111 1122544332 Q ss_pred -ChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCc Q lcl|NC_017974. 66 -DATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTV 143 (507) Q Consensus 66 -D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~ 143 (507) =+++...++.-...|.+.+-.+.++ ..+..|++.+ +..+.++..++++++. ++.||.+.+=+.+-..++ T Consensus 66 ~~n~~~~tl~~l~G~vfrk~p~~~~p-------~~l~~l~~d~--D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~ 136 (489) T protein:vir:78 66 VYNFTRRTLSGMVGSVMRKEPEINIP-------KELEYLLKNA--DGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGA 136 (489) T ss_pred cCChHHHHHHHHhchhhcCCcceecc-------HHHHHHHhcc--CCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCC Confidence 3455555555555554443223221 2233333322 3455688899988765 777998876555432211 Q ss_pred ccc-----CcceeeeeccccCchhcccccceee-----------------cC--CCcccccccccccc---------ccc Q lcl|NC_017974. 144 GEY-----IGKYKVKNLAFRPQASLSRTNPIVY-----------------NE--DGSAIIGIKQSLSA---------FQN 190 (507) Q Consensus 144 ~~~-----~g~~~~~~l~~r~~~~i~~~~~~~~-----------------~~--d~~~l~~~~q~~~~---------~~~ 190 (507) .. .+... --|....+..|..|+.-.. .+ |+.+-....|..-. ... T Consensus 137 -~T~ade~~~~~r-Py~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~ 214 (489) T protein:vir:78 137 -ATAAEQNAGLLN-PTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRL 214 (489) T ss_pred -cCHHHHHHhcCC-cEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEE Confidence 00 00000 0011122222322221000 00 11111111110000 000 Q ss_pred cc---cccccccc--cccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCceEE Q lcl|NC_017974. 191 YT---ASEIGVGG--VSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKIL-IENLEVVGATKDMGGVIEL 264 (507) Q Consensus 191 ~~---~~~~~~~~--~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~-~~~~w~~f~Er~g~p~~v~ 264 (507) +. .+...... .....++..++.=-|+++-.. ..+--.+...|..++..-+ ++| ...+.-+-+-.-+.|++++ T Consensus 215 ~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~-~~~~~~~~pPLl~LA~lni-~Hy~~ssd~~~~l~~~~~P~l~i 292 (489) T protein:vir:78 215 FRFDAEGGAQEDVVEIYPDLGESLRGVIPFTFIGAT-NNDATIDDAPLLPLAELNI-GHYRNSADNEESSFVVGQPTLFI 292 (489) T ss_pred EEeecCCcccceeeEEeccCCCCccCeeeEEEEecC-CCCCCCCcCchHHHHHHHH-HHhhhhhHHHHHHHHcccceeee Confidence 00 00000000 000111212221123333222 2222334444555543322 222 1222223333345676665 Q ss_pred ecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHH Q lcl|NC_017974. 265 KIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSI 344 (507) Q Consensus 265 ~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~I 344 (507) . +.. +..++. +.... ...++.|+.++..+|.+....+ ++.++++ .. ++.++-...+| T Consensus 293 ~-G~d-------~~~~~~---~~~~~--~~~i~~g~~~~~~lp~~~~~~~-------ie~~~~~--~~-r~~l~~le~qm 349 (489) T protein:vir:78 293 Y-PGE-------NLTPQA---FKEAN--PNGIKFGSRRGHNLGYGGSAQL-------IQAGENN--LA-RQNMLDKEQQA 349 (489) T ss_pred e-cCc-------cCCccc---ccccC--ccceeeCCcccccCCCCCCcce-------eccCcch--HH-HHHHHHHHHHH Confidence 4 110 011110 00000 1224457778888888875443 3343332 22 33333334443 Q ss_pred HHHHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEe--cC-cCchh Q lcl|NC_017974. 345 LDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKA--GE-IVDVD 421 (507) Q Consensus 345 sk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~--~~-~~~~d 421 (507) . .+....++.+..++++-+.. .......++.+-+..+++.++ +++++++.+-+. ++..-+.|.. +. ....| T Consensus 350 ~-~lGa~l~~~~~~~Ta~~~~~--~~~~~~S~L~~~a~~~e~al~-~~l~~~a~w~G~--~~~~~~~i~~n~dF~~~~~d 423 (489) T protein:vir:78 350 I-QIGAQLITPTQQITAQSARI--QRGADTSVMATIARNVSQAYT-DALRWVAVMLGK--PEDTEVEFRLNMDFFLEPMT 423 (489) T ss_pred H-HHhhhhccCCcchhHHHHHH--HHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCC--CCCCceEEEeecccCcccCC Confidence 3 22234444332345554433 333346778888899999997 588999997532 1111122311 11 22233 Q ss_pred HHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCcccccccccCCCCch Q lcl|NC_017974. 422 MEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 422 l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 501 (507) . ...+++..+...|.+.. .....+++ +-||..+..++...+.....++ ...+.++..|+.+.+. T Consensus 424 ~-~~~~al~~~~~~G~is~-~t~~~~L~-~~gv~d~~~e~~~~ei~~~~~~-------------~~~~~~g~~~~~~q~~ 487 (489) T protein:vir:78 424 A-QDRAAWMADINAGLLPA-TAYYAALR-KAGVTDWTDADIKDAVADQPLP-------------VATEVQGEIPQSAQQQ 487 (489) T ss_pred H-HHHHHHHHHHhcCCCCH-HHHHHHHH-hCCCCCccHHHHHHHHhhcCCC-------------cccCCcccCCCCcccc Confidence 2 23456667778887543 33344554 4577654433322111111000 0111122222221111 Q ss_pred hh Q lcl|NC_017974. 502 AA 503 (507) Q Consensus 502 a~ 503 (507) .+ T Consensus 488 ~~ 489 (489) T protein:vir:78 488 EK 489 (489) T ss_pred cC Confidence 11 No 194 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=89.06 E-value=0.029 Score=29.05 Aligned_cols=441 Identities=8% Similarity=0.016 Sum_probs=151.6 Q ss_pred Cccccccccccc---Cccccccccccccc--cccccCchhh--hHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVK---VAKATSEQINVSRM--RMSEQGTFAL--AKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGL 73 (507) Q Consensus 1 ~~~k~~~~~~~~---~~~~~~~~~~~~~~--~~~~~g~~~~--~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l 73 (507) |.=--+.|+.++ .-+|........+. ++.+ +...- ...+......+++. .+.. .-..-.-+.+.+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~w~~~~~-~~~~------~~~~~~~l~~~i 72 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVP-DNQKEWSKDSYLTSLWAQGYV-PTVH------DKLMNSGTGNEI 72 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcc-cchhhhhhhhhhhhhcccCCC-Cccc------cccccCChHHHH Confidence 222222222221 11111000000000 0000 00000 00000000000000 0000 000000122222 Q ss_pred HH-HHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHH-HHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 74 DM-LYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFA-RDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 74 ~~-Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l-~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) -+ --..|.+-.-.+.+.+.+..+++.+.++++++|++.. |...+ .-+.++...|=.++=..|. +|++. T Consensus 73 ~~~~A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~---f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~ 142 (518) T protein:vir:78 73 VVVAAEYISGKPLSIDVTGVNGSKDENLTKQLKEALRIDN---FDSKSVKIVELAGGSGVSAVKINIL-------NGRPS 142 (518) T ss_pred HHHHHHhhcCCCceEEecCccccCcHHHHHHHHHHHHhcc---HHHHHHHHHHHhhccCceEEEEEEE-------CCeeE Confidence 22 2222333334566654443445567788888887633 44444 4445677788777755553 23333 Q ss_pred eeeccccCchhcccccceeecCCCcccccccccccc----cc----c------------ccccccc--cc---------- Q lcl|NC_017974. 152 VKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSA----FQ----N------------YTASEIG--VG---------- 199 (507) Q Consensus 152 ~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~----~~----~------------~~~~~~~--~~---------- 199 (507) +.- .+..++.. .+.+.+-..+.-+...+.. .+ . ...+.+. +. T Consensus 143 i~~---v~ad~~~P---~~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~ 216 (518) T protein:vir:78 143 ISV---HSSSQFWI---DFKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPI 216 (518) T ss_pred EEE---EcCCeeEE---EeecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccc Confidence 322 22222211 1111110011000000000 00 0 0000000 00 Q ss_pred -------------ccccCCCccccc---cccEEEEeeC-----CccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_017974. 200 -------------GVSTRMSDVIIP---INRVMLMNTG-----GSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDM 258 (507) Q Consensus 200 -------------~~~~~~~~~~iP---~~k~i~~~~~-----~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g 258 (507) .+........++ +.-|+.|... ...++|+|.|.+..|.-..-.=+..+..|+.-++ .| T Consensus 217 ~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~-~g 295 (518) T protein:vir:78 217 SAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGE-KT 295 (518) T ss_pred cccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHH-hC Confidence 000000000011 1124455432 2357899999999887554444444444554444 22 Q ss_pred CCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccc--ccccccceeeeeccCCCcchhHHHH Q lcl|NC_017974. 259 GGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTK--DNAPQYSMTLKGIDGMGKQYSTAQL 336 (507) Q Consensus 259 ~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~--i~~~~~~~el~~~~g~g~~~~~~~l 336 (507) -. .+.+|..+.+............. +..+.+....+..++. .+.++ .++.....-. ...|... T Consensus 296 ~~--~i~v~~~~l~~~~~~~~~~~~~~----------fd~~~~~y~~i~~~~~~~~~~~~-~i~~~~~~Ir--~e~~~~~ 360 (518) T protein:vir:78 296 KT--KIAASERMFRKKVNKSTDKEEWS----------MNVDEDYFMQFKGTLDAGAKLND-MIQFMQGDFR--DGSYRET 360 (518) T ss_pred Cc--eeeechhHhccCCCCCCCccccc----------cCCCCceEEEecCcCCCCCcccc-ceeeeecccC--hHHHHHH Confidence 22 33445433322111100000000 0001122222221110 00000 0000000000 1124444 Q ss_pred HHHHHHHHHHHH-hc-ccccccccchhhhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-------CC Q lcl|NC_017974. 337 ISDRKKSILDRL-GA-GFINVGNDKGGSYNLSESKQTIH--TQFVQRVNEIILEALNENLLPQLLALNDMR-------LP 405 (507) Q Consensus 337 i~~~d~~Isk~i-LG-qtLts~~~~~GS~Al~~vh~~v~--~~~~~aD~~~i~~~ln~~li~~l~~lN~~~-------~~ 405 (507) ++.+=++|...+ ++ +|+.. + +|-....++..+-. -.-+..-...+...| ++|+..++.+=..+ .+ T Consensus 361 ~~~~l~~~~~~~G~s~~tfg~--~-~~~~TATei~s~~~~~~~t~~~~~~~~e~al-~~l~~~i~~l~~~~~~~~~~~~~ 436 (518) T protein:vir:78 361 MEYFAQKAVSKSGYNPATFNL--G-NREVKATEIWSLQDATVRKIEKKKRLIQNVY-EQMLWDFLYLLTGGTNNKEKAIM 436 (518) T ss_pred HHHHHHHHHHhhCCChhhcCc--c-cccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhcCccccccC Confidence 444444443333 22 23322 1 11111223332221 122334445555555 45766665531111 11 Q ss_pred -ccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHc-CCCCCCCcccccchhhhcCCCcCCCCCCccc Q lcl|NC_017974. 406 -ETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVL-GIDEKIEEDISQEELMKLLGEDTSRAGDGMT 483 (507) Q Consensus 406 -~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (507) +..-+.|.|+..-.+|.++.++.+++++.+|++.. +.++++.+ +..+.+..++......... ...++.++... T Consensus 437 ~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~----e~~i~~~~~~~~deea~~e~~ri~~E~~-~~~~~~p~~~~ 511 (518) T protein:vir:78 437 RDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSV----EEKVKLIHPKWEDEEIQAEVKRIYLENA-IGEVPDPEAIG 511 (518) T ss_pred CCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCH----HHHHHHhCCCCCHHHHHHHHHHHHHHhc-ccCCCCCcccc Confidence 11236788999999999999999999999998654 45677653 6643322222211111111 01111111111 Q ss_pred ccCcccc Q lcl|NC_017974. 484 KGSSGNG 490 (507) Q Consensus 484 ~~~~~~~ 490 (507) ...+.-| T Consensus 512 g~~~~~g 518 (518) T protein:vir:78 512 GMETKGG 518 (518) T ss_pred CCCCCCC Confidence 1111111 No 195 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=88.95 E-value=0.029 Score=29.00 Aligned_cols=411 Identities=9% Similarity=-0.003 Sum_probs=152.4 Q ss_pred CccccccccccccccccccCchhhhHH-HHhhhhccc-chhccchh-HHHHHH-----H-hhChHHHH-----HHHHHHH Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKV-QVDSERMKA-EEIRWPNL-IGTVES-----M-KQDATVAT-----GLDMLYT 78 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~~-~~lr~~~~-~~~y~~-----m-~~D~~v~s-----~l~~Rk~ 78 (507) +-.-+-.. ....-...+-.. +... .+..+.+.. +.+.++.. ...... . ..+..|.. .+.+... T Consensus 1 ~~~~~~~~--~i~~~~~~~~~~-~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~ 77 (470) T protein:vir:10 1 MELDALKK--LIQNTSTSRNDL-INNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (470) T ss_pred CchHHHHH--HHHHHHHHHHHH-HHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhh Confidence 00000000 000000000000 0000 000000000 11111000 000000 0 00112222 2222233 Q ss_pred HHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) -+.+-+..|.+. +.+..+.+.+.++. .|.+.+.++ .++.-+|.+.. .+|-.. +|.+.+.-+.| T Consensus 78 yl~G~p~~~~~~------d~~~~~~l~~~~~~----~~~~~~~~l~~~~~~~G~a~~-~~y~d~-----~~~~~~~~~~p 141 (470) T protein:vir:10 78 YVASVFPDIDVG------KDADNKKIIDVLGD----DRALTLNGLLVDSSNAGRAWL-HYWIDE-----DGNFRYGIIQP 141 (470) T ss_pred heeccceeeecC------chHHHHHHHHHHhh----hHHHHHHHHHHHHhhcCeeEE-EEEecC-----CCceEEEEEcc Confidence 333333333222 23344455555542 355555554 35777898865 556422 34555443332 Q ss_pred cCchhcccccceeecCCCccccc-ccccc----------ccccccccccccccccccC---------------------- Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAIIG-IKQSL----------SAFQNYTASEIGVGGVSTR---------------------- 204 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~~-~~q~~----------~~~~~~~~~~~~~~~~~~~---------------------- 204 (507) . ..+. ++-+.+.+.++. ++-.. .....++............ T Consensus 142 ~--~~~~----v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (470) T protein:vir:10 142 D--QITP----IYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYET 215 (470) T ss_pred c--ceEE----EEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceecccccccccccccccccc Confidence 2 1111 110111011110 00000 0000000000000000000 Q ss_pred --CC--ccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchH Q lcl|NC_017974. 205 --MS--DVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSS 280 (507) Q Consensus 205 --~~--~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~ 280 (507) .. ...+..-.++.|+ .|..|.|.+..+-...=-=+..+..++..++.++.|+++++- . ..+... T Consensus 216 ~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g---~----~~~~~~ 283 (470) T protein:vir:10 216 GQSNTLKHNFGRVPFIEFS-----KNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTN---Y----GGADLH 283 (470) T ss_pred ccccccccCCCeeeEEEee-----cCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeec---C----Cccccc Confidence 00 0000001123333 356788999876554444466788899999999999988761 0 111111 Q ss_pred HHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch Q lcl|NC_017974. 281 PEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG 360 (507) Q Consensus 281 ~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~ 360 (507) + ....+. ....+.++..-+ -.+..++++...... ..+...++++.+.|.+.--+..++... . T Consensus 284 ~---~~~~~~---------~~~~i~~~~~~~--~~~~~~~~lt~~~~~--~~~~~~~~~L~~~I~~~s~~p~~~~~~--~ 345 (470) T protein:vir:10 284 Q---FMNDLR---------KYKSIKINNTGN--GDNSGVDKLQIDIPV--EARDDALKITRKNIFLFGQGIDPANFE--S 345 (470) T ss_pred h---hhhhhh---------hcCeEeccCCCC--CcCceeEEEeecCCh--HHHHHHHHHHHHHHHHHhCCCCCCccc--c Confidence 1 111111 112344443211 011234444444332 245778999999998776665554422 2 Q ss_pred hhhhHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHH-hCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhc Q lcl|NC_017974. 361 GSYNLSESKQTIHTQFVQR----VNEIILEALNENLLPQLLA-LNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAV 435 (507) Q Consensus 361 GS~Al~~vh~~v~~~~~~a----D~~~i~~~ln~~li~~l~~-lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~ 435 (507) | .+ +.+.-+.+...... -.+.+...|. ++++.+++ +|.. ..+..-..+.|....+.|..+.++.+.++ . T Consensus 346 g-n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~l~~~-~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~ 419 (470) T protein:vir:10 346 S-NA-SGVAIKMLYSHLELKAAKTQTYFEHAIN-ELVRAIMRYLNFS-DADKRHISQHWTRTKVEDSLTKAQIVSTV--A 419 (470) T ss_pred c-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccc-CcccceeeEEeccCCCCCHHHHHHHHHHH--h Confidence 2 22 22222222222222 3334445553 35555554 3321 11223457889999999999999999887 4 Q ss_pred CCccccHHHHHHHHHHcCCCCCCCcccc--cchhhhcCCCcCCCCCCcccccCccccccc Q lcl|NC_017974. 436 GYLPKTPKVINRVLEVLGIDEKIEEDIS--QEELMKLLGEDTSRAGDGMTKGSSGNGTGK 493 (507) Q Consensus 436 G~~~~~~~~~~~i~e~~Gip~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (507) |. ++ ++-+.+.++.-.....+.. ..+.....+..+. ..+.. ..+.+..+ T Consensus 420 g~-iS----~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~-~~~~~---~~~~dde~ 470 (470) T protein:vir:10 420 NY-SS----KEAVAKANPIVDDWQQELKDLAKDKEENDPYSNQ-ADELN---GKGVNDEQ 470 (470) T ss_pred cc-Cc----HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhcc-ccccC---CCCCCCCC Confidence 64 44 2445566653211111111 1111111111111 11110 11111111 No 196 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=88.75 E-value=0.03 Score=28.90 Aligned_cols=446 Identities=12% Similarity=0.072 Sum_probs=161.6 Q ss_pred Cc------------------ccccccccccCcccccccccccccccccc-CchhhhHHHHhh-hhcc--cchh-ccchhH Q lcl|NC_017974. 1 MS------------------RKRNRNRSVKVAKATSEQINVSRMRMSEQ-GTFALAKVQVDS-ERMK--AEEI-RWPNLI 57 (507) Q Consensus 1 ~~------------------~k~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~-~~e~--~~~l-r~~~~~ 57 (507) |+ .++-+.+....+.|.....+.. . ++ +..++.++++.. ...- ...+ -..+++ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e---~-~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI 76 (524) T protein:vir:72 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGARE---F-EVSSNEAASPYNAAFQTIFGSYEPGMKTTRELI 76 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccccCccCCCCcee---e-eecccccccccceeeeehhcccccccchHHHHH Confidence 11 1111111212222222111100 0 00 000111112111 1111 1111 235799 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHhhCCc---eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceee Q lcl|NC_017974. 58 GTVESMKQDATVATGLDMLYTFVEKAFK---DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVV 134 (507) Q Consensus 58 ~~y~~m~~D~~v~s~l~~Rk~av~~~~~---~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~ 134 (507) +.|++|..++.|-++++.....+...+- .+.+.-.+.+-++.+.+.|.+- |.. |..+|+.--+||..+ T Consensus 77 ~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~ee--------F~~-Il~ll~F~~~~~~~f 147 (524) T protein:vir:72 77 DTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDE--------FSD-VLNHLSFQRKGSDHF 147 (524) T ss_pred HHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHH--------HHH-HHHHhccchhhhHHH Confidence 9999999999999999988776532211 0111111122223333333222 222 224444444444443 Q ss_pred -----------eeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccc---cc Q lcl|NC_017974. 135 -----------EKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGV---GG 200 (507) Q Consensus 135 -----------Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~---~~ 200 (507) .++-.... .-+|-. .|....|.++.+++-+....++.. ..+. .-..++.+.++..+. .. T Consensus 148 R~WYVDgRi~fhKiid~k~--pk~GI~---Elr~lDPr~i~~vr~i~~~~~~~~-~vi~-~~~e~f~Y~~~~~~y~~~g~ 220 (524) T protein:vir:72 148 RRWYVDSRIFFHKIIDPKR--PKEGIK---ELRRLDPRQVQYVREIITETEAGT-KIVK-GYKEYFIYDTAHESYACDGR 220 (524) T ss_pred hhheeeeEEEEEEEEeCCC--ccccce---eeeeeCCccceeeeeeccCCCccc-hhhc-chhhheeeccCccccccCcc Confidence 33322110 012333 333334445555544433322221 1111 111222222221111 01 Q ss_pred cccCCCccccccccEEEEeeCCccCc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCc Q lcl|NC_017974. 201 VSTRMSDVIIPINRVMLMNTGGSSSQ--ALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQ 278 (507) Q Consensus 201 ~~~~~~~~~iP~~k~i~~~~~~~~~~--p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~ 278 (507) ......++.||.+ -|+|+|..-.+- -.=.|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ T Consensus 221 ~~~~~~~ikI~~d-AI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYID--vGnlP---- 293 (524) T protein:vir:72 221 MYEAGTKIKIPKA-AVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVD--TGNMP---- 293 (524) T ss_pred ccCCCcceecchh-heeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEe--cCCCC---- Confidence 1123456777765 488888543111 12247888888776433333332222210001111244442 22222 Q ss_pred hHHHHHHHHHHHHHHHHhh-cCCcee--------------EeecC-----cccccccccceeeeeccCCCcchhHHHHHH Q lcl|NC_017974. 279 SSPEAEMVRGLMSDAANAH-SGEQSF--------------FMLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLIS 338 (507) Q Consensus 279 ~~~~~~~l~a~~~~~~n~~-~G~~a~--------------~iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~ 338 (507) .....+.|+.++.-..|=. -.+..| .-+|. |+||.+. .|+.+... ..=|+ T Consensus 294 k~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlge-m~DV~ 363 (524) T protein:vir:72 294 ARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTL---------PGADNTGN-MEDIR 363 (524) T ss_pred chhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeec---------cccCCcCh-HHHHH Confidence 2233345555444332200 000111 11222 5555442 23323322 34589 Q ss_pred HHHHHHHHHHhcccccccccch----hhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----c Q lcl|NC_017974. 339 DRKKSILDRLGAGFINVGNDKG----GSYNLSESKQTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----M 409 (507) Q Consensus 339 ~~d~~Isk~iLGqtLts~~~~~----GS~Al~~vh~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~ 409 (507) |..+.+-+++--..--.+.+.+ +.++..=+..|+ |...+......+...|..-|-..|+-=+.. ..+. . T Consensus 364 YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgii--t~eew~~i~ 441 (524) T protein:vir:72 364 WFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGII--TEDEWNDEI 441 (524) T ss_pred HHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHh Confidence 9999998888776533322211 123322222233 223344444444444443333333321211 1111 1 Q ss_pred ceEEecCc------CchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcc Q lcl|NC_017974. 410 PYVKAGEI------VDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGM 482 (507) Q Consensus 410 P~~~~~~~------~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (507) +.+.|+.. |-.+.+.+.+++..|..+--.+-.-...+|+++. |.+.+.+-.++. ..-.....++--+.++ T Consensus 442 ~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~-k~I~~E~k~~~~~~~~-- 518 (524) T protein:vir:72 442 NNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEA-KQIEEESKEARFQDPD-- 518 (524) T ss_pred hcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHH-HHHHHHhhcCCCCCCc-- Confidence 23333332 2244444555665554431111111235677654 566422111111 0011111110000000 Q ss_pred cccCccccc Q lcl|NC_017974. 483 TKGSSGNGT 491 (507) Q Consensus 483 ~~~~~~~~~ 491 (507) ...++. T Consensus 519 ---~~~~~f 524 (524) T protein:vir:72 519 ---QEQEDF 524 (524) T ss_pred ---hhhhcC Confidence 000111 No 197 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=88.59 E-value=0.031 Score=28.83 Aligned_cols=407 Identities=8% Similarity=-0.002 Sum_probs=157.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHH-HHhhhhcc-cchhccchhHHH--HHHHhhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKV-QVDSERMK-AEEIRWPNLIGT--VESMKQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~e~-~~~lr~~~~~~~--y~~m~~D~~v~s~l~~R 76 (507) |+.+. + ..-..++-.. .... ....+.+- .+.+....-... -.. ...+...-++... T Consensus 1 l~~~~---------------l---~~~i~~~~~~-~~r~~~l~~yy~g~~~il~~~~~~~~~~~~k-i~~n~~~~ivd~~ 60 (429) T protein:vir:98 1 MTKDL---------------L---SELIQKHRSF-NLSYSAYKQLYEGDHAILQQKQKEQYKPDNR-LVVNFAKYIVDTF 60 (429) T ss_pred CCHHH---------------H---HHHHHHHHHH-HHHHHHHHHHhccccccccccccccCCCcce-eecchHHHHHHHH Confidence 11000 0 0000000000 0000 00000000 011100000000 000 0123344444444 Q ss_pred HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeec Q lcl|NC_017974. 77 YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNL 155 (507) Q Consensus 77 k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l 155 (507) ..-+.+-+..+.. +++++.+.+.++++.. .+...+.++. ++.-||.+ ++++|.. .+|.+.++-+ T Consensus 61 ~~~l~g~~~~~~~------~~~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~~-~~~v~~d-----~~g~~~~~~~ 125 (429) T protein:vir:98 61 NGYFIGVPVQTSH------ENKQVSNYLELLDGYN---DQDDNNAELSKICSIYGHG-YELVFND-----ENAEAGITYL 125 (429) T ss_pred hhhhcccCceeec------CChHHHHHHHHHHhhc---CHhHHHHHHHHHHhhcCeE-EEEEEec-----CCCcEEEEEE Confidence 4444444333332 2334555666666542 3555555543 57779975 5577753 2466655544 Q ss_pred cccCchhcccccceeecCC-Ccccccccc--ccc---ccccccccccccccccc----CCCcccccccc--EEEEeeCCc Q lcl|NC_017974. 156 AFRPQASLSRTNPIVYNED-GSAIIGIKQ--SLS---AFQNYTASEIGVGGVST----RMSDVIIPINR--VMLMNTGGS 223 (507) Q Consensus 156 ~~r~~~~i~~~~~~~~~~d-~~~l~~~~q--~~~---~~~~~~~~~~~~~~~~~----~~~~~~iP~~k--~i~~~~~~~ 223 (507) .|+. .+. ++-+.. ...+..++- ... ....++........... .....+-|..+ ++.| T Consensus 126 ~p~~--~~~----v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~----- 194 (429) T protein:vir:98 126 TPLE--AFI----VYDDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEY----- 194 (429) T ss_pred cccc--eEE----EEeCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEe----- Confidence 3331 111 110100 101111100 000 00001110000000000 00011111111 2222 Q ss_pred cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCcee Q lcl|NC_017974. 224 SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSF 303 (507) Q Consensus 224 ~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~ 303 (507) .+++.|.|.+..+....=--+..+..++...+.+..|+++++ | ...+++.. ..+. . ... T Consensus 195 ~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~-----g---~~~~~~~~----~~~~-------~--~~~ 253 (429) T protein:vir:98 195 VENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKIL-----G---AELDDETL----KSLR-------D--TRI 253 (429) T ss_pred cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-----c---CCCCcchh----hhHh-------h--Cce Confidence 246788999987655544445577888888999999988875 1 11111111 1111 0 123 Q ss_pred EeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch-hhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 304 FMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG-GSYNLSESKQTIHTQFVQRVNE 382 (507) Q Consensus 304 ~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~-GS~Al~~vh~~v~~~~~~aD~~ 382 (507) +.+|.+.. ...+++++..+.. ...+...++.+.+.|.+.--+..++.+..+. +.-|+ .....-....+..-.+ T Consensus 254 ~~~~~~~~---~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al-~~~~~~l~~k~~~~~~ 327 (429) T protein:vir:98 254 INLKDTDA---QQLTVEFLQKPDA--DATQEHLLDRLENLIFRTAMVANISDESFGTASGIAL-RYRLQAMDNLAKTKER 327 (429) T ss_pred eeccCCCC---CCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCccccCccccccchHHHH-HHHHHHHHHHHHHHHH Confidence 44444321 1123444443332 2346678889988887776555444322111 11122 1112222233333445 Q ss_pred HHHHHHHHHHHHHHHHhCCCcCCccc--cceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCC-CCCCC Q lcl|NC_017974. 383 IILEALNENLLPQLLALNDMRLPETE--MPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGI-DEKIE 459 (507) Q Consensus 383 ~i~~~ln~~li~~l~~lN~~~~~~~~--~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gi-p~~~~ 459 (507) .+...+. ++++-++.+-...+.... --.+.|....+.|..++++++.+| .|+ ++ .+.+.+.++. +.+.. T Consensus 328 ~~~~~l~-~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~-is----~et~~~~l~~v~d~~~ 399 (429) T protein:vir:98 328 KFMSGMN-RRYKLIASYPTSKIGPKDWIGIKYKFTRNLPANLLEESQIAGNL--AGI-VS----EETQVGVLSIVENPQK 399 (429) T ss_pred HHHHHHH-HHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHH--hcc-Cc----hHHHHHhCCCCCCHHH Confidence 5556664 355556554211111211 135789999999999999999998 455 44 2456677764 32211 Q ss_pred c-ccccchhhhcCCCcCCCCCCcccccCcccccc Q lcl|NC_017974. 460 E-DISQEELMKLLGEDTSRAGDGMTKGSSGNGTG 492 (507) Q Consensus 460 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (507) + +....+.... .+.++.. .....+.++.. T Consensus 400 E~~ri~~E~~~~-~~~~~~~---~~~~~~~~~~~ 429 (429) T protein:vir:98 400 EIERKNSDKSTL-ISRQAGG---LNGQNTTTILE 429 (429) T ss_pred HHHHHHHHHHHH-HHHHHhh---hcCCCCCCCCC Confidence 1 0001111100 0000100 00000111110 No 198 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=87.09 E-value=0.041 Score=28.19 Aligned_cols=430 Identities=9% Similarity=-0.006 Sum_probs=153.9 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccch-hcc-chh-HHHHHHHh----hChHHHHHHHH---------H Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEE-IRW-PNL-IGTVESMK----QDATVATGLDM---------L 76 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~-lr~-~~~-~~~y~~m~----~D~~v~s~l~~---------R 76 (507) ||..... +. ++. .....|......-.+.. .|. ++. +--+..+. .|..+..-+.. - T Consensus 1 m~~V~~~-----hp---~y~-~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t 71 (501) T protein:vir:95 1 MPNVSFI-----RP---ELG-KLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRT 71 (501) T ss_pred CCCCCCC-----CH---HHH-HHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHH Confidence 4322111 10 111 11223333322222221 111 000 00001010 12223222111 1 Q ss_pred HHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCc-cc-c-----Cc Q lcl|NC_017974. 77 YTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTV-GE-Y-----IG 148 (507) Q Consensus 77 k~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~-~~-~-----~g 148 (507) ..++.+. -|.-+|.=+ ....+..|++.+ +..+.++..++++++. ++.||.+.+=+.+-..+. +. . -+ T Consensus 72 ~~~l~G~--vf~k~p~~~-~p~~l~~l~~d~--D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~ 146 (501) T protein:vir:95 72 LFGLVGQ--VFMRDPVVK-VPALLNPLVANA--TGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAG 146 (501) T ss_pred HHHHhhh--hhcCCccee-CcHHHHHHHhcc--CCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhc Confidence 2222222 222222110 112333333222 2345678899988775 667998876444421110 00 0 00 Q ss_pred ceeeeeccccCchhcccccce----------------eecCCC-ccccccccccc---------ccccccccccc----- Q lcl|NC_017974. 149 KYKVKNLAFRPQASLSRTNPI----------------VYNEDG-SAIIGIKQSLS---------AFQNYTASEIG----- 197 (507) Q Consensus 149 ~~~~~~l~~r~~~~i~~~~~~----------------~~~~d~-~~l~~~~q~~~---------~~~~~~~~~~~----- 197 (507) ... --|....+..|..|+.- +...|+ .+-....|..- ....|...... T Consensus 147 ~~r-Py~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~ 225 (501) T protein:vir:95 147 RIR-PTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGS 225 (501) T ss_pred cCC-cEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcc Confidence 000 00111122222222210 001111 11111111000 00001100000 Q ss_pred ----------ccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCceEEec Q lcl|NC_017974. 198 ----------VGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIE-NLEVVGATKDMGGVIELKI 266 (507) Q Consensus 198 ----------~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~-~~w~~f~Er~g~p~~v~~~ 266 (507) ........++..++.=-|+++......-. .+...|..++.. -.++|-. .+.-+-+-.-+.|+++++- T Consensus 226 ~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~-~~~pPLl~lA~l-ni~hy~~ssd~~~~l~~~~~P~l~i~G 303 (501) T protein:vir:95 226 KIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSN-PDNPNFYDLASL-NMAHYRNSADYEESCYIVGQPTPVLIG 303 (501) T ss_pred eecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCC-CCccchHHHHHH-HHHHHhhhhHHHHHHHHcccceeeeeC Confidence 00000001112222112444332222212 233334444422 2232222 2222333344677777652 Q ss_pred ccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHH Q lcl|NC_017974. 267 PSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILD 346 (507) Q Consensus 267 P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk 346 (507) +- ++..+ .+.. ..+.-|+.+++.+|+|.+.. +++.++++ . .+..++....+|.. T Consensus 304 ~~--------~~~~~------~~~~--~~i~~G~~~~~~lP~~~~~~-------~ie~~~~~--i-~~~~l~~l~~~m~~ 357 (501) T protein:vir:95 304 LT--------EEWVT------NVLK--GSVNFGSRGGIPLPVGADAK-------LLQASENT--M-LKEAMDTKERQMVA 357 (501) T ss_pred Cc--------ccccc------cCCC--CceeecccccccCCCCCcee-------EEecChhh--H-HHHHHHHHHHHHHH Confidence 10 00000 0000 12334788899999988654 44444432 2 35567777777654 Q ss_pred HHhccccccc-ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEe--cCcCchhHH Q lcl|NC_017974. 347 RLGAGFINVG-NDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKA--GEIVDVDME 423 (507) Q Consensus 347 ~iLGqtLts~-~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~--~~~~~~dl~ 423 (507) +- ...++.. ..++++-+..+ ..-...++.+-+..+++.++ +++++++.+-+. .+.. ++|.+ +......-. T Consensus 358 ~G-a~ll~~~~~~~Ta~~~~~~--~~~~~S~L~~~a~~le~al~-~~l~~~a~w~g~--~~~~-~~v~i~~df~~~~~~~ 430 (501) T protein:vir:95 358 LG-AKLVEQKEVQRTATEAELE--AASEGSTLSSATKNVSAAFE-WALKWAARWVGQ--ADSG-VKFELNTDFDIARMTP 430 (501) T ss_pred HH-HhhccCCccchhHHHHHHH--HHHHhHHHHHHHHHHHHHHH-HHHHHHHHHcCC--CCCc-eEEEEecccccccCCH Confidence 32 3333332 22345544332 33335677888899999997 488999887642 1221 23332 222222123 Q ss_pred HHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhh Q lcl|NC_017974. 424 GFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAA 503 (507) Q Consensus 424 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~ 503 (507) ..++++.++...|.+.. .....+++ +.|++.+...++..............+. .+.... + -+... T Consensus 431 ~~~~al~~~~~~G~is~-~t~~~~L~-~~~v~~~~~~~e~e~i~~~~~~~~~~~~---~~~~~~---~-------~~gg~ 495 (501) T protein:vir:95 431 DERRSLVEEWQKGAITF-EEMRTGLR-KAGVATEDDSKAKEKIAKDTAEAMALAT---PANVPG---D-------GSGGD 495 (501) T ss_pred HHHHHHHHHHhCCCCcH-HHHHHHHH-hCCCCChhHHHHHHHHHhhhcCcccccc---cCCCCC---C-------Ccccc Confidence 44677788888888544 33344454 4688765333221111100000000000 000000 0 00111 Q ss_pred hccC Q lcl|NC_017974. 504 NLDN 507 (507) Q Consensus 504 ~~~~ 507 (507) |+.| T Consensus 496 ~~~~ 499 (501) T protein:vir:95 496 NVGN 499 (501) T ss_pred cccC Confidence 2223 No 199 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=86.87 E-value=0.043 Score=28.10 Aligned_cols=432 Identities=10% Similarity=0.016 Sum_probs=155.1 Q ss_pred Ccccccccccc-----cCccccc-cccccccccccccCchhhhHHH-Hhhhhc--ccchhccchhHHHHH---HH-hhCh Q lcl|NC_017974. 1 MSRKRNRNRSV-----KVAKATS-EQINVSRMRMSEQGTFALAKVQ-VDSERM--KAEEIRWPNLIGTVE---SM-KQDA 67 (507) Q Consensus 1 ~~~k~~~~~~~-----~~~~~~~-~~~~~~~~~~~~~g~~~~~~~~-~~~~~e--~~~~lr~~~~~~~y~---~m-~~D~ 67 (507) |+.-+-+.... .+++... ........-+.++-+.-...+. ...+.+ +.+.+.+.....-+. .. ...+ T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n 85 (481) T protein:vir:10 6 INNINTKFSPLANDDFVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHN 85 (481) T ss_pred eehhchhcccccCceeeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecc Confidence 22221111111 1111110 0000000001111001111111 111111 111111111100000 00 1234 Q ss_pred HHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcccc Q lcl|NC_017974. 68 TVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEY 146 (507) Q Consensus 68 ~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~ 146 (507) +..-++.+...-+.+-+..+.+. +.+..+.+.+++... .|...+.++ .++.-+|.+. +++|... T Consensus 86 ~~~~ivd~~~~~l~g~~~~~~~~------d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~~~-~~~~~d~----- 150 (481) T protein:vir:10 86 YAKYVSRFIVGYLTGNPITITHQ------DNQTNDKIIELNDLN---DADEVNSDLALNLSIYGRAY-EIVYRDF----- 150 (481) T ss_pred hHHHHHHHHHhhhccCCceEecC------ChhHHHHHHHHHHhc---ChhHHHHHHHHHHHhcCeEE-EEEEeCC----- Confidence 44444554444454444444432 223344555555442 355555554 4677899665 4666532 Q ss_pred CcceeeeeccccCchhccccc------------ceee-cCCCccccccccc-cccccccccccccccccccCCCcc-ccc Q lcl|NC_017974. 147 IGKYKVKNLAFRPQASLSRTN------------PIVY-NEDGSAIIGIKQS-LSAFQNYTASEIGVGGVSTRMSDV-IIP 211 (507) Q Consensus 147 ~g~~~~~~l~~r~~~~i~~~~------------~~~~-~~d~~~l~~~~q~-~~~~~~~~~~~~~~~~~~~~~~~~-~iP 211 (507) +|.+.+.-+.|+. .+..|. .+.. +.++..+..+.-. +.....+................+ .|| T Consensus 151 dg~~~i~~~~p~~--~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vP 228 (481) T protein:vir:10 151 EDRDTFKVLDPKS--TFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVP 228 (481) T ss_pred CCeEEEEEEcccc--eEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCcee Confidence 4555555443321 111000 0000 1111111110000 000000000000000000000000 122 Q ss_pred cccEEEEeeCCccCccccchHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHH Q lcl|NC_017974. 212 INRVMLMNTGGSSSQALGVSPLVGCYRAWRE-KILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLM 290 (507) Q Consensus 212 ~~k~i~~~~~~~~~~p~G~gll~~~~~~~~f-K~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~ 290 (507) ++.|. .++.|.|.+..+- ..+- =+..+...+..++.+..|+.+++- ..... ++ . . ..+. T Consensus 229 ---vv~~~-----n~~~g~~~~~~v~-~lida~~~~~s~~~~~~~~~~~~~~~~~g---~~~~~---~~-~-~---~~~~ 288 (481) T protein:vir:10 229 ---IIEYL-----NDQFKQGDFENVI-ALIDLYDSAQSDTANYMTDLNDAMLAIIG---NVDLD---SE-D-A---KAFR 288 (481) T ss_pred ---EEEee-----cCCCCCCchhhHH-HHHHHHHHHHHHHHHHHHHhcCceeEeec---CcCCC---cc-c-h---hhhh Confidence 22222 3567888887533 3332 233556677778888988887652 11111 11 1 0 1111 Q ss_pred HHHHHhhcCCceeEeecCcccccc--cccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch--hhhhHH Q lcl|NC_017974. 291 SDAANAHSGEQSFFMLPSDTKDNA--PQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG--GSYNLS 366 (507) Q Consensus 291 ~~~~n~~~G~~a~~iiP~g~~i~~--~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~--GS~Al~ 366 (507) . + ..+.+|.+..... ...+++++..+.. ...+...++.+.+.|...--...++.+..++ +.-|+ T Consensus 289 ~-------~--~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al- 356 (481) T protein:vir:10 289 D-------A--NMIHLEPGTNANGSEGKAEVKYVYKQYD--VAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESM- 356 (481) T ss_pred h-------c--cceeccccccccCCCCCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHH- Confidence 0 0 1122222211110 0112334333322 2345677888888887765444444332111 11222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcCCc--cccceEEecCcCchhHHHHHHHHHHHHhcCCccccHH Q lcl|NC_017974. 367 ESKQTIHTQFVQRVNEIILEALNENLLPQLLAL-NDMRLPE--TEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPK 443 (507) Q Consensus 367 ~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l-N~~~~~~--~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~ 443 (507) +....-....+..-.+.+...+. ++++.++.+ |...... ..-..+.|....+.|..+.++++.+|+ |. ++. T Consensus 357 ~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~-is~-- 430 (481) T protein:vir:10 357 KYKLFGLEQVRAIKERLFKKGLM-KRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNALS--GG-VSE-- 430 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHHh--cc-CCh-- Confidence 11122223333334455666664 456666554 2111111 123478899999999999999999985 64 442 Q ss_pred HHHHHHHHcCC-CCCCCcccc--cchhhhcCCCcCC----CCCCcccccCcccc Q lcl|NC_017974. 444 VINRVLEVLGI-DEKIEEDIS--QEELMKLLGEDTS----RAGDGMTKGSSGNG 490 (507) Q Consensus 444 ~~~~i~e~~Gi-p~~~~~~~~--~~~~~~~~~~~~~----~~~~~~~~~~~~~~ 490 (507) +.+.+.++. ..+. ++.. ..+.....+.... .+.+.......++| T Consensus 431 --et~~~~l~~i~d~~-~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 431 --STRLSLLDFIDNPK-EELEKMQEEEAQREKQADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred --HHHHHhCCCCCCHH-HHHHHHHHHHHHHHhhhhhccCCccCCCCCCCCCCCC Confidence 345555654 2211 1110 0111000000000 00000011112222 No 200 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=85.94 E-value=0.049 Score=27.76 Aligned_cols=459 Identities=11% Similarity=-0.013 Sum_probs=177.2 Q ss_pred Ccc----------cccccccccCc-cccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHh-hChH Q lcl|NC_017974. 1 MSR----------KRNRNRSVKVA-KATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMK-QDAT 68 (507) Q Consensus 1 ~~~----------k~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~-~D~~ 68 (507) |+= .+--++..... ...-......+ +..-... ... .. .+-... +..+..--++|- .+++ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r-~~~~~~~-~~s-~~----~~i~~~--~~~lr~RaRdL~rNn~~ 71 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGR-THKAKRQ-PLG-AD----TSLQKS--AVSMREQCRKLDEDHDL 71 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccc-cccccCC-CCC-hH----HHHHHH--HHHHHHHHHHHHhcChH Confidence 211 11100000000 00000000000 0000000 000 00 000000 111222234443 5999 Q ss_pred HHHHHHHHHHHHhhC-CceeeeCC-C-CChhHHHHHHHHHHHHHhc-------cCCCHHHHHHHHHHH-Hhhcceeeeee Q lcl|NC_017974. 69 VATGLDMLYTFVEKA-FKDFKVIP-G-ESEESKKAAKFVEYCLKNM-------EGQTLRQFARDAATF-NEYGLSVVEKV 137 (507) Q Consensus 69 v~s~l~~Rk~av~~~-~~~~~~~~-~-~~~~~~~~ae~v~~~l~~~-------~~~~~~~~l~~~l~a-~~~Gfs~~Ei~ 137 (507) +.++++.....|-+. .+.+++.+ + +...+++..+.|+..++.- ...+|..+...++.+ +.-|=+++-+. T Consensus 72 a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~ 151 (548) T protein:vir:95 72 VTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKL 151 (548) T ss_pred HHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEee Confidence 999999988888652 34444433 2 2233444555555444321 123577777766654 55788888888 Q ss_pred eeccCccccCcceeeeeccccCchhccc---------ccceeecCCCccccccccccccccccccccccccccccCCCcc Q lcl|NC_017974. 138 YTQVTVGEYIGKYKVKNLAFRPQASLSR---------TNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDV 208 (507) Q Consensus 138 w~~~~~~~~~g~~~~~~l~~r~~~~i~~---------~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (507) |........ |....-+|...+++-|.. ..-+.|+.+|+-+-++- ...++++... ......-. T Consensus 152 ~~~~~~~~~-g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i------~~~hPgd~~~--~~~~~~~~ 222 (548) T protein:vir:95 152 MGRVPNYTF-ATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHL------LKDHPGNLQT--LGGSLAVK 222 (548) T ss_pred ecccccccC-CcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEE------eecCCCcccc--ccccccee Confidence 876542111 111111122222222211 12234444443332221 1111221111 11122345 Q ss_pred ccccccEEEEee-CCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHH Q lcl|NC_017974. 209 IIPINRVMLMNT-GGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVR 287 (507) Q Consensus 209 ~iP~~k~i~~~~-~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~ 287 (507) .+|... |+|.+ ..+.+..-|.++|.++.....--.......++-..--.+=..+++.+..-+........ +.. T Consensus 223 rvpA~~-VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~~~-~~~---- 296 (548) T protein:vir:95 223 RVEAER-IIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEPGKD-RKN---- 296 (548) T ss_pred eechhH-heecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCCCcc-ccc---- Confidence 577765 55555 45578888999999887655433333332222221111111122322110000000000 000 Q ss_pred HHHHHHHHhhcCCceeE-eecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhc--ccccccccchhhhh Q lcl|NC_017974. 288 GLMSDAANAHSGEQSFF-MLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGA--GFINVGNDKGGSYN 364 (507) Q Consensus 288 a~~~~~~n~~~G~~a~~-iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLG--qtLts~~~~~GS~A 364 (507) ... .+..| ..+ .++.|.++++. ..+..+ ..|..|.+..-+.|+..+.- +.||. +- .+||| T Consensus 297 ~~~----~~~pG--~iv~~L~pGe~i~~~-------~p~~p~--~~~~~f~~~~lr~IAaglGipYe~ltg-D~-s~nYS 359 (548) T protein:vir:95 297 RTI----PIAPG--MVFDDLEPGEDVGMI-------ESNRPN--PFLEGFRNGQLRMIGAGTRSTYSSVSR-AY-DGTYS 359 (548) T ss_pred ccc----cccCC--ccccccCCCceeeec-------CCCCCC--CCHHHHHHHHHHHHHhhcCCCHHHHhc-cc-chhHH Confidence 000 01112 222 25667765543 333222 23677889999999887622 23443 33 35777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCC-cCCc----cccceEEe--cCcCchhHHHHHHHHHHHHh Q lcl|NC_017974. 365 LSESKQTIHTQFVQRVNEIILEALNENLLPQLLA---LNDM-RLPE----TEMPYVKA--GEIVDVDMEGFSKAIQRIGA 434 (507) Q Consensus 365 l~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~---lN~~-~~~~----~~~P~~~~--~~~~~~dl~~~a~~~~~L~~ 434 (507) .+-.-..-+....+....++...+-+-+...+++ +++. ..|. ..+-...+ ......|..+-+++...++. T Consensus 360 S~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~ 439 (548) T protein:vir:95 360 AQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVK 439 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHH Confidence 5544443333444444444443343333333322 3321 1111 11112222 22334676777778888888 Q ss_pred cCCccccHHHH----------------HHHHHHcCCCCCCCcccccch-hhhcCCCcCCCCCCcccccCcc--------- Q lcl|NC_017974. 435 VGYLPKTPKVI----------------NRVLEVLGIDEKIEEDISQEE-LMKLLGEDTSRAGDGMTKGSSG--------- 488 (507) Q Consensus 435 ~G~~~~~~~~~----------------~~i~e~~Gip~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~--------- 488 (507) +|+.....+.. ....+.+||+-+.+....... .+.....++.....+.+..+.+ T Consensus 440 ~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (548) T protein:vir:95 440 AGFADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRY 519 (548) T ss_pred cCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccC Confidence 88764432211 123466888643221110000 0000000010111111111100 Q ss_pred -----------------cccccccCCCCc Q lcl|NC_017974. 489 -----------------NGTGKISSTRDN 500 (507) Q Consensus 489 -----------------~~~~~~~~~~d~ 500 (507) -+..-+|..+|| T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (548) T protein:vir:95 520 GAGLPVPGPDFPNESNNGGADGQPSNPDP 548 (548) T ss_pred CCCCcCCCCCCCcccccCCCCCCCCCCCC Confidence 011122333333 No 201 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=84.65 E-value=0.059 Score=27.33 Aligned_cols=419 Identities=8% Similarity=-0.001 Sum_probs=150.0 Q ss_pred Cccccccccc-----ccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRS-----VKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDM 75 (507) Q Consensus 1 ~~~k~~~~~~-----~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~ 75 (507) +++-+.+.+. -+.-......+........+. .+.-| . -.....-++.. T Consensus 49 i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~----------------~~~~k---i--------~~n~~k~Iv~~ 101 (511) T protein:vir:99 49 IEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEY----------------MADNR---V--------AHDYASYISDF 101 (511) T ss_pred HHHHHHhhHHHHHHHHHHhcccCccccccCcccccc----------------cCcce---e--------ecchHHHHHHH Confidence 2211111000 000000000000000000000 00000 0 01222222233 Q ss_pred HHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeee Q lcl|NC_017974. 76 LYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKN 154 (507) Q Consensus 76 Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~ 154 (507) ...-+.+-+..|.. .++++.+++.++++.- .+......+. ++.-||.+ ++++|... +|.+.++- T Consensus 102 ~~~yl~g~p~~~~~------~d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~i~G~a-~~~vy~de-----d~~~~i~~ 166 (511) T protein:vir:99 102 INGYFLGNPIQYQD------DDKDVLEAIEAFNDLN---DVESHNRSLGLDLSIYGKA-YELMIRNQ-----DDETRLYK 166 (511) T ss_pred HHhhhcccCceeec------CchHHHHHHHHHHhhc---CHhHHHHHHHHHHHhcCee-EEEEEeCC-----CCceEEEE Confidence 33333344333332 1234556777776542 3555655554 57779965 56777532 45566554 Q ss_pred ccccCchhcccccceeecCC--Cccccccccccc------------ccccccccccccc-----------ccccCCCccc Q lcl|NC_017974. 155 LAFRPQASLSRTNPIVYNED--GSAIIGIKQSLS------------AFQNYTASEIGVG-----------GVSTRMSDVI 209 (507) Q Consensus 155 l~~r~~~~i~~~~~~~~~~d--~~~l~~~~q~~~------------~~~~~~~~~~~~~-----------~~~~~~~~~~ 209 (507) +.|+. ++. .|+.. +..+..++.... ....++.....-. .......... T Consensus 167 ~~p~~--~~~-----vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 239 (511) T protein:vir:99 167 SDAMS--TFV-----IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHS 239 (511) T ss_pred Eccce--eEE-----EEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCC Confidence 44331 111 11111 111111111000 0001111110000 0000000111 Q ss_pred cccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHH- Q lcl|NC_017974. 210 IPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRG- 288 (507) Q Consensus 210 iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a- 288 (507) +..-.++.|+ .|+.|.|.+..+-...=--+..+..++..++.|..|+++++-- . ..+.++.....+. T Consensus 240 ~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~---~----~~~~~~~~~~~~~~ 307 (511) T protein:vir:99 240 FERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN---L----NLDPVEVRKQKEAN 307 (511) T ss_pred CCccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccC---c----ccCchhhccccccc Confidence 1111133333 3567888888765443334556778888889899888876511 1 0111111100000 Q ss_pred HHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch--hhhhHH Q lcl|NC_017974. 289 LMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG--GSYNLS 366 (507) Q Consensus 289 ~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~--GS~Al~ 366 (507) +........ ....+.-...+. +++++..+. ....+..+++++.+.|...--...++.+...+ +.-|+. T Consensus 308 ~~~~~~~~~-~~~~~~~~~~~~-------d~~~l~~~~--~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk 377 (511) T protein:vir:99 308 VLFLEPTVY-ADSEGRETEGSV-------DGGYIYKQY--DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMK 377 (511) T ss_pred ceecccccc-cccccccCCCCc-------ceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHH Confidence 000000000 000000011122 233333322 22346778899999887766555444432111 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CCCc-CCc-cccceEEecCcCchhHHHHHHHHHHHHhcCCcccc Q lcl|NC_017974. 367 ESKQTIHTQFVQRVNEIILEALNENLLPQLLAL---NDMR-LPE-TEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKT 441 (507) Q Consensus 367 ~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l---N~~~-~~~-~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~ 441 (507) ....-....+..-.+.+...|++ +++.++.+ +... .+. ..-.++.|....+.|..+.++.+.+|+ |+ ++. T Consensus 378 -~~~~~l~~ka~~k~~~~~~~l~~-~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--Gi-iS~ 452 (511) T protein:vir:99 378 -YKLFGLEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQ 452 (511) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CCH Confidence 11112222233334555556643 44554443 2111 111 112478898888999999999999885 65 342 Q ss_pred HHHHHHHHHHcCC-CCCCCc-ccccchh---hhcCCCcCCCCCCcccccCcccccccccCCCCchh Q lcl|NC_017974. 442 PKVINRVLEVLGI-DEKIEE-DISQEEL---MKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSA 502 (507) Q Consensus 442 ~~~~~~i~e~~Gi-p~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a 502 (507) +.+.+.++. +.+..+ +....+. .+..-......+++.........+. +..|... T Consensus 453 ----et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~d~~e 511 (511) T protein:vir:99 453 ----TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTK---DSIDKKE 511 (511) T ss_pred ----HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCc---CcccccC Confidence 345555533 322111 0000000 0000000000000000000000000 0001111 No 202 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=84.57 E-value=0.059 Score=27.31 Aligned_cols=417 Identities=8% Similarity=0.012 Sum_probs=157.3 Q ss_pred CcccccccccccCcc------ccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAK------ATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~------~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) +++-+.+ +..+..+ .....+...+ +..+.+ .+.-| ....+..-++. T Consensus 48 i~~~~~~-~~~r~~~l~~yY~g~~~~i~~~~-~~~~~~---------------~~~~k-----------i~~n~~k~Ivd 99 (501) T protein:vir:27 48 INHHKLR-QAPRIQELLDYARGENHDVLQFG-RRKDRE---------------MADKR-----------AVHNYGRMISK 99 (501) T ss_pred HHHHHHH-HHHHHHHHHHHhcCCCccccccC-ccCccc---------------cccce-----------eccchHHHHHH Confidence 1100000 0000000 0000000000 000000 00000 01233333344 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) ....-+.+-+..+... +.+..+.+.+++.+++... .|...+.++. ++.-||.+ ++++|... +|.+.++ T Consensus 100 ~~~~yl~g~p~~~~~~--d~~~~~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a-~~~vy~de-----d~~~~i~ 168 (501) T protein:vir:27 100 FKTGYLAGNPIRVEYD--DNDNNSQNDDTIKRIGRIN---DIDSHNRTLIRDLSQTGRA-YEVIYRNE-----YDETRIK 168 (501) T ss_pred HHhhhhcccCeeEecC--CccchHHHHHHHHHHHHhc---ChhHHHHHHHHHHhhCCeE-EEEEEeCC-----CCceEEE Confidence 4444444444444433 2334456677777776542 4666766654 57779986 56787643 4555555 Q ss_pred eccccCchh----------cccccceeecCCCccccccc-cccccccccccccccccccccCCCcc-ccccccEEEEeeC Q lcl|NC_017974. 154 NLAFRPQAS----------LSRTNPIVYNEDGSAIIGIK-QSLSAFQNYTASEIGVGGVSTRMSDV-IIPINRVMLMNTG 221 (507) Q Consensus 154 ~l~~r~~~~----------i~~~~~~~~~~d~~~l~~~~-q~~~~~~~~~~~~~~~~~~~~~~~~~-~iP~~k~i~~~~~ 221 (507) .+.|+.-.. +..++.+....+......+. -.+...+.+...... .......... .|| ++.| T Consensus 169 ~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~-~~~~~~~~~~g~vP---vv~~--- 241 (501) T protein:vir:27 169 RLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDF-NEISVTTHAFGTVP---ITEF--- 241 (501) T ss_pred EEccceeEEEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCce-eeccccccCCCccc---EEEe--- Confidence 443331100 00000000000000000000 000011000000000 0000001111 122 2333 Q ss_pred CccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_017974. 222 GSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQ 301 (507) Q Consensus 222 ~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~ 301 (507) .+|+.|.|.+..+....=--+..+..++...+.+..|+++++-. .. ...++....+... T Consensus 242 --~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~---~~----~~~~~~~~~~~~~------------ 300 (501) T protein:vir:27 242 --LNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGD---LA----LPKGMQASDMKRT------------ 300 (501) T ss_pred --cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecC---cc----CCcccchhhhhhc------------ Confidence 24678899998755444344556777888888889988887621 11 1111111111110 Q ss_pred eeEeecCccccc--ccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHH---HHHHHHHH Q lcl|NC_017974. 302 SFFMLPSDTKDN--APQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSES---KQTIHTQF 376 (507) Q Consensus 302 a~~iiP~g~~i~--~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~v---h~~v~~~~ 376 (507) ..+.++.+.... .....++++..+- ....+..+++.+.+.|.+.--...++.+.. +| .+.|.. ...-.... T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~-n~Sg~Al~~~~~~l~~k 376 (501) T protein:vir:27 301 RLMQLKPPKSADGKEGTVKAEYLTKSY--DVSGAEAYKTRLNRDIHIFTNIPDMSDTNF-SG-NTSGEALKYKLFGLDQD 376 (501) T ss_pred CceeecccccccCCCCCcceeeeeccC--CHHHHHHHHHHHHHHHHHHhCCcccCcccc-cc-CchHHHHHHHHHHHHHH Confidence 011111110000 0001223333222 122367788999999888765555554322 11 111211 11112233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh---CCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHc Q lcl|NC_017974. 377 VQRVNEIILEALNENLLPQLLAL---NDMRL-PETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVL 452 (507) Q Consensus 377 ~~aD~~~i~~~ln~~li~~l~~l---N~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~ 452 (507) +..-.+.+...|. ++++-++++ +.... .+..-..+.|....+.+..+.++++.+|+ |+ ++ .+-+.+.+ T Consensus 377 a~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~--g~-iS----~et~l~~l 448 (501) T protein:vir:27 377 RVDTQSQFTQGLK-RRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQ-VS----QETALSLS 448 (501) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-Cc----HHHHHHhC Confidence 3334455556664 355555543 21111 11122468899999999999999999984 65 33 23455555 Q ss_pred C-CCCCCCc-ccccchhh-----hcCCCcCCCCCCcccc--cCcccccccccC Q lcl|NC_017974. 453 G-IDEKIEE-DISQEELM-----KLLGEDTSRAGDGMTK--GSSGNGTGKISS 496 (507) Q Consensus 453 G-ip~~~~~-~~~~~~~~-----~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 496 (507) + ++.+..+ +....+.. ...++-....+..... .+.+.+...+.+ T Consensus 449 ~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 449 GLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred CCCCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCccccccccCC Confidence 3 3322111 00000100 0111111111111110 011111111111 No 203 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=84.14 E-value=0.063 Score=27.17 Aligned_cols=433 Identities=9% Similarity=0.009 Sum_probs=149.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) .++.+.+++ ++...-. +.=.+++..+ ........ ....+..| ...+...-++.+...-+ T Consensus 31 i~~~~~~~~----~r~~~l~----~YY~g~~~~i-~~~~~~~~-~~~~~~~k-----------i~~n~~~~Iv~~~~~~l 89 (506) T protein:vir:94 31 ITHHFNYQR----PRLEMLD----DYYQGYNLKI-LDKQSRRH-EDGKADHR-----------ATHSFAKYIADFQTSYS 89 (506) T ss_pred HHHHHHHHH----HHHHHHH----HHhcCCCccc-cccccccc-cccCCcce-----------eecchHHHHHHHhhhhh Confidence 111111111 0000000 0000000000 00000000 00000000 12344444455555555 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..|.+. ++...+.+.++++.. .|...+.. ..++..+|.+. +.+|.. .+|.+.+.-+.|+. T Consensus 90 ~G~p~~~~~~------d~~~~~~l~~~~~~N---~~~~~~~~~~~~~~~~G~a~-~~v~~d-----ed~~~~i~~~~p~~ 154 (506) T protein:vir:94 90 VGNPINVKLP------DDGSNSGFDTFNKAN---DVDAENYDLFLDMSRYGRAY-EYVYRG-----EDNEEHLAKLDPLD 154 (506) T ss_pred cccCceeecC------cchHHHHHHHHHhcc---CHhHHHHHHHHHHHhcCeEE-EEEEec-----CCCeeEEEEEcccc Confidence 4444444332 223445666666542 35555444 44677799864 577753 24566655443331 Q ss_pred chhcccccceeecC-CCcccccccc---------------------ccccccccccccccccccccCCCcc-ccccccEE Q lcl|NC_017974. 160 QASLSRTNPIVYNE-DGSAIIGIKQ---------------------SLSAFQNYTASEIGVGGVSTRMSDV-IIPINRVM 216 (507) Q Consensus 160 ~~~i~~~~~~~~~~-d~~~l~~~~q---------------------~~~~~~~~~~~~~~~~~~~~~~~~~-~iP~~k~i 216 (507) .+. ++-+. ++..+..++. .+.....+...............+. .|| ++ T Consensus 155 --~~~----v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv 225 (506) T protein:vir:94 155 --TFV----IYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFP---VV 225 (506) T ss_pred --eEE----EecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccc---eE Confidence 111 11010 0111111110 0000100000000000000000111 122 23 Q ss_pred EEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEe-cccccCCC------CCCCchHHHHHHHHHH Q lcl|NC_017974. 217 LMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELK-IPSQILNK------AAMDQSSPEAEMVRGL 289 (507) Q Consensus 217 ~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~-~P~~~~k~------~~~~~~~~~~~~l~a~ 289 (507) .|+. +..|.|.+..+-...=-=+..+..++..++.+..|+++++ .+....+. ......+.....-... T Consensus 226 ~~~n-----~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (506) T protein:vir:94 226 EFKN-----SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDK 300 (506) T ss_pred EecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccch Confidence 3332 3456666665543332234456667777776666665554 11110000 0000000000000000 Q ss_pred HHHHHHhhcCCceeEeecCcccccccc--cceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHH Q lcl|NC_017974. 290 MSDAANAHSGEQSFFMLPSDTKDNAPQ--YSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSE 367 (507) Q Consensus 290 ~~~~~n~~~G~~a~~iiP~g~~i~~~~--~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~ 367 (507) ......+. ..-.+.++.+......+ ..++++..+. ....+...++.+.+.|...--...++.+.. ++...|. T Consensus 301 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~d~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~n~Sg~ 374 (506) T protein:vir:94 301 LELIKEMK--DANMLLLKSGMTVNGTQTSVDAKYINKTY--DVVGSEAYKKRVAGDIHKFSHTPDLTDENF--ASNSSGV 374 (506) T ss_pred hHHHhhhh--hcCeeeecccccccCccccccceeeeecC--CHHHHHHHHHHHHHHHHHHhCccccccccc--cccchHH Confidence 01111111 11223334333222111 1222222222 223456788999999987765555444321 1122121 Q ss_pred HHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHh----CCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCcc Q lcl|NC_017974. 368 SKQTIH----TQFVQRVNEIILEALNENLLPQLLAL----NDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLP 439 (507) Q Consensus 368 vh~~v~----~~~~~aD~~~i~~~ln~~li~~l~~l----N~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~ 439 (507) ..... ...+..-.+.+.+.|. ++++.++++ |...-.+..-..+.|....+.|..+.++++.+|+ |+ + T Consensus 375 -Aik~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-i 449 (506) T protein:vir:94 375 -AMQYKVLGTVELASTKRRMFERGLY-ARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAG--AT-L 449 (506) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-C Confidence 11111 1222223334444553 355555543 2111111223578899999999999999999984 65 4 Q ss_pred ccHHHHHHHHHHcCC-CCCCCc-ccccchhhhcCCCcCCCCCCcccccCcccccccccCCC Q lcl|NC_017974. 440 KTPKVINRVLEVLGI-DEKIEE-DISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTR 498 (507) Q Consensus 440 ~~~~~~~~i~e~~Gi-p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (507) +. +.+.+.++. +.+..+ +....+..............+...........+..+.+ T Consensus 450 S~----et~~~~lp~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 450 PQ----KYLYQQLPGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQTNTTATQTDEEVR 506 (506) T ss_pred Ch----HHHHHhCCCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCccccccccccCCC Confidence 42 445555533 322111 01111111111110000001111111111111111222 No 204 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=82.29 E-value=0.078 Score=26.65 Aligned_cols=467 Identities=14% Similarity=0.074 Sum_probs=163.1 Q ss_pred Cc--------cccccccc-ccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHH Q lcl|NC_017974. 1 MS--------RKRNRNRS-VKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVAT 71 (507) Q Consensus 1 ~~--------~k~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s 71 (507) |+ ++....+. ...+.|.....+++ +.+.|..... ...+ ....-..++++.|++|..++.|-+ T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~------~~~~g~~~~~--~~~~-~~~~~~~eLI~~YR~ma~~pEvd~ 71 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDN------FISSGFYGQY--VDIE-GAYRSEYDLIRRYREMALHPEADG 71 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccc------eeccceeeee--eccc-chhhhHHHHHHHHHHHhhccchhh Confidence 33 22222222 22222222211111 1111111111 1111 111123579999999999999999 Q ss_pred HHHHHHHHHhhCCc---eeeeCCCCChhH----HHHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCc Q lcl|NC_017974. 72 GLDMLYTFVEKAFK---DFKVIPGESEES----KKAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTV 143 (507) Q Consensus 72 ~l~~Rk~av~~~~~---~~~~~~~~~~~~----~~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~ 143 (507) +++....-+...+- .+.++-..++.+ +++.+-++..|+-|+ |.....++.. -.-.|--.+.++-.... T Consensus 72 Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~---F~~~~~e~fR~WYVDgRiyfHKiid~k~- 147 (558) T protein:vir:10 72 AIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMD---FDKKSHEIFRNWYVDGRVFYLKVIDTKN- 147 (558) T ss_pred HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhc---cchhhhHHHhhheeeeEEEEEEEEeCCC- Confidence 99988776532211 011111112222 233333333333332 2221111111 11123333333332110 Q ss_pred cccCcceeeeeccccCchhcccccceeec-CCCccccccccc------c--ccccccccccc---cccccccCCCccccc Q lcl|NC_017974. 144 GEYIGKYKVKNLAFRPQASLSRTNPIVYN-EDGSAIIGIKQS------L--SAFQNYTASEI---GVGGVSTRMSDVIIP 211 (507) Q Consensus 144 ~~~~g~~~~~~l~~r~~~~i~~~~~~~~~-~d~~~l~~~~q~------~--~~~~~~~~~~~---~~~~~~~~~~~~~iP 211 (507) .-+|-..++ ...|.+|.+++-.... .|+.....++.. + .-++.+..... ......+...++.|| T Consensus 148 -pk~GI~ELr---~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~ 223 (558) T protein:vir:10 148 -PQEGIQDLR---YIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIA 223 (558) T ss_pred -ccccceeee---eeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeec Confidence 112333333 3344455544433222 122111111111 1 11122222111 111112233455665 Q ss_pred cccEEEEeeCCccC--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHH Q lcl|NC_017974. 212 INRVMLMNTGGSSS--QALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGL 289 (507) Q Consensus 212 ~~k~i~~~~~~~~~--~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~ 289 (507) .+ -|+|+|..--+ .-.=.|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.+ T Consensus 224 ~d-AI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYID--VGnLP----k~KAeqYlr~i 296 (558) T protein:vir:10 224 KD-SITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYID--VGNLP----KVKAEQYLKEV 296 (558) T ss_pred hh-heeeecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHH Confidence 44 57788763211 112247888888776433333333222210001111244442 22222 22333455555 Q ss_pred HHHHHHhh-cCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 290 MSDAANAH-SGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLG 349 (507) Q Consensus 290 ~~~~~n~~-~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iL 349 (507) +.-..|-. -.+..|- -+|. |++|.+. .|+.+... ..=|+|..+.+-+++- T Consensus 297 M~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnLge-m~DV~YF~kKLy~aLn 366 (558) T protein:vir:10 297 MSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTL---------PGGQNLGE-LSDVDYFQKKLYRALG 366 (558) T ss_pred HHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeec---------cccCCcch-HHHHHHHHHHHHHHhC Confidence 44332210 0111111 1222 5555442 23333332 3458999999998887 Q ss_pred cccccccccchhhhhHH---HHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCC--ccccceEEecCc----- Q lcl|NC_017974. 350 AGFINVGNDKGGSYNLS---ESK-QTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLP--ETEMPYVKAGEI----- 417 (507) Q Consensus 350 GqtLts~~~~~GS~Al~---~vh-~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~--~~~~P~~~~~~~----- 417 (507) -..--.+.. |...+| ++. .|+ |...+......+...|..-|-..|+-=|...-. +.-.+.+.|+.. T Consensus 367 VP~SRl~~e--~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f 444 (558) T protein:vir:10 367 VPESRIAAE--GGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQF 444 (558) T ss_pred CCccccCCC--CcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchH Confidence 765333322 122222 222 222 223344444444444443333333321211000 011122333322 Q ss_pred -CchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCC---------------------CCcccccchhhhc---- Q lcl|NC_017974. 418 -VDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEK---------------------IEEDISQEELMKL---- 470 (507) Q Consensus 418 -~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~---------------------~~~~~~~~~~~~~---- 470 (507) |-.+.+.+.+++..|..+--.+-.-.+.+||++. |.+.+. ..+++....+.++ T Consensus 445 ~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~ 524 (558) T protein:vir:10 445 AELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDP 524 (558) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCc Confidence 2344445555655554431111111224566543 444311 1111100000000 Q ss_pred -CCCcCCCCCCcccccCcccccccccCCCCchhhhc Q lcl|NC_017974. 471 -LGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANL 505 (507) Q Consensus 471 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~ 505 (507) +++.+.++.++...+++.+..+ ...++.+-+-+ T Consensus 525 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 558 (558) T protein:vir:10 525 AMEGMGEQPVDPDLEAQAQAVDA--QYSKDTKKAEL 558 (558) T ss_pred hhccCCCCCcccccccchhhhhh--hhhhhhhhhcC Confidence 0111111111111111111111 11223333333 No 205 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=82.29 E-value=0.078 Score=26.65 Aligned_cols=466 Identities=12% Similarity=0.105 Sum_probs=168.1 Q ss_pred ccccCcccccccccccccccc--ccCchhhhHHHHhhhhcccchhccchhHHHHHH-------H---------------- Q lcl|NC_017974. 9 RSVKVAKATSEQINVSRMRMS--EQGTFALAKVQVDSERMKAEEIRWPNLIGTVES-------M---------------- 63 (507) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~--~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~-------m---------------- 63 (507) -.+.+|-+.-........+++ .+++..++.+.......+..+-+|.++++.|.. + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 80 (641) T protein:vir:94 1 MTIEMPTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWR 80 (641) T ss_pred CccCCCcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhccc Confidence 223333332222222222333 233444555666666666667677666554411 0 Q ss_pred --hhChHHHHHHHHHHHHHhhC-----CceeeeCCCCChhHHHHHHHHHHHHHh-ccCCCHHHHHHHHH-HHHhhcceee Q lcl|NC_017974. 64 --KQDATVATGLDMLYTFVEKA-----FKDFKVIPGESEESKKAAKFVEYCLKN-MEGQTLRQFARDAA-TFNEYGLSVV 134 (507) Q Consensus 64 --~~D~~v~s~l~~Rk~av~~~-----~~~~~~~~~~~~~~~~~ae~v~~~l~~-~~~~~~~~~l~~~l-~a~~~Gfs~~ 134 (507) ..|+++...++.....+.+. +| |+..|. ++++.+.|+++.++++. +....|.+.+..++ +++.+|..+. T Consensus 81 ~ki~~~~~~~~~~~l~s~Lm~~~~p~~~w-f~~~p~-~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv 158 (641) T protein:vir:94 81 HRINTGHTFEVVETLVAYFKGATFPSDDW-FDLKGM-VPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTY 158 (641) T ss_pred ccccchhHHHHHHHHhhHHhhhhcCCCce-EEEecC-CCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEE Confidence 12455555555554554432 22 233332 23344556665555542 11234666666665 6777888776 Q ss_pred eeeeeccC----------ccccCcce---------eeeeccccCch-----------------------hccc-ccceee Q lcl|NC_017974. 135 EKVYTQVT----------VGEYIGKY---------KVKNLAFRPQA-----------------------SLSR-TNPIVY 171 (507) Q Consensus 135 Ei~w~~~~----------~~~~~g~~---------~~~~l~~r~~~-----------------------~i~~-~~~~~~ 171 (507) ..-|.... .+..-+.+ ..-++.+.++. ++.. ...-+| T Consensus 159 ~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~ 238 (641) T protein:vir:94 159 RLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYY 238 (641) T ss_pred EeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHHhcCCC Confidence 65554210 00000000 00000000000 0000 000001 Q ss_pred cCCCccccccccc---------------ccccccccc-c----c-cccc-ccccCCCccc-----cc---cccEEEEeeC Q lcl|NC_017974. 172 NEDGSAIIGIKQS---------------LSAFQNYTA-S----E-IGVG-GVSTRMSDVI-----IP---INRVMLMNTG 221 (507) Q Consensus 172 ~~d~~~l~~~~q~---------------~~~~~~~~~-~----~-~~~~-~~~~~~~~~~-----iP---~~k~i~~~~~ 221 (507) +.|.........+ .+.+..+.. + + .... ......+... .+ ..-|+++++. T Consensus 239 ~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~ 318 (641) T protein:vir:94 239 DLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLL 318 (641) T ss_pred ChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecce Confidence 1000000000000 000000000 0 0 0000 0000011111 11 1258999999 Q ss_pred CccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_017974. 222 GSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQ 301 (507) Q Consensus 222 ~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~ 301 (507) ...++.||.|....|..-..-.+...+.-+..++.-..|..... +..+.+ + .+++.+ T Consensus 319 ~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~-~~~~~~-----~---------------~~l~~~-- 375 (641) T protein:vir:94 319 PDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLV-EDGILK-----R---------------EDVKAK-- 375 (641) T ss_pred ecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeec-cccccc-----c---------------ceeecc-- Confidence 99999999999999999988888888888888877655543321 100000 0 001111 Q ss_pred eeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccccc-chhhh-hHHHHHH--HHHHHHH Q lcl|NC_017974. 302 SFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGND-KGGSY-NLSESKQ--TIHTQFV 377 (507) Q Consensus 302 a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~-~~GS~-Al~~vh~--~v~~~~~ 377 (507) .|+++..+..- .+..+. .++.........+++++..|.+++....+.++.+ ..|.. -+.+|.. +.....+ T Consensus 376 PG~ii~~~~~~-----~v~pl~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l 449 (641) T protein:vir:94 376 PGAVFKVAQHG-----SLQPID-MGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRL 449 (641) T ss_pred CCcceeeCCCC-----cceeec-CCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHH Confidence 12222222111 111111 1222222234678999888888876655433221 22221 1223322 1112223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCcCCcc-----------------ccc-eE--Eec--CcCchhHHHHHHHHHHHH-- Q lcl|NC_017974. 378 QRVNEIILEALNENLLPQLLALNDMRLPET-----------------EMP-YV--KAG--EIVDVDMEGFSKAIQRIG-- 433 (507) Q Consensus 378 ~aD~~~i~~~ln~~li~~l~~lN~~~~~~~-----------------~~P-~~--~~~--~~~~~dl~~~a~~~~~L~-- 433 (507) ..-.+.+++.+-..|+.+++.+|..+.... ..| .+ .|+ .........-+..++.|. T Consensus 450 ~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~ 529 (641) T protein:vir:94 450 SSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQL 529 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHH Confidence 334444554444555555555542111000 001 11 111 111111111122222221 Q ss_pred -h-cCCccc-------cHHHHHHHHHHcCCCCCCCc---ccccchhhhcCCCcCCCCCCcccccCcccccc-cccCC-CC Q lcl|NC_017974. 434 -A-VGYLPK-------TPKVINRVLEVLGIDEKIEE---DISQEELMKLLGEDTSRAGDGMTKGSSGNGTG-KISST-RD 499 (507) Q Consensus 434 -~-~G~~~~-------~~~~~~~i~e~~Gip~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~d 499 (507) + +|. .| ...+...+.+..|++.+..- +..++.+.+..+......- -..+.+.+-.+. .+.+. .. T Consensus 530 ~~~~a~-~P~v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~~~~~~~~~~~~~q~~~-~~~a~~~~~~~~~~a~~~~~~ 607 (641) T protein:vir:94 530 LDISGR-VPQIGQSLDYALILEDLLRQMRFTDPMRYIKKAEAPPAAPPIAPAEPGALP-PEMMNSVGGGLNDQAIAGMTP 607 (641) T ss_pred HHHhhc-ChhhhhcCCHHHHHHHHHHHhCCCCchhhccCccCchhHHHHHHHHHHHHH-HHHHHHHHhhhHHHHHHHhhH Confidence 1 221 11 11122445666677644221 0000000000000000000 000000000000 00000 00 Q ss_pred chhhhccC Q lcl|NC_017974. 500 NSAANLDN 507 (507) Q Consensus 500 ~~a~~~~~ 507 (507) .....+.| T Consensus 608 ~~~~~~~~ 615 (641) T protein:vir:94 608 EDVSDLAS 615 (641) T ss_pred HHHHHHHH Confidence 00000111 No 206 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=81.76 E-value=0.083 Score=26.51 Aligned_cols=373 Identities=10% Similarity=0.000 Sum_probs=136.1 Q ss_pred ccccCchhh-hHHHHhhhhcccchhccchhHHHHHHHhhChH----HHHHHHHH--------HHHHhhCCceeeeCCCCC Q lcl|NC_017974. 28 MSEQGTFAL-AKVQVDSERMKAEEIRWPNLIGTVESMKQDAT----VATGLDML--------YTFVEKAFKDFKVIPGES 94 (507) Q Consensus 28 ~~~~g~~~~-~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~----v~s~l~~R--------k~av~~~~~~~~~~~~~~ 94 (507) +.+-.-.-| ..+..+. .|...+.+.|+--....+ +--.+..+ +.+|..+.=.+.+++-.. T Consensus 1 ~~~~~i~~L~~~~~~~~-------~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~ 73 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHK-------RRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN 73 (409) T ss_pred CCHHHHHHHHHHHHHHh-------HHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccC Confidence 111110000 0000000 011111222221101111 11111111 112222111223333122 Q ss_pred hhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccc----- Q lcl|NC_017974. 95 EESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNP----- 168 (507) Q Consensus 95 ~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~----- 168 (507) + +.+ +.+++.. ..|.....++ .+|+-||.|.. .+|..+ +|...+.-+.|+. ++.-|.+ T Consensus 74 ~-d~~----l~~i~~~---N~ld~~~~~~~~~aliyG~sf~-~v~~~~-----dg~~~i~~~sp~~--~~~i~D~~~~~~ 137 (409) T protein:vir:94 74 D-DFT----VNEIFEE---NNPDIFFDSAVLSSLIASCSFT-YISKGE-----NDAVRLQVIEAVN--ATGIIDPITGLL 137 (409) T ss_pred C-chH----HHHHHHh---cChhHHHHHHHHHHHHhcceeE-EEecCC-----CCceEEEEeccce--EEEEEecCCCce Confidence 1 222 2333332 2354444443 46889999765 788643 3443333222211 1111111 Q ss_pred -----eeecCCCccccccc-cccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH-HHHHHHHH Q lcl|NC_017974. 169 -----IVYNEDGSAIIGIK-QSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL-VGCYRAWR 241 (507) Q Consensus 169 -----~~~~~d~~~l~~~~-q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll-~~~~~~~~ 241 (507) ++............ ..+.....+.. ..+...... ...+..-++-|.++.+.+.|+|.|-+ +.+-..-= T Consensus 138 ~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~----n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~d 212 (409) T protein:vir:94 138 TEGYAVLERDENNNVVLEAHFLPDRTDYYYR-DSRNNISIA----NPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQS 212 (409) T ss_pred eeeEEEEEecCCCceEEEEEEecCcEEEEEe-cCceeEeee----CCCCCcceEEeccccccccccCccccchhHHHHHH Confidence 01111111000000 00110000000 000111111 11222336778888888899998755 32321111 Q ss_pred HHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceee Q lcl|NC_017974. 242 EKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTL 321 (507) Q Consensus 242 fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el 321 (507) -=+..+..-+.-.|=|.+|-.++. ++ +++......+++... ....+|.+.+-+. +++ T Consensus 213 a~~r~~~~~~~~~e~~a~pqr~i~---G~------d~d~~~~~~~~~~~~----------~i~~~~~d~dg~~----~~v 269 (409) T protein:vir:94 213 NAKRTLERADVTAEFYSFPQKYVT---GL------SDDAEPMETWKATVS----------SMLQFTKDEDGDK----PTL 269 (409) T ss_pred HHHHHHHHHHHHHHHhcChhheeE---ec------CCCCcccchhhhhHH----------HhhcCCCCCCCCC----ceE Confidence 112222233344555666655553 11 011111111222211 2345665533221 122 Q ss_pred eeccCCCcchhHHHHHHHHHHHHHHHHhcccccc-c-c-cc-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 322 KGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV-G-N-DK-GGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLL 397 (507) Q Consensus 322 ~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts-~-~-~~-~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~ 397 (507) .+.+++ +...|-..++.+-.++|-. .+=.... + . .. .+.-|+...+..+ ....+.-.+.+...+. ++++-.+ T Consensus 270 ~q~~~~-~l~~~~~~l~~~~~~~a~~-t~lP~~~lg~~~~NpsSa~Al~a~~~~L-~~~a~~k~~~fg~~~~-~~~rla~ 345 (409) T protein:vir:94 270 GQFTQP-SMSPFTEQLRTAAAGFAGE-TGLTLDDLGFVSDNPSSVEAIKASHENL-RLAGRKAQRSLGAGLL-NVAYLAA 345 (409) T ss_pred EecCCC-ChhHHHHHHHHHHHHHhhh-cCCCHHHhccccCchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHH-HHHHHHH Confidence 222222 2223434444444444431 1100000 0 0 01 1222333222222 2333334455666674 4666555 Q ss_pred Hh-CCCc-CCcc-ccceEEecCcC---chhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCC Q lcl|NC_017974. 398 AL-NDMR-LPET-EMPYVKAGEIV---DVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKI 458 (507) Q Consensus 398 ~l-N~~~-~~~~-~~P~~~~~~~~---~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~ 458 (507) .+ +... .+++ .-.++.|.... ...+.+.|+++.||+++|..+.+ .+.+++.+|+.++. T Consensus 346 ~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~---~~~~~~~lG~~~~d 409 (409) T protein:vir:94 346 CLRDDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFIN---KDTIRDLTGIEGGE 409 (409) T ss_pred HHhCCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccc---hhHHHHHcCCCCCC Confidence 44 3211 1111 11356677443 34456778899999999975544 24689999997654 No 207 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=79.54 E-value=0.1 Score=25.99 Aligned_cols=385 Identities=8% Similarity=-0.057 Sum_probs=140.5 Q ss_pred cccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHH Q lcl|NC_017974. 21 INVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKA 100 (507) Q Consensus 21 ~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ 100 (507) ++..+.++. ....+.+-...++. -.+.+-+++..+- .++++-=+.+|..+.=++.+++-..+ +.. T Consensus 1 l~~~~~r~~----------~~~~yY~g~~~~~~-~~~~~p~~~~~~~--~~v~nw~~~~Vds~a~rl~~~Gf~~~-d~~- 65 (410) T protein:vir:95 1 MNLYQSRVN----------LRYKHYAMQHYEAP-TGITIPAHIRAKY--QAVLGWAAKGVDSLADRLIFRAFAND-DFN- 65 (410) T ss_pred CCcchhhHH----------HHHHHhcCCCCccc-cchhccHHHHhHH--HhhcchhHHHHHHhHhhhccccccCC-Cch- Confidence 111111100 01111111111110 0111222222111 11222222233222112333331111 122 Q ss_pred HHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccceeecCCCcccc Q lcl|NC_017974. 101 AKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAII 179 (507) Q Consensus 101 ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~ 179 (507) +.+++.. ..|.....+ ..+|+-||.|.. .||.-. +|.-.+. .+++.... ..+++..+.+. T Consensus 66 ---l~~i~~~---N~ld~~~~~~~~~al~~G~sf~-~v~~~~-----d~~~~i~---~~sP~~~~----~i~Dp~~~~~~ 126 (410) T protein:vir:95 66 ---VTEIFDR---NNPDIFFDSAILSALIGSCSFV-YISKGE-----DDEVRLQ---VIESSNAT----GVIDPITGLLV 126 (410) T ss_pred ---HHHHHhh---cChHHHHHHHHHHHHHhCceeE-EEecCC-----CCceEEE---EEcccceE----EEEeCCCCceE Confidence 3333332 345555555 457889999765 788633 3333332 22222111 01111111110 Q ss_pred c-c--cc-----ccccccccccccccccccccCC--CccccccccEEEEeeCCccCccccchHH-HHHHHHHHHHHHHHH Q lcl|NC_017974. 180 G-I--KQ-----SLSAFQNYTASEIGVGGVSTRM--SDVIIPINRVMLMNTGGSSSQALGVSPL-VGCYRAWREKILIEN 248 (507) Q Consensus 180 ~-~--~q-----~~~~~~~~~~~~~~~~~~~~~~--~~~~iP~~k~i~~~~~~~~~~p~G~gll-~~~~~~~~fK~~~~~ 248 (507) . + .. .+.....+.............. ....++...++.|++..+.+.|+|.|-+ +.+-...=-=+..+. T Consensus 127 ~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~ 206 (410) T protein:vir:95 127 EGYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLE 206 (410) T ss_pred EEEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHH Confidence 0 0 00 0001111111111000000000 1122333456788888888899998743 433222111112222 Q ss_pred HHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCC Q lcl|NC_017974. 249 LEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMG 328 (507) Q Consensus 249 ~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g 328 (507) .-+.-.|=|.+|-.++. ++ .......+ .+++. .+....+|.+.+-+. .++.+.+++ T Consensus 207 ~~~~~~e~~a~pqr~i~---G~--d~d~~~~~----~~~~~----------~~~i~~~~~~~~~~~----~~v~q~~~~- 262 (410) T protein:vir:95 207 RADITAEFYSWPQKYIL---GL--DPDAEPME----KWKAT----------VSSLLTISSSDKGVK----PSVGQFTTA- 262 (410) T ss_pred HHHHHHHHhcchhheee---cc--CCCCCcCc----hhhhh----------hhhheeccCCCCCCc----ceEEecCCC- Confidence 33344455666655543 11 11111111 11211 113456676543221 122222222 Q ss_pred cchhHHHHHHHHHHHHHHH------Hhcccccccccc-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_017974. 329 KQYSTAQLISDRKKSILDR------LGAGFINVGNDK-GGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLALND 401 (507) Q Consensus 329 ~~~~~~~li~~~d~~Isk~------iLGqtLts~~~~-~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~ 401 (507) +...|...++-+-.+||-. .||..+ .. .+.-|+...+..+ ....+.-.+.+...+. ++++-.+.+=. T Consensus 263 ~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~----~NpsSa~Al~a~~~~L-~~ka~~k~~~fg~~l~-~~~rla~~i~~ 336 (410) T protein:vir:95 263 SMSPFTEQLRTAAAGFAGEMGLTLDDLGFVS----DNPSSVEAIKASHENL-RLAGRKAQRSLGAGLL-NVAYVAACLRD 336 (410) T ss_pred ChHHHHHHHHHHHHHHhhhcCCCHHHhcccc----CchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHH-HHHHHHHHHhc Confidence 2223434444444444322 112110 01 1223333333332 2333344555667774 46666555421 Q ss_pred CcC--Ccc-ccceEEec---CcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcC Q lcl|NC_017974. 402 MRL--PET-EMPYVKAG---EIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDT 475 (507) Q Consensus 402 ~~~--~~~-~~P~~~~~---~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~ 475 (507) ... +.. .-..+.|. ..+...+.+.|+++.||+++|--+.+ .+-+++.+|+.+.+ ...........+ T Consensus 337 ~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~---~~~~~~~lg~~~~~---~~~~~~~e~~~~-- 408 (410) T protein:vir:95 337 EFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYIN---AETIRDLTGIAGDM---SAKPVVSEGGSN-- 408 (410) T ss_pred CCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCcc---HHHHHHhcCCChHH---HHHHHHHHHHhC-- Confidence 111 111 11234454 45555678889999999998532232 24588999996432 111111111111 Q ss_pred CCCCC Q lcl|NC_017974. 476 SRAGD 480 (507) Q Consensus 476 ~~~~~ 480 (507) |+ T Consensus 409 ---g~ 410 (410) T protein:vir:95 409 ---GE 410 (410) T ss_pred ---CC Confidence 11 No 208 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=78.99 E-value=0.11 Score=25.86 Aligned_cols=437 Identities=11% Similarity=0.056 Sum_probs=152.3 Q ss_pred Ccccc--ccccccc-CccccccccccccccccccCchhhhH---HHHhhhhcccchhccch-hHHH---HHHHhhChHHH Q lcl|NC_017974. 1 MSRKR--NRNRSVK-VAKATSEQINVSRMRMSEQGTFALAK---VQVDSERMKAEEIRWPN-LIGT---VESMKQDATVA 70 (507) Q Consensus 1 ~~~k~--~~~~~~~-~~~~~~~~~~~~~~~~~~~g~~~~~~---~~~~~~~e~~~~lr~~~-~~~~---y~~m~~D~~v~ 70 (507) |-.+= ..|.-.+ |--..+++..+...... +...-... ++.. +.-..+.+.... .... -+... ...+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~~~~i~~~~~~-Y~g~~~~~~~~~~~~~~~~~~~~~~-s~n~~ 77 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVN-ANDEDYKYIDMWKRL-YQGNYAEWHNLNYEHNGNPVNRRQL-SMNLP 77 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCc-CCHHHHHHHHHHHHH-hcCCcchhhccccccCCCcccccee-ecchH Confidence 54421 1111111 11011111111111111 00000000 1100 000001100000 0000 00000 01111 Q ss_pred HH-HHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCc Q lcl|NC_017974. 71 TG-LDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIG 148 (507) Q Consensus 71 s~-l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g 148 (507) .. ..+--.-|++-+..+.+. +++..++++++++. ..|...+..++ .|..+|-+++=+.|.. +| T Consensus 78 ~~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~~~~~---n~f~~~~~~~~~~a~~~G~~~~~~~~D~------~~ 142 (499) T protein:vir:80 78 KVTAKYMSKLLFNEKVKINID------DETAEEFVLNVLKT---NGFTKNMERYIEYGEAMGGFVIKVYHDG------NK 142 (499) T ss_pred HHHHHHHHHhhhCCcceEeeC------CHHHHHHHHHHHhh---ccHHHHHHHHHHHHhhcCcEEEEEEECC------CC Confidence 11 122222333332334432 35677788888764 23666666655 5888999998666643 34 Q ss_pred ceeeeeccccCchhcccccceeecCCCcccccc---cccccc--------------------------cccccccccccc Q lcl|NC_017974. 149 KYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGI---KQSLSA--------------------------FQNYTASEIGVG 199 (507) Q Consensus 149 ~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~---~q~~~~--------------------------~~~~~~~~~~~~ 199 (507) ++.+.-+ ++.++. |++++. + .+..+ ...... +........+.. T Consensus 143 ~~~i~~v---~a~~~~---Pi~~d~-~-~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~ 214 (499) T protein:vir:80 143 NVKVSFA---TADCMY---PLSNDS-E-NVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGK 214 (499) T ss_pred cEEEEEE---cCCceE---EEEecC-C-CeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcc Confidence 4444322 222221 122221 1 11100 000000 000000000000 Q ss_pred c-----cccCCCcccc---ccccEEEEee----CCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecc Q lcl|NC_017974. 200 G-----VSTRMSDVII---PINRVMLMNT----GGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIP 267 (507) Q Consensus 200 ~-----~~~~~~~~~i---P~~k~i~~~~----~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P 267 (507) . +........+ +.--|+.|+. ....++|+|.|.+..|--..--=+..+..|+.-++. | .- .+.+| T Consensus 215 v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~-~~-~i~v~ 291 (499) T protein:vir:80 215 VSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-G-KK-KVLVP 291 (499) T ss_pred cchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh-c-cc-ceecc Confidence 0 0011111111 1111455543 234678999999998865543334444445544542 1 11 11233 Q ss_pred cccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_017974. 268 SQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDR 347 (507) Q Consensus 268 ~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~ 347 (507) ..+.+.......+. ...... ...-+..++...+-. .. .++. .+..=-...|...++.+=++|+.. T Consensus 292 ~~~l~~~~~~~g~~-~~~~~~----------~~~~~~~~~~~~~~~-~~-~i~~--~~~~ir~e~~~~~l~~~l~~i~~~ 356 (499) T protein:vir:80 292 SSFVKTAVNLDGST-TQYFDS----------TDEAFFLYQGEQDDN-GK-AIKD--ISVEIRSTEFIESINAMLRIYAMQ 356 (499) T ss_pred hhhhhccCCCCCCc-ccCCCc----------ccceeeEeeccCCCC-cC-ceeE--ecCcCChHHHHHHHHHHHHHHHHh Confidence 32222111000000 000000 000111111110000 00 0000 000000012334445444555544 Q ss_pred H-hcccccccccchhhhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHh----CCCcC--CccccceEEecCcC Q lcl|NC_017974. 348 L-GAGFINVGNDKGGSYNLSESKQTIHT--QFVQRVNEIILEALNENLLPQLLAL----NDMRL--PETEMPYVKAGEIV 418 (507) Q Consensus 348 i-LGqtLts~~~~~GS~Al~~vh~~v~~--~~~~aD~~~i~~~ln~~li~~l~~l----N~~~~--~~~~~P~~~~~~~~ 418 (507) + +++. +-+.+.+|....-++..+-.. ..+..-.+.+...|. +|++.++.+ ....+ .+..-+.+.|+..- T Consensus 357 ~g~s~~-~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i 434 (499) T protein:vir:80 357 VGLSAG-TFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIK-EMIVSILEVGKLIKAYDGDTVELDTITVDFDDSI 434 (499) T ss_pred cCCChh-hcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccCCCCCccceEEEeCCCC Confidence 4 2221 112222232212222222111 122333455555663 465555543 11111 12234678899888 Q ss_pred chhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 419 DVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 419 ~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) ..|.++.++.+.+++.+|++.. +.++.+.+|.++.+.+++.........+. .+..+. .+-.|+.. T Consensus 435 ~~d~~~~~~~~~~~~~~Gi~S~----et~l~~~~~~~d~ea~~el~~i~~E~~~~--~~~~d~--~g~~ge~e 499 (499) T protein:vir:80 435 AQDEDTTINRYTTAKNQGMIPL----KIALQRAWNITEAEADEWAEMLAKEKQAE--IPNNDM--TGIFGEEE 499 (499) T ss_pred CCCHHHHHHHHHHHHHcCCCCH----HHHHhhcCCCChHHHHHHHHHHHHHhhcC--CCCCCc--cccCCCCC Confidence 8898889999999999998654 57788889987543322222211111110 011110 01111111 No 209 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=77.67 E-value=0.12 Score=25.59 Aligned_cols=444 Identities=13% Similarity=0.092 Sum_probs=164.8 Q ss_pred Cccccc------ccccccCccccccccccccccccccCchhhhH-HHHh-h-hhcccchh-ccchhHHHHHHHhhChHHH Q lcl|NC_017974. 1 MSRKRN------RNRSVKVAKATSEQINVSRMRMSEQGTFALAK-VQVD-S-ERMKAEEI-RWPNLIGTVESMKQDATVA 70 (507) Q Consensus 1 ~~~k~~------~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-~~~~-~-~~e~~~~l-r~~~~~~~y~~m~~D~~v~ 70 (507) +++..+ +.++...+.|.....++ ++.+.+.++ +.++ . ........ --.++++.|++|..++.|- T Consensus 11 ~~~d~~~~~~~~~~~~~s~~~p~~~dGa~------~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd 84 (516) T protein:vir:10 11 DRVDQNEYDERLKLGHESIATPKKDDGAT------EIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEVE 84 (516) T ss_pred cchhhhHHhhhhcCCcCcccCCCCCCCce------eeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccchh Confidence 222221 12222233332222111 111101111 1111 0 00111112 2257899999999999999 Q ss_pred HHHHHHHHHHhhCCc---eeeeCCCCChhHH----HHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccC Q lcl|NC_017974. 71 TGLDMLYTFVEKAFK---DFKVIPGESEESK----KAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVT 142 (507) Q Consensus 71 s~l~~Rk~av~~~~~---~~~~~~~~~~~~~----~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~ 142 (507) ++++.....+...+- .+.+.-.+.+-++ ++.+-++..|+-|+ |..-..++.. -.-.|--.+.++-.. T Consensus 85 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~---F~~~~~~~fR~WYVDgRi~fhKiid~-- 159 (516) T protein:vir:10 85 RAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLD---ASRKLDTLFRRWYVDSRIFFHKIMPN-- 159 (516) T ss_pred hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhc---cchhhhHHHhhhhhcceEEEEEEecC-- Confidence 999988776532210 0111111122223 34444444444332 2211111111 111222233322221 Q ss_pred ccccCcceeeeeccccCchhcccccceee-cCCCccccccccccccccccccccccccc---cccCCCccccccccEEEE Q lcl|NC_017974. 143 VGEYIGKYKVKNLAFRPQASLSRTNPIVY-NEDGSAIIGIKQSLSAFQNYTASEIGVGG---VSTRMSDVIIPINRVMLM 218 (507) Q Consensus 143 ~~~~~g~~~~~~l~~r~~~~i~~~~~~~~-~~d~~~l~~~~q~~~~~~~~~~~~~~~~~---~~~~~~~~~iP~~k~i~~ 218 (507) ..+|-..++.| .|.+|.+++-... +.+|..+.. .-..++.+..+...... ..+....+.||.+ -|+| T Consensus 160 --~k~GI~Elr~l---DPr~i~~vR~i~~~~~~~~~v~~---~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~d-AI~y 230 (516) T protein:vir:10 160 --PKKGIAELRRL---DPRFMEYYREIVTSDIGGTTIVK---GYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRS-AVVY 230 (516) T ss_pred --ccccceeeeee---CCcceeeEeeecccccccchhhh---hhhheeeeccCccccccccceeCCCcceeechh-heee Confidence 11333333333 3344544443322 222222111 11122222222211110 1122345666654 4888 Q ss_pred eeCCc---cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_017974. 219 NTGGS---SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAAN 295 (507) Q Consensus 219 ~~~~~---~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n 295 (507) +|..- .+..+ .|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.-..| T Consensus 231 ~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--vGnlP----k~KAeqYl~~im~k~kN 303 (516) T protein:vir:10 231 ASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYID--VGNMN----NRKATEYVNGIMQSLKN 303 (516) T ss_pred ecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhcCc Confidence 88543 33444 78899888876444433333333210001111244442 22222 22333455554443322 Q ss_pred hh-cCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_017974. 296 AH-SGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV 355 (507) Q Consensus 296 ~~-~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts 355 (507) =. -.+..|= -+|. |+||.+. .|+.+... ..=|+|..+.+-+++--..--. T Consensus 304 klvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlge-m~DV~YF~kkLy~aLnVP~sRl 373 (516) T protein:vir:10 304 RVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSL---------PGAQTMGD-MDDVRWFNKKLYEALRIPLSRI 373 (516) T ss_pred eeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeec---------cccCCcCh-HHHHHHHHHHHHHHhCCCcccc Confidence 00 0001111 1222 5655442 23322322 3458999999999887766333 Q ss_pred cccchhh----hhHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------Cc Q lcl|NC_017974. 356 GNDKGGS----YNLSESKQ-TI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------VD 419 (507) Q Consensus 356 ~~~~~GS----~Al~~vh~-~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~~ 419 (507) +..++++ ++ +++.. |+ |...+......+...|..-|-..|+-=|.. ..+. .+.+.|+.. |- T Consensus 374 ~~e~~~~~~~Gr~-~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~El 450 (516) T protein:vir:10 374 PRDDGGMVIGGQD-TAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRII--TEDEWDEQINNIKVNFHQDSYYTEL 450 (516) T ss_pred cCCCCceeecccc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhcceEEeeecchHHHH Confidence 3222222 33 33333 33 233444444444555543333333322211 1111 123333332 22 Q ss_pred hhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 420 VDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 420 ~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) .+.+.+.+++..|..+-=.+-.-...+|+++. |.++..+-.++. ..-.....++--+.++... +. T Consensus 451 Ke~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~-k~I~~E~~~~~~~~p~~~~------~f 516 (516) T protein:vir:10 451 KDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEE-KQIEQEAGIKRFQNPENED------DF 516 (516) T ss_pred HHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHH-HHHHHhhhCCCCCCCCccc------cC Confidence 44444555555554431111112235677654 677533221111 0111111111000000000 00 No 210 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=77.67 E-value=0.12 Score=25.59 Aligned_cols=444 Identities=13% Similarity=0.092 Sum_probs=164.8 Q ss_pred Cccccc------ccccccCccccccccccccccccccCchhhhH-HHHh-h-hhcccchh-ccchhHHHHHHHhhChHHH Q lcl|NC_017974. 1 MSRKRN------RNRSVKVAKATSEQINVSRMRMSEQGTFALAK-VQVD-S-ERMKAEEI-RWPNLIGTVESMKQDATVA 70 (507) Q Consensus 1 ~~~k~~------~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-~~~~-~-~~e~~~~l-r~~~~~~~y~~m~~D~~v~ 70 (507) +++..+ +.++...+.|.....++ ++.+.+.++ +.++ . ........ --.++++.|++|..++.|- T Consensus 11 ~~~d~~~~~~~~~~~~~s~~~p~~~dGa~------~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd 84 (516) T protein:vir:10 11 DRVDQNEYDERLKLGHESIATPKKDDGAT------EIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEVE 84 (516) T ss_pred cchhhhHHhhhhcCCcCcccCCCCCCCce------eeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccchh Confidence 222221 12222233332222111 111101111 1111 0 00111112 2257899999999999999 Q ss_pred HHHHHHHHHHhhCCc---eeeeCCCCChhHH----HHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccC Q lcl|NC_017974. 71 TGLDMLYTFVEKAFK---DFKVIPGESEESK----KAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVT 142 (507) Q Consensus 71 s~l~~Rk~av~~~~~---~~~~~~~~~~~~~----~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~ 142 (507) ++++.....+...+- .+.+.-.+.+-++ ++.+-++..|+-|+ |..-..++.. -.-.|--.+.++-.. T Consensus 85 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~---F~~~~~~~fR~WYVDgRi~fhKiid~-- 159 (516) T protein:vir:10 85 RAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLD---ASRKLDTLFRRWYVDSRIFFHKIMPN-- 159 (516) T ss_pred hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhc---cchhhhHHHhhhhhcceEEEEEEecC-- Confidence 999988776532210 0111111122223 34444444444332 2211111111 111222233322221 Q ss_pred ccccCcceeeeeccccCchhcccccceee-cCCCccccccccccccccccccccccccc---cccCCCccccccccEEEE Q lcl|NC_017974. 143 VGEYIGKYKVKNLAFRPQASLSRTNPIVY-NEDGSAIIGIKQSLSAFQNYTASEIGVGG---VSTRMSDVIIPINRVMLM 218 (507) Q Consensus 143 ~~~~~g~~~~~~l~~r~~~~i~~~~~~~~-~~d~~~l~~~~q~~~~~~~~~~~~~~~~~---~~~~~~~~~iP~~k~i~~ 218 (507) ..+|-..++.| .|.+|.+++-... +.+|..+.. .-..++.+..+...... ..+....+.||.+ -|+| T Consensus 160 --~k~GI~Elr~l---DPr~i~~vR~i~~~~~~~~~v~~---~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~d-AI~y 230 (516) T protein:vir:10 160 --PKKGIAELRRL---DPRFMEYYREIVTSDIGGTTIVK---GYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRS-AVVY 230 (516) T ss_pred --ccccceeeeee---CCcceeeEeeecccccccchhhh---hhhheeeeccCccccccccceeCCCcceeechh-heee Confidence 11333333333 3344544443322 222222111 11122222222211110 1122345666654 4888 Q ss_pred eeCCc---cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_017974. 219 NTGGS---SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAAN 295 (507) Q Consensus 219 ~~~~~---~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n 295 (507) +|..- .+..+ .|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.-..| T Consensus 231 ~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--vGnlP----k~KAeqYl~~im~k~kN 303 (516) T protein:vir:10 231 ASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYID--VGNMN----NRKATEYVNGIMQSLKN 303 (516) T ss_pred ecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhcCc Confidence 88543 33444 78899888876444433333333210001111244442 22222 22333455554443322 Q ss_pred hh-cCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_017974. 296 AH-SGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV 355 (507) Q Consensus 296 ~~-~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts 355 (507) =. -.+..|= -+|. |+||.+. .|+.+... ..=|+|..+.+-+++--..--. T Consensus 304 klvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlge-m~DV~YF~kkLy~aLnVP~sRl 373 (516) T protein:vir:10 304 RVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSL---------PGAQTMGD-MDDVRWFNKKLYEALRIPLSRI 373 (516) T ss_pred eeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeec---------cccCCcCh-HHHHHHHHHHHHHHhCCCcccc Confidence 00 0001111 1222 5655442 23322322 3458999999999887766333 Q ss_pred cccchhh----hhHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------Cc Q lcl|NC_017974. 356 GNDKGGS----YNLSESKQ-TI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------VD 419 (507) Q Consensus 356 ~~~~~GS----~Al~~vh~-~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~~ 419 (507) +..++++ ++ +++.. |+ |...+......+...|..-|-..|+-=|.. ..+. .+.+.|+.. |- T Consensus 374 ~~e~~~~~~~Gr~-~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~El 450 (516) T protein:vir:10 374 PRDDGGMVIGGQD-TAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRII--TEDEWDEQINNIKVNFHQDSYYTEL 450 (516) T ss_pred cCCCCceeecccc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhcceEEeeecchHHHH Confidence 3222222 33 33333 33 233444444444555543333333322211 1111 123333332 22 Q ss_pred hhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 420 VDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 420 ~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) .+.+.+.+++..|..+-=.+-.-...+|+++. |.++..+-.++. ..-.....++--+.++... +. T Consensus 451 Ke~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~-k~I~~E~~~~~~~~p~~~~------~f 516 (516) T protein:vir:10 451 KDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEE-KQIEQEAGIKRFQNPENED------DF 516 (516) T ss_pred HHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHH-HHHHHhhhCCCCCCCCccc------cC Confidence 44444555555554431111112235677654 677533221111 0111111111000000000 00 No 211 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=77.53 E-value=0.12 Score=25.56 Aligned_cols=380 Identities=7% Similarity=-0.044 Sum_probs=134.4 Q ss_pred ccccCchhhhH-HHHhhhhcccchhccchhHHHHH--------------HHhhChHHHHHHHHHHHHHhhCCceeeeCCC Q lcl|NC_017974. 28 MSEQGTFALAK-VQVDSERMKAEEIRWPNLIGTVE--------------SMKQDATVATGLDMLYTFVEKAFKDFKVIPG 92 (507) Q Consensus 28 ~~~~g~~~~~~-~~~~~~~e~~~~lr~~~~~~~y~--------------~m~~D~~v~s~l~~Rk~av~~~~~~~~~~~~ 92 (507) |.+.+-..|.. +..+. .|..++.+.|+ ++..... ++++==+.+|..+.=++.+.+- T Consensus 1 m~~~~i~~L~~~~~~~~-------~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~~Vd~~a~rl~~~Gf 71 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFK-------TGVDKRYRYYAMDDRDDTRSIVMPNNVREMYR--SVLEWTAKGVDSLADRIIFREF 71 (422) T ss_pred CChHHHHHHHHHHHHHH-------HHHHHHHHHHhcCCChhhcCccccHHHHHHHH--hhcchhHHHHHHHHhcccccee Confidence 21111111100 00000 01011111111 1111100 1111111122221112233321 Q ss_pred CChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcc-cccce- Q lcl|NC_017974. 93 ESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLS-RTNPI- 169 (507) Q Consensus 93 ~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~-~~~~~- 169 (507) ..+ +.++ .+++.. ..+...... ..+|+-||.|.. .||.-. .+|...+. .+++.... -|.+. T Consensus 72 ~~~-d~~l----~~~w~~---N~ld~~~~~~~~~al~~G~sf~-~v~~~~----~~~~p~i~---~~sp~~~~~i~D~~~ 135 (422) T protein:vir:97 72 TND-DFNA----WEIFKA---NNPDIFFDTAIQSALIASCCFV-YIMPGA----EDGLPKMQ---VIEASKATGILDPTT 135 (422) T ss_pred eCC-chhH----HHHHHh---cChHHHHHHHHHHHHHhcceeE-EEeeCC----CCCeeEEE---EechhhEEEEEeCCC Confidence 111 2222 333332 235544444 357889999765 777532 13333222 22222111 11110 Q ss_pred --------ee--cCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH-HHHHH Q lcl|NC_017974. 170 --------VY--NEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL-VGCYR 238 (507) Q Consensus 170 --------~~--~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll-~~~~~ 238 (507) ++ +.+|.... ..-.+..... .....+......+ ..+.--++.|.++.+.+.|+|.|-+ +.+-. T Consensus 136 ~~~~~a~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~----~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~ 209 (422) T protein:vir:97 136 FLLTEGYAILESDSNGNPTL-EAYFTDKDIW-YYPKKGKPYNIKN----PTGHPLLVPIIHRPDAVRPFGRSRITKAGMY 209 (422) T ss_pred CcceeeEEEEEecCCCcEEE-EEEEcCceEE-EEcCCCccccccC----CCCCcceEEecccCCCccccCccccchhHHH Confidence 11 11111100 0000000000 0000000011111 1122235777888888999998754 32221 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccc Q lcl|NC_017974. 239 AWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYS 318 (507) Q Consensus 239 ~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~ 318 (507) ..=-=+..+..-+...|=|.+|..++. ++ .......+ .+++.. +....+|.+.+-+. T Consensus 210 l~da~~r~~~~~~~~~e~~a~pqr~i~---G~--d~d~~~~~----~~~~~~----------~~i~~~~~de~~~~---- 266 (422) T protein:vir:97 210 HQKAAKRTLERAEVTAEFYSFPQKYVL---GM--DPDAKPME----KWRATV----------STLLEISKDEDGDK---- 266 (422) T ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhc---cc--CcccccCc----hhhhhh----------hhhhccCCCCCCCc---- Confidence 111111122223334455555554442 11 11011111 111111 12345665543221 Q ss_pred eeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc--c----c-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 319 MTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN--D----K-GGSYNLSESKQTIHTQFVQRVNEIILEALNEN 391 (507) Q Consensus 319 ~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~----~-~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~ 391 (507) +++...+++ +...|...++.+-.+|| +.+-+... | . .+.-|+ .....-....++.-.+.+...+. + T Consensus 267 ~~v~q~~~~-~l~~~~~~l~~~~~~~a----~~s~lP~~~lg~~~~NpsSa~Ai-~a~~~~L~~ka~~k~~~fg~~l~-~ 339 (422) T protein:vir:97 267 PTVGQFTTA-SMAPFMEHLKMYASLFA----GGSGLTLDDLGFPSDNPSSVESI-KAAHENLRAAGRKAQRSFSSGFL-N 339 (422) T ss_pred ceeeecCCC-ChhHHHHHHHHHHHHHh----cccCCCHHHhccccCchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHH-H Confidence 122222222 22223333333333333 22211111 1 0 111222 22333333444555566677775 4 Q ss_pred HHHHHHHhCCCcCC-cccc--ceEEecCcCchh---HHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccc Q lcl|NC_017974. 392 LLPQLLALNDMRLP-ETEM--PYVKAGEIVDVD---MEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQE 465 (507) Q Consensus 392 li~~l~~lN~~~~~-~~~~--P~~~~~~~~~~d---l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~ 465 (507) +++-.+.+...... ...+ ..+.|....+.+ +.+.|+++.||+++|--..+ .+-+++.+|+..+.++-+. T Consensus 340 ~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~---~~~~~~~lg~~~~~~~~~~-- 414 (422) T protein:vir:97 340 VAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMD---ADVIRDLTGVKGADKPIPA-- 414 (422) T ss_pred HHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhcccccc---HHHHHHHcCCCchhHHHHH-- Confidence 66666666532111 0111 246666443444 66778899999998643333 2568899999544322211 Q ss_pred hhhhcCCCc Q lcl|NC_017974. 466 ELMKLLGED 474 (507) Q Consensus 466 ~~~~~~~~~ 474 (507) .....++. T Consensus 415 -~~~~~~d~ 422 (422) T protein:vir:97 415 -ITEVTTDG 422 (422) T ss_pred -HHhhhccC Confidence 11111111 No 212 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=76.50 E-value=0.13 Score=25.36 Aligned_cols=451 Identities=12% Similarity=0.069 Sum_probs=172.6 Q ss_pred Cc--------c--cccccccccCccccccccccc-ccccc--ccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhCh Q lcl|NC_017974. 1 MS--------R--KRNRNRSVKVAKATSEQINVS-RMRMS--EQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDA 67 (507) Q Consensus 1 ~~--------~--k~~~~~~~~~~~~~~~~~~~~-~~~~~--~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~ 67 (507) |+ + ++-+.++...+.|.....++. +.... ...+.|....... .+.. ..-..++++.|++|..++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~--~e~~-~~~~~eLI~~YR~ma~~p 84 (521) T protein:vir:81 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYS--TDQK-ISTTKQLVNTYRGLMNNH 84 (521) T ss_pred hHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecc--cccc-hhhHHHHHHHHHHHhhcc Confidence 22 2 122233333333333222211 00000 0000111111111 1111 112357999999999999 Q ss_pred HHHHHHHHHHHHHhhCCc---eeeeCCCCChhHH----HHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeee Q lcl|NC_017974. 68 TVATGLDMLYTFVEKAFK---DFKVIPGESEESK----KAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYT 139 (507) Q Consensus 68 ~v~s~l~~Rk~av~~~~~---~~~~~~~~~~~~~----~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~ 139 (507) .|-++++.....+...+- .+.++-.+++-.+ ++.+-++..|+-|+ |..-..++.. -.-.|--.+.++.. T Consensus 85 Evd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~---F~~~~~~~fR~WYVDgRi~fhkiid 161 (521) T protein:vir:81 85 EVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQ---FDRRGQDMFRRWYVDSRIFFHKIIG 161 (521) T ss_pred chhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhc---cchhhhHHHhhhhhcceEEEEEEEc Confidence 999999988776532211 0111111122222 34444444444333 2222222221 22245555556554 Q ss_pred ccCccccCcceeeeeccccCchhcccccceee-cCCCcccccccccccccccccccccccc---ccccCCCccccccccE Q lcl|NC_017974. 140 QVTVGEYIGKYKVKNLAFRPQASLSRTNPIVY-NEDGSAIIGIKQSLSAFQNYTASEIGVG---GVSTRMSDVIIPINRV 215 (507) Q Consensus 140 ~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~-~~d~~~l~~~~q~~~~~~~~~~~~~~~~---~~~~~~~~~~iP~~k~ 215 (507) ..+ -+|-..++.|- |.+|.+.+-... +.++..+. ..-..++.+.++..... ...+...++.||.+ - T Consensus 162 ~~p---k~GI~Elr~lD---Pr~i~~vr~i~k~~~~~~~v~---~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~d-A 231 (521) T protein:vir:81 162 KNP---KDGIVELRQLD---PRNLEYVREIITEDTPEGKIY---KATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRS-A 231 (521) T ss_pred CCc---cccceeeeeeC---CcceeeeeeecccccCcccee---cceeeeeeeecCCccccccceeecCCcceeechh-h Confidence 221 24444444443 344544443322 22222221 11112233222222211 11233445666654 5 Q ss_pred EEEeeCCc--cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHH Q lcl|NC_017974. 216 MLMNTGGS--SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDA 293 (507) Q Consensus 216 i~~~~~~~--~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~ 293 (507) |+|+|..- .++..=.|-|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.-. T Consensus 232 I~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--vGnlp----k~KAeqYl~~im~k~ 305 (521) T protein:vir:81 232 ITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFID--TGNMN----NRKAAQHMNSVAQSF 305 (521) T ss_pred eeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhc Confidence 77777543 22223347888888776433333333222210001111244442 22222 233344555555444 Q ss_pred HHh-hcCCcee--------Ee------ecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccc Q lcl|NC_017974. 294 ANA-HSGEQSF--------FM------LPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFI 353 (507) Q Consensus 294 ~n~-~~G~~a~--------~i------iP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtL 353 (507) .|= .-.+..| +. +|. |+||.+. .|+.+.. -..=|+|..+.+-+++--..- T Consensus 306 kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlg-em~DV~YF~kkLy~aLnVP~s 375 (521) T protein:vir:81 306 KNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTL---------PGASGMS-DIDDIRYFNRKLYEALRVPLS 375 (521) T ss_pred CceeEeecccccccccccccchhhhhcccccCCCcccceeec---------ccCCCCC-hHHHHHHHHHHHHHHhCCccc Confidence 330 0011111 11 222 5665442 2332333 234589999999998877653 Q ss_pred ccc-ccch---hhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------C Q lcl|NC_017974. 354 NVG-NDKG---GSYNLSESKQTIH-TQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------V 418 (507) Q Consensus 354 ts~-~~~~---GS~Al~~vh~~v~-~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~ 418 (507) -.+ ++.+ +.++..=+..|+. ...+......+...|..-|-..|+-=+.. ..+. .+.+.|+.. | T Consensus 376 Rl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~E 453 (521) T protein:vir:81 376 RSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVI--TEDDWDREINNIKVVFHRDSYYTE 453 (521) T ss_pred cccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCC--CHHHHHHHhhcceEEEeecchHHH Confidence 332 1211 2233222223332 23344444444444443333333322211 1111 123333332 2 Q ss_pred chhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 419 DVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 419 ~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) -.+.+.+.+++..|..+--.+-.-...+|+++. |.+.+.+-.++. ..-.....++--+.++ . ..++. T Consensus 454 lKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~-k~I~~E~~~~~~~~p~--~---~~~~f 521 (521) T protein:vir:81 454 VKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEK-KQIEEEANDPRFKQTP--D---EIEDF 521 (521) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHH-HHHHHHhhCCCCCCCc--c---cccCC Confidence 344445555665554431111111235677654 566422111111 1111111111111100 0 01111 No 213 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=74.23 E-value=0.16 Score=24.94 Aligned_cols=450 Identities=11% Similarity=0.063 Sum_probs=173.6 Q ss_pred Cc----------ccccccccccCccccccccccc-cccccccCc---hhhhHHHHhhhhcccchhccchhHHHHHHHhhC Q lcl|NC_017974. 1 MS----------RKRNRNRSVKVAKATSEQINVS-RMRMSEQGT---FALAKVQVDSERMKAEEIRWPNLIGTVESMKQD 66 (507) Q Consensus 1 ~~----------~k~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~---~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D 66 (507) |+ .++-++++...+.|.....+++ +.+.. .++ .|....... .+.. ..-..++++.|++|..+ T Consensus 8 ~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~-~~~~~~~g~~~~~~~--~e~~-~~~~~eLI~~YR~ma~~ 83 (521) T protein:vir:65 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDN-SPASSWNSLTQQFYS--TDQK-ISTTKQLVNTYRGLMNN 83 (521) T ss_pred hhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCC-ccccccccceeeecc--ccch-hhhHHHHHHHHHHHhhc Confidence 22 1222333444444443322221 10100 000 111111111 1111 11235799999999999 Q ss_pred hHHHHHHHHHHHHHhhCCc---eeeeCCCCChhHH----HHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeee Q lcl|NC_017974. 67 ATVATGLDMLYTFVEKAFK---DFKVIPGESEESK----KAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVY 138 (507) Q Consensus 67 ~~v~s~l~~Rk~av~~~~~---~~~~~~~~~~~~~----~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w 138 (507) +.|-++++.....+...+- .+.+.-.+++-.+ ++.+-++..|+-|+ |.....++.. -.-.|--.+.++. T Consensus 84 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~---F~~~~~~~fR~WYVDgRi~fhkii 160 (521) T protein:vir:65 84 HEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQ---FDRRGQDMFRRWYVDSRIFFHKII 160 (521) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhc---cchhhhHHHhhhhhcceeEEEEEE Confidence 9999999988776532210 0111111122222 34444444444333 2222222221 2224555555655 Q ss_pred eccCccccCcceeeeeccccCchhcccccceee-cCCCcccccccccccccccccccccccc---ccccCCCcccccccc Q lcl|NC_017974. 139 TQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVY-NEDGSAIIGIKQSLSAFQNYTASEIGVG---GVSTRMSDVIIPINR 214 (507) Q Consensus 139 ~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~-~~d~~~l~~~~q~~~~~~~~~~~~~~~~---~~~~~~~~~~iP~~k 214 (507) ...+ -+|-..++.|- |.+|.+.+-... +.++.... ..-..++.+..+..... ...+...++.||.+ T Consensus 161 d~~p---k~GI~ELr~lD---Pr~i~~vr~i~k~~~~~~~v~---~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~d- 230 (521) T protein:vir:65 161 GKNP---KDGIVELRQLD---PRNLEYVREIITEDTPEGKIY---KATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRS- 230 (521) T ss_pred cCCc---cccceeeeeeC---CcceeeeeeecccccCCccee---cceeeeeeeecCCcceeccceeecCCcceeechh- Confidence 4221 24444444443 344544443322 22222221 11112223222222111 11233445666654 Q ss_pred EEEEeeCCc--cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHH Q lcl|NC_017974. 215 VMLMNTGGS--SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSD 292 (507) Q Consensus 215 ~i~~~~~~~--~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~ 292 (507) -|+|+|..- .++..=.|-|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.- T Consensus 231 AI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--vGnlP----k~KAeqYl~~im~k 304 (521) T protein:vir:65 231 AITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFID--TGNMN----NRKAAQHMNSVAQS 304 (521) T ss_pred heeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHh Confidence 477777543 22333348888888776443433333332210001111244442 22222 22334455555544 Q ss_pred HHHh-hcCCcee--------E------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccc Q lcl|NC_017974. 293 AANA-HSGEQSF--------F------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGF 352 (507) Q Consensus 293 ~~n~-~~G~~a~--------~------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt 352 (507) ..|= .-.+..| + -+|. |+||.+. .|+.+.. -..=|+|..+.+-+++--.. T Consensus 305 ~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlg-em~DV~YF~kkLy~aLnVP~ 374 (521) T protein:vir:65 305 FKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTL---------PGASGMS-DIDDIRYFNRKLYEALRVPL 374 (521) T ss_pred cCceeEeecccccccccccccchhhhhcccccCCCCccceeec---------ccCCCcC-hHHHHHHHHHHHHHHhCCCc Confidence 4330 0011111 1 1222 5665542 2332332 23458999999999887765 Q ss_pred cccc-ccchh---hhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------ Q lcl|NC_017974. 353 INVG-NDKGG---SYNLSESKQTIH-TQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------ 417 (507) Q Consensus 353 Lts~-~~~~G---S~Al~~vh~~v~-~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------ 417 (507) --.+ ++.+| .++..=+..|+. ...+......+...|..-|-..|+-=+.. ..+. .+.+.|+.. T Consensus 375 sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~ 452 (521) T protein:vir:65 375 SRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVI--TEDDWDREINNIKVVFHRDSYYT 452 (521) T ss_pred eeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCC--CHHHHHHHhhcceEEeeecchHH Confidence 3321 22222 233222223332 23344444444444443333333322211 1111 123333332 Q ss_pred CchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 418 VDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 418 ~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) |-.+.+.+.+++..|..+--.+-.-.+.+|+++. |.+.+.+-.++. ..-.....++--+.. .. ..++. T Consensus 453 ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~-k~I~~E~~~~~~~~p--~~---~~~~f 521 (521) T protein:vir:65 453 EVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEK-KQIEEEANDPRFKQT--PD---EIEDF 521 (521) T ss_pred HHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHH-HHHHHhhhCCCCCCC--cc---cccCC Confidence 2344445555665554431111112235777654 566422111110 111111111111100 00 01111 No 214 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=73.60 E-value=0.17 Score=24.83 Aligned_cols=426 Identities=12% Similarity=0.072 Sum_probs=150.8 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHH-HHHHHhhChHHHHHHHHH-HH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIG-TVESMKQDATVATGLDML-YT 78 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~-~y~~m~~D~~v~s~l~~R-k~ 78 (507) |.+|-+..+..+-..- ...+.+++..+ ..+ ..++.. +.-..+.+.....-. ....-..---+...+-+. -. T Consensus 14 ~~~~~~~~~~~~~i~d-~~~i~~~~~~~---~~i--~~~~~~-Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i~~~~A~ 86 (505) T protein:vir:79 14 GSAAVGMTKSLGQIID-DPRINLPADEV---ERI--ARDKRY-YMDDFKQVTHKNSYGDTQKHELQSVNVTKLASAKLAS 86 (505) T ss_pred hhhhhcchhhhhhhhc-ccCCCCCHHHH---HHH--HHHHHH-hcCCCccccccccCCCccccceeecchHHHHHHHHHh Confidence 3333333332210000 00111111111 111 111111 111111110000000 000000000122222222 22 Q ss_pred HHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCccccCcceeeeeccc Q lcl|NC_017974. 79 FVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAF 157 (507) Q Consensus 79 av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~ 157 (507) .|.+-+-.|.++ +.+..++++++++.- .|...+... ..|..+|=.++=+.|.. |.+.+. . T Consensus 87 ll~~e~~~i~~~------d~~~~e~l~~i~~~n---~f~~~~~~~~e~a~a~G~~~~k~~~D~-------~~~~i~---~ 147 (505) T protein:vir:79 87 LIFNEQCQVTVS------DETANDFLDDVFQQN---DFYTTFEEKLEEWIALGSGCVRPYVDS-------GKIKLA---W 147 (505) T ss_pred hhcCCCceeecC------ChHHHHHHHHHHHhc---cHHHHHHHHHHHHhhcCCeEEEEEEeC-------CceEEE---E Confidence 222222223332 356778888888652 366665554 46777888877666642 222222 1 Q ss_pred cCchhcccccceeecCCCcccc-cc---cc--------------c-----c-----ccccccccccccccc-------cc Q lcl|NC_017974. 158 RPQASLSRTNPIVYNEDGSAII-GI---KQ--------------S-----L-----SAFQNYTASEIGVGG-------VS 202 (507) Q Consensus 158 r~~~~i~~~~~~~~~~d~~~l~-~~---~q--------------~-----~-----~~~~~~~~~~~~~~~-------~~ 202 (507) .++.++. |.+++.++.... -. .+ + + ..+.......++... +. T Consensus 148 v~ad~~~---P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~ 224 (505) T protein:vir:79 148 ATADQVY---PLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYE 224 (505) T ss_pred EcCCeeE---EEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhccccc Confidence 2222221 111221111000 00 00 0 0 000000000000000 00 Q ss_pred cCCCccc---cccccEEEEee----CCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCC Q lcl|NC_017974. 203 TRMSDVI---IPINRVMLMNT----GGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAA 275 (507) Q Consensus 203 ~~~~~~~---iP~~k~i~~~~----~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~ 275 (507) .....+. ++..=|.+++. .....+|+|.|.+..+--..-.=+..+..|+.-.+. |-. -+.+|..+.+... T Consensus 225 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~--~i~v~~~~l~~~~ 301 (505) T protein:vir:79 225 GLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKK-GQR--RLIVPAEWLKTGS 301 (505) T ss_pred ccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-ccc--ceeechHHhcccC Confidence 0000000 11112334432 334578999999998875443333333334333331 211 2233433322111 Q ss_pred CCch---HHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHH-hcc Q lcl|NC_017974. 276 MDQS---SPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRL-GAG 351 (507) Q Consensus 276 ~~~~---~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~i-LGq 351 (507) .... ........ .....+..+..+..-. .++.....- -...|.+.++.+=++|+... +++ T Consensus 302 ~~~~~~~~~~~~~fd----------~~~~~y~~~~~~~~~~----~i~~~~~~i--r~e~~~~~l~~~l~~i~~~~g~s~ 365 (505) T protein:vir:79 302 SYGGQASETHPPMFD----------PDETVYQAMYGDASEV----GFHDATSPI--RVADYQATMDFFLREFENQTGLSQ 365 (505) T ss_pred CCCcccccccccCCC----------ccceeeeeccCCCCCC----ceEEecccC--CHHHHHHHHHHHHHHHHHHhCCCh Confidence 1000 00000000 0000111111100000 011111100 01124444455555554433 332 Q ss_pred cccccccchhhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---CCcCC-------c--cccceEEecCc Q lcl|NC_017974. 352 FINVGNDKGGSYNLSESK--QTIHTQFVQRVNEIILEALNENLLPQLLALN---DMRLP-------E--TEMPYVKAGEI 417 (507) Q Consensus 352 tLts~~~~~GS~Al~~vh--~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN---~~~~~-------~--~~~P~~~~~~~ 417 (507) . |.+.+..|....-++. +.-...-+..-.+.+...| ++|++.++.+- ..+.. + ..-+.|.|+.. T Consensus 366 ~-~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~~~al-~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~ 443 (505) T protein:vir:79 366 G-TFTTSPSGIQTATEVVTNNSQTYQTRSSYITQVEKTI-KALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDG 443 (505) T ss_pred h-hcCCCccccchHHHHHHHHhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCC Confidence 2 2222223332222332 1222333444555667777 45777776532 11110 0 11245788888 Q ss_pred CchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccchh---hhcCCCcCCCCCC Q lcl|NC_017974. 418 VDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQEEL---MKLLGEDTSRAGD 480 (507) Q Consensus 418 ~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~~~---~~~~~~~~~~~~~ 480 (507) -..|.++.++...+++.+|+... +.++.+.+|+++.+..++..... +...|+...-.|+ T Consensus 444 i~~d~~~~~~~~~~~v~~Gi~s~----e~~l~~~~~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 444 VFVDQESKRAADLQAVQAQVMPK----KQFLMRNYGLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred CCCCHHHHHHHHHHHHHcCCCCH----HHHHHhcCCCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 88888888899999999998654 57888899997543333222211 1112222221122 No 215 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=73.47 E-value=0.17 Score=24.80 Aligned_cols=394 Identities=10% Similarity=-0.019 Sum_probs=154.6 Q ss_pred ccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH---H----------------h-----hChHHHH Q lcl|NC_017974. 16 ATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES---M----------------K-----QDATVAT 71 (507) Q Consensus 16 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~---m----------------~-----~D~~v~s 71 (507) .+.. ....-+.-+. +..+ |..++.+.|+- + . -.....- T Consensus 1 l~~~--------------~i~~~i~~~~--~~~~--r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ 62 (451) T protein:vir:10 1 MELE--------------KIRAIISADA--ARRQ--EILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEI 62 (451) T ss_pred CCHH--------------HHHHHHHHHH--HHHH--HHHHHHHHhcccCccccccccccccccccccccccccccchHHH Confidence 0000 0000000000 0000 00111111111 0 0 0122222 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeeeeeeeccCcc---ccC Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVEKVYTQVTVG---EYI 147 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~Ei~w~~~~~~---~~~ 147 (507) ++.....-+.+-+..+.. ..+++..++++.+++ ..+.+....+ .++.-||.+. +++|...... ... T Consensus 63 Ivd~~~~yl~G~p~~~~~-----~~~~~~~~~~~~~~~----n~~~~~~~~~~~~~~~~G~a~-~~~y~de~~~~~~~~~ 132 (451) T protein:vir:10 63 LVDEKASYMFTYPVLFDI-----DNNKELNEKVTDVLG----NEFTRKAKNLAIEASNCGSAW-LHYWIDEEYSGEQVTN 132 (451) T ss_pred HHHhhhhheecccceeec-----CCcHHHHHHHHHHhc----cCHHHHHHHHHHHHhhcCeEE-EEEeecCCcccccccc Confidence 333333333333333322 223455566766654 2466666665 4577799776 4666533210 112 Q ss_pred cceeeeeccccCchhcccccc-----------eeec-CC--Ccc----cccccc-ccccccccccccccccccccCCCc- Q lcl|NC_017974. 148 GKYKVKNLAFRPQASLSRTNP-----------IVYN-ED--GSA----IIGIKQ-SLSAFQNYTASEIGVGGVSTRMSD- 207 (507) Q Consensus 148 g~~~~~~l~~r~~~~i~~~~~-----------~~~~-~d--~~~----l~~~~q-~~~~~~~~~~~~~~~~~~~~~~~~- 207 (507) |.+.++-+.|+ .++.-|.. ++.. .+ +.. +..+.- .+.....+................ T Consensus 133 ~~~~~~~i~p~--~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~ 210 (451) T protein:vir:10 133 QTFKYGVVNTE--EIIPIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITV 210 (451) T ss_pred cceeEEEEccc--ceEEEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccc Confidence 44444444333 12211100 0000 00 000 000000 000000000000000000000001 Q ss_pred -cccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHH Q lcl|NC_017974. 208 -VIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMV 286 (507) Q Consensus 208 -~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l 286 (507) ..++.-.++.|. .|..|.|.+..+-...=-=+..+...+..++.+..|+++++ ++ ...+.. +.. T Consensus 211 ~~~~g~vPvv~~~-----nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~---g~---~~~~~~-~~~--- 275 (451) T protein:vir:10 211 QHRFNSVPFVEFS-----NNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILE---NF---GGEDTS-EFL--- 275 (451) T ss_pred cCCCCeeeEEEec-----cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeee---cC---Ccccch-hhH--- Confidence 111111123332 35567888877554443345567788888888988888765 11 111111 111 Q ss_pred HHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHH Q lcl|NC_017974. 287 RGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLS 366 (507) Q Consensus 287 ~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~ 366 (507) ..+ . ....+.++...+- .+..++++..... ...+...++++.+.|...--+..++.+. .| .+.| T Consensus 276 ~~~-------~--~~~~i~~~~~~~~--~~~~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~g-n~Sg 339 (451) T protein:vir:10 276 KEL-------K--RYKTIKTETDSEG--DSGGLKTMQIEIP--TEARKIILEILKKQIYESGQGLQQDTEN--FG-NASG 339 (451) T ss_pred HHH-------h--hCCeEEecCcCCc--cCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCcccccccc--cc-cccH Confidence 111 1 1123444432211 1123445444332 2346778999999998876555444322 22 2222 Q ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHHHHhcCCccccHH Q lcl|NC_017974. 367 ESKQ---TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPK 443 (507) Q Consensus 367 ~vh~---~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~ 443 (507) ..-. .-....+..-.+.+...| +++++.++.+.+.. +..-..+.|....+.|..+.++++.+|+ |. ++. T Consensus 340 ~Alk~~~~~l~~k~~~k~~~f~~~l-~~~~~li~~~~~~~--d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~-iS~-- 411 (451) T protein:vir:10 340 VALKFFYRKLELKSGLLETEFRTSF-DKLIKAILYFLGVT--DYKKIQQTYTRNMMSNDLEDADIATKSV--GI-IPT-- 411 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCC--CccceeEEecCCCCCCHHHHHHHHHHHh--cc-Cch-- Confidence 2111 112222333344555566 34666777654321 2223467899999999999999999985 64 442 Q ss_pred HHHHHHHHcCCCCCCCccc-ccchhh----hcCCCcCCCCCC Q lcl|NC_017974. 444 VINRVLEVLGIDEKIEEDI-SQEELM----KLLGEDTSRAGD 480 (507) Q Consensus 444 ~~~~i~e~~Gip~~~~~~~-~~~~~~----~~~~~~~~~~~~ 480 (507) +.+.+.++.-....++. ...... ....+.-++.++ T Consensus 412 --et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 412 --KIILRHHPWVDDVEEAEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred --HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 44555565532111111 000000 001111111111 No 216 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=70.85 E-value=0.2 Score=24.38 Aligned_cols=372 Identities=9% Similarity=-0.006 Sum_probs=140.5 Q ss_pred ccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhCh----HHHHHHHHHHHHHhhCCc----------eeeeCCCC Q lcl|NC_017974. 28 MSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDA----TVATGLDMLYTFVEKAFK----------DFKVIPGE 93 (507) Q Consensus 28 ~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~----~v~s~l~~Rk~av~~~~~----------~~~~~~~~ 93 (507) |..-. + .++.... .....|..++.+.|+--.... .+-..+..+.+.+ ++| .+.+++-. T Consensus 1 ~~~~~---i--~~L~~~~-~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v--~nw~~~iVds~a~rl~~~Gf~ 72 (409) T protein:vir:16 1 MTEKG---I--GYLRFKL-SVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSI--LGWCAKGVDSLADRLVFREFE 72 (409) T ss_pred CCHHH---H--HHHHHHH-HHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhh--cChhHHHHHHhHhhccccccc Confidence 22111 1 0111100 001112223333344211121 2223333333333 233 12223211 Q ss_pred ChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCccccCcceeeeeccccCchhcccccc---- Q lcl|NC_017974. 94 SEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNP---- 168 (507) Q Consensus 94 ~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~---- 168 (507) . .+.. +.++++. ..|.....+ ..+|+-||.|.. .+|..+ +|...+.-+.|+. .+.-|.+ T Consensus 73 ~-~d~~----l~~i~~~---N~ld~~~~~~~~~al~yG~sf~-~v~~~~-----dg~~~i~~~sP~~--~~~i~D~~~~~ 136 (409) T protein:vir:16 73 N-DDFT----VNEIFEE---NNPDIFFDSTVLSALIASCSFT-YISKGE-----NDAVRLQVIEATN--ATGIIDPITGL 136 (409) T ss_pred C-cchH----HHHHHHh---cChhHHHHHHHHHHHHhCceeE-EEecCC-----CCceEEEEEcccc--eEEEeeccccc Confidence 1 1222 3333332 235554444 447889999765 888643 4444333222221 1111111 Q ss_pred ------eeec-CCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCCccCccccchHH-HHHHHHH Q lcl|NC_017974. 169 ------IVYN-EDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPL-VGCYRAW 240 (507) Q Consensus 169 ------~~~~-~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll-~~~~~~~ 240 (507) ++.. .++.........+.....+.. ..+. +... ...++...++-|.++.+.+.|+|.|-+ +.+-... T Consensus 137 ~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~--~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~ 211 (409) T protein:vir:16 137 LTEGYAVLERDENNNVVLEAHFLPDRTDYYYR-DSRN--NISI--ANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQ 211 (409) T ss_pred ceeeeEEEEecCCCceEEEEEEecCcEEEEEe-cCcc--ccce--ecCCCCcceEEecccccccccCCccccchhHHHHH Confidence 1111 111110000000110000000 0000 0011 122223346778888888899998744 3222111 Q ss_pred HHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCccccccccccee Q lcl|NC_017974. 241 REKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMT 320 (507) Q Consensus 241 ~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~e 320 (507) =-=+..+..-+.-.|=|.+|-.++. +.. ......+ .+++.. +....+|.+.+-+. .+ T Consensus 212 da~~r~~~~~~~~~e~~a~pqr~i~---G~d--~d~~~~~----~~~~~~----------~~i~~~~~d~~g~~----~~ 268 (409) T protein:vir:16 212 SNAKRTLERADVTAEFYSFPQKYVT---GLS--DDAEPME----TWKATV----------SSMLQFTKDEDGDK----PT 268 (409) T ss_pred HHHHHHHHHHHHHHHHhcChhheeE---ecC--CCCCccc----hhhhhh----------hHhhccCCCCCCCC----ce Confidence 1111122222334455566655553 110 0011111 122211 12355665433111 12 Q ss_pred eeeccCCCcchhHHHHHHHHHHHHHHHHhcccccc-c-c-cc-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 321 LKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV-G-N-DK-GGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQL 396 (507) Q Consensus 321 l~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts-~-~-~~-~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 396 (507) +.+.+++ +...|...++-+-.++|-. .+=.+.. + . .. .+.-|+..-+. -.....+.-.+.+...+. ++.+-. T Consensus 269 v~q~~~~-~l~~~~~~l~~~~~~~a~~-s~lP~~~lg~~~~NpsSa~Ai~a~~~-~L~~ka~~k~~~fg~~l~-~~~rla 344 (409) T protein:vir:16 269 LGQFTQP-SMSPFTEQLRTAAAGFAGE-TGLTLDDLGFVSDNPSSVEAIKASHE-NLRLAGRKAQRSLGAGLL-NVAYLA 344 (409) T ss_pred EEecCCC-ChhHHHHHHHHHHHHHhhh-cCCCHHHcccccCchhHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHHHHH Confidence 2222222 2223444444444444432 1100100 0 0 01 12223332222 222233334455666664 466665 Q ss_pred HHh-CC-CcCCcc-ccceEEecCcCc---hhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCC Q lcl|NC_017974. 397 LAL-ND-MRLPET-EMPYVKAGEIVD---VDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKI 458 (507) Q Consensus 397 ~~l-N~-~~~~~~-~~P~~~~~~~~~---~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~ 458 (507) +.+ +. ...+++ .--++.|....+ ..+.+.|+++.||+.+|....+ .+.+++.+|+.... T Consensus 345 ~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~---~~v~~~~~g~~~~d 409 (409) T protein:vir:16 345 ACLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFIN---KDTIRDLTGIKGAE 409 (409) T ss_pred HHHhcCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccc---hhHHHHhccCCCCC Confidence 554 31 111111 112456765543 4467888999999999865543 24678999997654 No 217 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=69.47 E-value=0.22 Score=24.16 Aligned_cols=413 Identities=8% Similarity=-0.012 Sum_probs=155.4 Q ss_pred ccccccccccCchh---hhHHHHhhhhcccchhccchhHHHHHH---Hh------------------------hChHHHH Q lcl|NC_017974. 22 NVSRMRMSEQGTFA---LAKVQVDSERMKAEEIRWPNLIGTVES---MK------------------------QDATVAT 71 (507) Q Consensus 22 ~~~~~~~~~~g~~~---~~~~~~~~~~e~~~~lr~~~~~~~y~~---m~------------------------~D~~v~s 71 (507) -+|+.-..++-.++ ..-+.-+......+.+ ..+.+.|+- ++ ......- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~ 78 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWA--HIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTE 78 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHhcccchhhhcccccccccccccccccccccccccchHHH Confidence 11111111111111 0101011000000000 011111110 00 1223333 Q ss_pred HHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCccccCcce Q lcl|NC_017974. 72 GLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVGEYIGKY 150 (507) Q Consensus 72 ~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~ 150 (507) .+.+...-+.+-+..|... +..++++.+.+++++. ..|.+...+ +.++.-||.+ .|++|... +|.+ T Consensus 79 Ivd~~~~yl~G~Pv~~~~~---d~~~~e~~~~l~~~~~----~~~~~~~~el~~~~s~~G~a-y~~~y~de-----~~~~ 145 (537) T protein:vir:78 79 LVDQLAQYLLSNGVEVKVK---DEDNTQLDEILQEYFD----EDFQATIDTLVTNASKKGFE-GIFARTTS-----EGKL 145 (537) T ss_pred HHHHHhhhhcccCceeecC---cchhHHHHHHHHHHhh----ccHHHHHHHHHHHHhhcCee-EEEeeecC-----CCce Confidence 4444444455555455432 2334566666766653 235555444 4567789986 55777533 4566 Q ss_pred eeeeccccCchhcccccceeecCCCcccccc-ccc--------------ccccccccccccccccc-----------ccC Q lcl|NC_017974. 151 KVKNLAFRPQASLSRTNPIVYNEDGSAIIGI-KQS--------------LSAFQNYTASEIGVGGV-----------STR 204 (507) Q Consensus 151 ~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~-~q~--------------~~~~~~~~~~~~~~~~~-----------~~~ 204 (507) .++.+.|.. .| + .|+..+ .+... +.. ...+..++......... ... T Consensus 146 ~~~~i~p~~--~~----p-v~d~~~-~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~ 217 (537) T protein:vir:78 146 KFQTVDGLT--LI----P-VFDDYG-VLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEA 217 (537) T ss_pred EEEEEccce--eE----E-EEcCCC-CceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCccccccccccc Confidence 655544321 11 1 112111 11100 000 00000011100000000 000 Q ss_pred CCcccc-------------------------cccc--EEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 205 MSDVII-------------------------PINR--VMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKD 257 (507) Q Consensus 205 ~~~~~i-------------------------P~~k--~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~ 257 (507) .....+ |..+ ++.|+ .|..|.|.+..+-...=-=+..+..-+..++.| T Consensus 218 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~-----nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~ 292 (537) T protein:vir:78 218 YNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLY-----NNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDF 292 (537) T ss_pred ccccccceeeeccccccccccccccccccccCCcceeEEEec-----cCccCCCchhhhHHHHHHHHHHHHhhhhHHHHh Confidence 000000 0011 22222 356678888876555444455667778888999 Q ss_pred cCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeec-CcccccccccceeeeeccCCCcchhHHHH Q lcl|NC_017974. 258 MGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLP-SDTKDNAPQYSMTLKGIDGMGKQYSTAQL 336 (507) Q Consensus 258 g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP-~g~~i~~~~~~~el~~~~g~g~~~~~~~l 336 (507) .-|+++++- + ..+..++. ...+. +.. .+-++ .+.++ +++..... ...+..+ T Consensus 293 ~~~ilvi~g------~-~~~~~~~~---~~~l~--------~~~-~i~v~~d~~~v-------~~l~~~~~--~~~~e~~ 344 (537) T protein:vir:78 293 SEAIYVVKG------F-SGDSTDKL---RQNIK--------AKK-MIGVNGDNAGM-------EIQTVSIP--YEARKAK 344 (537) T ss_pred cCceeeeec------C-CCccchhH---HHHHh--------hcC-ceeecCCCCce-------eEEEecCC--HHHHHHH Confidence 999888761 1 11111111 11111 111 12232 33333 33333322 2235678 Q ss_pred HHHHHHHHHHHHhcccccccccchhhhhHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHH---hCCCcCCcccc Q lcl|NC_017974. 337 ISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQTIHTQFVQR----VNEIILEALNENLLPQLLA---LNDMRLPETEM 409 (507) Q Consensus 337 i~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~a----D~~~i~~~ln~~li~~l~~---lN~~~~~~~~~ 409 (507) ++++.+.|-+. +++..+.+. .+|.+.| +..+.+...... -.+.+...|.+ +++.++. +.+..--+..- T Consensus 345 ld~L~~~I~~~--s~~~~~~~~-~~gn~SG-vAlk~~~~~l~~ka~~ke~~f~~~l~~-~~~~i~~~~~~~~~~~~d~~~ 419 (537) T protein:vir:78 345 MDIDVENIYRS--GMGFNSTAV-GDGNVTN-VVIKSRYTLLAMKARKMETSLRKVLRW-CADMVVSDIALRGLGEYDSND 419 (537) T ss_pred HHHHHHHHHHh--cCCCCCccc-cccCCcH-HHHHHHHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhcCCcccccce Confidence 89999988764 343333222 2333323 333333222222 33334444533 3333433 22110012234 Q ss_pred ceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCC---------------------Cccccc---- Q lcl|NC_017974. 410 PYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKI---------------------EEDISQ---- 464 (507) Q Consensus 410 P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~---------------------~~~~~~---- 464 (507) ..+.|...-+.|..+.++.+.+|++.|.+.. +-+.+.++.-... .+...+ T Consensus 420 i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~-----eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~ 494 (537) T protein:vir:78 420 ICFEIEPHVLANELDIATTRKTEAETEALKI-----GNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDV 494 (537) T ss_pred eeEEeccCCCCCHHHHHHHHHHHHhcCcchH-----HHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCc Confidence 5788999999999999999999999986432 2222333221100 000000 Q ss_pred -----chhhhcCCCcCCCCCCcc-cccCcccccccccC-CCCc Q lcl|NC_017974. 465 -----EELMKLLGEDTSRAGDGM-TKGSSGNGTGKISS-TRDN 500 (507) Q Consensus 465 -----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~d~ 500 (507) +.......++.+++.+.. +.+...+++...|. .|.+ T Consensus 495 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 495 SPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred CcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccCCCC Confidence 000000000011111110 00001111111111 1111 No 218 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=69.45 E-value=0.22 Score=24.16 Aligned_cols=425 Identities=8% Similarity=-0.025 Sum_probs=149.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +++-+.+. .++..... +.-.+.+.- +....... .+..+..| . -.....-++.....-+ T Consensus 49 i~~~~~~~----~~r~~~l~----~Yy~g~~~i--l~~~~~~~-~~~~~~~k---i--------~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:78 49 IEHHMDYQ----RPRLKVLS----DYYEGKTKN--LVELTRRK-EEYMADNR---V--------AHDYASYISDFINGYF 106 (511) T ss_pred HHHHHHhh----hHHHHHHH----HHhhccCcc--ccccCccc-ccccCcce---e--------ecchHHHHHHHHhhhh Confidence 11111000 00000000 000000000 00000000 00000000 0 0122222223333333 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..|... ++++.+++.++++.. .+..+..++. ++.-||.+ ++++|... +|.+.++-+.|+. T Consensus 107 ~g~p~~~~~~------d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a-~~~vy~d~-----dg~~~i~~~~p~~ 171 (511) T protein:vir:78 107 LGNPIQYQDD------DKDVLEAIEAFNDLN---DVESHNRSLGLDLSIYGKA-YELMIRNQ-----DDETRLYKSDAMS 171 (511) T ss_pred cccCceeecC------chHHHHHHHHHHhhc---ChhHHHHHHHHHHHhcCee-EEEEEeCC-----CCceEEEEEcccc Confidence 3443334321 234556677776542 3555555544 57779974 56888532 4666665544331 Q ss_pred chhcccccceeecCC--Ccccccccccc------------cccccccccccccc-----------ccccCCCcccccccc Q lcl|NC_017974. 160 QASLSRTNPIVYNED--GSAIIGIKQSL------------SAFQNYTASEIGVG-----------GVSTRMSDVIIPINR 214 (507) Q Consensus 160 ~~~i~~~~~~~~~~d--~~~l~~~~q~~------------~~~~~~~~~~~~~~-----------~~~~~~~~~~iP~~k 214 (507) ++. .|+.+ +..+..++-.. ..+..++.....-. ..........+..-. T Consensus 172 --~~~-----v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 244 (511) T protein:vir:78 172 --TFI-----IYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMP 244 (511) T ss_pred --eEE-----EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccc Confidence 111 11111 11111111000 00011111110000 000011111111112 Q ss_pred EEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_017974. 215 VMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAA 294 (507) Q Consensus 215 ~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~ 294 (507) ++.|+ .++.|.|.+..+-...=--+..+..++..++.+..|+++++-- . ..+.++.......-.-. T Consensus 245 vv~~~-----n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~---~----~~~~~~~~~~~~~~~~~-- 310 (511) T protein:vir:78 245 ITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN---L----NLDPVEVRKQKEANVLF-- 310 (511) T ss_pred eEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecC---c----cCCchhhccccccccee-- Confidence 33333 3567888888764433333456777888889999998887520 0 01111110000000000 Q ss_pred HhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch--hhhhHHHHHHHH Q lcl|NC_017974. 295 NAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG--GSYNLSESKQTI 372 (507) Q Consensus 295 n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~--GS~Al~~vh~~v 372 (507) ......+.+.+.. .....+++++..+- ....+..+++++.+.|...--...++.+..++ +.-|+. ....- T Consensus 311 ----~~~~~~~~~~~~~-~~~~~~~~~l~~~~--~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~-~~~~~ 382 (511) T protein:vir:78 311 ----LEPTVYVDAEGRE-TEGSVDGGYIYKQY--DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMK-YKLFG 382 (511) T ss_pred ----ccccceecccccc-CCCCcceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHH-HHHHH Confidence 0000000011100 00111233333322 12335778899988888765555554432211 111211 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh---CCC-cCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHH Q lcl|NC_017974. 373 HTQFVQRVNEIILEALNENLLPQLLAL---NDM-RLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINR 447 (507) Q Consensus 373 ~~~~~~aD~~~i~~~ln~~li~~l~~l---N~~-~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~ 447 (507) ....+..-.+.+...|.+ +++.++.+ +.. ..+.. .-.++.|....+.|..+.++++.+|+ |+ ++. +. T Consensus 383 l~~ka~~~~~~f~~~l~~-~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~-iS~----et 454 (511) T protein:vir:78 383 LEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQ----TT 454 (511) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CCh----HH Confidence 222233333455555643 44554443 211 11111 12478899999999999999999985 65 342 34 Q ss_pred HHHHcCC-CCCCCc-ccccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 448 VLEVLGI-DEKIEE-DISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 448 i~e~~Gi-p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) +.+.++. +.+..+ +...++............+. ...... + ...++.+.+-+. T Consensus 455 ~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~--~~~~~~-~-----~~~~~~~~~~~~ 508 (511) T protein:vir:78 455 LMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK--DPRDIN-D-----DEQDDDTKDTVD 508 (511) T ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccc--CCCCCC-C-----CCCCCCccCccc Confidence 5555544 221111 00011111100000000000 000000 0 000111111111 No 219 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=69.45 E-value=0.22 Score=24.16 Aligned_cols=425 Identities=8% Similarity=-0.025 Sum_probs=149.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +++-+.+. .++..... +.-.+.+.- +....... .+..+..| . -.....-++.....-+ T Consensus 49 i~~~~~~~----~~r~~~l~----~Yy~g~~~i--l~~~~~~~-~~~~~~~k---i--------~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:96 49 IEHHMDYQ----RPRLKVLS----DYYEGKTKN--LVELTRRK-EEYMADNR---V--------AHDYASYISDFINGYF 106 (511) T ss_pred HHHHHHhh----hHHHHHHH----HHhhccCcc--ccccCccc-ccccCcce---e--------ecchHHHHHHHHhhhh Confidence 11111000 00000000 000000000 00000000 00000000 0 0122222223333333 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..|... ++++.+++.++++.. .+..+..++. ++.-||.+ ++++|... +|.+.++-+.|+. T Consensus 107 ~g~p~~~~~~------d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a-~~~vy~d~-----dg~~~i~~~~p~~ 171 (511) T protein:vir:96 107 LGNPIQYQDD------DKDVLEAIEAFNDLN---DVESHNRSLGLDLSIYGKA-YELMIRNQ-----DDETRLYKSDAMS 171 (511) T ss_pred cccCceeecC------chHHHHHHHHHHhhc---ChhHHHHHHHHHHHhcCee-EEEEEeCC-----CCceEEEEEcccc Confidence 3443334321 234556677776542 3555555544 57779974 56888532 4666665544331 Q ss_pred chhcccccceeecCC--Ccccccccccc------------cccccccccccccc-----------ccccCCCcccccccc Q lcl|NC_017974. 160 QASLSRTNPIVYNED--GSAIIGIKQSL------------SAFQNYTASEIGVG-----------GVSTRMSDVIIPINR 214 (507) Q Consensus 160 ~~~i~~~~~~~~~~d--~~~l~~~~q~~------------~~~~~~~~~~~~~~-----------~~~~~~~~~~iP~~k 214 (507) ++. .|+.+ +..+..++-.. ..+..++.....-. ..........+..-. T Consensus 172 --~~~-----v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 244 (511) T protein:vir:96 172 --TFI-----IYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMP 244 (511) T ss_pred --eEE-----EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccc Confidence 111 11111 11111111000 00011111110000 000011111111112 Q ss_pred EEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_017974. 215 VMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAA 294 (507) Q Consensus 215 ~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~ 294 (507) ++.|+ .++.|.|.+..+-...=--+..+..++..++.+..|+++++-- . ..+.++.......-.-. T Consensus 245 vv~~~-----n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~---~----~~~~~~~~~~~~~~~~~-- 310 (511) T protein:vir:96 245 ITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN---L----NLDPVEVRKQKEANVLF-- 310 (511) T ss_pred eEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecC---c----cCCchhhccccccccee-- Confidence 33333 3567888888764433333456777888889999998887520 0 01111110000000000 Q ss_pred HhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch--hhhhHHHHHHHH Q lcl|NC_017974. 295 NAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG--GSYNLSESKQTI 372 (507) Q Consensus 295 n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~--GS~Al~~vh~~v 372 (507) ......+.+.+.. .....+++++..+- ....+..+++++.+.|...--...++.+..++ +.-|+. ....- T Consensus 311 ----~~~~~~~~~~~~~-~~~~~~~~~l~~~~--~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~-~~~~~ 382 (511) T protein:vir:96 311 ----LEPTVYVDAEGRE-TEGSVDGGYIYKQY--DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMK-YKLFG 382 (511) T ss_pred ----ccccceecccccc-CCCCcceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHH-HHHHH Confidence 0000000011100 00111233333322 12335778899988888765555554432211 111211 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh---CCC-cCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHH Q lcl|NC_017974. 373 HTQFVQRVNEIILEALNENLLPQLLAL---NDM-RLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINR 447 (507) Q Consensus 373 ~~~~~~aD~~~i~~~ln~~li~~l~~l---N~~-~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~ 447 (507) ....+..-.+.+...|.+ +++.++.+ +.. ..+.. .-.++.|....+.|..+.++++.+|+ |+ ++. +. T Consensus 383 l~~ka~~~~~~f~~~l~~-~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~-iS~----et 454 (511) T protein:vir:96 383 LEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQ----TT 454 (511) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CCh----HH Confidence 222233333455555643 44554443 211 11111 12478899999999999999999985 65 342 34 Q ss_pred HHHHcCC-CCCCCc-ccccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 448 VLEVLGI-DEKIEE-DISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 448 i~e~~Gi-p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) +.+.++. +.+..+ +...++............+. ...... + ...++.+.+-+. T Consensus 455 ~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~--~~~~~~-~-----~~~~~~~~~~~~ 508 (511) T protein:vir:96 455 LMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK--DPRDIN-D-----DEQDDDTKDTVD 508 (511) T ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccc--CCCCCC-C-----CCCCCCccCccc Confidence 5555544 221111 00011111100000000000 000000 0 000111111111 No 220 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=68.18 E-value=0.24 Score=23.97 Aligned_cols=436 Identities=9% Similarity=-0.031 Sum_probs=149.9 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHH---HHh---------------hChHHHHHHH Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVE---SMK---------------QDATVATGLD 74 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~---~m~---------------~D~~v~s~l~ 74 (507) +.+......... ..-..........-+. .++.+ |..++.+.|+ ++. ..+...-++. T Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~~~i~~~~-~~~~~--r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~ 74 (489) T protein:vir:99 1 MLQEDFEAIDYE---SKLWIDQLKNYISRFK-AEQLE--RLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITV 74 (489) T ss_pred CCccceeeeCCC---CCCCHHHHHHHHHHHH-HHHHH--HHHHHHHHhcccCccccccccccccCCcceeecchHHHHHH Confidence 222211111000 0000000001111111 00111 1111222221 000 0122222233 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHH-HHHHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARD-AATFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~-~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) +...-+.+-+..+. ..++.+.+++++++... .|...... ..++.-||.+.. ++|-.. ....++++.+. T Consensus 75 ~~~~~l~g~~~~~~------~~d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~~~~-~v~~~~-~~d~~~~~~i~ 143 (489) T protein:vir:99 75 FEQGYMLGVPVEYK------NENKDLQAAIDLMSVRN---NEDYHNVKIKTDLSIYGRAYE-LLTVEK-IDDKKTEVKLY 143 (489) T ss_pred HHhhhhccCCceee------cCChhHHHHHHHHHhhc---ChhHHHHHHHHHHhhCCeEEE-EEeecc-CcCCCcceEEE Confidence 22233323222222 22345566777776542 35544333 456778997754 555321 11234566665 Q ss_pred eccccCchhcccccceeecCCCc-cccccccc---------ccccccccccccccccc---------ccCCCcccccccc Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNEDGS-AIIGIKQS---------LSAFQNYTASEIGVGGV---------STRMSDVIIPINR 214 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d~~-~l~~~~q~---------~~~~~~~~~~~~~~~~~---------~~~~~~~~iP~~k 214 (507) -+.|+.- + +++.+.... .+..++.. ......++......... ........+..-. T Consensus 144 ~~~p~~~--~----~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vP 217 (489) T protein:vir:99 144 QLPAEQT--F----VIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVP 217 (489) T ss_pred EEcccce--E----EEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCcee Confidence 5543321 1 011000000 11000000 00011111111100000 0000011111112 Q ss_pred EEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_017974. 215 VMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAA 294 (507) Q Consensus 215 ~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~ 294 (507) ++.|+ +++.|.|.+..+....=-=+..+..++..++-+..++++++- .........+.......-..... T Consensus 218 vv~~~-----n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g-----~~~~~~~~~~~~~~~~~~~~~~~ 287 (489) T protein:vir:99 218 VNEYA-----NNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAG-----NAYTGADENDYLDDGRLNPNGRL 287 (489) T ss_pred EEEee-----cCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhcc-----CCcccccchhhhhhccccccccc Confidence 34343 345677888765432222244557777777777777766541 10111111111110000000000 Q ss_pred Hh--hcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHHHH-- Q lcl|NC_017974. 295 NA--HSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSESKQ-- 370 (507) Q Consensus 295 n~--~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-- 370 (507) .+ .....-.+.++.+......+.+++++..... ...+...++++.+.|.+.--+..++.+. .++.+.|..-. T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~~~n~Sg~Al~~~ 363 (489) T protein:vir:99 288 AISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYD--TAGSEAYKNRLVADILRFTFTPDTQDMK--FSGVQSGESMKYK 363 (489) T ss_pred ccccccccceeeeeccccCccccccceeeeeecCC--hHHHHHHHHHHHHHHHHHhCCccccccc--ccccchHHHHHHH Confidence 00 0000011222222222222223333333221 2235667888888888765444444322 12222222211 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcCC---cc--ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHH Q lcl|NC_017974. 371 -TIHTQFVQRVNEIILEALNENLLPQLLALND-MRLP---ET--EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPK 443 (507) Q Consensus 371 -~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~-~~~~---~~--~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~ 443 (507) .-....+..-.+.+...|. ++++-++.+-. .... .. .-..+.|...-+.|..+.++++.+|+ |+ ++. T Consensus 364 ~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--gi-is~-- 437 (489) T protein:vir:99 364 LMASDNYREKQERLFKKGLM-RRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GI-VSD-- 437 (489) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-CCH-- Confidence 1112222333344555553 45555554321 1111 11 12467888889999999999999985 65 342 Q ss_pred HHHHHHHHc-CCCCCCCcccc---cchhhhcCCCcCCCCCCcccccCcccccccccCCC Q lcl|NC_017974. 444 VINRVLEVL-GIDEKIEEDIS---QEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTR 498 (507) Q Consensus 444 ~~~~i~e~~-Gip~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (507) +.+.+.+ ++......++. ..+........+...... ..+ ...+.+..| T Consensus 438 --et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~----~~~-~~~~~~~~p 489 (489) T protein:vir:99 438 --QTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGD----ASG-QEEPTAEKP 489 (489) T ss_pred --HHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCC----CCC-CcCCCCCCC Confidence 3344443 44322111111 111111111111111100 011 111111112 No 221 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=67.76 E-value=0.25 Score=23.91 Aligned_cols=460 Identities=13% Similarity=0.069 Sum_probs=164.5 Q ss_pred Cc--------ccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchh-ccchhHHHHHHHhhChHHHH Q lcl|NC_017974. 1 MS--------RKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEI-RWPNLIGTVESMKQDATVAT 71 (507) Q Consensus 1 ~~--------~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~l-r~~~~~~~y~~m~~D~~v~s 71 (507) |+ +++ .+++..+..|........ ...|..|. +....-+-.. -..++++.|++|..++.|-+ T Consensus 1 m~~lfgf~i~~~~-~~~~~S~vpp~~~~~~~~----i~~g~~g~-----~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~ 70 (564) T protein:vir:10 1 MSQLFGFLINEKE-GQKGQSPVPPNDEASVST----VAGGYFGT-----YVDTSGGQNSRNEYELIRRYRDMSLHPEVDS 70 (564) T ss_pred Ccchhcceeeeec-cCCCCCcccCCcCCChhh----hhccccce-----eeecccccchhhHHHHHHHHHHHhhccchhh Confidence 43 222 222222222222111100 01111111 1111111111 23579999999999999999 Q ss_pred HHHHHHHHHhhCCc---eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceee-----------eee Q lcl|NC_017974. 72 GLDMLYTFVEKAFK---DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVV-----------EKV 137 (507) Q Consensus 72 ~l~~Rk~av~~~~~---~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~-----------Ei~ 137 (507) +++....-+....- .+.+.-...+-++.+.+-|.+- |.. |..+|+.--+||..+ .++ T Consensus 71 Av~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eE--------F~~-Il~ll~F~~~~~e~fR~WYVDgRi~fHki 141 (564) T protein:vir:10 71 AIDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDE--------FNR-ILRMMNFNVNAHEIIRNWYVDGRSHYHKV 141 (564) T ss_pred HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHH--------HHH-HHHHhccchhhhHHHhhhhhcceEEEEEE Confidence 99988775421110 0111111222233322222221 222 223444444444433 333 Q ss_pred eeccCccccCcceeeeeccccCchhcccccceeecCC--Cccccccccc------cccccccccc--------ccccccc Q lcl|NC_017974. 138 YTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNED--GSAIIGIKQS------LSAFQNYTAS--------EIGVGGV 201 (507) Q Consensus 138 w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d--~~~l~~~~q~------~~~~~~~~~~--------~~~~~~~ 201 (507) -... +..-.+.+|....|.++.+++-.....| +..+...... ...++.+.+. ..+. .. T Consensus 142 id~~-----~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~-~~ 215 (564) T protein:vir:10 142 IDLD-----NPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGS-MD 215 (564) T ss_pred eeCC-----ChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccc-cc Confidence 3211 1111244455555556655553333222 2222111110 0011111110 0000 00 Q ss_pred ccCCCccccccccEEEEeeCCc--cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCch Q lcl|NC_017974. 202 STRMSDVIIPINRVMLMNTGGS--SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQS 279 (507) Q Consensus 202 ~~~~~~~~iP~~k~i~~~~~~~--~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~ 279 (507) .....++.||.+ .|+|+|..- .++..=.|.|+++..++==-+.....-+.+-=--.-.=.++++. +|..+ . T Consensus 216 ~~~~~~ikI~~d-aI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYID--VGnLP----k 288 (564) T protein:vir:10 216 WSNQEGIKIASD-AIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYID--VGNLP----K 288 (564) T ss_pred cccccceeechh-hcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----c Confidence 112346677654 577777643 23333467888888776444433333333210001111244442 22222 2 Q ss_pred HHHHHHHHHHHHHHHHhh-cCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHH Q lcl|NC_017974. 280 SPEAEMVRGLMSDAANAH-SGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISD 339 (507) Q Consensus 280 ~~~~~~l~a~~~~~~n~~-~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~ 339 (507) ....+.|+.++.-..|-. -.+..|- -+|. |+||.+. .|+.+... ..=|+| T Consensus 289 ~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTL---------pGgqnLge-m~DV~Y 358 (564) T protein:vir:10 289 VKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTL---------PGGQNLGE-LKDVEY 358 (564) T ss_pred hhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhcccccCCCcccceeec---------cccCCcch-HHHHHH Confidence 233345555444332200 0111111 1222 5655442 23333332 345899 Q ss_pred HHHHHHHHHhcccccccccchhhhhH---HHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cc Q lcl|NC_017974. 340 RKKSILDRLGAGFINVGNDKGGSYNL---SESK-QTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MP 410 (507) Q Consensus 340 ~d~~Isk~iLGqtLts~~~~~GS~Al---~~vh-~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P 410 (507) ..+.+-+++--..--.+.. +++..+ +++. .|+ |...+......+...|..-|-..|+-=|.. ..+. .+ T Consensus 359 F~kKLY~aLnVP~SRl~~e-~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgii--t~eeW~~i~~ 435 (564) T protein:vir:10 359 FKKKLYNSLNLPPSRLTDD-NKAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGII--TPEDWDDMEE 435 (564) T ss_pred HHHHHHHHhCCCcccccCC-CceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhh Confidence 9999999887765333322 211211 2222 222 233344444444444443333333321211 1111 12 Q ss_pred eEEecCc------CchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCC---------------------CCCccc Q lcl|NC_017974. 411 YVKAGEI------VDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDE---------------------KIEEDI 462 (507) Q Consensus 411 ~~~~~~~------~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~---------------------~~~~~~ 462 (507) .+.|+.. |-.+.+.+.+++..|..+--.+-.-.+.+||++. |.+.+ |.+.++ T Consensus 436 ~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~ 515 (564) T protein:vir:10 436 HIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNM 515 (564) T ss_pred cceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhc Confidence 2333322 3345455556666665541111111124555433 34421 111110 Q ss_pred ccchhhhcCCCcCCCCCCcccccCc---ccccccccCCCCchhhhccC Q lcl|NC_017974. 463 SQEELMKLLGEDTSRAGDGMTKGSS---GNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~d~~a~~~~~ 507 (507) .. .++..+.+-.|...++....+ ......++..++++-.+... T Consensus 516 ~~--~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 561 (564) T protein:vir:10 516 LD--DMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKSQ 561 (564) T ss_pred CC--CccCCCCcCCcchhhhccccccccChhhhccCCCCCCCCCCcCc Confidence 00 111111111111111111000 00000000111222222211 No 222 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=67.33 E-value=0.25 Score=23.85 Aligned_cols=452 Identities=12% Similarity=0.080 Sum_probs=172.0 Q ss_pred CcccccccccccCcccccccccccccc----ccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMR----MSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~----~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~R 76 (507) |+.-++.+-+.+..+..-.....-|.. +.++... +..+++. .+.+ ..+.+.-.+ -|++-.-++++- T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y--~lP~~~~--~~~~-~~~~~~~~~-----~dst~~~a~~~L 70 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKV--TIPSLFP--KDSD-NSSTDYTTP-----WQAVGARGLNNL 70 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHH--hccccCC--CCCC-ccccccccc-----ccchHHHHHHHH Confidence 666554433333322111111100100 0000000 0000000 0000 011111111 244444445544 Q ss_pred HHHHhhC-----CceeeeCCCC---------ChhHHHHHHHHHH----HHHhccCCCHHHHHHHHHH-HHhhccee---- Q lcl|NC_017974. 77 YTFVEKA-----FKDFKVIPGE---------SEESKKAAKFVEY----CLKNMEGQTLRQFARDAAT-FNEYGLSV---- 133 (507) Q Consensus 77 k~av~~~-----~~~~~~~~~~---------~~~~~~~ae~v~~----~l~~~~~~~~~~~l~~~l~-a~~~Gfs~---- 133 (507) -..+.+. +| |+...++ ..+..++.+++++ +...+....|+..+.+++. -+.+|-++ T Consensus 71 aa~l~~~ltP~~~W-F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~ 149 (543) T protein:vir:88 71 SAKVMLALFPLQSW-MKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLP 149 (543) T ss_pred HHHHHHhhcCCCcc-cccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeec Confidence 4444432 22 2222211 1122344444432 2233444567766655543 33455544 Q ss_pred -----------------eeeeeeccCccccCcceee-----eeccccCchhcccccceeecCCCccccccccccc--c-- Q lcl|NC_017974. 134 -----------------VEKVYTQVTVGEYIGKYKV-----KNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLS--A-- 187 (507) Q Consensus 134 -----------------~Ei~w~~~~~~~~~g~~~~-----~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~--~-- 187 (507) .+.++..+..|.-+-.+.- +.|...-+.++.. ....+++...-+.+.-.|+ . T Consensus 150 ~~~~~~~~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~--~~~~~p~~~~~v~~~V~pr~~~~~ 227 (543) T protein:vir:88 150 PPDASSNSYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSG--GQEYKPEQELEVYTHIYIDDESGD 227 (543) T ss_pred cCccccceecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHH--HhhcCCccceEEEEEEEeecCCCc Confidence 3334443333222211111 1111111111100 0011111111111100011 0 Q ss_pred ccccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecc Q lcl|NC_017974. 188 FQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIP 267 (507) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P 267 (507) +..+...+ + .......+....-..-|++.|+...+|+.||.|....++.-..--+...+.-+..+++---|+.. +| T Consensus 228 ~~~~~~~~-~-~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~--v~ 303 (543) T protein:vir:88 228 FLSYQEIE-G-VEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGL--VN 303 (543) T ss_pred cccccccc-C-eeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ec Confidence 00010000 0 01111111221223468999999999999999999999998877777778888888876555422 22 Q ss_pred cc-cCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHH Q lcl|NC_017974. 268 SQ-ILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILD 346 (507) Q Consensus 268 ~~-~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk 346 (507) +. .++ ..+ +..|. -+.++|...+ .+..+....++....-...|+.+...|.. T Consensus 304 ~~g~~~----------------~~~----~~~~~-~g~~v~g~~~------~v~~~~~~~~~~~~~~~~~i~~~~~rI~~ 356 (543) T protein:vir:88 304 PNGITQ----------------VRR----LVKAQ-TGDFVAGRKA------DIEFLQLEKTADFTVAKSVADAIEARLSY 356 (543) T ss_pred cccccc----------------hhh----cccCC-CceeecCCCC------cceeeecccccchhHHHHHHHHHHHHHHH Confidence 21 111 011 11111 2334443221 12233333334444567799999999999 Q ss_pred HHhcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CC----CcC---Ccccc-ceEEec- Q lcl|NC_017974. 347 RLGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLAL--ND----MRL---PETEM-PYVKAG- 415 (507) Q Consensus 347 ~iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l--N~----~~~---~~~~~-P~~~~~- 415 (507) +++...++..+ ++.--..||+....+.. . -.-=+-..|+..++-||+.. +. +-+ |++.+ |.++.. T Consensus 357 af~~~~~~~~~--~~r~TAtEV~~r~~E~~-~-~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l 432 (543) T protein:vir:88 357 VFMLNSAVQRS--GERVTAEEIRYVASELE-D-TLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGA 432 (543) T ss_pred HHhhhhhccCC--CCcccHHHHHHHHHHHH-H-HHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEecH Confidence 99887654322 22222345444322211 1 11122233444455555431 10 011 12111 222211 Q ss_pred --CcCchhHHHHHHHHHHHHhcC---Cc--cccHHHHHHHHHHcCCCCC----CCcccccch---------------hhh Q lcl|NC_017974. 416 --EIVDVDMEGFSKAIQRIGAVG---YL--PKTPKVINRVLEVLGIDEK----IEEDISQEE---------------LMK 469 (507) Q Consensus 416 --~~~~~dl~~~a~~~~~L~~~G---~~--~~~~~~~~~i~e~~Gip~~----~~~~~~~~~---------------~~~ 469 (507) .....++..+...++.+..++ +. +..+...+++.+.+|+|.. .+++..+.. ... T Consensus 433 ~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~ 512 (543) T protein:vir:88 433 EALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGS 512 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 011234444444444443332 11 2223455778889999532 222211100 000 Q ss_pred cCCCcCCCCCCcccccCcccccccccCCCCc Q lcl|NC_017974. 470 LLGEDTSRAGDGMTKGSSGNGTGKISSTRDN 500 (507) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 500 (507) ..+.+.+..++.+..+.+..+.+..|+++.- T Consensus 513 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 513 GVAAQATASPEAMESAMDTAGVQPGPIATQV 543 (543) T ss_pred chhhhhccChHHHHHHhhhcCCCCCCCCCCC Confidence 0111122222222222234455555666555 No 223 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=67.22 E-value=0.26 Score=23.83 Aligned_cols=410 Identities=10% Similarity=0.022 Sum_probs=151.5 Q ss_pred ccCccccccccccccccccccCchhhhHHHHhhhhccc-chhccchhHHHH-----HHH-----------hhChHHHHHH Q lcl|NC_017974. 11 VKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKA-EEIRWPNLIGTV-----ESM-----------KQDATVATGL 73 (507) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~-~~lr~~~~~~~y-----~~m-----------~~D~~v~s~l 73 (507) ..+...... +.. -...+-........+..+.+.. +.+..+...... ... ...+-..-.+ T Consensus 1 ~~~e~~~~~---i~~-~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 76 (471) T protein:vir:10 1 MEIEVIKKI---ISS-QMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLL 76 (471) T ss_pred CCHHHHHHH---HHH-HHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHH Confidence 000000000 000 0000000000000001111000 111110000000 000 0011122222 Q ss_pred HHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceee Q lcl|NC_017974. 74 DMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKV 152 (507) Q Consensus 74 ~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~ 152 (507) .+...-+.+-+..+. .++.++.++++.+++ .+|.....++. ++.-||.+. +.+|... .+|++.+ T Consensus 77 d~~~~yl~G~p~~~~------~~~~~~~~~l~~~~~----n~~~~~~~~~~~~~~~~G~~~-~~v~~d~----~~g~~~~ 141 (471) T protein:vir:10 77 DQKKAYALTYPPTFD------VDDKKVNDMIVDVLG----DDYERISKQLCVNAGNAGIAW-LHVWKDA----SDNSFRY 141 (471) T ss_pred HhhhhhhcccCceec------cCChHHHHHHHHHHh----cCHHHHHHHHHHHHhhCCeEE-EEEEeeC----CCCeeEE Confidence 222333323222222 234456666776654 34777666654 577799665 5676532 2466666 Q ss_pred eeccccCchhcccccceeecC-CCcccccccccc------------------cccccccccccccccc------------ Q lcl|NC_017974. 153 KNLAFRPQASLSRTNPIVYNE-DGSAIIGIKQSL------------------SAFQNYTASEIGVGGV------------ 201 (507) Q Consensus 153 ~~l~~r~~~~i~~~~~~~~~~-d~~~l~~~~q~~------------------~~~~~~~~~~~~~~~~------------ 201 (507) .-+.|+- .+ +++.+. ++..+..++... .....+.......... T Consensus 142 ~~~~p~~--~~----~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 215 (471) T protein:vir:10 142 ACVDSKE--VI----PIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDT 215 (471) T ss_pred EEEcccc--eE----EEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccc Confidence 5554432 11 111111 011111111000 0000000000000000 Q ss_pred ------ccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCC Q lcl|NC_017974. 202 ------STRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAA 275 (507) Q Consensus 202 ------~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~ 275 (507) ........+..--++.|+ +|..|.|.+..+-...=-=+..+..++..++.+..|+++++- . . T Consensus 216 ~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g---~----~ 283 (471) T protein:vir:10 216 MNGDRSSDNSFKHDFGLVPFIPFK-----NNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTN---Y----G 283 (471) T ss_pred ccccccccccccCCCCceeEEEec-----cCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeec---C----C Confidence 000000011111123332 245577888765443333355678888889999999888762 1 1 Q ss_pred CCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_017974. 276 MDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINV 355 (507) Q Consensus 276 ~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts 355 (507) .+...+ .+.. +.. ...+.++....-. ...++++..+.. ...+...++++.+.|...--+..++. T Consensus 284 ~~~~~~---~~~~-------~~~--~~~i~~~~~~~~~--~~~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~tp~~~~ 347 (471) T protein:vir:10 284 GQDKQE---FLED-------LKR--YKMIKMDNDGMGD--QSGVTTIAIDIP--TEARNLILERTKKQIFISGQGVNPET 347 (471) T ss_pred ccccch---hHHH-------hhc--CCeEEecCCCCcc--CccceEEeecCC--hHHHHHHHHHHHHHHHHHhCCcCCCc Confidence 111111 1111 111 1223333221100 113444444332 23467788999999877654443333 Q ss_pred cccchhhhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHHHHHHH Q lcl|NC_017974. 356 GNDKGGSYNLSESKQTIH----TQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFSKAIQR 431 (507) Q Consensus 356 ~~~~~GS~Al~~vh~~v~----~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~ 431 (507) + +.| .+.| +.-+.+ ...+..-.+.+...| +++++.++.+.+.. +..-..+.|....+.|..+.++.+.+ T Consensus 348 ~--~~g-n~Sg-~Alk~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~~~~~--d~~~i~i~f~~~~p~n~~e~~~~~~k 420 (471) T protein:vir:10 348 D--KLG-NSSG-VALKFLYSLLELKAGNMETQFRSGY-ATLVKMILKHLGLS--DKLKIKQTWTRNSINNDTEMAQVVST 420 (471) T ss_pred c--ccc-CccH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccC--CCceeEEEeCCCCCCCHHHHHHHHHH Confidence 2 222 2222 112222 223334445556666 45667777655322 22233688998999999999999999 Q ss_pred HHhcCCccccHHHHHHHHHHcCC-CCCCCc-ccccchhhhcCCCcCCCCCCcccccCcccccccccC Q lcl|NC_017974. 432 IGAVGYLPKTPKVINRVLEVLGI-DEKIEE-DISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISS 496 (507) Q Consensus 432 L~~~G~~~~~~~~~~~i~e~~Gi-p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (507) | .|. ++ ++.+.+.++. ..+..+ +....+... ..+..+..++ +++++.- + T Consensus 421 l--~g~-iS----~et~~~~~p~v~D~~~E~eri~~E~~~-~~~~~~~~~~------~~~~~e~--~ 471 (471) T protein:vir:10 421 L--ATI-TS----RENVAKSNPIVEDWQDELRLQKAEQEG-RSEKLYDMEE------VEHESEV--E 471 (471) T ss_pred H--hcc-Cc----hHHHHHhCCCCCCHHHHHHHHHHHHHH-HHhcccccCC------CCCcccc--C Confidence 7 464 44 2445565543 211111 000001100 0110000000 0000000 0 No 224 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=66.31 E-value=0.27 Score=23.71 Aligned_cols=423 Identities=7% Similarity=-0.038 Sum_probs=151.7 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +++.+.+.+ .+..+... .-.+.+.- ...... .. .+..+..| . -.....-++.....-+ T Consensus 49 i~~~~~~~~-~r~~~l~~-------Yy~g~~~i-~~~~~~-~~-~~~~~~~k---i--------~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:96 49 IEHHMDYQR-PRLKVLSD-------YYEGKTKN-LVELTR-RK-EEYMADNR---V--------AHDYASYISDFINGYF 106 (511) T ss_pred HHHHHHhhH-HHHHHHHH-------HhcccCcc-ccccCc-Cc-ccccCcce---e--------ecchHHHHHHHHHhhh Confidence 111111100 00000000 00000000 000000 00 00000000 0 0122222333333444 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..|... +.++.+++.++++.. .|.....++. ++.-||. +++++|... +|.+.++-+.|+. T Consensus 107 ~g~p~~~~~~------~~~~~~~l~~~~~~n---~~~~~~~~~~~~~~i~G~-a~~~vy~de-----d~~~~i~~~~p~~ 171 (511) T protein:vir:96 107 LGNPIQYQDD------DKDVLEAIEAFNDLN---DVESHNRSLGLDLSIYGK-AYELMIRNQ-----DDETRLYKSDAMS 171 (511) T ss_pred ccCCceeecC------chHHHHHHHHHHhhc---CHHHHHHHHHHHHHhcCe-eEEEEEeCC-----CCceEEEEEccce Confidence 4444444322 234556677776542 3666655554 5777997 456888532 4566655443321 Q ss_pred chhcccccceeecCC--Ccccccccccc------------cccccccccccccc-----------ccccCCCcccccccc Q lcl|NC_017974. 160 QASLSRTNPIVYNED--GSAIIGIKQSL------------SAFQNYTASEIGVG-----------GVSTRMSDVIIPINR 214 (507) Q Consensus 160 ~~~i~~~~~~~~~~d--~~~l~~~~q~~------------~~~~~~~~~~~~~~-----------~~~~~~~~~~iP~~k 214 (507) ++. .|+.. +..+..++... ..+..++.....-. ..........+..-. T Consensus 172 --~~~-----vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 244 (511) T protein:vir:96 172 --TFV-----IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP 244 (511) T ss_pred --eEE-----EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCcee Confidence 111 11111 11111111100 00001111110000 000000011111111 Q ss_pred EEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHH-HHHHHH Q lcl|NC_017974. 215 VMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVR-GLMSDA 293 (507) Q Consensus 215 ~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~-a~~~~~ 293 (507) ++.|+ .|..|.|.+..+....=--+..+..++..++.+..|+++++-- ... +..+...... .+.... T Consensus 245 vv~~~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~---~~~----~~~~~~~~~~~~~~~~~ 312 (511) T protein:vir:96 245 ITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN---LNL----DPVEVRKQKEANVLFLE 312 (511) T ss_pred eEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC---ccC----Cchhhcccccccceecc Confidence 33333 3567888888775554444567788888899999998887621 110 1111000000 000000 Q ss_pred HHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch-hh-hhHHHHHHH Q lcl|NC_017974. 294 ANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG-GS-YNLSESKQT 371 (507) Q Consensus 294 ~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~-GS-~Al~~vh~~ 371 (507) .. ....+.....+ ...+++++..+. ....+...++++.+.|...--...++.+..++ .| -|+ ..... T Consensus 313 ~~---~~~~~~~~~~~-----~~~~~~~l~~~~--~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al-~~~~~ 381 (511) T protein:vir:96 313 PT---VYADSEGRETE-----GSVDGGYIYKQY--DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM-KYKLF 381 (511) T ss_pred cc---cccccccccCC-----CCcceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHH-HHHHH Confidence 00 00000000010 011233333222 22345778888889887766555554432111 11 111 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh---CCCc-CCc-cccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHH Q lcl|NC_017974. 372 IHTQFVQRVNEIILEALNENLLPQLLAL---NDMR-LPE-TEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVIN 446 (507) Q Consensus 372 v~~~~~~aD~~~i~~~ln~~li~~l~~l---N~~~-~~~-~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~ 446 (507) -....+..-.+.+...|++ +++.++.+ ++.. +.. ..-.++.|....+.|..+.++++.+| .|+ ++. + T Consensus 382 ~l~~k~~~k~~~~~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~-iS~----e 453 (511) T protein:vir:96 382 GLEQRTKTKEGLFTKGLRR-RAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-ISQ----T 453 (511) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hcc-CCh----H Confidence 2222233334555556643 44444443 2111 111 12357889988899999999999887 465 342 3 Q ss_pred HHHHHcCC-CCCCCcccc--cchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 447 RVLEVLGI-DEKIEEDIS--QEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 447 ~i~e~~Gi-p~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .+.+.++. +.+. .+.. .++.....................+ ..++.+.+-.. T Consensus 454 t~l~~l~~v~D~~-~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 508 (511) T protein:vir:96 454 TLMSLFSFFQDPE-LEVKKIEEDEKESIKKAQKGIYKDPRDINDD--------EQDDDTKDTVD 508 (511) T ss_pred HHHHhCCCCCCHH-HHHHHHHHHHHHHHHHHhhccccCCCCCCCC--------CCCCccccccc Confidence 45566654 2221 1110 0110000000000000000000000 00000000000 No 225 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=65.00 E-value=0.29 Score=23.53 Aligned_cols=449 Identities=12% Similarity=0.087 Sum_probs=166.1 Q ss_pred Ccc------cccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhc-cchhHHHHHHHhhChHHHHHH Q lcl|NC_017974. 1 MSR------KRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIR-WPNLIGTVESMKQDATVATGL 73 (507) Q Consensus 1 ~~~------k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr-~~~~~~~y~~m~~D~~v~s~l 73 (507) ++. ++-++++...+.|.....++. .. +..++..................+ ..++++.|++|..++.|-+++ T Consensus 17 ~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~-i~-~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av 94 (524) T protein:vir:98 17 AREDEIELEQQLKNDTGSVAPPKNNDGAYE-IE-TDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRGIMSYPEVENAV 94 (524) T ss_pred hhhhhhhHhhhhcCCcccccCCCCCCCcee-ec-CCCCcceecceeeeeccccccccchHHHHHHHHHHHhhccchhhHH Confidence 111 111111222222222111100 00 000010000000000111111123 468999999999999999999 Q ss_pred HHHHHHHhhCCc---eeeeCCCCChhHH----HHHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCccc Q lcl|NC_017974. 74 DMLYTFVEKAFK---DFKVIPGESEESK----KAAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTVGE 145 (507) Q Consensus 74 ~~Rk~av~~~~~---~~~~~~~~~~~~~----~~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~~~ 145 (507) +.....+....- .+.+.-.+++-++ ++.+-++..|+-|+ |..-..++.. -.-.|--.+.++..... T Consensus 95 ~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~---F~~~~~~~fR~WYVDgRi~fhkiid~~~--- 168 (524) T protein:vir:98 95 SEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYD---FDNMGARLFRDWYVDSRIYFHKIMHKDE--- 168 (524) T ss_pred HhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhc---cchhhhHHHhhhhhcceeEEEEEEcCCC--- Confidence 988776532110 0111111222223 34444444444333 2222222221 22245555566655322 Q ss_pred cCcceeeeeccccCchhcccccceeec-CCCcccccccccccccccccccccc---ccccccCCCccccccccEEEEeeC Q lcl|NC_017974. 146 YIGKYKVKNLAFRPQASLSRTNPIVYN-EDGSAIIGIKQSLSAFQNYTASEIG---VGGVSTRMSDVIIPINRVMLMNTG 221 (507) Q Consensus 146 ~~g~~~~~~l~~r~~~~i~~~~~~~~~-~d~~~l~~~~q~~~~~~~~~~~~~~---~~~~~~~~~~~~iP~~k~i~~~~~ 221 (507) ..| +..|....|.+|.+++-.... .|++.- +.+.-.-++.+...... .....+..+++.||.+- |+|+|. T Consensus 169 ~kG---I~ELr~lDPr~i~~vr~~~~~~~~~~~~--v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dA-Ivy~hS 242 (524) T protein:vir:98 169 SKG---IRELRQLDPRCMELIRESITETLDGGVK--VFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSA-IVYAHS 242 (524) T ss_pred Ccc---eeeeeeeCCccceeeeeccccccccchh--hccceeeeeeeccCCCccccccceecCCCceeechhh-eeeecc Confidence 124 333444444455554422222 122211 11111112222211111 11112334567787665 778876 Q ss_pred CccCcccc---chHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhh- Q lcl|NC_017974. 222 GSSSQALG---VSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAH- 297 (507) Q Consensus 222 ~~~~~p~G---~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~- 297 (507) .-.+ ++ .|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.-..|=. T Consensus 243 GL~d--~~~~iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--vGnlP----k~KAeqYl~~im~k~kNklv 314 (524) T protein:vir:98 243 GLED--CSNNIIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYID--VGQMG----GNKATQYVNNIAQGLKNRVV 314 (524) T ss_pred Cccc--CCCCeeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhcCceeE Confidence 5432 22 37888888776433333332222210001111244442 22222 2233345555444332100 Q ss_pred cCCceeE--------------eecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccccccc Q lcl|NC_017974. 298 SGEQSFF--------------MLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGND 358 (507) Q Consensus 298 ~G~~a~~--------------iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~ 358 (507) -.+..|- -+|. |++|.+. .|+.+... ..=|+|..+.+-+++--..--.+.+ T Consensus 315 YDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTL---------pggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~~ 384 (524) T protein:vir:98 315 YDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTL---------PGGQNFSD-MDDIKWFNRKLYEALRVPLSRMPRD 384 (524) T ss_pred eeccCceeeccccccchhhhhcccccCCCCccceeec---------cccCCcCh-HHHHHHHHHHHHHHhCCCceeccCC Confidence 0001111 1222 5665542 23322322 3458999999999887765333221 Q ss_pred chhhhhHH---HHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------CchhHH Q lcl|NC_017974. 359 KGGSYNLS---ESK-QTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------VDVDME 423 (507) Q Consensus 359 ~~GS~Al~---~vh-~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~~~dl~ 423 (507) .|+..+| ++. .|+ |...+......+...|..-|-..|+-=+. -..+. .+.+.|+.. |-.+.+ T Consensus 385 -~~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgi--it~eew~~i~~~I~~~f~~Dn~f~ElKe~E 461 (524) T protein:vir:98 385 -DGGMQIGGGGEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKI--ITEDEWEENVSKISFVFQQDSYYAEVKDIE 461 (524) T ss_pred -CCccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcC--CCHHHHHHHhhcceEEEeecchHHHHHHHH Confidence 1222222 222 222 22334444444444444333333332221 11111 122333332 234444 Q ss_pred HHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccc-hhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 424 GFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQE-ELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 424 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) .+.+++..|..+--.+-.-...+|+++. |.+.+.+-.+.... ......+--..|. + ..++. T Consensus 462 il~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~----~---e~~~f 524 (524) T protein:vir:98 462 ILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPE----A---EEENF 524 (524) T ss_pred HHHHHHHHHHHhccccccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCc----c---ccccC Confidence 5555655554431111112335677654 66642211111000 0000011111110 0 00011 No 226 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=64.90 E-value=0.29 Score=23.51 Aligned_cols=414 Identities=11% Similarity=0.057 Sum_probs=154.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH---H-------------- Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES---M-------------- 63 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~---m-------------- 63 (507) +++=-++.++...-+-......... ..-......+. ...+.+ .++.+.|+- + T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~-------~~i~~~i~~~~--~~~~~~--~~~~~Yy~g~~~i~~~~~~~~~~~~~~ 72 (474) T protein:vir:96 4 IFWPNEKPYHERVVEQIKPKYETQE-------EMIIRLINDHK--PKIDDI--TVGERYYNHDPDVLRLAPKLDNKGEID 72 (474) T ss_pred eccCCCchhhhhHHHHhhhccCChH-------HHHHHHHHHHH--HHHHHH--HHHHHHhccCCcchhccchhccccccc Confidence 1111111111111100000000000 00000001110 000000 011111100 0 Q ss_pred --h-----hChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceeee Q lcl|NC_017974. 64 --K-----QDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVVE 135 (507) Q Consensus 64 --~-----~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~E 135 (507) + -.+...-++.+...-+.+-+..+.+ .+++..+.+.+++.+ ++.+.+.++ .++.-+|.+. + T Consensus 73 ~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~------~d~~~~~~l~~~~~n----~~~~~~~~~~~~~~~~G~~~-~ 141 (474) T protein:vir:96 73 PLKPDWRMFTNYHQNLVDQKVAYAVANPVTFSS------DDDKSLKTIQEVLNH----KWDDKLVDILTAASNKGIEW-L 141 (474) T ss_pred ccccchhcccchHHHHHHhhhhhhcccCceeec------CchHHHHHHHHHHhc----CHHHHHHHHHHHHHhcCeeE-E Confidence 0 0122222333333333343333322 233455566666542 355555544 4677799865 5 Q ss_pred eeeeccCccccCcceeeeeccccCchhcccccceeecCCCcccc-ccccccc----cccccccccccccccc-------- Q lcl|NC_017974. 136 KVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAII-GIKQSLS----AFQNYTASEIGVGGVS-------- 202 (507) Q Consensus 136 i~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~-~~~q~~~----~~~~~~~~~~~~~~~~-------- 202 (507) ++|.. .+|++.+.-+.|+. .+ +++-+...+.+. .++.... ....++.....-.... T Consensus 142 ~~y~d-----~~~~~~i~~~~p~~--~~----~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~ 210 (474) T protein:vir:96 142 QPYID-----ENGEFKTFRVPAEQ--AI----PIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDY 210 (474) T ss_pred EEEec-----CCCceEEEEEcccc--eE----EEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeecc Confidence 77753 24566655544331 11 111111111111 1111000 0000100000000000 Q ss_pred ----------cCCCcccccccc--EEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccccc Q lcl|NC_017974. 203 ----------TRMSDVIIPINR--VMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQI 270 (507) Q Consensus 203 ----------~~~~~~~iP~~k--~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~ 270 (507) ......+-+..+ ++.|+ +|+.|.|.+..+-...=-=+..+..++..++.+..|+++++ +. T Consensus 211 ~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~---g~ 282 (474) T protein:vir:96 211 YHGEEHIQSHYYVGNKRVSWGRVPFIPFK-----NNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILK---GY 282 (474) T ss_pred ccccccccccccccccccCCCceeEEEec-----cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee---cC Confidence 000011111111 23332 36778898887544333334567888999999999988865 11 Q ss_pred CCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeec-CcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 271 LNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLP-SDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLG 349 (507) Q Consensus 271 ~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP-~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iL 349 (507) + .++..+ ... ++. ....+.+| .|.+++ ++..+.. ...+...++.+.+.|...-- T Consensus 283 ~----~~~~~~---~~~-------~~~--~~~~i~~~~~~~~~~-------~l~~~~~--~~~~~~~~~~l~~~i~~~s~ 337 (474) T protein:vir:96 283 E----GQDLDE---FMR-------NLK--YYKAINVDGDGSGVD-------TIQIEVP--VQSSKEYLDMLRDYVIEFGQ 337 (474) T ss_pred C----cccccc---hhh-------hhh--cCceEEecCCCCcee-------EEeecCC--hHHHHHHHHHHHHHHHHHhC Confidence 1 111111 011 111 11234444 344433 3333222 22356788888888888665 Q ss_pred cccccccccchhhhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHH Q lcl|NC_017974. 350 AGFINVGNDKGGSYNLSESKQTI----HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGF 425 (507) Q Consensus 350 GqtLts~~~~~GS~Al~~vh~~v----~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~ 425 (507) +..++.+ +.|+...|.. .+. ....+..-.+.+...|. ++++.++.+.+.... ..-..+.|....+.|..++ T Consensus 338 ~p~~~~~--~~~~n~Sg~A-l~~~~~~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~-~~~i~i~f~~~~p~~~~e~ 412 (474) T protein:vir:96 338 GVDFQQD--KFGNSPSGIA-LKFMYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYKLNIK-VQDVEITFNFNVMVNELEQ 412 (474) T ss_pred Ccccccc--ccccccHHHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCcc-cceeeEEeccCCCcCHHHH Confidence 5554432 2233322322 222 22233334445666664 477777776543222 1224677888888887776 Q ss_pred HHHHHHHHhcCCccccHHHHHHHHHHcC-CCCCCCcc-cccchhhhcCCCcCCCCCCcccccCcccccccccCCCCch Q lcl|NC_017974. 426 SKAIQRIGAVGYLPKTPKVINRVLEVLG-IDEKIEED-ISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNS 501 (507) Q Consensus 426 a~~~~~L~~~G~~~~~~~~~~~i~e~~G-ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 501 (507) ++. ++.+|.+ + .+.+.+.++ ++.+..+- ....+.. ...+...+.. +...+... + ..+.+- T Consensus 413 ~~~---~~~ag~i-S----~et~~~~~~~v~d~~~E~~ri~~E~~-e~~~~~~~~~-~~~~~~~~-d-----~~~e~~ 474 (474) T protein:vir:96 413 SQI---GVQSQYL-S----KETVVTNHPWVDDPVAELERIEQDNI-DFNKQLPPLE-GDANGRAQ-D-----NESETN 474 (474) T ss_pred HHH---HHhcCCC-c----hHHHHHhCCCCCCHHHHHHHHHHHHH-HHHhcccccc-cccccccC-C-----CcccCC Confidence 664 4567753 3 234555554 33221110 0000101 1111111110 00000000 0 001111 No 227 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=62.70 E-value=0.33 Score=23.22 Aligned_cols=448 Identities=11% Similarity=0.042 Sum_probs=166.1 Q ss_pred Ccccccc--------------------cccccCccccccccccc-cccccccCchhhhHHHHhhhhcccchhccchhHHH Q lcl|NC_017974. 1 MSRKRNR--------------------NRSVKVAKATSEQINVS-RMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGT 59 (507) Q Consensus 1 ~~~k~~~--------------------~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~ 59 (507) |++-.|. .++...+.|.....++. ..........+..+ +.....+.. .....++++. T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q-~~y~~~e~~-~~~~~eLI~~ 78 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQ-QMFGSNEPE-VKNTRELIDT 78 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhh-hhhhcccch-hhhHHHHHHH Confidence 3322221 12222222222111111 00000000011100 111111211 1123579999 Q ss_pred HHHHhhChHHHHHHHHHHHHHhhCCc---eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceee-- Q lcl|NC_017974. 60 VESMKQDATVATGLDMLYTFVEKAFK---DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVV-- 134 (507) Q Consensus 60 y~~m~~D~~v~s~l~~Rk~av~~~~~---~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~-- 134 (507) |++|..++.|-++++.....+...+- .+.+.-..++-++.+.+-|.+- |.. |..+|+.--+||..+ T Consensus 79 YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~ee--------F~~-Il~ll~F~~~~~~~fR~ 149 (524) T protein:vir:10 79 YRNLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAE--------FSE-VLNLLNFQRKGTDHFQR 149 (524) T ss_pred HHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHH--------HHH-HHHHhccchhhhHHHhh Confidence 99999999999999988776532211 0111111122223333333222 222 224444444555444 Q ss_pred ---------eeeeeccCccccCcceeeeeccccCchhcccccceeecCCCcccccccccccccccccccccc---ccccc Q lcl|NC_017974. 135 ---------EKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIG---VGGVS 202 (507) Q Consensus 135 ---------Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~---~~~~~ 202 (507) .++-.... .-+|- ..|....|.+|.+++-+....++. ...+.... .++.+..+... ..... T Consensus 150 WYVDgRi~fHkiid~~~--pk~GI---~Elr~lDPr~i~~vr~i~~~~~~~-~~vi~~~~-e~f~Y~~~~~~~~~~~~~~ 222 (524) T protein:vir:10 150 WYVDSRIFFHKIINPKK--MKDGV---QELRRLDPRQVQYIREIVTRMEDG-VKIVDGYR-EFFVYDTGHESYCADGRIY 222 (524) T ss_pred heeeceEEEEEEeeCCC--ccccc---eeeeeeCCccceeeeeecccCccc-chhhcchh-hheeecCCCcccccCccee Confidence 22222110 11233 333344445565554443333221 12122111 12222222111 11122 Q ss_pred cCCCccccccccEEEEeeCCccCc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchH Q lcl|NC_017974. 203 TRMSDVIIPINRVMLMNTGGSSSQ--ALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSS 280 (507) Q Consensus 203 ~~~~~~~iP~~k~i~~~~~~~~~~--p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~ 280 (507) +...++.||.+- |+|+|..-.+- -.=.|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .. T Consensus 223 ~~~~~ikI~~dA-Ivy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--VGnlP----k~ 295 (524) T protein:vir:10 223 SAGTKVKIPRAA-VVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYID--TGNMP----SR 295 (524) T ss_pred cCCcceecchhh-eeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----ch Confidence 445677787665 77777643211 12247888888776444433333332210001111244442 22222 22 Q ss_pred HHHHHHHHHHHHHHHhhc-CCcee--------------EeecC-----cccccccccceeeeeccCCCcchhHHHHHHHH Q lcl|NC_017974. 281 PEAEMVRGLMSDAANAHS-GEQSF--------------FMLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDR 340 (507) Q Consensus 281 ~~~~~l~a~~~~~~n~~~-G~~a~--------------~iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~ 340 (507) ...+.|+.++.-..|=.- .+..| .-+|. |+||.+. .|+.+.. -..=|+|. T Consensus 296 KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlg-em~DV~YF 365 (524) T protein:vir:10 296 KAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTM---------PGATGMS-DMDDVLYF 365 (524) T ss_pred hHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeec---------cccCCcC-hHHHHHHH Confidence 333455555533322000 00011 11222 5665542 2332232 23458999 Q ss_pred HHHHHHHHhcccccccccch----hhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cce Q lcl|NC_017974. 341 KKSILDRLGAGFINVGNDKG----GSYNLSESKQTIH-TQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPY 411 (507) Q Consensus 341 d~~Isk~iLGqtLts~~~~~----GS~Al~~vh~~v~-~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~ 411 (507) .+.+-+++--..--.+..+. +.++..=+..|+. ...+......+...|..-|-..|+-=+.. ..+. .+. T Consensus 366 ~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgii--t~eew~~i~~~ 443 (524) T protein:vir:10 366 RTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKII--TEDEWEREINN 443 (524) T ss_pred HHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhc Confidence 99999988776533321111 2233222222332 23344444444444443333333321211 1111 123 Q ss_pred EEecCc------CchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcccc Q lcl|NC_017974. 412 VKAGEI------VDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGMTK 484 (507) Q Consensus 412 ~~~~~~------~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (507) +.|+.. |-.+.+.+.+++..|..+--.+-.-...+|+++. |.+.+.+-.++. ..-.....++--+.++ T Consensus 444 I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~-k~I~~E~k~~~~~~~~---- 518 (524) T protein:vir:10 444 IKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEEINQEA-KQIEEESKEARFQNPD---- 518 (524) T ss_pred ceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHH-HHHHHHhhcCCCCCCC---- Confidence 333332 2244445555665554431111111235677654 566422111110 0011111110000000 Q ss_pred cCccccc Q lcl|NC_017974. 485 GSSGNGT 491 (507) Q Consensus 485 ~~~~~~~ 491 (507) ...++. T Consensus 519 -~~~~~f 524 (524) T protein:vir:10 519 -EEEEDF 524 (524) T ss_pred -hhhhcC Confidence 001111 No 228 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=54.35 E-value=0.51 Score=22.21 Aligned_cols=447 Identities=13% Similarity=0.073 Sum_probs=157.3 Q ss_pred CccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHH-------------HHH--hhChHHHHHHHHHH Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTV-------------ESM--KQDATVATGLDMLY 77 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y-------------~~m--~~D~~v~s~l~~Rk 77 (507) |+..+.+.. .. .....+.+..........+-+|.++.+.. ... .-|++-.-++++-- T Consensus 1 ~~~~~~~~~-~~-------~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~La 72 (535) T protein:vir:94 1 MASSQKREG-FA-------ENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLA 72 (535) T ss_pred CCchhhhhh-HH-------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHH Confidence 222222110 00 00000111111111011111111111100 000 12444444555444 Q ss_pred HHHhhC-----CceeeeCCCC---------ChhHHHHHHHHHHH----HHhccCCCHHHHHHHHHH-HHhhcceeee--- Q lcl|NC_017974. 78 TFVEKA-----FKDFKVIPGE---------SEESKKAAKFVEYC----LKNMEGQTLRQFARDAAT-FNEYGLSVVE--- 135 (507) Q Consensus 78 ~av~~~-----~~~~~~~~~~---------~~~~~~~ae~v~~~----l~~~~~~~~~~~l~~~l~-a~~~Gfs~~E--- 135 (507) ..+.+. +| |+....+ ..+..++.+++.++ ...+....|+..+.+++. -+.+|-++.= T Consensus 73 a~l~~~ltP~~~W-F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~ 151 (535) T protein:vir:94 73 SKLMLALFPMQTW-MKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPE 151 (535) T ss_pred HHHHhhhcCCCCc-cccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeecc Confidence 444432 34 3332221 11223455554432 233455667776666543 3456665442 Q ss_pred ---------------eeeeccCccccCccee-----eeeccccCchhcccccceeecCCC-ccccc-ccccccc--cccc Q lcl|NC_017974. 136 ---------------KVYTQVTVGEYIGKYK-----VKNLAFRPQASLSRTNPIVYNEDG-SAIIG-IKQSLSA--FQNY 191 (507) Q Consensus 136 ---------------i~w~~~~~~~~~g~~~-----~~~l~~r~~~~i~~~~~~~~~~d~-~~l~~-~~q~~~~--~~~~ 191 (507) .++..+..|.-+-.+. .+.|...-+..+.. ...++.|. ..+.. +...... |..+ T Consensus 152 ~~~~~~~f~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~--~~~~~~~~~v~v~~~v~~~~~~~~~~~~ 229 (535) T protein:vir:94 152 PEGTYNPMKLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDS--SQEHKGDEMIDVYTHIYLDEESGEYLKY 229 (535) T ss_pred CcCcccceEEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHh--ccccCCCceeEEEEEEEeeCCCCcEEEE Confidence 2222222211111000 00010000000000 00000000 00000 0011000 0000 Q ss_pred ccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccC Q lcl|NC_017974. 192 TASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQIL 271 (507) Q Consensus 192 ~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~ 271 (507) ...+ ........+.-..-...|++.|+...+|+.||.|....++.-..--+...+.-+...+.-.-|+..+ -|-.+. T Consensus 230 ~e~~--g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv-~p~g~~ 306 (535) T protein:vir:94 230 EEID--GVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLV-NPAGIT 306 (535) T ss_pred EEec--CeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCccc-cccccc Confidence 0000 0000000001111224589999999999999999999999887666666666677766543332111 111111 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcc Q lcl|NC_017974. 272 NKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAG 351 (507) Q Consensus 272 k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGq 351 (507) +. ..+ ..+ ..+.++|-..+ .+..+..+.++....-...|+.+...|+.+++.. T Consensus 307 ~~-------------~~~-------~~~-~~g~~v~g~~~------~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~ 359 (535) T protein:vir:94 307 QV-------------RRL-------TKA-QTGDFVSGRPE------DISFLQLEKAADFSVARAVSEQIEGRLSYAFMLN 359 (535) T ss_pred ch-------------hhc-------ccC-CCceeecCCcc------cceeeecccccchhHHHHHHHHHHHHHHHHHhHh Confidence 10 011 111 12445553321 2223333333444445678999999999999865 Q ss_pred cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--C-----CCcCC--ccccceEEecC-----c Q lcl|NC_017974. 352 FINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLAL--N-----DMRLP--ETEMPYVKAGE-----I 417 (507) Q Consensus 352 tLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l--N-----~~~~~--~~~~P~~~~~~-----~ 417 (507) .++..+ ++.--..||+....+.. ..- -=+-..|+..++-||+.. + +. .| +...-+..+-. . T Consensus 360 ~~~~~d--~~rvTAtEV~~r~~E~~-~~L-Gpv~~rl~~ElL~Pli~r~~~il~r~g~-lP~~p~~~v~~~~vs~la~l~ 434 (535) T protein:vir:94 360 SAVQRT--GERVTAEEIRYVASELE-DTL-GGVYSILSQELQLPMVRVLLKQLQATNQ-IPELPKEAVEPTISTGMEALG 434 (535) T ss_pred hhccCC--CCCccHHHHHHHHHHHH-HHh-hhHHHHHHHHHHHHHHHHHHHHHHhCCC-CCCCChhhccceEeehHHHHH Confidence 544322 22122345544332221 111 122223444455555431 1 11 11 11111122211 1 Q ss_pred CchhHHHHHHHHHHHHhcCCc-----cccHHHHHHHHHHcCCCC----CCCcccccchhhhcCC----CcCCCCCCcccc Q lcl|NC_017974. 418 VDVDMEGFSKAIQRIGAVGYL-----PKTPKVINRVLEVLGIDE----KIEEDISQEELMKLLG----EDTSRAGDGMTK 484 (507) Q Consensus 418 ~~~dl~~~a~~~~~L~~~G~~-----~~~~~~~~~i~e~~Gip~----~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 484 (507) ...++..+...++.|..+|=. +..+...+++.+.+|+|. +.+++..+.....+.. ......+..... T Consensus 435 r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~ 514 (535) T protein:vir:94 435 RGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGT 514 (535) T ss_pred HHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 224566666666766665521 222345677889999983 2222222111000000 000011111110 Q ss_pred cCcccccccccCCCCchhhhccC Q lcl|NC_017974. 485 GSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 485 ~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .++.+..+. ++-.+...=.-| T Consensus 515 ~~~~~~~~~--~~~~~~~g~~~~ 535 (535) T protein:vir:94 515 MATASPENM--KAAAAQAGMAPN 535 (535) T ss_pred ccccChHHH--HHHHHHhccCCC Confidence 000000000 000000000001 No 229 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=53.77 E-value=0.52 Score=22.14 Aligned_cols=457 Identities=10% Similarity=0.044 Sum_probs=173.9 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhh--hcccc----hhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSE--RMKAE----EIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~--~e~~~----~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |...++.--..+.+=|...+--++ .....-+..++.... .-+.. .+|......+| |+--.-.+. T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~-----~~D~~RlaaY~ly~d~y~n~~~el~~il~G~dr~~~~-----~ps~r~~V~ 70 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVD-----ENDKNRVRAYDLYENIYLNSAETLKLVLRGDDSVPIL-----MPSGRKIVE 70 (563) T ss_pred CCccccccCCCcccccccccccCC-----HHHHHHHHHHHHHHHhhcCchhhhhhhcCCCceeeec-----cchHHHHHH Confidence 544333322222222211111111 111111122221111 11111 12222111111 222222334 Q ss_pred HHHHHHhhCCceeeeCCCC--ChhHHHHHHHHHHHHHhccCCCHHHH-HHHHHHHHhhcceeeeeeeeccCccccCccee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGE--SEESKKAAKFVEYCLKNMEGQTLRQF-ARDAATFNEYGLSVVEKVYTQVTVGEYIGKYK 151 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~--~~~~~~~ae~v~~~l~~~~~~~~~~~-l~~~l~a~~~Gfs~~Ei~w~~~~~~~~~g~~~ 151 (507) + ...+++-+.+|.+++.+ .+..+.+..+++.+.+. ..|... ..--.+|..-|=.|+=+.|.... ...+++. T Consensus 71 ~-~~~~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~---e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K--~~g~R~r 144 (563) T protein:vir:74 71 A-VHRFLGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKR---EAIKAKFTSNKRWGLIRGDAHFYIHADPNK--KAGERIS 144 (563) T ss_pred H-HHHhcCCCcEEecCccccCcchHHHHHHHHHHHHHH---hhhHHHHHHHHHhhhhhcceeEEEeecccc--ccCCCce Confidence 4 44566778889888754 22333333333333322 123322 22234688899999999998531 1123444 Q ss_pred eeeccccC------chhccccc------ceeecCC-Ccccccccc-----ccccc----cccccc--cc----------- Q lcl|NC_017974. 152 VKNLAFRP------QASLSRTN------PIVYNED-GSAIIGIKQ-----SLSAF----QNYTAS--EI----------- 196 (507) Q Consensus 152 ~~~l~~r~------~~~i~~~~------~~~~~~d-~~~l~~~~q-----~~~~~----~~~~~~--~~----------- 196 (507) +..+-|+. +.+..-.. .+...+| .+.+.+.+. ++.+. +.++.. .+ T Consensus 145 v~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~ 224 (563) T protein:vir:74 145 VDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISD 224 (563) T ss_pred EeecCCceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccch Confidence 44333220 00000000 0000001 111211110 00000 000000 00 Q ss_pred cc--cc---cc-cCC-Ccccc--ccc--cEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEe Q lcl|NC_017974. 197 GV--GG---VS-TRM-SDVII--PIN--RVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELK 265 (507) Q Consensus 197 ~~--~~---~~-~~~-~~~~i--P~~--k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~ 265 (507) .. .. .. ... +...+ |.. -+++|+..+..+..+|.|.|..+.-..---+....+....++-.|.||.++- T Consensus 225 ~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~ 304 (563) T protein:vir:74 225 EQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTN 304 (563) T ss_pred hhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEec Confidence 00 00 00 000 11112 222 2445666778899999999998887777777777778888888888887765 Q ss_pred -cccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHH-H Q lcl|NC_017974. 266 -IPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKK-S 343 (507) Q Consensus 266 -~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~-~ 343 (507) .++.-+ ..++.. . -|+ |..+.+=+|...+-. -|...+|..+...++..+++++. . T Consensus 305 ~~~p~d~--~~g~~~-----------~--w~v--gpG~i~El~~~~~~g------~l~~v~g~~~l~~~q~Hm~~l~era 361 (563) T protein:vir:74 305 ASAPVDP--NTGELT-----------D--WNI--GPMQIVEIAGNRNDN------YFERVSGVQDVSPFQDHMKWIDEKG 361 (563) T ss_pred ccccccc--cccccc-----------c--ccc--CCceeEeccCCcccc------ceeeecchhhhHHHHHHHHHHHHHH Confidence 221111 011000 0 011 111222223221111 13334454433345555666654 3 Q ss_pred HHHH-----HhcccccccccchhhhhHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhC----C-CcCCc Q lcl|NC_017974. 344 ILDR-----LGAGFINVGNDKGGSYNLSESKQTIHTQFVQRVN-------EIILEALNENLLPQLLALN----D-MRLPE 406 (507) Q Consensus 344 Isk~-----iLGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~-------~~i~~~ln~~li~~l~~lN----~-~~~~~ 406 (507) |+-. +--+|+-++.. -+++|| +++..-....++... ++...-+.+.|++.+-.+- + ..++. T Consensus 362 l~~~s~tPavA~G~vD~~~~-~SGiAL-eL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~ 439 (563) T protein:vir:74 362 IAEGSGTPEVAIGRVDVTSA-ESGISL-ELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFAS 439 (563) T ss_pred HHhhccCcceeecccccccc-cchhhh-hhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccc Confidence 3321 11111111111 233554 333333333333333 3333333344443333321 0 00112 Q ss_pred cccc-----eEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCCcccccc---------hhhhcCC Q lcl|NC_017974. 407 TEMP-----YVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIEEDISQE---------ELMKLLG 472 (507) Q Consensus 407 ~~~P-----~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~~~~~~~---------~~~~~~~ 472 (507) +..| .++|...-+.|.++..+-+..|+..|++.. .-..+.+.+. |.|-+.-+++.+. ..++..+ T Consensus 440 ~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSr-etAv~~L~~~-g~~~pdae~e~~~ie~~~i~~~~~a~a~a 517 (563) T protein:vir:74 440 ADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQAHLILR-KMAVAKLRSI-GWEYPEVDDQGNALTDDDIADMLLAEAEA 517 (563) T ss_pred cccCCceEEEEEeCCCCCccHHHHHHHHHHHHHcCchhH-HHHHHHHHhC-CCCCCcHHHHHhhcCHHHHHHHHHHHhhc Confidence 2222 356999999999999998999999998654 2112233332 6654432222111 1111111 Q ss_pred CcCCCCCCcccccCcccccccccCCCCchhh---hccC Q lcl|NC_017974. 473 EDTSRAGDGMTKGSSGNGTGKISSTRDNSAA---NLDN 507 (507) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~---~~~~ 507 (507) + +.-+..+. +++-.. ....|+.-. ..-| T Consensus 518 d----~~~~~~a~--~~~g~~-~~~~dd~g~p~~~~~~ 548 (563) T protein:vir:74 518 D----ASLGLSAM--DNGGAG-EQQFDDQGNPIDQFGN 548 (563) T ss_pred c----Ccccceec--ccCCCC-cccccccCCchhHcCC Confidence 1 11111111 111100 011111111 1111 No 230 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=52.77 E-value=0.55 Score=22.03 Aligned_cols=421 Identities=7% Similarity=-0.037 Sum_probs=149.5 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) +++.+.+. .++..... +.-.+.+. + +....... .+..+..| .-.....-++.....-+ T Consensus 49 i~~~~~~~----~~r~~~l~----~Yy~g~~~-i-~~~~~~~~-~~~~~~~k-----------i~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:10 49 IEHHMDYQ----RPRLKVLS----DYYEGKTK-N-LVELTRRK-EEYMADNR-----------VAHDYASYISDFINGYF 106 (511) T ss_pred HHHHHHhh----HHHHHHHH----HHhcccCc-c-ccccCccc-ccccCcce-----------eecchHHHHHHHHhhhh Confidence 22111110 00000000 00000000 0 00000000 00000000 00122223333333444 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..+... +.++.+++.++++.. .+.....++. ++.-||.+ ++++|... +|.+.++-+.|+. T Consensus 107 ~g~p~~~~~~------d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~i~G~a-y~~vy~de-----dg~~~i~~~~p~~ 171 (511) T protein:vir:10 107 LGNPIQYQDD------DKDVLEAIEAFNDLN---DVESHNRSLGLDLSIYGKA-YEIMIRNQ-----DDETRLYKSDAMS 171 (511) T ss_pred cccCceeecC------chHHHHHHHHHHhhc---CHHHHHHHHHHHHHhcCee-EEEEEeCC-----CCceEEEEEccce Confidence 4444344321 234556677766542 3665655544 57779974 56888532 4566665543331 Q ss_pred chhcccccceeecCC--Ccccccccccc------------ccccccccccccccc-----------cccCCCcccccccc Q lcl|NC_017974. 160 QASLSRTNPIVYNED--GSAIIGIKQSL------------SAFQNYTASEIGVGG-----------VSTRMSDVIIPINR 214 (507) Q Consensus 160 ~~~i~~~~~~~~~~d--~~~l~~~~q~~------------~~~~~~~~~~~~~~~-----------~~~~~~~~~iP~~k 214 (507) ++. .|+.. +..+..++... .....++.....-.. .........+..-. T Consensus 172 --~~~-----vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 244 (511) T protein:vir:10 172 --TFV-----IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP 244 (511) T ss_pred --eEE-----EEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCccee Confidence 111 11111 11111111100 000111111100000 00000011111111 Q ss_pred EEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHH-HHHHH Q lcl|NC_017974. 215 VMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRG-LMSDA 293 (507) Q Consensus 215 ~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a-~~~~~ 293 (507) ++.|+ .+..|.|.+..+....=--+..+..++..++.+..|+++++-- . ..+.++....... +.... T Consensus 245 vv~f~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~---~----~~~~~~~~~~~~~~~~~~~ 312 (511) T protein:vir:10 245 ITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN---L----NLDPVEVRKQKEANVLFLE 312 (511) T ss_pred EEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecc---c----cCCchhhccchhccceecc Confidence 23333 3456888888775544334556777888889999998887621 0 0111111000000 00000 Q ss_pred HHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccch--hhhhHHHHHHH Q lcl|NC_017974. 294 ANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKG--GSYNLSESKQT 371 (507) Q Consensus 294 ~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~--GS~Al~~vh~~ 371 (507) .+ ....+.....+. ..+++++..+. ....+..+++.+.+.|...--...++.+...+ +.-|+. .... T Consensus 313 ~~---~~~~~~~~~~~~-----~~d~~~l~~~~--~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~-~~~~ 381 (511) T protein:vir:10 313 PT---VYADSEGRETEG-----SVDGGYIYKQY--DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMK-YKLF 381 (511) T ss_pred cc---cccccccccCCC-----CcceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHH-HHHH Confidence 00 000000000100 11233333221 22346778888888887765444444432111 111211 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh---CCC-cCCcc-ccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHHH Q lcl|NC_017974. 372 IHTQFVQRVNEIILEALNENLLPQLLAL---NDM-RLPET-EMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVIN 446 (507) Q Consensus 372 v~~~~~~aD~~~i~~~ln~~li~~l~~l---N~~-~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~ 446 (507) -....+..-.+.+...|.+ +++.++.+ ... ..+.. .-..+.|....+.|..+.++++.+|+ |+ ++ .+ T Consensus 382 ~l~~k~~~k~~~f~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~-iS----~e 453 (511) T protein:vir:10 382 GLEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-IS----QT 453 (511) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHh--cc-Cc----HH Confidence 1222233333444555543 44444443 211 11111 12467899989999999999999985 65 33 24 Q ss_pred HHHHHcCC-CCCCCc-ccccchhhhcCCC---cCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 447 RVLEVLGI-DEKIEE-DISQEELMKLLGE---DTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 447 ~i~e~~Gi-p~~~~~-~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) .+.+.++. +.+..+ +...++....... ......++.... ..++.+.+-.. T Consensus 454 t~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~ 508 (511) T protein:vir:10 454 TLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD-----------EQDDDTKDTVD 508 (511) T ss_pred HHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCC-----------CCCCcccCccc Confidence 45566654 322111 0001111100000 000000000000 00111111111 No 231 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=52.29 E-value=0.56 Score=21.97 Aligned_cols=450 Identities=11% Similarity=0.065 Sum_probs=167.2 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhh-hhc----ccc-hhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDS-ERM----KAE-EIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~e----~~~-~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |++-+..+-+...++..-.....-|. . -.+.|+-.. ... ..+ .....+.-.+| |++-.-+++ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~------~-~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-----dst~~~a~~ 68 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRR------A-YETRAENCAQYTIPSLFPKESDNESTDYTTPW-----QAVGARGLN 68 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhh------H-HHHHHHHHHHHhcccccCCCCCcccccccccc-----cccHHHHHH Confidence 55544433333322211111100000 0 011111000 000 000 00111111122 444444444 Q ss_pred HHHHHHhhC-----CceeeeCCCC---------ChhHHHHHHHHHHH----HHhccCCCHHHHHHHHHH-HHhhcceeee Q lcl|NC_017974. 75 MLYTFVEKA-----FKDFKVIPGE---------SEESKKAAKFVEYC----LKNMEGQTLRQFARDAAT-FNEYGLSVVE 135 (507) Q Consensus 75 ~Rk~av~~~-----~~~~~~~~~~---------~~~~~~~ae~v~~~----l~~~~~~~~~~~l~~~l~-a~~~Gfs~~E 135 (507) +--..+.+. +| |+...++ ..+..++.++++.+ ...+....|+..+.+++. -+.+|-++.= T Consensus 69 ~Laa~l~~~ltP~~~W-F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~ 147 (535) T protein:vir:15 69 NLASKLMLALFPMQSW-MKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLY 147 (535) T ss_pred HHHHHHHHhhcCCCcc-cccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEE Confidence 444444432 23 2222211 11223455554433 223344567776666554 4457765432 Q ss_pred eee------------------eccCccccCcceeeeecc-----ccCchhcccccceeecCCC-ccccc-ccccccc--c Q lcl|NC_017974. 136 KVY------------------TQVTVGEYIGKYKVKNLA-----FRPQASLSRTNPIVYNEDG-SAIIG-IKQSLSA--F 188 (507) Q Consensus 136 i~w------------------~~~~~~~~~g~~~~~~l~-----~r~~~~i~~~~~~~~~~d~-~~l~~-~~q~~~~--~ 188 (507) +.+ ..++.|.-+..++--++. ..-+..+.+- ......+. ..+.. +...... + T Consensus 148 ~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~-~~~~~~~~~v~v~~~v~~~~~~~~~ 226 (535) T protein:vir:15 148 LPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKA-GGEKKMDEMVDVYTHVYLDEESGDY 226 (535) T ss_pred eecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhcc-ccccCCCCceeEEEEEEEecCCCcE Confidence 222 222222111111100110 0000000000 00000000 00000 0000000 0 Q ss_pred cccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccc Q lcl|NC_017974. 189 QNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPS 268 (507) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~ 268 (507) ..+...+... .....++...-...|++.|+...+|+.||.|....++.-..--+...+.-+..+++---|+..+ -|- T Consensus 227 ~~~~e~~g~~--~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv-~~~ 303 (535) T protein:vir:15 227 LKYEEVEDVE--IDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLV-NPA 303 (535) T ss_pred EEEEEeeCcc--ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee-ccc Confidence 0010000000 0000011111223579999999999999999999999988777778788888888765554221 111 Q ss_pred ccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_017974. 269 QILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRL 348 (507) Q Consensus 269 ~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~i 348 (507) ..++. .+.. .+| -+.++|...+ .+..+..+.++....-...|+.+...|+.++ T Consensus 304 g~~~~----------------~~l~---~~~--~g~~v~g~~~------~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af 356 (535) T protein:vir:15 304 GITQP----------------RRLT---KAQ--TGDFVPGRRE------DIDFLQLEKQADFTVAKAVSDQIEARLSYAF 356 (535) T ss_pred ccccc----------------hhcc---cCC--ceeeecCCcc------cceeeecccccchhHHHHHHHHHHHHHHHHH Confidence 11110 0110 012 1334443221 1223333333444456778999999999998 Q ss_pred hcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCC----cCC--ccccceEEecC---- Q lcl|NC_017974. 349 GAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLAL--NDM----RLP--ETEMPYVKAGE---- 416 (507) Q Consensus 349 LGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l--N~~----~~~--~~~~P~~~~~~---- 416 (507) +...++..++ +.--..||+....+.. .. .-=+-..|+..++-||+.. +.. -.| +...-+++|.. T Consensus 357 ~~~~~~~~~~--~r~TAtEV~~r~~E~~-~~-LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~ 432 (535) T protein:vir:15 357 MLNSAVQRTG--ERVTAEEIRYVASELE-DT-LGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEA 432 (535) T ss_pred hhhhcccCCC--ccccHHHHHHHHHHHH-HH-HhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHH Confidence 8776543221 2122344444322211 11 1122333444555555531 110 111 11111233311 Q ss_pred -cCchhHHHHHHHHHHHHhcCCc-----cccHHHHHHHHHHcCCCCC---CCcccccchhhhc-----CCCcCCCCCCcc Q lcl|NC_017974. 417 -IVDVDMEGFSKAIQRIGAVGYL-----PKTPKVINRVLEVLGIDEK---IEEDISQEELMKL-----LGEDTSRAGDGM 482 (507) Q Consensus 417 -~~~~dl~~~a~~~~~L~~~G~~-----~~~~~~~~~i~e~~Gip~~---~~~~~~~~~~~~~-----~~~~~~~~~~~~ 482 (507) ....++..+...++.+..++-. +..+...+++.+.+|+|.. ..+++.+...+.. +....+..+.+. T Consensus 433 aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~ 512 (535) T protein:vir:15 433 IGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAAATGGAGV 512 (535) T ss_pred HHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 1234555666666666655422 2223556788999999842 1222211111000 000000111111 Q ss_pred cccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 483 TKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 483 ~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) ++.. .+.++.. .-++| T Consensus 513 ~~~~--------~~~p~~~-~~~~~ 528 (535) T protein:vir:15 513 GALA--------TSSPEAM-QGAAA 528 (535) T ss_pred cchh--------ccChHHH-HHHHh Confidence 1000 0111111 12222 No 232 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=49.63 E-value=0.63 Score=21.67 Aligned_cols=446 Identities=13% Similarity=0.096 Sum_probs=160.5 Q ss_pred Cc------ccccccccccCccccccccccccccccccCchhhhHHHHh-hh-hcccchhc-cchhHHHHHHHhhChHHHH Q lcl|NC_017974. 1 MS------RKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVD-SE-RMKAEEIR-WPNLIGTVESMKQDATVAT 71 (507) Q Consensus 1 ~~------~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~-~e~~~~lr-~~~~~~~y~~m~~D~~v~s 71 (507) ++ .++-+.++...+.|.....++. . +.+... ..+.++ .. ......++ -.++++.|++|...+.|-+ T Consensus 11 ~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~-i---~~~~~~-~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~pEvd~ 85 (516) T protein:vir:10 11 DRVDQNEYDERLKQGHESIATPKKDDGATE-I---EAREGE-SSYNALMQQFFGIDNNISGTKDLINTYRQLTNNPEVER 85 (516) T ss_pred cchhhHHHHhhhcCCCCcccCCCCccCcee-e---ecCccc-ccccceeeeeecccCccccHHHHHHHHHHhhhccchhH Confidence 11 1111222333333333222111 0 011000 011111 00 01111122 2478999999999999999 Q ss_pred HHHHHHHHHhhCCc---eeeeCCCCChhHHH----HHHHHHHHHHhccCCCHHHHHHHHHH-HHhhcceeeeeeeeccCc Q lcl|NC_017974. 72 GLDMLYTFVEKAFK---DFKVIPGESEESKK----AAKFVEYCLKNMEGQTLRQFARDAAT-FNEYGLSVVEKVYTQVTV 143 (507) Q Consensus 72 ~l~~Rk~av~~~~~---~~~~~~~~~~~~~~----~ae~v~~~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~Ei~w~~~~~ 143 (507) +++.....+...+- .+.++-.+.+-++. +.+-++..|+-|+ |.....++.. -.-.|--.+.++-.. T Consensus 86 Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~---F~~~~~~~fR~WYVDgRi~fhKiid~--- 159 (516) T protein:vir:10 86 AVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLD---ASRKLDTLFRRWYIDSRIFFHKIMPN--- 159 (516) T ss_pred HHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhc---cchhhhHHHHhhhhcceEEEEEEecC--- Confidence 99988776532210 11122122222333 3333333333332 1111111111 111222233322221 Q ss_pred cccCcceeeeeccccCchhcccccceeec-CCCccccccccccccccccccccccccc---cccCCCccccccccEEEEe Q lcl|NC_017974. 144 GEYIGKYKVKNLAFRPQASLSRTNPIVYN-EDGSAIIGIKQSLSAFQNYTASEIGVGG---VSTRMSDVIIPINRVMLMN 219 (507) Q Consensus 144 ~~~~g~~~~~~l~~r~~~~i~~~~~~~~~-~d~~~l~~~~q~~~~~~~~~~~~~~~~~---~~~~~~~~~iP~~k~i~~~ 219 (507) ..+|-..++.| .|.+|.+++-.... .+|...+ ... .-++.+..+...... .-+....+.||.+ -|+|+ T Consensus 160 -~k~GI~elr~l---DPr~i~~vR~i~~~~~~~~~v~--~~~-~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~d-aI~y~ 231 (516) T protein:vir:10 160 -PKEGIVELRRL---DPRHVEYYREIVTSDVGGTSVV--KGY-REFFVYTTGNEGYAYNGRLFEPNTRIKIPRS-AIVYA 231 (516) T ss_pred -cccceeeeeee---CCcceeeEEeeecccCcchhhh--hce-eeeeeeecCccceeccccccCCCCceecchh-heeee Confidence 11333333333 33345444433222 2222111 111 112222222111110 0122234566554 57778 Q ss_pred eCCc--cCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhh Q lcl|NC_017974. 220 TGGS--SSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAH 297 (507) Q Consensus 220 ~~~~--~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~ 297 (507) |..- .+...=.|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.-..|=. T Consensus 232 hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYID--VGnLP----k~KAeqYl~~iM~k~KNkl 305 (516) T protein:vir:10 232 HSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYID--VGNMP----NRKATEYVNGIMQSLKNRV 305 (516) T ss_pred ecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhcCcee Confidence 7532 11122257888888776433333332222210001111244442 22222 2233345555444332200 Q ss_pred -cCCcee--------------EeecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_017974. 298 -SGEQSF--------------FMLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGN 357 (507) Q Consensus 298 -~G~~a~--------------~iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~ 357 (507) -.+..| .-+|. |+||.+. .|+.+.. -..=|+|..+.+-+++--..--.+. T Consensus 306 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlg-em~DV~YF~kkLy~aLnVP~SRl~~ 375 (516) T protein:vir:10 306 VYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSL---------PGAQTMG-EMDDVRWFNKKLYEALRIPLSRMPR 375 (516) T ss_pred EEeCCCCeeccchhhhhhHhhhcccccCCCcccceeec---------cccCCcC-hHHHHHHHHHHHHHHhCCCcccccC Confidence 000111 11222 5655442 2332232 2345899999999988776633332 Q ss_pred cchhh----hhHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCC--ccccceEEecCc------CchhHH Q lcl|NC_017974. 358 DKGGS----YNLSESKQ-TIH-TQFVQRVNEIILEALNENLLPQLLALNDMRLP--ETEMPYVKAGEI------VDVDME 423 (507) Q Consensus 358 ~~~GS----~Al~~vh~-~v~-~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~--~~~~P~~~~~~~------~~~dl~ 423 (507) .++++ ++ +++.. |+. ...+......++..|..-|-..|+-=+...-. +.-.+.+.|+.. |-.+.+ T Consensus 376 e~~~~~~~Gr~-~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~E 454 (516) T protein:vir:10 376 DDGGMVIGGQD-MAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIE 454 (516) T ss_pred CCCceeecccc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHH Confidence 22222 33 34333 332 33444444434444433332333221211000 011123333332 224444 Q ss_pred HHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCc--CCCCCCcccccCcccccccc Q lcl|NC_017974. 424 GFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGED--TSRAGDGMTKGSSGNGTGKI 494 (507) Q Consensus 424 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 494 (507) .+.+++..|..+-=.+-.-...+|+++. |.++..+-.++. ..-.+...++ ..|..+ . .+ T Consensus 455 il~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~-k~I~~E~~~~~~~~p~~e--~---------~f 516 (516) T protein:vir:10 455 TLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEE-KQIEKEANVKRFQNPENE--D---------DF 516 (516) T ss_pred HHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhHHHHHH-HHHHHhhhCCCCCCCCcc--c---------cC Confidence 4555555554431111112235677654 677533221111 0111111111 111110 0 00 No 233 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=49.54 E-value=0.64 Score=21.66 Aligned_cols=411 Identities=11% Similarity=0.027 Sum_probs=154.9 Q ss_pred Cccc-ccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHH--------------H-- Q lcl|NC_017974. 1 MSRK-RNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVES--------------M-- 63 (507) Q Consensus 1 ~~~k-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~--------------m-- 63 (507) |.+. .+..|..-.-...++... .++. ...-......+. +..+ |..++.+.|+- . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~--~~~i~~~i~~~~--~~~~--~~~~~~~yY~g~~~i~~~~~~~~~~~~~ 71 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQ---YETQ--EEMILRLITKHK--ENVE--DITVGERYYNHQPDVLFNAPKRNVKGEI 71 (468) T ss_pred CccccCCcCceeehheeeccccc---ccCc--HHHHHHHHHHHH--HHHH--HHHHHHHHhcCCCccccccccccccccc Confidence 6555 333332211111000000 0000 000000000110 0000 00011111110 0 Q ss_pred --------hhChHHHHHHHHHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHH-HHHHhhcceee Q lcl|NC_017974. 64 --------KQDATVATGLDMLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDA-ATFNEYGLSVV 134 (507) Q Consensus 64 --------~~D~~v~s~l~~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~-l~a~~~Gfs~~ 134 (507) .-.+...-+......-+.+-+..+. ..++++.+.+.+++++ +|.+.+.++ .++.-||.+. T Consensus 72 ~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~------~~d~~~~~~l~~~~~n----~~~~~~~~~~~~~~~~G~~~- 140 (468) T protein:vir:96 72 DPFKPDWRMYTNYHQNLVDQKVAYAVANPVTYG------TEDEKSLKTIQEVLNH----KWDDKLVDILTAASNKGVEW- 140 (468) T ss_pred cccccccccccchHHHHHHHHHhhhccCCceec------cCChHHHHHHHHHHhc----CHHHHHHHHHHHHhhcCeEE- Confidence 0122333333333333333322222 2234556667776642 455555554 5677899975 Q ss_pred eeeeeccCccccCcceeeeeccccCchhcccccceeecCCCcccc-ccccc----cccccccccccccccc--------- Q lcl|NC_017974. 135 EKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAII-GIKQS----LSAFQNYTASEIGVGG--------- 200 (507) Q Consensus 135 Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~-~~~q~----~~~~~~~~~~~~~~~~--------- 200 (507) +++|... +|.+.+.-+.|+. .+ +++.+.+.+.+. .++.. -.....++.....-.. T Consensus 141 ~~v~~d~-----~~~~~i~~~~p~~--~~----~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (468) T protein:vir:96 141 IQPYVDE-----QGEFKTFRVPAEQ--AI----PIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPD 209 (468) T ss_pred EEEEEcC-----CCceEEEEEcccc--eE----EEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeec Confidence 4677532 4555555443321 11 111111111111 01000 0000000000000000 Q ss_pred -----------cccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccc Q lcl|NC_017974. 201 -----------VSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQ 269 (507) Q Consensus 201 -----------~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~ 269 (507) .........+..-.++.|+ .++.|.|.+..+....=--+..+..++..++.+..|+++++- T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g--- 281 (468) T protein:vir:96 210 YYQGEEHVQAHYYVGNKSMSWNRVPFIPFK-----NNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKG--- 281 (468) T ss_pred ccccccccccceeeccccccCCcccEEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec--- Confidence 0000001111111223332 367788988875443333355678888888999998887651 Q ss_pred cCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHh Q lcl|NC_017974. 270 ILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLG 349 (507) Q Consensus 270 ~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iL 349 (507) . ..++..+ ....+ ..+ ..+.++.+.. ..++++..+.. ...+...++.+.+.|...-. T Consensus 282 ~----~~~~~~~---~~~~~-------~~~--~~i~~~~d~~-----~~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~ 338 (468) T protein:vir:96 282 Y----EGEDLEE---FMYNL-------KYY--KAINVDGDGS-----GGVDTIQIDVP--VQSAKEYLDMLRDYVIEFGQ 338 (468) T ss_pred C----Cccccch---hhhhh-------hcC--ceEEecCCCC-----CcceEEeecCC--hHHHHHHHHHHHHHHHHHhC Confidence 1 1111111 11111 111 2344443211 12334433322 23466788999999888754 Q ss_pred cccccccccchhhhhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccccceEEecCcCchhHHHHH Q lcl|NC_017974. 350 AGFINVGNDKGGSYNLSESKQ---TIHTQFVQRVNEIILEALNENLLPQLLALNDMRLPETEMPYVKAGEIVDVDMEGFS 426 (507) Q Consensus 350 GqtLts~~~~~GS~Al~~vh~---~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~~P~~~~~~~~~~dl~~~a 426 (507) +..++.+ +.|+.+.|..-. .-....+..-.+.+.+.|. ++++.++.+.+... +..-..+.|....+.|..+.+ T Consensus 339 ~p~~~~~--~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~g~~~-d~~~i~i~f~~~~p~d~~e~a 414 (468) T protein:vir:96 339 GVDFQQD--KFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYKLSI-KVQDVEITFNFNVMVNELEQS 414 (468) T ss_pred ccccccc--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCc-ccceeeEEecCCCCcCHHHHH Confidence 4444332 222222222211 1122223334455566664 46777777653222 222356788888888877766 Q ss_pred HHHHHHHhcCCccccHHHHHHHHHHcCC-CCCCCcccc--cchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 427 KAIQRIGAVGYLPKTPKVINRVLEVLGI-DEKIEEDIS--QEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 427 ~~~~~L~~~G~~~~~~~~~~~i~e~~Gi-p~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) +.+ +.+|++ + ++.+.+.++. ..+. .+.. ..+... ..+. .++.......+++ T Consensus 415 ~~~---~~~g~i-S----~et~i~~l~~v~D~~-~E~~ri~~E~~~-~~~~----~~~~~~~~~~~~~ 468 (468) T protein:vir:96 415 QIG---VNSQYL-S----KETVVTNHPWVDDPV-AEMERIDQEELA-LPSI----EEGLNGKENNEPT 468 (468) T ss_pred HHH---HhcCCC-c----hHHHHHhCCCCCCHH-HHHHHHHHHHHH-HHHH----hhccCCCCCCCCC Confidence 654 456764 3 2334455533 2211 1111 111110 0010 0111111111111 No 234 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=48.83 E-value=0.66 Score=21.58 Aligned_cols=446 Identities=12% Similarity=0.072 Sum_probs=161.7 Q ss_pred Cc------------------ccccccccccCcccccccccccccccccc-CchhhhHHHHhh-hhcc--cchh-ccchhH Q lcl|NC_017974. 1 MS------------------RKRNRNRSVKVAKATSEQINVSRMRMSEQ-GTFALAKVQVDS-ERMK--AEEI-RWPNLI 57 (507) Q Consensus 1 ~~------------------~k~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~-~~e~--~~~l-r~~~~~ 57 (507) |+ .++-+.+....+.|.....+.. . ++ +..++.++++.. ...- ...+ -..+++ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e---~-~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI 76 (524) T protein:vir:10 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGARE---F-EVSSNEAASPYNAAFQTIFGSYEPGMKTTRELI 76 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccccCccCCCCcee---e-eecccccccccceeeeehhcccccccchHHHHH Confidence 11 1111111212222222111100 0 00 000111112111 1111 1111 235799 Q ss_pred HHHHHHhhChHHHHHHHHHHHHHhhCCc---eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceee Q lcl|NC_017974. 58 GTVESMKQDATVATGLDMLYTFVEKAFK---DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVV 134 (507) Q Consensus 58 ~~y~~m~~D~~v~s~l~~Rk~av~~~~~---~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~ 134 (507) +.|++|..++.|-++++.....+...+- .+.+.-.+.+-++.+.+.|.+- |.. |..+|+.--+||..+ T Consensus 77 ~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~ee--------F~~-Il~ll~F~~~~~~~f 147 (524) T protein:vir:10 77 DTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDE--------FND-VLNHLSFQRKGSDHF 147 (524) T ss_pred HHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHH--------HHH-HHHHhccchhhhHHH Confidence 9999999999999999988776532211 0111111122223333333222 222 224444444444443 Q ss_pred -----------eeeeeccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccc---cc Q lcl|NC_017974. 135 -----------EKVYTQVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGV---GG 200 (507) Q Consensus 135 -----------Ei~w~~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~---~~ 200 (507) .++-.... .-+|-. .|....|.++.+++-+....++.. ..+. .-..++.+.++..+. .. T Consensus 148 R~WYVDgRi~fhKiid~k~--pk~GI~---Elr~lDPr~i~~vr~i~~~~~~~~-~vi~-~~~e~f~Y~~~~~~y~~~g~ 220 (524) T protein:vir:10 148 RRWYVDSRIFFHKIIDPKR--PKEGIK---ELRRLDPRQVQYVREIITETEAGT-KIVK-GYKEYFIYDTAHESYACDGR 220 (524) T ss_pred hhheeeeEEEEEEEeeCCC--ccccce---eeeeeCCccceeeeeeccCCCccc-hhhc-chhhheeeccCccccccCcc Confidence 33322110 012333 333344445555544433322221 1111 111222222221111 01 Q ss_pred cccCCCccccccccEEEEeeCCccCc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCc Q lcl|NC_017974. 201 VSTRMSDVIIPINRVMLMNTGGSSSQ--ALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQ 278 (507) Q Consensus 201 ~~~~~~~~~iP~~k~i~~~~~~~~~~--p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~ 278 (507) ......++.||.+ -|+|+|..-.+- -.=.|.|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ T Consensus 221 ~~~~~~~ikI~~d-AI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYID--vGnlP---- 293 (524) T protein:vir:10 221 MYEAGTKIKIPKA-AIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVD--TGNMP---- 293 (524) T ss_pred ccCCCcceecchh-heeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEe--cCCCC---- Confidence 1123456777765 488888543111 12247888888776433333332222210001111244442 22222 Q ss_pred hHHHHHHHHHHHHHHHHhh-cCCcee--------------EeecC-----cccccccccceeeeeccCCCcchhHHHHHH Q lcl|NC_017974. 279 SSPEAEMVRGLMSDAANAH-SGEQSF--------------FMLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLIS 338 (507) Q Consensus 279 ~~~~~~~l~a~~~~~~n~~-~G~~a~--------------~iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~ 338 (507) .....+.|+.++.-..|=. -.+..| .-+|. |++|.+. .|+.+... ..=|+ T Consensus 294 k~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlge-m~DV~ 363 (524) T protein:vir:10 294 ARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTL---------PGADNTGN-MEDVR 363 (524) T ss_pred chhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeec---------cccCCcCh-HHHHH Confidence 2233345555444332200 000111 11222 5555442 23323322 34589 Q ss_pred HHHHHHHHHHhcccccccccch----hhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----c Q lcl|NC_017974. 339 DRKKSILDRLGAGFINVGNDKG----GSYNLSESKQTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----M 409 (507) Q Consensus 339 ~~d~~Isk~iLGqtLts~~~~~----GS~Al~~vh~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~ 409 (507) |..+.+-+++--..--.+.+.+ +.++..=+..|+ |...+......+...|..-|-..|+-=|.. ..+. . T Consensus 364 YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgii--t~eew~~i~ 441 (524) T protein:vir:10 364 WFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGII--TEDEWNDEI 441 (524) T ss_pred HHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHh Confidence 9999998888776533322211 123322222233 223344444444444443333333321211 1111 1 Q ss_pred ceEEecCc------CchhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcc Q lcl|NC_017974. 410 PYVKAGEI------VDVDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGM 482 (507) Q Consensus 410 P~~~~~~~------~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (507) +.+.|+.. |-.+.+.+.+++..|..+--.+-.-...+|+++. |.+.+.+-.++. ..-.....++--+.++ T Consensus 442 ~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~-k~I~~E~k~~~~~~~~-- 518 (524) T protein:vir:10 442 NNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEA-KQIEEESKEARFQDPD-- 518 (524) T ss_pred hcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHH-HHHHHHhhcCCCCCCc-- Confidence 23333332 2244444555665554431111111235677654 566422111111 0011111100000000 Q ss_pred cccCccccc Q lcl|NC_017974. 483 TKGSSGNGT 491 (507) Q Consensus 483 ~~~~~~~~~ 491 (507) ...++. T Consensus 519 ---~~~~~f 524 (524) T protein:vir:10 519 ---QEQEDF 524 (524) T ss_pred ---hhhhcC Confidence 000111 No 235 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=48.55 E-value=0.67 Score=21.55 Aligned_cols=424 Identities=9% Similarity=-0.012 Sum_probs=150.1 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFV 80 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av 80 (507) |++...++. +...... +.-.+.+. + +....... .+..+..| . -.....-++.....-+ T Consensus 49 i~~~~~~~~----~r~~~l~----~YY~g~~~-i-~~~~~~~~-~~~~~~~k---i--------~~n~~k~Ivd~~~~yl 106 (512) T protein:vir:97 49 IEHHMDYQR----PRLKVLS----DYYEGKTK-N-LVELTRRK-EEYMADNR---V--------AHDYASYISDFINGYF 106 (512) T ss_pred HHHHHHhhH----HHHHHHH----HHhcccCc-c-ccccCccc-ccccCcce---e--------ecchHHHHHHHHhhhh Confidence 222111110 0000000 00000000 0 00000000 00000000 0 0112222223333333 Q ss_pred hhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeeeeccccC Q lcl|NC_017974. 81 EKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVKNLAFRP 159 (507) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~~l~~r~ 159 (507) .+-+..|.. .++++.+++.++++.. .|.....++. ++.-||.+ ++++|... +|.+.+.-+.|+. T Consensus 107 ~g~p~~~~~------~d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~i~G~a-y~~vy~de-----d~~~~i~~~~p~~ 171 (512) T protein:vir:97 107 LGNPIQCQD------DDKDVLEAIEAFNDLN---DVESHNRSLGLDLSIYGKA-YELMIRNQ-----DDETRLYKSDAMS 171 (512) T ss_pred cccCceecc------CChHHHHHHHHHHhhc---CHHHHHHHHHHHHHhcCeE-EEEEEeCC-----CCceEEEEEcccc Confidence 333333322 1234556677776542 3666655554 57779975 46777532 4566665544332 Q ss_pred chhcccccceeecCC--Cccccccccc------------cccccccccccccccc-----------cccCCCcccccccc Q lcl|NC_017974. 160 QASLSRTNPIVYNED--GSAIIGIKQS------------LSAFQNYTASEIGVGG-----------VSTRMSDVIIPINR 214 (507) Q Consensus 160 ~~~i~~~~~~~~~~d--~~~l~~~~q~------------~~~~~~~~~~~~~~~~-----------~~~~~~~~~iP~~k 214 (507) ++. + |+.+ +..+-.++-. ......++.....-.. .........+..-- T Consensus 172 --~~~----i-yd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 244 (512) T protein:vir:97 172 --TFV----I-YDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP 244 (512) T ss_pred --eEE----E-EcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccc Confidence 111 1 1111 1111111100 0001111111110000 00000111111111 Q ss_pred EEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHH-HHHHH Q lcl|NC_017974. 215 VMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRG-LMSDA 293 (507) Q Consensus 215 ~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a-~~~~~ 293 (507) ++.|+ .|+.|.|.+..+-...=--+..+..++..++.+..|+++++- ... .+. .+......+ +.... T Consensus 245 vv~~~-----nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G---~~~---~~~-~~~~~~~~~~~~~~~ 312 (512) T protein:vir:97 245 ITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG---NLN---LDP-VEVRKQKEANVLFLE 312 (512) T ss_pred eEeec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeec---Ccc---CCc-hhhhhhhhccccccc Confidence 23333 356788888876544433355677788888999999888751 110 111 111110000 00000 Q ss_pred HHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhhhhHHHH---HH Q lcl|NC_017974. 294 ANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGSYNLSES---KQ 370 (507) Q Consensus 294 ~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~v---h~ 370 (507) .+...+ .+..++.+. ..+++++..+- ....+..+++++.+.|.+.--...++.+..+ | ...|.. .. T Consensus 313 ~~~~~~--~~~~~~~~~-----~~d~~~l~~~~--~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-g-n~Sg~Al~~~~ 381 (512) T protein:vir:97 313 PTVYEN--RDTGIETEG-----SVDGGYIYKQY--DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G-TQSGEAMKYKL 381 (512) T ss_pred ccchhh--cccccCCCC-----CcceEEEeecC--CHHHHHHHHHHHHHHHHHHhCCcccCccccc-c-cchHHHHHHHH Confidence 000000 000111111 11233333322 2223667889999998876555545443211 1 111211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CCC-cCCc-cccceEEecCcCchhHHHHHHHHHHHHhcCCccccHHHH Q lcl|NC_017974. 371 TIHTQFVQRVNEIILEALNENLLPQLLAL---NDM-RLPE-TEMPYVKAGEIVDVDMEGFSKAIQRIGAVGYLPKTPKVI 445 (507) Q Consensus 371 ~v~~~~~~aD~~~i~~~ln~~li~~l~~l---N~~-~~~~-~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~ 445 (507) .-....+..-.+.+...|++ +++.++.+ ... ..+. ..-..+.|...-+.+..+.++++.+|+ |+ ++. T Consensus 382 ~~l~~ka~~k~~~f~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~--gi-iS~---- 453 (512) T protein:vir:97 382 FGLEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQ---- 453 (512) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-Cch---- Confidence 12222233334445555643 44555443 111 1111 112468898888999999999999984 65 442 Q ss_pred HHHHHHcCC-CCCCCc-ccccchhhhcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 446 NRVLEVLGI-DEKIEE-DISQEELMKLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 446 ~~i~e~~Gi-p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) +.+.+.++. +.+..+ +....+....+.......+.... ..++. ..++.+.+-+. T Consensus 454 et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~---~~~~~-----~~~~~~~~~~~ 509 (512) T protein:vir:97 454 TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR---DINDD-----EQDDDTKDTVD 509 (512) T ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCC---CCCCC-----CCCCCcccccc Confidence 445566654 322111 00011111000000000000000 00000 00000000000 No 236 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=47.30 E-value=0.71 Score=21.41 Aligned_cols=408 Identities=9% Similarity=0.012 Sum_probs=149.9 Q ss_pred CcccccccccccCcc------ccccccccccccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAK------ATSEQINVSRMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~------~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) +++-+.+.+ .++.+ .....+..+.....+.. ...| . -.....-++. T Consensus 49 i~~~~~~~~-~r~~~l~~Yy~g~~~il~~~~~~~~~~~----------------~~~k---i--------~~n~~k~Iv~ 100 (511) T protein:vir:93 49 IEHHMDYQR-PRLKVLSDYYEGKTKNLVELTRRKEEYM----------------ADNR---V--------AHDYASYISD 100 (511) T ss_pred HHHHHHhhH-HHHHHHHHHhcccCccccccCcCccccc----------------Ccce---e--------ecchHHHHHH Confidence 111111000 00000 00000000000000000 0000 0 0122222233 Q ss_pred HHHHHHhhCCceeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHH-HHHhhcceeeeeeeeccCccccCcceeee Q lcl|NC_017974. 75 MLYTFVEKAFKDFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAA-TFNEYGLSVVEKVYTQVTVGEYIGKYKVK 153 (507) Q Consensus 75 ~Rk~av~~~~~~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l-~a~~~Gfs~~Ei~w~~~~~~~~~g~~~~~ 153 (507) ....-+.+-+..+. . .+.++.+++.++++.. .|.....++. ++.-||.+ ++++|... +|.+.++ T Consensus 101 ~~~~yl~g~p~~~~--~----~d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a-y~~vy~de-----~~~~~i~ 165 (511) T protein:vir:93 101 FINGYFLGNPIQYQ--D----DDKDVLEVIEAFNDLN---DVESHNRSLGLDLSIYGKA-YELMIRNQ-----DDETRLY 165 (511) T ss_pred HHhhhhcccCeeec--c----CChHHHHHHHHHHhhc---CHhHHHHHHHHHHHhcCee-EEEEEeCC-----CCceEEE Confidence 33333333333332 2 1234556677766542 3666655554 57779965 55777532 4556655 Q ss_pred eccccCchhcccccceeecCC--Ccccccccccc------------cccccccccccccc-----------ccccCCCcc Q lcl|NC_017974. 154 NLAFRPQASLSRTNPIVYNED--GSAIIGIKQSL------------SAFQNYTASEIGVG-----------GVSTRMSDV 208 (507) Q Consensus 154 ~l~~r~~~~i~~~~~~~~~~d--~~~l~~~~q~~------------~~~~~~~~~~~~~~-----------~~~~~~~~~ 208 (507) -+.|+. ++. .|+.+ +..+-.++... .....++.....-. ......... T Consensus 166 ~~~p~~--~~~-----vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~ 238 (511) T protein:vir:93 166 KSDAMS--TFV-----IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESH 238 (511) T ss_pred EEccce--eEE-----EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccc Confidence 544331 111 11111 11111111000 00001111110000 000001111 Q ss_pred ccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHH Q lcl|NC_017974. 209 IIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRG 288 (507) Q Consensus 209 ~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a 288 (507) .+..-.++.|+ .|+.|.|.+..+-...=-=+..+..++..++.|..|+++++-- .. .+.++.....+ T Consensus 239 ~~g~vPvv~~~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~---~~----~~~~~~~~~~~- 305 (511) T protein:vir:93 239 SFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN---LN----LDPVEVRKQKE- 305 (511) T ss_pred CCCccceEEec-----CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecC---cc----cCchhhccccc- Confidence 11111234443 3567888888765443333456788888899999998887521 10 11111000000 Q ss_pred HHHHHHHhhcCCceeEeecCcccc------cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccchhh Q lcl|NC_017974. 289 LMSDAANAHSGEQSFFMLPSDTKD------NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDKGGS 362 (507) Q Consensus 289 ~~~~~~n~~~G~~a~~iiP~g~~i------~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS 362 (507) ...+.++.+..+ .....+++++..+. ....+..+++++.+.|.+.--...++.+..+ | T Consensus 306 ------------~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~- 369 (511) T protein:vir:93 306 ------------ANVLFLEPTVYADSEGRETEGSVDGGYIYKQY--DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G- 369 (511) T ss_pred ------------ccceecccccccccccccCCCCcceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCccccccccc-c- Confidence 000111110000 00011233433322 2234677889999998876655555443221 1 Q ss_pred hhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CCCc-CCc-cccceEEecCcCchhHHHHHHHHHHHHh Q lcl|NC_017974. 363 YNLSES---KQTIHTQFVQRVNEIILEALNENLLPQLLAL---NDMR-LPE-TEMPYVKAGEIVDVDMEGFSKAIQRIGA 434 (507) Q Consensus 363 ~Al~~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~l---N~~~-~~~-~~~P~~~~~~~~~~dl~~~a~~~~~L~~ 434 (507) ...|.. ...-....+..-.+.+...|. ++++.++++ ++.. .+. ..-.++.|....+.|..+.++++.+|. T Consensus 370 n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~- 447 (511) T protein:vir:93 370 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG- 447 (511) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHHh- Confidence 111211 111112222333344555554 344555543 2110 111 112478898888999999999999983 Q ss_pred cCCccccHHHHHHHHHHcCC-CCCCCcccc--cchhhhcCCC---cCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 435 VGYLPKTPKVINRVLEVLGI-DEKIEEDIS--QEELMKLLGE---DTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 435 ~G~~~~~~~~~~~i~e~~Gi-p~~~~~~~~--~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) |+ ++. +.+.+.++. +.+. .+.. ..+....... ......++.. +. ..++.+.+-+. T Consensus 448 -g~-iS~----et~~~~l~~v~d~~-~E~~ri~~E~~~~~~~~~~~~~~~~~~~~------~~-----~~~~~~~~~~~ 508 (511) T protein:vir:93 448 -GK-ISQ----TTLMSLFSFFQDPE-LEVKKIEEDEKESIKKAQKGIYKDPRDIN------DD-----EQDDDTKDTVD 508 (511) T ss_pred -cc-Cch----HHHHHhCCCCCCHH-HHHHHHHHHHHHHHHHHhhhcccCCCCCC------CC-----CCCCccccccc Confidence 65 442 445566644 2221 1110 0110000000 0000000000 00 00111111111 No 237 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=46.36 E-value=0.74 Score=21.31 Aligned_cols=449 Identities=11% Similarity=0.024 Sum_probs=158.6 Q ss_pred Ccccccccccc-----cCccc---------ccccccc----ccccccccCchhhhHHHHhhhhcccchhccc-------- Q lcl|NC_017974. 1 MSRKRNRNRSV-----KVAKA---------TSEQINV----SRMRMSEQGTFALAKVQVDSERMKAEEIRWP-------- 54 (507) Q Consensus 1 ~~~k~~~~~~~-----~~~~~---------~~~~~~~----~~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~-------- 54 (507) |+|++-||+.- ++|.. .++..++ ++..++..|. ++.....--+|+-|.+|-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (695) T protein:vir:78 1 MSRRNAKKRTQLAHTGRRPEVAKAAALAAAATIATATAAQPVPADMGRRGA--LNALDAAPVAEPSPSLRLARQFEVDVS 78 (695) T ss_pred CCccchhhhhhhhhcCCCcchhhhhhhhhhhhhhhhccccccchhhccccc--ccccccccccCCCcccccceeceeccc Confidence 98877666521 11110 0111000 1111211111 0000000011111111110 Q ss_pred ---------------------h-----------hHHHHHHHhhChHHHHHHHHHHHHHhhCCceeee------------- Q lcl|NC_017974. 55 ---------------------N-----------LIGTVESMKQDATVATGLDMLYTFVEKAFKDFKV------------- 89 (507) Q Consensus 55 ---------------------~-----------~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~------------- 89 (507) . .|.+...|..-+.+.++....-....+- |--.. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~ 157 (695) T protein:vir:78 79 NYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLA 157 (695) T ss_pred cCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceeccccchhhhhhccc Confidence 0 1222233444555666666665555442 41111 Q ss_pred -CCCC-ChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCcc-ccCcceeeeeccccCchhccc- Q lcl|NC_017974. 90 -IPGE-SEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVG-EYIGKYKVKNLAFRPQASLSR- 165 (507) Q Consensus 90 -~~~~-~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~-~~~g~~~~~~l~~r~~~~i~~- 165 (507) ..+. +..+.+..+.|++.++++.- |..+..-+-.+-+||=+++=+.= ++.. ..+.. |.+|+- +|.+ T Consensus 158 ~~~~~~~~~d~dqi~~L~~e~erL~V--~~~l~eaik~aRlfGGa~~~i~i--~gdd~~l~~P-----L~~~~~-~I~kG 227 (695) T protein:vir:78 158 AGGNAASTSDGDQLKQINDEIERLRI--RDAVRTTVIHDQAFGRAHPYFKI--KGDDQIMDTP-----LVPRPY-TVPKG 227 (695) T ss_pred ccccccccccHHHHHHHHHHHHHHHH--HHHHHHHHHhhccccceEEEEEe--ccCccccccc-----cccccc-cccCc Confidence 1111 11222455567777776542 34444444568999999732222 2110 01111 111110 1111 Q ss_pred -------ccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCC------ccCccccchH Q lcl|NC_017974. 166 -------TNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGG------SSSQALGVSP 232 (507) Q Consensus 166 -------~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~------~~~~p~G~gl 232 (507) ..+++..++.... .+|... .++....+.. .+..|=..+++.|+-.. -.-+.+|.++ T Consensus 228 slKGl~ViDp~~vtP~~~n~----~dP~sp---dfgkP~~y~V----~G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv 296 (695) T protein:vir:78 228 SFQGLRVVEPYWVTPNNYNS----INPVAD---DFYKPSTWWM----IGTEVHATRLHTIVSRPVGDMLKPTYSFAGISM 296 (695) T ss_pred ceeeeEeecccccccchhhh----ccchhh---ccCCCceEEE----eceEEeeeeEEEecCCCchhhhhcccccCcccH Confidence 0111222211000 011110 0111111110 01123233333333221 1235678888 Q ss_pred HHHHHHHHH---HHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCc Q lcl|NC_017974. 233 LVGCYRAWR---EKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSD 309 (507) Q Consensus 233 l~~~~~~~~---fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g 309 (507) ...++..+. --+.....- .+ ++-. .++++-+ .+ .-.+... .. +..-.+++..+++ .....+|..+ T Consensus 297 ~q~~~e~V~~~~rT~~~v~~L-i~--~~~v--~~lk~dl--a~-~L~~g~~--~~-l~~R~eli~~~Rs-n~G~~llDk~ 364 (695) T protein:vir:78 297 TQLAMPYIDNWLRTRQSVSDI-VK--QFSV--SGILMDL--AQ-ALMPGAN--VD-LSMRAELINRYRD-NRNILFLDKA 364 (695) T ss_pred HHHHHHHHHHHHHHHhHHHHH-HH--hhhh--HHHHHHH--HH-hhcChhH--HH-HHHHHHHHHHhcC-ccceEEEecC Confidence 888774432 222222211 11 1100 0111000 00 0001111 11 2222333334443 3344566532 Q ss_pred ccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc--ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 310 TKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN--VGNDKGGSYNLSESKQTIHTQFVQRVNEIILEA 387 (507) Q Consensus 310 ~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ 387 (507) . -++.++ ..+=+| ...++...-.+||-+. +-.+| .|.+-.|=+|.|+.-..+.-+.+++........ T Consensus 365 ~-Eefeq~-----stslSG----LddVi~qf~q~VAgaa-~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p 433 (695) T protein:vir:78 365 T-EEFFQF-----NTPLSG----LDALQAQAQEQMSAVS-HIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQ 433 (695) T ss_pred C-cceEEE-----ecccCC----HHHHHHHHHHHHHhhh-cCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHH Confidence 2 233322 222222 3456666666666543 22222 123345666777777777777777765443333 Q ss_pred HHHHHHHHHHHhCCCcCCccccce--EEecCcCchhHH-------HHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCC Q lcl|NC_017974. 388 LNENLLPQLLALNDMRLPETEMPY--VKAGEIVDVDME-------GFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKI 458 (507) Q Consensus 388 ln~~li~~l~~lN~~~~~~~~~P~--~~~~~~~~~dl~-------~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~ 458 (507) +=+.|+.-| .+.- ++ ...|. |+|...-+-+-+ +.|++++.+++.|++.++ .++.++.-.+.. T Consensus 434 ~L~rl~~ii-~rS~-~G--~idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~-----evr~rL~~d~~s 504 (695) T protein:vir:78 434 LMNDVIVMI-QLSL-FG--AVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPD-----QVAARLNTEPDG 504 (695) T ss_pred HHHHHHHHH-HHHh-cC--CCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHH-----HHHHHHhcCCCc Confidence 323333333 3321 11 12233 455443333223 345778889999998874 466665443211 Q ss_pred C--------cccccchhh--hcCCCcCCCCCCcccccCcccccccccCCCCchhhhccC Q lcl|NC_017974. 459 E--------EDISQEELM--KLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 459 ~--------~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) . +++..+... ........+.+++...+++++. + .-+.-+++.+|+.- T Consensus 505 ~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~g~~~~~~~~~~~~ 561 (695) T protein:vir:78 505 PYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA-R-AGATAPPTVANVNA 561 (695) T ss_pred ccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCCC-C-CCCCCCCceeeeec Confidence 1 110000000 0000011111122111222110 0 01122333333322 No 238 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=43.85 E-value=0.83 Score=21.03 Aligned_cols=451 Identities=11% Similarity=0.061 Sum_probs=167.0 Q ss_pred CcccccccccccCccccccccccccccccccCchhhhHHHHhh-hhc----ccc-hhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRMRMSEQGTFALAKVQVDS-ERM----KAE-EIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~e----~~~-~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |++-+....+...++..-.....-|. . -.+.|+-.. ... ..+ .....+...+| |++-.-++. T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~------~-~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-----dst~~~a~~ 68 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRR------A-YETRAENCAQYTIPSLFPKESDNESTDYTTPW-----QAVGARGLN 68 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhh------H-HHHHHHHHHHHhcccccCCCCCcccccccccc-----cccHHHHHH Confidence 54444333332222211111000000 0 011111000 000 000 00111111112 444444455 Q ss_pred HHHHHHhhC-----CceeeeCCCC---------ChhHHHHHHHHHHH----HHhccCCCHHHHHHHHHH-HHhhcceeee Q lcl|NC_017974. 75 MLYTFVEKA-----FKDFKVIPGE---------SEESKKAAKFVEYC----LKNMEGQTLRQFARDAAT-FNEYGLSVVE 135 (507) Q Consensus 75 ~Rk~av~~~-----~~~~~~~~~~---------~~~~~~~ae~v~~~----l~~~~~~~~~~~l~~~l~-a~~~Gfs~~E 135 (507) +--..+.+. +| |+...++ ..+..++.++++.+ ...+....|+..+.+++. -+.+|-++.= T Consensus 69 ~Laa~l~~~ltP~~~W-F~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~ 147 (535) T protein:vir:33 69 NLASKLMLALFPMQSW-MKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLY 147 (535) T ss_pred HHHHHHHHhhcCCCcc-cccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEE Confidence 444444432 23 2222211 11223444544332 233444567776666543 3456765433 Q ss_pred e------------------eeeccCccccCcceeeeecc-----ccCchhcccccceeecCCCcccccc-cccccc--cc Q lcl|NC_017974. 136 K------------------VYTQVTVGEYIGKYKVKNLA-----FRPQASLSRTNPIVYNEDGSAIIGI-KQSLSA--FQ 189 (507) Q Consensus 136 i------------------~w~~~~~~~~~g~~~~~~l~-----~r~~~~i~~~~~~~~~~d~~~l~~~-~q~~~~--~~ 189 (507) + ++..+..|.-+..++--++. .+-+..+.+-.......+...+..+ ...... +. T Consensus 148 ~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~ 227 (535) T protein:vir:33 148 LPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYL 227 (535) T ss_pred eecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEE Confidence 2 22222222111111101111 0000000000000000011111111 110000 11 Q ss_pred ccccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccc- Q lcl|NC_017974. 190 NYTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPS- 268 (507) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~- 268 (507) .+...+.. ......++...-...|++.|+...+|+.||.|....++.-..--+...+.-+..+++---|+.. +|+ T Consensus 228 ~~~~~~~~--~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~l--v~~~ 303 (535) T protein:vir:33 228 KYEEVEDV--EIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGL--VNPA 303 (535) T ss_pred EEEEEeCc--cccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--eccc Confidence 11000000 0000001111122357999999999999999999999998877777878888888876555422 221 Q ss_pred ccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_017974. 269 QILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRL 348 (507) Q Consensus 269 ~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~i 348 (507) ...+ ..+.. .+| -+.++|...+ .+..+..+.++....-...|+.+...|+.++ T Consensus 304 g~~~----------------~~~~~---~~~--~g~~v~g~~~------~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af 356 (535) T protein:vir:33 304 GITQ----------------PRRLT---KAQ--TGDFVPGRRE------DIDFLQLEKQADFTVAKAVSDQIEARLSYAF 356 (535) T ss_pred cccc----------------hhhcc---cCC--ceeeecCCcc------cceeeecccccchhHHHHHHHHHHHHHHHHH Confidence 1111 01110 012 1334443221 1223333334444456779999999999998 Q ss_pred hcccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCC----cCC--ccccceEEecC---- Q lcl|NC_017974. 349 GAGFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLAL--NDM----RLP--ETEMPYVKAGE---- 416 (507) Q Consensus 349 LGqtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l--N~~----~~~--~~~~P~~~~~~---- 416 (507) +...++..++ +.--..||+....+.. .. .-=+-..|+..++-||+.. +.. -.| +...-+++|.. T Consensus 357 ~~~~~~~~~~--~r~TAtEV~~r~~E~~-~~-LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~ 432 (535) T protein:vir:33 357 MLNSAVQRTG--ERVTAEEIRYVASELE-DT-LGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEA 432 (535) T ss_pred hhhhcccCCC--ccccHHHHHHHHHHHH-HH-HhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHH Confidence 8776543221 2122244444322211 11 1122333444455555531 110 111 11111233311 Q ss_pred -cCchhHHHHHHHHHHHHhcCCc-----cccHHHHHHHHHHcCCCCC---CCcccccchhhh-----cCCCcCCCCCCcc Q lcl|NC_017974. 417 -IVDVDMEGFSKAIQRIGAVGYL-----PKTPKVINRVLEVLGIDEK---IEEDISQEELMK-----LLGEDTSRAGDGM 482 (507) Q Consensus 417 -~~~~dl~~~a~~~~~L~~~G~~-----~~~~~~~~~i~e~~Gip~~---~~~~~~~~~~~~-----~~~~~~~~~~~~~ 482 (507) ....++..+...++.+..++-. +..+...+++.+.+|+|.. ..+++.+..... .+.+..+..+.+. T Consensus 433 aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~ 512 (535) T protein:vir:33 433 IGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGV 512 (535) T ss_pred HHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhh Confidence 1234555666666666655421 2224556788999999842 122221111100 0011111111111 Q ss_pred cccCcccccccccCCCCchh--hhccC Q lcl|NC_017974. 483 TKGSSGNGTGKISSTRDNSA--ANLDN 507 (507) Q Consensus 483 ~~~~~~~~~~~~~~~~d~~a--~~~~~ 507 (507) +. .+. ..++... +|.+- T Consensus 513 ~~-~~~-------~~~~~~~~~~~~~g 531 (535) T protein:vir:33 513 GA-LAT-------SSPEAMQGAAAKAG 531 (535) T ss_pred cc-hhh-------cCChhHHHHHHhcc Confidence 11 011 1121111 11111 No 239 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=41.94 E-value=0.91 Score=20.82 Aligned_cols=453 Identities=12% Similarity=0.075 Sum_probs=157.4 Q ss_pred CcccccccccccCccccccccccccc----cccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSEQINVSRM----RMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLDML 76 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~~~~~~~~----~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~R 76 (507) |+.+|- ..+....+..-.....-|. ++.++-.. +..+++.....+ ...+.-.+ -|++-.-++++- T Consensus 1 m~~~~~-~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~--~lP~~~~~~~~~---~~~~~~~~-----~dst~~~a~~~L 69 (536) T protein:vir:21 1 MAEKRT-GLAEDGAKSVYERLKNDRAPYETRAQNCAQY--TIPSLFPKDSDN---ASTDYQTP-----WQAVGARGLNNL 69 (536) T ss_pred Ccchhh-chhHHHHHHHHHHHHHHhhHHHHHHHHHHHH--hcccccCCCCCc---cccccccc-----ccccHHHHHHHH Confidence 666331 1111111111100000000 00000000 000000000000 00000011 244444455555 Q ss_pred HHHHhhC-----CceeeeCCCCC---------hhHHHHHHHHHH----HHHhccCCCHHHHHHHHHH-HHhhccee---- Q lcl|NC_017974. 77 YTFVEKA-----FKDFKVIPGES---------EESKKAAKFVEY----CLKNMEGQTLRQFARDAAT-FNEYGLSV---- 133 (507) Q Consensus 77 k~av~~~-----~~~~~~~~~~~---------~~~~~~ae~v~~----~l~~~~~~~~~~~l~~~l~-a~~~Gfs~---- 133 (507) -..+.+. +| |+....+. ....++.++++. +...+....|+..+.+++. -+.+|-.+ T Consensus 70 aa~l~~~ltP~~~W-Frl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~ 148 (536) T protein:vir:21 70 ASKLMLALFPMQTW-MRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLP 148 (536) T ss_pred HHHHHHhhcCCCcc-cccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEe Confidence 4444433 23 22211110 111233333322 2223444567766655543 33456443 Q ss_pred ---------------eeeeeeccCccccCcceeeeeccc-----cCchhcc-cccceeecCCCcccc-cccccccc-ccc Q lcl|NC_017974. 134 ---------------VEKVYTQVTVGEYIGKYKVKNLAF-----RPQASLS-RTNPIVYNEDGSAII-GIKQSLSA-FQN 190 (507) Q Consensus 134 ---------------~Ei~w~~~~~~~~~g~~~~~~l~~-----r~~~~i~-~~~~~~~~~d~~~l~-~~~q~~~~-~~~ 190 (507) .+.++..+..|.-+..++--++.. +-+.... ...... ..+...+. .+...+.. .+. T Consensus 149 e~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~-~~~~v~v~~~v~~~~~~~~~~ 227 (536) T protein:vir:21 149 EPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKK-ADETIDVYTHIYLDEDSGEYL 227 (536) T ss_pred eCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccc-cccceeEEEEEEEecCCCcEE Confidence 333444333332221111111111 1111000 000000 00010111 11111110 000 Q ss_pred cccccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccccc Q lcl|NC_017974. 191 YTASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQI 270 (507) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~ 270 (507) ++....+. ......+....-..-|++.|+...+|+.||.|....++.-..--+...+.-+...+.-.-|+..+ -|-.+ T Consensus 228 ~~~e~~g~-~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv-~p~g~ 305 (536) T protein:vir:21 228 RYEEVEGM-EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLV-NPAGI 305 (536) T ss_pred EEeccCCe-eeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc-Ccccc Confidence 11100000 00000111112223579999999999999999999999887666666666666666543332111 12111 Q ss_pred CCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_017974. 271 LNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGA 350 (507) Q Consensus 271 ~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLG 350 (507) .+ ...++ .+| .|.++|...+ .+..+..+.++....-...|+.+...|+++++. T Consensus 306 ~~----------------~~~~~---~~~--~g~~v~g~~~------~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~ 358 (536) T protein:vir:21 306 TQ----------------PRRLT---KAQ--TGDFVTGRPE------DISFLQLEKQADFTVAKAVSDAIEARLSFAFML 358 (536) T ss_pred cc----------------hhhhc---cCC--CcceecCCcc------cceeeeccccccchHHHHHHHHHHHHHHHHHhh Confidence 10 01111 112 2445663321 122223333333333467899999999999987 Q ss_pred ccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--C-----CCc--CCccc-cceEEec---Cc Q lcl|NC_017974. 351 GFINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLAL--N-----DMR--LPETE-MPYVKAG---EI 417 (507) Q Consensus 351 qtLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l--N-----~~~--~~~~~-~P~~~~~---~~ 417 (507) ..++..+ +..--..||+....+.. .. .-=+-..|+..++-||+.. + +.. .|++. -|.++-. .. T Consensus 359 ~~l~~~~--~~r~TAtEV~~r~~E~~-~~-LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~ 434 (536) T protein:vir:21 359 NSAVQRT--GERVTAEEIRYVASELE-DT-LGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG 434 (536) T ss_pred hhcccCC--CCCccHHHHHHHHHHHH-HH-hhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHH Confidence 6554322 11122345544332221 11 1112223334444444431 1 111 01111 1222111 11 Q ss_pred CchhHHHHHHHHHHHHhcCC-----ccccHHHHHHHHHHcCC-CCC--CCcccccchhh-----hcCCCcCCCCCCcccc Q lcl|NC_017974. 418 VDVDMEGFSKAIQRIGAVGY-----LPKTPKVINRVLEVLGI-DEK--IEEDISQEELM-----KLLGEDTSRAGDGMTK 484 (507) Q Consensus 418 ~~~dl~~~a~~~~~L~~~G~-----~~~~~~~~~~i~e~~Gi-p~~--~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 484 (507) ...++..+...++.+..++= .+..+...+++.+.+|+ |.. ..++++....+ ..........+.+++. T Consensus 435 r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~ 514 (536) T protein:vir:21 435 RGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAA 514 (536) T ss_pred HHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22456666666666666551 12334566788899999 531 11221111000 0000000000000100 Q ss_pred cCcccccccccCCCCchhhhccC Q lcl|NC_017974. 485 GSSGNGTGKISSTRDNSAANLDN 507 (507) Q Consensus 485 ~~~~~~~~~~~~~~d~~a~~~~~ 507 (507) . ...++...++=.++ T Consensus 515 ~--------~~~~~~~~~~~~~~ 529 (536) T protein:vir:21 515 Q--------ATASPEAMAAAADS 529 (536) T ss_pred H--------HhcChhhHHhhhhc Confidence 0 00011111111111 No 240 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=38.63 E-value=1.1 Score=20.45 Aligned_cols=447 Identities=11% Similarity=0.076 Sum_probs=163.1 Q ss_pred Cccccccc----ccccCccccccccccc---cccccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHH Q lcl|NC_017974. 1 MSRKRNRN----RSVKVAKATSEQINVS---RMRMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGL 73 (507) Q Consensus 1 ~~~k~~~~----~~~~~~~~~~~~~~~~---~~~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l 73 (507) ...|+.++ .+...+.|.....++. ....+..+..|. ..+.++..+.. ..-..++++.|++|..++.|-+++ T Consensus 15 ~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~-~q~~y~~~e~~-~~~~~eLI~~YR~ma~~pEvd~Av 92 (523) T protein:vir:68 15 MDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAM-FQRMFGSQEPG-LKSTRELIDTYRNLMTNYEVDNAV 92 (523) T ss_pred hhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchh-hhhhhhccccc-cchHHHHHHHHHHHhhccchhhHH Confidence 11221111 1112222222211110 000011011111 01112222111 112357999999999999999999 Q ss_pred HHHHHHHhhCCc---eeeeCCCCChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceee-----------eeeee Q lcl|NC_017974. 74 DMLYTFVEKAFK---DFKVIPGESEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVV-----------EKVYT 139 (507) Q Consensus 74 ~~Rk~av~~~~~---~~~~~~~~~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~-----------Ei~w~ 139 (507) +.....+...+- .+.++-..++-++.+.+.|.+- |.. |..+|+.--+||..+ .++-. T Consensus 93 ~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~ee--------F~~-Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid 163 (523) T protein:vir:68 93 SEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDE--------FNE-VLNHLSFQRKGSDHFRRWYVDSRIFFHKIID 163 (523) T ss_pred HHhhcceeeecCCCceEEEEecccccchHHHHHHHHH--------HHH-HHHHhccchhhhHHHHhheeeeEEEEEEEee Confidence 988776532210 1111112222233333333222 222 224444444444443 33332 Q ss_pred ccCccccCcceeeeeccccCchhcccccceeecCCCccccccccccccccccccccccc---cccccCCCccccccccEE Q lcl|NC_017974. 140 QVTVGEYIGKYKVKNLAFRPQASLSRTNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGV---GGVSTRMSDVIIPINRVM 216 (507) Q Consensus 140 ~~~~~~~~g~~~~~~l~~r~~~~i~~~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~---~~~~~~~~~~~iP~~k~i 216 (507) ... .-+|-..++.| .|.++.+++.+....+ .+...+.. -..++.+.+..... ........++.||.+ -| T Consensus 164 ~k~--pk~GI~Elr~l---DPr~i~~vr~i~~~~~-~g~~vi~~-~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~d-AI 235 (523) T protein:vir:68 164 PKR--PKEGIKELRRL---DPRQVQYVREVITTTE-AGVKIVKG-YKEYFIYDTSHESYACDGRIYEAGTKIKIPKA-AI 235 (523) T ss_pred CCC--ccccceeeeee---CCcceeEEEeecCCCC-cchhhhhh-hhhheeeccccccccccccccCCCcceecchh-he Confidence 110 11233333333 4445555444333222 11111211 11122222221110 011223456777765 48 Q ss_pred EEeeCCccCc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHH Q lcl|NC_017974. 217 LMNTGGSSSQ--ALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAA 294 (507) Q Consensus 217 ~~~~~~~~~~--p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~ 294 (507) +|+|..-.+- -.=.|-|.++..++==-+.....-+.+-=--.-.=.++++. +|..+ .....+.|+.++.-.. T Consensus 236 ~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYID--vGnlP----k~KAeqYl~~im~k~k 309 (523) T protein:vir:68 236 VYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVD--TGNMP----SRKAAEHMQHVMNTMK 309 (523) T ss_pred eeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEe--cCCCC----chhHHHHHHHHHHhhc Confidence 8888543111 12247888888776433333332222210001111244442 22222 2233445555544333 Q ss_pred Hhhc-CCcee--------------EeecC-----cccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc Q lcl|NC_017974. 295 NAHS-GEQSF--------------FMLPS-----DTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN 354 (507) Q Consensus 295 n~~~-G~~a~--------------~iiP~-----g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt 354 (507) |=.- .+..| .-+|. |+||.+. .|+.+... ..=|+|..+.+-+++--..-- T Consensus 310 NKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTL---------pGgqnlge-m~DV~YF~kkLy~aLnVP~sR 379 (523) T protein:vir:68 310 NRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTL---------PGADNTGN-MEDVRWFRNALYMALRIPITR 379 (523) T ss_pred ceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeec---------cccCCcCh-HHHHHHHHHHHHHHhCCccee Confidence 2100 00111 11222 5665442 23323322 345899999999988776533 Q ss_pred ccccchhhhhH---HHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCccc----cceEEecCc------Cc Q lcl|NC_017974. 355 VGNDKGGSYNL---SESK-QTI-HTQFVQRVNEIILEALNENLLPQLLALNDMRLPETE----MPYVKAGEI------VD 419 (507) Q Consensus 355 s~~~~~GS~Al---~~vh-~~v-~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~~----~P~~~~~~~------~~ 419 (507) .+. ..|+..+ +++. .|+ |...+......+...|..-|-..|+-=+.. ..+. .+.+.|+.. |- T Consensus 380 l~~-~~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~El 456 (523) T protein:vir:68 380 IPS-DQGGIQFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGII--TEDEWNDEINNIKIKFHRDSYFSEL 456 (523) T ss_pred ecC-CCcceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhcceEeeeecchHHHH Confidence 321 1222222 2222 222 223344444444444443332333221111 1111 122333322 23 Q ss_pred hhHHHHHHHHHHHHhcCCccccHHHHHHHHHH-cCCCCCCCcccccchhhhcCCCcCCCCCCcccccCccccc Q lcl|NC_017974. 420 VDMEGFSKAIQRIGAVGYLPKTPKVINRVLEV-LGIDEKIEEDISQEELMKLLGEDTSRAGDGMTKGSSGNGT 491 (507) Q Consensus 420 ~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (507) .+.+.+.+++..|..+--.+-.-...+|+++. |.+.+.+-.++. ..-.....++--+.++ ...++. T Consensus 457 Ke~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~-kqI~~E~k~~~~~~p~-----~e~~~f 523 (523) T protein:vir:68 457 KDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEA-KQIEEESKEARFQDPD-----QEQEDF 523 (523) T ss_pred HHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHH-HHHHHHhhcCCCCCCc-----hhhhcC Confidence 44445555665554431111111235677654 566422111111 0011111100000000 000111 No 241 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=33.25 E-value=1.4 Score=19.84 Aligned_cols=451 Identities=11% Similarity=0.026 Sum_probs=155.9 Q ss_pred Ccccccccccc-----cCccccc---------c--ccccc-cccccccCc---------------hhhhHH--------- Q lcl|NC_017974. 1 MSRKRNRNRSV-----KVAKATS---------E--QINVS-RMRMSEQGT---------------FALAKV--------- 39 (507) Q Consensus 1 ~~~k~~~~~~~-----~~~~~~~---------~--~~~~~-~~~~~~~g~---------------~~~~~~--------- 39 (507) |+|++-||+.- ++|.... + +-+.| +......|+ .+++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (694) T protein:vir:10 1 MSRRNAKKRTQLARTGRRPEVAKAAALAAAATIATAAAQPVPADFARRGALNALDAAPVAEPSPSLRLARQFEVDVSNYT 80 (694) T ss_pred CCccchhhHHHHhhcCCCcchhhhhhhhhhhhhhhcCCCcccCCccccccchhhcccccCCCCcchhhhhhccccccCCC Confidence 88876665521 1111100 0 00000 111111111 111100 Q ss_pred ----HHhhhhc-----ccchhcc-----chhHHHHHHHhhChHHHHHHHHHHHHHhhCCceeee--------------CC Q lcl|NC_017974. 40 ----QVDSERM-----KAEEIRW-----PNLIGTVESMKQDATVATGLDMLYTFVEKAFKDFKV--------------IP 91 (507) Q Consensus 40 ----~~~~~~e-----~~~~lr~-----~~~~~~y~~m~~D~~v~s~l~~Rk~av~~~~~~~~~--------------~~ 91 (507) +...+.. ..+.|-| .-.|.+...|..-+.+.++....-....+- |--.. .. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~ 159 (694) T protein:vir:10 81 PRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGG 159 (694) T ss_pred ccccchhhhhhccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceeccccchhhhhhcccccc Confidence 0000000 0000100 001222233444555666666655555442 41111 11 Q ss_pred CC-ChhHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHhhcceeeeeeeeccCcc-ccCcceeeeeccccCchhccc---- Q lcl|NC_017974. 92 GE-SEESKKAAKFVEYCLKNMEGQTLRQFARDAATFNEYGLSVVEKVYTQVTVG-EYIGKYKVKNLAFRPQASLSR---- 165 (507) Q Consensus 92 ~~-~~~~~~~ae~v~~~l~~~~~~~~~~~l~~~l~a~~~Gfs~~Ei~w~~~~~~-~~~g~~~~~~l~~r~~~~i~~---- 165 (507) +. +..+.+..+.|++.++++.- |..+..-+-.+-+||=+++=+.= ++.. ..+.. |.+|+- +|.+ T Consensus 160 ~~~~~~d~dqi~~L~~e~erl~V--~~~l~eaik~aRlfGGa~~~i~I--~gdd~~l~~P-----L~~~~~-~I~kGslK 229 (694) T protein:vir:10 160 NAASTSDGDQLKQINDEIERLRI--RDAVRTTVIHDQAFGRAHPYFKI--KGDDQIMDTP-----LVPRPY-TVPKGSFQ 229 (694) T ss_pred cccccccHHHHHHHHHHHHHHHH--HHHHHHHHHhhccccceEEEEEe--ecCccccccc-----cccccc-cccCccee Confidence 11 11222455567777776542 34444444568999999742322 1110 01111 111110 1111 Q ss_pred ----ccceeecCCCccccccccccccccccccccccccccccCCCccccccccEEEEeeCC------ccCccccchHHHH Q lcl|NC_017974. 166 ----TNPIVYNEDGSAIIGIKQSLSAFQNYTASEIGVGGVSTRMSDVIIPINRVMLMNTGG------SSSQALGVSPLVG 235 (507) Q Consensus 166 ----~~~~~~~~d~~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~------~~~~p~G~gll~~ 235 (507) ..+++..++.... .+|... .++....+.. .+..|=..+++.|+-.. -.-+.+|.++... T Consensus 230 Gl~ViDp~~vtP~~~n~----~dP~sp---dfgkP~~y~V----~G~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~ 298 (694) T protein:vir:10 230 GLRVVEPYWVTPNNYNS----INPVAD---DFYKPSTWWM----IGTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQL 298 (694) T ss_pred eeEeecccccccchhhh----ccchhh---ccCCCceEEE----eceEEeeeeEEEecCCCchhhhhcccccCcccHHHH Confidence 0111222211000 011110 0111111110 01123233333333221 1235678888888 Q ss_pred HHHHHH---HHHHHHHHHHHHHHHhcCCceEEecccccCCCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccc Q lcl|NC_017974. 236 CYRAWR---EKILIENLEVVGATKDMGGVIELKIPSQILNKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKD 312 (507) Q Consensus 236 ~~~~~~---fK~~~~~~w~~f~Er~g~p~~v~~~P~~~~k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i 312 (507) ++..+. --+.....- .+ ++-. .++++-+ .+ .-.+... .. +..-.+++..+++ .....+|..+. - T Consensus 299 ~~e~V~~~~rT~~~v~~L-i~--~~~v--~~lk~dl--a~-~L~~g~~--~~-l~~R~eli~~~Rs-n~G~~llDk~~-E 365 (694) T protein:vir:10 299 AMPYIDNWLRTRQSVSDI-VK--QFSV--SGILMDL--AQ-ALMPGAN--VD-LSMRAELINRYRD-NRNILFLDKAT-E 365 (694) T ss_pred HHHHHHHHHHHHhHHHHH-HH--hhhh--HHHHHHH--HH-hhcChhH--HH-HHHHHHHHHHhcC-ccceEEEecCC-c Confidence 774432 222222211 11 1100 0000000 00 0001111 11 2222333334443 33445665322 2 Q ss_pred cccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhccccc--ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_017974. 313 NAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFIN--VGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNE 390 (507) Q Consensus 313 ~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~ 390 (507) ++.++ ..+=+| ...++...-.+||-+. +-.+| .|.+-.|=+|.|+.-..+.-+.+++........+=+ T Consensus 366 efeq~-----stslSG----LddVi~qf~q~VAgaa-~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~ 435 (694) T protein:vir:10 366 EFFQF-----NTPLSG----LDALQAQAQEQMSAVS-HIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMN 435 (694) T ss_pred ceEEE-----ecccCC----HHHHHHHHHHHHHhhh-cCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 33322 222222 3456666666666543 22222 123345666777777777777777765443333323 Q ss_pred HHHHHHHHhCCCcCCccccce--EEecCcCchhHH-------HHHHHHHHHHhcCCccccHHHHHHHHHHcCCCCCCC-- Q lcl|NC_017974. 391 NLLPQLLALNDMRLPETEMPY--VKAGEIVDVDME-------GFSKAIQRIGAVGYLPKTPKVINRVLEVLGIDEKIE-- 459 (507) Q Consensus 391 ~li~~l~~lN~~~~~~~~~P~--~~~~~~~~~dl~-------~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gip~~~~-- 459 (507) .|+.-| .+.- ++ ...|. |+|...-+-+-+ +.|++++.+++.|++.++ .++.++.-.+... T Consensus 436 rl~~ii-~rS~-~G--~idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~-----evr~rL~~d~~s~Y~ 506 (694) T protein:vir:10 436 DVIVMI-QLSL-FG--AVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPD-----QVAARLNTEPDGPYA 506 (694) T ss_pred HHHHHH-HHHh-cC--CCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHH-----HHHHHHhcCCCcccc Confidence 333333 3321 11 12233 455443332223 346778889999998874 4666654432111 Q ss_pred ------cccccchhh--hcCCCcCCCCCCcccccCcccccccccCCCCchhhhc-cC Q lcl|NC_017974. 460 ------EDISQEELM--KLLGEDTSRAGDGMTKGSSGNGTGKISSTRDNSAANL-DN 507 (507) Q Consensus 460 ------~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~a~~~-~~ 507 (507) +++..+... ........+.+++...+++++ +. .-+.-+++.+|+ .| T Consensus 507 ~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~g~~~~~~v~~~~~~ 561 (694) T protein:vir:10 507 GKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG-AR-AGATAPPTVANVNAN 561 (694) T ss_pred cccccccCCCcCccchhhhhHhhhcCcccccccCCCCc-cc-ccccCCCcccccccc Confidence 110000000 000001111112211222221 10 011112222222 12 No 242 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=32.79 E-value=1.4 Score=19.78 Aligned_cols=442 Identities=13% Similarity=0.063 Sum_probs=161.3 Q ss_pred CccccccccccccccccccC---chhhhHHHHhh-hh------cccchhccchhHHHHHHHhhChHHHHHHHHHHHHHhh Q lcl|NC_017974. 13 VAKATSEQINVSRMRMSEQG---TFALAKVQVDS-ER------MKAEEIRWPNLIGTVESMKQDATVATGLDMLYTFVEK 82 (507) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~g---~~~~~~~~~~~-~~------e~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~av~~ 82 (507) |++.+...-.....++...- +.=.+.|+-.. .. .... ....+.-.+ -|++-.-++++--..+++ T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~-~~~~~~~~~-----~dst~~~a~~~Las~l~~ 74 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESD-NSSTEYTTP-----WQAVGARCLNNLAAKLML 74 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCC-ccccccccc-----ccccHHHHHHHHHHHHHh Confidence 33322211111111111000 00001111000 00 0000 000011111 255555555555555554 Q ss_pred C-----CceeeeCCCC---------ChhHHHHHHHHHH----HHHhccCCCHHHHHHHHHH-HHhhcceee--------- Q lcl|NC_017974. 83 A-----FKDFKVIPGE---------SEESKKAAKFVEY----CLKNMEGQTLRQFARDAAT-FNEYGLSVV--------- 134 (507) Q Consensus 83 ~-----~~~~~~~~~~---------~~~~~~~ae~v~~----~l~~~~~~~~~~~l~~~l~-a~~~Gfs~~--------- 134 (507) . +| |+....+ ..+..++.+++.+ +...+....|+..+.+++. -+.+|-++. T Consensus 75 ~ltP~~~W-Frl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 153 (522) T protein:vir:94 75 ALFPQSPW-MRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGT 153 (522) T ss_pred hcCCCCcc-cccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCc Confidence 3 23 2221111 1112234344332 2334445567776666553 345666542 Q ss_pred ----------eeeeeccCccccCcceeeeeccccC-chhccc-ccceeecCCC-cccc-ccccccccccccccccccccc Q lcl|NC_017974. 135 ----------EKVYTQVTVGEYIGKYKVKNLAFRP-QASLSR-TNPIVYNEDG-SAII-GIKQSLSAFQNYTASEIGVGG 200 (507) Q Consensus 135 ----------Ei~w~~~~~~~~~g~~~~~~l~~r~-~~~i~~-~~~~~~~~d~-~~l~-~~~q~~~~~~~~~~~~~~~~~ 200 (507) +.++..+..|.-+..+.--++..+. +..+.+ ...-.+..+. ..+. .+......+..+...+... T Consensus 154 ~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~~-- 231 (522) T protein:vir:94 154 YSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEGIE-- 231 (522) T ss_pred eeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccCce-- Confidence 2222222211111111000010000 000000 0000000000 0000 0011111111110000000 Q ss_pred cccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccc-ccCCCCCCCch Q lcl|NC_017974. 201 VSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPS-QILNKAAMDQS 279 (507) Q Consensus 201 ~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~-~~~k~~~~~~~ 279 (507) .....+....-...|++.|+...+|+.||.|....+..-..--+...+.-+..+++--.|... +|+ .+++.. T Consensus 232 ~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~--v~~~g~~~~~----- 304 (522) T protein:vir:94 232 VTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGL--VNPNGITQPR----- 304 (522) T ss_pred ecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee--ecccccccch----- Confidence 000011111222458999999999999999999999998877777888888888876655422 222 111110 Q ss_pred HHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcccccccccc Q lcl|NC_017974. 280 SPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAGFINVGNDK 359 (507) Q Consensus 280 ~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~ 359 (507) .+. .+| -+.++|...+ ++..+.....+....-...|+.+...|+.+++...++..+ T Consensus 305 --------~~~------~~~--~g~~v~g~~~------~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~-- 360 (522) T protein:vir:94 305 --------RLN------KAA--TGEFVAGRVE------DINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRN-- 360 (522) T ss_pred --------hee------ccC--CceeecCCcc------cceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCC-- Confidence 010 112 2345554322 1222233223333334678999999999999887655432 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hCCC----cCCc--cccceEEecC-----cCchhHHHHH Q lcl|NC_017974. 360 GGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA--LNDM----RLPE--TEMPYVKAGE-----IVDVDMEGFS 426 (507) Q Consensus 360 ~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~--lN~~----~~~~--~~~P~~~~~~-----~~~~dl~~~a 426 (507) ++..-..||+....+.. .. .-=+-..|+..++-||+. ++.. -.|. ...-++.|-. ....++..+. T Consensus 361 ~~r~TAtEV~~r~~E~~-~~-LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~ 438 (522) T protein:vir:94 361 AERVTAEEIRYVAGELE-AT-LGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLT 438 (522) T ss_pred CccccHHHHHHHHHHHH-HH-HhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHH Confidence 22222345544332221 11 112223344445555543 1100 1111 1111222211 1224555555 Q ss_pred HHHHHHHhcCCc-----cccHHHHHHHHHHcCCCCC----CCcccccc---hhhh-cCCCcCCCCCCccc-ccCcccccc Q lcl|NC_017974. 427 KAIQRIGAVGYL-----PKTPKVINRVLEVLGIDEK----IEEDISQE---ELMK-LLGEDTSRAGDGMT-KGSSGNGTG 492 (507) Q Consensus 427 ~~~~~L~~~G~~-----~~~~~~~~~i~e~~Gip~~----~~~~~~~~---~~~~-~~~~~~~~~~~~~~-~~~~~~~~~ 492 (507) ..++.+..++=. +..+...+++.+.+|+|.+ .+++.... .... .+.......+.+.. ...++.+.. T Consensus 439 ~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 518 (522) T protein:vir:94 439 QAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGED 518 (522) T ss_pred HHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchh Confidence 555655555422 2223555788999999532 22221110 0000 00001111111111 111111222 Q ss_pred cccC Q lcl|NC_017974. 493 KISS 496 (507) Q Consensus 493 ~~~~ 496 (507) ...+ T Consensus 519 ~~~~ 522 (522) T protein:vir:94 519 MAQA 522 (522) T ss_pred hhcC Confidence 1112 No 243 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=27.94 E-value=1.8 Score=19.19 Aligned_cols=442 Identities=11% Similarity=0.019 Sum_probs=152.3 Q ss_pred CcccccccccccCcccccc--ccccccc----cccccCchhhhHHHHhhhhcccchhccchhHHHHHHHhhChHHHHHHH Q lcl|NC_017974. 1 MSRKRNRNRSVKVAKATSE--QINVSRM----RMSEQGTFALAKVQVDSERMKAEEIRWPNLIGTVESMKQDATVATGLD 74 (507) Q Consensus 1 ~~~k~~~~~~~~~~~~~~~--~~~~~~~----~~~~~g~~~~~~~~~~~~~e~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 74 (507) |+..+|-.-.....+.... ++..-|. ++.++...-+ ...+. .++.. +..+ .+ -|++-.-++. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~l--P~~~~--~~~~~-~~~~--~~-----~dstg~~a~~ 68 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTL--PYLMN--DKGDN-ETSQ--NG-----WQGVGAQATN 68 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhc--ccccC--CCCCc-cccC--Cc-----ccchHHHHHH Confidence 4443332222111111000 0000000 0000000000 00000 00000 0000 11 2555555555 Q ss_pred HHHHHHhhC------CceeeeCCCCC---------hhHHHHHHHHHHH----HHhccCCCHHHHHHHHH-HHHhhcceee Q lcl|NC_017974. 75 MLYTFVEKA------FKDFKVIPGES---------EESKKAAKFVEYC----LKNMEGQTLRQFARDAA-TFNEYGLSVV 134 (507) Q Consensus 75 ~Rk~av~~~------~~~~~~~~~~~---------~~~~~~ae~v~~~----l~~~~~~~~~~~l~~~l-~a~~~Gfs~~ 134 (507) +--..+++. +| |+...++. .+..++.++++++ ...+....|+..+.+++ +-+.+|..++ T Consensus 69 ~LAa~l~~~ltpp~~~W-F~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 147 (516) T protein:vir:96 69 HLANKLAQVLFPAQRSF-FRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCML 147 (516) T ss_pred HHHHHHHhhhcCCCCcc-cccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeE Confidence 544444432 33 33322211 1222444443322 22344456777776644 4556776653 Q ss_pred ee--------------eeeccCccccCcce-----eeeeccccCchhcccccce-eecCCC--ccccccccccccccccc Q lcl|NC_017974. 135 EK--------------VYTQVTVGEYIGKY-----KVKNLAFRPQASLSRTNPI-VYNEDG--SAIIGIKQSLSAFQNYT 192 (507) Q Consensus 135 Ei--------------~w~~~~~~~~~g~~-----~~~~l~~r~~~~i~~~~~~-~~~~d~--~~l~~~~q~~~~~~~~~ 192 (507) -+ ++..+..|.-+-.+ .++.|..+-+......... ...++. .....+...+.++..++ T Consensus 148 ~~d~~~~~~~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 227 (516) T protein:vir:96 148 YKPSKGAISAIPMHHYVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELK 227 (516) T ss_pred EecCCCCEEEEEcCeEEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEE Confidence 22 11111111000000 0000100000000000000 000000 00000111111111111 Q ss_pred cccccccccccCCCccccccccEEEEeeCCccCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEeccc-ccC Q lcl|NC_017974. 193 ASEIGVGGVSTRMSDVIIPINRVMLMNTGGSSSQALGVSPLVGCYRAWREKILIENLEVVGATKDMGGVIELKIPS-QIL 271 (507) Q Consensus 193 ~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~~~p~G~gll~~~~~~~~fK~~~~~~w~~f~Er~g~p~~v~~~P~-~~~ 271 (507) ....+...... ++...-..-|++.|+...+|+.||.|....+..-..--+...+.-+..++.-..|+. .+|+ .++ T Consensus 228 ~~~d~~~~~~e--s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~--lv~p~g~~ 303 (516) T protein:vir:96 228 QSADDIPVGKV--SKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKY--LIRPGAQT 303 (516) T ss_pred EEeCceeeccc--cccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCcc--ccCccccc Confidence 10000000011 111112335799999999999999999999998876666666666777665333321 1221 111 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHhhcCCceeEeecCcccccccccceeeeeccCCCcchhHHHHHHHHHHHHHHHHhcc Q lcl|NC_017974. 272 NKAAMDQSSPEAEMVRGLMSDAANAHSGEQSFFMLPSDTKDNAPQYSMTLKGIDGMGKQYSTAQLISDRKKSILDRLGAG 351 (507) Q Consensus 272 k~~~~~~~~~~~~~l~a~~~~~~n~~~G~~a~~iiP~g~~i~~~~~~~el~~~~g~g~~~~~~~li~~~d~~Isk~iLGq 351 (507) + . .++..|. .+.++|...+ + +.....+..+....-...|+.+...|+.+++.. T Consensus 304 ~----------------~----~~l~~~~-~g~i~~g~~~-~-----v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~ 356 (516) T protein:vir:96 304 D----------------V----DHFVNSG-TGEVVTGVEE-D-----IHIVQLGKYADLTPISAVLEVYTRRIGVVFMME 356 (516) T ss_pred c----------------h----hhhccCC-CceeecCCcc-c-----ceeeecCcccchhHHHHHHHHHHHHHHHHHhhh Confidence 1 0 1111122 2455664331 1 222222222233334578999999999999987 Q ss_pred cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hCCCcCCccccceEEec--CcC-------ch Q lcl|NC_017974. 352 FINVGNDKGGSYNLSESKQTIHTQFVQRVNEIILEALNENLLPQLLA--LNDMRLPETEMPYVKAG--EIV-------DV 420 (507) Q Consensus 352 tLts~~~~~GS~Al~~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~--lN~~~~~~~~~P~~~~~--~~~-------~~ 420 (507) .|...++ ..--..||+....+.. .. .-=+-.-|+..++-||+. ++- .+-.+|..... ... .. T Consensus 357 ~l~~r~~--~rvTAtEV~~r~~E~~-~~-LGpv~~rl~~Ell~Pli~r~l~~---~~p~lp~~~v~~~~vs~l~~l~r~~ 429 (516) T protein:vir:96 357 TMTRRDA--ERVTAVEIQRDALEIE-QN-MGGVYSLFATTMQSPVAMWGLLE---AGESFTSDLVDPVIITGIEALGRMA 429 (516) T ss_pred hhccCCC--ccccHHHHHHHHHHHH-HH-hhhHHHHHHHHHHHHHHHHHHHh---cCCCCccccccceeechHHHHHHHH Confidence 6655332 2122345554332221 11 112222234444445543 221 11123322111 111 12 Q ss_pred hHHHHHHHHHHH---HhcC----CccccHHHHHHHHHHcCCCCC---CCcccccchhhhc----CCCcCCCCCCcccccC Q lcl|NC_017974. 421 DMEGFSKAIQRI---GAVG----YLPKTPKVINRVLEVLGIDEK---IEEDISQEELMKL----LGEDTSRAGDGMTKGS 486 (507) Q Consensus 421 dl~~~a~~~~~L---~~~G----~~~~~~~~~~~i~e~~Gip~~---~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 486 (507) ++..+...++.+ +... -.+..+...+++.+.+|+|.. .+++......+.. .....+..+.+.. +. T Consensus 430 ~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~-~~ 508 (516) T protein:vir:96 430 ELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVP-GV 508 (516) T ss_pred HHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-HH Confidence 222222233322 2111 111223455788999999953 2222211100000 0000011111111 01 Q ss_pred cccccccccCC Q lcl|NC_017974. 487 SGNGTGKISST 497 (507) Q Consensus 487 ~~~~~~~~~~~ 497 (507) ++ ...| ++ T Consensus 509 ~~-~~~~--~~ 516 (516) T protein:vir:96 509 IQ-QELK--EA 516 (516) T ss_pred hh-cccc--cC Confidence 11 0001 01 Done!