Query lcl|NC_016071.1_cdsid_YP_004893881.1 [gene=75] [protein=hypothetical protein] [protein_id=YP_004893881.1] [location=complement(57429..58979)] Match_columns 516 No_of_seqs 175 out of 338 Neff 7.9 Searched_HMMs 1612 Date Thu Nov 7 14:58:48 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_74 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_74_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95254 Length: 488 100.0 8E-130 5E-133 728.2 42.3 479 11-516 1-487 (488) 2 protein:vir:108215 Length: 469 100.0 2E-115 1E-118 649.4 43.1 453 1-516 1-469 (469) 3 protein:vir:79233 Length: 526 100.0 2E-111 9E-115 628.1 39.9 432 1-516 1-458 (526) 4 protein:vir:99232 Length: 526 100.0 1E-109 7E-113 618.0 40.5 432 1-516 1-455 (526) 5 protein:vir:103860 Length: 528 100.0 4E-109 2E-112 614.9 41.0 434 1-516 1-460 (528) 6 protein:vir:1986 Length: 512 # 100.0 5E-107 3E-110 603.6 40.0 428 1-516 1-449 (512) 7 protein:vir:79511 Length: 448 100.0 6E-106 4E-109 597.4 39.9 440 1-508 1-448 (448) 8 protein:vir:77981 Length: 448 100.0 1E-105 9E-109 595.4 39.2 440 1-510 1-448 (448) 9 protein:vir:98816 Length: 446 100.0 3E-104 2E-107 588.6 36.6 417 1-470 3-446 (446) 10 protein:vir:99853 Length: 488 100.0 8E-104 5E-107 585.8 38.5 413 1-516 6-425 (488) 11 protein:vir:79063 Length: 491 100.0 1E-101 9E-105 573.5 40.0 422 1-516 1-429 (491) 12 protein:vir:107880 Length: 491 100.0 9E-102 6E-105 574.5 39.0 423 1-516 1-434 (491) 13 protein:vir:78161 Length: 355 100.0 9.7E-88 6E-91 497.6 33.6 336 133-516 1-353 (355) 14 protein:vir:102727 Length: 945 99.8 3.5E-19 2.2E-22 121.8 33.6 431 1-516 76-546 (945) 15 protein:vir:93610 Length: 454 99.8 2.2E-17 1.3E-20 111.9 36.5 432 1-516 1-450 (454) 16 protein:vir:79772 Length: 648 99.8 5.4E-17 3.3E-20 109.8 38.0 452 1-516 1-517 (648) 17 protein:vir:105064 Length: 421 99.8 1.8E-17 1.1E-20 112.4 32.5 412 1-509 3-421 (421) 18 protein:vir:3153 Length: 467 # 99.8 3.5E-17 2.1E-20 110.8 33.8 415 58-516 1-455 (467) 19 protein:vir:99452 Length: 651 99.7 5.4E-17 3.3E-20 109.8 33.8 482 1-516 1-549 (651) 20 protein:vir:1266 Length: 416 # 99.7 3.8E-17 2.3E-20 110.6 32.6 406 1-501 3-416 (416) 21 protein:vir:3843 Length: 397 # 99.7 2.8E-16 1.7E-19 105.8 33.7 391 1-505 1-397 (397) 22 protein:vir:1380 Length: 422 # 99.7 1.8E-16 1.1E-19 106.8 32.1 394 29-502 1-422 (422) 23 protein:vir:7853 Length: 518 # 99.7 7.4E-16 4.6E-19 103.5 34.8 427 11-516 1-454 (518) 24 protein:vir:101648 Length: 518 99.7 9.2E-16 5.7E-19 103.0 35.1 427 11-516 1-454 (518) 25 protein:vir:80333 Length: 419 99.7 3.7E-16 2.3E-19 105.2 31.2 411 1-515 1-419 (419) 26 protein:vir:63755 Length: 547 99.7 8.2E-16 5.1E-19 103.3 33.1 438 1-516 31-529 (547) 27 protein:vir:6240 Length: 457 # 99.7 3.2E-16 2E-19 105.6 30.8 422 1-510 1-457 (457) 28 protein:vir:102855 Length: 432 99.7 7.6E-16 4.7E-19 103.5 32.9 408 26-510 1-432 (432) 29 protein:vir:107605 Length: 432 99.7 7.6E-16 4.7E-19 103.5 32.9 408 26-510 1-432 (432) 30 protein:vir:105002 Length: 432 99.7 7.6E-16 4.7E-19 103.5 32.9 408 26-510 1-432 (432) 31 protein:vir:189 Length: 424 # 99.7 2.3E-16 1.4E-19 106.3 29.9 415 1-511 1-424 (424) 32 protein:vir:96579 Length: 576 99.7 5.1E-15 3.2E-18 98.9 36.7 439 1-516 31-531 (576) 33 protein:vir:5737 Length: 419 # 99.7 1.6E-15 9.9E-19 101.7 33.8 410 1-513 3-419 (419) 34 protein:vir:8418 Length: 409 # 99.7 1.5E-15 9.3E-19 101.9 33.4 401 1-516 1-408 (409) 35 protein:vir:4337 Length: 434 # 99.7 9.9E-16 6.1E-19 102.9 32.1 421 1-505 1-434 (434) 36 protein:vir:1431 Length: 419 # 99.7 1.5E-15 9.4E-19 101.8 32.5 409 1-515 2-419 (419) 37 protein:vir:1884 Length: 424 # 99.7 7E-16 4.3E-19 103.7 30.5 417 1-511 1-424 (424) 38 protein:vir:100150 Length: 437 99.7 2.7E-15 1.7E-18 100.5 33.2 421 1-507 1-437 (437) 39 protein:vir:483 Length: 413 # 99.7 5.9E-16 3.7E-19 104.1 29.2 405 1-512 3-413 (413) 40 protein:vir:4454 Length: 414 # 99.7 3.4E-15 2.1E-18 99.9 32.8 405 1-512 1-414 (414) 41 protein:vir:81152 Length: 411 99.7 1.4E-15 8.5E-19 102.1 30.5 403 1-509 1-411 (411) 42 protein:vir:1326 Length: 457 # 99.7 3E-15 1.9E-18 100.2 31.9 425 1-510 1-457 (457) 43 protein:vir:80644 Length: 551 99.7 7.1E-15 4.4E-18 98.2 33.5 441 1-516 23-542 (551) 44 protein:vir:100249 Length: 431 99.7 5.4E-15 3.4E-18 98.8 32.8 411 1-504 1-431 (431) 45 protein:vir:102080 Length: 429 99.7 5.5E-15 3.4E-18 98.8 32.8 419 1-511 1-429 (429) 46 protein:vir:102118 Length: 409 99.6 8.4E-15 5.2E-18 97.8 33.2 401 1-499 3-409 (409) 47 protein:vir:99312 Length: 563 99.6 1.6E-14 1E-17 96.2 34.4 439 1-516 29-557 (563) 48 protein:vir:95599 Length: 563 99.6 1.6E-14 1E-17 96.2 34.4 439 1-516 29-557 (563) 49 protein:vir:10362 Length: 432 99.6 1.4E-14 8.8E-18 96.5 33.7 414 1-513 9-432 (432) 50 protein:vir:80796 Length: 574 99.6 1.4E-14 8.9E-18 96.5 33.5 439 1-516 27-535 (574) 51 protein:vir:9359 Length: 348 # 99.6 2.7E-15 1.7E-18 100.5 29.4 342 78-501 1-348 (348) 52 protein:vir:2683 Length: 412 # 99.6 8.2E-15 5.1E-18 97.8 31.7 406 1-503 1-412 (412) 53 protein:vir:98396 Length: 441 99.6 2.5E-14 1.6E-17 95.1 34.2 420 1-511 14-441 (441) 54 protein:vir:93943 Length: 409 99.6 1.1E-14 7.1E-18 97.0 30.8 401 1-503 1-409 (409) 55 protein:vir:97060 Length: 432 99.6 2.8E-14 1.8E-17 94.8 32.9 414 1-513 9-432 (432) 56 protein:vir:4194 Length: 540 # 99.6 7.3E-14 4.5E-17 92.6 35.0 440 1-516 6-466 (540) 57 protein:vir:79984 Length: 441 99.6 5.6E-14 3.5E-17 93.2 33.9 412 1-511 13-441 (441) 58 protein:vir:9408 Length: 441 # 99.6 5.6E-14 3.5E-17 93.2 33.9 412 1-511 13-441 (441) 59 protein:vir:4598 Length: 416 # 99.6 4.4E-14 2.7E-17 93.8 33.2 408 1-511 4-416 (416) 60 protein:vir:81095 Length: 416 99.6 4.4E-14 2.7E-17 93.8 33.2 408 1-511 4-416 (416) 61 protein:vir:96980 Length: 409 99.6 4.4E-14 2.8E-17 93.8 33.2 398 1-504 1-409 (409) 62 protein:vir:101647 Length: 460 99.6 8.6E-14 5.4E-17 92.2 34.5 416 1-502 1-460 (460) 63 protein:vir:94426 Length: 409 99.6 2.9E-14 1.8E-17 94.8 31.6 396 1-503 1-409 (409) 64 protein:vir:4509 Length: 424 # 99.6 9.5E-14 5.9E-17 92.0 33.9 400 1-503 18-424 (424) 65 protein:vir:81218 Length: 423 99.6 4.4E-14 2.7E-17 93.8 31.7 411 1-502 1-423 (423) 66 protein:vir:81072 Length: 432 99.6 5.1E-14 3.2E-17 93.5 32.1 414 1-513 9-432 (432) 67 protein:vir:960 Length: 413 # 99.6 3.7E-13 2.3E-16 88.8 34.2 392 1-500 4-413 (413) 68 protein:vir:3868 Length: 417 # 99.5 4E-13 2.5E-16 88.6 32.2 403 1-516 1-416 (417) 69 protein:vir:4156 Length: 542 # 99.5 1.4E-12 8.6E-16 85.6 33.8 438 1-516 6-467 (542) 70 protein:vir:95378 Length: 406 99.5 8.3E-13 5.1E-16 86.8 32.1 395 1-503 1-406 (406) 71 protein:vir:100691 Length: 535 99.5 2.9E-12 1.8E-15 83.8 33.8 443 1-516 13-532 (535) 72 protein:vir:7407 Length: 392 # 99.5 4.1E-13 2.6E-16 88.5 29.2 380 1-506 1-392 (392) 73 protein:vir:9702 Length: 406 # 99.5 1.5E-12 9.3E-16 85.4 32.0 391 1-510 4-406 (406) 74 protein:vir:1023 Length: 392 # 99.5 6.7E-13 4.1E-16 87.3 29.4 380 1-501 3-392 (392) 75 protein:vir:3989 Length: 392 # 99.5 6.7E-13 4.1E-16 87.3 29.4 380 1-501 3-392 (392) 76 protein:vir:6210 Length: 394 # 99.5 7E-12 4.4E-15 81.7 33.3 384 1-516 1-394 (394) 77 protein:vir:100187 Length: 385 99.4 9.6E-13 6E-16 86.5 28.0 372 1-499 4-385 (385) 78 protein:vir:101289 Length: 395 99.4 3.6E-12 2.2E-15 83.4 29.8 387 1-516 1-395 (395) 79 protein:vir:9507 Length: 395 # 99.4 3.6E-12 2.2E-15 83.4 29.8 387 1-516 1-395 (395) 80 protein:vir:100650 Length: 395 99.4 3.6E-12 2.2E-15 83.4 29.8 387 1-516 1-395 (395) 81 protein:vir:80134 Length: 403 99.4 9.8E-12 6E-15 80.9 32.1 393 1-504 1-403 (403) 82 protein:vir:94666 Length: 723 99.4 1.1E-11 7.1E-15 80.6 31.4 418 1-516 1-456 (723) 83 protein:vir:4952 Length: 386 # 99.4 2.1E-11 1.3E-14 79.2 32.7 380 1-513 1-386 (386) 84 protein:vir:104259 Length: 403 99.4 2.5E-11 1.6E-14 78.7 32.9 386 1-500 3-403 (403) 85 protein:vir:95965 Length: 385 99.4 1.8E-11 1.1E-14 79.5 30.0 372 1-502 1-385 (385) 86 protein:vir:100882 Length: 383 99.4 7.7E-12 4.8E-15 81.5 27.3 370 1-502 4-383 (383) 87 protein:vir:4854 Length: 386 # 99.3 8.4E-11 5.2E-14 75.8 32.7 381 1-503 1-386 (386) 88 protein:vir:4089 Length: 395 # 99.3 1.1E-10 6.6E-14 75.3 32.0 379 1-509 1-395 (395) 89 protein:vir:94002 Length: 378 99.3 2.3E-11 1.4E-14 78.9 27.6 366 29-502 1-378 (378) 90 protein:vir:98643 Length: 395 99.3 1.6E-10 9.9E-14 74.3 30.9 384 1-498 1-395 (395) 91 protein:vir:8317 Length: 409 # 99.3 1.1E-10 6.9E-14 75.1 29.3 383 1-484 12-409 (409) 92 protein:vir:9641 Length: 395 # 99.3 1.3E-10 7.8E-14 74.9 29.5 373 1-498 1-395 (395) 93 protein:vir:4995 Length: 384 # 99.2 1.2E-10 7.6E-14 74.9 28.5 378 1-475 1-384 (384) 94 protein:vir:78310 Length: 376 99.2 2.1E-10 1.3E-13 73.7 29.4 365 1-494 1-376 (376) 95 protein:vir:93867 Length: 378 99.2 9.5E-11 5.9E-14 75.5 27.4 364 29-502 1-378 (378) 96 protein:vir:1661 Length: 378 # 99.2 1.7E-10 1.1E-13 74.1 27.3 364 29-502 1-378 (378) 97 protein:vir:8100 Length: 466 # 99.2 9.3E-10 5.7E-13 70.1 31.8 434 1-516 3-466 (466) 98 protein:vir:80040 Length: 461 99.1 1.3E-09 8.2E-13 69.3 32.0 432 1-490 1-461 (461) 99 protein:vir:5249 Length: 437 # 99.1 1.9E-09 1.2E-12 68.4 35.4 423 9-502 1-437 (437) 100 protein:vir:1082 Length: 359 # 99.1 2.8E-09 1.7E-12 67.5 28.5 348 1-467 1-359 (359) 101 protein:vir:4828 Length: 382 # 99.0 6.1E-09 3.8E-12 65.6 31.8 379 1-513 1-382 (382) 102 protein:vir:94869 Length: 378 99.0 8.9E-09 5.5E-12 64.7 28.2 362 26-502 1-378 (378) 103 protein:vir:78641 Length: 278 98.9 3.6E-09 2.2E-12 66.9 23.6 275 78-428 1-278 (278) 104 protein:vir:858 Length: 378 # 98.9 1.7E-08 1E-11 63.2 27.7 364 26-511 1-378 (378) 105 protein:vir:107742 Length: 537 98.9 1.8E-08 1.1E-11 63.0 35.7 451 1-516 48-535 (537) 106 protein:vir:100328 Length: 346 98.8 2.7E-08 1.7E-11 62.1 23.2 335 1-439 1-346 (346) 107 protein:vir:5691 Length: 344 # 98.7 1E-07 6.3E-11 58.9 23.1 330 1-431 1-344 (344) 108 protein:vir:98567 Length: 340 98.6 2.1E-07 1.3E-10 57.1 26.0 329 1-432 1-340 (340) 109 protein:vir:1150 Length: 350 # 98.5 3.2E-07 2E-10 56.2 23.3 330 1-431 1-350 (350) 110 protein:vir:79647 Length: 435 98.5 3.3E-07 2.1E-10 56.1 31.4 408 1-491 5-435 (435) 111 protein:vir:78749 Length: 337 98.5 3.5E-07 2.2E-10 56.0 22.8 323 1-431 1-337 (337) 112 protein:vir:6058 Length: 344 # 98.5 4.3E-07 2.7E-10 55.5 22.8 330 1-431 1-344 (344) 113 protein:vir:94049 Length: 532 98.5 5E-07 3.1E-10 55.1 33.5 455 1-516 23-531 (532) 114 protein:vir:79150 Length: 368 98.5 2E-07 1.2E-10 57.3 19.8 341 1-444 1-368 (368) 115 protein:vir:107662 Length: 427 98.4 6.2E-07 3.9E-10 54.6 29.6 413 7-513 1-427 (427) 116 protein:vir:96068 Length: 765 98.4 7.1E-07 4.4E-10 54.3 34.1 451 1-516 43-560 (765) 117 protein:vir:104338 Length: 422 98.4 9.3E-07 5.8E-10 53.7 30.8 405 11-489 1-422 (422) 118 protein:vir:103971 Length: 376 98.4 9.9E-07 6.1E-10 53.5 24.1 334 1-433 26-376 (376) 119 protein:vir:79207 Length: 351 98.4 1E-06 6.4E-10 53.4 24.6 332 1-433 1-351 (351) 120 protein:vir:267 Length: 348 # 98.3 1.5E-06 9.4E-10 52.5 25.8 330 1-438 1-348 (348) 121 protein:vir:78191 Length: 351 98.3 1.6E-06 9.9E-10 52.4 23.8 331 1-433 1-351 (351) 122 protein:vir:4698 Length: 251 # 98.3 8.3E-07 5.1E-10 53.9 18.9 245 1-320 1-251 (251) 123 protein:vir:3420 Length: 533 # 98.3 1.9E-06 1.2E-09 51.9 29.5 468 1-512 1-533 (533) 124 protein:vir:3743 Length: 345 # 98.2 2.8E-06 1.7E-09 51.1 25.1 335 1-433 1-345 (345) 125 protein:vir:2013 Length: 344 # 98.2 3.3E-06 2E-09 50.7 22.8 327 1-432 1-344 (344) 126 protein:vir:3780 Length: 345 # 98.1 5.6E-06 3.4E-09 49.4 24.3 335 1-433 1-345 (345) 127 protein:vir:96738 Length: 505 98.0 6.1E-06 3.8E-09 49.2 28.4 450 1-509 1-505 (505) 128 protein:vir:6382 Length: 553 # 97.9 1E-05 6.2E-09 48.0 29.4 474 1-514 1-553 (553) 129 protein:vir:389 Length: 530 # 97.9 1.3E-05 8.4E-09 47.3 34.2 463 1-511 1-530 (530) 130 protein:vir:78227 Length: 480 97.7 2.7E-05 1.6E-08 45.7 23.3 443 1-513 1-480 (480) 131 protein:vir:94956 Length: 452 97.6 4.1E-05 2.5E-08 44.6 26.9 406 1-503 1-452 (452) 132 protein:vir:78537 Length: 480 97.5 4.7E-05 2.9E-08 44.3 25.5 440 1-513 1-480 (480) 133 protein:vir:105782 Length: 449 97.5 6.3E-05 3.9E-08 43.6 25.9 415 1-506 1-449 (449) 134 protein:vir:95014 Length: 491 97.2 0.00015 9.2E-08 41.6 25.1 432 1-502 1-491 (491) 135 protein:vir:79538 Length: 502 97.1 0.00017 1E-07 41.3 34.7 445 1-512 11-502 (502) 136 protein:vir:99916 Length: 504 97.0 0.00021 1.3E-07 40.8 29.0 452 1-512 1-504 (504) 137 protein:vir:5839 Length: 533 # 97.0 0.00023 1.5E-07 40.5 20.5 434 1-516 20-524 (533) 138 protein:vir:80165 Length: 651 96.9 0.00025 1.5E-07 40.3 27.3 462 1-516 1-626 (651) 139 protein:vir:98444 Length: 434 96.9 0.00027 1.7E-07 40.1 27.9 413 20-506 1-434 (434) 140 protein:vir:99563 Length: 862 96.9 0.00028 1.7E-07 40.1 35.7 459 1-516 39-596 (862) 141 protein:vir:102239 Length: 527 96.9 0.00029 1.8E-07 39.9 26.1 453 1-516 1-523 (527) 142 protein:vir:101494 Length: 527 96.9 0.0003 1.8E-07 39.9 26.0 453 1-516 1-523 (527) 143 protein:vir:8184 Length: 474 # 96.8 0.00031 1.9E-07 39.8 24.5 427 10-502 1-474 (474) 144 protein:vir:5961 Length: 503 # 96.8 0.00033 2E-07 39.7 29.4 433 1-516 38-503 (503) 145 protein:vir:78393 Length: 489 96.7 0.00037 2.3E-07 39.4 24.8 432 1-509 1-489 (489) 146 protein:vir:10321 Length: 495 96.6 0.00046 2.8E-07 38.9 31.3 445 1-509 1-495 (495) 147 protein:vir:93747 Length: 472 96.3 0.00075 4.7E-07 37.7 27.9 418 1-512 5-472 (472) 148 protein:vir:4898 Length: 502 # 96.2 0.00086 5.3E-07 37.4 27.1 444 1-512 17-502 (502) 149 protein:vir:98853 Length: 219 95.8 0.0014 8.9E-07 36.2 17.9 212 170-432 1-219 (219) 150 protein:vir:80680 Length: 441 95.8 0.0015 9.2E-07 36.1 25.2 410 24-503 1-441 (441) 151 protein:vir:98883 Length: 517 95.8 0.0015 9.3E-07 36.1 28.0 447 1-500 3-517 (517) 152 protein:vir:80453 Length: 535 95.5 0.0019 1.2E-06 35.5 34.2 456 1-516 1-534 (535) 153 protein:vir:105889 Length: 474 95.5 0.0019 1.2E-06 35.5 30.6 418 10-509 1-474 (474) 154 protein:vir:94101 Length: 474 95.5 0.0019 1.2E-06 35.5 30.6 418 10-509 1-474 (474) 155 protein:vir:95806 Length: 440 95.5 0.002 1.2E-06 35.4 25.6 409 1-502 6-440 (440) 156 protein:vir:78907 Length: 518 95.3 0.0024 1.5E-06 35.0 32.4 426 26-500 1-518 (518) 157 protein:vir:95149 Length: 501 95.1 0.0027 1.7E-06 34.7 32.4 426 1-516 1-499 (501) 158 protein:vir:105819 Length: 456 95.1 0.0029 1.8E-06 34.5 26.6 425 1-513 1-456 (456) 159 protein:vir:102602 Length: 456 95.1 0.0029 1.8E-06 34.5 26.6 425 1-513 1-456 (456) 160 protein:vir:2341 Length: 488 # 95.0 0.003 1.9E-06 34.4 28.0 435 18-506 1-488 (488) 161 protein:vir:94742 Length: 409 95.0 0.003 1.9E-06 34.4 24.9 371 24-467 1-409 (409) 162 protein:vir:97265 Length: 513 94.6 0.0041 2.5E-06 33.7 28.9 458 1-516 1-511 (513) 163 protein:vir:105292 Length: 478 94.5 0.0041 2.6E-06 33.7 27.9 427 1-512 1-478 (478) 164 protein:vir:1236 Length: 483 # 94.4 0.0045 2.8E-06 33.5 29.5 427 1-502 1-483 (483) 165 protein:vir:733 Length: 453 # 94.4 0.0045 2.8E-06 33.4 24.6 418 10-514 1-453 (453) 166 protein:vir:103177 Length: 533 94.1 0.0054 3.4E-06 33.0 21.0 460 1-516 1-532 (533) 167 protein:vir:94805 Length: 492 94.0 0.0056 3.5E-06 32.9 26.1 422 1-516 20-491 (492) 168 protein:vir:99072 Length: 479 93.9 0.006 3.7E-06 32.8 24.2 434 15-511 1-479 (479) 169 protein:vir:7768 Length: 484 # 93.8 0.0064 4E-06 32.6 25.0 443 1-512 1-484 (484) 170 protein:vir:105154 Length: 525 93.7 0.0066 4.1E-06 32.5 22.9 465 2-515 1-525 (525) 171 protein:vir:96494 Length: 501 93.6 0.007 4.4E-06 32.4 27.1 447 1-511 16-501 (501) 172 protein:vir:9751 Length: 422 # 93.5 0.0073 4.5E-06 32.3 24.5 384 24-483 1-422 (422) 173 protein:vir:7987 Length: 456 # 93.3 0.0082 5.1E-06 32.0 25.6 423 21-507 1-456 (456) 174 protein:vir:2732 Length: 501 # 93.0 0.0092 5.7E-06 31.7 29.4 450 1-507 16-501 (501) 175 protein:vir:5665 Length: 511 # 92.9 0.0095 5.9E-06 31.7 17.2 443 1-490 5-511 (511) 176 protein:vir:9871 Length: 429 # 92.6 0.011 6.8E-06 31.3 27.3 413 1-502 1-429 (429) 177 protein:vir:97336 Length: 492 92.4 0.011 7.1E-06 31.2 29.9 413 1-511 56-492 (492) 178 protein:vir:95542 Length: 548 92.4 0.012 7.2E-06 31.2 36.2 453 1-509 11-548 (548) 179 protein:vir:96366 Length: 511 92.2 0.012 7.6E-06 31.1 24.4 437 23-514 1-511 (511) 180 protein:vir:78805 Length: 511 92.2 0.012 7.6E-06 31.1 24.4 437 23-514 1-511 (511) 181 protein:vir:99781 Length: 511 92.0 0.013 8.2E-06 30.9 25.8 426 1-511 53-511 (511) 182 protein:vir:9306 Length: 511 # 91.6 0.015 9.4E-06 30.6 25.2 439 23-514 1-511 (511) 183 protein:vir:94599 Length: 641 90.9 0.018 1.1E-05 30.1 22.5 472 1-516 1-615 (641) 184 protein:vir:2427 Length: 485 # 90.1 0.022 1.4E-05 29.6 27.2 433 1-510 1-485 (485) 185 protein:vir:104082 Length: 485 89.6 0.025 1.6E-05 29.3 28.7 449 1-510 1-485 (485) 186 protein:vir:3609 Length: 452 # 89.3 0.027 1.7E-05 29.2 27.9 415 1-502 1-452 (452) 187 protein:vir:9568 Length: 410 # 89.1 0.028 1.8E-05 29.1 24.6 387 19-482 1-410 (410) 188 protein:vir:3964 Length: 453 # 89.0 0.029 1.8E-05 29.0 25.8 417 1-502 1-453 (453) 189 protein:vir:1587 Length: 508 # 88.9 0.03 1.8E-05 29.0 29.1 438 1-498 3-508 (508) 190 protein:vir:9815 Length: 500 # 88.6 0.031 1.9E-05 28.8 30.2 443 1-498 3-500 (500) 191 protein:vir:3028 Length: 500 # 88.6 0.031 1.9E-05 28.8 30.2 443 1-498 3-500 (500) 192 protein:vir:94498 Length: 474 88.5 0.032 2E-05 28.8 30.2 412 1-509 39-474 (474) 193 protein:vir:97447 Length: 474 88.5 0.032 2E-05 28.8 30.2 412 1-509 39-474 (474) 194 protein:vir:4782 Length: 522 # 88.5 0.032 2E-05 28.8 28.3 448 1-511 14-522 (522) 195 protein:vir:101806 Length: 516 87.6 0.038 2.4E-05 28.4 16.4 441 1-498 3-516 (516) 196 protein:vir:101189 Length: 516 87.6 0.038 2.4E-05 28.4 16.4 441 1-498 3-516 (516) 197 protein:vir:78083 Length: 537 86.9 0.043 2.6E-05 28.1 33.3 425 17-510 1-537 (537) 198 protein:vir:106999 Length: 564 86.6 0.045 2.8E-05 28.0 19.8 473 1-514 1-564 (564) 199 protein:vir:95113 Length: 474 86.5 0.045 2.8E-05 28.0 30.1 421 1-502 1-474 (474) 200 protein:vir:96839 Length: 474 85.9 0.049 3E-05 27.8 27.1 428 1-510 1-474 (474) 201 protein:vir:107112 Length: 478 85.8 0.05 3.1E-05 27.7 28.9 425 1-512 1-478 (478) 202 protein:vir:102950 Length: 471 84.3 0.062 3.8E-05 27.2 25.4 416 1-504 1-471 (471) 203 protein:vir:97171 Length: 512 84.0 0.064 4E-05 27.1 24.2 451 1-514 1-512 (512) 204 protein:vir:79703 Length: 505 83.0 0.072 4.5E-05 26.8 28.4 437 1-489 3-505 (505) 205 protein:vir:95899 Length: 474 82.6 0.075 4.7E-05 26.7 28.3 419 1-505 1-474 (474) 206 protein:vir:96266 Length: 474 82.6 0.075 4.7E-05 26.7 28.3 419 1-505 1-474 (474) 207 protein:vir:106571 Length: 499 82.6 0.076 4.7E-05 26.7 25.8 442 1-516 1-493 (499) 208 protein:vir:96240 Length: 511 81.7 0.084 5.2E-05 26.5 25.9 445 1-514 17-511 (511) 209 protein:vir:1634 Length: 409 # 77.3 0.13 7.8E-05 25.5 26.6 375 24-467 1-409 (409) 210 protein:vir:4223 Length: 486 # 77.3 0.13 7.8E-05 25.5 29.4 444 1-509 1-486 (486) 211 protein:vir:79043 Length: 479 76.9 0.13 8.1E-05 25.4 28.1 409 1-501 32-479 (479) 212 protein:vir:80959 Length: 499 75.6 0.14 9E-05 25.2 31.8 440 1-500 1-499 (499) 213 protein:vir:108049 Length: 524 75.4 0.15 9.1E-05 25.2 20.3 443 1-490 15-524 (524) 214 protein:vir:99522 Length: 470 74.1 0.16 0.0001 24.9 29.9 429 1-509 1-470 (470) 215 protein:vir:38 Length: 496 # N 72.6 0.18 0.00011 24.7 32.5 445 1-502 1-496 (496) 216 protein:vir:105461 Length: 470 70.9 0.2 0.00013 24.4 29.2 406 24-502 1-470 (470) 217 protein:vir:106639 Length: 481 70.7 0.21 0.00013 24.4 27.7 429 1-499 9-481 (481) 218 protein:vir:104500 Length: 537 70.3 0.21 0.00013 24.3 24.6 448 1-509 1-537 (537) 219 protein:vir:102330 Length: 451 68.3 0.24 0.00015 24.0 25.9 401 24-500 1-451 (451) 220 protein:vir:2500 Length: 501 # 67.8 0.25 0.00015 23.9 26.8 434 1-516 1-498 (501) 221 protein:vir:101541 Length: 694 67.8 0.25 0.00015 23.9 33.9 445 1-516 1-557 (694) 222 protein:vir:103458 Length: 524 67.2 0.26 0.00016 23.8 17.6 444 1-490 13-524 (524) 223 protein:vir:103951 Length: 511 66.6 0.27 0.00016 23.7 26.4 441 1-514 17-511 (511) 224 protein:vir:100598 Length: 516 64.6 0.3 0.00018 23.5 16.9 445 1-498 3-516 (516) 225 protein:vir:7208 Length: 524 # 64.3 0.3 0.00019 23.4 17.6 444 1-490 13-524 (524) 226 protein:vir:95449 Length: 584 63.7 0.31 0.00019 23.3 21.2 443 1-501 1-584 (584) 227 protein:vir:7430 Length: 563 # 57.7 0.43 0.00027 22.6 28.1 464 1-516 1-548 (563) 228 protein:vir:96179 Length: 468 56.3 0.46 0.00029 22.4 26.3 422 1-502 1-468 (468) 229 protein:vir:98265 Length: 524 55.9 0.47 0.00029 22.4 18.1 443 1-490 17-524 (524) 230 protein:vir:103219 Length: 201 55.0 0.49 0.0003 22.3 13.5 192 249-499 1-201 (201) 231 protein:vir:94546 Length: 506 46.6 0.73 0.00045 21.3 29.2 447 1-512 22-506 (506) 232 protein:vir:96783 Length: 488 45.4 0.77 0.00048 21.2 24.4 417 1-491 7-488 (488) 233 protein:vir:78589 Length: 695 42.7 0.88 0.00054 20.9 33.4 439 1-516 46-558 (695) 234 protein:vir:3648 Length: 695 # 39.2 1 0.00064 20.5 33.6 444 1-516 33-558 (695) 235 protein:vir:106282 Length: 521 34.3 1.3 0.00081 20.0 22.1 438 1-490 5-521 (521) 236 protein:vir:9922 Length: 489 # 31.6 1.5 0.00092 19.6 25.3 443 1-504 9-489 (489) 237 protein:vir:6896 Length: 523 # 31.1 1.5 0.00094 19.6 20.1 444 1-490 5-523 (523) 238 protein:vir:106491 Length: 646 29.1 1.7 0.001 19.3 21.8 440 1-516 13-510 (646) No 1 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=100.00 E-value=8.5e-130 Score=728.21 Aligned_cols=479 Identities=27% Similarity=0.405 Sum_probs=410.0 Q ss_pred ccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCC Q lcl|NC_016071. 11 VVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYN 90 (516) Q Consensus 11 ~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~ 90 (516) +++.....+.++|.|+++||+.+.+.+.+...++..|+|||++.+++|++|++|+||+++|++||++|++++|+|++..+ T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~~w~v~p~~~ 80 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESIKTFQLMMRDPAVAASVNIIKMFVRKVNWRFVPPKG 80 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC Confidence 55555566789999999999999999989888999999999999999999999999999999999999999999987654 Q ss_pred C-CChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccc------ccccceeeccccccCchh Q lcl|NC_016071. 91 R-DSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPS------KYAGYITIDKIAFRPQSS 163 (516) Q Consensus 91 ~-d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~------~~~g~~~~~~l~~r~q~t 163 (516) + ++..++++|++|+++++++. .+|+++|++||||++|||||+|++|+++.+.. ..+|++.+++|++|||.+ T Consensus 81 ~~~d~~~~~~a~~v~~~l~~~~--~~~~~~i~~~lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~Rpq~~ 158 (488) T protein:vir:95 81 KEQDPKMLERADFFNSLMDDME--HDWADFINSVMSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIRNQST 158 (488) T ss_pred CchhHHHHHHHHHHHHHHhccC--ccHHHHHHHHHHhhcccceeeeeeeeccccccccccccccCCeeeeeeeeecCccc Confidence 3 34456789999999999885 35999999999999999999999999875432 348999999999999988 Q ss_pred cccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHH Q lcl|NC_016071. 164 LSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVG 243 (516) Q Consensus 164 i~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~ 243 (516) |+ ||.|+.|++++++++|+......... .........++.||+.|||+|+|+++++||||.||||. T Consensus 159 ~~---~f~~d~d~~l~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~ 224 (488) T protein:vir:95 159 LD---KWYFDEDFRRVTGVRQNLRNVSHIAG-----------AINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLN 224 (488) T ss_pred cc---ceeeccCCCceeeccccccccccccc-----------ccccccccccccccccceEEEeecCCCCccchhhHHHH Confidence 85 79999999999999988654322211 11123456788999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccc Q lcl|NC_016071. 244 CYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGG 323 (516) Q Consensus 244 ~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~ 323 (516) |||+|+||++++++|++|+||||+|||++++|+.+... .++.+....++.+.+++.+++++.++|+|||.||+++++ T Consensus 225 ~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~---~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k 301 (488) T protein:vir:95 225 AYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDE---NAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTK 301 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCC---cccHHHHHHHHHHHHHHHHhhccchhheeeccccccccc Confidence 99999999999999999999999999999999865332 223334456677888888999999999999999999997 Q ss_pred c-ccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 324 E-QYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLI 402 (516) Q Consensus 324 e-~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li 402 (516) + ..++++++++|+ +.++|.+||+|||++|||+|||||||++++++||+|+|+||++|+++++++|+++|+++||+||| T Consensus 302 ~~~~e~~l~~~~~~-~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li 380 (488) T protein:vir:95 302 EDIFEFSLVSRQGA-KAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLV 380 (488) T ss_pred hhhhhhhccccccC-CchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 367888888764 55679999999999999999999999988888999999999999999999999999999999999 Q ss_pred HHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCC Q lcl|NC_016071. 403 PQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQ 482 (516) Q Consensus 403 ~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~ 482 (516) +|||++|| ++..++|+|+|+.++++|++++++++++|+++|++++++.+++|++++||||++.++++.. .+..++ T Consensus 381 ~~l~~~Nf--g~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~---~~~~~~ 455 (488) T protein:vir:95 381 AQTYALNM--WDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVS---EKLSPN 455 (488) T ss_pred HHHHHhcC--CCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCcccc---ccCCCC Confidence 99999994 5678899999999999999999999999999999999999999999999999875544332 222233 Q ss_pred CCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 483 DTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) ..++++++ ..+++++++++++++|++.||++| T Consensus 456 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~a~~~~ 487 (488) T protein:vir:95 456 SQSRSGDG--YKTAGEGTAKTPSAKDPSTANKAN 487 (488) T ss_pred CCCCCCcc--cCCCcccCCcccccccchhhhhcc Confidence 33333333 235678899999999999999999 No 2 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=100.00 E-value=2e-115 Score=649.44 Aligned_cols=453 Identities=20% Similarity=0.205 Sum_probs=362.2 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHH-HHHhhcccccCCcccHHHHHHHh-hChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRA-ESEVMKVEELRWPCFLATVEAMK-QDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~-~~~~~~~~~lr~~~~~~~y~~m~-~D~~v~s~l~~Rk~~v 78 (516) |++++ .|++|+.+.+|+|+.|+..... +...++.|+||+++.+++|++|+ +|+||+|+|++||++| T Consensus 1 ~~~~~------------~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av 68 (469) T protein:vir:10 1 MTERV------------KTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPI 68 (469) T ss_pred CCCcc------------cCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHH Confidence 44433 3778888999999988753322 33457889999999999999997 5999999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHhhcc-------------CcCCHHHHHHHHHH-HHhhcceeeeEEEeecccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALKNLA-------------NQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAP 144 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~-------------~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~ 144 (516) ++++|+|++..+ +.+++++++++|.... .+.+|.++|.++|+ |++|||||+|+||++++. T Consensus 69 ~~~~w~v~p~~~-----~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~- 142 (469) T protein:vir:10 69 RSTPWRIRANGA-----SDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQ- 142 (469) T ss_pred hcCCceEecCCC-----CHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccc- Confidence 999999975432 3467788888776431 24569999988776 899999999999998753 Q ss_pred cccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEE Q lcl|NC_016071. 145 SKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLM 224 (516) Q Consensus 145 ~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i 224 (516) .++|++.+++|++|||.+|. +|.|++|++ ++.++|..+..... ......+.++++||+.||| T Consensus 143 -~~dG~~~~~~l~~rp~~~i~---~~~~~~~~~-l~~~~~~~~~~~~~-------------~~~~~~~~~~~~lp~~k~i 204 (469) T protein:vir:10 143 -SPDGRFWLRKLAPRPQWTIS---KFNVAPDGG-LESIEQIAPPARTR-------------GSLYVANIAPPEIPVNRLV 204 (469) T ss_pred -cCCCceeeeeeeecCcccce---eeeeccCCc-eeeeeecCcccccc-------------cccccCCCCccccccCcEE Confidence 24899999999999999886 488888875 44555543321111 1112234568899999999 Q ss_pred EEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 225 VMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANA 304 (516) Q Consensus 225 ~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~ 304 (516) +|+|++++|||||.||||.|||+|+||++++++|+.|+||||+|++++++|+ ++++++++. |.+++.++ T Consensus 205 ~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~--------~a~~~ek~~---l~~a~~~~ 273 (469) T protein:vir:10 205 VYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASS--------ATDEDEVRK---MAALARSV 273 (469) T ss_pred EEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCC--------CCCHHHHHH---HHHHHHHH Confidence 9999999999999999999999999999999999999999999999887764 455566544 55666777 Q ss_pred hcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHH Q lcl|NC_016071. 305 HAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGH 384 (516) Q Consensus 305 ~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~ 384 (516) ++|+++++|||.||+ |||++++|++ .+|.++|+|||++|||+|||||||++++ +||||+|+||++|+++ T Consensus 274 ~~g~~a~~iip~~~~--------ie~~ea~g~~--~~~~~li~~~d~~Isk~iLG~tlTs~~~-gGS~a~~~vh~ev~~d 342 (469) T protein:vir:10 274 RGGINAGVGLAQGQI--------LELLGVSGNL--PDIRRAIEGHDRSIALSGLAHFLNLDGK-GGSYALASVLEDPFTQ 342 (469) T ss_pred hcCCceEEEccCCce--------EEEeecCCCc--hHHHHHHHHHHHHHHHHHhcccccccCc-cchhhHHHHHHHHHHH Confidence 789999999999985 6666666544 4799999999999999999999999754 5999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCC Q lcl|NC_016071. 385 FVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFD 464 (516) Q Consensus 385 ~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp 464 (516) ++++|+++|+++||+|||++|+++|| ++...+|+|+|++.++. .+.+++++++|+++|+++.++..++|++++|||| T Consensus 343 ~~~sDa~~i~~tln~~li~~l~~lN~--g~~~~~P~~~~~~~e~~-~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip 419 (469) T protein:vir:10 343 AVHAYATSICRIANQHIIEDLVDINF--GVDTPAPVLTFDPIGSR-QDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLP 419 (469) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC--CCCCCccEEEecCCCCc-HHHHHHHHHHHHhcCCccCccccHHHHHHHhCCC Confidence 99999999999999999999999994 66788999999988754 4788999999999999998888899999999999 Q ss_pred CCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 465 EEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) ++.+++++....++..++..++++++. ..++.+++..+++++..++..+= T Consensus 420 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~l~da 469 (469) T protein:vir:10 420 SELNDTPSAEPEEPAAVPNQSAAPART--RSSGNADARARAPKADQGVLFDA 469 (469) T ss_pred CCCCCcccccchhcccCCCCCcccccc--CCCCCcccccccCCChHHhhccC Confidence 998887766554444444444444433 33455666666666666555544 No 3 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=100.00 E-value=1.5e-111 Score=628.13 Aligned_cols=432 Identities=12% Similarity=0.092 Sum_probs=326.6 Q ss_pred CCccccCcccccc-----------------hhhhccc--CCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVK-----------------AGNENLA--VSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAM 61 (516) Q Consensus 1 ~~~r~~~~~~~~~-----------------~~~~~p~--~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m 61 (516) |+.=..+-++..+ ....+|+ ++|.|+. +.+..++. ++++ ..++||++| T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~----------~il~~a~~-gd~~--~~~~L~edm 67 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLA----------RILVEAEQ-GNLQ--AQAELFMDM 67 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHH----------HHHHHhhC-CCHH--HHHHHHHHH Confidence 5444444444333 3323332 2221111 11223332 2332 357899999 Q ss_pred h-hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEee Q lcl|NC_016071. 62 K-QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRT 140 (516) Q Consensus 62 ~-~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~ 140 (516) + +|+||+|+|++||++|++++|+|++.. .+++.++++|++|+++|+++. +|+++|++||+|++|||||+|++|+. T Consensus 68 ~e~D~~i~s~l~~Rk~av~~~~w~I~p~~-~~~~~~~~~a~~v~~~l~~~~---~~~~~i~~~ldA~~~G~s~~Ei~w~~ 143 (526) T protein:vir:79 68 EERDAHLFAEMSKRKRAILGLDWAVEPPR-NASAAEKADADYLHELLLDLE---GLEDLLLDALDGIGHGYSCIELEWAL 143 (526) T ss_pred HhhChHHHHHHHHHHHHHhCCCceEecCC-CCChHHHHHHHHHHHHHhccc---CHHHHHHHHHhhhhhcceeEEEEEee Confidence 8 699999999999999999999997643 356788999999999998764 49999999999999999999999998 Q ss_pred cccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCcccccc Q lcl|NC_016071. 141 ESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPI 220 (516) Q Consensus 141 ~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~ 220 (516) + +|.+.+.++.+||| +||.|++++++.+.+++. ..+|+++|+ T Consensus 144 ~------~g~~~~~~l~~r~~------~~F~~~~~~~~~l~~~~~--------------------------~~~g~~l~~ 185 (526) T protein:vir:79 144 Q------GREWMPLAFHHRPQ------SWFQLNPEDQNELRLRDN--------------------------SPAGEALQP 185 (526) T ss_pred c------CCceeEEEeeeecc------cceEeccCCCcEEEecCC--------------------------CCCceeecC Confidence 6 46788889999997 699999998877655542 356889999 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMAD 300 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~ 300 (516) .|||+|+|++++|||||.||||+|||+|+||++++++|+.|+||||+|++++++|+ +++++|++.+ .++ T Consensus 186 ~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~--------~a~~~ek~~L---~~a 254 (526) T protein:vir:79 186 FGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP--------GTADEEKATL---LRA 254 (526) T ss_pred CceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC--------CCCHHHHHHH---HHH Confidence 99999999999999999999999999999999999999999999999998877654 3555665544 344 Q ss_pred HHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--CccchhhHHHHH Q lcl|NC_016071. 301 AANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSYNLSESK 378 (516) Q Consensus 301 ~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~vh 378 (516) +.++ |+++++|||.||+ |||++++++ +...|.+||+|||++|||+||||||||++ +++||+|+|+|| T Consensus 255 v~~i--~~da~~iiP~~~~--------ie~~ea~~~-~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh 323 (526) T protein:vir:79 255 VTGL--GHAAAGIIPETMA--------IDFQQAAQG-SSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVH 323 (526) T ss_pred HHHH--hcCcEEEecCCce--------eEEeecCCC-CHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHH Confidence 4444 6789999999985 556666544 34579999999999999999999999963 356999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHH Q lcl|NC_016071. 379 QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRL-SDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKI 457 (516) Q Consensus 379 ~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i 457 (516) ++|+++++++||++|++|||+|||++|+++||+.. +..++|+|+|+..+++|++++++++++|+++|+.++ .+|+ T Consensus 324 ~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~----~~~i 399 (526) T protein:vir:79 324 NEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIP----SAWV 399 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCC----HHHH Confidence 99999999999999999999999999999997543 346789999999999999999999999999999765 5899 Q ss_pred HHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccc---hhhhhcC Q lcl|NC_016071. 458 LEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN---SVSNMDN 516 (516) Q Consensus 458 ~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~---~~~~~~~ 516 (516) +++||||.+.++++......++.+....++. ...+............|. ..+..++ T Consensus 400 ~e~~gip~~~~~e~~l~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~d~~l~~~~~ 458 (526) T protein:vir:79 400 YDKLGIPQPAKNEPVLRPAAQPAILSRQHGQ---RVAALATIVGPRYGDQQALDKALADLPA 458 (526) T ss_pred HHHhCCCCCCCchhhccccCCcccccccccc---ccccccccccccCchhhHHHHHHHHHHH Confidence 9999999988776655443333332222211 111111111111111111 1111111 No 4 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=100.00 E-value=1.1e-109 Score=618.00 Aligned_cols=432 Identities=13% Similarity=0.091 Sum_probs=325.7 Q ss_pred CCccccCccccc-----------------chhhhccc--CCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVV-----------------KAGNENLA--VSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAM 61 (516) Q Consensus 1 ~~~r~~~~~~~~-----------------~~~~~~p~--~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m 61 (516) |+.=..+-++.. +....+|+ ++|.|+. +.+..++. +++ ...++||++| T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~----------~iLr~a~~-gd~--~~~~~L~e~m 67 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLA----------RILVEAEQ-GNL--QAQAELFMDM 67 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHH----------HHHHhhhC-CCH--HHHHHHHHHH Confidence 444443333322 22222221 2221111 11222232 222 2367899999 Q ss_pred h-hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEee Q lcl|NC_016071. 62 K-QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRT 140 (516) Q Consensus 62 ~-~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~ 140 (516) + +|+||+|+|++||++|++++|+|++.. .+++.++++|++|+++|+++. +|+++|++||+|++|||||+|++|+. T Consensus 68 ~e~D~~i~s~l~~Rk~av~~~~w~I~p~~-~~~~~~~~~a~~v~~~l~~~~---~~~~~i~~~lda~~~G~s~~Eivw~~ 143 (526) T protein:vir:99 68 EERDAHLFAEMSKRKRAILGLDWAVEPPR-NASAAEKADADYLHELLLDLE---GLEDLLLDALDGIGHGYSCIELEWAL 143 (526) T ss_pred HhhChHHHHHHHHHHHHHhCCCceEecCC-CCCHHHHHHHHHHHHHHhccc---CHHHHHHHHHHhhhhcceeEEEEEee Confidence 8 599999999999999999999987643 356788999999999998764 49999999999999999999999998 Q ss_pred cccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCcccccc Q lcl|NC_016071. 141 ESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPI 220 (516) Q Consensus 141 ~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~ 220 (516) + +|.+.+.++.+||| +||.|++++++.+.+++. ..+|+++|+ T Consensus 144 ~------~g~~~~~~l~~r~~------~~f~~~~~~~~~l~~~~~--------------------------~~~g~~l~~ 185 (526) T protein:vir:99 144 Q------GREWMPLAFHHRPQ------SWFQLNPEDQNELRLRDN--------------------------SPAGEALQP 185 (526) T ss_pred c------CCceeEEEeeeecc------cceeeccCCCcEEEecCC--------------------------CCCceeecC Confidence 6 46777889999987 699999999877665542 356889999 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMAD 300 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~ 300 (516) .|||+|+|++++|||||.||||+|||+|+||++++++|+.|+||||+|++++++|+ +++++|++.+ .++ T Consensus 186 ~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~--------~a~~~ek~~L---~~a 254 (526) T protein:vir:99 186 FGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP--------GTADEEKATL---LRA 254 (526) T ss_pred CCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC--------CCCHHHHHHH---HHH Confidence 99999999999999999999999999999999999999999999999998877654 3555665544 344 Q ss_pred HHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--CccchhhHHHHH Q lcl|NC_016071. 301 AANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSYNLSESK 378 (516) Q Consensus 301 ~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~vh 378 (516) +.++ |+++++|||.||+ |||++++++ +...|.+||+|||++|||+||||||||++ +++||+|+|+|| T Consensus 255 v~~i--~~d~~~iiP~~~~--------ie~~ea~~~-~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh 323 (526) T protein:vir:99 255 VTGL--GHAAAGIIPETMA--------IDFQQAAQG-SSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVH 323 (526) T ss_pred HHHH--hhCcEEEecCCce--------eEEeecCCC-CHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHH Confidence 4444 6789999999985 556666544 34569999999999999999999999974 356999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHH Q lcl|NC_016071. 379 QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRL-SDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKI 457 (516) Q Consensus 379 ~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i 457 (516) ++|+++++++|+++|++|||+|||++|+++||+.. +...+|+|+|+..+++|++++++++++|+++|+.++ .+|+ T Consensus 324 ~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~----~~~i 399 (526) T protein:vir:99 324 NEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIP----SAWV 399 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccC----HHHH Confidence 99999999999999999999999999999997543 336789999999999999999999999999999765 5899 Q ss_pred HHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 458 LEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 458 ~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +++||||.+.++++......++.++...+ +.................+.....++. T Consensus 400 ~e~~Gip~~~~~e~~l~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~d~~l~~ 455 (526) T protein:vir:99 400 YDKLGIPQPAKNEPVLRSAAQPAILSRQH---GQRVAALATIVGPRYGDQQALDKALAD 455 (526) T ss_pred HHHhCCCCCCCcccccCCCCCCccccccc---ccccccccccccccCcchhhHHHHHHH Confidence 99999999988777655443333222221 111111111111111111111111111 No 5 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=100.00 E-value=3.9e-109 Score=614.94 Aligned_cols=434 Identities=13% Similarity=0.095 Sum_probs=324.0 Q ss_pred CCccccCcc-----------------cccchhhhccc--CCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPS-----------------EVVKAGNENLA--VSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAM 61 (516) Q Consensus 1 ~~~r~~~~~-----------------~~~~~~~~~p~--~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m 61 (516) |+.=..+-+ .+.+....+|+ ++|.|+. +.+..++ .+++ ...+++|++| T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~----------~il~~a~-~gd~--~~~~~L~~~m 67 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLA----------HILIEAE-QGHL--QAQAELFMDM 67 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHH----------HHHHhhh-CCCH--HHHHHHHHHH Confidence 433222222 22223333332 2221111 1122222 2222 2467899999 Q ss_pred h-hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEee Q lcl|NC_016071. 62 K-QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRT 140 (516) Q Consensus 62 ~-~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~ 140 (516) + +|+||+|+|++||++|++++|+|++.. .++++++++|++|+++|+++. +|+++|.+||+|++|||||+|++|+. T Consensus 68 ~e~D~~i~s~l~~Rk~av~~~~w~I~p~~-~~~~~~~~~a~~v~~~l~~~~---~f~~~i~~~lda~~~G~s~~Ei~w~~ 143 (528) T protein:vir:10 68 EERDAHLFAEMSKRKRAVLGLDWTIEPPR-NASAAEKADAEYLHELLLDLE---GIEDLMLDCMDGVGHGYSAIELDWSL 143 (528) T ss_pred HhhChHHHHHHHHHHHHHhcCCceEecCC-CCCHHHHHHHHHHHHHHhCCc---cHHHHHHHHHhhhhhcceeEEEEEee Confidence 8 699999999999999999999987643 346778999999999998764 49999999999999999999999998 Q ss_pred cccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCcccccc Q lcl|NC_016071. 141 ESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPI 220 (516) Q Consensus 141 ~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~ 220 (516) + +|.+.++++.+||| +||.|++++++.+.++++ ..+|++||+ T Consensus 144 ~------~g~~~~~~~~~r~~------~~f~~~~~~~~~l~~~~~--------------------------~~~g~~l~~ 185 (528) T protein:vir:10 144 Q------GREWLPQAFDHRPQ------SWFQLNPDDQDELRLRDN--------------------------SIAGEVLQP 185 (528) T ss_pred c------CCceeEEEeeeecc------cceeeccCCCcEEeccCC--------------------------CCCceeecC Confidence 6 46778889999997 699999998877665542 346789999 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMAD 300 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~ 300 (516) .|||+|+|++++|||||.||||.|||+|+||++++++|+.|+||||+|++++++|+ +++++|++.+ .++ T Consensus 186 ~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~--------~a~~~ek~~L---~~a 254 (528) T protein:vir:10 186 FGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPP--------GTPDEEKVTL---LRA 254 (528) T ss_pred CCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC--------CCCHHHHHHH---HHH Confidence 99999999999999999999999999999999999999999999999998877654 4555665544 344 Q ss_pred HHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC-C-ccchhhHHHHH Q lcl|NC_016071. 301 AANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN-D-GQGSYNLSESK 378 (516) Q Consensus 301 ~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-~-~~GS~Al~~vh 378 (516) +.++ |+++++|||.||+ |||+++++++ ...|.+||+|||++|||+||||||||++ + ++||+|+|+|| T Consensus 255 l~~i--~~~~~~iiP~~~~--------ie~~ea~~~~-~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh 323 (528) T protein:vir:10 255 VTGL--GHAAAGIIPESMS--------IDFQEASKGS-AEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVH 323 (528) T ss_pred HHHH--hhCcEEEecCCce--------eEEeecCCCC-hhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHH Confidence 4443 6789999999985 6666665544 4579999999999999999999999964 3 46899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHH Q lcl|NC_016071. 379 QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRL-SDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKI 457 (516) Q Consensus 379 ~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i 457 (516) ++|+++++++|+++|++|||+|||++||++||+.. +...+|+|+|+..+++|++++++++++|+++|+.++ ++|+ T Consensus 324 ~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~----~~~i 399 (528) T protein:vir:10 324 NEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVP----VNWV 399 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCC----HHHH Confidence 99999999999999999999999999999996432 346789999999999999999999999999999665 6899 Q ss_pred HHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCccccc---ccccchhhhhcC Q lcl|NC_016071. 458 LEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKIS---STRDNSVSNMDN 516 (516) Q Consensus 458 ~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~d~~~~~~~~ 516 (516) +++||||.|.++++....... .++....+.++....+......... .+-|...+..+. T Consensus 400 ~e~~gip~p~~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 460 (528) T protein:vir:10 400 QEQLGIPLPANGEAVLGDQAG-AGIAQLSRRPGPRIAALAQVIGPRYRDQEALDQVLASLPA 460 (528) T ss_pred HHHhCCCCCCCCcccccCCCc-ccccccCcccccccccccccccccccccchHHHHHHHHHH Confidence 999999998877665533222 2222222222222221111111111 111121111111 No 6 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=100.00 E-value=4.6e-107 Score=603.55 Aligned_cols=428 Identities=14% Similarity=0.099 Sum_probs=321.7 Q ss_pred CC-----------------ccccCcccccchhhhccc--CCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHH Q lcl|NC_016071. 1 MS-----------------TRFAQPSEVVKAGNENLA--VSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAM 61 (516) Q Consensus 1 ~~-----------------~r~~~~~~~~~~~~~~p~--~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m 61 (516) |+ +...+.+.+.+....+|+ ++|.|+. +.+..++.. +++ ...++|++| T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~----------~iL~~a~~g-d~~--~~~~L~~dm 67 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAA----------QMLRDAERG-DLT--AQADLAFDM 67 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHH----------HHHHHhhCC-CHH--HHHHHHHHH Confidence 33 333333334444444443 2222211 112223332 222 245778888 Q ss_pred h-hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEee Q lcl|NC_016071. 62 K-QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRT 140 (516) Q Consensus 62 ~-~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~ 140 (516) + +|+||+|+|++||++|++++|+|++.. .+++.++++|++|+++|+++. +|+++|++||||++|||||+|++|+. T Consensus 68 ~~~D~hi~s~l~~Rk~av~~~~w~I~p~~-~~~~~~~~~a~~v~~~l~~~~---~f~~~~~~lldA~~~G~s~~Ei~w~~ 143 (512) T protein:vir:19 68 EEKDTHLFSELSKRRLAIQALEWRIAPAR-DASAQEKKDADMLNEYLHDAA---WFEDALFDAGDAILKGYSMQEIEWGW 143 (512) T ss_pred HhhChHHHHHHHHHHHHHhCCCceEecCC-CCCHHHHHHHHHHHHHHhcCC---CHHHHHHHHHhhhhhcceeeeeEeee Confidence 6 699999999999999999999987543 346788999999999998764 49999999999999999999999988 Q ss_pred cccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCcccccc Q lcl|NC_016071. 141 ESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPI 220 (516) Q Consensus 141 ~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~ 220 (516) + +|.+.++++.+||| +||.|+.++++.+.+++. ..+|++||+ T Consensus 144 ~------~g~~~~~~~~~r~~------~~f~~~~~~~~~lr~~~~--------------------------~~~G~~l~~ 185 (512) T protein:vir:19 144 L------GKMRVPVALHHRDP------ALFCANPDNLNELRLRDA--------------------------SYHGLELQP 185 (512) T ss_pred e------CCceeeeeeeeecc------ccceeccCCCcEEEecCC--------------------------CCCceeecC Confidence 6 46778889999987 699999998776655432 346788999 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMAD 300 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~ 300 (516) .|||+|+|++++|||||.||||.|||+|+||++++++|+.|+||||+|++++++|+ ++++++++.+ .++ T Consensus 186 ~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~--------~a~~~ek~~L---~~a 254 (512) T protein:vir:19 186 FGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPT--------GSTNREKATL---MQA 254 (512) T ss_pred CceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCC--------CCCHHHHHHH---HHH Confidence 99999999999999999999999999999999999999999999999988776554 4556666543 444 Q ss_pred HHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH Q lcl|NC_016071. 301 AANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS 380 (516) Q Consensus 301 ~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e 380 (516) +.++ |+++++|||.||+ |||+++++ ++...|.+||+|||++|||+||||||||+++++||+|+|+||++ T Consensus 255 l~~~--~~~a~~iiP~~~~--------ie~~ea~~-~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~vh~e 323 (512) T protein:vir:19 255 VMDI--GRRAGGIIPMGMT--------LDFQSAAD-GQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGEVHDE 323 (512) T ss_pred HHHH--hhCcEEEecCCce--------EEEeecCC-CCHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHH Confidence 4443 7789999999985 55666654 34467999999999999999999999999878899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHH Q lcl|NC_016071. 381 IHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRL-SDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILE 459 (516) Q Consensus 381 v~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e 459 (516) |+++++++|+++|++|||+|||++||++||+.. +...+|+|+|+..+++|++++++++++|+ +|+.++ ++|+++ T Consensus 324 v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~-~G~~i~----~~~i~e 398 (512) T protein:vir:19 324 VRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLA-AGMRIP----VSWIQE 398 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHHHh-cCCCCC----HHHHHH Confidence 999999999999999999999999999997543 33568999999999999999999999996 899665 689999 Q ss_pred HcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 460 VGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 460 ~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +||||.+.+++.......+. ++ ++....++.+....+.. ..-|..+..+.+ T Consensus 399 ~~Gip~~~~~e~~~~~~~~~-~~----~~~~~~~~~~~~~~~~~-~~~d~~~~~~~~ 449 (512) T protein:vir:19 399 KLHIPQPVGDEAVFTIQPVV-PD----NGSQKEAALSAEDIPQE-DDIDRMGVSPED 449 (512) T ss_pred HhCCCCCCCccccccCCCcc-cc----ccccccccccccCCCch-hhHhHHhhhHHH Confidence 99999988766554322111 11 11111111111111111 111111111222 No 7 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=100.00 E-value=6.1e-106 Score=597.42 Aligned_cols=440 Identities=16% Similarity=0.097 Sum_probs=334.9 Q ss_pred CCccccCcccccchh-hhccc----CCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAG-NENLA----VSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~~~~~~~~-~~~p~----~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) |.+|..-+.+..... ...|+ +...+..-++ ..+.|...++..+.||++.++++|++|++|+||+|+|++|| T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk 76 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMS----TSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIF 76 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcc----cccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHH Confidence 988887766543221 11111 1111111111 12345555666778999999999999999999999999999 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhc---cCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccccccee Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNL---ANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYIT 152 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~---~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~ 152 (516) ++|++++|+|++ +++++.++++||+|+++|... ....+|+++|.+||||++|||||+|++|+... +|++. T Consensus 77 ~av~~~~w~v~p--~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~lda~~~G~s~~Eivw~~~~-----~g~~~ 149 (448) T protein:vir:79 77 GRIRSAKWYVEP--ASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGA-----DGKLI 149 (448) T ss_pred HHHhcCCceEec--CCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHHHHhhhhcceeEEEEeeecC-----CCcee Confidence 999999999864 566788999999999999753 23457999999999999999999999998642 68899 Q ss_pred eccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcC Q lcl|NC_016071. 153 IDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTE 232 (516) Q Consensus 153 ~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~ 232 (516) +++|.+|||.+|+ ||.|+.|+++.+..+.+.... .....++++||..||++|. .+++ T Consensus 150 ~~~l~~r~~~~~~---~f~~~~d~~l~~~~~~~~~~~-------------------~~~~~~~~~lP~~~~i~~~-~~~~ 206 (448) T protein:vir:79 150 LDKIVPIHPFNID---EVLYDEEGGPKALKLSGEVKG-------------------GSQFVSGLEIPIWKTVVFL-HNDD 206 (448) T ss_pred cccccccCCcccc---ceeeecCCceEEeecCCcccc-------------------cccCCCccccccceEEEEe-cCcc Confidence 9999999998775 799999998877665443211 1123467889999999875 5799 Q ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEE Q lcl|NC_016071. 233 SNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYF 312 (516) Q Consensus 233 g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ 312 (516) |||||.||||.|||+|+||++++++|+.|+||||+|++++++|++ ++++++ ....|.++++++++|+++++ T Consensus 207 g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~g--------a~~~~~-~~~~l~~av~~i~~g~~a~~ 277 (448) T protein:vir:79 207 GSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKS--------VRQGTK-QWEAAKEIVKNFVQKPRHGI 277 (448) T ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCC--------CCcCHH-HHHHHHHHHHHHhcCCceEE Confidence 999999999999999999999999999999999999999888764 222222 23456778888899999999 Q ss_pred EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 313 ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDI 392 (516) Q Consensus 313 iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~ 392 (516) |||.||+ |||++++|++ .+|.++|+|||++|||+|||||||+++++|++.++..+|.+++++++++|+++ T Consensus 278 iiP~~~~--------ie~~ea~~~~--~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~ 347 (448) T protein:vir:79 278 ILPDDWK--------FDTVDLKSAM--PDAIPYLTYHDAGIARALGIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQRE 347 (448) T ss_pred EecCCce--------EEEEecCCCc--ccHHHHHHHHHHHHHHHHhhhhhccccccchhhhhhhhHHHHHHHHHHHHHHH Confidence 9999985 6666666544 45778999999999999999999998766444455557999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc Q lcl|NC_016071. 393 IVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS 472 (516) Q Consensus 393 i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~ 472 (516) |++|||+|||++||++|| ++..++|+|+|+.++++|++++|+++++|++++ +..++|+++++|+|++.++++. T Consensus 348 i~~tln~~li~~l~~lNf--g~~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~-----~~~~~~~~~~~~~p~~~~~~~~ 420 (448) T protein:vir:79 348 FASAVNLYLIPKLVLPNW--PSATRFPRLTFEMEERNDFSAAANLMGMLINAV-----KDSEDIPTELKALIDALPSKMR 420 (448) T ss_pred HHHHHHHHHHHHHHHhcC--CCcCCCcEEEecCCChHHHHHHHHHhhhhhccc-----hhhHHHHHHhhcCCCCCCCccc Confidence 999999999999999994 677889999999999999999999999999875 3347899999999988766543 Q ss_pred cCcccccCCCCCCcccccccccCCCCCccccccccc Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRD 508 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 508 (516) .... ..+ +.++ ....+++.--.-+.-|. T Consensus 421 ~a~~---~~~---~~~~--~~~~~~~~~~~~~~~~~ 448 (448) T protein:vir:79 421 RALG---VVD---EVRE--AVRQPADSRYLYTRRRR 448 (448) T ss_pred cccC---CCC---cccc--cccCCccccchhhcccC Confidence 2111 111 1111 00011111111111111 No 8 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=100.00 E-value=1.4e-105 Score=595.40 Aligned_cols=440 Identities=16% Similarity=0.108 Sum_probs=326.6 Q ss_pred CCccccCccccc-chhhhccc----CCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVV-KAGNENLA----VSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~~~~~-~~~~~~p~----~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) |+++...++... .++...|. +...|..-++ ..+.|.......+.||++..+++|++|++|+||+|+|++|| T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk 76 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMS----TSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIF 76 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcc----cccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHH Confidence 998876654332 22222221 1122211111 12344555555677999999999999999999999999999 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhcc---CcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccccccee Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLA---NQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYIT 152 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~---~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~ 152 (516) ++|++++|+|++ +++++.++++|++|+++|.+.. ...+|+++|.+||||++|||||+|++|++.. +|++. T Consensus 77 ~av~~~~w~v~p--~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~lda~~~G~s~~Eivw~~~~-----dg~~~ 149 (448) T protein:vir:77 77 GRIRSAKWYVEP--ASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGA-----DGKLI 149 (448) T ss_pred HHHhcCCceEec--CCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHHHHhhhhcceeEEEEEeecC-----CCcee Confidence 999999999864 5677889999999999997532 2457999999999999999999999998642 68899 Q ss_pred eccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcC Q lcl|NC_016071. 153 IDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTE 232 (516) Q Consensus 153 ~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~ 232 (516) +.+|.+|||.+++ ||.|+.++++.+..+.+...+ .....++++||..||++| |.+++ T Consensus 150 ~~~l~~r~~~~~~---~f~~~~~~~l~~~~~~~~~~~-------------------~~~~~~~~~lP~~~~i~~-~~~~~ 206 (448) T protein:vir:77 150 LDKIVPIHPFNID---EVLYDEEGGPKALKLSGEVKG-------------------GSQFVNGLEIPIWKTVVF-LHNDD 206 (448) T ss_pred eccccccCCCccc---eeeeecCCceEEEecCCcccc-------------------cccCCCccccccceEEEE-ecCCc Confidence 9999999998764 899999998877665432211 122346788999999877 45789 Q ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEE Q lcl|NC_016071. 233 SNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYF 312 (516) Q Consensus 233 g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ 312 (516) |||+|.||||.|||+|+||++++++|+.|+||||+|++++++|++ +++++ +....|.+++.++++|+++|+ T Consensus 207 g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~g--------a~~~~-~~~~~l~~av~~i~~g~~a~~ 277 (448) T protein:vir:77 207 GSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKS--------VRQGT-KQWEAAKEIVKNFVQKPRHGI 277 (448) T ss_pred CCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCC--------CCCCH-HHHHHHHHHHHHHhcCCceEE Confidence 999999999999999999999999999999999999999887764 22222 223456778888889999999 Q ss_pred EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 313 ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDI 392 (516) Q Consensus 313 iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~ 392 (516) |||.||+ |||++++|++ .+|.++|+|||++|||+||||||||+++++++.+....|.+++.+++++|+++ T Consensus 278 iiP~g~~--------ie~~ea~~~~--~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~ 347 (448) T protein:vir:77 278 ILPDDWK--------FDTVDLKSAM--PDAIPYLTYHDAGIARALGIDFNTVQLNMGVQAVNIGEFVSLTQQTIISLQRE 347 (448) T ss_pred EecCCce--------EEEEecCCCc--cCHHHHHHHHHHHHHHHHhccccccccccchhhhhhhhHHHHHHHHHHHHHHH Confidence 9999985 6666666544 35888999999999999999999998765333333345678999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc Q lcl|NC_016071. 393 IVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS 472 (516) Q Consensus 393 i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~ 472 (516) |++|||+|||+|||++|| ++.+++|+|+|+..+++|++++|+++++|+ +++++++|||++.++... T Consensus 348 i~~tln~~Li~~l~~lNf--g~~~~~P~~~f~~~e~eDl~~~a~~~~~l~------------~~~~~~~~ip~~~~~~~~ 413 (448) T protein:vir:77 348 FASAVNLYLIPKLVLPNW--PGATRFPRLTFEMEERNDFSAAANLMGMLI------------NAVKDSEDIPTELKALID 413 (448) T ss_pred HHHHHHHHHHHHHHHhcC--CCCCCCCEEEecCCChhhHHHHHHHhHHHH------------HHHHHHhcCCccCCcCCC Confidence 999999999999999994 677889999999999999999999999886 368999999987655433 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) ....++..++... +....+.....+.+....|... T Consensus 414 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~r~~~ 448 (448) T protein:vir:77 414 ALPSKMRRALGVV---DEVREAVRQPADSRYLYTRRRR 448 (448) T ss_pred CCchhcccccCCC---CCCCchhhcchhhHHHHhhhcC Confidence 2111110110000 0000000011111111111111 No 9 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=100.00 E-value=2.5e-104 Score=588.56 Aligned_cols=417 Identities=14% Similarity=0.166 Sum_probs=314.5 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCC-c----ccHHHHHHHhh-ChHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRW-P----CFLATVEAMKQ-DHTVSTALDTK 74 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~-~----~~~~~y~~m~~-D~~v~s~l~~R 74 (516) |--|.+... ....+.. ++. .++....++.+. -|-||. + +.+++|++|++ |+||+|+|++| T Consensus 3 ~~~~~~p~~---~~~~~~~--~~~-------~~~~~~~g~~~~--D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~R 68 (446) T protein:vir:98 3 MEVRNAPTP---AIRRRTI--YAM-------EHLGLATSYLSE--DGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSI 68 (446) T ss_pred ccccCCCch---hhhhhhh--hcc-------ccchhhcccCCc--chHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHH Confidence 322222111 1111111 000 112223344321 122432 2 25699999985 99999999999 Q ss_pred HHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccc----cc Q lcl|NC_016071. 75 YVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYA----GY 150 (516) Q Consensus 75 k~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~----g~ 150 (516) |++|++++|+|++ +++++|+||+++|+++. |+.++.+||||++|||||+|++|++.++.+.|. +. T Consensus 69 k~av~~~~w~V~p-------~~~~~a~~v~~~l~~~~----~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~ 137 (446) T protein:vir:98 69 ALSVLNKVGPYQH-------GDKRIKKFIDDQLRNRA----KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDI 137 (446) T ss_pred HHHhhcCCceecC-------ccHHHHHHHHHHHhhcC----chhHHHHHHHHHhhCceeeeEEEeecccccccchhhccc Confidence 9999999999874 24689999999999874 677888899999999999999999988877653 33 Q ss_pred eeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccc---cccccccCCCccccccccEEEEe Q lcl|NC_016071. 151 ITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMS---LVTNLTSSADEVFIPINKLMVMS 227 (516) Q Consensus 151 ~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~iP~~k~i~~~ 227 (516) ++++++. .+|.|+.+++++.+..+... .+..+.+...+++. ..+.....++++.||..|||+|+ T Consensus 138 ~~~~~~~----------~r~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~ 204 (446) T protein:vir:98 138 VNYHPLQ----------VMLIANDNGRIVDGDTVTAS---QYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFIN 204 (446) T ss_pred ccccccc----------ceeeeccCCccccccccchh---hcccccccCcccchhhhhhhhcccCcccccccccceEEEE Confidence 4443322 24888988887766555432 22223333333222 22344566788999999999999 Q ss_pred ecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHH--HHHHHHHHHHhh Q lcl|NC_016071. 228 LGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEM--VQGLMADAANAH 305 (516) Q Consensus 228 ~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~--l~~l~~~~~~~~ 305 (516) |+++++||||.||||.|||+|+||++++++|+.|+||||+|++++++||+...+...+++..+.++ ...|...+.++ T Consensus 205 ~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~- 283 (446) T protein:vir:98 205 YNTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRL- 283 (446) T ss_pred ecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhc- Confidence 999999999999999999999999999999999999999999999999988776665555444332 23355555554 Q ss_pred cccceEEEe-----ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc--CCccchhhHHHHH Q lcl|NC_016071. 306 AGEQAYFIL-----PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG--NDGQGSYNLSESK 378 (516) Q Consensus 306 ~g~~a~~ii-----P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh 378 (516) ++++++|| |+|| +|||+++++++ ..+|+++|+|||++|||+|||||||++ ++++||+|+|+|| T Consensus 284 -~~da~~ii~~~~~P~g~--------eie~~ea~~~~-~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh 353 (446) T protein:vir:98 284 -STDSGLVLTQLSKEQPV--------QVGALTTGNNF-SDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQ 353 (446) T ss_pred -cccceeeeecccCCCCc--------eEEeeccccCC-hhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHH Confidence 67889998 7776 47777776654 457999999999999999999999865 3456999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcccc-----ceEEecCcCchhHHHHHHHHHHHHhCCcccccHHH Q lcl|NC_016071. 379 QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDM-----PKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTV 453 (516) Q Consensus 379 ~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~-----P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~ 453 (516) ++||.+++++|+++||+|||+|||+|||++||+ +...+ |+++|+..+++|++++|+++++|+++|+++++ . T Consensus 354 ~~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~--~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~--~ 429 (446) T protein:vir:98 354 LELFDGKINSIFDTVIHAFTEQVIGNLIRLNFD--PALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDG--D 429 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccc--c Confidence 999999999999999999999999999999964 33333 34578888999999999999999999998764 3 Q ss_pred HHHHHHHcCCCCCCCcc Q lcl|NC_016071. 454 INKILEVGGFDEEIPED 470 (516) Q Consensus 454 ~~~i~e~~Glp~~~~~~ 470 (516) ++|++++||||++.++. T Consensus 430 ~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 430 KDHIRSITGLPDAISST 446 (446) T ss_pred HHHHHHHhCcCCCCCCC Confidence 78999999999876654 No 10 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=100.00 E-value=7.9e-104 Score=585.82 Aligned_cols=413 Identities=15% Similarity=0.128 Sum_probs=312.9 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccc---c--CCcccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEE---L--RWPCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~---l--r~~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) ++.-.++...+.+....+ .....+|+ | +.+.++++|++|++|+||+++|++|| T Consensus 6 l~~e~at~~~~~d~~~~~----------------------~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk 63 (488) T protein:vir:99 6 LGREIATSGDGRDITRPF----------------------ISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQ 63 (488) T ss_pred hhHHHHHHHhhhhhhccc----------------------cCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHH Confidence 121222222222222211 11122222 1 24557899999999999999999999 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) ++|++++|+|++ +.+++.++++|++|+++|+++ +|+++|++||+|++|||||+|++|+++ +|++.+++ T Consensus 64 ~av~~~~w~i~p--~~~~~~~~~~ae~v~~~l~~~----~~~~~l~~~lda~~~G~s~~Ei~w~~~------~g~~~~~~ 131 (488) T protein:vir:99 64 LAVVSREWKVEA--GGDRPIDQAAAEHLEQQLQRV----GWDRVTSKMLFGVFYGYAVSELIYGRD------DRYITLEA 131 (488) T ss_pred HHHhcCCceEEc--CCCChHHHHHHHHHHHHHhCC----CHHHHHHHHHhhhhhcceeEEEEEeec------CCeeeEee Confidence 999999999875 456788999999999999864 599999999999999999999999876 57888999 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccc-cccEEEEeecCcCCc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIP-INKLMVMSLGGTESN 234 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP-~~k~i~~~~~~~~g~ 234 (516) |.+||| +||.|++++++++..+++ ..+|+++| +.||++|+|++++|| T Consensus 132 l~~r~~------~~f~~d~~~~l~~~~~~~--------------------------~~~g~~lp~~~~~i~~~~~~~~g~ 179 (488) T protein:vir:99 132 IKVRNR------RRFRYDQDGGLRLLTPNN--------------------------MFEGEPCPAPYFWHFSTGADNDDE 179 (488) T ss_pred eeeecc------cceeecCCCceEEeccCC--------------------------CCCccccccCceEEEEeecCCCCC Confidence 999998 589999999877655432 23577886 568999999999999 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEe Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFIL 314 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ii 314 (516) |||.||||.|||+|+||++++++|+.|+||||+|++++++|| .++++++++.+ .+++.++ |+++++|| T Consensus 180 p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-------~~a~~~ek~~l---~~av~~~--~~~~~~vi 247 (488) T protein:vir:99 180 PYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDD-------KTATPEDKAKL---LAALHAI--QTDSAIIM 247 (488) T ss_pred cccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCC-------CCCCHHHHHHH---HHHHHHH--hcCcEEEe Confidence 999999999999999999999999999999999998887654 23455565544 3444443 67899999 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIV 394 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~ 394 (516) |.||+ |||+++++++ ...|.++|+|||++|||+|||||||++++ +||+|+|+||++|+++++++|+++|+ T Consensus 248 P~~~~--------ie~~ea~~~~-~~~~~~li~~~d~~Isk~iLGqtlts~~~-~Gs~a~~~vh~~v~~d~~~aDa~~i~ 317 (488) T protein:vir:99 248 PAGMQ--------AELLEAGRSG-TADYKTLHDTMDATIAKVGLGQVASTQGT-PGRLGNDDLQADVRLDLVKADADLIC 317 (488) T ss_pred cCCce--------eEEeecCCCC-hHHHHHHHHHHHHHHHHHHhhhhhccccc-ccchhhHHHHHHHHHHHHHHHHHHHH Confidence 99985 6666665544 45799999999999999999999999754 58999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhC-CcccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAV-GYLPKTPTVINKILEVGGFDEEIPEDMST 473 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~-G~~~~~~~~~~~i~e~~Glp~~~~~~~~~ 473 (516) ++||+|||++|+++|| ++..+|+|+|+..+++|++++++++++|+++ |+.++ ++|++++||||.+.++++.. T Consensus 318 ~tln~~li~~l~~~N~---~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~----~~~i~e~~Gip~~~~~~~~~ 390 (488) T protein:vir:99 318 ESFNLGPARWLTEWNF---PGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPT----RGYVQETYGVEVESTQAEAT 390 (488) T ss_pred HHHHHHHHHHHHHhCc---CCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCC----HHHHHHHcCCCCcccccccc Confidence 9999999999999994 4567899999999999999999999999997 87554 68999999999876655432 Q ss_pred CcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 474 DELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) . +.+. ....++.+. ....+..+.+...+.+.-++. T Consensus 391 ~----~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 425 (488) T protein:vir:99 391 A----PTPS--TEFAEGDQP--SDPAAAMAPQLAEAMQPVVGN 425 (488) T ss_pred c----CCCc--ccCCCCCCC--CCchHHHHHHHHHHHHHHHHH Confidence 1 1111 111111110 011111111111111111111 No 11 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=100.00 E-value=1.4e-101 Score=573.51 Aligned_cols=422 Identities=15% Similarity=0.109 Sum_probs=308.1 Q ss_pred CCccccCc-ccccchhhhcccCCCCcccccchHHHHHHHH----HHHhhcccccC-CcccHHHHHHHhhChHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQP-SEVVKAGNENLAVSRLRTGELGSGALSQLRA----ESEVMKVEELR-WPCFLATVEAMKQDHTVSTALDTK 74 (516) Q Consensus 1 ~~~r~~~~-~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~----~~~~~~~~~lr-~~~~~~~y~~m~~D~~v~s~l~~R 74 (516) |+..+--+ ++..+......++ ..+|.+.. +.+.. -......+.|| .+.++++|++|++|+||+|+|++| T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~----~~~ia~~~-~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~l~~R 75 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSL----SSQIATRA-RSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRR 75 (491) T ss_pred CCCeeeCCCCCcccccccchhH----HHHHhhhc-cccccccccccCcchhHHHhhccCCHHHHHHHhhChHHHHHHHHH Confidence 66554221 2222111111111 11222110 00100 00111122233 456799999999999999999999 Q ss_pred HHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 75 YVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 75 k~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) |++|++++|+|++..+ +.+++++|+++|+++ +|+++|++||+|++|||||+|++|+.+ +|++.++ T Consensus 76 k~av~~~~w~i~~~~~-----~~~~a~~i~e~l~~~----~~~~~i~~~lda~~~G~s~~Ei~w~~~------~g~~~~~ 140 (491) T protein:vir:79 76 KAAVKALEWGLDRGKA-----KSRVAKSIADVFADL----DLSRIATEMLDAVLYGYQPMEITWGKV------GNYIVPI 140 (491) T ss_pred HHHHhCCCcEEecCCC-----CHHHHHHHHHHHhcC----CHHHHHHHHHHhhhhcceeEEEEEeec------CCeeeEE Confidence 9999999999876432 246789999999865 599999999999999999999999886 5788889 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) +|.+||| +||.|+.++++++..++ +..+|+++|++|||+|+|++++|| T Consensus 141 ~l~~r~~------~~f~~d~~~~l~l~~~~--------------------------~~~~g~~lp~~k~i~~~~~~~~g~ 188 (491) T protein:vir:79 141 DVVGKPA------DWFVYDPENQLRFRSKE--------------------------HWVQGEELPARKFLVPRQEATYLN 188 (491) T ss_pred eeeeecc------cceeeccCCceEEeecC--------------------------CCCCceeecCCCeEEEEecCCCCC Confidence 9999997 68999999987765443 235688999999999999999999 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEe Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFIL 314 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ii 314 (516) |||.||||.|||+|+||++++++|+.|+||||+|++++++|+ +++++|++.+ .+++.++ |+++++|| T Consensus 189 p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~--------~a~~~ek~~l---~~al~~~--~~~a~~vi 255 (491) T protein:vir:79 189 PYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR--------SASDAETNLL---LDRLEDM--VQDAVAVI 255 (491) T ss_pred cccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC--------CCCHHHHHHH---HHHHHHH--hcCeEEEe Confidence 999999999999999999999999999999999998877653 4566666554 3344443 77899999 Q ss_pred ccCcccccccccceeeeeccc-cCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDG-AGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDII 393 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g-~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i 393 (516) |.||+ |||+++++ +|+...|.+||+|||++|||+||||||||+ ++||+|+|+||++|+++++++|+++| T Consensus 256 P~~~~--------ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~--~~gs~a~~~vh~~v~~~i~~~D~~~i 325 (491) T protein:vir:79 256 PDDSS--------IEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTE--ATSTRASAQAGLEVTDDIRDGDKAIV 325 (491) T ss_pred cCCce--------eEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccC--cccchhhHHHHHHHHHHHHHHHHHHH Confidence 99986 55565554 344456999999999999999999999996 46999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMST 473 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~ 473 (516) +++|| +||++|+.+|| ++...|+|.+.+.++.+ +.+|+++++|+++|+.++ ++|++++||||.+..+++.. T Consensus 326 ~~tln-~li~~l~~~N~---~~~~~p~f~~~e~ee~~-~~~a~~~~~L~~~G~~i~----~~~~~e~~Gip~~~~~e~~~ 396 (491) T protein:vir:79 326 VEAMN-MLIRWICDLNF---DGAARPVFDMWEQEQVD-EIQAGRDEKLTRAGARFT----PAYFKRAYNLQDGDLDERPL 396 (491) T ss_pred HHHHH-HHHHHHHHhcC---CCCCcceEeecCcCchh-HHHHHHHHHHHhCCCccC----HHHHHHHhCCCCCCCCcccc Confidence 99999 59999999995 34567888887766554 678999999999999665 68999999999887666544 Q ss_pred CcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 474 DELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +...+..++..+. ....+++ ..+.|...+..+. T Consensus 397 ~~~~~~~~~~~~~----~~~~~~~------~~~~d~~~~~~~~ 429 (491) T protein:vir:79 397 PVSAVDAVGAASF----AEFEAPD------QDALDAALNALSA 429 (491) T ss_pred CcCcccccccccc----cccCCCC------CcchHHHHHHHHH Confidence 3322211111111 1101111 1111222111111 No 12 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=100.00 E-value=9.4e-102 Score=574.45 Aligned_cols=423 Identities=14% Similarity=0.096 Sum_probs=306.5 Q ss_pred CCcccc-CcccccchhhhcccCCCCcccccchHH--HHHHH-HHHHhhcccccC-CcccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFA-QPSEVVKAGNENLAVSRLRTGELGSGA--LSQLR-AESEVMKVEELR-WPCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~-~~~~~~~~~~~~p~~~~~~~~e~g~~~--~~~~~-~~~~~~~~~~lr-~~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) |+..+- +.++..+.....+.+ ..+|++.. .+.++ +.......+-|| .+.++++|++|++|+||+|+|++|| T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~----~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m~~D~~i~s~l~~Rk 76 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSL----SSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRRK 76 (491) T ss_pred CCCceeCCCCCccCcccCChHH----HHHHHhhhcccccccccCCccchHHHHHhcCCCHHHHHHHhhChHHHHHHHHHH Confidence 655432 122222211111111 01221100 00000 000000001122 2457899999999999999999999 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) ++|++++|+|++..+ +.+++++|+++|+++ +|+++|++||+|++|||||+|++|+++ +|++.+++ T Consensus 77 ~av~~~~w~i~~~~~-----~~~~~e~v~e~l~~~----~~~~~l~~~lda~~~G~s~~Ei~w~~~------~g~~~~~~ 141 (491) T protein:vir:10 77 AAVKALEWGLDRGKA-----KSRVAKSIADVFADL----DLSRIVTEMLDAVLYGYQPMEITWGKV------GNYIVPID 141 (491) T ss_pred HHHhCCCcEEecCCC-----CHHHHHHHHHHHhcC----CHHHHHHHHHHhhhhcceeEEEEEeec------CCeeEEEE Confidence 999999999875422 246789999999864 599999999999999999999999976 46788999 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) +.+||| +||.|+.++++.+..++ +..+|+++|+.|||+|+|+++++|| T Consensus 142 l~~r~~------~~f~~d~~~~l~~~~~~--------------------------~~~~g~~l~~~k~i~~~~~~~~~~p 189 (491) T protein:vir:10 142 VVGKPA------DWFVYDPENQLRFRSKD--------------------------HWMQGEELPARKFLVPRQEATYLNP 189 (491) T ss_pred eeeecc------cceeeccCCceEEecCC--------------------------CCCCcceecCCCEEEEEecCCCCCc Confidence 999998 68999999987765443 2356889999999999999999999 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP 315 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP 315 (516) ||.||+|.|||+|+||++++++|+.|+||||+|++++++|+ +++++|++.+ .+++.++ |+++++||| T Consensus 190 ~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~--------~a~~~ek~~l---~~al~~~--~~~a~~viP 256 (491) T protein:vir:10 190 YGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR--------SASDGEKNLL---LDCLEDM--VQDAVAVVP 256 (491) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCC--------CCCHHHHHHH---HHHHHHH--hcCcEEEec Confidence 99999999999999999999999999999999998877654 4556666544 3444443 678999999 Q ss_pred cCcccccccccceeeeeccccC-cchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAG-KQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g-~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~ 394 (516) .||+ |||+++++++ +...|.+||+|||++|||+||||||||+ ++||+|+|+||++|+++++++|+++|+ T Consensus 257 ~~~~--------ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~--~~gs~a~~~vh~~v~~di~~~D~~~i~ 326 (491) T protein:vir:10 257 DDSS--------IEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTE--ATSTRASAQAGLEVTDDIRDGDKAVVS 326 (491) T ss_pred CCce--------eEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccC--cccchhHHHHHHHHHHHHHHHHHHHHH Confidence 9985 5566666544 3456999999999999999999999996 469999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccC Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTD 474 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~ 474 (516) ++|| +||++||++||+ +..+|+|+|++.++.+ +.+|+++++|+++|+.++ ++|++++||||.+.+++++.+ T Consensus 327 ~tln-~li~~l~~~N~~---~~~~p~f~~~~~~e~~-~~~a~~~~~L~~~G~~i~----~~~i~e~~Gip~~~~~~~~~~ 397 (491) T protein:vir:10 327 EAMN-MLIRWICDLNFD---GADRPVFDMWEQEQVD-EIQAGRDQKLTQAGARFT----PAYFKRAYNLQDGDLDERPLP 397 (491) T ss_pred HHHH-HHHHHHHHhcCC---CCCcceEEecCcCchh-HHHHHHHHHHHhCCCcCC----HHHHHHHhCCCCCCcCccccc Confidence 9999 599999999953 4568999999876555 788999999999999665 689999999998876655433 Q ss_pred cccccCCCCCCcccccccccCCCCCcccccccccchhhhh-----cC Q lcl|NC_016071. 475 ELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNM-----DN 516 (516) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~-----~~ 516 (516) ....... ++.......++ +. ...|.....+ +. T Consensus 398 ~~~~~~~----~~~~~~~~~~~--~~----~~~d~~~~~~~~~~~~~ 434 (491) T protein:vir:10 398 VSAVDTV----GAASFAEFEAP--DQ----DALDAALNTLSARDLNA 434 (491) T ss_pred cCCCCCc----ccccccccCCC--CC----CchHHHHHHHHHHHHHH Confidence 2211111 11110110111 00 1111111111 11 No 13 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=100.00 E-value=9.7e-88 Score=497.62 Aligned_cols=336 Identities=20% Similarity=0.239 Sum_probs=265.7 Q ss_pred eeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccC Q lcl|NC_016071. 133 IFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSS 212 (516) Q Consensus 133 ~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (516) |+||||+++ +|++.+++|.+|||.++. ||.|+++++++ .++|.. ..+ T Consensus 1 v~Eivw~~~------~g~~~~~~l~~r~~~~~~---~f~~~~~~~l~-~~~~~~-----------------------~~g 47 (355) T protein:vir:78 1 MFEQVYRIE------NGRARLGKLAWRPPRTIS---RFDVAPDGGLV-AIEQWG-----------------------VFG 47 (355) T ss_pred CeEEEEEee------CCeEEEeeeeecCcccee---eeeeccCCcee-EEEecC-----------------------CCC Confidence 999999986 578889999999998875 68899988744 444422 223 Q ss_pred CCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCC--HHH Q lcl|NC_016071. 213 ADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPK--SPE 290 (516) Q Consensus 213 ~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~--~~~ 290 (516) .++++||+.|||+|+|+++++||||.||||.|||+|+||++++++|+.|+||||+|||++++|++....+...+. ... T Consensus 48 ~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~ 127 (355) T protein:vir:78 48 KATVRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWL 127 (355) T ss_pred CCcceeccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHH Confidence 467899999999999999999999999999999999999999999999999999999999998764322111000 001 Q ss_pred HHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCC-cc Q lcl|NC_016071. 291 SEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGND-GQ 369 (516) Q Consensus 291 ~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~-~~ 369 (516) ......+..+++++++|+++++|||.||+ |||++++|+ ..+|.++|+|||++|||+||||||||+++ ++ T Consensus 128 ~~~~~~l~~~~~~i~~g~~a~~iip~g~~--------ie~~ea~g~--~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~g 197 (355) T protein:vir:78 128 NDQKEEGLQLAKEFRAGEAAGGYIPHGAN--------FTLTGVQGK--LPEMDGPIRYHDEQIARAVLAHFLTLGGDKST 197 (355) T ss_pred HHHHHHHHHHHHHhhCCcceeEeecCCce--------EEEeecCCC--cccHHHHHHHHHHHHHHHHhhhhhccccCCcc Confidence 11223466777888889999999999985 666666553 44688999999999999999999999864 56 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccc Q lcl|NC_016071. 370 GSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPK 449 (516) Q Consensus 370 GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~ 449 (516) ||+|+|+||++|+++++++|+++|+++||+|||++||++|| ++..++|+|+|+.+++.| +++++++++|+++|++++ T Consensus 198 GS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~--~~~~~~P~~~~~~~~~~~-~~~a~~~~~l~~~G~~~~ 274 (355) T protein:vir:78 198 GSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQNW--GPEEPAPRLVPAQLGKEQ-PVTAEAIRALVECGAFTA 274 (355) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCCCCCCEEEecCcChhH-HHHHHHHHHHHhCCCccc Confidence 99999999999999999999999999999999999999994 667889999999877655 678999999999999999 Q ss_pred cHHHHHHHHHHcCCCCCCCcccccCcc-cccCCCCCCcccccccccCCCCCcccccccccchhhhhc------------- Q lcl|NC_016071. 450 TPTVINKILEVGGFDEEIPEDMSTDEL-LKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMD------------- 515 (516) Q Consensus 450 ~~~~~~~i~e~~Glp~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~------------- 515 (516) ++.+++|++++||||+|.++++..... .+..+....+++++...+ +.++++.++++|...-.=+ T Consensus 275 ~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~a~~~~a~~~~~~~~~~~~~~~~~~~~~~ 352 (355) T protein:vir:78 275 DPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKRLPGQRQG--AALPSRSPRADPPRRRGPLRRRPRHPAHRRCA 352 (355) T ss_pred cHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccccCCcccc--ccccccCCCCCChhhhHHHHHHhhccccCCCC Confidence 988899999999999887766554433 333344445556554443 4555555555554432211 Q ss_pred C Q lcl|NC_016071. 516 N 516 (516) Q Consensus 516 ~ 516 (516) | T Consensus 353 ~ 353 (355) T protein:vir:78 353 P 353 (355) T ss_pred C Confidence 1 No 14 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.82 E-value=3.5e-19 Score=121.79 Aligned_cols=431 Identities=11% Similarity=0.036 Sum_probs=219.6 Q ss_pred CCccccCcccccch--hhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcc---cHHHHHHHh-hChHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKA--GNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPC---FLATVEAMK-QDHTVSTALDTK 74 (516) Q Consensus 1 ~~~r~~~~~~~~~~--~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~---~~~~y~~m~-~D~~v~s~l~~R 74 (516) |.++++........ ....|.- .-..+.+..|. ...++..+. ..+.|.+|++.+ T Consensus 76 i~~pfkkk~~~~~~d~f~~s~es---------------------~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~I 134 (945) T protein:vir:10 76 IIVPYNHQEPPFKFNLFEYSPES---------------------LMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEI 134 (945) T ss_pred ccccccccccchhhhhhhccCcc---------------------ceecccccCccceeeehhhhhhhhccHHHHHHHHHH Confidence 33333321111100 0000000 00011111111 234555554 589999999999 Q ss_pred HHHHhcCCceeee--CCCCCChhhHHH--HHHHHHHHhhcc----CcCCHHHHHHHHH-HHHhhcceeeeEEEeeccccc Q lcl|NC_016071. 75 YVFVTKAFNDFKV--LYNRDSKASKDA--AEFVEYALKNLA----NQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPS 145 (516) Q Consensus 75 k~~v~~~~w~i~~--~~~~d~~~~~~~--a~~v~~~l~~~~----~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~ 145 (516) ...|.++++++.- ..+..+...+++ ..-+...|++-+ ....|..+++.++ +.+.+|-++.++++...+ T Consensus 135 A~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G--- 211 (945) T protein:vir:10 135 PKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQG--- 211 (945) T ss_pred HhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCC--- Confidence 9999999987632 111111111111 112223343211 1223556776654 788899999999876433 Q ss_pred ccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEE Q lcl|NC_016071. 146 KYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMV 225 (516) Q Consensus 146 ~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~ 225 (516) . +..|.+.++.+++ ...++||+.....++. .++.....++..-.|+ T Consensus 212 ----~--ii~L~pLdPs~Vt----i~~ddDG~~~y~Yv~~------------------------idG~~~~~v~a~DvIl 257 (945) T protein:vir:10 212 ----N--LVAITPVDGTTIK----PILSEDTGIVVGYVQE------------------------VDGAIVAHFDKRDVVL 257 (945) T ss_pred ----c--EEEEEEECCcceE----EEEcCCCcEEEEEEEe------------------------cCCceEEEecCCceEE Confidence 2 2244555555443 3455666544322211 1112233566777788 Q ss_pred EeecCcCC---ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeeccc--ccccccCCCCHHHHHHHHHHHH Q lcl|NC_016071. 226 MSLGGTES---NPAGVSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQ--ILNKAAIDPKSPESEMVQGLMA 299 (516) Q Consensus 226 ~~~~~~~g---~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~--~~~k~~~~~~~~~~~~l~~l~~ 299 (516) |.+....+ .++|.|.+..+....-.-....++-+.+..++|+ |--++..+.. ...+.+..-+ .+..+++++ T Consensus 258 hirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~Ls---eEq~erlKe 334 (945) T protein:vir:10 258 FRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLS---REQLESIQR 334 (945) T ss_pred EeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccC---HHHHHHHHH Confidence 87665433 3468888998888776666666665666555553 2112222111 1111111122 233445565 Q ss_pred HHHHhhcccceE--EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHH Q lcl|NC_016071. 300 DAANAHAGEQAY--FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSES 377 (516) Q Consensus 300 ~~~~~~~g~~a~--~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~v 377 (516) .......|.+++ ++++.|++.. ..+.+.....+.+..++.-++|++++.-..--.+...+++++-.+. T Consensus 335 ~wee~~sG~NnG~piVLdeGmef~----------pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEq 404 (945) T protein:vir:10 335 QLQAIMMGDYTQVPILSGGKFTWI----------DFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEV 404 (945) T ss_pred HHHHHhCCcccccceecCCCceEE----------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHH Confidence 556555565555 3567776422 2222333445667778888899999766543333233345555565 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHH Q lcl|NC_016071. 378 KQSIH-GHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINK 456 (516) Q Consensus 378 h~ev~-~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~ 456 (516) +...+ ...++.-++.|++.||+.|++.. . ...-+|.|+.....|.+..+++++++++.|++.+ +. T Consensus 405 q~~~Fv~~tL~Pil~~IEqeLNrkLl~~~--------e-g~~i~fdFd~ldl~D~ksraEal~kli~sGiLTi-----NE 470 (945) T protein:vir:10 405 MASLTKAKGLEPLMATISKGFDEVVSEFR--------N-EKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSI-----NE 470 (945) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccc--------c-CceeEEEecchhccCHHHHHHHHHHHHhCCCcCH-----HH Confidence 55554 56688999999999998654321 1 1223677877777888899999999999999876 57 Q ss_pred HHHHcCCCCCCCcccccCcccccCCCCC-Ccc-------------c--ccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 457 ILEVGGFDEEIPEDMSTDELLKLLGQDT-SRS-------------G--DGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 457 i~e~~Glp~~~~~~~~~~~~~~~~~~~~-~~~-------------~--~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +|+.+|+|+-..+|+.........|.+. ..+ + +....+...+.+......++.+++-+.. T Consensus 471 vRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~kda~~e~~~~ 546 (945) T protein:vir:10 471 ARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQKNAGLEVLRN 546 (945) T ss_pred HHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCcccchHHHHHHH Confidence 9999999976555554321110000000 000 0 0000000011111111222222222211 No 15 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.78 E-value=2.2e-17 Score=111.94 Aligned_cols=432 Identities=13% Similarity=0.053 Sum_probs=224.9 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) |=..+.+..+..++... ++...... ..+...+....-.+.+..+ ..+..++-+.|.+|+..+-..|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~~~-----~~~~~~~~~~g~~~~g~~v-~~~~al~~~~V~~~v~~Ia~~iA~ 68 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRD------VREAGWTS-----LFQAVAEPFAGAWQQGVKA-DPEAVLSFHAVFACISLISQDIAK 68 (454) T ss_pred CCCccccCccccccccc------ccchhhhh-----hhhhhhhhhcchhhcCccc-ChHHhhccHHHHHHHHHHHHhhcc Confidence 55555544443332221 11110100 0011111111111111111 124455678899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFR 159 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r 159 (516) ++|.+.-..+..... +.....+..++.+-+...++.+++..++ +.+.+|-+++++++... |. +..|.+. T Consensus 69 lp~~~~~~~~~g~~~-~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~-------G~--~~~L~~i 138 (454) T protein:vir:93 69 MRLRLMQTDAQGIRR-ETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR-------GQ--IKELRIL 138 (454) T ss_pred CceEEEEeccCCccc-hhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC-------Cc--EEEEEEE Confidence 999875322211111 1111122333444444556778888877 57889999999998643 22 2344455 Q ss_pred CchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccch Q lcl|NC_016071. 160 PQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVS 239 (516) Q Consensus 160 ~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~g 239 (516) ++.+++ ...+++|++......... ........+|.+-+|++++....+.++|.| T Consensus 139 ~~~~v~----v~~~~~g~~~y~~~~~~~----------------------~~~~~~~~~~~~eViH~k~~~~~~~~~G~s 192 (454) T protein:vir:93 139 DWNRVE----PLVADDGEVFYRITPDRN----------------------CGITEAVTVPAREVIHDRFNCFFHPLIGLP 192 (454) T ss_pred cCcceE----EEEcCCCcEEEEEEeccc----------------------cccceeEEecCcceEEeccCCCCCCceecc Confidence 554443 234556654332221100 011223456777766666666777789999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EEeccC Q lcl|NC_016071. 240 PLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FILPSD 317 (516) Q Consensus 240 Llr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP~g 317 (516) .+..+....-.-....++...+...-+.+--+++. +..-++++ .+++++.......|..++ ++++.| T Consensus 193 p~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~--------~~~l~~e~---~~~~~~~~~~~~~g~n~g~~~vl~~g 261 (454) T protein:vir:93 193 PVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEI--------PGSITEEN---AKKLKSNWDSGYTGENAGKTAILSNG 261 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEec--------CCCCCHHH---HHHHHHHHHHHhcccccCCceeccCC Confidence 99999887777666666666666543333222222 22223333 344555555555565554 567777 Q ss_pred cccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_016071. 318 MNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIVEA 396 (516) Q Consensus 318 ~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ 396 (516) ++.+. ++. +.....|.+..++...+|++++.-...-.+..++++++-.+.+. ......+.-.++.|+.. T Consensus 262 ~~~~~--------l~~--~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ 331 (454) T protein:vir:93 262 AKYNP--------TTF--SPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIELL 331 (454) T ss_pred ceEEE--------ccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 64322 111 22333466666788889999977655444333335555544443 34556677888888888 Q ss_pred HHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc- Q lcl|NC_016071. 397 FNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE- 475 (516) Q Consensus 397 ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~- 475 (516) ||+.|+.. ...+-+|.++..-..|++..++++.++++.|++.+ +.+|+.+|+|+-..+|+..-. T Consensus 332 ln~~L~~~----------~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~ggD~~~~~~ 396 (454) T protein:vir:93 332 LDEALETG----------ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTP-----NEARKRENLPPLAGGDALYLQQ 396 (454) T ss_pred HHHhhcCC----------CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeecc Confidence 98866431 11233444445556888999999999999999876 679999999976555543111 Q ss_pred -------ccccCCCCCCcccccccccCCCC---Cccc---ccccccchhhhhcC Q lcl|NC_016071. 476 -------LLKLLGQDTSRSGDGMTAGSNGN---GTGK---ISSTRDNSVSNMDN 516 (516) Q Consensus 476 -------~~~~~~~~~~~~~~~~~~~~~~~---~~~~---~~~~~d~~~~~~~~ 516 (516) ..++.....+....+.....+.+ .++. .-...|.+.+-..+ T Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~~~~~ 450 (454) T protein:vir:93 397 QNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEHDAVKAMFRG 450 (454) T ss_pred CccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccchhhhhhhh Confidence 11111111111111111111100 1111 11122333333333 No 16 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.78 E-value=5.4e-17 Score=109.79 Aligned_cols=452 Identities=12% Similarity=0.084 Sum_probs=209.9 Q ss_pred CCcc---------ccCcc----------------cccchhhhcc--------cCCCCcccccchHHHHHHHHHHHhhccc Q lcl|NC_016071. 1 MSTR---------FAQPS----------------EVVKAGNENL--------AVSRLRTGELGSGALSQLRAESEVMKVE 47 (516) Q Consensus 1 ~~~r---------~~~~~----------------~~~~~~~~~p--------~~~~~~~~e~g~~~~~~~~~~~~~~~~~ 47 (516) |... +--.. +...+-...| .....++..+-+.|+....+.-. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g---~~ 77 (648) T protein:vir:79 1 MARKVWGRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGG---GR 77 (648) T ss_pred CccchhcchhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCC---cc Confidence 1100 00000 0000000001 11112222222222111111100 00 Q ss_pred cc-CCcccHHHHHHHh-hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH- Q lcl|NC_016071. 48 EL-RWPCFLATVEAMK-QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA- 124 (516) Q Consensus 48 ~l-r~~~~~~~y~~m~-~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l- 124 (516) ++ .-|-+++.+.+.. .+++|.+|+..+...|.+++|.|....+. ...... .+..+.+-+...+..+++..++ T Consensus 78 ~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~---~~~~~~--~~~ll~rPn~~~t~~~f~~~l~~ 152 (648) T protein:vir:79 78 DFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPN---AVEYIR--MRFTLMAEATQIPTNQLFIEIAE 152 (648) T ss_pred ccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCc---cchhhH--HHHHhhccCCCCCHHHHHHHHHH Confidence 11 1233555554444 49999999999999999999998764322 111111 1112222223345566776655 Q ss_pred HHHhhcceeeeEEEeecccccccccce--------eeccccccCchhcccccceeecCCCceeeeccccccccccccccc Q lcl|NC_016071. 125 TFNEYGFSIFEKVYRTESAPSKYAGYI--------TIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGL 196 (516) Q Consensus 125 da~~~G~S~~Eivw~~~~~~~~~~g~~--------~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~ 196 (516) +.+.||.+.+|++....+... .+.. .+..+.+-++.+++ ...+++|..+... T Consensus 153 ~lll~GNAYveiiRd~~G~~~--~~l~~~~~~~~~~v~~l~pl~p~~v~----v~~d~~g~~~~Y~-------------- 212 (648) T protein:vir:79 153 DLVKYCNVVIAKSRAKDALPF--QGMNVMGVGDSMPVAGYFPLNLASMK----VKRDKFGMIKGWQ-------------- 212 (648) T ss_pred HHHhcCCeEEEEEecCCCccc--hhhhhhhhccccceeeeEeecCceeE----EEEcCCCceeeeE-------------- Confidence 467899999999987655321 1000 01112222222111 2223333221100 Q ss_pred cccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecc Q lcl|NC_016071. 197 TQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPS 276 (516) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp 276 (516) ....+....+.++++.+|++++....+.+||.|.+..|.-..-.-....++...|....+.|.-+++.++ T Consensus 213 ----------y~~~g~~~~~~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~ 282 (648) T protein:vir:79 213 ----------QEQEGQDKPQKFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGL 282 (648) T ss_pred ----------EEecCCceeEEecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC Confidence 0011222344567777666666667788999999999998887777777888888876555544444321 Q ss_pred cccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHH Q lcl|NC_016071. 277 QILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDR 356 (516) Q Consensus 277 ~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~ 356 (516) .....+..++.++.+.....+.. ...+.+.++.+. ++. .++....+|.+..++..++|+.+ T Consensus 283 ------~~~~~e~~k~~~e~~~~~~~~~~--i~gg~v~~~~~~--------i~~---~~s~~dlqfle~rk~~~~eIa~a 343 (648) T protein:vir:79 283 ------EQEGFGAEEGEVDLVRGEVENMD--VEGGMVTTERVN--------ISS---IASNQIIDAKEYLKHFEQRAFTV 343 (648) T ss_pred ------CccchHHHHHHHHHHHHhccccc--ccccccccceee--------ccc---cCCHHHHHHHHHHHHHHHHHHHH Confidence 11111222333333333222110 001122222221 111 11112224666667778899999 Q ss_pred HhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCCcCCccccceEEecCcCchhHHH Q lcl|NC_016071. 357 FGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLA---LNDIRLSDEDMPKLKPGLIQEVDMEG 433 (516) Q Consensus 357 iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~---lN~~~~~~~~~P~~~~~~~~~~dl~~ 433 (516) +--.-.-.+..+.++++-++.....+...+..-...++..++..+++.+.. ++-. .......+|.|......|.+. T Consensus 344 FgVPP~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~-l~~d~~ieF~~~~Llr~D~~~ 422 (648) T protein:vir:79 344 LGVSELMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPV-LNPDDKVEFRFNEIDMDSKIK 422 (648) T ss_pred hCCCHhHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc-ccccceEEEeecccchhhHHH Confidence 877654444334456666666666666666666666666665554443321 2211 111233567777777788888 Q ss_pred HHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc-----c----cccCCCCCCcccccccccCCCCCcccc- Q lcl|NC_016071. 434 FSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE-----L----LKLLGQDTSRSGDGMTAGSNGNGTGKI- 503 (516) Q Consensus 434 ~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~-----~----~~~~~~~~~~~~~~~~~~~~~~~~~~~- 503 (516) .++.+.+++..|++.+ +.+|+..|+|+-.+++..... . +.+.+....+.+.+.. .+.+++.... T Consensus 423 ~a~~~~~l~~~GilT~-----NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~eg~~~e~ 496 (648) T protein:vir:79 423 LENQAVFLYEHNAISE-----DEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSA-SASGDKKKKAT 496 (648) T ss_pred HHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCC-Ccccccccccc Confidence 8899999999999875 579999999854332211000 0 0000000011111100 0111111100 Q ss_pred -ccccc-------chhhhhcC Q lcl|NC_016071. 504 -SSTRD-------NSVSNMDN 516 (516) Q Consensus 504 -~~~~d-------~~~~~~~~ 516 (516) ...++ .+-..-.| T Consensus 497 ~~~~~~~~~~g~~~~~~~~~~ 517 (648) T protein:vir:79 497 DNKTKPTNQHGTKTSPKKQTN 517 (648) T ss_pred CCCCCCCCCCCcCCCCccccc Confidence 00011 11111111 No 17 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.76 E-value=1.8e-17 Score=112.39 Aligned_cols=412 Identities=10% Similarity=-0.008 Sum_probs=209.2 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) .+.-+++..+.. + +. ..|..+...........+..+ .-+..++-+.|.+|+..+-..|.+ T Consensus 3 ~~~~~~~~~~~~---------s-------~~---~~w~~~~~~~~~~~~~~g~~v-t~~~al~~~~v~~~i~~Ia~~iA~ 62 (421) T protein:vir:10 3 IPQMFEGKKRSV---------S-------GG---GFWEAMLGGVRSSHSKAGVMI-TPETALALSAVRACVTLLAESVAQ 62 (421) T ss_pred Ccchhccccccc---------C-------cc---hhhHHHhhhhccCcccCCcee-chHHhhccHHHHHHHHHHHHhhcc Confidence 111111111000 0 00 012222221111111111122 124456789999999999999999 Q ss_pred CCceeeeCCCCCC-hhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 81 AFNDFKVLYNRDS-KASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 81 ~~w~i~~~~~~d~-~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) ++|++.-...... ....+ .-+...|. +-+...++.++++.+. +.+.+|-+++++++...+ ++ ..|. T Consensus 63 lp~~~~~~~~~g~~~~~~~--~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G---~~------~~L~ 131 (421) T protein:vir:10 63 LPVELYRRDKNGGRQRATD--HPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKG---YP------KELI 131 (421) T ss_pred CceEEEEEcCCCceeeccc--chHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC---cE------EEEE Confidence 9998742211111 11011 11223333 3344556778887755 677899999999876432 22 2333 Q ss_pred ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAG 237 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G 237 (516) +.++..+. ...+.+|.....+ ...+..+|.+-+++.++.. .+.++| T Consensus 132 ~l~~~~v~----v~~~~~g~~~y~~-----------------------------~~~g~~~~~~eiih~~~~~-~d~~~G 177 (421) T protein:vir:10 132 PINPKKVI----VLKGPDGMPYYEI-----------------------------PEIGETLPMRMMHHVKVFS-LDGYIG 177 (421) T ss_pred EecCceEE----EEECCCceEEEEE-----------------------------cCCCcEEchhhEEEecCcC-CCCccc Confidence 44443222 1223344322111 0112346666555555444 455899 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc---ceEEEe Q lcl|NC_016071. 238 VSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE---QAYFIL 314 (516) Q Consensus 238 ~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~---~a~~ii 314 (516) .|.+..+....-.-....++...+...-+.+=-+++.+.. .+....++..+++++...+...|. ...+++ T Consensus 178 ~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-------~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl 250 (421) T protein:vir:10 178 SSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKE-------APAIKSQEKIDQLLAKWTDRYSGINNMFSVALL 250 (421) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCc-------cCccCCHHHHHHHHHHHHHHhcCccccCcceec Confidence 9999999877766666667767776654443333333211 111112233444555544444442 234677 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDII 393 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i 393 (516) |.|++++ ..+.+....+|.+..++...+|++++.-..--.+..+.++++-.+-+. .....-+.-.++.| T Consensus 251 ~~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~i 320 (421) T protein:vir:10 251 QEGMSYK----------QMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYTLLAWLKRH 320 (421) T ss_pred CCCceEE----------ecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHHHHHHHHHH Confidence 8886432 222233444577777888899999988765444333345565444443 44455677888888 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMST 473 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~ 473 (516) +..||+.|+.+- .....+.+|..+.....|++..+++++++++.|++.+ +.+|+.+|+|+-+.+|+.. T Consensus 321 e~~ln~kL~~~~-------~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD~~~ 388 (421) T protein:vir:10 321 EGALQRDLLLPS-------ERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSV-----NDIRRMENLPPIAGGDKYL 388 (421) T ss_pred HHHHhhhccCcc-------ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceee Confidence 888888665431 1111222344444456788999999999999998876 6799999999765555543 Q ss_pred CcccccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 474 DELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) ............+......+...++.+... .|+ T Consensus 389 ~~~n~~~~~~~~~~~~~~~~~~~~e~d~~~---~~~ 421 (421) T protein:vir:10 389 TPLNMVDSAQIIPGDKKPTAQQMAEIDTIL---SRT 421 (421) T ss_pred eccccccccccccCCCCcccccCccccccc---ccC Confidence 211110011111111101111111111111 111 No 18 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.76 E-value=3.5e-17 Score=110.83 Aligned_cols=415 Identities=11% Similarity=0.068 Sum_probs=202.8 Q ss_pred HHHHhh-ChHHHHHHHHHHHHHhcCCceeeeCCCCCCh-hhHHHHHHHHHHHhhccC----------cCCHHHHHHHH-H Q lcl|NC_016071. 58 VEAMKQ-DHTVSTALDTKYVFVTKAFNDFKVLYNRDSK-ASKDAAEFVEYALKNLAN----------QQTLRDIARSA-A 124 (516) Q Consensus 58 y~~m~~-D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~-~~~~~a~~v~~~l~~~~~----------~~~~~~~l~~~-l 124 (516) .++|.+ .+.|.+|+..+...|.+++|.+....+.... ...+..+-+...|..... ..++.+++..+ . T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 345555 7999999999999999999998765433221 112222333333332211 12456777654 4 Q ss_pred HHHhhcceeeeEEEeecccccccccceeeccccccCchhccccc---ceeecCCCceeeecccccccccccccccccc-- Q lcl|NC_016071. 125 TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSK---PWVFDEDGRTLKGIYQSKMAFANFQNGLTQI-- 199 (516) Q Consensus 125 da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~---~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~-- 199 (516) +.+.+|.+++|+++...+. + ..|.+.++.+++..+ ++..-.++...... .+...+... T Consensus 81 ~l~l~Gn~~i~~~r~~~G~---~------~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 143 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGT---P------TGLAYVPGHTIRKRMDERGFVQLLEEKEKYFG--------VAGDRYQTNGN 143 (467) T ss_pred HHHhcCCeEEEEEECCCCc---E------EEEEEeCCceeEeeeecceeEeecCCceeeEE--------eccccceeecc Confidence 6788899999999865432 2 233344444443111 11111111110000 000000000 Q ss_pred --ccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccc Q lcl|NC_016071. 200 --SSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQ 277 (516) Q Consensus 200 --~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~ 277 (516) .............+..+.+|...+|.++.....+..+|.+.+..+......-....++-..|....+.+--+++.+ T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-- 221 (467) T protein:vir:31 144 GDLDPVFVDADDGSTGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVK-- 221 (467) T ss_pred cceeeeeeeeccccccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-- Confidence 0000000111123445667888877776655567789999999888765544444444444443222221122211 Q ss_pred ccccccCCCCHHHHHHHHHHHHHHHHhhc--------------ccceEEEeccCcccccccccceeeeeccc-cCcchhH Q lcl|NC_016071. 278 ILNKAAIDPKSPESEMVQGLMADAANAHA--------------GEQAYFILPSDMNAQGGEQYKMSLKGIDG-AGKQYST 342 (516) Q Consensus 278 ~~~k~~~~~~~~~~~~l~~l~~~~~~~~~--------------g~~a~~iiP~g~~i~~~e~~~iel~~~~g-~g~~~~~ 342 (516) ...-+.++. +.+++...+... ......+++.|++.... .+++...+- +.....| T Consensus 222 -----~~~l~~e~~---~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~---~~~~~~ls~~~~~d~qf 290 (467) T protein:vir:31 222 -----GAELTEKGR---EEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDV---EIRLEPLTVGIDEEASF 290 (467) T ss_pred -----CcCCCHHHH---HHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCccccc---ceeEEeccccChhhHHH Confidence 111222222 334433332211 11223566777654432 244444321 2233457 Q ss_pred HHHHHHHHHHHHHHHhcccccccCCccchhh-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccce Q lcl|NC_016071. 343 QELVNSRKKAILDRFGAGFINLGNDGQGSYN-LSESK-QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPK 420 (516) Q Consensus 343 ~~li~~~d~~Isk~iLGqtLts~~~~~GS~A-l~~vh-~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~ 420 (516) .+..++.-++|++++.-..--++...+++++ -.+-. .......+.-.++.|++.||+.|++.....+ ..+.+ T Consensus 291 ~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~------~~~i~ 364 (467) T protein:vir:31 291 LEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAP------DWTIE 364 (467) T ss_pred HHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccC------CceEE Confidence 7788888899999866543223222223442 22222 2334555788899999999998876543222 12335 Q ss_pred EEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCccc---ccCCCCCCcccccccccCCC Q lcl|NC_016071. 421 LKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELL---KLLGQDTSRSGDGMTAGSNG 497 (516) Q Consensus 421 ~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 497 (516) |.+......|.+..+++++.++..|++.+ +.+|+.+|+|+- ++++..+... ...+++.+....+....... T Consensus 365 f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~Gl~pi-~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (467) T protein:vir:31 365 FELAKPDTKLQDVEIASQRVQAMQGLLTV-----NELRDEFGFEPF-PEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLV 438 (467) T ss_pred EecchhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCC-CcccccCCcccccccccccCCCCcccCcCCCCC Confidence 66667777899999999999999999876 579999999854 3333221111 11111111100000000000 Q ss_pred CCcccccccccchhhhhcC Q lcl|NC_016071. 498 NGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 498 ~~~~~~~~~~d~~~~~~~~ 516 (516) +.+....-|...+..+. T Consensus 439 --~~~~~~~~~~~~~~~~~ 455 (467) T protein:vir:31 439 --EDRADEIIDSYQADLET 455 (467) T ss_pred --CCcccchHhhhhhcccc Confidence 00000011111111111 No 19 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=99.75 E-value=5.4e-17 Score=109.77 Aligned_cols=482 Identities=11% Similarity=0.050 Sum_probs=231.3 Q ss_pred CCccc-cCcccccchhhhcccCCCCcccccchH-HHHHHHHHHHhhcccccCCcccHHHHHHHhh-ChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRF-AQPSEVVKAGNENLAVSRLRTGELGSG-ALSQLRAESEVMKVEELRWPCFLATVEAMKQ-DHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~-~~~~~~~~~~~~~p~~~~~~~~e~g~~-~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~-D~~v~s~l~~Rk~~ 77 (516) ||..+ ...++|.+.....-.. +.+..+-+.. +-..+. ...+-++.|-..+-...|.+ -+.+.+|++..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-----~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~ 74 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEA-DLAKSPNSTQIPDHRIQ-----SHNVGVNPPYNPDRLAAFLELNETLATGIRKKSRY 74 (651) T ss_pred CCCccceeeeeEEEeecccccc-cccccccccccchhhhc-----ccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhh Confidence 87665 3334444433211011 1111111110 001111 11223444545666677776 89999999999999 Q ss_pred HhcCCceeeeCCCCC-ChhhHHHHHHHHHHHhh-----------ccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRD-SKASKDAAEFVEYALKN-----------LANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAP 144 (516) Q Consensus 78 v~~~~w~i~~~~~~d-~~~~~~~a~~v~~~l~~-----------~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~ 144 (516) |.++.|++++....+ +..+++.-+.++.+|+. ++...++..++..+ .|-..+||+++|++=... T Consensus 75 iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~--- 151 (651) T protein:vir:99 75 EVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIE--- 151 (651) T ss_pred hhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCc--- Confidence 999999998754433 23344444455555543 12234566777654 467788999999853221 Q ss_pred cccccceeecc-------------------ccccCchhcc-------------ccccee-ec-CCCceeeecccc--ccc Q lcl|NC_016071. 145 SKYAGYITIDK-------------------IAFRPQSSLS-------------RSKPWV-FD-EDGRTLKGIYQS--KMA 188 (516) Q Consensus 145 ~~~~g~~~~~~-------------------l~~r~q~ti~-------------~~~~f~-~~-~dg~~l~~~~q~--~~~ 188 (516) ..+.+.+.+.. +..+|..... ...||. +. ..++........ ... T Consensus 152 g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~ 231 (651) T protein:vir:99 152 GRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPT 231 (651) T ss_pred cchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCccee Confidence 11111111100 0011100000 000110 00 000000000000 000 Q ss_pred cccccccccccc-----cccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 189 FANFQNGLTQIS-----SAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGAS 263 (516) Q Consensus 189 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~e 263 (516) ............ ...+.+ ..........+|.+.+|.+++....+.++|.|.+..+......-....++...|.. T Consensus 232 ~~~~~d~~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~ 310 (651) T protein:vir:99 232 IRYREDEESEREPIFVDRETGDV-TTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFD 310 (651) T ss_pred EEeccCcceeeeeecccceeeeE-EEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000000000000 000000 01112233456777777666655567789999999999888777777777777776 Q ss_pred hccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccc-cccccceeeeeccccC-cchh Q lcl|NC_016071. 264 KDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQ-GGEQYKMSLKGIDGAG-KQYS 341 (516) Q Consensus 264 r~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~-~~e~~~iel~~~~g~g-~~~~ 341 (516) ..+.+--+++.|. ...++++. +++++..++...|..-.++|+.+.... ......+++...+-+. .... T Consensus 311 NG~~p~gil~~~~-------~~ls~e~~---~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~q 380 (651) T protein:vir:99 311 NDTIPRMVIKVTG-------GELSEESK---RDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMD 380 (651) T ss_pred ccCCCceEEEecC-------CCCCHHHH---HHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHH Confidence 5444433443321 11233333 334444444444444455666422111 0111234444443322 2345 Q ss_pred HHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccce Q lcl|NC_016071. 342 TQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIH-GHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPK 420 (516) Q Consensus 342 ~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~-~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~ 420 (516) |.+..++...+|++++.-...-++..+.+++|..+.+...+ .+.++-.++.|++.||+.|++...... +. .-+ T Consensus 381 fle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~-----~~-~i~ 454 (651) T protein:vir:99 381 FRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVT-----DW-TIE 454 (651) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccc-----Cc-eEE Confidence 77788888999999988876544434446777777666544 567888999999999998877543322 10 113 Q ss_pred EEecC--cCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCC--cccccCcccccCCCCCCcccccccccCC Q lcl|NC_016071. 421 LKPGL--IQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIP--EDMSTDELLKLLGQDTSRSGDGMTAGSN 496 (516) Q Consensus 421 ~~~~~--~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (516) |.|+. ....|.+..+++++.+++.|++.+ +.+|+.+|+|+-.+ .++................++......+ T Consensus 455 ~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~-----NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~ 529 (651) T protein:vir:99 455 YELRGADQPKQEAQLAEQRVRAMRLAGVGLV-----DEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEP 529 (651) T ss_pred EEeccchhhhccHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccC Confidence 44543 455788999999999999999876 57999999985432 1221111000000011111111000000 Q ss_pred CCCcccccccccchhhh---hcC Q lcl|NC_016071. 497 GNGTGKISSTRDNSVSN---MDN 516 (516) Q Consensus 497 ~~~~~~~~~~~d~~~~~---~~~ 516 (516) ..+...+..+..+. |.. T Consensus 530 ---~~~~~~~~~e~~~~~~~~~~ 549 (651) T protein:vir:99 530 ---PEENKIGEREWDTVKSELTT 549 (651) T ss_pred ---ccccccccchhhhhhhhhcc Confidence 00011111111111 111 No 20 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.75 E-value=3.8e-17 Score=110.62 Aligned_cols=406 Identities=10% Similarity=-0.001 Sum_probs=216.4 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) .++-+...+....... ...+ .....+.+. + ...+..+ ..+..++-+.|.+|+..+...|.+ T Consensus 3 ~~~~f~~~~~~~~~~~---~~~~--------~~~~~~~~~------~-~~~~~~v-~~~~al~~~~v~~~i~~Ia~~ia~ 63 (416) T protein:vir:12 3 LERMFEKRSGSSDHED---GFNN--------ILLNMFGGR------K-TASGERV-SESNSLVQPDIFACVNVLSDDIAK 63 (416) T ss_pred cchhcccccCccccCc---cchh--------HHHHhhcCc------c-cccCcee-chhhhhccHHHHHHHHHHHHhhhh Confidence 3332322221111000 0000 000011110 0 0111111 134556788999999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHH-hhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYAL-KNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l-~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++++...........++ .-+...| .+-+...++.++++.++ +.+.+|-+++++++...+ . +..|.+ T Consensus 64 l~~~~~~~~~~~~~~~~~--~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G-------~--~~~L~~ 132 (416) T protein:vir:12 64 LPIHTYKRTDGGIERKPE--HKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHG-------Y--PEALFP 132 (416) T ss_pred CceEEEEecCCccccccc--cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEEE Confidence 999875433222111111 0122223 23344466778888876 467799999999875432 1 233444 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.++. ...+.+++.+.. .....+..+.+|...++.+++.. .+.++|. T Consensus 133 l~~~~v~----v~~~~~~~~~~~--------------------------~~~~~g~~~~~~~~eiih~~~~~-~~~~~G~ 181 (416) T protein:vir:12 133 LRPDYTN----AYVHPTTGMLWY--------------------------QTVLNGKAIELYDYEVLHFKGLS-TDGIHGK 181 (416) T ss_pred ECCcceE----EEEeCCCcEEEE--------------------------EEecCCeEEEecCccEEEecCcC-CCCcccc Confidence 4444332 223344332210 01112334567888776666554 4558999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCc Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDM 318 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~ 318 (516) |.+..++...-.-....++...+.+.-+.+=-+++.+ ...++++. +++++..+.+.. ....+++|.|+ T Consensus 182 s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~--------~~~~~e~~---~~~~~~~~~~~~-~~~~~vl~~g~ 249 (416) T protein:vir:12 182 SPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVP--------AFLDEKPK---ENVRKEWKRVNK-VENIAIIDYGL 249 (416) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecC--------CCCCHHHH---HHHHHHHHHHhc-CCCeeecCCCc Confidence 9999999887777767777777776544433333322 22333333 334444443332 23467788887 Q ss_pred ccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_016071. 319 NAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS-IHGHFVQRDIDIIVEAF 397 (516) Q Consensus 319 ~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e-v~~~~~~aDa~~i~~~l 397 (516) +++. + +-+....+|.+..++..++|++++--..-..+....++++-.+.+.. ....-+.-.+++|++.| T Consensus 250 ~~~~--------l--~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l 319 (416) T protein:vir:12 250 EYQS--------I--SMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQEL 319 (416) T ss_pred eEEE--------c--cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5332 2 22334445777888899999998887654444334466765554443 44667888999999999 Q ss_pred HHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCccc Q lcl|NC_016071. 398 NKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELL 477 (516) Q Consensus 398 n~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~ 477 (516) |+.|+...-... ..+-+|.++..-..|.+..+++++++++.|++.+ +.+|+.+|+|+-+.+|+...... T Consensus 320 ~~~l~~~~~~~~------g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~Pi~ggd~~~~~~n 388 (416) T protein:vir:12 320 NVKLFLDHDQKS------GHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNK-----DEIRELLERNPIENGDKYISSLN 388 (416) T ss_pred HHhhcCchhhcC------CceEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeeeccc Confidence 988775432111 1122333344456788999999999999999876 57999999997655554321111 Q ss_pred ccCCC-----CCCcccccccccCCCCCcc Q lcl|NC_016071. 478 KLLGQ-----DTSRSGDGMTAGSNGNGTG 501 (516) Q Consensus 478 ~~~~~-----~~~~~~~~~~~~~~~~~~~ 501 (516) -...+ ..+.++...+.|.. +..+ T Consensus 389 ~~~~~~~~~~~~~~~~~~~~gge~-~~~g 416 (416) T protein:vir:12 389 YVFLDFLEEYQRLKAGGAMKGGDN-KNEG 416 (416) T ss_pred cccccccchhhccccccccCCCCC-cCCC Confidence 00000 01111111111110 0111 No 21 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.72 E-value=2.8e-16 Score=105.85 Aligned_cols=391 Identities=11% Similarity=0.045 Sum_probs=203.6 Q ss_pred CCcc--ccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTR--FAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~r--~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |.== .++.++... + ..+ .|..+ .... ..+..+ ..+..++-+.|.+|+..+-..| T Consensus 1 M~~f~~~~~~~~~~~-------~--------~~~---~~~~~---~~~~--~~~~~v-~~~~al~~~~V~~~v~~ia~~i 56 (397) T protein:vir:38 1 MPLLKLNKSHSQGFS-------L--------NDP---DWVNF---LTGG--EAQKYV-SADTALKNSDIFSLIMQLSGDL 56 (397) T ss_pred CcchhhhhcccCccc-------C--------Cch---hhhhh---hcCC--cCCcee-chHHhhccHHHHHHHHHHHHHH Confidence 3321 111100000 0 000 01101 0000 001111 1244567889999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) .+++|+++ ++ . +...+.+-+...++.++++.+. +.+.+|.+++++++...+ . +..|. T Consensus 57 a~~p~~~~------~~---~----~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g-------~--~~~l~ 114 (397) T protein:vir:38 57 AMVRYTSE------SD---R----SQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNG-------V--DLSWE 114 (397) T ss_pred hhCccccc------cc---H----HHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC-------c--EEEEE Confidence 99988642 11 1 2223334444567888888877 567799999999986432 1 23444 Q ss_pred ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAG 237 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G 237 (516) +.++.+++ ...+.+|+.+..... . .....+....+|..-+|++++....+.++| T Consensus 115 ~l~~~~v~----i~~~~~~~~~~y~~~-------~---------------~~~~~~~~~~~~~~eiih~~~~~~~~~~~G 168 (397) T protein:vir:38 115 YLRPSQVQ----PMLLQDGSGLIYNIN-------F---------------DEPAIGYMENVPAADVIHIRLLSKNGGKTG 168 (397) T ss_pred EEcCceeE----EEEcCCCceEEEEEE-------e---------------ccccccceeEecCccEEEecCCCCCCcccc Confidence 55554332 234455543221000 0 001122234577777777777777777899 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EEec Q lcl|NC_016071. 238 VSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FILP 315 (516) Q Consensus 238 ~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP 315 (516) .|.+..+....-.-....++...+....+.+--+++.+ .+...++.+.+ ++.......+..++ ++++ T Consensus 169 ~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~--------~~~~~e~~~~~---~~~~~~~~~~~n~~~~~vl~ 237 (397) T protein:vir:38 169 ISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQ--------KGGLLDAETRI---ARSKEISKQIHNSDGPVVID 237 (397) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC--------CCCCHHHHHHH---HHHHHHHhcccccCCceecC Confidence 99999999888887777777777777655554444433 22233333333 22223333344333 5566 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVE 395 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~ 395 (516) .|++ +...+.+....+|.+..++.-.+|++++.-..--.+... ++++..+-........++-.++.|++ T Consensus 238 ~g~~----------~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~-~~~~~~e~~~~~~~~~l~P~~~~ie~ 306 (397) T protein:vir:38 238 ALED----------YKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQG-DQQSSITQISGQYAKSLNRYVQAIVG 306 (397) T ss_pred CCce----------EEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CcccHHHHHHHHHHHHHHHHHHHHHH Confidence 6653 222333334455778888899999998766443332222 22222221223334567778888999 Q ss_pred HHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc Q lcl|NC_016071. 396 AFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE 475 (516) Q Consensus 396 ~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~ 475 (516) .||+.|++.+ .++ +.+ .-..|.+..++++++|++.|++.+ +.+|+.+|+|+-.++|..... T Consensus 307 ~ln~~l~~~~-~~~-----------~~~--~~~~d~~~~~~~~~~~~~~G~~t~-----nE~R~~lg~~p~~~~d~~~~~ 367 (397) T protein:vir:38 307 ELNDKLHANI-SAN-----------IRF--AIDAMGDQYASTISSSVKGGTIAG-----NQARFILQNSGYLAKDLPDPE 367 (397) T ss_pred HHHHhccChh-ccc-----------ccc--cccCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCcccccc Confidence 9998776542 121 112 123577888999999999998875 579999999865444422211 Q ss_pred ccc-cCCCCCCcccccccccCCCCCcccccc Q lcl|NC_016071. 476 LLK-LLGQDTSRSGDGMTAGSNGNGTGKISS 505 (516) Q Consensus 476 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (516) ... ...+.... ..+...+..++..+..+. T Consensus 368 ~~~~~~~~~~~~-~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 368 KEPQQAIQLIQQ-EGGENDGNNSDERGSDPE 397 (397) T ss_pred cccccccccccc-ccCCCCCCCCCCCCCCCC Confidence 111 11111111 111111111111111111 No 22 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.72 E-value=1.8e-16 Score=106.84 Aligned_cols=394 Identities=11% Similarity=0.028 Sum_probs=212.4 Q ss_pred cchHHHHHHHHHHHh-------------hcccccCCcc---------cHHHH-HHHhhChHHHHHHHHHHHHHhcCCcee Q lcl|NC_016071. 29 LGSGALSQLRAESEV-------------MKVEELRWPC---------FLATV-EAMKQDHTVSTALDTKYVFVTKAFNDF 85 (516) Q Consensus 29 ~g~~~~~~~~~~~~~-------------~~~~~lr~~~---------~~~~y-~~m~~D~~v~s~l~~Rk~~v~~~~w~i 85 (516) +|= +.++... +....+..+. ...+. +..++-+.|.+|+..+-..|.++++.+ T Consensus 1 MG~-----f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~ 75 (422) T protein:vir:13 1 MGF-----LRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKI 75 (422) T ss_pred Cch-----hhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEE Confidence 221 1111100 0000011111 11122 233456789999999999999999887 Q ss_pred eeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHH-HHhhcceeeeEEEeecccccccccceeeccccccCchhc Q lcl|NC_016071. 86 KVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSL 164 (516) Q Consensus 86 ~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti 164 (516) .-. .+...+..++.++.. +-+...++.++++.++. .+.+|-+.+++++...+ . +..|.+.++.++ T Consensus 76 ~~~--~~~~~~~~~~~lL~~---~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G-------~--~~~L~~i~~~~v 141 (422) T protein:vir:13 76 YKD--KEEYKEHELYYLLRY---KPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKG-------K--IIGLYPINSDNV 141 (422) T ss_pred Eec--CcccccchHHHHHhh---hcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEEEECCcce Confidence 432 222222233333321 23344567788887664 67799999999876532 2 334455555544 Q ss_pred ccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHH Q lcl|NC_016071. 165 SRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGC 244 (516) Q Consensus 165 ~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~ 244 (516) . ...++||.... ... -++......+....++++..|++++....+.++|.|.+..+ T Consensus 142 ~----~~~~~~~~~~~-~~~-------------------~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~ 197 (422) T protein:vir:13 142 T----KIIDDDNFLSS-LSK-------------------VWYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYL 197 (422) T ss_pred E----EEEcCCcceec-cce-------------------EEEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHH Confidence 3 23344442110 000 00111122334456788888877766666778999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc---cceEEEeccCcccc Q lcl|NC_016071. 245 YRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG---EQAYFILPSDMNAQ 321 (516) Q Consensus 245 ~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g---~~a~~iiP~g~~i~ 321 (516) ....-.-....++...+...-+.+--+++.+ ...+++.. +.+++.......| ....++++.|++.+ T Consensus 198 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~l~~e~~---~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~ 266 (422) T protein:vir:13 198 RCTIENGRATQEFINKFFKNGLSIKGIVQYV--------GDLDEKAK---KIFKKEFESMSNGLENAHSISLLPFGYQFQ 266 (422) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEeC--------CCCCHHHH---HHHHHHHHHHhcCccccCCceecCCCceee Confidence 8876666666666666666533333333322 22233332 3344444433333 22346778887533 Q ss_pred cccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 322 GGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIVEAFNKN 400 (516) Q Consensus 322 ~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~ 400 (516) - ++. +....++.+..++...+|++++--..-..+..+.++++-.+-+. ..-..-+.-.++.|++.||+. T Consensus 267 ~--------l~~--~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ 336 (422) T protein:vir:13 267 P--------ISL--SMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQDK 336 (422) T ss_pred e--------ccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2 111 22334566777888899999888766444333345565444333 444566888899999999998 Q ss_pred HHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccC Q lcl|NC_016071. 401 LIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLL 480 (516) Q Consensus 401 li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~ 480 (516) |++..-... + .+-+|.++.....|++..+++++++++.|++.+ +.+|+.+|+|+-+.+|+......-.+ T Consensus 337 Ll~~~~~~~-----g-~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD~~~~~~n~~~ 405 (422) T protein:vir:13 337 LFSQYETLQ-----D-VKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEA-----NEARRRENLPPVEGGDRLLVNGNMIP 405 (422) T ss_pred hCChhhhcC-----C-ceEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeeccCccc Confidence 877542211 1 122344445555688999999999999999876 57999999997655554321111000 Q ss_pred CCCCCcccccccccCCCCCccc Q lcl|NC_016071. 481 GQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~ 502 (516) -+ ..++..+.+ |+.-++ T Consensus 406 l~---~~~~~~~~~--g~~~g~ 422 (422) T protein:vir:13 406 IE---MAGEQYKKG--GEKGGK 422 (422) T ss_pred hh---hcccccccC--CCcCCC Confidence 00 001100000 111111 No 23 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.71 E-value=7.4e-16 Score=103.54 Aligned_cols=427 Identities=12% Similarity=0.034 Sum_probs=207.5 Q ss_pred ccchhhhcccCCCCcccccchHHHHHHHH-HHHhhcccccCCcccHHHHHH-HhhChHHHHHHHHHHHHHhcCCceeeeC Q lcl|NC_016071. 11 VVKAGNENLAVSRLRTGELGSGALSQLRA-ESEVMKVEELRWPCFLATVEA-MKQDHTVSTALDTKYVFVTKAFNDFKVL 88 (516) Q Consensus 11 ~~~~~~~~p~~~~~~~~e~g~~~~~~~~~-~~~~~~~~~lr~~~~~~~y~~-m~~D~~v~s~l~~Rk~~v~~~~w~i~~~ 88 (516) +.-+.-+.++.|. ..|+...-...+.+ ...... -.....++-. .++.+.|.+|+..+-..|.+++|.+.-. T Consensus 1 ~~~~~~~~~~~p~--~~~~~~~~~~~~~~~~~~g~~-----~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~ 73 (518) T protein:vir:78 1 MLLANGQTLSAPA--MAELSPQMQDSYYYAPAVGMQ-----LERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFT 73 (518) T ss_pred CcccCceeeccch--hhhhhhhhhhcccccceecee-----cccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEE Confidence 1111111111111 11111110000000 000000 0011222222 2358999999999999999999988543 Q ss_pred CCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccccCchhcccc Q lcl|NC_016071. 89 YNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRS 167 (516) Q Consensus 89 ~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~ 167 (516) .+ +... ++.-..+...+.+-+...+..+++..++ +.+.+|-+++++++...+ ++. .|.+.++..++ T Consensus 74 ~~-~~~~-~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G---~~~------~L~~l~p~~Vt-- 140 (518) T protein:vir:78 74 SG-DTET-EEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPE------KLMPMHPSRVA-- 140 (518) T ss_pred cC-Cccc-cccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC---cEE------EEEEECCCceE-- Confidence 32 2111 1111122334444444556777887766 466789999999875432 122 33333333221 Q ss_pred cceeecCC-CceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHH Q lcl|NC_016071. 168 KPWVFDED-GRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYR 246 (516) Q Consensus 168 ~~f~~~~d-g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~ 246 (516) ...+.+ +......+.. .......+.+|...+|++++....+..+|.|.+..+.. T Consensus 141 --v~~~~~~~~~~y~~~~~-----------------------~~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~ 195 (518) T protein:vir:78 141 --IKRNSRTGRYEYYFQAG-----------------------AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKS 195 (518) T ss_pred --EEEcCCCCEEEEEEEec-----------------------CCccceeEEecCCcEEEecCCCCCcccccccHHHHHHH Confidence 112222 2111111000 00111233567777666665555566789999999988 Q ss_pred HHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEEeccCcccccc Q lcl|NC_016071. 247 AFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFILPSDMNAQGG 323 (516) Q Consensus 247 ~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~iiP~g~~i~~~ 323 (516) ..-.-....++...|....+.|--+++.+ ..-+.++. +++++.......| .++ .++++.|++.+ T Consensus 196 ~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~--------~~ls~e~~---~~~k~~~~~~~~G~~nag~~~vL~~G~~~~-- 262 (518) T protein:vir:78 196 TIFSEDSSRNATAAMWKNAGRPNLVLRHE--------KRLSPEAQ---QRLREQFDRAHAGSSNTGKTMVVEEGMEPI-- 262 (518) T ss_pred HHHHHHHHHHHHHHHHhcCCCccEEEecC--------CCCCHHHH---HHHHHHHHHHhcCcccCCceeEcCCCceEE-- Confidence 77766667777777776544443333322 22223332 3344444433334 233 46667776422 Q ss_pred cccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 324 EQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS-IHGHFVQRDIDIIVEAFNKNLI 402 (516) Q Consensus 324 e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e-v~~~~~~aDa~~i~~~ln~~li 402 (516) ..+-+.....|.+..++...+|++++--..-..+..+.++++-.+.+.. .....+.-.++.|++.||+.|+ T Consensus 263 --------~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~ 334 (518) T protein:vir:78 263 --------PLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVG 334 (518) T ss_pred --------eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 2211223344667777888999998877654343333456655554443 3455688889999999998775 Q ss_pred HHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC--Cccccc-Cccccc Q lcl|NC_016071. 403 PQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI--PEDMST-DELLKL 479 (516) Q Consensus 403 ~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~--~~~~~~-~~~~~~ 479 (516) +.+ .. ..+-+|..+..-..|.+..++++.++++.|++.+ +.+|+.+|+|+-. ..|+.. .....+ T Consensus 335 ~~~-------~~-~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~-----NE~R~~~gl~pie~~~gD~~~v~~n~~p 401 (518) T protein:vir:78 335 QYW-------VR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATP-----NEGREIMGLPRSDDPKADELYANSALQP 401 (518) T ss_pred ccc-------cC-cceEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCceeeeccccee Confidence 432 11 1222333345556889999999999999999876 5799999998543 222211 000000 Q ss_pred -------CCCCCCcccccccccCCC----C-----CcccccccccchhhhhcC Q lcl|NC_016071. 480 -------LGQDTSRSGDGMTAGSNG----N-----GTGKISSTRDNSVSNMDN 516 (516) Q Consensus 480 -------~~~~~~~~~~~~~~~~~~----~-----~~~~~~~~~d~~~~~~~~ 516 (516) ......+..+...+.++. . .....+...+.......+ T Consensus 402 l~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (518) T protein:vir:78 402 LGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKT 454 (518) T ss_pred cccccccccCCCCCCCCCCCCcccccccccCccccCCCCCccccccccccccc Confidence 000000000000000000 0 000001111122222222 No 24 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.71 E-value=9.2e-16 Score=103.01 Aligned_cols=427 Identities=11% Similarity=0.035 Sum_probs=210.5 Q ss_pred ccchhhhcccCCCCcccccchHHHHHHHH-HHHhhcccccCCcccHHHHHHH-hhChHHHHHHHHHHHHHhcCCceeeeC Q lcl|NC_016071. 11 VVKAGNENLAVSRLRTGELGSGALSQLRA-ESEVMKVEELRWPCFLATVEAM-KQDHTVSTALDTKYVFVTKAFNDFKVL 88 (516) Q Consensus 11 ~~~~~~~~p~~~~~~~~e~g~~~~~~~~~-~~~~~~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~~v~~~~w~i~~~ 88 (516) +.-++-+.++.|. ..|+...-...+.. ....... .....+|-.+ ++.+.|.+|+..+-..|.++++.+.-. T Consensus 1 ~~~~~~~~~~~p~--~~e~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~ 73 (518) T protein:vir:10 1 MLLANGQTLSAPA--MAELSPQMQDSYYYAPAVGMQL-----ERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFT 73 (518) T ss_pred CcccCceeecCch--hhhhhhhhhcccccccccceec-----ccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEE Confidence 1112222222221 01111110000000 0000111 1122333333 357899999999999999999987443 Q ss_pred CCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccccCchhcccc Q lcl|NC_016071. 89 YNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRS 167 (516) Q Consensus 89 ~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~ 167 (516) .+ +..... .-..+...+.+-+...+..++++.++ +.+.+|-+++++++...+ ++. .|.+.++..++ T Consensus 74 ~~-~~~~~~-~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G---~~~------~L~~l~p~~v~-- 140 (518) T protein:vir:10 74 SG-DTETEE-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPE------KLMPMHPSRVA-- 140 (518) T ss_pred cC-CCceec-cchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEE------EEEEECCCceE-- Confidence 22 211111 11122334444444566778887776 567899999999875432 122 23333332221 Q ss_pred cceeecCC-CceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHH Q lcl|NC_016071. 168 KPWVFDED-GRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYR 246 (516) Q Consensus 168 ~~f~~~~d-g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~ 246 (516) ...+.+ ++......... .....-+.+|...+|++++....+.++|.|.+..+.. T Consensus 141 --v~~~~~~~~~~y~~~~~~-----------------------~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~ 195 (518) T protein:vir:10 141 --IKRNSRTGRYEYYFQAGA-----------------------GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKS 195 (518) T ss_pred --EEEcCCCCEEEEEEEecC-----------------------CccceEEEecCCcEEEecCCCCCcccccccHHHHHHH Confidence 112222 22211111000 0011223466666666655555566789999999988 Q ss_pred HHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEEeccCcccccc Q lcl|NC_016071. 247 AFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFILPSDMNAQGG 323 (516) Q Consensus 247 ~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~iiP~g~~i~~~ 323 (516) ....-....++...|....+.+--++..+ ..-+++.. +++++.......| .++ .++++.|++.. T Consensus 196 ~i~~~~a~~~~~~~~f~ng~~p~gil~~~--------~~ls~e~~---~~~k~~~~~~~~G~~nag~v~vL~~G~~~~-- 262 (518) T protein:vir:10 196 TIFSEDSSRNATAAMWKNAGRPNLVLRHE--------KRLSEAAQ---QRLREQFDRAHSGSSNTGKTMVVEEGMEPI-- 262 (518) T ss_pred HHHHHHHHHHHHHHHHhcCCCccEEEecC--------CCCCHHHH---HHHHHHHHHHhcCccccCcceEcCCCceEE-- Confidence 77777777777777776544433333322 22233333 3344444443334 233 46677776432 Q ss_pred cccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 324 EQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIVEAFNKNLI 402 (516) Q Consensus 324 e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~li 402 (516) ..+-+.....|.+..++...+|++++--..-..+..+.++++-.+.+. ......+.-.++.|++.||+.|+ T Consensus 263 --------~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~ 334 (518) T protein:vir:10 263 --------PLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVG 334 (518) T ss_pred --------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 222222333467777888899999887754444333345565555443 33445678889999999998765 Q ss_pred HHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC--Cccccc-C----c Q lcl|NC_016071. 403 PQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI--PEDMST-D----E 475 (516) Q Consensus 403 ~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~--~~~~~~-~----~ 475 (516) +.+ . ...+-+|.++..-..|++..++++.+++..|++.+ +.+|+.+|+|+-. ..|+.. . . T Consensus 335 ~~~-------~-~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~-----NE~R~~~Gl~pie~~~gD~~~~~~n~~p 401 (518) T protein:vir:10 335 QYW-------V-RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATP-----NEGREIMGLPRSDDPKADELYANSALQP 401 (518) T ss_pred ccc-------c-CCceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCCeeeeccccee Confidence 432 1 11222333344556899999999999999999876 5799999998543 223211 0 0 Q ss_pred cc---ccCCCCCCcccccccccCC---------CCCcccccccccchhhhhcC Q lcl|NC_016071. 476 LL---KLLGQDTSRSGDGMTAGSN---------GNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 476 ~~---~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~d~~~~~~~~ 516 (516) .. ........+..+...+.++ +......+...+.......| T Consensus 402 l~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (518) T protein:vir:10 402 LGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKT 454 (518) T ss_pred cccccccccCCCCCCCCCCCCccccccccccccccCCCCCccccccccccccc Confidence 00 0000000000000000000 00000011122233333333 No 25 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.70 E-value=3.7e-16 Score=105.19 Aligned_cols=411 Identities=10% Similarity=-0.004 Sum_probs=212.4 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |- .+.++......+... + .+ ..+.. ......+..+ .-+..++-+.|.+|+..+-..| T Consensus 1 m~~~~~~~~~~~~~~~~~~-----~----~~--------~~~~g---~~~s~~~~~v-~~~~al~~~~v~~cv~~ia~~i 59 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSG-----G----WV--------SALLG---SARSEAGQVV-TPASALSLTVLQNCVTLLAESI 59 (419) T ss_pred CCcccccccccCcCCCCcc-----h----hh--------HHhhc---ccccccCccc-ChHHhhccHHHHHHHHHHHHhh Confidence 33 333332221111110 0 00 00000 0000112222 1244556788999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) .++++++.-..+......++ .-+...|. +-+...+..++++.+. +.+.+|-+++++++...+ . +..| T Consensus 60 a~lp~~~~~~~~~~~~~~~~--~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G-------~--~~~L 128 (419) T protein:vir:80 60 AQLPVELYERSGDDRKPATD--HPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDG-------V--IQGL 128 (419) T ss_pred ccCceEEEEecCCCcccccc--cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEE Confidence 99999875332221111111 11233343 2333456778887766 568899999999876432 2 2334 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .+.++.+++ ...+.+|+.+... .....+|.+-+++.++.+. +.++ T Consensus 129 ~~i~~~~v~----i~~~~~~~~~y~~------------------------------~~~~~~~~~~i~h~~~~~~-d~~~ 173 (419) T protein:vir:80 129 YPLDNEAVT----VMKGPDLKPMYRV------------------------------AGADPLPQRLVHHVRWMSI-NGYT 173 (419) T ss_pred EEecCceEE----EEECCCceEEEEE------------------------------cCccccchhheEEecCCCC-CCcc Confidence 444444332 2233444322110 0112355554444445444 4589 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc-ce--EEE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE-QA--YFI 313 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~-~a--~~i 313 (516) |.|.+..++...-.-....++...+...-+.+--+++.|- ......+++..+++++.......|. ++ .++ T Consensus 174 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-------~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~v 246 (419) T protein:vir:80 174 GLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPT-------DAPALKDQASVDRITDGWNAKFGGSGNAKKVAL 246 (419) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecC-------CCCcccCHHHHHHHHHHHHHHhcCccccCCcee Confidence 9999999988776666666777777765444433333221 1111112334555666655554453 22 466 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDI 392 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~ 392 (516) ++.|+++. ..+-+.....+.+..++..++|++++.-..--.+..+.++++-.+.+. .....-+.-.++. T Consensus 247 l~~g~~~~----------~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ 316 (419) T protein:vir:80 247 LQEGMKFK----------PLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKR 316 (419) T ss_pred cCCCceEE----------eccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHH Confidence 77775422 222223334567777888899999988765444433445665555443 3345557888899 Q ss_pred HHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc Q lcl|NC_016071. 393 IVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS 472 (516) Q Consensus 393 i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~ 472 (516) |++.||+.|+.+-- .. ..+.+|.++.....|+++.++++.++++.|++.+ +.+|+.+|+|+-+.+|+. T Consensus 317 ie~~l~~kll~~~~------~~-~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~gGD~~ 384 (419) T protein:vir:80 317 HEQAKTRDLLLPSE------RK-QYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSI-----NDIRRLENMPPVKGGDIY 384 (419) T ss_pred HHHHHhhhccCccc------cC-CeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCccee Confidence 99999987764321 11 1223344445556788999999999999999886 579999999976555544 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhc Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMD 515 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 515 (516) ...... .....+. + ...+. .+ + ..++-|+...=++ T Consensus 385 ~~~~n~--~~~~~~~-~-~~~~~-~~-~--~~~~~~~~~~~l~ 419 (419) T protein:vir:80 385 LSPMNM--VDASKPQ-P-IPMGK-TE-P--TKAALDEIGRILS 419 (419) T ss_pred eecccc--ccccccc-c-ccCCC-CC-c--hhhhHHHHHhhcC Confidence 321110 0000000 0 00010 00 0 0111122222222 No 26 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.69 E-value=8.2e-16 Score=103.29 Aligned_cols=438 Identities=12% Similarity=0.081 Sum_probs=203.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH-HHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV-EAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~~v~ 79 (516) |.++... ...|..+..+ .....+.+|... .-.++. ..|..+.+..+.-. +....-+.|.+|+..++..|. T Consensus 31 ~~~~~~~--~~~k~~~~~~--~~~~~~~~~~~~--~~~g~~---~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a~~ia 101 (547) T protein:vir:63 31 IQQREQE--QISKAMNNKE--VAYSQPVIGSMS--ANPGFK---TKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVS 101 (547) T ss_pred hhhhhHH--HHHHhhcccc--hhhhchhhheee--cccccc---cCCccCChhHHHHHHHHhhcCHHHHHHHHHHHHHHh Confidence 2222111 1111111100 001111121110 000110 11111222222111 111235899999999998887 Q ss_pred c-----------CCceeeeCC--CCCChhhHHHHHHHHHHHhhcc-----CcCCHHHHHHHHH-HHHhhcceeeeEEEee Q lcl|NC_016071. 80 K-----------AFNDFKVLY--NRDSKASKDAAEFVEYALKNLA-----NQQTLRDIARSAA-TFNEYGFSIFEKVYRT 140 (516) Q Consensus 80 ~-----------~~w~i~~~~--~~d~~~~~~~a~~v~~~l~~~~-----~~~~~~~~l~~~l-da~~~G~S~~Eivw~~ 140 (516) + ..|.|.+.. ......++...+.++.+|.+.. .+.+|.+++..++ +.+.+|.+++|+++.. T Consensus 102 ~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd~ 181 (547) T protein:vir:63 102 MYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNR 181 (547) T ss_pred hhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEEEECC Confidence 4 335554432 1222333434445555555432 2246778887766 5788999999999865 Q ss_pred cccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCcccccc Q lcl|NC_016071. 141 ESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPI 220 (516) Q Consensus 141 ~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~ 220 (516) .+ . +..|.+.++.+++ +..+.+|..... .. .+.....+.....++. T Consensus 182 ~G-------~--~~~L~~l~p~~V~----~~~~~~g~~~~~----~~-----------------~y~~~~~~~~~~~~~~ 227 (547) T protein:vir:63 182 NQ-------S--MVRFVAKDPTTIF----FATTADGKIPDN----GN-----------------RFVQVIDQKIVATFNA 227 (547) T ss_pred CC-------c--EEEEEEecCceeE----EEECCccccccC----ce-----------------EEEEEcCCcEEEEecc Confidence 43 2 2234444554443 334444432110 00 0001111222345677 Q ss_pred ccEEEEeecCcCC---ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTES---NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGL 297 (516) Q Consensus 221 ~k~i~~~~~~~~g---~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l 297 (516) ..+|++++....+ .+||.|.+..+......-....++-..|....+.|--++..+ .....+.+ ..+++ T Consensus 228 ~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~------~~~~ls~e---~~~~l 298 (547) T protein:vir:63 228 REMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIK------AAQQQSQH---ALEIF 298 (547) T ss_pred ccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEec------CCCCCCHH---HHHHH Confidence 7776666655443 578999999999888777777777777776433222122211 11112222 33445 Q ss_pred HHHHHHhhcc-cceEE--Eec-cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc-------- Q lcl|NC_016071. 298 MADAANAHAG-EQAYF--ILP-SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG-------- 365 (516) Q Consensus 298 ~~~~~~~~~g-~~a~~--iiP-~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~-------- 365 (516) ++.......| ..++. +++ .|+ ++...+.+.....|.+..++..++|++++.-...-.+ T Consensus 299 k~~~~~~~~G~~nagk~~vl~~~g~----------~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~ 368 (547) T protein:vir:63 299 KREWKNSLSGINGSWQIPVVSAEDV----------KFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGAT 368 (547) T ss_pred HHHHHHHhcCcccccccccccCCCc----------eEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccc Confidence 5544443334 34442 332 343 3334433444445777778888999999865432221 Q ss_pred CCccch--hhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHH Q lcl|NC_016071. 366 NDGQGS--YNLSESKQ-SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIG 442 (516) Q Consensus 366 ~~~~GS--~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~ 442 (516) +..++| ++-.+... ......+.-.++.|+..||+.|++.+ +. . -+|.|+.....+....++ +.+++ T Consensus 369 ~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~-------~~--~-~~~~f~~~~~~~~~~~~~-~~~~~ 437 (547) T protein:vir:63 369 GSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEF-------GD--K-YTFQFVGGDIKSELESVK-ILAEK 437 (547) T ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------CC--c-eEEEeeccccccHHHHHH-HHHHH Confidence 111222 23333232 34556688889999999998776531 11 1 257777777777666554 44577 Q ss_pred hCCcccccHHHHHHHHHHcCCCCC-CCcccccCcc----------------cccCCCCCCccc-cc-----ccccCCCCC Q lcl|NC_016071. 443 AVGYLPKTPTVINKILEVGGFDEE-IPEDMSTDEL----------------LKLLGQDTSRSG-DG-----MTAGSNGNG 499 (516) Q Consensus 443 ~~G~~~~~~~~~~~i~e~~Glp~~-~~~~~~~~~~----------------~~~~~~~~~~~~-~~-----~~~~~~~~~ 499 (516) ..|++.+ +.+|+.+|+|+. ..+|+..... +++......+.. .+ .....+... T Consensus 438 ~~g~lT~-----NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (547) T protein:vir:63 438 AKVAMTV-----NEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGK 512 (547) T ss_pred hCCCcCH-----HHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCc Confidence 7888754 689999999763 2333322100 000000000000 00 000000000 Q ss_pred cccccccccchhhhhcC Q lcl|NC_016071. 500 TGKISSTRDNSVSNMDN 516 (516) Q Consensus 500 ~~~~~~~~d~~~~~~~~ 516 (516) +.......|.......| T Consensus 513 ~~~~~~~~d~~~~~~~~ 529 (547) T protein:vir:63 513 DTTGDIGKDGQRKDKDN 529 (547) T ss_pred ccCCCcCccccccCccc Confidence 11111112222222222 No 27 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.69 E-value=3.2e-16 Score=105.56 Aligned_cols=422 Identities=9% Similarity=-0.019 Sum_probs=209.1 Q ss_pred CC------ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHH Q lcl|NC_016071. 1 MS------TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTK 74 (516) Q Consensus 1 ~~------~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~R 74 (516) |. .|...++.....++ +..+.... + + ......+.+..+. -+..++-+.|.+|+..+ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~---~~~~~~~~------~-----~---~~~~~~~~g~~v~-~~~al~~~~v~~~i~~i 62 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGR---AWEPYDPS------I-----Y---NLGATASSGERVT-PHDALQVSAVFASVRLL 62 (457) T ss_pred Cchhhhhhcccccccccccccc---ccccchhh------h-----h---hccccccCCceec-hHHhhccHHHHHHHHHH Confidence 32 22211111110000 01110000 0 0 0000001111111 24556678899999999 Q ss_pred HHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceee Q lcl|NC_016071. 75 YVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITI 153 (516) Q Consensus 75 k~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~ 153 (516) -..|.++++++.-..+..... .+ ...+..++..-+...++.++++.++ +.+.+|.+++++.+.. + .+. T Consensus 63 a~~iA~lp~~~~~~~~~~~~~-~~-~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~--g--~~~----- 131 (457) T protein:vir:62 63 SETIATLPLSTYSKRGGTRKE-ID-TPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAG--P--NIA----- 131 (457) T ss_pred HHhHhhCceEEEEecCCcccc-cc-chHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCC--C--cEE----- Confidence 999999999876443322111 11 1112233334444567788888766 4788999999987652 1 111 Q ss_pred ccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCC---ccccccccEEEEeecC Q lcl|NC_016071. 154 DKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSAD---EVFIPINKLMVMSLGG 230 (516) Q Consensus 154 ~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~iP~~k~i~~~~~~ 230 (516) .|.+.++..++. .....++-.... +..+ .....+. -..+|.+.+|++++.. T Consensus 132 -~l~~l~p~~v~v---~~~~~~~~~~~~-------~~~y---------------~~~~~g~~~~~~~~~~~eiih~r~~~ 185 (457) T protein:vir:62 132 -GLDVLDPTKIHV---HMVMVDGLRRKV-------FEAY---------------DIDADGNEVLLGWFTPRDVLHIPGMM 185 (457) T ss_pred -EEEEEcCcceEE---EEeccCCcccee-------EEEE---------------EEccCCceeEEEeeCccceEEecCCC Confidence 222333322211 000111100000 0000 0000000 1234666666666656 Q ss_pred cCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc-c Q lcl|NC_016071. 231 TESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE-Q 309 (516) Q Consensus 231 ~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~-~ 309 (516) ..+..+|.|.+..+....-.-....++...+....+.+=-+++.+ ..-+.+ ..+++++.......|. . T Consensus 186 ~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~ls~e---~~~~~~~~~~~~~~G~~n 254 (457) T protein:vir:62 186 LPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP--------GTMSEE---GLARAREAWRAANSGVDN 254 (457) T ss_pred CCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC--------CCCCHH---HHHHHHHHHHHHhcCccc Confidence 666689999999998877777777777777776544443333332 222322 2445555555554453 2 Q ss_pred e--EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHH---HHHHHHHHH Q lcl|NC_016071. 310 A--YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLS---ESKQSIHGH 384 (516) Q Consensus 310 a--~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~---~vh~ev~~~ 384 (516) + .++++.|++.+- .+-+.....|.+..++...+|++++.-...-.+..+.+++..+ +........ T Consensus 255 ag~~~vl~~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~ 324 (457) T protein:vir:62 255 AHRVALLTEGAKFSK----------VAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMF 324 (457) T ss_pred cCcceecCCCceEEE----------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHH Confidence 3 367788864322 1112233346677778888999988765433332222333222 222334455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCC Q lcl|NC_016071. 385 FVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFD 464 (516) Q Consensus 385 ~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp 464 (516) -+.--++.|+..||+.|+... .....+-+|.++.....|++..+++++++++.|++.+ +.+|+.+|+| T Consensus 325 ~l~P~~~~ie~~ln~~L~~~~-------~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~ 392 (457) T protein:vir:62 325 SLRPWLERIEAGFNRLLFAET-------ADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSI-----DEVRAAEDMT 392 (457) T ss_pred HHHHHHHHHHHHHHhhhcCcc-------ccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCC Confidence 567788888888888776542 1111122344445556799999999999999999876 6899999998 Q ss_pred CCCCc--ccccCc-------------cc----ccCCCCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 465 EEIPE--DMSTDE-------------LL----KLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 465 ~~~~~--~~~~~~-------------~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) +-.++ |+.... .. ...++...++.+..+.+..+..+.+..+.+|.+ T Consensus 393 pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 393 PLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred CCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCccccccccccC Confidence 65433 221111 00 000101111111112233333444445555555 No 28 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.69 E-value=7.6e-16 Score=103.48 Aligned_cols=408 Identities=12% Similarity=0.041 Sum_probs=210.8 Q ss_pred ccccchHHHHHHHHHHHhhcccc-----cC------------CcccHHHH-HHHhhChHHHHHHHHHHHHHhcCCceeee Q lcl|NC_016071. 26 TGELGSGALSQLRAESEVMKVEE-----LR------------WPCFLATV-EAMKQDHTVSTALDTKYVFVTKAFNDFKV 87 (516) Q Consensus 26 ~~e~g~~~~~~~~~~~~~~~~~~-----lr------------~~~~~~~y-~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~ 87 (516) |+ = .+.+..+...+.... +. .+..+.+- +..++.+.|.+|+..+-..|.++++.+.. T Consensus 1 M~---~--~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 75 (432) T protein:vir:10 1 MK---I--VDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQ 75 (432) T ss_pred CC---h--HHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEE Confidence 22 1 122222211111100 00 01111111 23456889999999999999999998753 Q ss_pred CCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHHH-HHhhcceeeeEEEeecccccccccceeeccccccCchhcc Q lcl|NC_016071. 88 LYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLS 165 (516) Q Consensus 88 ~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~ 165 (516) ..+.... +..-.-+...|+ +-+...++.++++.++. .+.+|-+++++++...+ . +..|.+.++.+++ T Consensus 76 ~~~~~~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G-------~--~~~L~~i~~~~v~ 144 (432) T protein:vir:10 76 EDEYGIQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG-------K--VQALWPIDASKVT 144 (432) T ss_pred ecCCcee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEEEEcCceeE Confidence 3221111 111111333343 23344567888887664 67899999999886533 2 2234444444332 Q ss_pred cccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHH Q lcl|NC_016071. 166 RSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCY 245 (516) Q Consensus 166 ~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~ 245 (516) ...++++...... .........+....+|...+|.+++....+..+|.|.+..+. T Consensus 145 ----v~~d~~~~~~~~~---------------------~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~ 199 (432) T protein:vir:10 145 ----VYIDDVGLLNSKT---------------------KMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLK 199 (432) T ss_pred ----EEEcCcccccccc---------------------eEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHH Confidence 1223322110000 000011122334557777777666655566688999999998 Q ss_pred HHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc---cceEEEeccCccccc Q lcl|NC_016071. 246 RAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG---EQAYFILPSDMNAQG 322 (516) Q Consensus 246 ~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g---~~a~~iiP~g~~i~~ 322 (516) ...-.-....++-..+...-+.+--+++.+ ..-+.+. .+++.+.......| ....+++|.|++++. T Consensus 200 ~~i~~~~~~~~~~~~~~~ng~~p~gil~~~--------~~l~~e~---~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~ 268 (432) T protein:vir:10 200 STLENSASADKFINNFYKQGLQVKGLVQYV--------GDLNEDA---KKVFRENFESMSSGLQNSHRIALMPVGYQFQP 268 (432) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcC--------CCCCHHH---HHHHHHHHHHHhcccccCCcceecCCCceEEE Confidence 877666666666666666433333233321 1222222 23344444433333 223467788864322 Q ss_pred ccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 323 GEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIVEAFNKNL 401 (516) Q Consensus 323 ~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~l 401 (516) ++ -+.....+.+..++..++|++++.-..-..+..+.|+++-.+-+. .....-++-.++.|++.||+.| T Consensus 269 --------l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 269 --------IS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred --------cc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 22 222334566677888899999988765444433446665544443 4445668889999999999887 Q ss_pred HHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCC Q lcl|NC_016071. 402 IPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLG 481 (516) Q Consensus 402 i~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~ 481 (516) +..--. ....+.+|.++.....|++..+++++++++.|++.+ +.+|+.+|+|+-+.+|+......- .+ T Consensus 339 l~~~~~------~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~pi~ggD~~~~~~n~-~~ 406 (432) T protein:vir:10 339 FLDSEL------DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKP-----NEARSKEDLPPEAGGDRLLVNGNM-LP 406 (432) T ss_pred cChhhc------CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccc-cc Confidence 754211 111223444555667789999999999999999876 579999999865444433211110 00 Q ss_pred CCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 482 QDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) -+... ..-.+.+. .+++. ...+.+++ T Consensus 407 ~~~~~-~~~~k~~~-~~~~~-~~~~~~~~ 432 (432) T protein:vir:10 407 IDMAG-QAYLKGGD-TNGEV-SKEGNEGN 432 (432) T ss_pred hhhcc-ccccCCCC-CCCCC-CCCCCCCC Confidence 00000 00001110 01110 01111111 No 29 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.69 E-value=7.6e-16 Score=103.48 Aligned_cols=408 Identities=12% Similarity=0.041 Sum_probs=210.8 Q ss_pred ccccchHHHHHHHHHHHhhcccc-----cC------------CcccHHHH-HHHhhChHHHHHHHHHHHHHhcCCceeee Q lcl|NC_016071. 26 TGELGSGALSQLRAESEVMKVEE-----LR------------WPCFLATV-EAMKQDHTVSTALDTKYVFVTKAFNDFKV 87 (516) Q Consensus 26 ~~e~g~~~~~~~~~~~~~~~~~~-----lr------------~~~~~~~y-~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~ 87 (516) |+ = .+.+..+...+.... +. .+..+.+- +..++.+.|.+|+..+-..|.++++.+.. T Consensus 1 M~---~--~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 75 (432) T protein:vir:10 1 MK---I--VDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQ 75 (432) T ss_pred CC---h--HHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEE Confidence 22 1 122222211111100 00 01111111 23456889999999999999999998753 Q ss_pred CCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHHH-HHhhcceeeeEEEeecccccccccceeeccccccCchhcc Q lcl|NC_016071. 88 LYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLS 165 (516) Q Consensus 88 ~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~ 165 (516) ..+.... +..-.-+...|+ +-+...++.++++.++. .+.+|-+++++++...+ . +..|.+.++.+++ T Consensus 76 ~~~~~~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G-------~--~~~L~~i~~~~v~ 144 (432) T protein:vir:10 76 EDEYGIQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG-------K--VQALWPIDASKVT 144 (432) T ss_pred ecCCcee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEEEEcCceeE Confidence 3221111 111111333343 23344567888887664 67899999999886533 2 2234444444332 Q ss_pred cccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHH Q lcl|NC_016071. 166 RSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCY 245 (516) Q Consensus 166 ~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~ 245 (516) ...++++...... .........+....+|...+|.+++....+..+|.|.+..+. T Consensus 145 ----v~~d~~~~~~~~~---------------------~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~ 199 (432) T protein:vir:10 145 ----VYIDDVGLLNSKT---------------------KMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLK 199 (432) T ss_pred ----EEEcCcccccccc---------------------eEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHH Confidence 1223322110000 000011122334557777777666655566688999999998 Q ss_pred HHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc---cceEEEeccCccccc Q lcl|NC_016071. 246 RAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG---EQAYFILPSDMNAQG 322 (516) Q Consensus 246 ~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g---~~a~~iiP~g~~i~~ 322 (516) ...-.-....++-..+...-+.+--+++.+ ..-+.+. .+++.+.......| ....+++|.|++++. T Consensus 200 ~~i~~~~~~~~~~~~~~~ng~~p~gil~~~--------~~l~~e~---~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~ 268 (432) T protein:vir:10 200 STLENSASADKFINNFYKQGLQVKGLVQYV--------GDLNEDA---KKVFRENFESMSSGLQNSHRIALMPVGYQFQP 268 (432) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcC--------CCCCHHH---HHHHHHHHHHHhcccccCCcceecCCCceEEE Confidence 877666666666666666433333233321 1222222 23344444433333 223467788864322 Q ss_pred ccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 323 GEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIVEAFNKNL 401 (516) Q Consensus 323 ~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~l 401 (516) ++ -+.....+.+..++..++|++++.-..-..+..+.|+++-.+-+. .....-++-.++.|++.||+.| T Consensus 269 --------l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 269 --------IS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred --------cc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 22 222334566677888899999988765444433446665544443 4445668889999999999887 Q ss_pred HHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCC Q lcl|NC_016071. 402 IPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLG 481 (516) Q Consensus 402 i~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~ 481 (516) +..--. ....+.+|.++.....|++..+++++++++.|++.+ +.+|+.+|+|+-+.+|+......- .+ T Consensus 339 l~~~~~------~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~pi~ggD~~~~~~n~-~~ 406 (432) T protein:vir:10 339 FLDSEL------DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKP-----NEARSKEDLPPEAGGDRLLVNGNM-LP 406 (432) T ss_pred cChhhc------CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccc-cc Confidence 754211 111223444555667789999999999999999876 579999999865444433211110 00 Q ss_pred CCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 482 QDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) -+... ..-.+.+. .+++. ...+.+++ T Consensus 407 ~~~~~-~~~~k~~~-~~~~~-~~~~~~~~ 432 (432) T protein:vir:10 407 IDMAG-QAYLKGGD-TNGEV-SKEGNEGN 432 (432) T ss_pred hhhcc-ccccCCCC-CCCCC-CCCCCCCC Confidence 00000 00001110 01110 01111111 No 30 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.69 E-value=7.6e-16 Score=103.48 Aligned_cols=408 Identities=12% Similarity=0.041 Sum_probs=210.8 Q ss_pred ccccchHHHHHHHHHHHhhcccc-----cC------------CcccHHHH-HHHhhChHHHHHHHHHHHHHhcCCceeee Q lcl|NC_016071. 26 TGELGSGALSQLRAESEVMKVEE-----LR------------WPCFLATV-EAMKQDHTVSTALDTKYVFVTKAFNDFKV 87 (516) Q Consensus 26 ~~e~g~~~~~~~~~~~~~~~~~~-----lr------------~~~~~~~y-~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~ 87 (516) |+ = .+.+..+...+.... +. .+..+.+- +..++.+.|.+|+..+-..|.++++.+.. T Consensus 1 M~---~--~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 75 (432) T protein:vir:10 1 MK---I--VDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQ 75 (432) T ss_pred CC---h--HHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEE Confidence 22 1 122222211111100 00 01111111 23456889999999999999999998753 Q ss_pred CCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHHH-HHhhcceeeeEEEeecccccccccceeeccccccCchhcc Q lcl|NC_016071. 88 LYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLS 165 (516) Q Consensus 88 ~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~ 165 (516) ..+.... +..-.-+...|+ +-+...++.++++.++. .+.+|-+++++++...+ . +..|.+.++.+++ T Consensus 76 ~~~~~~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G-------~--~~~L~~i~~~~v~ 144 (432) T protein:vir:10 76 EDEYGIQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG-------K--VQALWPIDASKVT 144 (432) T ss_pred ecCCcee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEEEEcCceeE Confidence 3221111 111111333343 23344567888887664 67899999999886533 2 2234444444332 Q ss_pred cccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHH Q lcl|NC_016071. 166 RSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCY 245 (516) Q Consensus 166 ~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~ 245 (516) ...++++...... .........+....+|...+|.+++....+..+|.|.+..+. T Consensus 145 ----v~~d~~~~~~~~~---------------------~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~ 199 (432) T protein:vir:10 145 ----VYIDDVGLLNSKT---------------------KMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLK 199 (432) T ss_pred ----EEEcCcccccccc---------------------eEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHH Confidence 1223322110000 000011122334557777777666655566688999999998 Q ss_pred HHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc---cceEEEeccCccccc Q lcl|NC_016071. 246 RAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG---EQAYFILPSDMNAQG 322 (516) Q Consensus 246 ~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g---~~a~~iiP~g~~i~~ 322 (516) ...-.-....++-..+...-+.+--+++.+ ..-+.+. .+++.+.......| ....+++|.|++++. T Consensus 200 ~~i~~~~~~~~~~~~~~~ng~~p~gil~~~--------~~l~~e~---~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~ 268 (432) T protein:vir:10 200 STLENSASADKFINNFYKQGLQVKGLVQYV--------GDLNEDA---KKVFRENFESMSSGLQNSHRIALMPVGYQFQP 268 (432) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcC--------CCCCHHH---HHHHHHHHHHHhcccccCCcceecCCCceEEE Confidence 877666666666666666433333233321 1222222 23344444433333 223467788864322 Q ss_pred ccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 323 GEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIVEAFNKNL 401 (516) Q Consensus 323 ~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~l 401 (516) ++ -+.....+.+..++..++|++++.-..-..+..+.|+++-.+-+. .....-++-.++.|++.||+.| T Consensus 269 --------l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 269 --------IS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKL 338 (432) T ss_pred --------cc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 22 222334566677888899999988765444433446665544443 4445668889999999999887 Q ss_pred HHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCC Q lcl|NC_016071. 402 IPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLG 481 (516) Q Consensus 402 i~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~ 481 (516) +..--. ....+.+|.++.....|++..+++++++++.|++.+ +.+|+.+|+|+-+.+|+......- .+ T Consensus 339 l~~~~~------~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~pi~ggD~~~~~~n~-~~ 406 (432) T protein:vir:10 339 FLDSEL------DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKP-----NEARSKEDLPPEAGGDRLLVNGNM-LP 406 (432) T ss_pred cChhhc------CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccc-cc Confidence 754211 111223444555667789999999999999999876 579999999865444433211110 00 Q ss_pred CCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 482 QDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) -+... ..-.+.+. .+++. ...+.+++ T Consensus 407 ~~~~~-~~~~k~~~-~~~~~-~~~~~~~~ 432 (432) T protein:vir:10 407 IDMAG-QAYLKGGD-TNGEV-SKEGNEGN 432 (432) T ss_pred hhhcc-ccccCCCC-CCCCC-CCCCCCCC Confidence 00000 00001110 01110 01111111 No 31 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.69 E-value=2.3e-16 Score=106.28 Aligned_cols=415 Identities=11% Similarity=0.025 Sum_probs=207.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHH-HHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQ-LRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~-~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~ 79 (516) |.+...+--..++.+--.+-..-++...+-.+.... ...+. ....-.+..+ .=+..++-+.|.+|+..+-..|. T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~----~~~~~~~~~v-~~~~al~~~~v~~cv~~Ia~~iA 75 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVS----AHGYLGDSSI-NDERILQISTVWRCVSLISTLTA 75 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccccccccchhhccccc----cccccccccc-cHHHhhccHHHHHHHHHHHHhhc Confidence 222211111111111100000000000000000000 00000 0000001111 11345567889999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) ++++.+.-....+.....+.-.-+...|.. -+...+..+++..++ +.+.+|-+++++++...+ . +..|. T Consensus 76 ~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G-------~--~~~L~ 146 (424) T protein:vir:18 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG-------D--VISLL 146 (424) T ss_pred cCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEE Confidence 999987422111111000011112333432 233456667777665 678899999999875432 2 22333 Q ss_pred ccCchhcccccceeecCC-CceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDED-GRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~d-g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) +.++.++. ...+ +++.... ...+....+|.+.+|++++.. .+..+ T Consensus 147 ~l~~~~v~------v~~~~~~~~y~~---------------------------~~~g~~~~~~~~eVihir~~~-~dg~~ 192 (424) T protein:vir:18 147 PLQSANMD------VKLVGKKVVYRY---------------------------QRDSEYADFSQKEIFHLKGFG-FTGLV 192 (424) T ss_pred EecCcceE------EEEcCCeEEEEE---------------------------EeCCeEEEeccccEEEecCcC-CCCcc Confidence 44443322 1222 2221110 112233457777766666544 45589 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EEe Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FIL 314 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~ii 314 (516) |.|.+..+....-.-....++...+...-+.+--+++.|.. --+++. .+++++..+....|..++ +++ T Consensus 193 G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-------~l~~e~---~~~~~~~~~~~~~~~nag~~~vl 262 (424) T protein:vir:18 193 GLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-------VLTEQQ---RSQVEENFKEIAGGPVKKRLWIL 262 (424) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCc-------CCCHHH---HHHHHHHHHHHhCCcccCCceec Confidence 99999998877666666666666776654444333333211 112222 334455555555555554 677 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh--h-HHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY--N-LSESKQSIHGHFVQRDID 391 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--A-l~~vh~ev~~~~~~aDa~ 391 (516) +.|++++- .+-+....+|.+..++...+|++++.-..--.+..+++++ + ..+........-+.-.++ T Consensus 263 ~~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~ 332 (424) T protein:vir:18 263 EAGFSTSA----------IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYIS 332 (424) T ss_pred cCCceEEe----------cCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHH Confidence 88864322 2222334457777788889999998876544443333433 2 222333455667788888 Q ss_pred HHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccc Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDM 471 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~ 471 (516) .|++.||+.|++.- .....+-+|.++..-..|.++.++++.++++.|++.+ +.+|+.+|+|+-..+|+ T Consensus 333 ~ie~~ln~~L~~~~-------~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~ggD~ 400 (424) T protein:vir:18 333 RWENSIQRWLIPSK-------DVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTI-----NEMRRTDNMPPLPGGDV 400 (424) T ss_pred HHHHHHHhhcCCcc-------ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCe Confidence 99999998776531 1112233444455566788999999999999999886 57999999996544444 Q ss_pred ccCcccccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 472 STDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) ......-.+... .+. ....+++.| T Consensus 401 ~~~~~n~~~l~~---~~~-------------~~~~~~n~a 424 (424) T protein:vir:18 401 AMRQAQYVPITD---LGT-------------NKEPRNNGA 424 (424) T ss_pred eeeccCccchhh---hhc-------------cCCccccCC Confidence 321111000000 000 011112222 No 32 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.69 E-value=5.1e-15 Score=98.93 Aligned_cols=439 Identities=12% Similarity=0.092 Sum_probs=205.9 Q ss_pred CCccccCc----ccccchhhh--cccCCCCcccccchHHHHHHHHHHHhhcccccCCcccH-HHHHHHhhChHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQP----SEVVKAGNE--NLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFL-ATVEAMKQDHTVSTALDT 73 (516) Q Consensus 1 ~~~r~~~~----~~~~~~~~~--~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m~~D~~v~s~l~~ 73 (516) +..|+.+- ....+..+. .+..-|+ +|. +..-+++ ...+..+...+.+ .+...+..-+.|.+|+.. T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~----~~~--~~~~~~~--~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ 102 (576) T protein:vir:96 31 LQANIRNIEEKSKELNKSLYGKQQAYAEPF----LEV--MDTNPEF--RTKRSYMKNSDNLHDVLKQFGNNPILNAIILT 102 (576) T ss_pred hhHHHHHhhhhhhhhccccCCccchhhcce----eee--eecCCCc--cccCcchhhhhhhHHHHHHhhcCHHHHHHHHH Confidence 11111111 111111100 0001110 000 0000001 1111111111111 112222345789999999 Q ss_pred HHHHHhc-----------CCceeeeCCCCCChhhHHHHH--HHHHHHhhc-----cCcCCHHHHHHHHH-HHHhhcceee Q lcl|NC_016071. 74 KYVFVTK-----------AFNDFKVLYNRDSKASKDAAE--FVEYALKNL-----ANQQTLRDIARSAA-TFNEYGFSIF 134 (516) Q Consensus 74 Rk~~v~~-----------~~w~i~~~~~~d~~~~~~~a~--~v~~~l~~~-----~~~~~~~~~l~~~l-da~~~G~S~~ 134 (516) |...|.. ..|.|......-...+.+.++ .++..|..+ +.+.+|.+++..++ +.+.+|.+.+ T Consensus 103 ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~ 182 (576) T protein:vir:96 103 RSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNF 182 (576) T ss_pred HHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEE Confidence 9988875 456665443322222222222 233333322 12246778887766 5788999999 Q ss_pred eEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCC Q lcl|NC_016071. 135 EKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSAD 214 (516) Q Consensus 135 Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (516) |++|.+.+. | .+..|.+.++.+++ +..+.+|..+..... +........ T Consensus 183 ~i~~~rd~~-----g--~~~~L~pl~p~~V~----v~~~~dg~~~~~~~~---------------------~~~~~~~~~ 230 (576) T protein:vir:96 183 EKVFNKKNA-----T--TMDKFIAVDPSTIF----YATDKNGKIIKGGKR---------------------FVQVINKKV 230 (576) T ss_pred EEEEecCCC-----C--ceEEEEEeCCceeE----EEECCCCceeeeeeE---------------------EEEecCCce Confidence 999976532 1 12334444554443 344555543321110 000111223 Q ss_pred ccccccccEEEEeecCcCC---ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHH Q lcl|NC_016071. 215 EVFIPINKLMVMSLGGTES---NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPES 291 (516) Q Consensus 215 ~~~iP~~k~i~~~~~~~~g---~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~ 291 (516) ...+|....|+|++....+ .+||.|.+..+....-.-....++-..|....+.+--++..+. ....+++ T Consensus 231 ~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~------~~~ls~e-- 302 (576) T protein:vir:96 231 VASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKS------EQQQSQR-- 302 (576) T ss_pred EEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC------CCCCCHH-- Confidence 3456777888888776554 6789999999988887777777777777764333322222221 0111222 Q ss_pred HHHHHHHHHHHHhhccc-ceE---EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC- Q lcl|NC_016071. 292 EMVQGLMADAANAHAGE-QAY---FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN- 366 (516) Q Consensus 292 ~~l~~l~~~~~~~~~g~-~a~---~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~- 366 (516) ..+++++.......|. .++ ++++.|++ +...+-+.....|.+..++..++|++++.-...-.+. T Consensus 303 -~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~----------~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~ 371 (576) T protein:vir:96 303 -ALENFKREWKSSFSGINGSWQVPVVMADDIK----------FVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFP 371 (576) T ss_pred -HHHHHHHHHHHHhccccccccceeecCCCce----------EEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHcccc Confidence 2344555555444443 332 56687764 2233333344567788888999999998654322211 Q ss_pred --------Cccc--hhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHH Q lcl|NC_016071. 367 --------DGQG--SYNLSE-SKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFS 435 (516) Q Consensus 367 --------~~~G--S~Al~~-vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a 435 (516) .++| +|+-.+ .........++-.++.|+..||+.|++.+ +. . -.|.|... |++..+ T Consensus 372 ~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~-------~~--~-~~~~f~r~---d~~~~~ 438 (576) T protein:vir:96 372 NRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEY-------SD--K-YVFQFVGG---DTKSEL 438 (576) T ss_pred ccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-------cC--c-eEEEeccC---CHHHHH Confidence 1112 333333 23344555688888999999998887532 11 1 14556544 444455 Q ss_pred HHHH--HHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCccc-----ccCC---CCCCccc-------ccccccCCCC Q lcl|NC_016071. 436 KFVQ--RIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELL-----KLLG---QDTSRSG-------DGMTAGSNGN 498 (516) Q Consensus 436 ~~~~--~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~-----~~~~---~~~~~~~-------~~~~~~~~~~ 498 (516) +.+. +++..|++.+ +.+|+.+|+|+-..+|....... .+.. ....... +..+...+.. T Consensus 439 e~~~~~~~~~~G~lT~-----NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 513 (576) T protein:vir:96 439 DKIKILQEEVKTYKTV-----NEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEE 513 (576) T ss_pred HHHHHHHHHhcCccCH-----HHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccCCCCCCC Confidence 5443 3455688765 67999999986544454321110 0000 0000000 0001111111 Q ss_pred CcccccccccchhhhhcC Q lcl|NC_016071. 499 GTGKISSTRDNSVSNMDN 516 (516) Q Consensus 499 ~~~~~~~~~d~~~~~~~~ 516 (516) +..+++...+++..-.+| T Consensus 514 ~~~~s~~~~~~g~~~~~~ 531 (576) T protein:vir:96 514 PQQESTEDKVDGRESNDP 531 (576) T ss_pred CCCCCCCCcccccccccC Confidence 111122222222222222 No 33 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.69 E-value=1.6e-15 Score=101.70 Aligned_cols=410 Identities=10% Similarity=-0.040 Sum_probs=207.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) .....+... .+.......++. .........+..+ ..+..++-+.|.+|+..+-..|.+ T Consensus 3 ~~~~~~~~~--~~~~~~~~~~~~-------------------~~~~~~~~~g~~v-~~~~al~~~~v~~~i~~ia~~ia~ 60 (419) T protein:vir:57 3 IPQFWKGRP--SENRVNWQVVPG-------------------GMRSSSSQAGVII-TPETALALSAVRACVTLLAESVAQ 60 (419) T ss_pred chhhhccCC--cccccccccccc-------------------ccccccccCCcee-chHHhhccHHHHHHHHHHHHhhcc Confidence 122211110 000000000000 0000000111112 123445678899999999999999 Q ss_pred CCceeeeCCCCCC-hhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 81 AFNDFKVLYNRDS-KASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 81 ~~w~i~~~~~~d~-~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) ++|.+.-...... +... -.-+...|. +-+...++.++++.+. +.+.+|-+++++++...| . +..|. T Consensus 61 lp~~~~~~~~~g~~~~~~--~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G-------~--~~~L~ 129 (419) T protein:vir:57 61 LPCVLYRRTENGGREIAF--DHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRG-------D--ITELI 129 (419) T ss_pred CceEEEEEcCCCceeccc--cchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEE Confidence 9998732211111 1111 111333443 2334456778887766 577899999999876432 2 22344 Q ss_pred ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAG 237 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G 237 (516) +.++.++. ...+.+|.....+ ...+..+|.+-++..++.. .+.++| T Consensus 130 pl~~~~v~----v~~~~~g~~~y~~-----------------------------~~~~~~~~~~~vih~r~~~-~d~~~G 175 (419) T protein:vir:57 130 PINPHKVI----VLKGPDGMPYYDI-----------------------------PSIGEILPMRMVHHIKSFS-LDGYIG 175 (419) T ss_pred EEcCcceE----EEECCCceEEEEE-----------------------------cCCceEEchhhEEEecCcC-CCCccc Confidence 44443322 1223333221100 1123346666555555443 455899 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc-c--eEEEe Q lcl|NC_016071. 238 VSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE-Q--AYFIL 314 (516) Q Consensus 238 ~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~-~--a~~ii 314 (516) .|.+..++...-.-....++...+...-+.+=-+++.|.. .....+ ++.++++++.......|. . ..+++ T Consensus 176 ~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~----~~~~~~---~e~~~~~~~~~~~~~~g~~nag~~~vl 248 (419) T protein:vir:57 176 TSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFE----AKAIAS---QAAVDAILAKWTERYGGVRNAFSVGML 248 (419) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCc----CCcccC---HHHHHHHHHHHHHHhccccccccceec Confidence 9999999887666666666666666654444333333211 111112 233444554444443442 2 34567 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIH-GHFVQRDIDII 393 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~-~~~~~aDa~~i 393 (516) +.|++++ ..+-+....+|.+..++..++|++++--..--.+..+.++++-.+-+...+ ...++-.++.| T Consensus 249 ~~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~i 318 (419) T protein:vir:57 249 QEGMTYK----------QLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRH 318 (419) T ss_pred CCCceEE----------EcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHH Confidence 8776432 222223344577777888899999988765444433445565555444333 55577788888 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMST 473 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~ 473 (516) ++.||+.|+.+- .....+.+|.++.....|++..+++++++++.|++.+ +.+|+.+|+|+-..+|+.. T Consensus 319 e~~l~~~ll~~~-------~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD~~~ 386 (419) T protein:vir:57 319 ESAMMRDLLLPS-------ERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSV-----NDIRRMENLTPIPGGDKYL 386 (419) T ss_pred HHHHHhhccCcc-------ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeee Confidence 888888766431 0111223344445556788999999999999999886 5799999998655555543 Q ss_pred CcccccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 474 DELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) ..... .+..... ++.++......+. -..++.-| T Consensus 387 ~~~n~--~~~~~~~-~~~~~~~~~~~~~----~~~~~~~~ 419 (419) T protein:vir:57 387 TPLNM--VDSKALT-GIGKATPQQLKDI----EAILCTRN 419 (419) T ss_pred ecccc--ccccccc-cccCCCcccCcch----hhhhhccC Confidence 22111 1111111 1001000011111 11122222 No 34 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.69 E-value=1.5e-15 Score=101.86 Aligned_cols=401 Identities=10% Similarity=-0.053 Sum_probs=202.6 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |+ +|+-......+.......++ ..+ ..+ .. ++.. -..+..++-+.|.+|+..+-..| T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~-----~~~-------~~~-------~~-~g~~-v~~~~al~~~~v~~~v~~ia~~i 59 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIP-----SPA-------EDW-------AM-HGDR-PGANSAMTLGAFYACVTLLADTV 59 (409) T ss_pred CchhhhhhcCCCcccccccccccc-----ccc-------chh-------hc-cCcc-cchhhhhccHHHHHHHHHHHHhh Confidence 22 11111100000000000000 000 000 00 1111 12345566788999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) .+++|.+.-..+........+ ...|. +-+...++.++++.++ +.+.+|-++.++.++..++ ++.++ T Consensus 60 A~lp~~~~~~~~~~~~~~~~l----~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g--~~~~L------ 127 (409) T protein:vir:84 60 ASLSIDAYRKKDNVRIPVSPA----PKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEAN--RPTAI------ 127 (409) T ss_pred hhCceEEEEecCCcccccchH----HHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCC--ceEEE------ Confidence 999998754322211111122 23343 3344567888988877 6778899998888764432 22222 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .+.++..++ .....|+..... .......+..+|.+.+|++++....+.++ T Consensus 128 ~~l~p~~v~----v~~~~~~~~~~~--------------------------~~~~~~~g~~~~~~dvih~~~~~~~~~~~ 177 (409) T protein:vir:84 128 MPIHPDCIH----VTDAKDEDGDWI--------------------------EPVYRIDGKVVPNHRIMHIKRYPVAGCAL 177 (409) T ss_pred EEEcCceeE----EEEcCCCcceEE--------------------------EEEecCCceEEchhhEEEecCCCCCcccc Confidence 233332221 111122211110 00011234457777777777777777789 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEecc Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPS 316 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~ 316 (516) |.|.+..+....-.-....++...+...-+.+--+++.+ ..-+.+..+ .+++.......+....+++|. T Consensus 178 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~l~~e~~~---~~~~~~~~~~~n~g~~~vl~~ 246 (409) T protein:vir:84 178 GMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSD--------ADLTPDQVK---QTQKQWIQSHHNRRLPAVMSA 246 (409) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC--------CCCCHHHHH---HHHHHHHHHhccCCCeeecCC Confidence 999999988877776667777777776545444344332 122223322 233322222234444677888 Q ss_pred CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh--hHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 317 DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY--NLS-ESKQSIHGHFVQRDIDII 393 (516) Q Consensus 317 g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--Al~-~vh~ev~~~~~~aDa~~i 393 (516) |++++- .+-+....+|.+..++.-++|++++--..--.+..++++. +-. +........-+.--++.| T Consensus 247 g~~~~~----------~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~i 316 (409) T protein:vir:84 247 GIKWQS----------VSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCI 316 (409) T ss_pred CceEEE----------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHH Confidence 875332 2222233456666677778999877665433332222322 211 222233455567778888 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMST 473 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~ 473 (516) +..||+.|.+ ..+-+|.++.....|++..++++.++++.|++.+ +.+|+.+|+|+-..+|+.. T Consensus 317 e~~l~~~L~~------------g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~p~~ggD~~~ 379 (409) T protein:vir:84 317 EQALDTFLPR------------GQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSV-----NEVRAWEDAPPIPEGDIHL 379 (409) T ss_pred HHHHHHhccC------------CCeEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceee Confidence 8888875411 1223455556666899999999999999999876 5799999999654445432 Q ss_pred CcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 474 DELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) ....-...+. .+..++..... ...+.-.| T Consensus 380 ~~~n~~~~~~-~~~~~~~~~~~-------------~~~~~~gn 408 (409) T protein:vir:84 380 QPMNFVPLGY-VPPEEPAQEPQ-------------PNSATEGN 408 (409) T ss_pred eccccccccc-CCccccCcCCC-------------CCCccCCC Confidence 2111111111 11111000000 00000011 No 35 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.68 E-value=9.9e-16 Score=102.85 Aligned_cols=421 Identities=12% Similarity=0.009 Sum_probs=214.5 Q ss_pred CCccccCcccccchhhhcc-c-CCCCcccccch-HHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENL-A-VSRLRTGELGS-GALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p-~-~~~~~~~e~g~-~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |+....+..- +....| + +.+.. +...+ ..-..|..+ . ......+..+ ..+..++-+.|.+|+..+-.. T Consensus 1 ~~~~l~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~---~-g~~~~~g~~v-~~~~al~~~~V~~~i~~ia~~ 71 (434) T protein:vir:43 1 MSKSLGKVLS---SATSAPRSSLFGWG-GKTIRLTDGAFWSQF---L-GRESSSGKKV-TVDKAMKLSAVWACVRLISTS 71 (434) T ss_pred Cccchhhhhh---hcccccchhhhccc-ccccccCchHHHHHH---h-cCCccCCcee-chhhhhccHHHHHHHHHHHHh Confidence 8887754222 111111 1 11100 00000 000011111 1 1111111122 235667788999999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) |.+++|.+.-..+ +....+....-+...|.. -+...+..++++.++ +.+.+|-++.++.+. +| ++. . T Consensus 72 ia~lp~~~~~~~~-~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~--~G--~~~------~ 140 (434) T protein:vir:43 72 VAGLPLGVYERKA-DGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA--AG--RPA------A 140 (434) T ss_pred hhhCceEEEEEcC-CCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC--CC--cEE------E Confidence 9999998732211 111111111123334432 333456777887766 568899998887653 21 222 3 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) |.+.++..+. +..+.+|+...... ...+....+|.+.+|++++.+ .+.. T Consensus 141 L~~l~p~~v~----~~~~~~g~~~y~~~--------------------------~~~g~~~~~~~~eVih~~~~~-~dg~ 189 (434) T protein:vir:43 141 LDFLLPSRVD----LECDENGRLKYFYT--------------------------TKKGARREIERTNMLHIPAFT-LDGR 189 (434) T ss_pred EEEEcCcceE----EEEcCCCeEEEEEE--------------------------ecCceEEEEccccEEEecCcC-CCCc Confidence 3344443332 23455554332111 112234567877777666654 4457 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EE Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FI 313 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~i 313 (516) +|.|.+..+....-.-....++-..+...-+.+--+++.+ ..-+.+.. +++++..+.+..+..++ ++ T Consensus 190 ~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~--------~~l~~e~~---~~~r~~~~~~~g~~nag~~~v 258 (434) T protein:vir:43 190 IGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVD--------RILQPAQR---EEFREYVKSVSGAMNSGRSPV 258 (434) T ss_pred cccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecC--------CCCCHHHH---HHHHHHHHHhcCccccCCccc Confidence 8999999998877776667677777765433343333332 22233333 33455555544444443 56 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccch--hhH-HHHHHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGS--YNL-SESKQSIHGHFVQRDI 390 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS--~Al-~~vh~ev~~~~~~aDa 390 (516) +|.|++++ ..+.+....+|.+..++..++|++++.-..--.+...++| ++- .+.....-..-+.-.+ T Consensus 259 l~~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~ 328 (434) T protein:vir:43 259 LEQGITPE----------TIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFSISSIT 328 (434) T ss_pred cCCCceEE----------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHHHHHHH Confidence 78776422 2222334445777888889999999887654443333333 222 2223334455677888 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) ++|+..||+.|+..--..+ .+.+|.++..-..|.+..++++.+++..|++.+ +.+|+.+|+|+-..+| T Consensus 329 ~~ie~~ln~kL~~~~~~~~-------~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD 396 (434) T protein:vir:43 329 NQIQQCVNKRLLTAPERIR-------YYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTR-----NEGRRKENLPELPGGD 396 (434) T ss_pred HHHHHHHHhhcCChhhhcC-------ceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCC Confidence 8888999887654311111 122333334445788999999999999999876 5799999999754444 Q ss_pred cccCcccc-c--CCCCCCcccccccccCCCCCcccccc Q lcl|NC_016071. 471 MSTDELLK-L--LGQDTSRSGDGMTAGSNGNGTGKISS 505 (516) Q Consensus 471 ~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (516) +..-...- + .........+...+..+.++.++..+ T Consensus 397 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 397 ILTVQSNLVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred eEeeccCccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 43211100 0 00001111110111111122221111 No 36 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.68 E-value=1.5e-15 Score=101.84 Aligned_cols=409 Identities=9% Similarity=-0.033 Sum_probs=209.0 Q ss_pred CCcc-ccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTR-FAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r-~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~ 79 (516) .-+| ..+...-+ .++-.. . +..+.. . .....+..+ .-+..++-+.|.+|+..+-..|. T Consensus 2 ~~~r~~~~~~~~~-----~~~~~~----~--------~~~~~g-~--~~s~~~~~v-t~~~al~~~~v~~~v~~ia~~iA 60 (419) T protein:vir:14 2 FFSRQLLSNLGQT-----QMSAGG----W--------VSALLG-S--SRSDSGQVV-TPASALALTVLQNCVTLLAESIA 60 (419) T ss_pred ccccccccccccc-----ccCcch----h--------hHHhhc-C--CCccCCccc-chHHhhccHHHHHHHHHHHHhhc Confidence 1111 11111000 000000 0 000000 0 000011111 12445677889999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) +++|.+.-..+.+..... -.-+...|. +-+...++.++++.++ +.+.+|-+++++++...+ . +..|. T Consensus 61 ~lp~~~~~~~~~~~~~~~--~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G-------~--~~~l~ 129 (419) T protein:vir:14 61 QLPIELYERSGEDRKPAT--DHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDG-------V--IQGLY 129 (419) T ss_pred cCceEEEEecCCcccccc--ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEE Confidence 999987544332211111 111233333 2333456778887744 578899999999876432 2 22344 Q ss_pred ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAG 237 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G 237 (516) +.++..++ ...+.+|+.+..+. . ...+|.+- |+|......+..+| T Consensus 130 pl~~~~v~----v~~~~~~~~~y~~~----------------------------~--~~~~~~~~-i~h~~~~~~dg~~G 174 (419) T protein:vir:14 130 PLDNEAVT----VMRGSDLKPVYRVR----------------------------G--SDPMPQRL-VHHVRWMSINGYTG 174 (419) T ss_pred EecCceEE----EEECCCceEEEEEc----------------------------c--Ccccchhh-eeEecCcCCCCccc Confidence 44444332 12344444322111 0 01134443 44544333445899 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc-ce--EEEe Q lcl|NC_016071. 238 VSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE-QA--YFIL 314 (516) Q Consensus 238 ~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~-~a--~~ii 314 (516) .|.+..+....-.-....++...+...-+.+=-+++.+- ......+++..+++++.......|. ++ -+++ T Consensus 175 ~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-------~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl 247 (419) T protein:vir:14 175 LSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPK-------DAPALKDQASVDRITDGWNAKFGGSGNAKKVALL 247 (419) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecC-------CCCcccCHHHHHHHHHHHHHHhcCccccCCceec Confidence 999999988776666666777777665443322333221 1111112333445555555444443 33 3666 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSI-HGHFVQRDIDII 393 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev-~~~~~~aDa~~i 393 (516) +.|+++. ..+-+.....+.+..++...+|++++.-..--.+....++++-.+-+... -..-+.-.++.| T Consensus 248 ~~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~i 317 (419) T protein:vir:14 248 QEGMTFR----------PLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRH 317 (419) T ss_pred CCCceEE----------EccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 7776422 22222233346667788889999998886644443444666555544433 345677788888 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPG--LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDM 471 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~ 471 (516) ++.||+.|+.+-- . ... +++|+ .....|++..++++++|++.|++.+ +.+|+.+|+|+-..+|. T Consensus 318 e~~l~~kll~~~~------~-~~~--~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~gGD~ 383 (419) T protein:vir:14 318 EQAKTRDLLLPSE------R-KQY--FIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSI-----NDIRRLENMPPVKGGDI 383 (419) T ss_pred HHHHhhhccCccc------c-CCe--EEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCe Confidence 8989887654311 1 111 34554 4446788999999999999999876 57999999997655554 Q ss_pred ccCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhc Q lcl|NC_016071. 472 STDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMD 515 (516) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 515 (516) ......- .+....... ++..+...++.-+++..-++ T Consensus 384 ~~~~~n~-~~~~~~~~~-------~~~~~~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 384 YLSPMNM-VDASKPQQL-------PVGKSEPTKAAIDEIGRILS 419 (419) T ss_pred eeecccc-ccccccccc-------cCCCCCCccccccchhcccC Confidence 3321110 000000000 01111222334444444444 No 37 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.68 E-value=7e-16 Score=103.67 Aligned_cols=417 Identities=11% Similarity=0.030 Sum_probs=203.8 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) |.+.+-+----++.+-=.+-...++..++..+......+-.... ..+ .+..+ .-+..++.+.|.+|+..+-..|.+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~~~~v-~~~~al~~~~v~~cv~~Ia~~iA~ 76 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAH--GHL-GDSSI-NDERILQISTVWRCVSLISTLTAC 76 (424) T ss_pred CCCCcceEeecCCCchHHHHHhhhcccccccccccccccccccc--ccc-ccccc-cHHHhhccHHHHHHHHHHHHhhcc Confidence 21111000000000000000000000000000000000000000 000 01111 113446788999999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++.+.-....+.......-.-+...|. +-+...+..+++..++ +.+.+|-++.++++...+ . +..|.+ T Consensus 77 lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G-------~--~~~L~p 147 (424) T protein:vir:18 77 LPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG-------D--VISLLP 147 (424) T ss_pred CceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEEEEE Confidence 9998742211111100000111233343 2233456677777655 678899999999886542 2 223344 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.++. ...++..+.. .....+....+|.+.+|.+++.. .+.++|. T Consensus 148 l~~~~V~------v~~~~~~~~y--------------------------~~~~~g~~~~~~~~eIih~r~~~-~dg~~G~ 194 (424) T protein:vir:18 148 LQSANMD------VKLVGKKVVY--------------------------RYQRDSEYADFSQKEIFHLKGFG-FTGLVGL 194 (424) T ss_pred ecCcceE------EEEcCCeEEE--------------------------EEEeCCeEEEeccccEEEecCcC-CCCcccc Confidence 4443332 1222211110 00112233457777766555543 4558999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EEecc Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FILPS 316 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP~ 316 (516) |.+..+....-.-....++-..+...-+.+--+++.|.. --++++ .+++++..+.+..|..++ ++++. T Consensus 195 spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~-------~l~~e~---~~~~~~~~~~~~~g~nag~~~vl~~ 264 (424) T protein:vir:18 195 SPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-------VLTEQQ---RSQVEENFKEIAGGPVKKRLWILEA 264 (424) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCc-------CCCHHH---HHHHHHHHHHHhCCcccCCceeccC Confidence 999998877666566666666666654444333333211 112222 333445555555555554 67788 Q ss_pred CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh--hHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_016071. 317 DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY--NLSE-SKQSIHGHFVQRDIDII 393 (516) Q Consensus 317 g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--Al~~-vh~ev~~~~~~aDa~~i 393 (516) |++++. ++ -+....+|.+..++..++|++++.-..--.+...++++ +-.+ .....-..-+.-.++.| T Consensus 265 g~~~~~--------l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~i 334 (424) T protein:vir:18 265 GFSTSA--------IG--VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRW 334 (424) T ss_pred CceEEe--------cC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHH Confidence 875332 21 12233456777788889999998876544432333333 2222 22334466678888999 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMST 473 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~ 473 (516) ++.||+.|++.. .....+-+|.++..-..|.++.++++.++++.|++.+ +.+|+.+|+|+-..+|+.. T Consensus 335 e~~l~~~L~~~~-------~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~gGD~~~ 402 (424) T protein:vir:18 335 ENSIQRWLIPAK-------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI-----NEMRRTDNLPPLPGGDVAM 402 (424) T ss_pred HHHHHhhcCCcc-------ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeee Confidence 999998776542 1112233344444456788999999999999999886 5799999999754444432 Q ss_pred CcccccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 474 DELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) -...-.+.... + +..+..|+.| T Consensus 403 ~~~n~~~l~~~---~-------------~~~~p~~~ga 424 (424) T protein:vir:18 403 RQSQYVPITDL---G-------------TNKEPRNNGA 424 (424) T ss_pred eccCccchHhh---h-------------ccCCCccCCC Confidence 11110000000 0 0111222222 No 38 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.67 E-value=2.7e-15 Score=100.48 Aligned_cols=421 Identities=11% Similarity=0.018 Sum_probs=207.4 Q ss_pred CCccccCc-ccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQP-SEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~-~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~ 79 (516) |.+-+.+. +++..+....-.. ++...+ +.. .+.+.+.. ...+..+ ..+..++-+.|.+|+..+-..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~-~~s~~~-~~~-~~~~~~~~-------~~~g~~v-~~~~al~~~~v~~ci~~Ia~~ia 69 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGV-PISLTD-GSF-WSAWGGMG-------SSSGETV-TADSALQLSAVWSCVRLIAETIA 69 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCC-cccCCc-hhH-HHhhcccc-------cCCCcee-chHhhhccHHHHHHHHHHHHHHh Confidence 65322221 1111111110001 111111 111 11111110 0111111 13455678899999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) +++|.+.-... +.........-+...|. +-+...++.++++.++ +.+.+|-+++++++. .+ ++. .|. T Consensus 70 ~lp~~~~~~~~-~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g---~~~------~L~ 138 (437) T protein:vir:10 70 TLPLNLYQTKP-DGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AG---VLI------GLE 138 (437) T ss_pred hCceeEEEEcC-CCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CC---cEE------EEE Confidence 99998643211 11000000111223333 3344456778887766 467899999998875 22 222 233 Q ss_pred ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAG 237 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G 237 (516) +.++..+. ...+.+|....... ...+....+|.+.++++++.. .+.++| T Consensus 139 ~l~p~~v~----i~~~~~g~~~y~~~--------------------------~~~g~~~~~~~~dIih~r~~~-~d~~~G 187 (437) T protein:vir:10 139 LMLPQRTT----VKRLTSGALQYTYR--------------------------NVDGTVSTLAEDDVFHVRGFS-LDGLMG 187 (437) T ss_pred EEcCcceE----EEECCCCeEEEEEE--------------------------ecCceEEEEccccEEEecCcC-CCCccc Confidence 34433221 12233443221110 111223456777766555443 455899 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEEe Q lcl|NC_016071. 238 VSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFIL 314 (516) Q Consensus 238 ~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~ii 314 (516) .|.+..++...-.-....++-..+....+.|=-+++.+ ..-+.+. .+++++.......| ..+ .+++ T Consensus 188 ~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~l~~e~---~~~~~~~~~~~~~g~~nag~~~vl 256 (437) T protein:vir:10 188 LTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD--------QILQKEK---RAEIRTDLAEQFGGAMQAGKTMVL 256 (437) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC--------CCCCHHH---HHHHHHHHHHHhcCccccCcceec Confidence 99999998777666666677777766544343333322 2222222 23334343333223 223 4577 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh--hH-HHHHHHHHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY--NL-SESKQSIHGHFVQRDID 391 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--Al-~~vh~ev~~~~~~aDa~ 391 (516) +.|++.. ..+-+....+|.+..++..++|++++--..--.+...++++ +- .+........-+.-.++ T Consensus 257 ~~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl~P~~~ 326 (437) T protein:vir:10 257 EAGMKYQ----------AITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTLRPWLT 326 (437) T ss_pred cCCceEE----------eccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHH Confidence 8876422 22222334456777788889999998876644443333333 22 22223345556777888 Q ss_pred HHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccc Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDM 471 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~ 471 (516) .|++.||+.|++.- .....+.+|.++..-..|.++.+++++++++.|++.+ +.+|+.+|+|+-.++++ T Consensus 327 ~ie~~l~~kll~~~-------e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~gg~~ 394 (437) T protein:vir:10 327 RIEQAARRSLLRPG-------ERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTR-----DECRAKENLPPMGGNAA 394 (437) T ss_pred HHHHHHHhhccCcc-------ccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCcc Confidence 88888888775431 0111223333444456788999999999999999886 57999999986544443 Q ss_pred ccCcccccC-----CCCCCc--ccccccccCCCCCcccccccc Q lcl|NC_016071. 472 STDELLKLL-----GQDTSR--SGDGMTAGSNGNGTGKISSTR 507 (516) Q Consensus 472 ~~~~~~~~~-----~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 507 (516) ......... .+...+ ..++.+.+..++...+..+-| T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 395 VLTVQSALLPIDKLGEHTTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred eEeecCcccchhhccCcCCCcchhccccccCCCCCCCCccccC Confidence 221111111 111111 111111121222222222222 No 39 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.67 E-value=5.9e-16 Score=104.08 Aligned_cols=405 Identities=11% Similarity=0.016 Sum_probs=205.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) .+.-+.+.+.. +..++..+.+. +.. ..+. ..+..+. -+..++-+.|.+|+..+-..|.+ T Consensus 3 f~~~f~r~~~~-------~~~~~~~~~~~-------~~~-----~~~~-~~g~~v~-~~~~l~~~~v~~~i~~Ia~~iA~ 61 (413) T protein:vir:48 3 FSGLFQRKSDA-------PVTTPAELAEA-------IGL-----SYDT-YTGKRIS-SQRAMRLTAVYSCVRVLAESVGM 61 (413) T ss_pred cchhhccCccC-------CccchHHHHHh-------hhc-----Cccc-ccCceec-hhhhhccHHHHHHHHHHHHhhhh Confidence 11111111000 00111000000 000 0000 0011110 13345678899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++.+....+..... ..-.-+...|. +-+...++.+++..++ +.+.+|-+++++++.. +++. .|.+ T Consensus 62 ~p~~~~~~~~~~~~~--~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~----g~~~------~L~~ 129 (413) T protein:vir:48 62 LPCSLYKISGTLKTR--VVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKAL----GEVV------ELLP 129 (413) T ss_pred CceEEEEecCCccee--ecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCC----CcEE------EEEE Confidence 999875433221111 00111233343 2333456777887766 5778899998887642 1222 3334 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.+++ ...+.+++.+.... ...+....+|...++++++.. .+.++|. T Consensus 130 l~~~~v~----~~~~~~~~~~y~~~--------------------------~~~g~~~~~~~~evih~~~~~-~d~~~G~ 178 (413) T protein:vir:48 130 IDPGCVE----PKLNSQWQPVYQVT--------------------------FPDGSVDVLTQDEIWHVRTLT-LDGLVGL 178 (413) T ss_pred EcCceEE----EEEcCCceEEEEEE--------------------------ecCceEEEEccccEEEecCcC-CCCcccc Confidence 4443332 23344443321111 011223346776666665554 4558999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEEec Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFILP 315 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~iiP 315 (516) |.+..|+...-.-....++...+...-+.|=-+++.+ ...+.++ .+++++.......| .++ -++++ T Consensus 179 s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~--------~~~~~e~---~~~~~~~~~~~~~g~~n~g~~~vl~ 247 (413) T protein:vir:48 179 NPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTE--------QKLTPDA---YERLKKDFEERHTGLGNAHRPMILE 247 (413) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC--------CCCCHHH---HHHHHHHHHHHhcCccccCcceecC Confidence 9999999876665666666666665434332233322 2223333 23344443333333 223 36778 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~ 394 (516) .|++++ ..+-+.....+.+..++...+|+.++.-..-..+..+.++++-.+-+. ..-...+.-.++.|+ T Consensus 248 ~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~~ie 317 (413) T protein:vir:48 248 MGLDWK----------SMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIE 317 (413) T ss_pred CCceEE----------eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 886432 222222334566777888899999988865444434446666555443 444556788889999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccC Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTD 474 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~ 474 (516) +.||+.|+.+.-. ...+-+|.++.....|+++.+++++++++.|++.+ +.+|+.+|+|+-+.+|+... T Consensus 318 ~~l~~~L~~~~~~-------~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~-----NE~R~~~g~~p~~ggD~~~~ 385 (413) T protein:vir:48 318 QRINTGLVRESKQ-------GKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSP-----NDCRDLEDMNPRPGGDVYLT 385 (413) T ss_pred HHHHhhccCcccc-------CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceeec Confidence 9999877754210 11222333445555788999999999999999886 57999999987655554332 Q ss_pred cccccCCCCCCcccccccccCCCCCcccccccccchhh Q lcl|NC_016071. 475 ELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVS 512 (516) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 512 (516) .......+ ....+...... ++ ..|++++ T Consensus 386 ~~n~~~~~--~~~~~~~~~~~--~~------~~~~~~~ 413 (413) T protein:vir:48 386 PMNMTTSP--SAGDDNGKKKE--SG------DADKTAS 413 (413) T ss_pred cccccccc--cccccCCCCCC--CC------CccccCC Confidence 21111110 01111000011 11 1111111 No 40 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.66 E-value=3.4e-15 Score=99.93 Aligned_cols=405 Identities=12% Similarity=0.040 Sum_probs=205.8 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |. +|.-... ++.+..++..+.+. + + ...+.. .+..+. .+..++-+.|.+|+..+-..| T Consensus 1 Mg~f~~lf~r~------~~~~~~~~~~~~~~-------~-~----~~~~~~-~g~~v~-~~~al~~~~v~~~i~~Ia~~i 60 (414) T protein:vir:44 1 MVFFSGLFQRK------SDAPVTTPAELADA-------I-G----LSYDTY-TGKQIS-SQRAMRLTAVFSCVRVLAESV 60 (414) T ss_pred CchhhhhhccC------ccCcccchhhHhHh-------h-c----cCcccc-CCceec-hhhhhccHHHHHHHHHHHHHh Confidence 22 1111100 01111111100000 0 0 000111 111111 134456889999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) .++++.+....+.... .....-+...|. +-+...++.++++.+. +.+.+|-++++++.. . .++. .| T Consensus 61 a~~p~~~~~~~~~~~~--~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~---g~~~------~L 128 (414) T protein:vir:44 61 GMLPCNLYHLNGSLKQ--RATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-F---GEVA------EL 128 (414) T ss_pred ccCceEEEEecCCcee--ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-C---CcEE------EE Confidence 9999987543221111 111111233333 2333456777887766 467789999887543 2 1222 33 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .+.++..+. ..++.+|+.+.... ...+....+|.+.+|++++.. .+.++ T Consensus 129 ~~l~~~~v~----~~~~~~~~~~y~~~--------------------------~~~g~~~~~~~~evih~~~~~-~d~~~ 177 (414) T protein:vir:44 129 LPVDPGCVV----PKLNSSWEPVYQVT--------------------------FPDGSTDVLSQEDIWHVRTLT-LDGLV 177 (414) T ss_pred EEEcCceEE----EEECCCCcEEEEEE--------------------------ecCceEEEEccccEEEecCCC-CCCcc Confidence 344443221 12344444332111 112233456777766666553 45589 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFI 313 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~i 313 (516) |.|.+..++...-.-....++...+...-+.|--+++.+ ..-+++. .+++++.......| .++ -++ T Consensus 178 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~l~~e~---~~~~~~~~~~~~~g~~n~~~~~v 246 (414) T protein:vir:44 178 GLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTE--------QTLSDQA---YERLKKDFEERHTGLGNAHRPMI 246 (414) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC--------CCCCHHH---HHHHHHHHHHHhcCccccCccee Confidence 999999998766555555566666665544443333332 1222222 23343333332233 223 457 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH-HHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS-IHGHFVQRDIDI 392 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e-v~~~~~~aDa~~ 392 (516) +|.|++.+ ..+-+....+|.+..++...+|++++--..--.+..+.++++-.+.+.. .....++-.++. T Consensus 247 l~~g~~~~----------~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~ 316 (414) T protein:vir:44 247 LEMGLDWK----------SMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTR 316 (414) T ss_pred cCCCceEE----------EccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHH Confidence 78886432 2222233345777778888999998877654444344466666555543 345577888889 Q ss_pred HHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc Q lcl|NC_016071. 393 IVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS 472 (516) Q Consensus 393 i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~ 472 (516) |++.||+.|++.- .....+.+|.++.....|+++.+++++++++.|++.+ +.+|+.+|+|+-..+|+. T Consensus 317 ie~~ln~~L~~~~-------~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~gl~p~~ggD~~ 384 (414) T protein:vir:44 317 IEQRINTGLVRKS-------KQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSP-----NDCRDLEDMNPRPGGDVY 384 (414) T ss_pred HHHHHHhhcCCcc-------ccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCccee Confidence 9999998776531 1111223344445556788999999999999999886 579999999965445543 Q ss_pred cCcc-cccCCCCCCcccccccccCCCCCcccccccccchhh Q lcl|NC_016071. 473 TDEL-LKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVS 512 (516) Q Consensus 473 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 512 (516) .... ....+......+ ..+.+++. |.+++ T Consensus 385 ~~~~n~~~~~~~~~~~~---~~~~~~~~--------d~~~~ 414 (414) T protein:vir:44 385 LTPMNMTTKPSDGSKAG---KQKDNANA--------DETTS 414 (414) T ss_pred cccccccccCCccccCC---CCCCCCCC--------CCCCC Confidence 2111 111111111111 11111111 11111 No 41 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.66 E-value=1.4e-15 Score=102.08 Aligned_cols=403 Identities=13% Similarity=0.035 Sum_probs=211.4 Q ss_pred CCc--cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MST--RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~--r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |.= |+....+ +.. + .. +.-... +..+. +. ..-+-+..++.+.|.+|+..+-..| T Consensus 1 MG~~~~~~~~~~--~~~---~---~~---~~~~~~---~~~~~---------g~-~~~~~~~al~~~~V~~~v~~Ia~~i 56 (411) T protein:vir:81 1 MGWWSRLTRFFR--PRN---E---TV---DMTNPL---LLQWL---------GV-DPDTPRNQLSEATYFACLKILSESL 56 (411) T ss_pred CchHHHHHhhcc--Ccc---c---cc---ccchHH---HHHHh---------cC-cccChhhhhccHHHHHHHHHHHHhH Confidence 332 2211110 000 0 00 000011 11111 00 1111244557889999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) .++++++....+ +... +..-.-+...|. +-+...++.++++.++ +.+.+|-+.+++++. ++ ++. .| T Consensus 57 A~lp~~~~~~~~-~~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~--~g--~~~------~l 124 (411) T protein:vir:81 57 GKLPLKMYQKTE-RGIV-KSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS--GP--QLQ------AL 124 (411) T ss_pred hhCceeEEEecC-Ccee-eecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec--CC--ceE------EE Confidence 999998853322 1110 111111233343 2334457788888876 478899999998875 21 222 23 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .+.++.++. ...++++.. ...... + + .......+....+|.+..|++++....+..+ T Consensus 125 ~~l~~~~v~----~~~~~~~~~--~~~~~~--~--~-------------~~~~~~~g~~~~~~~~eiih~k~~~~~~~~~ 181 (411) T protein:vir:81 125 WILPSQYVT----IVVDDRGLL--GEKNAI--W--Y-------------RYNDPYDGKMYVFRNDEILHFKTSVTFDGIT 181 (411) T ss_pred EEECCceEE----EEEcCcccc--cccceE--E--E-------------EEEecCCceEEEEccccEEEEcCCCCCCCcc Confidence 344443332 233333321 100000 0 0 0001112344567888877776666667789 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc-ce--EEE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE-QA--YFI 313 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~-~a--~~i 313 (516) |.|.+..+....-.-....++...+....+.|--+++.+ ..-+++.. +++++.......|. .+ .++ T Consensus 182 G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~l~~e~~---~~~~~~~~~~~~g~~n~g~~~v 250 (411) T protein:vir:81 182 GLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYT--------GDLNQEAR---DRLVKGFEQFANGSKNAGKIIP 250 (411) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC--------CCCCHHHH---HHHHHHHHHHhcCccccCCcee Confidence 999999999877777777777777776544443333332 22233332 33333433333332 22 466 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDI 392 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~ 392 (516) ++.|++++. .+-+.....+.+..++..++|++++--..-..+..+.++++-++.+. .....-+.-.++. T Consensus 251 l~~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ 320 (411) T protein:vir:81 251 VPLGMKLVP----------LDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLKQ 320 (411) T ss_pred cCCCceEEE----------ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHHH Confidence 778864322 21222334566777888899999988876544434456776666554 3345557788888 Q ss_pred HHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc Q lcl|NC_016071. 393 IVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS 472 (516) Q Consensus 393 i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~ 472 (516) |++.||+.|+..-.. ....+-+|.++.....|.+..++++++++..|++.+ +.+|+.+|+|+-+.+|+. T Consensus 321 ie~~l~~~ll~~~~~------~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~-----NE~R~~~gl~p~~ggD~~ 389 (411) T protein:vir:81 321 YEEEITYKILSNDLI------SQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTP-----NEARDYLDMPADDYGNNL 389 (411) T ss_pred HHHHHHhhcCChhhc------CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCee Confidence 999999877654211 111222333444456788999999999999999886 578999999864444433 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) .....- .|-.. .++ ...++.|+ T Consensus 390 ~~~~n~-~pl~~--~~~------------~~~kgGd~ 411 (411) T protein:vir:81 390 MANGNY-IPLSM--LGA------------NYGKGGDS 411 (411) T ss_pred eeccCc-cchhh--hhh------------hhccCCCC Confidence 211110 00000 000 00112222 No 42 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.66 E-value=3e-15 Score=100.21 Aligned_cols=425 Identities=9% Similarity=-0.021 Sum_probs=203.2 Q ss_pred CCc--cccCcc-cccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MST--RFAQPS-EVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~--r~~~~~-~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |.- |+.... .......+.-...+. .+.. .........+..+. .+..++-+.|.+|+..+-.. T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~------~~~~--------~~~~~~~~~g~~V~-~~~al~~~~V~~~v~~Ia~~ 65 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPY------DPSI--------YNLGAVAASGETVT-PHDALQVSAVFASVRLLSET 65 (457) T ss_pred Cchhhhhhccccccccccccccccccc------chHH--------HhhcccccCCceec-hHHhhccHHHHHHHHHHHHh Confidence 332 111110 000000000000000 0000 00000000111111 23445678899999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.++++.+.-..+..... .....+...++.-++..++.++++.++ +.+.+|.+++++++.. + .+.+ | T Consensus 66 iA~lp~~~~~~~~~~~~~--~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~--g--~~~~------l 133 (457) T protein:vir:13 66 IATLPLSTYSKRGGSRKE--IVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQG--P--NIVG------L 133 (457) T ss_pred hccCceEEEEecCCcccc--cccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC--C--cEEE------E Confidence 999999886543322111 112223333433333356677887766 5788999999998752 1 1222 2 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCC---ccccccccEEEEeecCcCC Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSAD---EVFIPINKLMVMSLGGTES 233 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~iP~~k~i~~~~~~~~g 233 (516) .+-++..++. .....++..... +..+. ....+. ...+|...+|.+++....+ T Consensus 134 ~~l~p~~v~v---~~~~~~~~~~~~-------~~~y~---------------~~~~~~~~~~~~~~~~diih~~~~~~~~ 188 (457) T protein:vir:13 134 DVLDPTKIHV---HMVMVDGLRRKV-------FEAYD---------------IDADGNEVLLGWFTPRDVLHIPGMMLPG 188 (457) T ss_pred EEEccCceEE---EEecCCCcccee-------EEEEE---------------EecCCceeeEEeeCccceEEecCCCCCC Confidence 2222222210 011111100000 00000 000000 1134566666555555556 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccc---e Q lcl|NC_016071. 234 NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQ---A 310 (516) Q Consensus 234 ~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~---a 310 (516) ..+|.|.+..+....-.-....++...+....+.|--+++.+ ..-+.+. .+++++.......|.+ . T Consensus 189 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~ls~e~---~~~~~~~~~~~~~g~~nag~ 257 (457) T protein:vir:13 189 DFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVP--------GTMSEEG---LARAREAWRAANSGVDNAHR 257 (457) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC--------CCCCHHH---HHHHHHHHHHHhcCccccCc Confidence 689999999998877777777777777776544443333322 2223333 4445555554444432 2 Q ss_pred EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhh---HHHHHHHHHHHHHH Q lcl|NC_016071. 311 YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYN---LSESKQSIHGHFVQ 387 (516) Q Consensus 311 ~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A---l~~vh~ev~~~~~~ 387 (516) .+++|.|++.+- .+-+.....|.+..++.-.+|++++--..--.+..+++++. ..+........-+. T Consensus 258 ~~vl~~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl~ 327 (457) T protein:vir:13 258 VALLTEGAKFSK----------VAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLR 327 (457) T ss_pred ceecCCCceEEE----------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHH Confidence 467788864332 11122333466666788889999887755333322223321 22223334455567 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC Q lcl|NC_016071. 388 RDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI 467 (516) Q Consensus 388 aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~ 467 (516) -.++.|+..||+.|+... .....+-+|.++.....|++..++++.++++.|++.+ +.+|+.+|+|+-. T Consensus 328 P~~~~ie~~ln~~L~~~~-------~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~-----NE~R~~~gl~Pi~ 395 (457) T protein:vir:13 328 PWLERIEAGFNRLLFAET-------ADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSI-----DEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHhhcCcc-------ccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCC Confidence 788888888888776542 1111223344445566799999999999999999886 5799999998543 Q ss_pred Cc--ccccCcc------cc--c----CCCC-----CCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 468 PE--DMSTDEL------LK--L----LGQD-----TSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 468 ~~--~~~~~~~------~~--~----~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) ++ |+..... .. + .+++ ..+..+..+.+.+...++......|.| T Consensus 396 ~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 396 DGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred CCcccceeeccccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 32 2211100 00 0 0000 000011111222222222223333333 No 43 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.65 E-value=7.1e-15 Score=98.17 Aligned_cols=441 Identities=11% Similarity=0.086 Sum_probs=201.9 Q ss_pred CCccccCccccc---------chhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccH-HHHHHHhhChHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVV---------KAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFL-ATVEAMKQDHTVSTA 70 (516) Q Consensus 1 ~~~r~~~~~~~~---------~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m~~D~~v~s~ 70 (516) +-.|+.....+. +++.-.. ........+|.. ..-.++. ..|..+-+..+ .+.+.+.+-+.|.+| T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~k~~~~~-~~a~~~~~~~~~--~~~~~~~---~r~~~~~~~~l~~~~~~~~~npiv~~~ 96 (551) T protein:vir:80 23 KHIEVDDNYSIAIQQREQEQISKAMNNK-EVAYSQPVIGSM--SANPGFK---TKPSIRNNQDLHGVLKKFGGNIILNAI 96 (551) T ss_pred cccccccceeeecccccHHHHHHhhccC-cceeecccccce--ecCcccc---cCccccChhHHHHHHHHhhcCHHHHHH Confidence 222221110000 0000000 000000111110 0000110 01111111112 122223346899999 Q ss_pred HHHHHHHHhc-----------CCceeeeCCCC--CChhhHHHHHHHHHHHhhccC-----cCCHHHHHHHHH-HHHhhcc Q lcl|NC_016071. 71 LDTKYVFVTK-----------AFNDFKVLYNR--DSKASKDAAEFVEYALKNLAN-----QQTLRDIARSAA-TFNEYGF 131 (516) Q Consensus 71 l~~Rk~~v~~-----------~~w~i~~~~~~--d~~~~~~~a~~v~~~l~~~~~-----~~~~~~~l~~~l-da~~~G~ 131 (516) ++.|...|.+ ..|.+.+.... ....+.+..+.++.+|.+.+. +.+|.+++..++ +.+.+|. T Consensus 97 I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gn 176 (551) T protein:vir:80 97 INTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQ 176 (551) T ss_pred HHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCC Confidence 9999999976 45666554321 122333334445555554332 246788888766 5788999 Q ss_pred eeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeecccccccccccccccccccccccccccccc Q lcl|NC_016071. 132 SIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTS 211 (516) Q Consensus 132 S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (516) +.+|+++...+ ++ ..|.+.++.+++ ...+.+|...... .++..... T Consensus 177 ay~~i~rd~~G---~~------~~L~~l~p~~V~----v~~~~~g~~~~~~---------------------~~y~~~~~ 222 (551) T protein:vir:80 177 VNFEKVFNRNQ---SM------VRFVAKDPTTIF----FATTADGKIPDNG---------------------NRFVQVID 222 (551) T ss_pred EEEEEEECCCC---cE------EEEEEeCCceeE----EEECCccccccCc---------------------eEEEEEeC Confidence 99999986543 22 234444444442 2334444321100 00001111 Q ss_pred CCCccccccccEEEEeecCcC---CccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCH Q lcl|NC_016071. 212 SADEVFIPINKLMVMSLGGTE---SNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKS 288 (516) Q Consensus 212 ~~~~~~iP~~k~i~~~~~~~~---g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~ 288 (516) +...+.+|.+.+|++++.... ..+||.|.+..+......-....++-..|...-+.|--++..+ .....+. T Consensus 223 g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~------~~~~lt~ 296 (551) T protein:vir:80 223 QKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIK------AAQQQSQ 296 (551) T ss_pred CcEEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEc------CCCCCCH Confidence 222345677776666655433 3578999999888877777767776666666433222222211 1111222 Q ss_pred HHHHHHHHHHHHHHHhhcc-cceEE--Ee-ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_016071. 289 PESEMVQGLMADAANAHAG-EQAYF--IL-PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINL 364 (516) Q Consensus 289 ~~~~~l~~l~~~~~~~~~g-~~a~~--ii-P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts 364 (516) + ..+++++.......| ..++. ++ +.|+ ++...+.+.....|.+..++..++|++++.-.-.-. T Consensus 297 e---~~~~lk~~~~~~~~G~~nag~~~vl~~~g~----------~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~l 363 (551) T protein:vir:80 297 H---ALEIFKREWKNSLSGINGSWQIPVVSAEDV----------KFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEI 363 (551) T ss_pred H---HHHHHHHHHHHHhcCccccCccccccCCCc----------eEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHc Confidence 2 234455544443334 24432 33 3454 333333333444577777888899999875433211 Q ss_pred c--------CCccch--hhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHH Q lcl|NC_016071. 365 G--------NDGQGS--YNLSESKQ-SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEG 433 (516) Q Consensus 365 ~--------~~~~GS--~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~ 433 (516) + +..++| ++-.+... ......+.--++.|+..||+.|++.+ + .. -+|.|+.....+... T Consensus 364 G~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~-------~--~~-~~f~f~~~~~~~~~~ 433 (551) T protein:vir:80 364 NIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEF-------G--DK-YTFQFVGGDIKSELE 433 (551) T ss_pred CcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc-------C--Cc-eEEEeeccChhhHHH Confidence 1 111222 23333333 34556688899999999998776531 1 12 267787777777666 Q ss_pred HHHHHHHHHhCCcccccHHHHHHHHHHcCCCCC-CCcccccCccc-cc-------------------------CCCCCCc Q lcl|NC_016071. 434 FSKFVQRIGAVGYLPKTPTVINKILEVGGFDEE-IPEDMSTDELL-KL-------------------------LGQDTSR 486 (516) Q Consensus 434 ~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~-~~~~~~~~~~~-~~-------------------------~~~~~~~ 486 (516) .+++. +++..|++.+ +.+|+.+|+|+. ..+|....... .+ .+....+ T Consensus 434 ~~~~~-~~~~~g~lT~-----NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (551) T protein:vir:80 434 SVKIL-AEKAKVAMTV-----NEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVST 507 (551) T ss_pred HHHHH-HHHhcCCcCH-----HHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCC Confidence 65544 5666788764 689999999763 33343221100 00 0000000 Q ss_pred ccccccccCC---CCCcccccccccchhh--hhcC Q lcl|NC_016071. 487 SGDGMTAGSN---GNGTGKISSTRDNSVS--NMDN 516 (516) Q Consensus 487 ~~~~~~~~~~---~~~~~~~~~~~d~~~~--~~~~ 516 (516) ..+....++. +.++.......+++.+ .+-+ T Consensus 508 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 542 (551) T protein:vir:80 508 DVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQGMK 542 (551) T ss_pred CCCCCCCccccCCCccccccccCccccchhhhhcC Confidence 0000000000 0111111111111111 1111 No 44 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.65 E-value=5.4e-15 Score=98.79 Aligned_cols=411 Identities=13% Similarity=0.053 Sum_probs=206.0 Q ss_pred CC-----ccccCcccccchhhhcccCCC--CcccccchHHHHHHHH-----HHHhhcccccCCcccHHHHHHHhhChHHH Q lcl|NC_016071. 1 MS-----TRFAQPSEVVKAGNENLAVSR--LRTGELGSGALSQLRA-----ESEVMKVEELRWPCFLATVEAMKQDHTVS 68 (516) Q Consensus 1 ~~-----~r~~~~~~~~~~~~~~p~~~~--~~~~e~g~~~~~~~~~-----~~~~~~~~~lr~~~~~~~y~~m~~D~~v~ 68 (516) |. .|.+++....+.-.+ |+.+. .....-|.. +.+ +........+. +..+. .+..++-+.|. T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~~----~~~~~~~~~~~~~~~~~~~-g~~v~-~~~al~~~~V~ 73 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVE-PSFQASTPTTSIPGET----FEGLDDPRLKEYIRRGELN-GGTGR-ETRALRNMAVL 73 (431) T ss_pred CcchhhhhcCcccccccccccc-cccccccccccccccc----cccccchHHHHhhccCccC-cceec-hhhhhccHHHH Confidence 33 222222111111111 11111 000111110 111 00111111111 11121 24556788999 Q ss_pred HHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccc Q lcl|NC_016071. 69 TALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSK 146 (516) Q Consensus 69 s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~ 146 (516) +|+..+-..|.++++.+.-..+... +..-.-+...|.. -+...+..+++..+ .+.+.+|-+++++++.. + . T Consensus 74 ~ci~~Ia~~iA~lp~~v~~~~~~~~---~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g---~ 146 (431) T protein:vir:10 74 RCVTLISGTIGMLPMNLISSDDSKQ---VLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-N---R 146 (431) T ss_pred HHHHHHHHhhccCceEEEEecCcee---eeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-C---c Confidence 9999999999999998743221111 1111223334432 23345667777664 46778999999998852 1 1 Q ss_pred cccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEE Q lcl|NC_016071. 147 YAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVM 226 (516) Q Consensus 147 ~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~ 226 (516) + ..|.+.++.++. ...+.+|+...... ...+..+.+|...++.+ T Consensus 147 ~------~~L~pl~~~~v~----~~~~~~~~~~y~~~--------------------------~~~g~~~~~~~~dViHi 190 (431) T protein:vir:10 147 P------IRLIPMDRGSAK----GRLTSTWQIVYDYT--------------------------TPTGDKIELPAREVFHL 190 (431) T ss_pred e------EEEEEEcCceeE----EEEcCCCeEEEEEE--------------------------eCCceEEEEchhhEEEe Confidence 1 233344443332 23344544321111 11233455777776655 Q ss_pred eecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_016071. 227 SLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHA 306 (516) Q Consensus 227 ~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~ 306 (516) ++.. .+.++|.|.+..+....-.-....++...+...-+.+--++..+ ..-++++ .+++++...+... T Consensus 191 r~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~ls~e~---~~~~~~~~~~~~~ 258 (431) T protein:vir:10 191 RDLS-IDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVP--------KELSDNA---YGRMKASVQENHT 258 (431) T ss_pred cCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC--------CCCCHHH---HHHHHHHHHHHhc Confidence 5543 45589999999998777666666676666666444333333322 2223332 3445555554444 Q ss_pred c-cce--EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH-HH Q lcl|NC_016071. 307 G-EQA--YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS-IH 382 (516) Q Consensus 307 g-~~a--~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e-v~ 382 (516) | .++ .+++|.|++.+ ..+-+.....+.+.-++...+|++++.-..--.+..++++++-.+-+.. .- T Consensus 259 g~~n~g~~~vl~~g~~~~----------~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~ 328 (431) T protein:vir:10 259 GSENAGSWMLLEEGATAK----------QFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAIFFI 328 (431) T ss_pred CccccCCceecCCCceEE----------EccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHH Confidence 4 233 36778886432 2222233344666667778899988877654444333455554443433 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcC Q lcl|NC_016071. 383 GHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGG 462 (516) Q Consensus 383 ~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~G 462 (516) ..-+.--++.|++.||+.|++.-- .. ..+.+|.++..-..|+++.++++++++..|+... -...+.+|+.+| T Consensus 329 ~~tL~P~~~~ie~~ln~~Ll~~~~------~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g-~lT~NE~R~~~g 400 (431) T protein:vir:10 329 QYGLSHWFVSWEQAAARAFLPEKM------LG-QRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSP-WMKQNEVREMLD 400 (431) T ss_pred HHHHHHHHHHHHHHHHhhccChhh------cC-CceEEEechhhhccCHHHHHHHHHHHHhcccccC-ccCHHHHHHHhC Confidence 445777888888889887765321 11 1233444444456789999999999999886211 001368999999 Q ss_pred CCCCCC--cccccCcccccCCCCCCcccccccccCCCCCccccc Q lcl|NC_016071. 463 FDEEIP--EDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKIS 504 (516) Q Consensus 463 lp~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (516) +|+-.. .|+....... .+.+++ +..+.++ T Consensus 401 l~p~~~~~gD~~~~p~n~------~~~~~~-------~~~p~~~ 431 (431) T protein:vir:10 401 LPRADDPVADQLRNPMTQ------KQKGSG-------DEPPATT 431 (431) T ss_pred CCCCCCccccceeccccc------ccCCCC-------CCCCCCC Confidence 996533 3332211110 111110 0011000 No 45 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.65 E-value=5.5e-15 Score=98.77 Aligned_cols=419 Identities=12% Similarity=0.050 Sum_probs=213.3 Q ss_pred CC---ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH-HHHhhChHHHHHHHHHHH Q lcl|NC_016071. 1 MS---TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV-EAMKQDHTVSTALDTKYV 76 (516) Q Consensus 1 ~~---~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~ 76 (516) |+ .=+.-.++.. .+.. .....+. .+..+.+. ....+.+. +..++.+.|.+|+..+-. T Consensus 1 M~~~~~~f~~~~r~~-----~~~~---~~~~~~~-~~~~~~g~----------~~~~~~v~~~~al~~~~v~~~i~~ia~ 61 (429) T protein:vir:10 1 MDSVKKFFNFEKRQT-----SQVI---ELNKDDE-KLLEWLGI----------SPSTISVKGKNALKVATVFACIKILSE 61 (429) T ss_pred CchhhhhhcccccCc-----cccc---ccCCChH-HHHHHhcC----------CCCcceechhhhhccHHHHHHHHHHHH Confidence 33 2222111111 0110 0010110 11111111 01111122 234568899999999999 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHHH-HHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) .|.+++|.+.-..+.... +..-.-+...|. +-+...++.++++.++. .+.+|-+++++++...+ . +. T Consensus 62 ~ia~l~~~~~~~~~~~~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G-------~--~~ 130 (429) T protein:vir:10 62 SVSKLPLKIYQEDEYGIQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG-------K--VQ 130 (429) T ss_pred hhccCceEEEEecCCcee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EE Confidence 999999987543221111 111112333343 22334567778877664 67899999999875432 2 22 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) .|.+.++.+++ ...++++...... .........+....+|...+|.+++....+. T Consensus 131 ~L~~i~~~~v~----v~~~~~~~~~~~~---------------------~~~~~~~~~g~~~~~~~~evih~~~~~~~~~ 185 (429) T protein:vir:10 131 ALWPIDASKVT----VYIDDVGLLNSKT---------------------KMWYVVNTGGQQRVLKPEEILHFKNGITLDG 185 (429) T ss_pred EEEEEcCceeE----EEEcCcccccccc---------------------eEEEEEccCCeEEEEccccEEEecCCCCCCC Confidence 34444444332 2223332111000 0001112233345678888777766666677 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc---cceE Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG---EQAY 311 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g---~~a~ 311 (516) ++|.|.+..++...-.-....++...+...-+.+--+++.+ ..-+.+.. +++++..+....| .... T Consensus 186 ~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~--------~~l~~e~~---~~~~~~~~~~~~g~~n~~~~ 254 (429) T protein:vir:10 186 LVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV--------GDLNEDAK---KVFRENFESMSSGLQNSHRI 254 (429) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC--------CCCCHHHH---HHHHHHHHHHhccccccCce Confidence 88999999998877776667677777766533332233321 12222222 3344444433333 2245 Q ss_pred EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHH Q lcl|NC_016071. 312 FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDI 390 (516) Q Consensus 312 ~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa 390 (516) +++|.|++++. ++. +....++.+..++.-++|++++--..-..+....|+++-.+-+. ......+.-.+ T Consensus 255 ~vl~~g~~~~~--------l~~--~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~ 324 (429) T protein:vir:10 255 ALMPVGYQFQP--------ISL--NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATL 324 (429) T ss_pred eecCCCceEEE--------ccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHH Confidence 77888874332 221 22333466666788889999888776444433445665544443 34566688899 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) +.|++.||+.|+..--.- ...+.+|.++.....|+++.++++++|++.|++.+ +.+|+.+|+|+-...| T Consensus 325 ~~ie~~ln~kl~~~~~~~------~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~ggD 393 (429) T protein:vir:10 325 TMYEQEMTYKLFLDSELD------KGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKP-----NEARSKEDLPPEAGGD 393 (429) T ss_pred HHHHHHHHHhhcChhhcC------CCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcC Confidence 999999998776532111 11223444445566789999999999999999876 5789999999654444 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) +......- .+-+.... .-.+.+. .+++ ..+..++.. T Consensus 394 ~~~~~~n~-~~~d~~~~-~~~k~g~-~~~~--~~~~~~e~~ 429 (429) T protein:vir:10 394 RLLVNGNM-LPIDMAGQ-AYLKGGD-TNGE--VSKEGNEGN 429 (429) T ss_pred eeeecccc-cchhhccc-cccCCCC-CCCC--CCCCCCCCC Confidence 33221110 01010000 0001111 0111 111111111 No 46 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.65 E-value=8.4e-15 Score=97.76 Aligned_cols=401 Identities=11% Similarity=0.037 Sum_probs=209.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) -.+++...+... +.+. .....+.+.. + .+..+ ..+..++-+.|.+|+..+-..|.+ T Consensus 3 f~~~~~~~~~~~-------~~~~--------~~~~~~~g~~-----~---~~~~v-~~~~al~~~~v~~~i~~ia~~ia~ 58 (409) T protein:vir:10 3 FRKGFKNQSQEI-------SIDD--------KKILEWLGIN-----P---SETYV-NGKSCLKQATVFGCIRILSDNISK 58 (409) T ss_pred ccccccCcCCCC-------CCCh--------HHHHHHhcCC-----c---Cccee-chhhhhccHHHHHHHHHHHHhhhh Confidence 111111111100 0111 0000111110 0 00011 124456788899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) ++|++.-..+..... ...-+...|. +-+...++.++++.++ +.+.+|-+++++++...+ . +..|.+ T Consensus 59 lp~~~~~~~~~~~~~---~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G-------~--~~~L~~ 126 (409) T protein:vir:10 59 LPIKIYQKKDGIKRV---PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNG-------E--IKGLYP 126 (409) T ss_pred CceEEEEecCCeeec---cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC-------c--EEEEEE Confidence 999874322211111 0111233343 3344456778887766 478899999999886543 2 224445 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.++. ...+++|.... ... -.+......+....+|.+.+|++++. ..+.++|. T Consensus 127 i~~~~V~----v~~~~~~~~~~-~~~-------------------~~y~~~~~~g~~~~~~~~evih~r~~-~~d~~~G~ 181 (409) T protein:vir:10 127 LKSDGMK----IFVDDTGLLNS-ENN-------------------VWYLYTDDLGQRHKFMSDEILHFKGL-TADGLAGL 181 (409) T ss_pred EcCCceE----EEEcCCccccc-cce-------------------EEEEEEeCCceeEEeccccEEEecCc-CCCCcccc Confidence 5554332 23333332110 000 00011122234456787776666554 34568999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc-ce--EEEec Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE-QA--YFILP 315 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~-~a--~~iiP 315 (516) |.+..|+...-.-....++...+....+.+--+++.+ ..-+.+. .+++++.......|. ++ .++++ T Consensus 182 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~--------~~l~~e~---~~~~~~~~~~~~~g~~n~~~~~vl~ 250 (409) T protein:vir:10 182 SVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYA--------GDLNPEA---EEVFKENFERMSSGLKNAHRIAMLP 250 (409) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC--------CCCCHHH---HHHHHHHHHHHhccccccCCceecC Confidence 9999999877776666777777766544333233322 2222222 334444444444442 22 46677 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESK-QSIHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh-~ev~~~~~~aDa~~i~ 394 (516) .|++++ ..+-+....++.+..++.-++|++++.-..--.+..+.++++-.+.. ......-++-.++.|+ T Consensus 251 ~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie 320 (409) T protein:vir:10 251 IGYKFE----------PISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYIDTLQSILNMYE 320 (409) T ss_pred CCceEE----------EccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHHHHHHHH Confidence 776433 22222334457777788889999998876544433334566555433 3445556778888899 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccC Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTD 474 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~ 474 (516) +.||+.|+..-- + ....+-+|.++.....|++..++++.++++.|++.+ +.+|+.+|+|+-+.+|+... T Consensus 321 ~~ln~kL~~~~~-~-----~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~lgl~p~~ggD~~~~ 389 (409) T protein:vir:10 321 LEINYKLFLISE-I-----KNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTP-----NEIRELEEDEPLEGGDVLLI 389 (409) T ss_pred HHHHHhhcCchh-c-----cCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeee Confidence 999887653211 0 111122333445556788999999999999999886 57899999997655554321 Q ss_pred cccccCCCCCCcccccccccCCCCC Q lcl|NC_016071. 475 ELLKLLGQDTSRSGDGMTAGSNGNG 499 (516) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (516) ...- .+-+.. ++... ..|+- T Consensus 390 ~~n~-~~~~~~--~~~~~--kgGe~ 409 (409) T protein:vir:10 390 NGNM-IPVKMA--GEQYS--KGGEK 409 (409) T ss_pred ccCc-cchhhc--ccccc--ccCCC Confidence 1110 000000 00000 00111 No 47 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.64 E-value=1.6e-14 Score=96.16 Aligned_cols=439 Identities=12% Similarity=0.095 Sum_probs=200.6 Q ss_pred CCccccCcccccchh---------hhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccH-HHHHHHhhChHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAG---------NENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFL-ATVEAMKQDHTVSTA 70 (516) Q Consensus 1 ~~~r~~~~~~~~~~~---------~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m~~D~~v~s~ 70 (516) .+-|+.......... ++-+...++.. ...+.+++ ..+...++.+..+ .+.+.+..-+-+.+| T Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~------~~~~~~~~--~~~~~~~~~~~~l~~~l~~~~~n~i~~~~ 100 (563) T protein:vir:99 29 LQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIE------MMDTNPEF--RDKRSYMKNEHNLHDVLKKFGNNPILNAI 100 (563) T ss_pred hhhhHhhhhccchhHHHHHhhhccCCCcchhhhHh------hhcccccc--cccccCCCCcccHHHHHHHhhcchHHHHH Confidence 111111111100000 00000111000 00000111 1111123333333 223344446788899 Q ss_pred HHHHHHHHhc-----------CCceeeeCCCCCChhhHHHH--HHHHHHHhhcc-----CcCCHHHHHHHHH-HHHhhcc Q lcl|NC_016071. 71 LDTKYVFVTK-----------AFNDFKVLYNRDSKASKDAA--EFVEYALKNLA-----NQQTLRDIARSAA-TFNEYGF 131 (516) Q Consensus 71 l~~Rk~~v~~-----------~~w~i~~~~~~d~~~~~~~a--~~v~~~l~~~~-----~~~~~~~~l~~~l-da~~~G~ 131 (516) +.+|...|.. +.|.+..........+++.+ ..++..|.... .+.+|.+++..++ +.+.+|. T Consensus 101 I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn 180 (563) T protein:vir:99 101 ILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQ 180 (563) T ss_pred HHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCC Confidence 9999887774 33455433221111222222 23344443221 2236778887766 5789999 Q ss_pred eeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeecccccccccccccccccccccccccccccc Q lcl|NC_016071. 132 SIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTS 211 (516) Q Consensus 132 S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (516) +.+|+++.+.+. ++ +..|.+.++.+++ ...+.+|....... . +..... T Consensus 181 ~~~~~~~~rd~~-G~------~~~L~pl~p~~V~----v~~~~~g~~~~~~~----~-----------------y~~~~~ 228 (563) T protein:vir:99 181 VNFEKVFNKNNK-TK------LEKFIAVDPSTIF----YATDKKGKIIKGGK----R-----------------FVQVVD 228 (563) T ss_pred eEEEEEEEecCC-Cc------eEEEEEeCCceeE----EEECCCCceeccce----e-----------------EEEEeC Confidence 999999876532 22 2334444454443 23344443221100 0 000111 Q ss_pred CCCccccccccEEEEeecCcCC---ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCH Q lcl|NC_016071. 212 SADEVFIPINKLMVMSLGGTES---NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKS 288 (516) Q Consensus 212 ~~~~~~iP~~k~i~~~~~~~~g---~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~ 288 (516) +.....++..-.|+|+.....+ .+||.|.+..|......-....++-..|....+.+--++..+. ....+. T Consensus 229 g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~------~~~ls~ 302 (563) T protein:vir:99 229 KRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRS------DQQQSQ 302 (563) T ss_pred CceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCC------CCCCCH Confidence 1222346677778888765544 6789999999998888777777777777765444332232221 011122 Q ss_pred HHHHHHHHHHHHHHHhhccc-ceE---EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_016071. 289 PESEMVQGLMADAANAHAGE-QAY---FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINL 364 (516) Q Consensus 289 ~~~~~l~~l~~~~~~~~~g~-~a~---~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts 364 (516) + ..+++++.......|. .++ ++++.|++.. ..+-+.....|.+..++.-++|++++.-..--. T Consensus 303 e---~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~----------~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l 369 (563) T protein:vir:99 303 H---ALENFKREWKSSLSGINGSWQIPVVMADDIKFV----------NMTPTANDMQFEKWLNYLINIISALYGIDPAEI 369 (563) T ss_pred H---HHHHHHHHHHHHhccccccccceEEcCCCceEE----------eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc Confidence 2 3444555555544442 343 5678886422 222233344577777888899998876543222 Q ss_pred cC---------Cccchh--hHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHH Q lcl|NC_016071. 365 GN---------DGQGSY--NLSESK-QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDME 432 (516) Q Consensus 365 ~~---------~~~GS~--Al~~vh-~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~ 432 (516) +- ..++|. +-.+.. .......++--++.|+..||+.|++.+ +. . -+|.|... |++ T Consensus 370 G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-------~~--~-~~~~f~r~---D~~ 436 (563) T protein:vir:99 370 GFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEY-------GD--K-YTFQFVGG---DTK 436 (563) T ss_pred cccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-------cc--c-cEEEeccC---CHH Confidence 11 111222 222222 234455677788899999998887642 11 1 14555544 333 Q ss_pred HHHHH--HHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCccc-----ccCCC------------------CCCcc Q lcl|NC_016071. 433 GFSKF--VQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELL-----KLLGQ------------------DTSRS 487 (516) Q Consensus 433 ~~a~~--~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~-----~~~~~------------------~~~~~ 487 (516) ..++. +.+++..|++.+ +.+|+.+|+|+-..+|....... ..... ..+++ T Consensus 437 ~~~e~~~~~~~~~~G~lT~-----NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (563) T protein:vir:99 437 SATDKLNILKLETQIFKTV-----NEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDN 511 (563) T ss_pred HHHHHHHHHHHhcCCccCH-----HHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCC Confidence 34443 345788898775 67999999986655554321100 00000 00000 Q ss_pred ccccc--ccCCCCCcccc-----cccccchhhh--------hc--C Q lcl|NC_016071. 488 GDGMT--AGSNGNGTGKI-----SSTRDNSVSN--------MD--N 516 (516) Q Consensus 488 ~~~~~--~~~~~~~~~~~-----~~~~d~~~~~--------~~--~ 516 (516) ++... ...+.+.++++ +.+.++.-+. |. + T Consensus 512 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 557 (563) T protein:vir:99 512 DDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGEK 557 (563) T ss_pred CCCCCCCCCCCCCCccccccccccccccccccccCccccccccCcC Confidence 00000 00001111111 0111111111 00 0 No 48 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.64 E-value=1.6e-14 Score=96.16 Aligned_cols=439 Identities=12% Similarity=0.095 Sum_probs=200.6 Q ss_pred CCccccCcccccchh---------hhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccH-HHHHHHhhChHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAG---------NENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFL-ATVEAMKQDHTVSTA 70 (516) Q Consensus 1 ~~~r~~~~~~~~~~~---------~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m~~D~~v~s~ 70 (516) .+-|+.......... ++-+...++.. ...+.+++ ..+...++.+..+ .+.+.+..-+-+.+| T Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~------~~~~~~~~--~~~~~~~~~~~~l~~~l~~~~~n~i~~~~ 100 (563) T protein:vir:95 29 LQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIE------MMDTNPEF--RDKRSYMKNEHNLHDVLKKFGNNPILNAI 100 (563) T ss_pred hhhhHhhhhccchhHHHHHhhhccCCCcchhhhHh------hhcccccc--cccccCCCCcccHHHHHHHhhcchHHHHH Confidence 111111111100000 00000111000 00000111 1111123333333 223344446788899 Q ss_pred HHHHHHHHhc-----------CCceeeeCCCCCChhhHHHH--HHHHHHHhhcc-----CcCCHHHHHHHHH-HHHhhcc Q lcl|NC_016071. 71 LDTKYVFVTK-----------AFNDFKVLYNRDSKASKDAA--EFVEYALKNLA-----NQQTLRDIARSAA-TFNEYGF 131 (516) Q Consensus 71 l~~Rk~~v~~-----------~~w~i~~~~~~d~~~~~~~a--~~v~~~l~~~~-----~~~~~~~~l~~~l-da~~~G~ 131 (516) +.+|...|.. +.|.+..........+++.+ ..++..|.... .+.+|.+++..++ +.+.+|. T Consensus 101 I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn 180 (563) T protein:vir:95 101 ILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQ 180 (563) T ss_pred HHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCC Confidence 9999887774 33455433221111222222 23344443221 2236778887766 5789999 Q ss_pred eeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeecccccccccccccccccccccccccccccc Q lcl|NC_016071. 132 SIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTS 211 (516) Q Consensus 132 S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (516) +.+|+++.+.+. ++ +..|.+.++.+++ ...+.+|....... . +..... T Consensus 181 ~~~~~~~~rd~~-G~------~~~L~pl~p~~V~----v~~~~~g~~~~~~~----~-----------------y~~~~~ 228 (563) T protein:vir:95 181 VNFEKVFNKNNK-TK------LEKFIAVDPSTIF----YATDKKGKIIKGGK----R-----------------FVQVVD 228 (563) T ss_pred eEEEEEEEecCC-Cc------eEEEEEeCCceeE----EEECCCCceeccce----e-----------------EEEEeC Confidence 999999876532 22 2334444454443 23344443221100 0 000111 Q ss_pred CCCccccccccEEEEeecCcCC---ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCH Q lcl|NC_016071. 212 SADEVFIPINKLMVMSLGGTES---NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKS 288 (516) Q Consensus 212 ~~~~~~iP~~k~i~~~~~~~~g---~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~ 288 (516) +.....++..-.|+|+.....+ .+||.|.+..|......-....++-..|....+.+--++..+. ....+. T Consensus 229 g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~------~~~ls~ 302 (563) T protein:vir:95 229 KRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRS------DQQQSQ 302 (563) T ss_pred CceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCC------CCCCCH Confidence 1222346677778888765544 6789999999998888777777777777765444332232221 011122 Q ss_pred HHHHHHHHHHHHHHHhhccc-ceE---EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_016071. 289 PESEMVQGLMADAANAHAGE-QAY---FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINL 364 (516) Q Consensus 289 ~~~~~l~~l~~~~~~~~~g~-~a~---~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts 364 (516) + ..+++++.......|. .++ ++++.|++.. ..+-+.....|.+..++.-++|++++.-..--. T Consensus 303 e---~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~----------~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l 369 (563) T protein:vir:95 303 H---ALENFKREWKSSLSGINGSWQIPVVMADDIKFV----------NMTPTANDMQFEKWLNYLINIISALYGIDPAEI 369 (563) T ss_pred H---HHHHHHHHHHHHhccccccccceEEcCCCceEE----------eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc Confidence 2 3444555555544442 343 5678886422 222233344577777888899998876543222 Q ss_pred cC---------Cccchh--hHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHH Q lcl|NC_016071. 365 GN---------DGQGSY--NLSESK-QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDME 432 (516) Q Consensus 365 ~~---------~~~GS~--Al~~vh-~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~ 432 (516) +- ..++|. +-.+.. .......++--++.|+..||+.|++.+ +. . -+|.|... |++ T Consensus 370 G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-------~~--~-~~~~f~r~---D~~ 436 (563) T protein:vir:95 370 GFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEY-------GD--K-YTFQFVGG---DTK 436 (563) T ss_pred cccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-------cc--c-cEEEeccC---CHH Confidence 11 111222 222222 234455677788899999998887642 11 1 14555544 333 Q ss_pred HHHHH--HHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCccc-----ccCCC------------------CCCcc Q lcl|NC_016071. 433 GFSKF--VQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELL-----KLLGQ------------------DTSRS 487 (516) Q Consensus 433 ~~a~~--~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~-----~~~~~------------------~~~~~ 487 (516) ..++. +.+++..|++.+ +.+|+.+|+|+-..+|....... ..... ..+++ T Consensus 437 ~~~e~~~~~~~~~~G~lT~-----NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (563) T protein:vir:95 437 SATDKLNILKLETQIFKTV-----NEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDN 511 (563) T ss_pred HHHHHHHHHHHhcCCccCH-----HHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCC Confidence 34443 345788898775 67999999986655554321100 00000 00000 Q ss_pred ccccc--ccCCCCCcccc-----cccccchhhh--------hc--C Q lcl|NC_016071. 488 GDGMT--AGSNGNGTGKI-----SSTRDNSVSN--------MD--N 516 (516) Q Consensus 488 ~~~~~--~~~~~~~~~~~-----~~~~d~~~~~--------~~--~ 516 (516) ++... ...+.+.++++ +.+.++.-+. |. + T Consensus 512 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 557 (563) T protein:vir:95 512 DDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGEK 557 (563) T ss_pred CCCCCCCCCCCCCCccccccccccccccccccccCccccccccCcC Confidence 00000 00001111111 0111111111 00 0 No 49 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.64 E-value=1.4e-14 Score=96.50 Aligned_cols=414 Identities=11% Similarity=0.093 Sum_probs=200.6 Q ss_pred CCcc----ccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_016071. 1 MSTR----FAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYV 76 (516) Q Consensus 1 ~~~r----~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~ 76 (516) |=.| +..+.++.-.+. .+..+.. ..+ ...... . -..+..+. -+..++-+.|.+|+..+-. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~--~~~----~~~~~~-----~--s~~g~~v~-~~~al~~~~V~~~i~~Ia~ 72 (432) T protein:vir:10 9 LLGQLKAMFVPPDPVDIGGG--QTFTPVN--ATA----RDLGII-----I--SDTGAAVN-ADAIMRLDAVAACVKLVSQ 72 (432) T ss_pred hhhhhHhhcCCccccccccc--cccccCc--chh----hhhccc-----c--cccCcccc-hhhhhcchHHHHHHHHHHH Confidence 1111 111111111010 0111100 000 000000 0 00011111 1345578999999999999 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) .|.+++|.+.-... +. ..+..-.-+...|. +-+...++.+++..++ +.+.+|.+++++++. ++ ++ . T Consensus 73 ~ia~lp~~~y~~~~-~g-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g---~~------~ 140 (432) T protein:vir:10 73 AIAAMPLTMYMRTP-DG-RKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DG---RI------E 140 (432) T ss_pred hhhhCceeEEEecC-CC-cccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CC---cE------E Confidence 99999998743321 11 11111111233333 2333456777877765 678899999999874 21 12 2 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) .|.+.++..++ ...+.+|+...... ...+..+.+|.+.++.+++.+.. . T Consensus 141 ~L~~l~~~~v~----v~~~~~g~~~y~~~--------------------------~~~g~~~~~~~~~iih~~~~~~d-g 189 (432) T protein:vir:10 141 SLQYLANDRLT----ITTDTKGNTAYRYR--------------------------RTDGQMIDIPKQQIWKIMGYSLD-G 189 (432) T ss_pred EEEEEcCCceE----EEEcCCCcEEEEEE--------------------------ecCceEEEEcCccEEEecCCCCC-C Confidence 33344443332 23345554322111 11223456777777666655444 4 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEe Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFIL 314 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ii 314 (516) .+|.|.+..+....-.-....++-..|...-+.+--+++. +...+.+..+ ++++..+... +....+++ T Consensus 190 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~--------~~~l~~e~~~---~~~~~~~~~~-nag~~~vl 257 (432) T protein:vir:10 190 ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQI--------DRFLTDDQYD---SFAKKVSGSV-EAGRAPLL 257 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEec--------CCCCCHHHHH---HHHHHHhhhh-hCCCceec Confidence 7899999999887666556666666666543333222222 2223333333 3333333221 11234677 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH----HHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ----SIHGHFVQRDI 390 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~----ev~~~~~~aDa 390 (516) |.|++++. .+-+....+|.+..++...+|++++.-..--.+....|+++.++..+ ..-..-+.-.+ T Consensus 258 ~~g~~~~~----------l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~ 327 (432) T protein:vir:10 258 EGGMDVKS----------LGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMTLSPWL 327 (432) T ss_pred CCCceEEE----------ccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHHHHHHH Confidence 88875332 11122334566777889999999887655444333334443332222 23334566777 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) +.|+..||+.|+.+-- . ..-+.+|..+..-..|.++.++++.++++.|++.+ +.+|+.+|+|+-.+++ T Consensus 328 ~~ie~~ln~kL~~~~~------~-~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~glppi~g~~ 395 (432) T protein:vir:10 328 RRIEQSIALNLLSPAE------R-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTR-----DEAREIEGLPKLGGNA 395 (432) T ss_pred HHHHHHHHhhhcCccc------c-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCc Confidence 8888888876655321 1 11222333334445788999999999999998876 6899999999765443 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) ..........+-.... ... ...++++++.. ..++.+. T Consensus 396 ~~~~~~~~~~pl~~~~-~~~--~~~~~~~~~~~---~~~~~~~ 432 (432) T protein:vir:10 396 AVLTVQSAMVPLDSIG-LQA--SPEPASGLGNQ---QQDKVSK 432 (432) T ss_pred ceEeecCcccchhhhc-ccC--CCCCCCCCCCc---ccccccC Confidence 3322111111100000 000 00001111110 0111111 No 50 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.64 E-value=1.4e-14 Score=96.47 Aligned_cols=439 Identities=12% Similarity=0.093 Sum_probs=197.6 Q ss_pred CCccccCcccccchhhhcccCCCCcccccc--hHH-----HHHHHHHHHhhc----ccccCCcccH-HHHHHHhhChHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELG--SGA-----LSQLRAESEVMK----VEELRWPCFL-ATVEAMKQDHTVS 68 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g--~~~-----~~~~~~~~~~~~----~~~lr~~~~~-~~y~~m~~D~~v~ 68 (516) |.++..+..-+.... .+...+- --+ .+...+..+.-. .|..+.+..+ .+.......+.|. T Consensus 27 ~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~ 98 (574) T protein:vir:80 27 MHLREIDTNVVNNEP--------YSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILN 98 (574) T ss_pred cccchhhhhhhhccC--------CCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHH Confidence 666655443332100 0000000 000 000000000000 1112222222 1222222356677 Q ss_pred HHHHHHHHHHh-----------cCCceeeeCCCCCChhhHHHH--HHHHHHHhhcc-----CcCCHHHHHHHHH-HHHhh Q lcl|NC_016071. 69 TALDTKYVFVT-----------KAFNDFKVLYNRDSKASKDAA--EFVEYALKNLA-----NQQTLRDIARSAA-TFNEY 129 (516) Q Consensus 69 s~l~~Rk~~v~-----------~~~w~i~~~~~~d~~~~~~~a--~~v~~~l~~~~-----~~~~~~~~l~~~l-da~~~ 129 (516) .|+..|+..|. +++|.|...........++.+ ..+...|++.. ...+|.+++..++ +.+.+ T Consensus 99 ~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~ 178 (574) T protein:vir:80 99 AIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMY 178 (574) T ss_pred HHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhc Confidence 77777776554 688887644322111112222 22334444321 2246778887766 46789 Q ss_pred cceeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeecccccccccccccccccccccccccccc Q lcl|NC_016071. 130 GFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNL 209 (516) Q Consensus 130 G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (516) |.+++|+++...+ ++. .|.+.++.++. ...+.++... ..... +... T Consensus 179 Gnayi~i~r~~~G---~~~------~L~pl~p~~V~----v~~d~~~~~~----~~~~~-----------------y~~~ 224 (574) T protein:vir:80 179 DQVNFEKVFDKDG---NFI------KFDTVDPTTIF----LATNGEGKLI----KNGER-----------------FVQV 224 (574) T ss_pred CCeEEEEEECCCC---cEE------EEEEEcCceeE----EEEcCccccc----cCceE-----------------EEEE Confidence 9999999986543 222 33444444332 1112221100 00000 0011 Q ss_pred ccCCCccccccccEEEEeecCcCC---ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCC Q lcl|NC_016071. 210 TSSADEVFIPINKLMVMSLGGTES---NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDP 286 (516) Q Consensus 210 ~~~~~~~~iP~~k~i~~~~~~~~g---~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~ 286 (516) ..+.....+|...+|++++...++ .+||.|.+..+....-.-....++-..|...-+.+=-++..+ ..... T Consensus 225 ~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~------~~~~l 298 (574) T protein:vir:80 225 IDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVK------TGQQQ 298 (574) T ss_pred eCCceEEEEccccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC------CCCCC Confidence 112223456777777777665543 468999999988877777777777777766433322222211 11111 Q ss_pred CHHHHHHHHHHHHHHHHhhcc-cceE---EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccc Q lcl|NC_016071. 287 KSPESEMVQGLMADAANAHAG-EQAY---FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFI 362 (516) Q Consensus 287 ~~~~~~~l~~l~~~~~~~~~g-~~a~---~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtL 362 (516) +.+ .++++++.......| ..++ ++++.|++ +...+.+.....|.+..++..++|+.++.-..- T Consensus 299 s~e---~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~----------~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~ 365 (574) T protein:vir:80 299 SQQ---ALDIFRREWRSSLAGINGSWQIPVVSAEDVK----------FVNMTPSANDMQFEKWLNYLINVISALYGIDPA 365 (574) T ss_pred CHH---HHHHHHHHHHHHhccccccccceeecCCCce----------EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 222 334455555444334 3443 23356643 333333334445677778888999998865432 Q ss_pred ccc--------CCcc--chhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH Q lcl|NC_016071. 363 NLG--------NDGQ--GSYNLSESKQS-IHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM 431 (516) Q Consensus 363 ts~--------~~~~--GS~Al~~vh~e-v~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl 431 (516) -.+ ..++ .++|-.+.... .....++-.++.|+..||+.|++.+ .. .+ +|.|+..+..+. T Consensus 366 ~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~-------~~--~~-~~~f~~~d~~~~ 435 (574) T protein:vir:80 366 EINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEF-------GE--KY-QFQFRGGDLSAQ 435 (574) T ss_pred HhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc-------CC--ce-EEEecccchhhH Confidence 211 1111 12455554443 4445688899999999999887632 11 11 456665444433 Q ss_pred HHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcc-----cccCCCCC-------Cccccccc-cc-CCC Q lcl|NC_016071. 432 EGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDEL-----LKLLGQDT-------SRSGDGMT-AG-SNG 497 (516) Q Consensus 432 ~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~-----~~~~~~~~-------~~~~~~~~-~~-~~~ 497 (516) .... .+.+++..|++.+ +.+|+.+|+|+-..+|+..... ..+..... ....+... .+ .+. T Consensus 436 ~~~~-~~~~~~~~G~lT~-----NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (574) T protein:vir:80 436 LDKL-KIIEQEGKVFRTV-----NEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVE 509 (574) T ss_pred HHHH-HHHHHHhCCccCH-----HHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhccccccccccCCCCC Confidence 3333 2345677898775 6899999999765555442110 00000000 00000000 00 000 Q ss_pred CCcccccc--cccchh--hhhc---C Q lcl|NC_016071. 498 NGTGKISS--TRDNSV--SNMD---N 516 (516) Q Consensus 498 ~~~~~~~~--~~d~~~--~~~~---~ 516 (516) ......++ .-|.+. .+.- + T Consensus 510 ~~~~~~p~~~~~d~~~~~~~~~~~~~ 535 (574) T protein:vir:80 510 QPEPEEPKDSQNDTDVSFQDEQQGLN 535 (574) T ss_pred CCCCCCCCCccccccchhhhhhhhhc Confidence 00000000 000000 0000 0 No 51 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.64 E-value=2.7e-15 Score=100.48 Aligned_cols=342 Identities=11% Similarity=0.028 Sum_probs=184.9 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) |.++++.+.-. +...+..+ ...|. +-+...++.++++.++ +.+.+|-+++.+++...+ . +.. T Consensus 1 ia~lp~~~~~~---~~~~~~~l----~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G-------~--~~~ 64 (348) T protein:vir:93 1 MASLPLKMYED---YKVVNTEV----SDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH-------Q--PSK 64 (348) T ss_pred CcccceEeEec---CcCcccHH----HHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c--EEE Confidence 88899887422 12222223 33444 3344566778887766 678899999998875432 2 234 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) |.+.++..++ ...+.+++.+.. + .....+..+.+|.+.++.+++....+.. T Consensus 65 L~~l~~~~v~----~~~~~~~~~~~y-~------------------------~~~~~g~~~~~~~~eiih~r~~~~~~~~ 115 (348) T protein:vir:93 65 LFLLNPDVVE----MLIENQSRELYY-S------------------------IHAATGNKLIVHNMDMLHFKHIVASNMV 115 (348) T ss_pred EEEEcCCceE----EEEeCCCcEEEE-E------------------------EEcCCCeEEEEccccEEEecCCCCCCce Confidence 4455544332 233444432211 0 0011223445777776666665555678 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP 315 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP 315 (516) +|.|.+..+....-.-... .-|. +..++. ++..+.+....-+.++.+. +++.......+....+++| T Consensus 116 ~G~s~~~~~~~~i~~~~~~-~~~~--~~~~~~-------~~~~i~~~~~~l~~e~~~~---~~~~~~~~~~n~~~~~vl~ 182 (348) T protein:vir:93 116 QGISPIDVLKNTTDFDNAV-RTFN--LTEMQK-------PDSFMLKYGSNVSTEKRQQ---VLEDFKQYYEENGGILFQE 182 (348) T ss_pred eeccHHHHHHHHHHHHHHH-HHHH--HHhcCC-------CceeEEecCCCCCHHHHHH---HHHHHHHHhhcCCCeeecC Confidence 8999988876544333222 2222 112222 2222223333333333333 3333333333444456778 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS-IHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e-v~~~~~~aDa~~i~ 394 (516) .|++++ ..+-+....+|.+..++...+|++++.-...-.+..+.++++-.+-+.. ....-+.-.++.|+ T Consensus 183 ~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~~~~~~l~P~~~~ie 252 (348) T protein:vir:93 183 PGVEIE----------PLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYE 252 (348) T ss_pred CCceEE----------EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 886433 2222334446778888899999999888765555455567776665543 34556788888999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPG--LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS 472 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~ 472 (516) +.||+.|++.. +.. .--+|+|+ .....|.++.++++.+|++.|++.+ +.+|+.+|+|+-..+|+. T Consensus 253 ~~l~~~l~~~~-~~~-------~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~-----NE~R~~~g~~p~~ggD~~ 319 (348) T protein:vir:93 253 EEFNRKLLTKT-DRE-------KNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGDKP 319 (348) T ss_pred HHHHHhhCCcc-ccc-------CcceEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCcCeE Confidence 99998776542 111 11235554 4445688999999999999999876 679999999865445443 Q ss_pred cCcccccCCCCCCccccc-ccccCCCCCcc Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDG-MTAGSNGNGTG 501 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 501 (516) .-... -.+-+....... .+.|...+.++ T Consensus 320 ~~~~n-~~~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 320 LISGD-LYPIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred eeccc-ccccccchhhcccccCCCCCcCCC Confidence 21111 111111111111 11111111111 No 52 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.63 E-value=8.2e-15 Score=97.80 Aligned_cols=406 Identities=11% Similarity=0.047 Sum_probs=203.9 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) |+-=.+ .+.+++..+ +...+.....+ . ++..+..+.. +....+ ..+..++-+.|.+|+..+-..|.+ T Consensus 1 m~~~~~-~~~~~~~~~--~~~~~~~~~~~-~-~~~~~~~~~~-------~~~~~v-~~~~a~~~~~v~~~i~~ia~~iA~ 67 (412) T protein:vir:26 1 MNVIAK-ENIVTRIKK--KLIDNWIDQST-S-KLYDFSPWKN-------RSFWGV-INNTLETNETIFSAITKLSNSMAS 67 (412) T ss_pred Cccchh-hhhhhhhhh--hHhhhhhcccc-c-ccccccccCC-------cccccc-chhhhhccHHHHHHHHHHHHhHhh Confidence 442211 000110000 00000000000 0 0111111110 000111 124455788999999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) ++|.+.-.. +..+..+++ .|. +-+...++.++++.++ +.+.+|-++.+++.... |. +..|.+ T Consensus 68 lp~~~~~~~---~~~~~~~~~----lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-------G~--~~~L~~ 131 (412) T protein:vir:26 68 LPLKMYEDY---KVVNTEVSD----LLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-------HQ--PSKLFL 131 (412) T ss_pred CceeEeecc---ccccchHHH----HHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC-------Cc--EEEEEE Confidence 998764221 122223333 333 2333456777777655 57889999998876543 22 223444 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) -++..++ ...+.+++.+...- ....+..+.+|.+..+++++....+..+|. T Consensus 132 l~~~~v~----v~~~~~~~~~~y~~-------------------------~~~~g~~~~~~~~evih~~~~~~~~~~~G~ 182 (412) T protein:vir:26 132 LNPDVVE----MLIENQSRELYYSI-------------------------HAATGNKLIVHNMDMLHFKHIVASNMVQGI 182 (412) T ss_pred EcCceeE----EEEeCCCcEEEEEE-------------------------EcCCceEEEEccccEEEeCCCCCCCCcccc Confidence 4443332 23344443222100 011223345677776666665556778899 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCc Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDM 318 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~ 318 (516) |.+..+....-......+ |. +..++. ++..+-+.....+++.. +++++...+...+....++++.|+ T Consensus 183 s~i~~~~~~i~~~~a~~~-~~--~~~~~~-------~~~~i~~~~~~l~~e~~---~~~~~~~~~~~~~~g~~~vl~~g~ 249 (412) T protein:vir:26 183 SPIDVLKNTTDFDNAVRT-FN--LTEMQK-------PDSFMLKYGSNVGKEKR---QQVLEDFKQYYEENGGILFQEPGV 249 (412) T ss_pred cHHHHHHHHHHHHHHHHH-HH--HHhcCC-------CCceEEecCCCCCHHHH---HHHHHHHHHHhhcCCCeeecCCCc Confidence 998887654444433322 22 122222 12222223333333333 333333333333444466778886 Q ss_pred ccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_016071. 319 NAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIH-GHFVQRDIDIIVEAF 397 (516) Q Consensus 319 ~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~-~~~~~aDa~~i~~~l 397 (516) +++ ..+-+....+|.+..++...+|++++.-...-.+..+.++++-.+.+...+ ..-+.--++.|++.| T Consensus 250 ~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l 319 (412) T protein:vir:26 250 EIE----------PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEF 319 (412) T ss_pred eEE----------EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 432 222223344566777788899999988876555545556777777665444 455888889999999 Q ss_pred HHHHHHHHHHhcCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc Q lcl|NC_016071. 398 NKNLIPQLLALNDIRLSDEDMPKLKPG--LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE 475 (516) Q Consensus 398 n~~li~~lv~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~ 475 (516) |+.|+... . ..... +|+|+ .....|.++.+++++++++.|++.+ +.+|+.+|+|+-+.+|+..-. T Consensus 320 n~kLl~~~---~---~~~~~--~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~gl~p~~ggD~~~~~ 386 (412) T protein:vir:26 320 NRKLLTKT---D---REKNR--YFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGDKPLIS 386 (412) T ss_pred HhhcCCcc---c---ccCcc--eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeec Confidence 98776532 0 01112 35554 4456789999999999999999876 679999999976555543311 Q ss_pred ccccCCCCCCcccc-cccccCCCCCcccc Q lcl|NC_016071. 476 LLKLLGQDTSRSGD-GMTAGSNGNGTGKI 503 (516) Q Consensus 476 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 503 (516) .. -.+-+...... ..+.|. +...+. T Consensus 387 ~n-~~~~~~~~~~~~~~~gG~--~n~~e~ 412 (412) T protein:vir:26 387 GD-LYPIDTPLELRKSLKGGD--KNVNES 412 (412) T ss_pred cc-ccccccchhhcccccCCC--CCcCCC Confidence 11 01111111100 011110 011111 No 53 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.63 E-value=2.5e-14 Score=95.11 Aligned_cols=420 Identities=12% Similarity=0.019 Sum_probs=204.4 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH--HHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV--EAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y--~~m~~D~~v~s~l~~Rk~~v 78 (516) -+.+..+.......-...+. -|..+....+.. .+.... +..+ ...+..| +..++-+.|.+|+..+-..| T Consensus 14 ~~~~~~~~~~~~~~~f~~~e---~r~~~~~~~~~~---~~~~~~--~~~~-~~~~~~~~~~~al~~~~V~acv~~Ia~~i 84 (441) T protein:vir:98 14 KSRKQSRKELVVVGIFYKNE---KRDLQYNEDDLQ---MMVQTL--PGFQ-GTKLRQYKDIEAIRHSDIFTAVMMIASDL 84 (441) T ss_pred ccccchhhhhhccccccccc---cccccCCCcchH---HHHHHh--hccc-ccCccccchhhhhccHHHHHHHHHHHHhh Confidence 22222222222110000000 000010011111 111111 1111 1112222 23456888999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) .++++++.- +.....+..+. ..|. +-+...+..+++..+. +.+.+|.+++++++...+ . +..| T Consensus 85 A~lpl~~~~--~~~~~~~~~~~----~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G-------~--~~~L 149 (441) T protein:vir:98 85 ARMPIRVTV--NGQINYSDRIV----NLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTG-------E--PMNL 149 (441) T ss_pred ccCceEEec--CCcccccchHH----HHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC-------c--EEEE Confidence 999988752 22211222222 2333 2233345667776655 578899999999886432 2 2344 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .+.++.+++ ...+.+|+......... .........+|...+|.+++.+.. ..+ T Consensus 150 ~~i~~~~v~----v~~~~~g~~~~~~~~~~----------------------~~~~~~~~~~~~~dviHir~~~~d-g~~ 202 (441) T protein:vir:98 150 TFRKTSEIE----LKLDARGRLYYFHQRID----------------------SNGNNIERNVKFEDMLDIKFYSLD-GIN 202 (441) T ss_pred EEEcCceeE----EEECCCCcEEEEEEEec----------------------cCcceeeEEEccccEEEeccCCCC-Ccc Confidence 455554432 34566665433211100 001112234677777766665444 478 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFI 313 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~i 313 (516) |.|.+..+....-.-....++...+...-+.+=-+++.+ ..-.+++ ..+++++.......| ..+ .++ T Consensus 203 G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-------~~~~~~e---~~~~~~~~~~~~~~G~~nag~~~v 272 (441) T protein:vir:98 203 GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-------GVLDNKK---ARDRAREEFHKSFSGTKQAGKVVV 272 (441) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-------CCCCCHH---HHHHHHHHHHHHhcCccccCccee Confidence 999999888777666666677677766544333333322 1111122 223333333333334 233 467 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDII 393 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i 393 (516) ++.|++.+. .+-+.....+.+..++..++|++++.-..--.+....+ ++..+. +..+..-+.-.++.| T Consensus 273 l~~g~~~~~----------l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~-~s~~q~-~~~y~~tl~P~~~~i 340 (441) T protein:vir:98 273 LDESMTFDQ----------LEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDA-NLDYLSTLKPYITCV 340 (441) T ss_pred cCCCceEEE----------ccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC-ccHHHH-HHHHHHHHHHHHHHH Confidence 788874332 11122333466777888899999988765434322222 222221 112334566777888 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMST 473 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~ 473 (516) +..||+.|++. .. ..+-+|..+.....|.+..+++++++++.|++.+ +.+|+.+|+|+-..++... T Consensus 341 e~~ln~~L~~~--------~~-~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~gGd~~~ 406 (441) T protein:vir:98 341 CAELNFKFNDE--------YV-NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLAPIPGGNGSI 406 (441) T ss_pred HHHHHhhcccc--------cc-CceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcce Confidence 88888765432 11 1222444445566888999999999999999886 6899999999655544322 Q ss_pred C-cccccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 474 D-ELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 474 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) - ......+-+.. +......+.. ..+..++.|++. T Consensus 407 ~~~~~n~~~~~~~---~~~q~~~~~~-~~~~~kgGe~ne 441 (441) T protein:vir:98 407 HRVDLNHVNIELV---DEYQMNKSRA-TDKKLKGGEENE 441 (441) T ss_pred Eeecccccccccc---cccccccccc-cccccCCCCCCC Confidence 1 11111111000 0011111111 111122233222 No 54 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.61 E-value=1.1e-14 Score=97.02 Aligned_cols=401 Identities=10% Similarity=0.020 Sum_probs=201.2 Q ss_pred CCccccCccc---ccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSE---VVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~~~~~~---~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |++-.-.... +.......+.. ++..+..+.. +....+ ..+..++-+.|.+|+..+-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~-------~~~~~v-~~~~~~~~~~V~~ci~~Ia~~ 61 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTS-----------KLYDFSPWKN-------RSFWGV-INNTLETNETIFSAITKLSNS 61 (409) T ss_pred CCccchhhhhhhhhhhhhhccccc-----------cccccccccC-------cccccc-chhhhhccHHHHHHHHHHHHh Confidence 5442111100 00000111100 0000111100 000011 123445678899999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) |.++++.+.-.. +..+..++. .|. +-+...+..++++.++ +.+.+|-++.++++... |. +.. T Consensus 62 ia~lp~~~~~~~---~~~~~~~~~----lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-------G~--~~~ 125 (409) T protein:vir:93 62 MASLPLKMYEDY---KVVNTEVSD----LLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-------HQ--PSK 125 (409) T ss_pred hhhCceeEeecc---ccccchHHH----HHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC-------Cc--EEE Confidence 999998874321 222223333 333 2334556777877765 56889999999887543 22 223 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) |.+.++.++. ...+.+++.+...- ....+..+.+|.+.+|.+++....+.. T Consensus 126 L~~l~~~~v~----~~~~~~~~~~~y~~-------------------------~~~~g~~~~~~~~eVih~r~~~~~~~~ 176 (409) T protein:vir:93 126 LFLLNPDVVE----MLIENQSRELYYSI-------------------------HAATGNKLIVHNMDMLHFKHIVASNMV 176 (409) T ss_pred EEEEcCceeE----EEEeCCCcEEEEEE-------------------------EcCCceEEEEccccEEEeCCCCCCCcc Confidence 4444443332 23344443221100 011223345777776666655455667 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP 315 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP 315 (516) +|.|.+..+....-......+ +. +..++.+ +..+-+.....++++. +++++.......+....++++ T Consensus 177 ~G~s~i~~~~~~i~~~~~~~~-~~--~~~~~~~-------~~~i~~~~~~l~~e~~---~~~~~~~~~~~~~~g~~~vl~ 243 (409) T protein:vir:93 177 QGISPIDVLKNTTDFDNAVRT-FN--LTEMQKP-------DSFMLKYGSNVGKEKR---QQVLEDFKQYYEENGGILFQE 243 (409) T ss_pred ccccHHHHHHHHHHHHHHHHH-HH--HHhcCCC-------CceEEecCCCCCHHHH---HHHHHHHHHHhhcCCCeeecC Confidence 899998877664444333322 22 2222222 2222222233333333 333444333333334456777 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS-IHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e-v~~~~~~aDa~~i~ 394 (516) .|++++ ..+-+....+|.+..++...+|++++.-..--.+..+.++++-.+.+.. ....-+.--+++|+ T Consensus 244 ~g~~~~----------~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie 313 (409) T protein:vir:93 244 PGVEIE----------PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYE 313 (409) T ss_pred CCceEE----------EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 776433 2222223335667777888899999887765554444456666655543 44556888888999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPG--LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS 472 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~ 472 (516) +.||+.|++..- ..... +|.|+ .....|+++.+++++++++.|++.+ +.+|+.+|+|+-+.+|+. T Consensus 314 ~~l~~~Ll~~~~------~~~~~--~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~ggD~~ 380 (409) T protein:vir:93 314 EEFNRKLLTKTD------REKNR--YFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGDKP 380 (409) T ss_pred HHHHhhcCCccc------ccCcc--eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCee Confidence 999987775421 01112 35554 4445788999999999999999876 679999999976545543 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccc Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKI 503 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (516) .-...-.+.+.........+.|. +.+.+. T Consensus 381 ~~~~n~~~~~~~~~~~~~~~gG~--~n~~e~ 409 (409) T protein:vir:93 381 LISGDLYPIDTPLELRKSLKGGD--KNVNES 409 (409) T ss_pred eecccccccccchhhcccccCCC--CCcCCC Confidence 32111100100000000011110 111111 No 55 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.61 E-value=2.8e-14 Score=94.85 Aligned_cols=414 Identities=11% Similarity=0.084 Sum_probs=200.0 Q ss_pred CCccc----cCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRF----AQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYV 76 (516) Q Consensus 1 ~~~r~----~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~ 76 (516) |=.|. ..+.++.-.+. .+..+.. ..........-..+..+. -+..++-+.|.+|+..+-. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~-------------~~~~~~~~~~~~~g~~v~-~~~a~~~~aV~~~v~~Ia~ 72 (432) T protein:vir:97 9 LLGQLKAMFVPPDPVDIGGG--QTFTPVN-------------ATARDLGIIISDTGAAVN-ADAIMRLDAVAACVKLVSQ 72 (432) T ss_pred hhhhhHhhcCCccccccccc--cccccCc-------------hhhhhhcccccccCcccc-hHhhhcchHHHHHHHHHHH Confidence 11121 11111111010 0111100 000000000000111111 1345568999999999999 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) .|.++++.+.-... +.. .+..-.-+...|. +-+...++.++++.++ +.+.+|.+++++++. ++ ++ . T Consensus 73 ~ia~lp~~~y~~~~-~g~-~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g---~~------~ 140 (432) T protein:vir:97 73 AVAAMPLMMYMRTP-DGR-KEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DG---RI------E 140 (432) T ss_pred hhccCceEEEEecC-CCc-ccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CC---cE------E Confidence 99999998753221 111 0111111223333 2333456777887766 678899999999874 21 22 2 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) .|.+.++..++ ...+.+|+...... ...+..+.+|.+.++..++.+..+ T Consensus 141 ~L~~l~p~~v~----v~~~~~g~~~y~~~--------------------------~~~g~~~~~~~~~iih~r~~~~dg- 189 (432) T protein:vir:97 141 SLQYLANDRLT----ITTDTKGNTAYRYR--------------------------RTDGQMIDIPRQQIWKIMGYSLDG- 189 (432) T ss_pred EEEEEcCcceE----EEEcCCCcEEEEEE--------------------------ecCceEEEEccccEEEecCcCCCC- Confidence 33334443222 23345554332111 112234467777776666554444 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEe Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFIL 314 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ii 314 (516) .+|.|.+..+....-.-....++-..+...-+.+--+++.+ ...+++..+ ++.+...... .....+++ T Consensus 190 ~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~--------~~l~~e~~~---~~~~~~~~~~-nag~~~vl 257 (432) T protein:vir:97 190 ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID--------RFLTDDQYD---SFSKKVSGSV-EAGRAPLL 257 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecC--------CCCCHHHHH---HHHHHHhhhh-cCCCceec Confidence 79999999998776665555566666665433332233322 222333333 3333333221 12235677 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH----HHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ----SIHGHFVQRDI 390 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~----ev~~~~~~aDa 390 (516) +.|++++. .+-+....++.+..++...+|++++--..--.+....|+++.+...+ .....-+.-.+ T Consensus 258 ~~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~ 327 (432) T protein:vir:97 258 EGGMDVKS----------LGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWL 327 (432) T ss_pred CCCceEEE----------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHHHHHHH Confidence 88875332 22223344567778888999999877654333323334443322222 23334566677 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) +.|+..||+.|+.+-- . ...+-+|.++..-..|.++.++++.+++..|++.+ +.+|+.+|+|+-.+++ T Consensus 328 ~~ie~~ln~kLl~~~e------~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~-----NE~R~~~glpp~~g~~ 395 (432) T protein:vir:97 328 RRIEQSIALNLLTPAE------R-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTR-----DEAREIEGLPKLGGNA 395 (432) T ss_pred HHHHHHHhhhccCccc------c-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCc Confidence 7888888876654310 0 11122333334456788999999999999999876 6799999998765443 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) ..........+-.... ..+. ..+++++.. ...++.+. T Consensus 396 ~~~~~~~~~~pl~~~~-~~~~--~~~~~~~~~---~~~~~~~~ 432 (432) T protein:vir:97 396 AVLTVQSAMVPLDSIG-LQAS--PEPASGLGN---QQQDKVSK 432 (432) T ss_pred ceEeecccccchhhhc-ccCC--CCCCCCCCC---cccccccC Confidence 3322211111100000 0000 001111111 11111111 No 56 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.61 E-value=7.3e-14 Score=92.61 Aligned_cols=440 Identities=12% Similarity=0.078 Sum_probs=210.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHh-hChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMK-QDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~-~D~~v~s~l~~Rk~~v~ 79 (516) ||.+.-....+.+...+.+++.--+..+. . + | |-.+..+.++. ..+++.+|+..+...|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~--~--p----p~~~~~La~~~~~n~~v~scI~~ia~~ia 66 (540) T protein:vir:41 6 LSIKSLEKYRAIKGDTDSQALKEDRFEEY-----------V--E--P----KVHPLVLLSLLQVNPYHASACSIKANDIL 66 (540) T ss_pred cChhhccchhhhhccccccccccCCCCcc-----------c--c--C----CCCHHHHHHHHHhcHHHHHHHHHHHHHHh Confidence 88887666666665554443322111111 0 1 1 22344444555 48999999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++|.++...+ ...++ +. +...++.+++..++ +.+.+|.+++|+++...+ . +..|.+ T Consensus 67 ~~~~~i~~~~~-------~~~~~----lp--N~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G-------~--~~~L~~ 124 (540) T protein:vir:41 67 RTGYLIDGDDG-------GVEEL----LR--ACRPSFEFILLQALEDLQVFNYCTLEVVRDDQG-------E--PVRLDY 124 (540) T ss_pred cCCceEecCcc-------chhhh----cc--CCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCC-------c--EEEEEE Confidence 99998864322 12222 22 23456788888877 578899999999986532 1 223444 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.+++. ..++.....+............++. -............+|...+|.++.....+.+||. T Consensus 125 i~~~~V~v------~~~~~~~~~~~d~~~~~~~~~~~~~-------~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~ 191 (540) T protein:vir:41 125 IPAHTVRV------HRDGSRYMQTWDGIHVTYFKDYRYE-------GEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGV 191 (540) T ss_pred eCCcceEE------eEcCceeEeeecCceeeeeeccccc-------ceeeccccccceeecccceEEecCCCCCCCcccc Confidence 44444431 1122111111110000000000000 0011112223345677666655555556778999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc--cceE--EEe Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG--EQAY--FIL 314 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g--~~a~--~ii 314 (516) |.+..+......-....++-..|....+.+--+++.+-. +...............+.+.+...+...| .+++ +++ T Consensus 192 Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~-l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vL 270 (540) T protein:vir:41 192 PRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGE-FEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVF 270 (540) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcc-cCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEE Confidence 999999988887777777777776543333333433311 11111111112222233343333322222 2222 233 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCc--cchhhHHHHH-HHHHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDG--QGSYNLSESK-QSIHGHFVQRDID 391 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~--~GS~Al~~vh-~ev~~~~~~aDa~ 391 (516) ... ......+++...+-+.....|.+..++...+|++++.-.---.+... ++++|-.+.. .....+.+.-.++ T Consensus 271 e~~----~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~ 346 (540) T protein:vir:41 271 SIP----GGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQE 346 (540) T ss_pred ecC----CCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHH Confidence 210 00111244444443444556778888889999998876554333221 2223434433 3345666888999 Q ss_pred HHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCCcc Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIPED 470 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~~~ 470 (516) .|++.||+.|++. ++ .+ -+|+|+...-.+ ...++.+.++++.|++.+ +.+|+.+ |+|+ .++. T Consensus 347 ~ie~~ln~~L~~~---~~----~~---~~i~f~~~~ll~-~D~~~~~~~lv~~G~lT~-----NE~Re~L~g~e~-gdd~ 409 (540) T protein:vir:41 347 IVSSVLTDFIQLK---LD----PG---ARFVFNEEILME-SEFVHNYALLVQCGVLTP-----SEVREKLFGLDG-GPDM 409 (540) T ss_pred HHHHHHHHhhhhc---cC----Cc---eEEEecchhhcc-hHHHHHHHHHHhCCCCCH-----HHHHHHhCcCcC-CCcc Confidence 9999999876542 11 11 146666544433 234667888999999876 4578754 6653 1111 Q ss_pred cccCccc---ccCCCC----CCcccccccccC---C-CCCcccccccccchhhhhcC Q lcl|NC_016071. 471 MSTDELL---KLLGQD----TSRSGDGMTAGS---N-GNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 471 ~~~~~~~---~~~~~~----~~~~~~~~~~~~---~-~~~~~~~~~~~d~~~~~~~~ 516 (516) -..+... ....+. .....+..+..+ + -+++-......+..-..+.. T Consensus 410 ~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (540) T protein:vir:41 410 FMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDE 466 (540) T ss_pred cccccccccccccccccccCCCCccccccccchhcccccCccccccccccccccccc Confidence 1111000 000000 000001111000 0 00000000001111111111 No 57 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.61 E-value=5.6e-14 Score=93.25 Aligned_cols=412 Identities=12% Similarity=0.029 Sum_probs=203.4 Q ss_pred CCccccCccccc---------chhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH--HHHhhChHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVV---------KAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV--EAMKQDHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~~~---------~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y--~~m~~D~~v~s 69 (516) -..|+.-.+... ++..+.|..+ . +.+.... +... +..+..| +.-++.+.|.+ T Consensus 13 ~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~-----------~---~~~~~~~--~~~~-~~~~~~~~~~~al~~~~V~~ 75 (441) T protein:vir:79 13 FKSRKQSRKELVVVGIFYKNEKRDLQYNEDD-----------L---QMMVQTL--PGFQ-GTKLRQYKDIEAIRHSDIFT 75 (441) T ss_pred ccccccchhhhhccccccccccccccCCCcc-----------h---HHHHHHh--cccC-cccccccchhhhhccHHHHH Confidence 222222222111 0111111110 0 0011111 0001 1112223 33456888999 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeeccccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKY 147 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~ 147 (516) |+..+-..|.++++++.- +.....+..++ ..|.. -+...+..+++..+. +.+.+|-+++++++... T Consensus 76 cv~~Ia~~iA~lp~~~~~--~~~~~~~~~~~----~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------ 143 (441) T protein:vir:79 76 AVMMIASDLARMPIRVTV--NGQINYSDRIV----NLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT------ 143 (441) T ss_pred HHHHHHHhhccCceeeec--CccccccchHH----HHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------ Confidence 999999999999988752 22111112222 33332 233345667776655 47889999999988643 Q ss_pred ccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEe Q lcl|NC_016071. 148 AGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMS 227 (516) Q Consensus 148 ~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~ 227 (516) |. +..|.+.++.+++ ...+.+|+......... .........+|...+|.++ T Consensus 144 -G~--~~~L~~i~~~~v~----v~~d~~g~~~~~~~~~~----------------------~~~~~~~~~~~~~dvih~k 194 (441) T protein:vir:79 144 -GE--PMNLTFRKTSEIE----LKSDARGRLYYFHQRID----------------------SNGNNIERNVKFEDMLDIK 194 (441) T ss_pred -Cc--EEEEEEEcCceeE----EEECCCccEEEEEEEec----------------------cCCceeEEEEccccEEEec Confidence 22 2334455554443 23455554322111000 0001112346777777666 Q ss_pred ecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc Q lcl|NC_016071. 228 LGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG 307 (516) Q Consensus 228 ~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g 307 (516) +.+.. ..+|.|++..+....-.-....++...+...-+.+--+++.+ ..-.+++ ..+++++.......| T Consensus 195 ~~~~d-g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-------~~~~~~e---~~e~~r~~~~~~~~G 263 (441) T protein:vir:79 195 FYSLD-GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-------GVLDNKK---ARDRAREEFHKSFSG 263 (441) T ss_pred cCCCC-CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-------CCCCCHH---HHHHHHHHHHHHhcC Confidence 65444 479999999988776666666666666666544433333322 1111122 222334333333333 Q ss_pred -cce--EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHH Q lcl|NC_016071. 308 -EQA--YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGH 384 (516) Q Consensus 308 -~~a--~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~ 384 (516) ..+ -+++|.|++.+- .+-+.....|.+..++..++|++++.-...-.+..+.+ ++..+. +..+.. T Consensus 264 ~~nag~~~vl~~G~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~s~~q~-~~~~~~ 331 (441) T protein:vir:79 264 TKQAGKVVVLDESMTFDQ----------LEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDA-NLDYLS 331 (441) T ss_pred ccccCcceecCCCceEEE----------ccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC-ccHHHH-HHHHHH Confidence 233 367788874322 22223334577777888899999987754333322212 222221 222334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCC Q lcl|NC_016071. 385 FVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFD 464 (516) Q Consensus 385 ~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp 464 (516) -+.-.++.|+..||+.|++.. .+ .+-+|.++.....|.+..+++++++++.|++.+ +.+|+.+|+| T Consensus 332 tl~P~~~~ie~eln~kl~~~~--------~~-~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~-----NE~R~~~gl~ 397 (441) T protein:vir:79 332 TLKPYITCVCAELNFKFNDEY--------VN-REFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLA 397 (441) T ss_pred HHHHHHHHHHHHHhhhccccc--------cC-ceEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCC Confidence 577788888888887654321 11 222444455566788999999999999999876 5799999999 Q ss_pred CCCCcccccC-cccccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 465 EEIPEDMSTD-ELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 465 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) +-..++...- ......+-+... ..+...+.. ..+..++.|++. T Consensus 398 Pi~ggd~~~~~~~~n~~~~~~~~---~~~~~~~~~-~~~~~kgGe~~e 441 (441) T protein:vir:79 398 PIPGGNGSIHRVDLNHVNIELVD---EYQMNKSRA-TDKKLKGGEENE 441 (441) T ss_pred CCCCCCcceEeeccccccccccc---ccccccccc-cccccCCCCCCC Confidence 6555554221 111111111000 011111111 111222333333 No 58 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.61 E-value=5.6e-14 Score=93.25 Aligned_cols=412 Identities=12% Similarity=0.029 Sum_probs=203.4 Q ss_pred CCccccCccccc---------chhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH--HHHhhChHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVV---------KAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV--EAMKQDHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~~~---------~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y--~~m~~D~~v~s 69 (516) -..|+.-.+... ++..+.|..+ . +.+.... +... +..+..| +.-++.+.|.+ T Consensus 13 ~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~-----------~---~~~~~~~--~~~~-~~~~~~~~~~~al~~~~V~~ 75 (441) T protein:vir:94 13 FKSRKQSRKELVVVGIFYKNEKRDLQYNEDD-----------L---QMMVQTL--PGFQ-GTKLRQYKDIEAIRHSDIFT 75 (441) T ss_pred ccccccchhhhhccccccccccccccCCCcc-----------h---HHHHHHh--cccC-cccccccchhhhhccHHHHH Confidence 222222222111 0111111110 0 0011111 0001 1112223 33456888999 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeeccccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKY 147 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~ 147 (516) |+..+-..|.++++++.- +.....+..++ ..|.. -+...+..+++..+. +.+.+|-+++++++... T Consensus 76 cv~~Ia~~iA~lp~~~~~--~~~~~~~~~~~----~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~------ 143 (441) T protein:vir:94 76 AVMMIASDLARMPIRVTV--NGQINYSDRIV----NLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT------ 143 (441) T ss_pred HHHHHHHhhccCceeeec--CccccccchHH----HHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCC------ Confidence 999999999999988752 22111112222 33332 233345667776655 47889999999988643 Q ss_pred ccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEe Q lcl|NC_016071. 148 AGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMS 227 (516) Q Consensus 148 ~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~ 227 (516) |. +..|.+.++.+++ ...+.+|+......... .........+|...+|.++ T Consensus 144 -G~--~~~L~~i~~~~v~----v~~d~~g~~~~~~~~~~----------------------~~~~~~~~~~~~~dvih~k 194 (441) T protein:vir:94 144 -GE--PMNLTFRKTSEIE----LKSDARGRLYYFHQRID----------------------SNGNNIERNVKFEDMLDIK 194 (441) T ss_pred -Cc--EEEEEEEcCceeE----EEECCCccEEEEEEEec----------------------cCCceeEEEEccccEEEec Confidence 22 2334455554443 23455554322111000 0001112346777777666 Q ss_pred ecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc Q lcl|NC_016071. 228 LGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG 307 (516) Q Consensus 228 ~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g 307 (516) +.+.. ..+|.|++..+....-.-....++...+...-+.+--+++.+ ..-.+++ ..+++++.......| T Consensus 195 ~~~~d-g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-------~~~~~~e---~~e~~r~~~~~~~~G 263 (441) T protein:vir:94 195 FYSLD-GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-------GVLDNKK---ARDRAREEFHKSFSG 263 (441) T ss_pred cCCCC-CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-------CCCCCHH---HHHHHHHHHHHHhcC Confidence 65444 479999999988776666666666666666544433333322 1111122 222334333333333 Q ss_pred -cce--EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHH Q lcl|NC_016071. 308 -EQA--YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGH 384 (516) Q Consensus 308 -~~a--~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~ 384 (516) ..+ -+++|.|++.+- .+-+.....|.+..++..++|++++.-...-.+..+.+ ++..+. +..+.. T Consensus 264 ~~nag~~~vl~~G~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~s~~q~-~~~~~~ 331 (441) T protein:vir:94 264 TKQAGKVVVLDESMTFDQ----------LEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDA-NLDYLS 331 (441) T ss_pred ccccCcceecCCCceEEE----------ccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCC-ccHHHH-HHHHHH Confidence 233 367788874322 22223334577777888899999987754333322212 222221 222334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCC Q lcl|NC_016071. 385 FVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFD 464 (516) Q Consensus 385 ~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp 464 (516) -+.-.++.|+..||+.|++.. .+ .+-+|.++.....|.+..+++++++++.|++.+ +.+|+.+|+| T Consensus 332 tl~P~~~~ie~eln~kl~~~~--------~~-~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~-----NE~R~~~gl~ 397 (441) T protein:vir:94 332 TLKPYITCVCAELNFKFNDEY--------VN-REFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLA 397 (441) T ss_pred HHHHHHHHHHHHHhhhccccc--------cC-ceEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCC Confidence 577788888888887654321 11 222444455566788999999999999999876 5799999999 Q ss_pred CCCCcccccC-cccccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 465 EEIPEDMSTD-ELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 465 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) +-..++...- ......+-+... ..+...+.. ..+..++.|++. T Consensus 398 Pi~ggd~~~~~~~~n~~~~~~~~---~~~~~~~~~-~~~~~kgGe~~e 441 (441) T protein:vir:94 398 PIPGGNGSIHRVDLNHVNIELVD---EYQMNKSRA-TDKKLKGGEENE 441 (441) T ss_pred CCCCCCcceEeeccccccccccc---ccccccccc-cccccCCCCCCC Confidence 6555554221 111111111000 011111111 111222333333 No 59 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.61 E-value=4.4e-14 Score=93.82 Aligned_cols=408 Identities=11% Similarity=-0.023 Sum_probs=200.4 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) -.+++++......-+ +..+...........+..+. -+..++.+.|.+|+..+-..|.+ T Consensus 4 f~~~~~r~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~-~~~al~~~~v~~cv~~Ia~~iA~ 61 (416) T protein:vir:45 4 FYKNEKRDLQYNEDD---------------------LQMMVQTLPGFQGTKLRQYK-DIEAIRHSDIFTAVMMIASDLAR 61 (416) T ss_pred ccccccccccCCCcc---------------------hhHHHHHhccccccCccccc-hhhhhcchHHHHHHHHHHHhhcc Confidence 222222111100000 00011100000000011111 12335678899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHHH-HHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) ++|++.. .+ .... .+-+-..|. +-+...+..+++..+.. .+.+|.+++++++... |. +..|.+ T Consensus 62 ~p~~~~~-~~-~~~~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-------G~--~~~L~~ 126 (416) T protein:vir:45 62 MPIRVTV-NG-QINY----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-------GE--PMNLTF 126 (416) T ss_pred CceEEec-Cc-cccc----cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-------Cc--EEEEEE Confidence 9998753 12 1111 122333443 23334556777777664 6789999999887543 22 223444 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.++. ...+.+|+......... .........+|...+|.+++.+ .+.++|. T Consensus 127 i~~~~v~----v~~~~~g~~~~~~~~~~----------------------~~~~~~~~~~~~~evihir~~~-~d~~~G~ 179 (416) T protein:vir:45 127 RKTSEIE----LKSDARGRLYYFHQRID----------------------SNGNNIERNVKFEDMLDIKFYS-LDGINGL 179 (416) T ss_pred EcCceeE----EEECCCccEEEEEEEec----------------------CCCceeEEEEccccEEEeccCC-CCCcccc Confidence 4444332 23345554322111000 0001112346777777666554 4458999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEEec Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFILP 315 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~iiP 315 (516) |++..+....-.-....++...+....+.+--+++.+ ..-.++ +..+++++.......| ..+ -++++ T Consensus 180 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~--------~~~~~~--~~~~~~~~~~~~~~~g~~nag~~~vl~ 249 (416) T protein:vir:45 180 SLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK--------GVLDNK--KARDRAREEFHKSFSGTKQAGKVVVLD 249 (416) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC--------CCCCCH--HHHHHHHHHHHHHhcCccccCceeecC Confidence 9999998877776666677777766544443333332 111111 1223333333333333 223 36777 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVE 395 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~ 395 (516) .|++.+- .+-+....+|.+..++..++|++++.-..--.+..+.+ ++..+. +-.+..-+.-.++.|+. T Consensus 250 ~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~~~~~-~~~~~~~l~P~~~~ie~ 317 (416) T protein:vir:45 250 ESMTFDQ----------LEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDA-NLDYLSTLKPYITCVCA 317 (416) T ss_pred CCceeEe----------ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-ccHHHH-HHHHHHHHHHHHHHHHH Confidence 7764321 11222334567777888899999988754223322222 222221 11233456778888888 Q ss_pred HHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc Q lcl|NC_016071. 396 AFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE 475 (516) Q Consensus 396 ~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~ 475 (516) .||+.|.+. ..+ .+-+|.++.....|.+..+++++++++.|++.+ +.+|+.+|+|+-..++...-. T Consensus 318 ~ln~~l~~~--------~~~-~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~~gd~~~~~ 383 (416) T protein:vir:45 318 ELNFKFNDE--------YVN-REFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLAPIPGGNGSIHR 383 (416) T ss_pred HHhhhcccc--------ccC-ceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceEe Confidence 888765432 111 222444455566788999999999999999886 579999999865444432211 Q ss_pred ccccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 476 LLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) ...-..+.... +......+. .+.+..++.|++. T Consensus 384 ~~~n~~~~~~~--~~~~~~~~~-~~~~~~kgGe~n~ 416 (416) T protein:vir:45 384 VDLNHVNIELV--DEYQMNKSR-ATDKKLKGGEENE 416 (416) T ss_pred ecccccccccc--cccCccccc-ccccccCCCCCCC Confidence 11100000000 000111100 0111122233222 No 60 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.61 E-value=4.4e-14 Score=93.82 Aligned_cols=408 Identities=11% Similarity=-0.023 Sum_probs=200.4 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) -.+++++......-+ +..+...........+..+. -+..++.+.|.+|+..+-..|.+ T Consensus 4 f~~~~~r~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~-~~~al~~~~v~~cv~~Ia~~iA~ 61 (416) T protein:vir:81 4 FYKNEKRDLQYNEDD---------------------LQMMVQTLPGFQGTKLRQYK-DIEAIRHSDIFTAVMMIASDLAR 61 (416) T ss_pred ccccccccccCCCcc---------------------hhHHHHHhccccccCccccc-hhhhhcchHHHHHHHHHHHhhcc Confidence 222222111100000 00011100000000011111 12335678899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHHH-HHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) ++|++.. .+ .... .+-+-..|. +-+...+..+++..+.. .+.+|.+++++++... |. +..|.+ T Consensus 62 ~p~~~~~-~~-~~~~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-------G~--~~~L~~ 126 (416) T protein:vir:81 62 MPIRVTV-NG-QINY----SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-------GE--PMNLTF 126 (416) T ss_pred CceEEec-Cc-cccc----cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-------Cc--EEEEEE Confidence 9998753 12 1111 122333443 23334556777777664 6789999999887543 22 223444 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.++. ...+.+|+......... .........+|...+|.+++.+ .+.++|. T Consensus 127 i~~~~v~----v~~~~~g~~~~~~~~~~----------------------~~~~~~~~~~~~~evihir~~~-~d~~~G~ 179 (416) T protein:vir:81 127 RKTSEIE----LKSDARGRLYYFHQRID----------------------SNGNNIERNVKFEDMLDIKFYS-LDGINGL 179 (416) T ss_pred EcCceeE----EEECCCccEEEEEEEec----------------------CCCceeEEEEccccEEEeccCC-CCCcccc Confidence 4444332 23345554322111000 0001112346777777666554 4458999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEEec Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFILP 315 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~iiP 315 (516) |++..+....-.-....++...+....+.+--+++.+ ..-.++ +..+++++.......| ..+ -++++ T Consensus 180 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~--------~~~~~~--~~~~~~~~~~~~~~~g~~nag~~~vl~ 249 (416) T protein:vir:81 180 SLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK--------GVLDNK--KARDRAREEFHKSFSGTKQAGKVVVLD 249 (416) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC--------CCCCCH--HHHHHHHHHHHHHhcCccccCceeecC Confidence 9999998877776666677777766544443333332 111111 1223333333333333 223 36777 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVE 395 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~ 395 (516) .|++.+- .+-+....+|.+..++..++|++++.-..--.+..+.+ ++..+. +-.+..-+.-.++.|+. T Consensus 250 ~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~~~~~-~~~~~~~l~P~~~~ie~ 317 (416) T protein:vir:81 250 ESMTFDQ----------LEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN-MSITDA-NLDYLSTLKPYITCVCA 317 (416) T ss_pred CCceeEe----------ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-ccHHHH-HHHHHHHHHHHHHHHHH Confidence 7764321 11222334567777888899999988754223322222 222221 11233456778888888 Q ss_pred HHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc Q lcl|NC_016071. 396 AFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE 475 (516) Q Consensus 396 ~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~ 475 (516) .||+.|.+. ..+ .+-+|.++.....|.+..+++++++++.|++.+ +.+|+.+|+|+-..++...-. T Consensus 318 ~ln~~l~~~--------~~~-~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~~gd~~~~~ 383 (416) T protein:vir:81 318 ELNFKFNDE--------YVN-REFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI-----DEIRQRDGLAPIPGGNGSIHR 383 (416) T ss_pred HHhhhcccc--------ccC-ceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceEe Confidence 888765432 111 222444455566788999999999999999886 579999999865444432211 Q ss_pred ccccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 476 LLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) ...-..+.... +......+. .+.+..++.|++. T Consensus 384 ~~~n~~~~~~~--~~~~~~~~~-~~~~~~kgGe~n~ 416 (416) T protein:vir:81 384 VDLNHVNIELV--DEYQMNKSR-ATDKKLKGGEENE 416 (416) T ss_pred ecccccccccc--cccCccccc-ccccccCCCCCCC Confidence 11100000000 000111100 0111122233222 No 61 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.61 E-value=4.4e-14 Score=93.78 Aligned_cols=398 Identities=11% Similarity=0.043 Sum_probs=198.6 Q ss_pred CCc-----cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MST-----RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~-----r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) |+| |++.. +.......+ .+++- .+..+. . +....+ ..+..++-+.|.+|+..+- T Consensus 1 ~~~~~~~~~~k~~--~~~~~~~~~-~~~~~----------~~~~~~----~---~~~~~v-~~~~a~~~~~V~~ci~~ia 59 (409) T protein:vir:96 1 MAKENIVTRIKKK--LIDNWIDQS-ASKLY----------DFSPWK----N---KSFWGV-INNTLETNETIFSAITKLS 59 (409) T ss_pred CccccchhhhhhH--Hhhhhhccc-ccccc----------cccccc----C---cccccc-chhhHhhhHHHHHHHHHHH Confidence 443 21111 111111111 11100 000000 0 000011 1133446788999999999 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceee Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITI 153 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~ 153 (516) ..|.+++|.+.-.. ...+..+.+ .|. +-+...+..++++.++ +.+.+|-++.++++...+ . + T Consensus 60 ~~ia~lp~~~~~~~---~~~~~~l~~----lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G-------~--~ 123 (409) T protein:vir:96 60 NSMASLPLKMYEDY---KVVNTEVSD----LLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH-------Q--P 123 (409) T ss_pred HhhhhCceEEeecc---cccchhHHH----HHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCC-------c--E Confidence 99999999874221 122222333 333 2233455667776655 578899999999876432 2 2 Q ss_pred ccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCC Q lcl|NC_016071. 154 DKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTES 233 (516) Q Consensus 154 ~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g 233 (516) ..|.+-++..+. ...+.+++.+... .....+....+|...+|.+++....+ T Consensus 124 ~~L~~l~~~~v~----v~~~~~~~~~~y~-------------------------~~~~~g~~~~~~~~evih~r~~~~~~ 174 (409) T protein:vir:96 124 SKLFLLNPDVVE----MLIENQSRELYYS-------------------------IHAATGNKLIVHNMDMLHFKHIVASN 174 (409) T ss_pred EEEEEEcCceeE----EEEeCCCcEEEEE-------------------------EEcCCceEEEEccccEEEeCCCCCCC Confidence 234444443332 2233443322110 00112234456777766666555556 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEE Q lcl|NC_016071. 234 NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFI 313 (516) Q Consensus 234 ~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~i 313 (516) ..+|.|.+..+....-.-....+++ ...++.+ ++.+-+....-++++.+ ++++...+...+....++ T Consensus 175 ~~~G~s~l~~~~~~i~~~~~~~~~~---~~~~~~~-------~~~i~~~~~~l~~e~~~---~~~~~~~~~~~n~g~~~v 241 (409) T protein:vir:96 175 MVQGISPIDVLKNTTDFDNAVRTFN---LTEMQKP-------DSFMLKYGSNVSTEKRQ---QVLEDFKQYYEENGGILF 241 (409) T ss_pred ccccccHHHHHHHHHHHHHHHHHHH---HHhcCCC-------ceeEEecCCCCCHHHHH---HHHHHHHHHhhcCCCeee Confidence 6789999987765443333333332 1222222 11122222233333333 333333332334444667 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH-HHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS-IHGHFVQRDIDI 392 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e-v~~~~~~aDa~~ 392 (516) ++.|++++- .+-+....++.+..++..++|++++--..--.+..+.++++-.+-+.. ....-+.--++. T Consensus 242 l~~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ 311 (409) T protein:vir:96 242 QEPGVEIEP----------LPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQ 311 (409) T ss_pred cCCCceEEE----------cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHH Confidence 788875332 222223345677778888999999887654444344456666665543 345557888889 Q ss_pred HHHHHHHHHHHHHHHhcCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 393 IVEAFNKNLIPQLLALNDIRLSDEDMPKLKPG--LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 393 i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) |++.||+.|++..- ..... +|+|+ ..-..|++..+++++++++.|++.+ +.+|+.+|+|+-+.+| T Consensus 312 ie~~l~~~Ll~~~~------~~~g~--~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~-----NE~R~~~g~~pi~ggD 378 (409) T protein:vir:96 312 YEEEFNRKLLTKTD------REKNR--YFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEGGD 378 (409) T ss_pred HHHHHHhhcCCccc------ccCcc--eEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCcc Confidence 99999987765321 01112 35554 4456788999999999999999876 6799999999765555 Q ss_pred cccCcccccCCCCCCccc-ccccccCCCCCccccc Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSG-DGMTAGSNGNGTGKIS 504 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 504 (516) +..-...- .+-+..... ...+.| +...+.. T Consensus 379 ~~~~~~n~-~~~~~~~~~~~~~~gG---~~n~~e~ 409 (409) T protein:vir:96 379 KPLISGDL-YPIDTPLELRKSLKGG---DKNVNES 409 (409) T ss_pred eeeecccc-cccccchhhcccccCC---CCCcCCC Confidence 43321111 010000000 001111 1000000 No 62 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.60 E-value=8.6e-14 Score=92.21 Aligned_cols=416 Identities=12% Similarity=0.074 Sum_probs=205.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhccccc-CCcccHHHH--HHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEEL-RWPCFLATV--EAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~l-r~~~~~~~y--~~m~~D~~v~s~l~~Rk~~ 77 (516) |..++.+.-+-.. +.+......|..+. .+.. ..+.....+ +..++-+.|.+|+..+-.. T Consensus 1 ~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~ 62 (460) T protein:vir:10 1 MANRIIRALRELT--------------GLDNKFNDAFIKYI----GQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAK 62 (460) T ss_pred CchhHHHHHhhhh--------------ccCCCchHHHHHhh----ccccCCCccchhhhhHHHHhcchHHHHHHHHHHHh Confidence 6555544432111 11111111222111 1111 122333333 2345689999999999999 Q ss_pred HhcCCceeeeCCCCCChhhH---------H------------------HHHHHHHHHhhccCcCCHHHHHHHHH-HHHhh Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASK---------D------------------AAEFVEYALKNLANQQTLRDIARSAA-TFNEY 129 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~---------~------------------~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~ 129 (516) |.+++|.+.-.... ....+ . ........+.+-+...++.+++..++ +.+.+ T Consensus 63 iA~lp~~v~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~ 141 (460) T protein:vir:10 63 TVAVPYTIKVVKDT-KAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLN 141 (460) T ss_pred hhhCceEEEeccCC-ccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhc Confidence 99999988643221 11000 0 00111223333344557788888877 67889 Q ss_pred cceeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeecccccccccccccccccccccccccccc Q lcl|NC_016071. 130 GFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNL 209 (516) Q Consensus 130 G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (516) |-+..++++...+. ..|. +..|.+.++.+++ ...+.++..+... ..... ... T Consensus 142 Gnay~~i~r~~~~~---~~G~--~~~L~~l~~~~v~----v~~~~~~~~~~~~--~~~~~-----------------~~~ 193 (460) T protein:vir:10 142 GNCYFYLMSPDDGI---NAGV--PSQMYVLPAHLIK----IVLKDDINLLSTD--SPIKS-----------------YML 193 (460) T ss_pred CCeEEEEEecCCCc---cCce--eEEEEEEcCceEE----EEEcCCCceeeee--eeeeE-----------------EEE Confidence 99999988754321 1222 2345555554332 2223333222110 00000 001 Q ss_pred ccCCCccccccccEEEEeecCcC-----CccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC Q lcl|NC_016071. 210 TSSADEVFIPINKLMVMSLGGTE-----SNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI 284 (516) Q Consensus 210 ~~~~~~~~iP~~k~i~~~~~~~~-----g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~ 284 (516) ..++....+|.+..|++++.... +..+|.|.+..+....-.-....++-..+... |+. |+.+-+... T Consensus 194 ~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~n-g~~-------~~~i~~~~~ 265 (460) T protein:vir:10 194 IQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQN-GGV-------FGFIHGGST 265 (460) T ss_pred ecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhc-CCC-------cceeeecCC Confidence 12234456788887777654332 45789999999987777777777776666654 432 222222223 Q ss_pred CCCHHHHHHHHHHHHHHHHhhccc-ce--EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccc Q lcl|NC_016071. 285 DPKSPESEMVQGLMADAANAHAGE-QA--YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGF 361 (516) Q Consensus 285 ~~~~~~~~~l~~l~~~~~~~~~g~-~a--~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt 361 (516) ..++++. +++++.......|. .+ .++++.|++. ...+-+.....+.+..++...+|++++--.. T Consensus 266 ~l~~e~~---~~~~~~~~~~~~g~~n~g~~~vl~~g~~~----------~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp 332 (460) T protein:vir:10 266 GLTQPQA---DSLKQRLTEMDKSPDRLSQIAGASGEIAF----------TKISLNTDELKPFDYLKYDQKAICNALGWSD 332 (460) T ss_pred CCCHHHH---HHHHHHHHHHhcCccccCCceecCCCceE----------EEccCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 3333333 34444444433332 23 3566766532 2222233344566777888899999886644 Q ss_pred ccccCCc--cchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHH Q lcl|NC_016071. 362 INLGNDG--QGSYNLSESKQ-SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFV 438 (516) Q Consensus 362 Lts~~~~--~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~ 438 (516) --.+... +.+++-.+.+. ......+.--++.|++.||+.|++..-. ..-.+|.|+..+-..+..-.++. T Consensus 333 ~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~--------~~~~~i~~d~~~l~~l~~d~~~~ 404 (460) T protein:vir:10 333 KLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKG--------YENAVIEWDISELPEMQTDMVAM 404 (460) T ss_pred HHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccc--------cCCceEEeecchhhhHHHHHHHH Confidence 3332222 22355444443 4445568888999999999888764311 11124556544332222223344 Q ss_pred HHHHhCCcccccHHHHHHHHHHcCCCCCC-C-cccccCcccccCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 439 QRIGAVGYLPKTPTVINKILEVGGFDEEI-P-EDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 439 ~~L~~~G~~~~~~~~~~~i~e~~Glp~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) ..+++.|++.+ +.+|+.+|+|+-. + +|+......-.+.+.. .+... ..++...+ T Consensus 405 ~~~~~~g~~T~-----NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~---~~~~~--~~~~nq~~ 460 (460) T protein:vir:10 405 ASWLNTIPVTP-----NEIRIAMKYETLNQDGMDIVFMPSNKVRIDDV---SNNLI--DSAFNQNQ 460 (460) T ss_pred HHHHhCCCCCH-----HHHHHHhCCCCCCCCCCCeeeecccccchhhc---ccccC--CCcccCCC Confidence 45678898875 6799999999632 1 2222111100000000 00000 00000000 No 63 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.60 E-value=2.9e-14 Score=94.79 Aligned_cols=396 Identities=11% Similarity=0.047 Sum_probs=199.3 Q ss_pred CCc-----cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHH--HHHHHhhChHHHHHHHH Q lcl|NC_016071. 1 MST-----RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLA--TVEAMKQDHTVSTALDT 73 (516) Q Consensus 1 ~~~-----r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~--~y~~m~~D~~v~s~l~~ 73 (516) |++ |++... ...... .+.+++ ..+..+ .++.+. ..+..++-+.|.+|+.. T Consensus 1 ~~~~~~~~~~k~~~--~~~~~~-~~~~~~----------~~~~~~----------~~~~~~~v~~~~a~~~~~v~~~i~~ 57 (409) T protein:vir:94 1 MAKENIVTRIKKKL--IDNWID-QSASKL----------YDFSPW----------KNKSFWGVINNTLETNETIFSAITK 57 (409) T ss_pred CcccccchhhhhHH--hhhhhc-CCcccc----------cccccc----------cCccccccchhhhhccHHHHHHHHH Confidence 543 222110 000100 001100 000000 001110 12334467889999999 Q ss_pred HHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccce Q lcl|NC_016071. 74 KYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYI 151 (516) Q Consensus 74 Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~ 151 (516) +-..|.++++.+.-.. +..+..+.+ .|. +-+...+..+++..++ +.+.+|-+..++++...+ . T Consensus 58 Ia~~ia~lp~~~~~~~---~~~~~~~~~----lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G-------~- 122 (409) T protein:vir:94 58 LSNSMASLPLKMYEDY---KVVNTEVSD----LLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH-------Q- 122 (409) T ss_pred HHHhhhhCceeEeecc---cccchhHHH----HHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC-------c- Confidence 9999999999874221 122223333 343 2344556778887755 578899999988775432 2 Q ss_pred eeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCc Q lcl|NC_016071. 152 TIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGT 231 (516) Q Consensus 152 ~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~ 231 (516) +..|.+-++..+. ...+.+++.+...-+ ...+..+.+|...++.+++... T Consensus 123 -~~~L~~l~~~~v~----v~~~~~~~~~~y~~~-------------------------~~~g~~~~~~~~dvih~r~~~~ 172 (409) T protein:vir:94 123 -PSKLFLLNPDVVE----MLIENQSRELYYSIH-------------------------AATGNKLIVHNMDMLHFKHIVA 172 (409) T ss_pred -EEEEEEEcCceeE----EEEeCCCcEEEEEEE-------------------------cCCceEEEEccccEEEecCCCC Confidence 2233344443222 223444433221100 1112334567777666665545 Q ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE Q lcl|NC_016071. 232 ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY 311 (516) Q Consensus 232 ~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~ 311 (516) .+..+|.|.+..+....-....... |.. ..++.+ +..+-+.....+.++. +++++...+...+.... T Consensus 173 ~~~~~G~s~l~~~~~~i~~~~~~~~-~~~--~~~~~~-------~~~i~~~~~~l~~e~~---~~~~~~~~~~~~~~g~~ 239 (409) T protein:vir:94 173 SNMVQGISPIDVLKNTTDFDNAVRT-FNL--TEMQKP-------DSFMLKYGSNVGKEKR---QQVLEDFKQYYEENGGI 239 (409) T ss_pred CCccccccHHHHHHHHHHHHHHHHH-HHH--HhcCCC-------CeeEEecCCCCCHHHH---HHHHHHHHHHhhcCCCe Confidence 5667899998887665544433322 221 122221 1112222223333333 33344433333333445 Q ss_pred EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH-HHHHHHHHHH Q lcl|NC_016071. 312 FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS-IHGHFVQRDI 390 (516) Q Consensus 312 ~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e-v~~~~~~aDa 390 (516) ++++.|++++ ..+-+....++.+..++...+|++++--..--.+..+.++++-.+-+.. ....-+.--+ T Consensus 240 ~vl~~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~ 309 (409) T protein:vir:94 240 LFQEPGVEIE----------PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIV 309 (409) T ss_pred eecCCCceEE----------EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHH Confidence 6777776432 2222333445677777888999999887765554444456655554443 3345577788 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCC Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPG--LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIP 468 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~ 468 (516) +.|++.||+.|++.. . .... .+|+|+ ..-..|+++.+++++++++.|++.+ +.+|+.+|+|+-+. T Consensus 310 ~~ie~~ln~~Ll~~~---~---~~~~--~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~g 376 (409) T protein:vir:94 310 KQYEEEFNRKLLTKT---D---REKN--RYFKFNVKSYLRADSATQAEVYFKAVRSGYYTI-----NDIREWEDLPPVEG 376 (409) T ss_pred HHHHHHHHHhhCCcc---c---ccCc--ceEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCC Confidence 888888988776532 0 0111 235554 4446788999999999999999876 67999999996654 Q ss_pred cccccCcccccCCCCCCcccc-cccccCCCCCcccc Q lcl|NC_016071. 469 EDMSTDELLKLLGQDTSRSGD-GMTAGSNGNGTGKI 503 (516) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 503 (516) +|+..-...- .+-+...... ..+.|. +.+.+. T Consensus 377 gD~~~~~~n~-~~~~~~~~~~~~~kGG~--~n~~e~ 409 (409) T protein:vir:94 377 GDKPLISGDL-YPIDTPLELRKSLKGGD--KNVNES 409 (409) T ss_pred cCeEeecccc-cccccchhhcccccCCC--CCcCCC Confidence 5543321110 0111111000 011110 001000 No 64 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.59 E-value=9.5e-14 Score=91.97 Aligned_cols=400 Identities=9% Similarity=0.028 Sum_probs=202.6 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) +=+.+-.. +..+.|+.|... +. .. ..+.. ..+..+ .-+..++-+.|.+|+..+-..|.+ T Consensus 18 ~~~~lf~~-----~~~~~~~~~~~~--~~----~~-~~~~~--------~~~~~v-s~~~al~~~~v~~cv~~Ia~~iA~ 76 (424) T protein:vir:45 18 LLDALFRS-----KSLENPSTPITG--DA----VD-TDGLF--------RADVYV-SPETAMKLAAVYSCIYVLSSSLAQ 76 (424) T ss_pred HHHhhccc-----cCCCCCccccch--hh----hh-hhccc--------cCCcee-chHHhhccHHHHHHHHHHHHHHhh Confidence 11111111 111123322210 00 00 00000 011111 124456678899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++++.-....+..... ..-+.+.|.. -+...+..++++.++ +.+.+|-++.++++...+. + ..|.+ T Consensus 77 lp~~v~~~~~~~~~~~~--~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~---~------~~L~~ 145 (424) T protein:vir:45 77 MPLHVMRRHKGKVEPAR--DHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGE---V------ISLDC 145 (424) T ss_pred CceEEEEecCCceeecc--cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc---E------EEEEE Confidence 99987543222211111 1123334432 233455667777655 6788999999998865431 1 12333 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++..+. ....+++....+. .......+|++.+|.+++. ..+.++|. T Consensus 146 l~~~~v~-----i~~~~~~~~y~~~---------------------------~~~~~~~~~~~eVih~r~~-~~d~~~G~ 192 (424) T protein:vir:45 146 CMPWETT-----LMNTGGRYTYGLY---------------------------NEYGAFAISPDDMIHIRAL-GNNQKMGL 192 (424) T ss_pred ecCceEE-----EEEcCCeEEEEEE---------------------------ecCceEEECcccEEEecCc-CCCCcccc Confidence 3332221 1122232221111 1112334677666555554 44568999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc--cce--EEEe Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG--EQA--YFIL 314 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g--~~a--~~ii 314 (516) |.+..++...-.-....++...+...-+.+--+++.+ ..-+++.. +.+++..+....| .++ .+++ T Consensus 193 spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~l~~e~~---~~~~~~~~~~~~g~~~n~g~~~vl 261 (424) T protein:vir:45 193 SPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVK--------SGLNKESW---GWLKDQWQKASQALRRQENKTMLL 261 (424) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC--------CCCCHHHH---HHHHHHHHHHhccccccCCceeEc Confidence 9999888776666666666666665433333333322 22233333 3333333333333 233 4677 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDII 393 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i 393 (516) +.|++.+- .+-+.....|.+..++-..+|++++.-..--.+..+.++++-.+-+. ..-..-+.-.++.| T Consensus 262 ~~g~~~~~----------l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~i 331 (424) T protein:vir:45 262 PADLDYKA----------LTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNW 331 (424) T ss_pred CCCceEEE----------ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 88864332 11122333466677788899999988766544433445565444443 33455578888899 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMST 473 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~ 473 (516) ++.||+.|+..--... + .+-+|..+..-..|+++.+++++++++.|++.+ +.+|+.+|+|+-+.+|+.. T Consensus 332 e~~ln~kLl~~~e~~~-----g-~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~-----NE~R~~~gl~pi~ggD~~~ 400 (424) T protein:vir:45 332 EQELNRRLFTRAELAA-----G-YYVRFNLTGLLRGTPQERAQFYHFAITDGWMSR-----NEARAFEDMNPVEGLDEML 400 (424) T ss_pred HHHHHHhcCChhhhcC-----C-cEEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcceee Confidence 9999987765421111 1 112333344445788999999999999998875 5799999999655555443 Q ss_pred CcccccCCCCCCcccccccccCCCCCcccc Q lcl|NC_016071. 474 DELLKLLGQDTSRSGDGMTAGSNGNGTGKI 503 (516) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (516) ..... .++.++...... +.++... T Consensus 401 ~~~n~-----~~~~~~~~~~~~-~~~~~~~ 424 (424) T protein:vir:45 401 VSVNA-----ANPAGDFKPPKN-DEGKTNE 424 (424) T ss_pred ecccc-----cccccccCCCCC-CCCCCCC Confidence 22111 111111100000 0000000 No 65 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.59 E-value=4.4e-14 Score=93.83 Aligned_cols=411 Identities=13% Similarity=0.059 Sum_probs=199.7 Q ss_pred CC--ccccCcccccchhhhcccCCCCccc-ccchHHHHHHHHHHHhhcccccCCcccHHHHHHH-hhChHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTG-ELGSGALSQLRAESEVMKVEELRWPCFLATVEAM-KQDHTVSTALDTKYV 76 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~-e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~ 76 (516) |. .++..+.... ...+ .+.+. .++. +...-...+ +.+.+ ++.+.|.+|+..+-. T Consensus 1 Mg~~~~~~~~~~~~---~~~~---~~~~~~~~~~---------------~~~~~~~~~-~~~~~~~~~~~v~~~i~~ia~ 58 (423) T protein:vir:81 1 MGFLQKLGLAPSVV---ATPE---PIELVGPIFE---------------SLKLSTKNM-TVEQIWEDQPHLRTVTTFIAR 58 (423) T ss_pred CchhHhhccccccc---cCcc---cccccccccc---------------ccccccchh-hHHHHHHhhhHHHHHHHHHHH Confidence 33 2232111110 1111 11111 0000 000001112 23343 468999999999999 Q ss_pred HHhcCCceeeeC-CCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 77 FVTKAFNDFKVL-YNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 77 ~v~~~~w~i~~~-~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) .|.++++.+.-. .+.+.+..+. .-+...|++-+...++.++++.++ +.+.+|-++..++-.. ++. ...+ T Consensus 59 ~ia~lp~~~~~~~~dg~~~~~~~--~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~-~~~---~~~~--- 129 (423) T protein:vir:81 59 NVASLQLQAFERVEDGGRERVRE--GHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDL-GVD---TPTL--- 129 (423) T ss_pred hHhhCceEEEEEecCCceeeecc--chHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcC---cceE--- Confidence 999999876321 1111111111 112233444334456778887765 5678898777665332 211 1112 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) .|.+.+...+.... ..+..+++...+ .......+..+.+|.+.+|+++.....+. T Consensus 130 ~l~p~~~~~v~~~~--~~~~~~~~~Y~~-----------------------~~~~~~~g~~~~~~~~evih~r~~~~~~~ 184 (423) T protein:vir:81 130 DIRPIPVSWVQRRA--YKDGWGSLDYII-----------------------IESGDNDGRSVKVPGERVIHRHGYNPKTM 184 (423) T ss_pred EEeecccceeeeee--ccCCCcceEEEE-----------------------EEecCCCceEEEEcccceEEecCCCCCCc Confidence 23333332222100 001112111000 00011123345677776555543444555 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhh-cc-cce-- Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAH-AG-EQA-- 310 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~-~g-~~a-- 310 (516) .+|.|.+..++...-.-....++-..+...-+.+--+++.++... +..-+++ ..+++++..+... .| .++ T Consensus 185 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~---~~~l~~e---~~~~~~~~~~~~~~~~~~n~g~ 258 (423) T protein:vir:81 185 KRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESK---AGKWDAE---SRTRFMANLRASFSPKSSDVGG 258 (423) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCccc---CccCCHH---HHHHHHHHHHHHhccccccCCc Confidence 689999999998776666677777777654333333443332110 0111222 2233333333221 12 222 Q ss_pred EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHH Q lcl|NC_016071. 311 YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRD 389 (516) Q Consensus 311 ~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aD 389 (516) -++++.|+++.- .+-+.....+.+..++...+|++++.-..--.+..++++|+-.+-+. .....-+.-. T Consensus 259 ~~vl~~g~~~~~----------l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~L~P~ 328 (423) T protein:vir:81 259 TLLLEDGMKAEN----------FHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKALYGDNLGSW 328 (423) T ss_pred ceecCCCceEEe----------ccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHH Confidence 457788864332 11122333466667788899999887654333333345565444443 3444467888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHH-hCCcccccHHHHHHHHHHcCCCCCCC Q lcl|NC_016071. 390 IDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIG-AVGYLPKTPTVINKILEVGGFDEEIP 468 (516) Q Consensus 390 a~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~-~~G~~~~~~~~~~~i~e~~Glp~~~~ 468 (516) ++.|++.||+.|+++.-.-+ ...+-+|.++.....|+++.++++++++ +.|++.+ +.+|+.+|+|+-.. T Consensus 329 ~~~ie~~l~~~L~~~~~~~~-----~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~-----NE~R~~~gl~p~~g 398 (423) T protein:vir:81 329 IRIIQDVMNLFLLPRVGIDN-----EKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTI-----NEVRAMDNLPSIDG 398 (423) T ss_pred HHHHHHHHhhhhcCcccccc-----CccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCH-----HHHHHHhCCCCCCC Confidence 89999999998876532111 1122234344556678899999998866 6788775 57999999997655 Q ss_pred cccccCcccccCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 469 EDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) +|+......-...+...+.++. ..+ T Consensus 399 GD~~~~p~n~~~~~~~~~~~~~---------~~t 423 (423) T protein:vir:81 399 GDDLARPLNTEFGDSEDAPGEE---------VET 423 (423) T ss_pred cceeecccccccCccCCCCCCC---------CCC Confidence 5554322111111111111110 000 No 66 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.59 E-value=5.1e-14 Score=93.46 Aligned_cols=414 Identities=11% Similarity=0.088 Sum_probs=199.5 Q ss_pred CCccccC----cccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQ----PSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYV 76 (516) Q Consensus 1 ~~~r~~~----~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~ 76 (516) |=+|.+. ++.+...+.. +..+.. .....-+.. . ...+..+ ..+..++-+.|.+|+..+-. T Consensus 9 ~f~r~~~~~~~~~~~~~~~~~--~~~~~~-------~~~~~~~~~----~--~~~g~~v-~~~~al~~~~V~~~i~~Ia~ 72 (432) T protein:vir:81 9 LFGQLKAMFVPPDPVDIGGGQ--TFTPVN-------ATARDLGII----I--SDTGAAV-NADAIMRLDAVAACVKLVSQ 72 (432) T ss_pred hhhhhhhhccccccccccccc--ccccCc-------cchhhhccc----c--cccCccc-chHhhhccHHHHHHHHHHHH Confidence 2222211 1111110000 000000 000000000 0 0011111 12445578999999999999 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) .|.++++.+..... +... +..-.-+...|. +-+...+..++++.+. +.+.+|-+..++++. ++ ++ . T Consensus 73 ~ia~lp~~~y~~~~-~g~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g---~~------~ 140 (432) T protein:vir:81 73 AIAAMPLTMYMRTP-DGRK-EAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DG---RI------E 140 (432) T ss_pred hhhhCceeeEEecC-Ccce-ecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CC---cE------E Confidence 99999988643221 1110 000111223333 2233455677777766 578899999988774 21 22 2 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) .|.+.++..+. ...+.+|+...... ...+....+|.+.++.+++.+..+ T Consensus 141 ~L~~l~~~~v~----v~~~~~g~~~y~~~--------------------------~~~g~~~~~~~~~iih~r~~~~dg- 189 (432) T protein:vir:81 141 SLQYLANDRLT----ITTDPKGNTAYRYR--------------------------RTDGQMIDIPKQQIWKIMGYSLDG- 189 (432) T ss_pred EEEEEcCCceE----EEECCCCcEEEEEE--------------------------ecCceEEEEccccEEEecCCCCCC- Confidence 33333333222 23345554332111 112234467777777666655555 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEe Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFIL 314 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ii 314 (516) .+|.|.+..+....-.-....++-..+...-+.+--+++. +...+++..+ ++++..+... +....+++ T Consensus 190 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~--------~~~l~~e~~~---~~~~~~~~~~-nag~~~vl 257 (432) T protein:vir:81 190 ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQI--------DRFLTDDQYD---SFAKKVSGSV-EAGRAPLL 257 (432) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEec--------CCCCCHHHHH---HHHHHHhhhh-cCCCceec Confidence 6899999998876666665666666665432222222222 2223333333 3333333221 11235677 Q ss_pred ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH----HHHHHHHHHHH Q lcl|NC_016071. 315 PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ----SIHGHFVQRDI 390 (516) Q Consensus 315 P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~----ev~~~~~~aDa 390 (516) |.|++++. .+-+....++.+..++..++|++++--..--.+..+.|+++.+...+ ..-..-+.--+ T Consensus 258 ~~g~~~~~----------l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~ 327 (432) T protein:vir:81 258 EGGMDVKS----------LGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWL 327 (432) T ss_pred CCCceEEE----------ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHHHHHHH Confidence 88875322 11222334566777888899999887765444434344444332222 22334566677 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) +.|+..||+.|+.+-- ....+.+|..+..-..|.++.++++.++++.|++.+ +.+|+.+|+|+-.+++ T Consensus 328 ~~ie~~l~~kLl~~~~-------~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~-----NE~R~~~glpp~~g~~ 395 (432) T protein:vir:81 328 RRIEQSIALNLLSPAE-------RRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTR-----DEAREIEGLPKLGGNA 395 (432) T ss_pred HHHHHHHHhhccCccc-------cCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCc Confidence 7788888876665311 111122333334456788999999999999998876 6799999999764443 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) ..........+-. .....+. ..++.+++.. .++..+. T Consensus 396 ~~~~~~~~~~pl~-~~~~~~~--~~~~~~~~n~---~~~~~~~ 432 (432) T protein:vir:81 396 AVLTVQSAMVPLD-SIGLQAS--PEPASGLGNQ---QQDKVSK 432 (432) T ss_pred ceEeecCcccchh-hhccCCC--CCCCCCCCCc---ccccccC Confidence 3222111111100 0000000 0001111110 0111111 No 67 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.56 E-value=3.7e-13 Score=88.77 Aligned_cols=392 Identities=12% Similarity=0.029 Sum_probs=201.2 Q ss_pred CCccccCcc-----------cccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHH Q lcl|NC_016071. 1 MSTRFAQPS-----------EVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVST 69 (516) Q Consensus 1 ~~~r~~~~~-----------~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s 69 (516) ||.-.+..- .-.+...+.++.|...+. ...+... + ....|..+++.+.|.+ T Consensus 4 ~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~-----~----~~~~~~~~~~~~~v~~ 65 (413) T protein:vir:96 4 VSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMT---------LPNFFKE-----L----ISDGYTKLSDSPEVRM 65 (413) T ss_pred cchhhhhhcCCccccCCCcchhhhhhcccccccccccc---------chhhHhh-----h----ccchhHHHhhchHHHH Confidence 333222211 111111112222211110 0001000 0 1122455667899999 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeeccccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKY 147 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~ 147 (516) |+..+...|.+++|.+....+ +. .+++..-+...|. +-+...++.+++..+. +.+.+|.++++++....++ . T Consensus 66 cI~~ia~~ia~~~~~~~~~~~-~~--~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~--~- 139 (413) T protein:vir:96 66 AVDCIADLVSNMTIQLMQNGE-TG--DKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGD--K- 139 (413) T ss_pred HHHHHHHhhccCceEEEEecC-CC--ccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC--c- Confidence 999999999999998753322 11 1122222333343 3334456778887766 4678999999998765432 1 Q ss_pred ccceeeccccccCchhcccccceeecCC-CceeeeccccccccccccccccccccccccccccccCCCccccccccEEEE Q lcl|NC_016071. 148 AGYITIDKIAFRPQSSLSRSKPWVFDED-GRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVM 226 (516) Q Consensus 148 ~g~~~~~~l~~r~q~ti~~~~~f~~~~d-g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~ 226 (516) +..|.+.++.++. +..+ +...... ...+..++.+.+|++ T Consensus 140 -----~~~L~~l~~~~v~------~~~~~~~~~y~~-----------------------------~~~~~~~~~~evih~ 179 (413) T protein:vir:96 140 -----IIGLTPISPYKVT------FNVSDDDLDYSI-----------------------------TFDNKEYDPSTLLHF 179 (413) T ss_pred -----eEEEEEecCceeE------EEEcCCeEEEEE-----------------------------eecCcEEchhhEEEE Confidence 1223333333222 2211 1111110 011224566666666 Q ss_pred eecCcC-CccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_016071. 227 SLGGTE-SNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAH 305 (516) Q Consensus 227 ~~~~~~-g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~ 305 (516) ++.... +-.+|.|++..+....-.-....++...+....+.|--+++.+ .+.+++. .+++++..+... T Consensus 180 k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~--------~~l~~e~---~~~~~~~~~~~~ 248 (413) T protein:vir:96 180 VLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVD--------SDSDELS---DEEGRENFEEMY 248 (413) T ss_pred eccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC--------CCCCHHH---HHHHHHHHHHHh Confidence 765444 3457999999998877777777777777777655444444432 2222222 233444444433 Q ss_pred cc-cceE--EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHH Q lcl|NC_016071. 306 AG-EQAY--FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIH 382 (516) Q Consensus 306 ~g-~~a~--~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~ 382 (516) .| ..++ +++|.|+.- .. ++... +.....+.+..++.-++|++++--..--.+. +.++.++ ..... T Consensus 249 ~g~~n~g~~~vl~~~~~~-~~-----~~~~~--~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~-~~~~~~~---~~~~~ 316 (413) T protein:vir:96 249 LKRKEAGKPWIIPEGMVN-VQ-----QIKPL--TLNDLAINDAVTLDKKTVAGIFGVPAFLLGV-GTYNKDE---FNNFI 316 (413) T ss_pred cCccccCceeeecCCccc-cc-----ccccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CcchHHH---HHHHH Confidence 33 2333 566777531 10 11111 1122345566678888999988776533321 1133332 23344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcC Q lcl|NC_016071. 383 GHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGG 462 (516) Q Consensus 383 ~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~G 462 (516) ..-+.-.++.|++.||+.|++ ...+-+|.++.....|+++.++++++++..|++.+ +.+|+.+| T Consensus 317 ~~~l~P~~~~ie~~ln~~ll~-----------~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g 380 (413) T protein:vir:96 317 NTKIMSIAQVIQQTYNKLIVE-----------EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRR-----NEFRNWVG 380 (413) T ss_pred HHHHHHHHHHHHHHHHHhhCC-----------CCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhC Confidence 556777888888888876643 12233455555567789999999999999999876 57899999 Q ss_pred CCCCCCcccccCcccccCCCCCCcccccccccCCCCCc Q lcl|NC_016071. 463 FDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGT 500 (516) Q Consensus 463 lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (516) +|+-..+|+...... -.+-........ ..+..| T Consensus 381 ~~p~~~gd~~~~~~n-~~~~~~~~~~~~----~~~~dt 413 (413) T protein:vir:96 381 MPPDAEMDDLLVLEN-YLQQKDLVNQKK----LIQDET 413 (413) T ss_pred CCCCCCcceeeeccc-ccchhhcccccC----CCCCCC Confidence 986544443321111 011000000000 001111 No 68 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.53 E-value=4e-13 Score=88.58 Aligned_cols=403 Identities=13% Similarity=0.018 Sum_probs=191.9 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) |+=...........+... .......|..++. -+. ..-++-+.|.+|+..+...|.+ T Consensus 1 m~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~g~-~~~--~~Al~~~~V~~cv~~ia~~iA~ 56 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADH---------------------LLDSGVIPSFRGG-YLG--ISALRNSDVLTAVSIVSGDVSR 56 (417) T ss_pred CccccccccCCCccchhh---------------------hcccccccccCCc-eec--hhhcccHHHHHHHHHHHHhhcc Confidence 443211111111111000 0001111111211 011 1223567899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++.+.-..+ +...+. .-+...|. +-+...++.+++..++ +.+.+|.++.+++....++ . +..|.+ T Consensus 57 lp~~~~~~~~-~~~~~~---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~--~------~~~l~~ 124 (417) T protein:vir:38 57 FPLVITDSST-DEVIDL---ANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITN--E------PAMFEF 124 (417) T ss_pred CeeEEEEcCC-cceecc---chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC--E------EEEEEE Confidence 9998754322 221111 11223343 3344456777877755 4788999999998754332 1 122333 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.++. ...+++|+...... .........+|...+|.+++.+. +..+|. T Consensus 125 l~p~~v~----v~~~~~~~~~y~~~-------------------------~~~~~~~~~~~~~dviH~r~~~~-d~~~G~ 174 (417) T protein:vir:38 125 YAPSQTQ----VDTSDPDNIIYRFT-------------------------PYNSSMQKVCGFEDVIHWKFFSY-DTIMGR 174 (417) T ss_pred eCCceEE----EEEcCCCeEEEEEE-------------------------EcCCcEEEEecCcceEEecCCCC-CCcccc Confidence 3333221 12233443322111 01111223456666666666543 447899 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccce--EEEecc Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQA--YFILPS 316 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a--~~iiP~ 316 (516) |.+..+....-.-....++...|...-+.|--+++ .+..-++++. +++++.......|..+ -++++. T Consensus 175 s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~--------~~~~l~~e~~---~~~~~~~~~~~~g~n~g~~~vl~~ 243 (417) T protein:vir:38 175 SPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKA--------KESRLSAEAR---QKIREDFERAQAGADAGSPIIVDA 243 (417) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEE--------eCCCCCHHHH---HHHHHHHHHHhcccccCCceeccC Confidence 99999887776666666776776654333322222 2222233333 3344444444444444 345688 Q ss_pred CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 317 DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSES-KQSIHGHFVQRDIDIIVE 395 (516) Q Consensus 317 g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~v-h~ev~~~~~~aDa~~i~~ 395 (516) |++.+- ++. +.....|.+..++...+|++++.-..-..+.. ++++-.+- -......-++-.++.|++ T Consensus 244 g~~~~~--------l~~--~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~--~~~s~~e~~~~~~~~~tl~P~~~~ie~ 311 (417) T protein:vir:38 244 TMDYQP--------LEV--DTNVLNLINSNNYSTAQIAKALRVPAYRLAQN--SPNQSVKQLADDYIRNDLPFYFEPITS 311 (417) T ss_pred CceEEE--------ccC--CHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCC--CcchhHHHHHHHHHHHHHHHHHHHHHH Confidence 864221 111 22333466677788899998776554333322 23332222 233444567788888999 Q ss_pred HHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccc--cc Q lcl|NC_016071. 396 AFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDM--ST 473 (516) Q Consensus 396 ~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~--~~ 473 (516) .||+.|+.+.-. . . -.|.|+... .+ ....+.++++++.|++.+ +.+|+.+|+|+-..++. .. T Consensus 312 ~l~~~Ll~~~~~------~--~-~~~~fd~~~-l~-~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~pi~~g~~d~~~ 375 (417) T protein:vir:38 312 EFELKLLDDAQR------H--Q-YCIGFDTKS-VN-GLPIADVNTAVNGGLWTG-----NEGRAELGKKPLKDPNMDRIQ 375 (417) T ss_pred HHHhhhcChhhc------c--c-ceEEechhh-hh-HHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCCeee Confidence 999877654311 1 1 146776432 22 222456788999999876 57999999985433321 11 Q ss_pred Cccc-ccCC---CC-CCcccccccccC-CCCCcccccccccchhhhhcC Q lcl|NC_016071. 474 DELL-KLLG---QD-TSRSGDGMTAGS-NGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 474 ~~~~-~~~~---~~-~~~~~~~~~~~~-~~~~~~~~~~~~d~~~~~~~~ 516 (516) .... .+.. +. .+.... .+.+. ++.+. .+....-+| T Consensus 376 ~~~n~~~~d~~~~~~~~~~~~-~kgg~~~~~~~-------~~~~~~~~~ 416 (417) T protein:vir:38 376 STLNTVFLDQKEAYQAEHAAE-LKGGDTNAKGN-------QNGSGTNAN 416 (417) T ss_pred ecccccccccccccccccccc-cCCCCCCCCCC-------CcCCCCcCC Confidence 1000 0000 00 000000 00000 00111 011111111 No 69 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.51 E-value=1.4e-12 Score=85.58 Aligned_cols=438 Identities=13% Similarity=0.089 Sum_probs=202.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHh-hChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMK-QDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~-~D~~v~s~l~~Rk~~v~ 79 (516) ||=|....-.-.+ ++..+... ++.. .+.++ .+-|-++.-+.+|. ..+.|.+|++.+...|. T Consensus 6 ~~i~s~~~~~~i~--~~~~~s~~-----~~~~---~~~~~--------~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA 67 (542) T protein:vir:41 6 LSIRSLEKYKAIK--REEVESQA-----LGET---RFEEY--------VEPKVNPLVLLSLLQVNPYHASACSIKANDII 67 (542) T ss_pred ccccccccchhhh--hccccccc-----cccc---cCCcc--------ccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHh Confidence 3333322111111 11111111 1100 01111 11123444455555 48999999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++|.+... +. ..+...+. +...++.+++..++ +.+.+|.+.+|+++...+ .+.+ |.+ T Consensus 68 ~l~~~~~~~---~~-------~~l~~~lp--N~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G---~~~~------L~~ 126 (542) T protein:vir:41 68 RTGYILEGD---DE-------GVVDEFIR--ACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRG---DPIR------FEY 126 (542) T ss_pred hCceeeecc---cc-------hhhhhhcC--CCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC---cEEE------EEE Confidence 999987522 11 11223332 23456778887777 678899999999886543 2222 333 Q ss_pred cCchhcccccceeecCCCceeeeccccccc--cccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMA--FANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .++.+++ +..|+.......+.... +..+.. ..............+|..-.|.+++....+.+| T Consensus 127 l~~~~v~------v~~d~~~~~~~~~~~~~~~~~~y~~---------~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~ 191 (542) T protein:vir:41 127 IPSHTIR------VHKDGSRYRQTWDGVNITHFKDYRY---------EGEINPETGEDQDSVGANELVFIHIPSPVCSYY 191 (542) T ss_pred EcCcceE------EEEcCCeeEeeecCCcceeEEeecc---------cccccccccccccccCcccEEEecCCCCCCCcc Confidence 3443332 22233222221111110 000100 001111222334456776666666556577789 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHH-hhcc-cce--EE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAAN-AHAG-EQA--YF 312 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~-~~~g-~~a--~~ 312 (516) |.|.+..+......-....++-..+...-+.|--+++.+-....+...+.. ...+..+.+++.... +..+ ..+ .+ T Consensus 192 Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~-~~~e~~~~lk~~~~~~~~g~~~n~gk~~ 270 (542) T protein:vir:41 192 GVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPD-GNPTGRTVIQALIEDNFKHLKEAPHTPL 270 (542) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccc-cCHHHHHHHHHHHHHHHhhhhcccCcee Confidence 999999998887777777777777766544454455544333222221111 111223334433322 2211 122 33 Q ss_pred Eec--cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh--hHHHHHH-HHHHHHHH Q lcl|NC_016071. 313 ILP--SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY--NLSESKQ-SIHGHFVQ 387 (516) Q Consensus 313 iiP--~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~--Al~~vh~-ev~~~~~~ 387 (516) +++ .+. ...+++...+-+.....|.+..++...+|++++.-..--.+..+++|+ +-.+.+. ......+. T Consensus 271 vL~~~~~~------~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~ 344 (542) T protein:vir:41 271 VFSIPGGD------TVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVR 344 (542) T ss_pred EeeccCCc------ccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHH Confidence 443 111 112444444444444557777788889999998665433332222322 3344333 34466678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCC Q lcl|NC_016071. 388 RDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEE 466 (516) Q Consensus 388 aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~ 466 (516) -.++.|+..||+.|++.. ..+ . +|.|+...-.. ....+.++.+++.|++.+ +.+|+.+ |+++ T Consensus 345 P~~~~ie~~ln~~L~~~~-------~~~-~--~~~f~~~~ll~-~d~~~~~~~~v~~GilT~-----NE~Re~L~g~~p- 407 (542) T protein:vir:41 345 PQQNIISSILTDFFQVKF-------NPK-T--RFKFNDETLLE-SDSVRNCALLVQSGVLTP-----AEARERLFGLDG- 407 (542) T ss_pred HHHHHHHHHHHhhccccc-------CCc-e--EEEecchhhcc-hHHHHHHHHHHhCCCCCH-----HHHHHhhCCCCC- Confidence 889999999998665431 111 1 35554332222 123456788999999886 4678754 7763 Q ss_pred CCcccccCccc--cc-CCCCCCccc-------ccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 467 IPEDMSTDELL--KL-LGQDTSRSG-------DGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 467 ~~~~~~~~~~~--~~-~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) .++.-..+... .. ..+..+... .....+.++......++...++.....+ T Consensus 408 gdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~ 467 (542) T protein:vir:41 408 GPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKID 467 (542) T ss_pred CCccccccccccccccccCCcCCCCCchhhhhhcccccCccccccccccccchhhccccc Confidence 22211111000 00 000000000 0000011111111111112222222222 No 70 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.51 E-value=8.3e-13 Score=86.83 Aligned_cols=395 Identities=11% Similarity=0.059 Sum_probs=191.0 Q ss_pred CCc--cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MST--RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~--r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |.= ++....+...... .+. ....+.. ....... .-.+...++.+.|.+|+..+-..| T Consensus 1 Mg~f~~~~~~~~~~~~~~----~~~---------~~~~~~~------~~~~~~~--~~~~~~~~~~~~v~~~i~~ia~~i 59 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRA----DTG---------YVGLFMS------GEDVSFL--VPGYVRLSDNPEVRMAVHKIADLI 59 (406) T ss_pred Ccchhhhccccccccccc----cch---------hhhhhcc------CcccCcc--ccCHHHHhhcHHHHHHHHHHHHhh Confidence 332 2221111110000 000 0000100 0000000 011455667899999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHH-hhccCcCCHHHHHHHHHHH-Hhh--cceeeeEEEeecccccccccceeec Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYAL-KNLANQQTLRDIARSAATF-NEY--GFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l-~~~~~~~~~~~~l~~~lda-~~~--G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) .+++|.+.-..+... +++-.-+...| .+-+...++.+++..++.. +.+ ||++.++++...+ .+. T Consensus 60 a~~~~~~~~~~~~~~---~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g---~~~------ 127 (406) T protein:vir:95 60 SSMTIYLMQNTEDGD---IRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADG---LID------ 127 (406) T ss_pred ccCceEEEEecCCcc---eeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCC---cEE------ Confidence 999998743222111 11111122222 3333445788888887753 444 7778778775432 222 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcC-C Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTE-S 233 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~-g 233 (516) .|.+.++.+++ ...+.+|-.+ ...+..+|.+.+|++++...+ + T Consensus 128 ~l~~i~~~~v~----~~~~~~~~~~--------------------------------~~~~~~~~~~evih~~~~~~~~~ 171 (406) T protein:vir:95 128 ELVPLTPSKVN----FLDTPDGYQV--------------------------------LYGGQTFNYDEVLHFIYNPDPER 171 (406) T ss_pred EEEEEcCceeE----EEEcCCeEEE--------------------------------EeccEEEchhHEEEeeccCCCCC Confidence 33333443332 1112222100 001234677777766765444 3 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHH-HHHHHHHHHHHHhhcccce-- Q lcl|NC_016071. 234 NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPES-EMVQGLMADAANAHAGEQA-- 310 (516) Q Consensus 234 ~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~-~~l~~l~~~~~~~~~g~~a-- 310 (516) ..+|.|.+..+....-.-....++...+...-+.+--+++.+ ..-++++. +..+.+.+... ....+ T Consensus 172 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~--------~~l~~e~~~~~~~~~~~~~~---g~~n~~~ 240 (406) T protein:vir:95 172 PYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVD--------AATAELSSEEGRNAVFKKYL---QATEAGQ 240 (406) T ss_pred CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC--------CCCCHHHHHHHHHHHHHHhc---cccccCC Confidence 457999999998877777777777777775533332222222 12222222 22233333222 11222 Q ss_pred EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 311 YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDI 390 (516) Q Consensus 311 ~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa 390 (516) .++++.+... +. ++... +....++.+..++.-++|++++.-..--.+. ++.. .+........-+.--+ T Consensus 241 ~~v~~~~~~~-~~-----~~~~~--~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~---~~~~-~~~~~~~~~~~l~P~~ 308 (406) T protein:vir:95 241 PWIIPAELLE-VE-----QVKPL--SLKDIAINEAVELDKRTVAGMFGVPAFLLGI---GEFN-RDEYNNFINSTILPIA 308 (406) T ss_pred ceeecCCCcc-cc-----ccccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC---CCch-HHHHHHHHHHHHHHHH Confidence 3456665421 10 11111 1223346677788889999988776533321 1111 2233445555677777 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) +.|++.||+.|+. +...+-+|.++.....|.+..++.+.+|++.|++.+ +.+|+.+|+|+-..+| T Consensus 309 ~~ie~~l~~~l~~----------~~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~-----NE~R~~~gl~p~~~gd 373 (406) T protein:vir:95 309 KGIEQELTRKLLI----------SPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEG-----NEVRDWLGLSPKEGLS 373 (406) T ss_pred HHHHHHHHHhcCC----------CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcc Confidence 7888888765542 122233455555556788999999999999999876 5799999999654444 Q ss_pred cccCcccccCCCCCCcccccccccC-CCCCcccc Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGS-NGNGTGKI 503 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 503 (516) .......-.+.+.... ....+.+. .+++..+. T Consensus 374 ~~~~~~n~~~~~~~~~-~~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 374 ELVILENYIPLDKIGD-QSKLKGGDNSGADGQTD 406 (406) T ss_pred eeeeccCccchhhccc-ccccCCCCCCCCCCCCC Confidence 3321111100000000 00001100 01111000 No 71 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.49 E-value=2.9e-12 Score=83.85 Aligned_cols=443 Identities=11% Similarity=-0.001 Sum_probs=187.9 Q ss_pred CCccccCccc-ccchhhh------cccCCCCcccccchHHH-HHHHHHHHhhcccccCCcccHHHHH------------- Q lcl|NC_016071. 1 MSTRFAQPSE-VVKAGNE------NLAVSRLRTGELGSGAL-SQLRAESEVMKVEELRWPCFLATVE------------- 59 (516) Q Consensus 1 ~~~r~~~~~~-~~~~~~~------~p~~~~~~~~e~g~~~~-~~~~~~~~~~~~~~lr~~~~~~~y~------------- 59 (516) |+-.+-.... +....+. .|.-...|....|-... ....++. +. + .++....|+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--~~-~---~~~~~~~~~~l~~~~~~~~~~~ 86 (535) T protein:vir:10 13 LSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQY--SV-A---SISDVLSTKKLLKAYADNDIVQ 86 (535) T ss_pred hhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCccccc--cc-C---ccccccCHHHHHHHhccChhHH Confidence 1111000000 0000000 00000000000000000 0000000 00 0 111111122 Q ss_pred ----HHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCc-----CCHHHHHHHHH-HHHhh Q lcl|NC_016071. 60 ----AMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQ-----QTLRDIARSAA-TFNEY 129 (516) Q Consensus 60 ----~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~-----~~~~~~l~~~l-da~~~ 129 (516) -+..+..+.+|+......+.+++.++.-.........+.-...+...|...++. ..|..++..++ +.+.+ T Consensus 87 ~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~ 166 (535) T protein:vir:10 87 AIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQ 166 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhh Confidence 222333344444444333434443332111111111111122344555543322 23556777665 34556 Q ss_pred c-ceeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccc Q lcl|NC_016071. 130 G-FSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTN 208 (516) Q Consensus 130 G-~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 208 (516) | .++.++++...+ ++. .|.+.++.+++ +..+.++... .. .+.. T Consensus 167 ~g~ay~~i~r~~~G---~~~------~L~~l~p~~V~----v~~d~~~~~~------~~-----------------~~~~ 210 (535) T protein:vir:10 167 DQINIERIFKNDSN---ELD------HFNAVDASKVV----ISYSPRSKDQ------PR-----------------KFEQ 210 (535) T ss_pred CCceEEEEEECCCC---cEE------EEEEeCCceeE----EEEcCccccC------ce-----------------EEEE Confidence 5 566666554322 222 34444444332 2223222100 00 0001 Q ss_pred cccCCCccccccccEEEEeecCcC---CccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCC Q lcl|NC_016071. 209 LTSSADEVFIPINKLMVMSLGGTE---SNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAID 285 (516) Q Consensus 209 ~~~~~~~~~iP~~k~i~~~~~~~~---g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~ 285 (516) .........+|.+.+|++++.... +.+||.|.+..+....-.-....++-..|...-+.|--+++.|.. .+ T Consensus 211 ~~~~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~------~~ 284 (535) T protein:vir:10 211 FVSETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQD------GD 284 (535) T ss_pred EecCceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCC------CC Confidence 112233456778887777765433 357899999999887777777777777777654444334443211 01 Q ss_pred CCHHHHHHHHHHHHHHHHhhccc-ceEE-EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc Q lcl|NC_016071. 286 PKSPESEMVQGLMADAANAHAGE-QAYF-ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN 363 (516) Q Consensus 286 ~~~~~~~~l~~l~~~~~~~~~g~-~a~~-iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt 363 (516) + ...++.++++++...+...|. .++. .|+.+.. +++.....+.....|.+..++..++|++++.-...- T Consensus 285 ~-~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g--------~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~ 355 (535) T protein:vir:10 285 A-QANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKD--------AKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEE 355 (535) T ss_pred c-ccCHHHHHHHHHHHHHHhcCcccccccccccCCC--------ceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 1 112233455666655554453 3332 2333221 233333333344457777788889999998765433 Q ss_pred ccCCccchhh------------HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchh Q lcl|NC_016071. 364 LGNDGQGSYN------------LSESKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVD 430 (516) Q Consensus 364 s~~~~~GS~A------------l~~vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~d 430 (516) .+-.+.++|+ ..+..... ....+.--++.|+..||+.|++.. . .. -+|.|+.....| T Consensus 356 lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~-------~--~~-~~f~f~~l~~~d 425 (535) T protein:vir:10 356 INFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYV-------D--TD-YRFSFTLGDAQD 425 (535) T ss_pred hccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc-------C--Ce-EEEEeccccccC Confidence 3222222222 12222222 233577788889999998776531 1 11 267888888888 Q ss_pred HHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc--------------ccccCCCCC----------Cc Q lcl|NC_016071. 431 MEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE--------------LLKLLGQDT----------SR 486 (516) Q Consensus 431 l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~--------------~~~~~~~~~----------~~ 486 (516) .+..+++.+... .|.+.+ +.+|+.+|+|+-+.+|..... ...+.+... .. T Consensus 426 ~~~r~~~~~~~~-~g~lT~-----NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~ 499 (535) T protein:vir:10 426 KLQEEQVWKLKL-ANGYFI-----NEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQE 499 (535) T ss_pred HHHHHHHHHHHH-cCCCCH-----HHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccccCc Confidence 887777776544 565543 689999999976555542210 000000000 00 Q ss_pred ccccccccCCCCCcccccccc---cchhhhhcC Q lcl|NC_016071. 487 SGDGMTAGSNGNGTGKISSTR---DNSVSNMDN 516 (516) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~~~---d~~~~~~~~ 516 (516) ..........|..+++.+..+ ..+.+..+| T Consensus 500 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 532 (535) T protein:vir:10 500 RIQHSKDYEKGKDDPKSPLPKPSESDDVSNNED 532 (535) T ss_pred ccccccccccCCCCCCCCCCcCCCCCccccccc Confidence 000011111122233333222 112222223 No 72 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.49 E-value=4.1e-13 Score=88.47 Aligned_cols=380 Identities=10% Similarity=0.013 Sum_probs=188.7 Q ss_pred CC----ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH-HHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MS----TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV-EAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~----~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk 75 (516) |. ..+.+....... ..++ ... ...... ..+..+ .......+. +..++-+.|.+|+..+- T Consensus 1 m~m~~~~~~~~~~~~~~~--~~~~--~~~-~~~~~~--~~~~~~---------~~~~g~~v~~~~al~~~~v~~~v~~ia 64 (392) T protein:vir:74 1 MILPILNFINQTNDPPEA--GSVQ--SYF-PDGNDA--QIMESL---------LGDNNEWVSARAALRNSDLFSIILQLS 64 (392) T ss_pred CcchhhhhhhcccCcccc--cccc--ccc-ccCchh--hhhhhc---------cCCCCcccchhhhhcchHHHHHHHHHH Confidence 11 112111111100 0000 000 000000 000000 000111222 23456889999999999 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) ..|.++++++.-. . ... .+++-+...++.++++.++ +.+.+|-+++++++...+ . +. T Consensus 65 ~~ia~lp~~~~~~------~---~~~----l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G-------~--~~ 122 (392) T protein:vir:74 65 SDLAIVKINAEKK------K---NQG----IIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANG-------A--DM 122 (392) T ss_pred HhhccCceeeccc------h---hhh----hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC-------c--EE Confidence 9999998876411 0 011 2333344456778888766 678899999999986532 2 23 Q ss_pred cccccCchhcccccceeecCCCceee-eccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCC Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLK-GIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTES 233 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g 233 (516) .|.+.++.++. ...+.+|+.+. .+.... .....-..+|.+.+|++++....+ T Consensus 123 ~L~~i~~~~v~----v~~~~~~~~~~y~~~~~~-----------------------~~~~~~~~~~~~evih~~~~~~~~ 175 (392) T protein:vir:74 123 KWEYLRPSQVN----TYYFEYENGMYYNITFDD-----------------------PKIEPILQAPQSDLIHMKLLSIDG 175 (392) T ss_pred EEEEEcCceeE----EEEcCCCceEEEEEEecC-----------------------CccceeEEEcCccEEEecCCCCCC Confidence 44455554443 23344443221 111000 001112346677766666556666 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE-- Q lcl|NC_016071. 234 NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY-- 311 (516) Q Consensus 234 ~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~-- 311 (516) ..+|.|.+..+....-.-....++...+....+.|--+++.|. +...+++.+ +...+... ....++ T Consensus 176 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~--------~~~~~~~~~-~~~~~~~~---~~~n~g~~ 243 (392) T protein:vir:74 176 GKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKG--------GGLLSDKDK-ASRSRSFM---KRSRSGGP 243 (392) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC--------CCCchHHHH-HHHHHHHh---ccccCCCe Confidence 7899999999998887777777888888776655554444432 222222222 22222222 222333 Q ss_pred EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 312 FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDID 391 (516) Q Consensus 312 ~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~ 391 (516) +++|.|++.+ -++. +....++.+..++..++|++++.-..-..+..+ .+++..+.-.......+.--++ T Consensus 244 ~vl~~g~~~~--------~l~~--~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~e~~~~~~~~~l~p~~~ 312 (392) T protein:vir:74 244 VVLDDLEEFT--------ALEI--KSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG-DQQSSIQQISGMYASALNRYLR 312 (392) T ss_pred eecCCCceEE--------EccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CcccHHHHHHHHHHHHHHHHHH Confidence 6778887433 2222 233445777788888999999876543332211 2222222223344455666778 Q ss_pred HHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHH---cCCCCCCC Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEV---GGFDEEIP 468 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~---~Glp~~~~ 468 (516) .|++.||+.|++.+ .+| +...-..|...+++.+.+|+..|++.+. .+|+. .|+. + T Consensus 313 ~ie~~l~~~l~~~~-~~~-------------~~~~~~~d~~~~~~~~~~l~~~g~~t~n-----ear~~~~~~g~~-p-- 370 (392) T protein:vir:74 313 PAISELEYKLSDHI-SVN-------------MRPAIDPLGDNYLSTISTATRWGALAEN-----QATFVLQEAGYI-P-- 370 (392) T ss_pred HHHHHHHHhccchh-ccc-------------chhhhcCCHHHHHHHHHHHHhCCCcCHH-----HHHHHHHhCCCC-c-- Confidence 88888888776542 222 1111234557778899999999988763 34443 4664 2 Q ss_pred cccccCcccccCCCCCCcccccccccCCCCCccccccc Q lcl|NC_016071. 469 EDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISST 506 (516) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (516) .+ ....+. .++-++ |. +..|.+ T Consensus 371 ne-~r~~en-------l~~~~~---Gd-----~~~p~p 392 (392) T protein:vir:74 371 KD-LPAPEN-------TNKKTT---GQ-----SNEPVP 392 (392) T ss_pred cc-cchhcC-------CCCCCC---CC-----CCCCCC Confidence 11 110000 000000 00 011111 No 73 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.49 E-value=1.5e-12 Score=85.41 Aligned_cols=391 Identities=12% Similarity=-0.011 Sum_probs=192.2 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) ..+|......... .+..+...... .....+ .-++-+.|.+|+..+-..|.+ T Consensus 4 f~~~~~~~~~~~~----------------------~~~~~~~~~~~-----~~~~~~--~Al~~~~V~~~i~~Ia~~iA~ 54 (406) T protein:vir:97 4 FQPLGTSKVSYDD----------------------YISSVLAGDVS-----QKYLGV--SALKNSDILTATSIIAGDIAR 54 (406) T ss_pred ccccCCCCCCcch----------------------HHHHHhcCCCC-----cccccc--hhhccHHHHHHHHHHHHhhhh Confidence 2333222111000 01111110000 001111 123567899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++.+.-. +++...+ .-+...|. +-+...++.+++..++ +.+.+|-++++++....++ . +..|.+ T Consensus 55 lp~~~~~~-~g~~~~~----~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g--~------~~~L~~ 121 (406) T protein:vir:97 55 FPLVKKDV-NGDIIHD----EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTN--Q------ALQFQF 121 (406) T ss_pred CeeEEEec-Ccccccc----chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCC--e------EEEEEE Confidence 99865422 2221111 12334443 2334456778887655 5678899999988643222 1 223445 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++..+. ...+++|++.....+ ...+....+|...+|++++.+. +..+|. T Consensus 122 i~p~~v~----v~~~~~~~~~y~~~~-------------------------~~~~~~~~~~~~evih~r~~~~-dg~~G~ 171 (406) T protein:vir:97 122 YRPSETT----VEETDNHEIVYTFTD-------------------------MLTAKQVKCFAHDVIHWKFFSH-DTILGR 171 (406) T ss_pred ECCCeeE----EEEcCCceEEEEEEe-------------------------cCCceEEEEccccEEEecCCCC-CCcccc Confidence 5554332 223444433221110 1122344577777776666543 336799 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EEecc Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FILPS 316 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP~ 316 (516) |.+..+....-.-....++.+.|.. +|+.- +.+-.....-++++.+ ++++.......|..++ ++++. T Consensus 172 spi~~~~~~i~~~~a~~~~~~~~f~-ng~~~-------~~i~~~~~~l~~e~~~---~~~~~~~~~~~g~n~g~~~vl~~ 240 (406) T protein:vir:97 172 SPLLSLGDEIDLQTGGINTLIKFFK-DGFSS-------GILTMKGAQLSGDARQ---RARQEFEKMREGSVGGSPLVFDS 240 (406) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHh-ccCCC-------ceEEecCCCCCHHHHH---HHHHHHHHHhcccccCceeecCC Confidence 9998887766555666666666664 45432 2222222223333333 3444444445555544 45688 Q ss_pred CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 317 DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEA 396 (516) Q Consensus 317 g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ 396 (516) |++... ++. +.....+.+..++..++|++++--..-..+....+| ...+.-......-++-.++.|++. T Consensus 241 g~~~~~--------l~~--~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~-~~e~~~~~f~~~~l~P~~~~ie~~ 309 (406) T protein:vir:97 241 TMEYTP--------LEI--DTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQ-SVAQLMEDYVTNDLPFYFDAITSE 309 (406) T ss_pred CceEEE--------ccC--CHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcc-hHHHHHHHHHHHHHHHHHHHHHHH Confidence 864332 111 122223555667778899988755443332222222 223333344455577778888888 Q ss_pred HHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCC--cccccC Q lcl|NC_016071. 397 FNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIP--EDMSTD 474 (516) Q Consensus 397 ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~--~~~~~~ 474 (516) ||+.|+..-- ..--+++|+. ..+++..++++.++++.|++.+ +.+|+.+|+|+-.+ .|+..- T Consensus 310 l~~kll~~~~---------~~~~~i~fd~--~~~~~~~~~~~~~~~~~g~~T~-----NE~R~~~g~~p~~~~~gD~~~~ 373 (406) T protein:vir:97 310 LGLKTLNDKD---------RRLYHIEFDT--RSVTGRNVDEIVKLVNNQILTP-----NQGLVELGKQKSTDPNMDRYQS 373 (406) T ss_pred HhhhhcChhh---------ccceeEEEec--CccchhhHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCCeEee Confidence 8877654311 0112455654 3456677788889999998876 57899999986433 232211 Q ss_pred ccc-cc-----CCCCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 475 ELL-KL-----LGQDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 475 ~~~-~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) ... .+ .+++.... ..+ +...++ -.|+| T Consensus 374 ~~n~~~~~~~~~~~~~~~~--~~~-gg~~~~------~~~~~ 406 (406) T protein:vir:97 374 SLNYVFLDKKEEYQDKVGI--KGK-GGEVNA------EEDKS 406 (406) T ss_pred ccCccchhccccccccccc--ccC-CCCCCC------CCCCC Confidence 110 00 01110000 001 111111 11222 No 74 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.48 E-value=6.7e-13 Score=87.34 Aligned_cols=380 Identities=10% Similarity=0.023 Sum_probs=189.4 Q ss_pred CC--ccccCcccccchh-hhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAG-NENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~-~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |. ..+.+.....+.. .+.+ .+. |. ...+.... . . ..+..+. -+..++.+.|.+|+..+-.. T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~-~~~------~~--~~~~~~~~---~-~--~~~~~v~-~~~al~~~~v~~~i~~ia~~ 66 (392) T protein:vir:10 3 LPILNFINQTNDPPEVGSVQSY-FPD------GN--DAQIMESL---L-G--DNNEWVS-ARAALRNSDLFSIILQLSSD 66 (392) T ss_pred chhhhhhhcccccccccccccc-ccc------Cc--hhhhhhhh---c-C--CCCceec-hHHhhccHHHHHHHHHHHHh Confidence 11 1222111111100 0000 000 00 00010000 0 0 0112221 13345689999999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.++++++.-. .. ...+++-+...+..++++.+. +.+.+|.+++++++...+ . +..| T Consensus 67 ia~lp~~~~~~------~~-------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g-------~--~~~L 124 (392) T protein:vir:10 67 LAIVKINAEKK------KN-------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANG-------A--DMKW 124 (392) T ss_pred hccCceeeccc------hh-------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCC-------c--EEEE Confidence 99998876411 11 112334444566788888776 678899999999876432 2 2344 Q ss_pred cccCchhcccccceeecCCCceee-eccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLK-GIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) .+.++.++. ...+.+|+.+. ..... ......-..+|.+.+|++++....+.. T Consensus 125 ~~l~~~~v~----~~~~~~~~~~~y~~~~~-----------------------~~~~~~~~~~~~~eiih~~~~~~~~~~ 177 (392) T protein:vir:10 125 EYLRPSQVN----TYYFEYENGMYYNITFD-----------------------DPKIEPILQAPQSDLIHMKLLSIDGGK 177 (392) T ss_pred EEEcCceeE----EEEcCCCceEEEEEEec-----------------------CcccceeEEEccccEEEecCCCCCCcc Confidence 455554332 23344443221 10000 000111234667776666666666778 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccce--EEE Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQA--YFI 313 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a--~~i 313 (516) +|.|.+..+....-.-....++...+....+.+--+++.+ .+...+++.. +...+... ....+ .++ T Consensus 178 ~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~~~~~~~~~-~~~~~~~~---~~~~~g~~~v 245 (392) T protein:vir:10 178 TGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVK--------GGGLLSDKDK-ASRSRSFM---KRSRSGGPVV 245 (392) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC--------CCCCchHHHH-HHHHHHHh---ccccCCCeee Confidence 9999999999888777777777777777655544344332 1222222222 22222222 22233 366 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDII 393 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i 393 (516) +|.|++++ ..+.+....++.+..++..++|++++.-..-..+..+ .+++..+-.......-++-.++.| T Consensus 246 l~~g~~~~----------~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~-~~~~~~~~~~~f~~~~l~P~~~~i 314 (392) T protein:vir:10 246 LDDLEEFT----------ALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG-DQQSSIQQISGMYASALNRYLRPA 314 (392) T ss_pred cCCCceEE----------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CcccHHHHHHHHHHHHHHHHHHHH Confidence 78886432 2222233445777788888999999876554443211 222223323344555677788888 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc---CCCCCCCcc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG---GFDEEIPED 470 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~---Glp~~~~~~ 470 (516) ++.||+.|++.+ .+| ....-..|...+++.+.+|+..|++.++ .+|+.+ |+.+ . T Consensus 315 e~~l~~~L~~~~-~~d-------------~~~~~~~d~~~~~~~~~~l~~~g~~t~n-----E~r~~l~~~g~~p---~- 371 (392) T protein:vir:10 315 ISELEYKLSDHI-SVN-------------MRPAIDPLGDNYLSTISTATRWGALAEN-----QATFVLQEAGYIP---K- 371 (392) T ss_pred HHHHHHhccccc-ccc-------------chhhhccCHHHHHHHHHHHHhCCCcCHH-----HHHHHHHhcCCCc---c- Confidence 888888775542 222 1111234567778899999999987763 344444 6652 1 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCcc Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTG 501 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (516) +....+.. ++-++ |...+..+ T Consensus 372 e~r~~e~l-------~~~~~---Gd~~~p~p 392 (392) T protein:vir:10 372 DLPAPENT-------NKKTT---GQSNEPVP 392 (392) T ss_pred ccchhcCC-------CCCCC---CCCCCCCC Confidence 11100000 00000 00011111 No 75 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.48 E-value=6.7e-13 Score=87.34 Aligned_cols=380 Identities=10% Similarity=0.023 Sum_probs=189.4 Q ss_pred CC--ccccCcccccchh-hhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAG-NENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~-~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |. ..+.+.....+.. .+.+ .+. |. ...+.... . . ..+..+. -+..++.+.|.+|+..+-.. T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~-~~~------~~--~~~~~~~~---~-~--~~~~~v~-~~~al~~~~v~~~i~~ia~~ 66 (392) T protein:vir:39 3 LPILNFINQTNDPPEVGSVQSY-FPD------GN--DAQIMESL---L-G--DNNEWVS-ARAALRNSDLFSIILQLSSD 66 (392) T ss_pred chhhhhhhcccccccccccccc-ccc------Cc--hhhhhhhh---c-C--CCCceec-hHHhhccHHHHHHHHHHHHh Confidence 11 1222111111100 0000 000 00 00010000 0 0 0112221 13345689999999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.++++++.-. .. ...+++-+...+..++++.+. +.+.+|.+++++++...+ . +..| T Consensus 67 ia~lp~~~~~~------~~-------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g-------~--~~~L 124 (392) T protein:vir:39 67 LAIVKINAEKK------KN-------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANG-------A--DMKW 124 (392) T ss_pred hccCceeeccc------hh-------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCC-------c--EEEE Confidence 99998876411 11 112334444566788888776 678899999999876432 2 2344 Q ss_pred cccCchhcccccceeecCCCceee-eccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLK-GIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) .+.++.++. ...+.+|+.+. ..... ......-..+|.+.+|++++....+.. T Consensus 125 ~~l~~~~v~----~~~~~~~~~~~y~~~~~-----------------------~~~~~~~~~~~~~eiih~~~~~~~~~~ 177 (392) T protein:vir:39 125 EYLRPSQVN----TYYFEYENGMYYNITFD-----------------------DPKIEPILQAPQSDLIHMKLLSIDGGK 177 (392) T ss_pred EEEcCceeE----EEEcCCCceEEEEEEec-----------------------CcccceeEEEccccEEEecCCCCCCcc Confidence 455554332 23344443221 10000 000111234667776666666666778 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccce--EEE Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQA--YFI 313 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a--~~i 313 (516) +|.|.+..+....-.-....++...+....+.+--+++.+ .+...+++.. +...+... ....+ .++ T Consensus 178 ~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~~~~~~~~~-~~~~~~~~---~~~~~g~~~v 245 (392) T protein:vir:39 178 TGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVK--------GGGLLSDKDK-ASRSRSFM---KRSRSGGPVV 245 (392) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC--------CCCCchHHHH-HHHHHHHh---ccccCCCeee Confidence 9999999999888777777777777777655544344332 1222222222 22222222 22233 366 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDII 393 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i 393 (516) +|.|++++ ..+.+....++.+..++..++|++++.-..-..+..+ .+++..+-.......-++-.++.| T Consensus 246 l~~g~~~~----------~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~-~~~~~~~~~~~f~~~~l~P~~~~i 314 (392) T protein:vir:39 246 LDDLEEFT----------ALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG-DQQSSIQQISGMYASALNRYLRPA 314 (392) T ss_pred cCCCceEE----------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CcccHHHHHHHHHHHHHHHHHHHH Confidence 78886432 2222233445777788888999999876554443211 222223323344555677788888 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc---CCCCCCCcc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG---GFDEEIPED 470 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~---Glp~~~~~~ 470 (516) ++.||+.|++.+ .+| ....-..|...+++.+.+|+..|++.++ .+|+.+ |+.+ . T Consensus 315 e~~l~~~L~~~~-~~d-------------~~~~~~~d~~~~~~~~~~l~~~g~~t~n-----E~r~~l~~~g~~p---~- 371 (392) T protein:vir:39 315 ISELEYKLSDHI-SVN-------------MRPAIDPLGDNYLSTISTATRWGALAEN-----QATFVLQEAGYIP---K- 371 (392) T ss_pred HHHHHHhccccc-ccc-------------chhhhccCHHHHHHHHHHHHhCCCcCHH-----HHHHHHHhcCCCc---c- Confidence 888888775542 222 1111234567778899999999987763 344444 6652 1 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCcc Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTG 501 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (516) +....+.. ++-++ |...+..+ T Consensus 372 e~r~~e~l-------~~~~~---Gd~~~p~p 392 (392) T protein:vir:39 372 DLPAPENT-------NKKTT---GQSNEPVP 392 (392) T ss_pred ccchhcCC-------CCCCC---CCCCCCCC Confidence 11100000 00000 00011111 No 76 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.45 E-value=7e-12 Score=81.73 Aligned_cols=384 Identities=13% Similarity=0.026 Sum_probs=189.7 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhh-cccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVM-KVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~-~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |. .|+.... .+ .+ +.- .+.... .......+..+ .-+..++-+.|.+|+..+... T Consensus 1 MGl~~~~~~~~--~~----~~--------~~~--------~~~~~~~~~~~~~~~~~v-t~~~al~~~~v~~~i~~Ia~~ 57 (394) T protein:vir:62 1 MGLRDRFSNYL--FK----KA--------EKR--------GYLDNVLGKSIRYSGVYV-TDSNILQSSDVYELLQDISNQ 57 (394) T ss_pred Cchhhhhhhhc--cC----CC--------Cch--------hhhhhhhhcccccCcccc-ChhhhhccHHHHHHHHHHHHh Confidence 22 2322110 00 00 000 000000 00000011111 123345678899999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.+++|++.-..+ +. .. .+.+...+.+-+...++.+++..+. +.+.+|-+++.+.- +....+..+ T Consensus 58 iA~lp~~v~~~~g-~~-~~---~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~---------~~~~~~~~~ 123 (394) T protein:vir:62 58 MVLADIVVEDEFG-NE-IK---DDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNG---------AQIHLASNV 123 (394) T ss_pred hcccceEEEcCCC-cc-cc---hhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEec---------ceeeccccc Confidence 9999998864332 11 11 1222233444344456677776544 67888999886531 111111111 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) . ...++++... . ...+..+|.+.++..++.. .+..+ T Consensus 124 ~------------~~~~~~~~~~--~-----------------------------~~~~~~~~~~eiih~r~~~-~d~~~ 159 (394) T protein:vir:62 124 F------------TELDDNLVEH--F-----------------------------NIGGHEIPPCMIRHVKNIG-ADHLR 159 (394) T ss_pred e------------EEECCceEEE--E-----------------------------eeCCEEechhheEEecCcC-CCCcc Confidence 1 1122222110 0 0123456777766555544 44578 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFI 313 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~i 313 (516) |.|++..++...-.-....++...+....+.+=-+++.+.. ..+++..+ +++++.......| ..+ .+| T Consensus 160 G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~------~~~~~~~~---~~~~~~~~~~~~g~~n~g~~~v 230 (394) T protein:vir:62 160 GKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAH------INPQNGAQ---SKLINAILDQLESIDEARSVKM 230 (394) T ss_pred ccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCC------CCcCHHHH---HHHHHHHHHHhccccccCceeE Confidence 99999999876665566666666666654444333333211 11222222 2233333222223 223 357 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHH-HHHHHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSE-SKQSIHGHFVQRDIDI 392 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~-vh~ev~~~~~~aDa~~ 392 (516) +|.|++.++. ..+.+.....+.+..++...+|++++--.....+. ++++-.+ .-......-+.-.++. T Consensus 231 l~~g~~~~~~--------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~---~~~sn~e~~~~~~~~~~l~P~~~~ 299 (394) T protein:vir:62 231 IPLGKGYSID--------TLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTE---LIKEDIEKAMMYIHNKAVRPIMKN 299 (394) T ss_pred eeCCCceeEE--------ecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCC---CCCcCHHHHHHHHHHHHHHHHHHH Confidence 8888754332 22222233346666678889999998776644432 1222222 2233445567778888 Q ss_pred HHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCc--c Q lcl|NC_016071. 393 IVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPE--D 470 (516) Q Consensus 393 i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~--~ 470 (516) |++.||+.|+.+- .. ..-+|.|+...-.+.+..++++.++++.|++.+ +.+|+.+|+|+-.++ + T Consensus 300 ie~~l~~kll~~~--------~~-~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~-----NE~R~~~gl~p~~~~~gd 365 (394) T protein:vir:62 300 FEDHLSLLFYAQN--------SG-KRIKFKINILDFVTYSNKTNIGYNLVRTAITSP-----DNVADMLGFPKQNTKESQ 365 (394) T ss_pred HHHHHhhhhcCcc--------cc-CceEEEechhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCC Confidence 8888887665421 11 123678887777777788899999999998876 579999999964222 1 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) ... . .+...+-... +......++.+++. | T Consensus 366 ~~~-~-~~n~~~~~~~-----------~~~~~~~kgge~~e----n 394 (394) T protein:vir:62 366 AIY-I-SNDVTEIGKK-----------EATDGSLGGGEENE----N 394 (394) T ss_pred eee-c-cccccccccc-----------ccccccCCCCCCCC----C Confidence 111 0 0000000000 00001111111110 1 No 77 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.44 E-value=9.6e-13 Score=86.46 Aligned_cols=372 Identities=11% Similarity=0.073 Sum_probs=190.6 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH-HHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV-EAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~~v~ 79 (516) |+.|.-..... ..+..+..+ .. +.........+ .+. +..++.+.|.+|+..+-..|. T Consensus 4 ~~~~~~~~~~~---~~~~~~~~~----~~-------~~~~~~~~~~~--------~v~~~~al~~~~v~~~i~~ia~~ia 61 (385) T protein:vir:10 4 LTPRNFNKRKA---KNMVYPSNP----AF-------FTTTVGGMQLS--------YVSALSALQNTNVYSVINRIASDVA 61 (385) T ss_pred ccchhcccccc---cccccccch----hh-------hhhhccccCcc--------ccCHHHhhccHHHHHHHHHHHHHHh Confidence 33332111100 000000000 00 00000000000 011 234568899999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) ++++++.-. . ....|++-+...++.+++..+. +.+.+|-++++++... .+++ | T Consensus 62 ~~p~~v~~~-----~--------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~-------~~~~------p 115 (385) T protein:vir:10 62 SAHFKTENT-----A--------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN-------LEHI------P 115 (385) T ss_pred hCceeeecc-----c--------hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc-------eeEe------e Confidence 999887421 1 1223444444567788888766 4667999999987531 1222 2 Q ss_pred cCchhcccccceeecCCCc-eeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCc--CCcc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGR-TLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGT--ESNP 235 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~-~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~--~g~p 235 (516) -++.++ ....++. ..... ..........+|.+-.|++++... .+.. T Consensus 116 ~~~~~v------~~~~~~~~~~~~~-------------------------~~~~~~~~~~~~~~eiihik~~~~~~~~~~ 164 (385) T protein:vir:10 116 NSDVQI------NYLPGNMGIVYTV-------------------------LESNDRPQMVLRQDQMLHFRLMPDPQYRYL 164 (385) T ss_pred cCCceE------EEEEcCCceEEEE-------------------------EEcCCceEEEEccccEEEeccCCCCccccc Confidence 222222 1222221 11110 011122344577777666664332 3456 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EE Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FI 313 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~i 313 (516) +|.|.+..|....-......++-..+...-+.+--+++.+ ..-.+.+ ..+.+++.......|..++ ++ T Consensus 165 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~-------~~~~~~e---~~~~~~~~~~~~~~~~n~~~~~v 234 (385) T protein:vir:10 165 IGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTIS-------NYLSDGK---DLESAREEFEKANTGDNSGRLMV 234 (385) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-------CCCCCHH---HHHHHHHHHHHHhCccccCCccc Confidence 8999999999877777777777777766433333233321 1111122 2344555555555565554 66 Q ss_pred eccCcccccccccceeeeeccccCcchhH-HHHHHHHHHHHHHHHhcccccccCCc--cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYST-QELVNSRKKAILDRFGAGFINLGNDG--QGSYNLSESKQSIHGHFVQRDI 390 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~-~~li~~~d~~Isk~iLGqtLts~~~~--~GS~Al~~vh~ev~~~~~~aDa 390 (516) +|.|++++. ++. +.....+ .+..++..++|++++--..--.+... ..+++-.+.+...+.+-+.-.+ T Consensus 235 l~~g~~~~~--------l~~--~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~ 304 (385) T protein:vir:10 235 LPDGFDYTQ--------LEM--KTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYV 304 (385) T ss_pred cCCCceEEe--------cCC--ChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHHHHHHHH Confidence 777764332 222 2222233 35567778899998877553333221 1233444444445555677888 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) +.|++.||+.|+. + .-+|.++..-..|.++.+++++++++.|++.+ +.+|+.+|+++-.+++ T Consensus 305 ~~ie~~l~~~l~~----------~---~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~p~~~ 366 (385) T protein:vir:10 305 NPIVDELRLKMNA----------P---DLELDIKDMLDVDDSALINQVSNLAKSGVLGA-----EQAQFILTRSGFLPDN 366 (385) T ss_pred HHHHHHHHHhhCC----------c---eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCccCCCC Confidence 8888888876521 1 12344445556788999999999999999876 5789999886432322 Q ss_pred cccCcccccCCCCCCcccccccccCCCCC Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNG 499 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (516) .+... ++....+.|+ .++. T Consensus 367 ~~~~~----~~~~~~~~g~------~~dn 385 (385) T protein:vir:10 367 LPEFK----PLTTQVKGGD------EGDN 385 (385) T ss_pred Ccccc----CcccccCCCC------CCCC Confidence 21111 1111111111 1111 No 78 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.42 E-value=3.6e-12 Score=83.35 Aligned_cols=387 Identities=11% Similarity=0.042 Sum_probs=176.4 Q ss_pred CCc--cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH-HHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MST--RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV-EAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~--r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~~ 77 (516) |+= ++-... +.+ ..-+.+.-...++ +..++-+.|.+|+..+-.. T Consensus 1 Mg~f~~lf~~~-------~~~--------------------------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:10 1 MSILEKIFKTR-------KDI--------------------------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred CchhhhhhccC-------ccc--------------------------cccccchhccccchhhhhhhHHHHHHHHHHHHh Confidence 221 110000 000 0000011111122 2234578999999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHH-hhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFN-EYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~-~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.+++|.+.-. +...+...+.++. .+-+...++.++++.++..+ ..|-++. ++. ++ .+.+.+... T Consensus 48 iA~~p~~~~~~---~~~~~~~~~~ll~---~~PN~~~t~~~f~~~~~~~lll~g~~~~-~~~--~~-----~~~~~~~~~ 113 (395) T protein:vir:10 48 VAQSHFKVLEG---NRIQKNDVYYKLN---IKPNTDLSSDSFWQQVIYKLIYDNEVLI-VVS--DS-----KELLIADSF 113 (395) T ss_pred hccceeEeccC---CccccchHHHHHH---hccCcCCCHHHHHHHHHHHHhhCCceEE-EEe--cC-----CCeEecCCc Confidence 99999876521 1222223333322 12233456677777766544 4454443 222 21 122222211 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) ...+. . .++. .... + ......-...+|...+|.+++....+..+ T Consensus 114 ~~~~~-~-------~~~~---~~~~----------~---------------~~~~~~~~~~~~~~evih~~~~~~~~~~~ 157 (395) T protein:vir:10 114 YREEY-A-------LYDD---IFKD----------V---------------TVKDYTYQRTFTMQEVIYLKYNNNKVTHF 157 (395) T ss_pred cceeE-e-------ecCc---ceeE----------E---------------EEcCceeeeeeccccEEEEccCCCCcccc Confidence 11110 0 0000 0000 0 00001112346778877777777888899 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHH-HHHHhhcccceEEEec Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMA-DAANAHAGEQAYFILP 315 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~-~~~~~~~g~~a~~iiP 315 (516) |.|++..+.... .....+..+.+. +++++.......+++.++.+++..+ ......++..+.++++ T Consensus 158 G~spi~~~~~~~-------~~~~~~~~~~~~-------~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~ 223 (395) T protein:vir:10 158 VESLFEDYGKIF-------GRMIGAQLKNYQ-------IRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLI 223 (395) T ss_pred cchHHHHHHHHH-------HHHHHHHHhcCC-------CceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC Confidence 999998776432 112223333333 2333322222223344433333222 2211122222333457 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhh-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYN-LSESKQSIHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A-l~~vh~ev~~~~~~aDa~~i~ 394 (516) .|++.+... +......-...+|.+..++..++|++++--..--.+ |+++ ..+.........+.--+++|+ T Consensus 224 ~g~~~~~l~-----~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~~~~~l~P~~~~ie 294 (395) T protein:vir:10 224 EGFDYEELS-----NGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVFEKFCLTPLLKKIQ 294 (395) T ss_pred CCceeeecc-----ccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHHHHHHHHHHHHHHH Confidence 776533221 110100111123555666778899998876543332 3332 344445555667888889999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCc--ccc Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPE--DMS 472 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~--~~~ 472 (516) ..||+.|+.+--... .-+|.++.....|.++.+++++++++.|++.+ +.+|+.+|+|+-.++ |+. T Consensus 295 ~~l~~kL~~~~~~~~--------~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~-----NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 295 NELNAKLITQSMYLK--------DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTR-----NEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHhhcChhhhcc--------cceecchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCcee Confidence 999988765421111 11455556667888999999999999999876 578999999965433 222 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) .-.. ...+-+.....+.. +....+++.|+.... | T Consensus 362 ~~~~-n~~~~~~~~~~~~~-------~~~~~~kgg~~~~~g--~ 395 (395) T protein:vir:10 362 LITK-NYEKANSGENDEKE-------KDENTLKGGDEDESG--D 395 (395) T ss_pred eecc-ccccccccccccCc-------ccccccCCCCCCCCC--C Confidence 1110 00000000000000 000011111111000 0 No 79 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.42 E-value=3.6e-12 Score=83.35 Aligned_cols=387 Identities=11% Similarity=0.042 Sum_probs=176.4 Q ss_pred CCc--cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH-HHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MST--RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV-EAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~--r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~~ 77 (516) |+= ++-... +.+ ..-+.+.-...++ +..++-+.|.+|+..+-.. T Consensus 1 Mg~f~~lf~~~-------~~~--------------------------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:95 1 MSILEKIFKTR-------KDI--------------------------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred CchhhhhhccC-------ccc--------------------------cccccchhccccchhhhhhhHHHHHHHHHHHHh Confidence 221 110000 000 0000011111122 2234578999999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHH-hhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFN-EYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~-~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.+++|.+.-. +...+...+.++. .+-+...++.++++.++..+ ..|-++. ++. ++ .+.+.+... T Consensus 48 iA~~p~~~~~~---~~~~~~~~~~ll~---~~PN~~~t~~~f~~~~~~~lll~g~~~~-~~~--~~-----~~~~~~~~~ 113 (395) T protein:vir:95 48 VAQSHFKVLEG---NRIQKNDVYYKLN---IKPNTDLSSDSFWQQVIYKLIYDNEVLI-VVS--DS-----KELLIADSF 113 (395) T ss_pred hccceeEeccC---CccccchHHHHHH---hccCcCCCHHHHHHHHHHHHhhCCceEE-EEe--cC-----CCeEecCCc Confidence 99999876521 1222223333322 12233456677777766544 4454443 222 21 122222211 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) ...+. . .++. .... + ......-...+|...+|.+++....+..+ T Consensus 114 ~~~~~-~-------~~~~---~~~~----------~---------------~~~~~~~~~~~~~~evih~~~~~~~~~~~ 157 (395) T protein:vir:95 114 YREEY-A-------LYDD---IFKD----------V---------------TVKDYTYQRTFTMQEVIYLKYNNNKVTHF 157 (395) T ss_pred cceeE-e-------ecCc---ceeE----------E---------------EEcCceeeeeeccccEEEEccCCCCcccc Confidence 11110 0 0000 0000 0 00001112346778877777777888899 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHH-HHHHhhcccceEEEec Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMA-DAANAHAGEQAYFILP 315 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~-~~~~~~~g~~a~~iiP 315 (516) |.|++..+.... .....+..+.+. +++++.......+++.++.+++..+ ......++..+.++++ T Consensus 158 G~spi~~~~~~~-------~~~~~~~~~~~~-------~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~ 223 (395) T protein:vir:95 158 VESLFEDYGKIF-------GRMIGAQLKNYQ-------IRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLI 223 (395) T ss_pred cchHHHHHHHHH-------HHHHHHHHhcCC-------CceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC Confidence 999998776432 112223333333 2333322222223344433333222 2211122222333457 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhh-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYN-LSESKQSIHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A-l~~vh~ev~~~~~~aDa~~i~ 394 (516) .|++.+... +......-...+|.+..++..++|++++--..--.+ |+++ ..+.........+.--+++|+ T Consensus 224 ~g~~~~~l~-----~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~~~~~l~P~~~~ie 294 (395) T protein:vir:95 224 EGFDYEELS-----NGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVFEKFCLTPLLKKIQ 294 (395) T ss_pred CCceeeecc-----ccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHHHHHHHHHHHHHHH Confidence 776533221 110100111123555666778899998876543332 3332 344445555667888889999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCc--ccc Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPE--DMS 472 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~--~~~ 472 (516) ..||+.|+.+--... .-+|.++.....|.++.+++++++++.|++.+ +.+|+.+|+|+-.++ |+. T Consensus 295 ~~l~~kL~~~~~~~~--------~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~-----NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:95 295 NELNAKLITQSMYLK--------DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTR-----NEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHhhcChhhhcc--------cceecchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCcee Confidence 999988765421111 11455556667888999999999999999876 578999999965433 222 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) .-.. ...+-+.....+.. +....+++.|+.... | T Consensus 362 ~~~~-n~~~~~~~~~~~~~-------~~~~~~kgg~~~~~g--~ 395 (395) T protein:vir:95 362 LITK-NYEKANSGENDEKE-------KDENTLKGGDEDESG--D 395 (395) T ss_pred eecc-ccccccccccccCc-------ccccccCCCCCCCCC--C Confidence 1110 00000000000000 000011111111000 0 No 80 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.42 E-value=3.6e-12 Score=83.35 Aligned_cols=387 Identities=11% Similarity=0.042 Sum_probs=176.4 Q ss_pred CCc--cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH-HHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MST--RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV-EAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~--r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~~ 77 (516) |+= ++-... +.+ ..-+.+.-...++ +..++-+.|.+|+..+-.. T Consensus 1 Mg~f~~lf~~~-------~~~--------------------------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~ 47 (395) T protein:vir:10 1 MSILEKIFKTR-------KDI--------------------------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARA 47 (395) T ss_pred CchhhhhhccC-------ccc--------------------------cccccchhccccchhhhhhhHHHHHHHHHHHHh Confidence 221 110000 000 0000011111122 2234578999999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHH-hhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFN-EYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~-~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.+++|.+.-. +...+...+.++. .+-+...++.++++.++..+ ..|-++. ++. ++ .+.+.+... T Consensus 48 iA~~p~~~~~~---~~~~~~~~~~ll~---~~PN~~~t~~~f~~~~~~~lll~g~~~~-~~~--~~-----~~~~~~~~~ 113 (395) T protein:vir:10 48 VAQSHFKVLEG---NRIQKNDVYYKLN---IKPNTDLSSDSFWQQVIYKLIYDNEVLI-VVS--DS-----KELLIADSF 113 (395) T ss_pred hccceeEeccC---CccccchHHHHHH---hccCcCCCHHHHHHHHHHHHhhCCceEE-EEe--cC-----CCeEecCCc Confidence 99999876521 1222223333322 12233456677777766544 4454443 222 21 122222211 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) ...+. . .++. .... + ......-...+|...+|.+++....+..+ T Consensus 114 ~~~~~-~-------~~~~---~~~~----------~---------------~~~~~~~~~~~~~~evih~~~~~~~~~~~ 157 (395) T protein:vir:10 114 YREEY-A-------LYDD---IFKD----------V---------------TVKDYTYQRTFTMQEVIYLKYNNNKVTHF 157 (395) T ss_pred cceeE-e-------ecCc---ceeE----------E---------------EEcCceeeeeeccccEEEEccCCCCcccc Confidence 11110 0 0000 0000 0 00001112346778877777777888899 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHH-HHHHhhcccceEEEec Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMA-DAANAHAGEQAYFILP 315 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~-~~~~~~~g~~a~~iiP 315 (516) |.|++..+.... .....+..+.+. +++++.......+++.++.+++..+ ......++..+.++++ T Consensus 158 G~spi~~~~~~~-------~~~~~~~~~~~~-------~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~ 223 (395) T protein:vir:10 158 VESLFEDYGKIF-------GRMIGAQLKNYQ-------IRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLI 223 (395) T ss_pred cchHHHHHHHHH-------HHHHHHHHhcCC-------CceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC Confidence 999998776432 112223333333 2333322222223344433333222 2211122222333457 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhh-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYN-LSESKQSIHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A-l~~vh~ev~~~~~~aDa~~i~ 394 (516) .|++.+... +......-...+|.+..++..++|++++--..--.+ |+++ ..+.........+.--+++|+ T Consensus 224 ~g~~~~~l~-----~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~~~~~l~P~~~~ie 294 (395) T protein:vir:10 224 EGFDYEELS-----NGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVFEKFCLTPLLKKIQ 294 (395) T ss_pred CCceeeecc-----ccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHHHHHHHHHHHHHHH Confidence 776533221 110100111123555666778899998876543332 3332 344445555667888889999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCc--ccc Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPE--DMS 472 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~--~~~ 472 (516) ..||+.|+.+--... .-+|.++.....|.++.+++++++++.|++.+ +.+|+.+|+|+-.++ |+. T Consensus 295 ~~l~~kL~~~~~~~~--------~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~-----NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 295 NELNAKLITQSMYLK--------DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTR-----NEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHhhcChhhhcc--------cceecchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCCcee Confidence 999988765421111 11455556667888999999999999999876 578999999965433 222 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) .-.. ...+-+.....+.. +....+++.|+.... | T Consensus 362 ~~~~-n~~~~~~~~~~~~~-------~~~~~~kgg~~~~~g--~ 395 (395) T protein:vir:10 362 LITK-NYEKANSGENDEKE-------KDENTLKGGDEDESG--D 395 (395) T ss_pred eecc-ccccccccccccCc-------ccccccCCCCCCCCC--C Confidence 1110 00000000000000 000011111111000 0 No 81 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.42 E-value=9.8e-12 Score=80.95 Aligned_cols=393 Identities=11% Similarity=0.083 Sum_probs=186.0 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |. ..+.+..+. .|..+ +..+........+ ....|..+...+.|.+|+..+-..| T Consensus 1 Mg~~~~f~~k~~~------~~~~~--------------~~~~~~~~~~~~~----~~~~~~~~~~~~~V~~~I~~ia~~i 56 (403) T protein:vir:80 1 MGLFNFFRRKTRS------EPTNA--------------ISWFLTQEAYDTL----AIPGYTRLSDNPEVRMAVHKIAELI 56 (403) T ss_pred Ccccccccccccc------cccch--------------hhhhccccccccc----ccchhhhhhhhHHHHHHHHHHHHhh Confidence 33 334332211 11000 0000000000001 1122445656789999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHHH-HHh--hcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAAT-FNE--YGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~ld-a~~--~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) .++++++.-... +. .+++..-+...|. +-+...+..+++..++. .+. +|+++++++|...+ ++ . T Consensus 57 A~~p~~~~~~~~-~g--~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g---~~------~ 124 (403) T protein:vir:80 57 SSMTIHLMQNTD-NG--DIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSG---LI------D 124 (403) T ss_pred hhCceEEEEecC-Cc--eeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCC---cE------E Confidence 999988643221 11 1111222333343 33344566777777653 443 68899999885432 12 2 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) .|.+.++.++. +..+.+|..+. + .+..+|.+.++.++....+.+ T Consensus 125 ~L~~l~p~~v~----~~~~~~g~~~~-----------y---------------------~~~~~~~~eiih~~~~~~~~~ 168 (403) T protein:vir:80 125 ELIPLAPSKVS----FVDTDTGYQIW-----------Y---------------------QGKAYNYDEVLHFIVNPDPEK 168 (403) T ss_pred EEEEEcCCeeE----EEEcCCceEEE-----------E---------------------eecccchhhEEEEeccCCCcC Confidence 33333343332 22333332110 0 112355666666665444444 Q ss_pred -cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHH-HHHHHHHHHHHHHhhcccceEE Q lcl|NC_016071. 235 -PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPE-SEMVQGLMADAANAHAGEQAYF 312 (516) Q Consensus 235 -p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~-~~~l~~l~~~~~~~~~g~~a~~ 312 (516) .+|.|.+..+....-.-....++...+...-+.|--+++.+ ...+... ++..+...+...... .....+ T Consensus 169 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~--------~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~ 239 (403) T protein:vir:80 169 PYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVD--------AATAELSSEEGRNAVFKKYLEAS-EAGQPW 239 (403) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC--------CCCChHHHHHHHHHHHHHHhhhh-hcCCee Confidence 46999888777666555555566666665433333333322 1112222 222222222221111 112235 Q ss_pred EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 313 ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDI 392 (516) Q Consensus 313 iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~ 392 (516) ++|.+.. +. .++... +.....+.+..++...+|++++--..--.+.....+ +.........+.--++. T Consensus 240 ~~~~~~~-~~-----~~~~~l--~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~----~~~~~f~~~~l~P~~~~ 307 (403) T protein:vir:80 240 IIPAELL-DV-----EQVKPL--SLKDLAIHETVELDKRTVAGIFGVPAFLLGVGKYDK----DEYNNFINSTILPIAKG 307 (403) T ss_pred eeccccc-cc-----ceeccC--CHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCccH----HHHHHHHHHHHHHHHHH Confidence 6676642 11 112111 112234667778888999998877652222111111 11223445556677778 Q ss_pred HHHHHHHHHHHHHHHhcCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 393 IVEAFNKNLIPQLLALNDIRLSDEDMPKLKPG--LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 393 i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) |++.||+.|+. +.+ + +|+|+ ..-..|.++.++++.++++.|++.+ +.+|+.+|+|+-..+| T Consensus 308 ie~~l~~kll~----------~~~-~-~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~-----NE~R~~~gl~p~~ggd 370 (403) T protein:vir:80 308 IEQELTRKLLI----------SPD-L-YFKFNPRSLYAYDLKELAEVGSNMYVRGLMEG-----NEVRDWLGLSPKEGLS 370 (403) T ss_pred HHHHHHHhccC----------CCC-c-EEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCC Confidence 88888775542 111 2 35554 3445688999999999999999876 5799999999654444 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCccccc Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKIS 504 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (516) +..... .-.|-+........+.+.......+.- T Consensus 371 ~~~~~~-n~~pl~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 371 ELVILE-NYIPLDKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred eEeecc-cccchhhccchhhccCCCCCCCCCCCC Confidence 322111 100000000011111111111111110 No 82 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.41 E-value=1.1e-11 Score=80.57 Aligned_cols=418 Identities=9% Similarity=0.032 Sum_probs=192.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) ||+ .|+-.+ +...|..... .+-.- +..++-+.|.+|+..+-..|.+ T Consensus 1 ~~~--------------~~~~~g---------~~~~~~~~~~--------~~~~~---~~~~~~~~V~acV~~Ia~~iA~ 46 (723) T protein:vir:94 1 MTT--------------FPSGAG---------GWNAWSADSV--------FGNGA---KGWSNSAVAYRCISMLANNAAS 46 (723) T ss_pred Ccc--------------cccCCC---------cccccccccc--------ccccH---HHHhhhHHHHHHHHHHHHhhcc Confidence 221 111111 1111111000 00001 2234678999999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) ++|.+.-..+ +...+ .-+-..|. +-+...+..++...++ +.+.+|-+++++++.-+.....+. .|.+ T Consensus 47 lpl~l~~~~~-~~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~------~l~~ 115 (723) T protein:vir:94 47 VDLVVRGPDG-ELDEL----HPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPD------EIWY 115 (723) T ss_pred ceeEEEcCCC-ccchh----hHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcccccee------EEEE Confidence 9998753221 11111 12333343 2344456677877766 577899999999875332222222 2222 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .++.... .....++.... +.... + +......+..+.+|....|.+++....+..+|. T Consensus 116 l~~~~~~----v~~~~~~~~~~---~~~~~------~----------y~~~~~~G~~~~~~~~dIiHir~~~~~dg~~G~ 172 (723) T protein:vir:94 116 VYDRVTT----IVATRAADAVP---QAQII------G----------YVIERTDGVRVPVLADEMLWLRFSDPYDPLAVM 172 (723) T ss_pred ecCcceE----EeecCCCccce---eeeee------E----------EEEEecCceeEEecccceEEecCCCCCCCcccc Confidence 2221100 01111111000 00000 0 000011223345677776666655545667999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cce--EEEec Q lcl|NC_016071. 239 SPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQA--YFILP 315 (516) Q Consensus 239 gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a--~~iiP 315 (516) |.+..+....-.-....++...|... |+ -|.+++.. + .-+++. .+++++.......| .++ .++++ T Consensus 173 Spi~~a~~~i~~~~aa~~~~~~~f~N-G~------~p~giL~~-~-~l~~e~---~~~~~~~~~~~~~G~~Nagk~~vL~ 240 (723) T protein:vir:94 173 APWKAARAAVDADFYAATWQRQSFKN-GA------RPGGVVNL-G-DMDEQT---FTKTVAAFRSQVEGVQNAGRHLLIA 240 (723) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhc-CC------CcceEEEc-C-CCCHHH---HHHHHHHHHHHhhchhhcCcceeec Confidence 99998887776666666666666653 32 12233321 1 112222 23333333322222 222 23443 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccc-ccccCCccchhhHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGF-INLGNDGQGSYNLSESK-QSIHGHFVQRDIDII 393 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt-Lts~~~~~GS~Al~~vh-~ev~~~~~~aDa~~i 393 (516) ..-........-+++...+-+.....|.+.-++..++|++++--.. +..+ +++++-.+.. ...-..-+.-.++.| T Consensus 241 g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~---~st~sN~e~~~~~f~~~tL~P~~~~i 317 (723) T protein:vir:94 241 GQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLG---GSTYENQAEAKAAVWTETLIPQMEVM 317 (723) T ss_pred ccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCC---CCCcccHHHHHHHHHHHHHHHHHHHH Confidence 2100000000012232322232334466777888899999888774 3322 1223222222 233456678888999 Q ss_pred HHHHHHHHHHHHHHhcCCcCCccccceEEecCcC--chhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCccc Q lcl|NC_016071. 394 VEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQ--EVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDM 471 (516) Q Consensus 394 ~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~--~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~ 471 (516) ++.||+.|++.. . .. -+|.|+... ..|.+..+++++++++.|++.+ +.+|+.+|+|+-..++. T Consensus 318 e~~ln~~Ll~~~---g----~~---~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~-----NE~R~~lglpPi~gGd~ 382 (723) T protein:vir:94 318 ASITDLQLLPDI---G----WT---VEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMV-----DEVRATIGLDPLPGGIG 382 (723) T ss_pred HHHHhHhhcccc---c----Cc---eEEeecchhhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCcc Confidence 999998887531 1 11 145565433 4788999999999999999886 57999999986544442 Q ss_pred cc--Ccc-cc--cCCCCCCcccccccc---------------cCCCCCcc-----cccccccchhhhh----cC Q lcl|NC_016071. 472 ST--DEL-LK--LLGQDTSRSGDGMTA---------------GSNGNGTG-----KISSTRDNSVSNM----DN 516 (516) Q Consensus 472 ~~--~~~-~~--~~~~~~~~~~~~~~~---------------~~~~~~~~-----~~~~~~d~~~~~~----~~ 516 (516) .. .+. .. +.+...+...++... ..+..+++ ..+..-++..+.. += T Consensus 383 ~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (723) T protein:vir:94 383 QMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRATTVLHHDPGPDPQQTLYERLEALLQP 456 (723) T ss_pred cceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCCCCCCCCcccCCchhHHHHHHHHHhh Confidence 21 110 00 111101111111100 00000000 0000000011111 00 No 83 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.40 E-value=2.1e-11 Score=79.16 Aligned_cols=380 Identities=13% Similarity=0.051 Sum_probs=187.8 Q ss_pred CC---ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MS---TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~---~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |. ..++........ .+ .+..............+..+. -+..++-+.|.+|+..+-.. T Consensus 1 M~~f~~~~~~~~~~~~~---~~----------------~~~~~~~~~~~~~~~~~~~v~-~~~al~~~~v~~~i~~ia~~ 60 (386) T protein:vir:49 1 MPIFNITNLATESPPIN---QE----------------SFFDIADSDFLASLNSSEWVS-AENALKNSDLFSIISQLSND 60 (386) T ss_pred CchhhhhccCCCCcccc---hh----------------hhhhhhhccccccccCCceec-hhhhhccHHHHHHHHHHHHH Confidence 22 211111110000 00 000000000001111111111 12334678999999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.++++.+.-. . ++ ..+.+-+...++.++++.++ +.+.+|-+++++++...+ ++. .| T Consensus 61 ia~~p~~~~~~-----~-----~~---~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~------~l 118 (386) T protein:vir:49 61 LATAKITTSRK-----Q-----LQ---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNG---RDM------KW 118 (386) T ss_pred hhhCceeeccc-----h-----hh---hhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC---cEE------EE Confidence 99999876411 1 11 12233334456788888877 466789999999986542 122 33 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) -+.++.+++ ...+.++..+... +. ......+....+|.+.+|++++....+..+ T Consensus 119 ~~i~~~~v~----v~~~~~~~~~~y~---------~~-------------~~~~~~~~~~~~~~~evih~~~~~~~~~~~ 172 (386) T protein:vir:49 119 EYLRPSQVS----FNRLDNQNGLYYN---------IT-------------FDDPHIAPKQHVPQNDILHFRLLSVDGGLT 172 (386) T ss_pred EEecCceeE----EEEcCCCceEEEE---------EE-------------EcCccccceeEEccccEEEecCCCCCCccc Confidence 333433222 1223333222100 00 000112234467777766666655666689 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEecc Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPS 316 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~ 316 (516) |.|.+..|....-.-....++...+...-+.+--+++.+ .....++.+.+..... ....+....+++|. T Consensus 173 G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~--------~~~~~~~~~~~~~~~~---~~~~n~g~~~vl~~ 241 (386) T protein:vir:49 173 SVSPLMALGREFNIQKASDKLTISALKNALNANGILKIK--------GGGLLDFKTKVSRSRQ---AMKQMQGGPLVLDD 241 (386) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeC--------CCCChHHHHHHHHHHH---HhccCCCCceecCC Confidence 999999999877776777777777766544444444432 2233333333333222 22223334567777 Q ss_pred CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 317 DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEA 396 (516) Q Consensus 317 g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ 396 (516) |++++ ..+-+....++.+..++...+|++++.-..--.+.++ .+++.++.........++--++.|+.. T Consensus 242 g~~~~----------~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~~~~~~~~~~i~~~l~~i~~~ 310 (386) T protein:vir:49 242 LEDFT----------PLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDG-DQQSSLEMIYNIYFKSVSRYLRPFVSE 310 (386) T ss_pred CceEE----------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CccchHHHHHHHHHHHHHHHHHHHHHH Confidence 76422 2222233345677778888999998776553333222 334444433444445566666666666 Q ss_pred HHHHHHHHHHHhcCCcCCccccceEEec--CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccC Q lcl|NC_016071. 397 FNKNLIPQLLALNDIRLSDEDMPKLKPG--LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTD 474 (516) Q Consensus 397 ln~~li~~lv~lN~~~~~~~~~P~~~~~--~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~ 474 (516) ||+.|... +.|+ .....|...++..+.+|+..|++.+ +.+|+.++-..-.+.+... T Consensus 311 ~~~~l~~~----------------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~-----nE~r~~l~~~~~~~~~~~~- 368 (386) T protein:vir:49 311 MSKKLSCE----------------VDVDISPAVDPTGSNYISLINSMVKSGTLAQ-----NQGLYILQQAEILPKELPD- 368 (386) T ss_pred HHHHhcch----------------hcccchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHHhhCCCCCCcCcc- Confidence 66654322 2232 2334566778889999999999875 4678776532211111110 Q ss_pred cccccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 475 ELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) .+.... . ..++.|+++.| T Consensus 369 ------~~~~~~--~-------------~~~gGd~~~~~ 386 (386) T protein:vir:49 369 ------GKNPNR--T-------------SLKGGEINEQD 386 (386) T ss_pred ------hhccCC--C-------------CCCCCCCCCCC Confidence 000000 0 00111222222 No 84 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.40 E-value=2.5e-11 Score=78.68 Aligned_cols=386 Identities=10% Similarity=0.022 Sum_probs=179.4 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) +.+....... +|...+. ..+.......-....+-+...+-+.|.+|+..+...|.+ T Consensus 3 ~~~~~~~~~~---------------------~~~~~~~---~~~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~ 58 (403) T protein:vir:10 3 FKSWITEKLN---------------------PGQRIIR---DMEPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAE 58 (403) T ss_pred chhhhhhccc---------------------hhhhhhh---cccccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhh Confidence 1111100000 0000000 000000000000001123445678899999999999999 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) ++|.+.-.............+-+...|.. -+...+..++...+. +.+.+|-+++++. +. .++ + T Consensus 59 ~p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~----~~------~l~-----~ 123 (403) T protein:vir:10 59 CSYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD----GT------SLY-----H 123 (403) T ss_pred CceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe----Cc------eeE-----e Confidence 99987533221111111111223334442 334456777887755 5778898875431 11 111 1 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecC----cCCc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGG----TESN 234 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~----~~g~ 234 (516) .|+. ++.+..+...+.... . ...++.++.+.++.++... ..+. T Consensus 124 l~~~------~~~v~~~~~~~~~~~--------~-------------------~~~~~~~~~~eiih~~~~~~~~~~~~~ 170 (403) T protein:vir:10 124 VPAA------LMQVEADANKFIKKF--------I-------------------FNNQINYRVDEIIFIKDNSYVCGTNSQ 170 (403) T ss_pred ecCc------ceEEEEcCCceEEEE--------E-------------------ecCceeecccceEEecccccccCCCCC Confidence 1211 122221111111000 0 0012233444444433221 2366 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc-ce--E Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE-QA--Y 311 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~-~a--~ 311 (516) ++|.+.+..+....-.-....++-..+...-+.+--+++ .+..-+++.. +++++.......|. .+ . T Consensus 171 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~--------~~~~l~~e~~---~~~~~~~~~~~~g~~n~g~~ 239 (403) T protein:vir:10 171 ISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILE--------TDEILNKKLR---ERKQEELQLDYNPSTGQSSV 239 (403) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEE--------eCCCCCHHHH---HHHHHHHHHHhCCcccCcce Confidence 889999998887776665565555555543222222222 2222333333 34444444333342 23 4 Q ss_pred EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 312 FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDID 391 (516) Q Consensus 312 ~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~ 391 (516) ++++.|++... ++...+.....|.+..++..++|++++--...-.+....++. .+........-+.-.++ T Consensus 240 ~vl~~g~~~~~--------~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~--e~~~~~f~~~tl~P~~~ 309 (403) T protein:vir:10 240 LILDGGMKAKP--------YSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANI--RPNIELFYYMTIIPMLN 309 (403) T ss_pred eecCCCceeEE--------ecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCH--HHHHHHHHHHHHHHHHH Confidence 67888875332 221112223346777888899999987766544432221211 12223344555667778 Q ss_pred HHHHHHHHHHHHHHHHhcCCcCCccccceEEecCc----CchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLI----QEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI 467 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~----~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~ 467 (516) .|++.||+.|. ++|.|+.. -..|.+..+++++++++.|++.+ +.+|+.+|+|+-. T Consensus 310 ~ie~~l~~~L~----------------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~-----NE~R~~~gl~pi~ 368 (403) T protein:vir:10 310 KLTSSLTFFFG----------------YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITG-----NEARSELNLEPLD 368 (403) T ss_pred HHHHHHHHhcC----------------ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCC Confidence 88888886541 13344332 23577888999999999999876 6899999999532 Q ss_pred Cccc--ccCcccccCCCCCCcccccccccCCCCCc Q lcl|NC_016071. 468 PEDM--STDELLKLLGQDTSRSGDGMTAGSNGNGT 500 (516) Q Consensus 468 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (516) ++.. ..-...-...+.....+++....++++|+ T Consensus 369 ~~~~d~~~~p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 369 DEQMNKIRIPANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred cccccccccccccccccccCCCCcCCCCCCCcCCC Confidence 2211 11000000011111112222222222222 No 85 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.36 E-value=1.8e-11 Score=79.47 Aligned_cols=372 Identities=13% Similarity=0.037 Sum_probs=178.8 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHH-HhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEA-MKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~-m~~D~~v~s~l~~Rk~~v~ 79 (516) |+=...--++..+ ++.. .++. ..-.++.. -++.+.|.+|+..+-..|. T Consensus 1 Mg~f~~~f~~~~~-----~~~~------------------------~~~~--~~~~~~~~~a~~~~~v~~~i~~ia~~ia 49 (385) T protein:vir:95 1 MGLFDSVFKRHSE-----LSWM------------------------YDLE--FLQDKSKKAYLKQIALNTVVEMVARTIS 49 (385) T ss_pred CchhhhhhccCcc-----cccc------------------------cchh--hhhccchhhhhhhHHHHHHHHHHHHHHc Confidence 3321111111000 0000 0000 00011111 1357889999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) +++|++.-. + ...... +...|. +-+...++.++++.++ +.+.+|.+++.+.. .+ .+++.... T Consensus 50 ~~p~~~~~~-~--~~~~~~----l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~--~~-------~~~~~~~~ 113 (385) T protein:vir:95 50 QSEFRVMKN-N--TKEKGT----LYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKND--EG-------HFFVADDF 113 (385) T ss_pred ccceeeeec-C--ccccch----HHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEec--CC-------Ceeecccc Confidence 999987532 1 112222 233343 2334456778887755 56678998864432 11 11111111 Q ss_pred ccCch-hcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 158 FRPQS-SLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 158 ~r~q~-ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .++.. .+.+..+.... .........+|...+|.+++....+..+ T Consensus 114 ~~~~~~~~~~~~~~~~~-----------------------------------~~~~~~~~~~~~~eiih~~~~~~~~~~~ 158 (385) T protein:vir:95 114 EKEDELGLYSHRFTNVL-----------------------------------VNDFEFKRVFTMDDVIYLKYNNQKLDAF 158 (385) T ss_pred ccccccccccccceeee-----------------------------------ecccceeeeeccccEEEecCCCCCcccc Confidence 11110 00000000000 0001112346777767666766677789 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccc-cCCCCHHHHHHHHHHHHHHHHhhc----ccceE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKA-AIDPKSPESEMVQGLMADAANAHA----GEQAY 311 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~-~~~~~~~~~~~l~~l~~~~~~~~~----g~~a~ 311 (516) |.|++..+.-..-. .+.. .+++.++ ++.+.-. ....+++. .+.+++....... +.... T Consensus 159 G~s~~~~~~~~i~~------~~~~--~~~~~~~------~g~l~~~~~~~~~~e~---~~~~~~~~~~~~~g~~~~~~~i 221 (385) T protein:vir:95 159 SLGLFEDYGEIFGR------MIDL--QMLNNQI------RGILKVDATKFYNKEK---QKELQAYIDTLFDAFQNNTIAV 221 (385) T ss_pred cchHHHHHHHHHHH------HHHH--HHhcCCC------ceEEEeCCccCCCHHH---HHHHHHHHHHHhhhhhhcCCce Confidence 99999887653321 1111 1222221 2212111 11112222 2233333332222 22334 Q ss_pred EEeccCcccccccccceeeeecc-ccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHH-HHHHHHHHHHHH Q lcl|NC_016071. 312 FILPSDMNAQGGEQYKMSLKGID-GAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSES-KQSIHGHFVQRD 389 (516) Q Consensus 312 ~iiP~g~~i~~~e~~~iel~~~~-g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~v-h~ev~~~~~~aD 389 (516) ++++.|++.... +..... .+-...+|.+..++...+|++++.-..-..+ |+++-.+. -......-+.-. T Consensus 222 ~~l~~g~~~~~l-----~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~----~~~sn~e~~~~~~~~~~l~P~ 292 (385) T protein:vir:95 222 VPLTEGLAYEEH-----SNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL----GEMADLEKTIESYLQFCINPL 292 (385) T ss_pred EEcCCCceeEee-----cccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCcCHHHHHHHHHHHHHHHH Confidence 557888754321 111111 1112335778888899999999887542221 34443333 345555567888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC-- Q lcl|NC_016071. 390 IDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI-- 467 (516) Q Consensus 390 a~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~-- 467 (516) ++.|++.||+.|+++--..+ .+.+|.++..-..|.++.+++++++++.|++.+ +.+|+.+|+|+-. T Consensus 293 ~~~ie~~l~~~L~~~~~~~~-------~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~-----NE~R~~~g~~p~~~~ 360 (385) T protein:vir:95 293 LRKIEAELNSKFFYQDEYLN-------DDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTR-----NQVRIMTGEEPADDP 360 (385) T ss_pred HHHHHHHHHhhcCChhhccc-------ceEEEechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCC Confidence 89999999988877532221 123444556667788999999999999999886 5799999998532 Q ss_pred CcccccCcccccCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 468 PEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) .+|+..-.. ...+.+. .+ + |+...+ T Consensus 361 ~gd~~~~~~------n~~~~~~-~k-g--ge~~~e 385 (385) T protein:vir:95 361 ELDKFIITK------NLQSADA-FK-G--GESNEE 385 (385) T ss_pred CCceeeecc------cceeccc-cc-C--CCCCCC Confidence 122221110 0011110 11 1 111111 No 86 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.35 E-value=7.7e-12 Score=81.50 Aligned_cols=370 Identities=11% Similarity=0.068 Sum_probs=188.8 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHH-hhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESE-VMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~-~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~ 79 (516) ++.+........ ...++- .+ .+..... ..... .+ .-+..++-+.|.+|+..+-..|. T Consensus 4 ~~~~~~~k~~~~------~~~~~~--~~-------~~~~~~~~~~~~~------~v-~~~~~l~~~~v~~~i~~ia~~ia 61 (383) T protein:vir:10 4 LTPKNFSKRNAK------NMVYPS--NP-------AFFTTTVGGMQLS------YV-SALSALQNTNVYSVINRIASDVS 61 (383) T ss_pred cccccccccccc------cccccc--ch-------hhhhhhccCcccc------cc-chhHhhcchHHHHHHHHHHHhhc Confidence 332211100000 000000 00 0000000 00000 00 01233456889999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) ++++++.- .. ....|++-+...++.++++.++ +.+.+|-++++++-. +.+++. T Consensus 62 ~~~~~~~~-----~~--------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~-------~~~~~p------ 115 (383) T protein:vir:10 62 SAHFKTEN-----TA--------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-------NLEHIP------ 115 (383) T ss_pred cCceeecc-----cc--------hhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-------ceeEee------ Confidence 99987641 11 1123444445567788887766 466789999887521 112211 Q ss_pred cCchhcccccceeecCCCceee-eccccccccccccccccccccccccccccccCCCccccccccEEEEeecC-c-CCcc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLK-GIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGG-T-ESNP 235 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~-~-~g~p 235 (516) -++.+ +.+..++..+. .+. ....+....+|...++++++.. . .+.. T Consensus 116 ~~~~~------v~~~~~~~~~~~~~~-------------------------~~~~~~~~~~~~~evih~r~~~~~~~~~~ 164 (383) T protein:vir:10 116 NSDVQ------INYLPGNMGIVYTVL-------------------------ESNDRPKMVLRQDQMLHFRLMPDPQYRYL 164 (383) T ss_pred cCcce------EEEEEcCCceEEEEE-------------------------EcCCceEEEEcccceEEeccCCCCccccc Confidence 11111 22222221111 110 0112234456777766665432 2 2346 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EE Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FI 313 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~i 313 (516) +|.|.+..|....-.-....++...+...-+.+=-++..+ .+..+ ++..+++++..+....|..++ ++ T Consensus 165 ~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~--------~~~~~--~e~~~~~~~~~~~~~~~~n~~~~~v 234 (383) T protein:vir:10 165 IGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTIS--------NYLSD--GKDLESAREEFEKANTGDNSGRLMV 234 (383) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC--------CCCCC--HHHHHHHHHHHHHHhCccccCCccc Confidence 8999999998877777777777777776544433333322 11111 122344555555555555544 66 Q ss_pred eccCcccccccccceeeeeccccCcchhH-HHHHHHHHHHHHHHHhcccccccCCc--cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYST-QELVNSRKKAILDRFGAGFINLGNDG--QGSYNLSESKQSIHGHFVQRDI 390 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~-~~li~~~d~~Isk~iLGqtLts~~~~--~GS~Al~~vh~ev~~~~~~aDa 390 (516) ++.|++++. ++. +....++ .++.++..++|++++.-..--.+... ..+++-.+.+...+..-+.--+ T Consensus 235 l~~g~~~~~--------l~~--~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~ 304 (383) T protein:vir:10 235 LPDGFDYTQ--------LEM--KTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYV 304 (383) T ss_pred cCCCceEEe--------cCC--ChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHHHHHHHHHH Confidence 677764332 222 2222233 45667778999998877553332111 1223334444445555677788 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED 470 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~ 470 (516) +.|++.||+.|+ ++ .-+|.++.....|.+..++++.++++.|++.+ +.+|+.+|+|+-.+++ T Consensus 305 ~~ie~~l~~~l~----------~~---~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~-----nE~R~~lg~~p~~~~d 366 (383) T protein:vir:10 305 NPIVDELRLKMN----------AP---DLELDIKDMLDVDDSILINQVSNLAKSGVLGA-----EQAQFILTRSGFLPDN 366 (383) T ss_pred HHHHHHHHHhhC----------Cc---eEEeechhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCcccCCc Confidence 888888887542 11 12344455556888999999999999999876 5799999998654444 Q ss_pred cccCcccccCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 471 MSTDELLKLLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) .+.... +....+.|+ .+ T Consensus 367 ~~~~~~----~~~~~~gGd-----------~e 383 (383) T protein:vir:10 367 LPEFKP----LTNETKGGD-----------DK 383 (383) T ss_pred ccccCC----CcccCCCCC-----------CC Confidence 322110 000011111 11 No 87 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.34 E-value=8.4e-11 Score=75.82 Aligned_cols=381 Identities=11% Similarity=0.036 Sum_probs=189.8 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |. .+.+...+. |..+..+. +.. ........++.+..+ ..+..++-+.|.+|+..+-..| T Consensus 1 M~~f~~~~~~~~~----------~~~~~~~~--~~~------~~~~~~~~~~~~~~v-~~~~~~~~~~v~~~i~~ia~~i 61 (386) T protein:vir:48 1 MPIFNITNLATES----------PPISQGGF--FDI------TDPDFLSTLNGSEWV-SAESALRNSDLFSIINQLSNDL 61 (386) T ss_pred Ccccccccccccc----------cccccccc--ccc------ccchhcccccCCcee-chhhhhcchHHHHHHHHHHHhh Confidence 32 121111111 11100000 000 000000111122211 1233457899999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) .++++++.- .. + ...+.+-+...++.++++.++ +.+.+|-+++++++...+ ++ ..|. T Consensus 62 a~~p~~~~~-----~~-----~---~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~------~~L~ 119 (386) T protein:vir:48 62 ATVKLTASR-----KQ-----L---QGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENG---RD------MKWE 119 (386) T ss_pred ccCceeecc-----ch-----h---HHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCC---cE------EEEE Confidence 999987641 11 1 123344455567888888866 578899999999886432 22 2333 Q ss_pred ccCchhcccccceeecCCCceeee-ccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKG-IYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~-~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) +.++..++ ...+.+|+.+.. +... ....+....+|.+.++++++....+.++ T Consensus 120 ~l~~~~v~----v~~~~~~~~~~y~~~~~-----------------------~~~~~~~~~~~~~evih~~~~~~~~~~~ 172 (386) T protein:vir:48 120 YLRPSQVS----FNRLDNKDGIYYNITFD-----------------------DPRIPPKQHVPQGDVLHFKLLSVDGGLT 172 (386) T ss_pred EecCceeE----EEEcCCCceEEEEEEec-----------------------CccccceeEecCccEEEecCCCCCCcee Confidence 33333222 122333332210 0000 0011123356777777666666667789 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEecc Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPS 316 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~ 316 (516) |.|.+..+....-.-....++...+...-+.+--+++. +...+.++.+.+..... ....+....++++. T Consensus 173 G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~--------~~~~~~e~~~~~~~~~~---~~~~n~g~~~vl~~ 241 (386) T protein:vir:48 173 SVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKI--------KGGGLLDFKTKLSRSRQ---AMKQMQGGPLVLDD 241 (386) T ss_pred eccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEe--------CCCCCHHHHHHHHHHHH---HhhcCCCCceecCC Confidence 99999999876666666667767776654443333322 22333334333332221 11222233466777 Q ss_pred CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 317 DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSE-SKQSIHGHFVQRDIDIIVE 395 (516) Q Consensus 317 g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~-vh~ev~~~~~~aDa~~i~~ 395 (516) |++++ ..+-+....+|.+..++..++|++++.-...-.+.. ++++-.+ .........+.--++.|++ T Consensus 242 g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~--~~~~~~e~~~~~~~~~~l~P~~~~ie~ 309 (386) T protein:vir:48 242 LEEFT----------PLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEMSLDLYNKAVSRYLRPFLS 309 (386) T ss_pred CceEE----------EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC--CCcccHHHHHHHHHHHHHHHHHHHHHH Confidence 76422 222223334567777888899999877655444322 2232222 2234445556667888888 Q ss_pred HHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc Q lcl|NC_016071. 396 AFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE 475 (516) Q Consensus 396 ~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~ 475 (516) .||+.|++.+ .++. ......|...++..+.+|+..|++.+ +.+|+.+|.+.-.+++... T Consensus 310 ~l~~~l~~~~-~~~~-------------~~~~~~d~~~~~~~~~~l~~~g~~t~-----nE~r~~lg~~~~~~~~~~~-- 368 (386) T protein:vir:48 310 ELSQKLSCDV-DADI-------------LPAVDPTGSNSVSRINSMVKSGTLAQ-----NQGLYILQQAEILPKELPE-- 368 (386) T ss_pred HHHHhhcchh-hcch-------------hhhhccChHHHHHHHHHHHhCCCcCH-----HHHHHHhhcCCCCCccchh-- Confidence 8888776542 2221 01112344566778889999998775 5689999876433322110 Q ss_pred ccccCCCCCCcccccccccCCCCCcccc Q lcl|NC_016071. 476 LLKLLGQDTSRSGDGMTAGSNGNGTGKI 503 (516) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (516) ... .+ .++ .+.|.+ +++- T Consensus 369 ~~~--~~-~~~----~~gGd~---~~~~ 386 (386) T protein:vir:48 369 GEN--PN-KTT----LKGGEI---NGED 386 (386) T ss_pred hcC--CC-CCc----cCCCCC---CCCC Confidence 000 00 000 011111 1000 No 88 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.33 E-value=1.1e-10 Score=75.27 Aligned_cols=379 Identities=11% Similarity=-0.008 Sum_probs=171.5 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcc--cc---cCCcccHHHHHH-HhhChHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKV--EE---LRWPCFLATVEA-MKQDHTVSTALDTK 74 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~--~~---lr~~~~~~~y~~-m~~D~~v~s~l~~R 74 (516) |.= ++.+.++...... .. .-|.-.-.++.+ -++.+.|.+|+..+ T Consensus 1 Mg~------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~v~~I 50 (395) T protein:vir:40 1 MGF------------------------------KSWVSGFFNEEQRTLNLTDTVWCSIPSEKLKELSIKKWAIDSCANKI 50 (395) T ss_pred Cch------------------------------HHHHHhhhcccccccccccchhhccccccchhhhhhhHHHHHHHHHH Confidence 111 1111222110000 00 001111112222 24578899999999 Q ss_pred HHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeeccccccccccee Q lcl|NC_016071. 75 YVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYIT 152 (516) Q Consensus 75 k~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~ 152 (516) -..|.+++|.+.-. +.+.... +...|. +-+...+..++++.+. +.+.+|.+.+.+... ..+.+.+ T Consensus 51 a~~ia~~p~~~~~~---~~~~~~~----~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~---~~~~~~~--- 117 (395) T protein:vir:40 51 ANTLSCAEVLTYEK---GEEVRKK----NWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDE---YIYVADS--- 117 (395) T ss_pred HHHHhhCceeeccC---Cccccch----HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecC---ceeecCC--- Confidence 99999999987421 1222222 223343 3344456677777644 577789988654321 1111111 Q ss_pred eccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcC Q lcl|NC_016071. 153 IDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTE 232 (516) Q Consensus 153 ~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~ 232 (516) +..... .+....+..+. ..+......+|...++++++.... T Consensus 118 ---~~~~~~-~~~~~~~~~v~-----------------------------------~~~~~~~~~~~~~evih~r~~~~~ 158 (395) T protein:vir:40 118 ---FTKNDK-SLYENTYTEVT-----------------------------------LKDLTLKKEFKESEVLHLTLNNES 158 (395) T ss_pred ---cccccc-ccccceeeeee-----------------------------------ecCceeeeeeccccEEEeecCCCC Confidence 100000 00000000000 000011224677777777777777 Q ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHH-HHHHHHHHHhhcccceE Q lcl|NC_016071. 233 SNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMV-QGLMADAANAHAGEQAY 311 (516) Q Consensus 233 g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l-~~l~~~~~~~~~g~~a~ 311 (516) +.+++.++...+.- +.... +. .+ .+.+.+-+.+. .+.....+.+..+.+ +.+.+..+....+.... T Consensus 159 ~~~~~~~l~~~~~~--~~~~~-~~---~~-~~~~~~~~~l~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (395) T protein:vir:40 159 IKSIIDGFYLLYGD--LLTAA-VN---KY-KKLNSRKIIVK------LKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSA 225 (395) T ss_pred ccccchhHHHHHHH--HHHHH-HH---HH-HhcCCCCceEE------EecccCCCHHHHHHHHHHHHHHHHHhhccCCce Confidence 78888777654432 11111 11 11 12222222222 122222333322222 23333333322233345 Q ss_pred EEeccCcccccccccceeeeeccccCcchhHHHH---HHHHHHHHHHHHhcccccccCCccchhhHHH-HHHHHHHHHHH Q lcl|NC_016071. 312 FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQEL---VNSRKKAILDRFGAGFINLGNDGQGSYNLSE-SKQSIHGHFVQ 387 (516) Q Consensus 312 ~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~l---i~~~d~~Isk~iLGqtLts~~~~~GS~Al~~-vh~ev~~~~~~ 387 (516) ++++.|++++.. + -+.....+.++ -+.+-++|++++.-..--.+ |+++-.+ ........-+. T Consensus 226 ~vl~~g~~~~~l--------~--~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~----~~~sn~e~~~~~f~~~~L~ 291 (395) T protein:vir:40 226 LPVEDGMEIDEL--------A--GDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAK----GDTVGLSEQVNSFLMFSIN 291 (395) T ss_pred eecCCCceEEec--------c--CChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCcCHHHHHHHHHHHHHH Confidence 667888753321 1 12222234333 33334789998876543232 3343222 22344445677 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC Q lcl|NC_016071. 388 RDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI 467 (516) Q Consensus 388 aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~ 467 (516) -.++.|++.||+.|++.--... ..+-+|.++..-..|.++.++++.++++.|++.+ +.+|+.+|+|+-. T Consensus 292 P~~~~ie~~l~~kLl~~~~~~~------g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~g~~pi~ 360 (395) T protein:vir:40 292 PIAEMFTDEGNRKFYGRDSVLE------RTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTI-----DDNLRMIGREPVM 360 (395) T ss_pred HHHHHHHHHHHHhcCChhhhcC------CceEEEechhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCC Confidence 7888888989887766422111 1223455556667889999999999999998876 5799999998643 Q ss_pred C--cccccCcc-cccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 468 P--EDMSTDEL-LKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 468 ~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) . .|...-.. ..+.... .+..+.|. .++.. .|+ T Consensus 361 ~~~gD~~~~~~n~~~~~~~----~~~~kgge-~~~~~-----~~~ 395 (395) T protein:vir:40 361 SPETQERFVTKNYAPLGEN----EEDLKGGD-INENK-----GDS 395 (395) T ss_pred CCCCceeeecccccccccc----ccccCCCC-CCCCc-----CCC Confidence 2 22211111 0111100 11111110 00000 111 No 89 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.31 E-value=2.3e-11 Score=78.88 Aligned_cols=366 Identities=10% Similarity=-0.022 Sum_probs=174.1 Q ss_pred cchHHH-HHHH-HHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhH---HHHHHH Q lcl|NC_016071. 29 LGSGAL-SQLR-AESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASK---DAAEFV 103 (516) Q Consensus 29 ~g~~~~-~~~~-~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~---~~a~~v 103 (516) +|-.+- ..+. .........-..+.. -.++ ..-+.|.+|+..+-..|.++++.+.-....+...++ ..-.-+ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l 76 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQRVTAWQN-EAVE---YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDL 76 (378) T ss_pred CCccccchhcccccccCCcceeeeecc-chhH---HHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccchH Confidence 111100 0000 000000000000110 0111 123569999999999999999875332222211111 011223 Q ss_pred HHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeee Q lcl|NC_016071. 104 EYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKG 181 (516) Q Consensus 104 ~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~ 181 (516) .+.|+. -+...+..+++..++ +.+.+|.+.+.++|+-.. |.++. +.++.+ T Consensus 77 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~------g~~~~----------------l~p~~~------ 128 (378) T protein:vir:94 77 DEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNT------GELLD----------------LLFADD------ 128 (378) T ss_pred HHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCC------ceEEE----------------EEecCC------ Confidence 445543 233455667776655 578889999888775432 22210 111111 Q ss_pred ccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 182 IYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIG 261 (516) Q Consensus 182 ~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~ 261 (516) +..+|++.. +|...+-++ -.|.|++..+.-..- . + T Consensus 129 ---------------------------------~~~~~~~di-iH~~~~~~~-~~g~s~l~~~~~~i~------~----~ 163 (378) T protein:vir:94 129 ---------------------------------KKEYKPEEL-VRLTSPFYI-NEDTSILDNALASIQ------T----K 163 (378) T ss_pred ---------------------------------eeEeeeeee-EEecCcCCc-cchhHHHHHHHHHHH------H----H Confidence 122344443 344333332 246777776654221 0 1 Q ss_pred HhhccccceeeeecccccccccCC-CCHHHHHHHHHHHHHHHHhhcccceE--EEeccCcccccccccceeeeeccccCc Q lcl|NC_016071. 262 ASKDLGGIIELKIPSQILNKAAID-PKSPESEMVQGLMADAANAHAGEQAY--FILPSDMNAQGGEQYKMSLKGIDGAGK 338 (516) Q Consensus 262 ~er~g~~~~v~~~pp~~~~k~~~~-~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP~g~~i~~~e~~~iel~~~~g~g~ 338 (516) + +.+. |.+++.. +.. ..+..++..+++.+.......|..++ ++++.|+++.- ++. +.. T Consensus 164 ~-~~~~-------~~gil~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~--------l~~--~~~ 224 (378) T protein:vir:94 164 L-EQGK-------LRGLLKI-NAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVE--------LKK--DYS 224 (378) T ss_pred H-hccc-------ccceeee-CCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEE--------ccC--Chh Confidence 1 1121 1222211 111 11223334456666666655566665 56677764321 122 222 Q ss_pred chhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcccc Q lcl|NC_016071. 339 QYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDM 418 (516) Q Consensus 339 ~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~ 418 (516) ..++. -.++..++|++++.-..-... |+++- +-.......-+.--++.|+..||+.|+++--.-.+.+.....- T Consensus 225 ~~~~~-~~~~~~~~Ia~~fgVP~~~l~----~~~se-~~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~ 298 (378) T protein:vir:94 225 VLNKD-EIDLIKSELLTGYFMNENILL----GTASQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYER 298 (378) T ss_pred hhhHH-HHHHHHHHHHHHhCCCHHHhc----CChHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccc Confidence 22343 347778899998877542231 33332 2334555666788888999999988876532211100000112 Q ss_pred ceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccc-c-CC Q lcl|NC_016071. 419 PKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTA-G-SN 496 (516) Q Consensus 419 P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~ 496 (516) +.|.++.....|+++.+++++++++.|++.+ +.+|+.+|+|+-..+|+..-...- .+-......+..+. . .. T Consensus 299 ~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~gl~p~~gGD~~~~~~n~-~~~~~~~~~~~~~~~~~~~ 372 (378) T protein:vir:94 299 IIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDVYIANLNA-VAVKNLSDLQGSRKDVTST 372 (378) T ss_pred eeecchhhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeecccc-cccccchhhcCCcCCCCCC Confidence 4566667777899999999999999999876 579999999976555543221111 01000000000000 0 01 Q ss_pred CCCccc Q lcl|NC_016071. 497 GNGTGK 502 (516) Q Consensus 497 ~~~~~~ 502 (516) +++..+ T Consensus 373 ~e~~n~ 378 (378) T protein:vir:94 373 DETNNQ 378 (378) T ss_pred CCCCCC Confidence 111211 No 90 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.28 E-value=1.6e-10 Score=74.29 Aligned_cols=384 Identities=8% Similarity=-0.007 Sum_probs=175.1 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccH-HHHHHH-hhChHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFL-ATVEAM-KQDHTVSTALDTKYV 76 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m-~~D~~v~s~l~~Rk~ 76 (516) |. .++..... ...+. .-.+..+ .++.+. ++-+.|.+|+..+-. T Consensus 1 MGlf~~~~~~~~--------~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~v~~~I~~ia~ 47 (395) T protein:vir:98 1 MGILDFFSFKKS--------GTLSD-------------------------DDSGSTTSEKLTNVVLKEDALYKCVNYLAR 47 (395) T ss_pred CcchhhhcCCCc--------ccccc-------------------------cccchhhhhhcchhhhhhHHHHHHHHHHHH Confidence 11 11110000 00000 0001111 111221 356789999999999 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) .|.++++++.-. +.+...+..+ ...|.. -+...+..+++..+. +.+.+|.+.+.++.... .+.++.+. T Consensus 48 ~iA~lp~~~~~~-~~~~~~~~~~----~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~--~~~~~~~~--- 117 (395) T protein:vir:98 48 IISKSTFRLKTP-EKLTENQKDW----LYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKG--IYVADSFT--- 117 (395) T ss_pred HHhhCceeEEec-CCcccccchH----HHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCc--eecCCccc--- Confidence 999999987532 2222222222 333432 223345566666644 46678999887765321 01111100 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) +.. .+.+ ..+.. . ......-...+|...++.+++....+. T Consensus 118 ----~~~-~~~~-~~~~~-------------------~---------------~~~~~~~~~~~~~~evih~k~~~~~~~ 157 (395) T protein:vir:98 118 ----QDK-KISG-SQFKV-------------------S---------------RVQGQTYEKTFTFDQVIYLKNDNSDLM 157 (395) T ss_pred ----ccc-cccC-cccce-------------------e---------------eecCceeeeEecCccEEEecCCCCCcc Confidence 000 0000 00000 0 000000122356666676677666777 Q ss_pred cccchhHHHHHHHHH--HHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEE Q lcl|NC_016071. 235 PAGVSPLVGCYRAFR--EKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYF 312 (516) Q Consensus 235 p~G~gLlr~~~~~~~--fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ 312 (516) +++.|+........- ........-..+...+..+.....++ .+ .......+...+...+.......+....+ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 231 (395) T protein:vir:98 158 SKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQ----EN--SDGGRQSKSDKDFFKRTVEKIRTESVVGI 231 (395) T ss_pred ccccchhhhHHHHHHHHHHHHHHHHHHHHhhcccccccccccc----cc--CCcHHHHHHHHHHHHHHHhhhhcCCccee Confidence 777777654332110 11111111111222222221111111 11 11111222222223333333233444455 Q ss_pred EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHH-HHHHHHHHHHHHH Q lcl|NC_016071. 313 ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESK-QSIHGHFVQRDID 391 (516) Q Consensus 313 iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh-~ev~~~~~~aDa~ 391 (516) +++.|++..-......+.. +-...++.++.++.-++|++++.-..--.+ |+++-.+-+ ......-+.-.++ T Consensus 232 ~l~~g~~~~~l~~~~~~~~----~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~----~~~sn~e~~~~~f~~~tl~P~~~ 303 (395) T protein:vir:98 232 PVTANTNYEEYGSKNTGAV----KSYVDDIKKLKDQYMAEFAEMLGIPISLLH----GDIADNQKNYELLLEGPIESLIT 303 (395) T ss_pred ecCCCceeEeccccccccc----ChhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCcccHHHHHHHHHHHHHHHHHH Confidence 5778875332211110000 111224666777888899998877553332 344433322 3445666888889 Q ss_pred HHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCC--c Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIP--E 469 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~--~ 469 (516) .|++.||+.|+++--. . ..-+|.++.....|+++.+++++++++.|++.+ +.+|+.+|+|+-.+ . T Consensus 304 ~ie~~l~~kll~~~~~-~-------~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~Pi~~~~g 370 (395) T protein:vir:98 304 NIVDGLEYAIFDKSET-L-------QGSFIKVTGLKNYDLFSISNQADKLISSGFVFI-----DEVREEIGLPELPDGLG 370 (395) T ss_pred HHHHHHHHhcCChhhh-c-------CcceeeehhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCC Confidence 9999999888764211 1 112477777788899999999999999998876 68999999986433 2 Q ss_pred ccccCcccccCCCCCCcccccccccCCCC Q lcl|NC_016071. 470 DMSTDELLKLLGQDTSRSGDGMTAGSNGN 498 (516) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (516) |+..-...- .+-....|+ .....+| T Consensus 371 D~~~~~~n~--~~~~~~gge--~~~~~~~ 395 (395) T protein:vir:98 371 KVLYMTKNY--ESVLERGGE--VDEEVET 395 (395) T ss_pred ceeeecccc--eecccccCC--CCCCCCC Confidence 322211100 001111111 0011111 No 91 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.27 E-value=1.1e-10 Score=75.15 Aligned_cols=383 Identities=10% Similarity=0.006 Sum_probs=189.6 Q ss_pred CCcccc-CcccccchhhhcccCCCCc-cccc-----c-hHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHH Q lcl|NC_016071. 1 MSTRFA-QPSEVVKAGNENLAVSRLR-TGEL-----G-SGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALD 72 (516) Q Consensus 1 ~~~r~~-~~~~~~~~~~~~p~~~~~~-~~e~-----g-~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 72 (516) --++.+ ..+.| +..-..|.+-.++ ..+. + ......+.++...... +....+. =+.+++-+.|.+|+. T Consensus 12 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~---~~~~~~t-~~~~~~~~~v~acV~ 86 (409) T protein:vir:83 12 SIPDLPNDNGPV-DYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWAT---PSWGSAQ-DKLRTLIDVAWACID 86 (409) T ss_pred cCCCcccccccc-cccCCCCceeeccCCCcchhhhhcccccccccccccccccc---cCccccc-hhhHhhhHHHHHHHH Confidence 000100 00000 0000000000000 0000 0 0000111111110000 0111111 144556789999999 Q ss_pred HHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhc-cCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccce Q lcl|NC_016071. 73 TKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNL-ANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYI 151 (516) Q Consensus 73 ~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~-~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~ 151 (516) .+-..|.++++.+.-. + ... .. +...++.. +...++.+++..++..+..|-+..+++-+.. +|. T Consensus 87 ~Ia~~iA~lpl~~~~~-~--~~~-~~----~~~ll~~~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~~------~G~- 151 (409) T protein:vir:83 87 LNASVLSSMPIYRMRN-G--RII-DS----VAWMSNPDPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHGS------DGY- 151 (409) T ss_pred HHHHhhccCceEEeeC-C--ccc-cc----hhhhcccCCCCCCCHHHHHHHHHHHHhhCCcEEEEEEECC------CCc- Confidence 9999999999876522 1 111 11 12223322 2235677788777766666888776543221 222 Q ss_pred eeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEE-eecC Q lcl|NC_016071. 152 TIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVM-SLGG 230 (516) Q Consensus 152 ~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~-~~~~ 230 (516) +..|.+.++..+. +..+++|+....+. +...+ +. |+| ++.. T Consensus 152 -~~~L~pl~p~~v~----v~~~~~g~~~y~~~-------------------------------~~~~~-~e-iiHir~~~ 193 (409) T protein:vir:83 152 -PIRFRVVPPWLVN----VELKKGARREYRIG-------------------------------GLNVT-DE-ILHIRYQG 193 (409) T ss_pred -EEEEEEECCcceE----EEEcCCceEEEEEc-------------------------------cccCc-cc-eEEeCCCC Confidence 2234444554332 34455554321110 01112 22 455 4445 Q ss_pred cCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccc- Q lcl|NC_016071. 231 TESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQ- 309 (516) Q Consensus 231 ~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~- 309 (516) ..+..+|.|.+..+....-......++-..|...-+.|- +++. ++..-+.++. +++++.......|.. T Consensus 194 ~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~-------gil~-~~~~ls~e~~---~~~~~~~~~~~~~nag 262 (409) T protein:vir:83 194 NTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPL-------YWLG-VERRLSETEA---VDLMDRWIESRSKYAG 262 (409) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-------eEee-cCCCCCHHHH---HHHHHHHHHhhCCccC Confidence 567789999999998887777777666666665433332 2222 2222333333 333433333333322 Q ss_pred eEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCcc---chhhHHHHHH-HHHHHH Q lcl|NC_016071. 310 AYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQ---GSYNLSESKQ-SIHGHF 385 (516) Q Consensus 310 a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~---GS~Al~~vh~-ev~~~~ 385 (516) ..+++..|++.. +.++. +....+|.+..++..++|++++.-...-.+..+. .+|+-.+-+. .....- T Consensus 263 ~~~il~~g~~~~-------~~~~~--s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~t 333 (409) T protein:vir:83 263 HPALVTGGATLN-------QAKSM--SAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSS 333 (409) T ss_pred ccceecCCcccc-------cccCC--CHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHH Confidence 125666666421 11111 1222346666678889999999886644442221 1244333333 333446 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCC Q lcl|NC_016071. 386 VQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDE 465 (516) Q Consensus 386 ~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~ 465 (516) +.-.++.|++.||+.|++. ..+-+|.++.....|+++.+++++++++.|++.+ +.+|+..|+|+ T Consensus 334 L~P~~~~ie~~l~~~Ll~~-----------~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~-----NE~R~~~glpp 397 (409) T protein:vir:83 334 LRPKATAVMAALDRWALPS-----------PQHLELNRDDYTRPSLVERATAYKIMIEAGVMEP-----NEARAMERLHS 397 (409) T ss_pred HHHHHHHHHHHHHHhhCCC-----------CcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCC Confidence 7778888899998866531 1233444455556888999999999999999876 57999999996 Q ss_pred CCCcccccCcccccCCCCC Q lcl|NC_016071. 466 EIPEDMSTDELLKLLGQDT 484 (516) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~ 484 (516) ....|+.... +. T Consensus 398 ~~ggd~l~~~-------gv 409 (409) T protein:vir:83 398 EAAAVRLSGG-------GV 409 (409) T ss_pred CCCCcccCCC-------CC Confidence 5444433110 01 No 92 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=99.27 E-value=1.3e-10 Score=74.87 Aligned_cols=373 Identities=10% Similarity=0.031 Sum_probs=170.1 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccC---CcccH-HHHHHH-hhChHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELR---WPCFL-ATVEAM-KQDHTVSTALDT 73 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr---~~~~~-~~y~~m-~~D~~v~s~l~~ 73 (516) |+ .++... ...... +...+ .++.++ ++-+.|.+|+.. T Consensus 1 Mgl~d~~~~~------------------------------------~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~i~~ 44 (395) T protein:vir:96 1 MGILDFFSFK------------------------------------KSGTLSDDDSGSTTSEKLTNVVLKEDALYKCVNY 44 (395) T ss_pred CcchhhhcCC------------------------------------CCccccccccccchhhhcchhhhhhHHHHHHHHH Confidence 11 111110 001100 11111 112222 356789999999 Q ss_pred HHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccce Q lcl|NC_016071. 74 KYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYI 151 (516) Q Consensus 74 Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~ 151 (516) +-..|.+++|++.-. +.+...+.. +...|+. -+...+..++++.++ +.+.+|.+.+.+++.. .. T Consensus 45 Ia~~ia~lp~~v~~~-~~~~~~~~~----~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~---------~~ 110 (395) T protein:vir:96 45 LARIISKSTFRIKAP-EKLTENQKD----WLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGK---------GI 110 (395) T ss_pred HHHhhccceeEEEeC-Cccccccch----HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCC---------ce Confidence 999999999988632 322222222 3334432 223345666676644 4666899887665432 11 Q ss_pred eeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCc Q lcl|NC_016071. 152 TIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGT 231 (516) Q Consensus 152 ~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~ 231 (516) ++....++... + + +.....+ ......-...+|...++.+++... T Consensus 111 ~~~~~~~~~~~-~-------~---~~~~~~v-------------------------~~~~~~~~~~~~~~dvih~k~~~~ 154 (395) T protein:vir:96 111 YVADAFTQDKK-L-------S---GNKFKVS-------------------------RVQGQTYEKIFTFDQVIYLKNDNS 154 (395) T ss_pred ecCCccccccc-c-------c---cceeeee-------------------------eeccceeeeEeccCceEEecccCC Confidence 11111111000 0 0 0000000 000000122356667666676666 Q ss_pred CCccccchhHHHHHH------HHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_016071. 232 ESNPAGVSPLVGCYR------AFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAH 305 (516) Q Consensus 232 ~g~p~G~gLlr~~~~------~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~ 305 (516) .+.+++.|+...... ....+.+..++...+.... ..+...+ +.. ....++...+...+...... T Consensus 155 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~-------~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~ 224 (395) T protein:vir:96 155 DLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDK-------VRERAQE-NSD--GGRQPKSDKDFFKRTIEKIR 224 (395) T ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccc-------cccceee-ccC--chhhHHHHHHHHHHHHHHhh Confidence 666666665443221 1111122222222222111 1111111 111 11112222222333333333 Q ss_pred cccceEEEeccCcccccccccceeeeeccccCcc----hhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHH-HHHH Q lcl|NC_016071. 306 AGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQ----YSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSE-SKQS 380 (516) Q Consensus 306 ~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~----~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~-vh~e 380 (516) .|..+.++++.|++.+.. +.+....+ -.+.++..++-++|++++.-..--.+ |+++-.+ .... T Consensus 225 ~~~~~v~~l~~g~~~~~l--------~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~----~~~sn~e~~~~~ 292 (395) T protein:vir:96 225 TESVVGIPVTANTNYEEY--------GSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH----GDIADNQKNYEL 292 (395) T ss_pred cCCcceEEccCCceeEec--------ccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCccHHHHHHH Confidence 455555667888753321 11111111 12334445566889999887653332 3343233 2334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHH Q lcl|NC_016071. 381 IHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEV 460 (516) Q Consensus 381 v~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~ 460 (516) ....-+.-.++.|++.||+.|++.--.. .+ -+|.++.....|+++.+++++++++.|++.+ +.+|+. T Consensus 293 f~~~~L~P~~~~ie~~l~~~Ll~~~e~~-----~~---~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~ 359 (395) T protein:vir:96 293 LLEGPIESLITNIVDGLEYAIFDKSETL-----EG---SFIKVTGLKNYDLFSISSQADKLISSGFVFI-----DEVREE 359 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcCChhhhc-----Cc---eeEeecchhccCHHHHHHHHHHHHhCCCcCH-----HHHHHH Confidence 4555677788888888988776542111 11 1467777788899999999999999998876 579999 Q ss_pred cCCCCCCC--cccccCcccccCCCCCCcccccccccCCCC Q lcl|NC_016071. 461 GGFDEEIP--EDMSTDELLKLLGQDTSRSGDGMTAGSNGN 498 (516) Q Consensus 461 ~Glp~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (516) +|+|+-.+ +|+..-... ..+.....|+.. ...++ T Consensus 360 ~gl~pi~~~~gD~~~~~~N--~~~~~~~gge~~--~~~~~ 395 (395) T protein:vir:96 360 IGLPELPDGLGKVLYMTKN--YESVLERGGEVD--EEVET 395 (395) T ss_pred hCCCCCCCCCCceeeeccc--ceechhccCCCC--CCCCC Confidence 99996433 232221100 000001111100 00011 No 93 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.25 E-value=1.2e-10 Score=74.92 Aligned_cols=378 Identities=11% Similarity=-0.018 Sum_probs=186.7 Q ss_pred CCc---cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MST---RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~---r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |.= +.+.... | +.....+.. ............+..+ .-+..++-+.|.+|+..+-.. T Consensus 1 Mglf~~~~~~~~~--------~---~~~~~~~~~--------~~~~~~~~~~~~~~~v-~~~~al~~~~V~~~i~~Ia~~ 60 (384) T protein:vir:49 1 MPIFNITNLATES--------P---PSNQDSFFD--------ITDPEFLDALNGSEWV-SAETALKNSDLFSIISQLSND 60 (384) T ss_pred CccccccccCccc--------c---cccchhhcc--------ccchhhcccccCCcee-chhhhhccHHHHHHHHHHHHH Confidence 432 2211111 1 100000000 0000000001111111 112345678899999999999 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) |.++++.+.- .. . ...+.+-+...++.+++..++ +.+.+|-+++++++...+ . +..| T Consensus 61 ia~l~~~~~~-----~~-----~---~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g-------~--~~~L 118 (384) T protein:vir:49 61 LATAKITTSR-----KQ-----L---QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENG-------R--DMKW 118 (384) T ss_pred HhhCceeeec-----ch-----h---hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCC-------c--EEEE Confidence 9999987641 10 0 112333444567788888776 567799999999986532 1 2233 Q ss_pred cccCchhcccccceeecCCCceee-eccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLK-GIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) .+.++.+++ ...+.|+..+. .+... -...+....+|...+|++++....+.. T Consensus 119 ~~l~~~~v~----v~~~~~~~~~~y~~~~~-----------------------~~~~~~~~~~~~~eVih~~~~~~~~~~ 171 (384) T protein:vir:49 119 EYLRPSQVS----FNRLDNQNGLYYNITFD-----------------------DPRIPPKQHVPQGDILHFRLLSVDGGL 171 (384) T ss_pred EEEcCceeE----EEEcCCCceEEEEEEec-----------------------CccccceeEecCccEEEecCCCCCCce Confidence 444443332 12233332211 10000 001122345777777766665666778 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP 315 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP 315 (516) +|.|.+..++...-.-....++...+...-+.+--+++.+ .....++.. +...+..... .+....++++ T Consensus 172 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~--------~~~~~~~~~--~~~~~~~~~~-~n~~~~~vl~ 240 (384) T protein:vir:49 172 TSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIK--------GGGLLDFKT--KQSRSRQAMK-QMQGGPLVLD 240 (384) T ss_pred eeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC--------CCCChHHHH--HHHHHHHhcc-cCCccceecC Confidence 9999999998877766666677777766544433333332 122222221 1222222211 1223355667 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVE 395 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~ 395 (516) .|++.+ ..+-+....++.+..++..++|++++.-..--.+..+.+ .+..+.-++.....++.-++-|.+ T Consensus 241 ~g~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~-~~~~~~~~~~~~~~i~~~l~pi~~ 309 (384) T protein:vir:49 241 DLEDFT----------PLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDK-QSSLEMIYNIYFKAVSRFLRPFVS 309 (384) T ss_pred CCceEE----------EccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-cccHHHHHHHHHHHHHHHHHHHHH Confidence 776422 222233444567777888899999887655333222111 222232344455555556666666 Q ss_pred HHHHHHHHHHHH-hcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccC Q lcl|NC_016071. 396 AFNKNLIPQLLA-LNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTD 474 (516) Q Consensus 396 ~ln~~li~~lv~-lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~ 474 (516) .|++.|-+.|.. +.....++..+.++.++..-..|+....++.+.|...|+.+ +.+|+..|+|+-+.+|+.-. T Consensus 310 ~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~------ne~r~~~~~~p~~gGd~~~~ 383 (384) T protein:vir:49 310 ELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP------KDLPEGETDSTLKGGETNEQ 383 (384) T ss_pred HHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC------hhHHHHcCCCCCCCCCCCCC Confidence 666655444311 11001111122233333333455667778888888888753 24788888876443332211 Q ss_pred c Q lcl|NC_016071. 475 E 475 (516) Q Consensus 475 ~ 475 (516) = T Consensus 384 ~ 384 (384) T protein:vir:49 384 Y 384 (384) T ss_pred C Confidence 1 No 94 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.24 E-value=2.1e-10 Score=73.69 Aligned_cols=365 Identities=10% Similarity=0.010 Sum_probs=171.2 Q ss_pred CCc--cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MST--RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~--r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |.= |.-. +.. . + +. ..+......+ ..+.-++-+.|.+|+..+-..+ T Consensus 1 Mg~f~~l~~--~~~----~-~--------~~----------------~~~~~~~~~~-~~~~~l~~~~v~~~i~~Ia~~i 48 (376) T protein:vir:78 1 MGFFSELFK--RNK----E-I--------EW----------------MWDLDFLEDK-TTKVYLKKMALNTCVKHIARTI 48 (376) T ss_pred Cchhhhhhc--cCC----c-c--------cc----------------ccchhhcccc-chhhhhhhHHHHHHHHHHHHhh Confidence 321 1100 000 0 0 00 0000000000 0122235678999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) .++++.+.-. +.+.+..++ ..|. +-+...++.++++.++ +.+.+|.+...+++... ..+..+ T Consensus 49 a~~p~~~~~~---~~~~~~~l~----~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~---------~~~~~~ 112 (376) T protein:vir:78 49 AKSDFRLKNG---ETSVRDKLY----YKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDD---------FLIADS 112 (376) T ss_pred cccceeeccc---cccccchHH----HHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCC---------eeeccc Confidence 9999976421 222222222 2333 2234456777777755 46668999876654321 122222 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .++.+..+....+..+. .........+|.+.++.+++....+.++ T Consensus 113 ~~~~~~~~~~~~~~~~~-----------------------------------~~~~~~~~~~~~~evih~~~~~~~~~~~ 157 (376) T protein:vir:78 113 YVRKEFAFFPDVFEGVT-----------------------------------VKDYRYNRNFSMDDVIFLEYGNERLSAF 157 (376) T ss_pred eeecccceeeeeeeeee-----------------------------------eecceeeeeeccccEEEeccCCCCchhh Confidence 33332222110000000 0000011235666777777777777776 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc--cce--EE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG--EQA--YF 312 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g--~~a--~~ 312 (516) +.++...+.- +.+. .+..+ +++.+.. +...-+.+...+++.. +++++..+....| ..+ .+ T Consensus 158 ~~~~~~~~~~--~~~~----~~~~~--~~~~~~~-----~~~~~~~~~~~~~e~~---~~~~~~~~~~~~g~~~~~~~v~ 221 (376) T protein:vir:78 158 TDGMFEDYGE--LFGK----MIRAQ--MRNFQIR-----GAVNFKMAGVADKDKQ---TKLQEYIDKVYASFNNNEIAIV 221 (376) T ss_pred hhHHHHHHHH--HHHH----HHHHH--HhcCCCc-----eeEEEccCCCCCHHHH---HHHHHHHHHHhccccccCcceE Confidence 6666544321 1111 11112 1222221 1111112222223332 3344444433333 223 34 Q ss_pred EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHH-HHHHHHHHHHHHHHH Q lcl|NC_016071. 313 ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSE-SKQSIHGHFVQRDID 391 (516) Q Consensus 313 iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~-vh~ev~~~~~~aDa~ 391 (516) +++.|++...... .....+-...++.+..++...+|++++.-..--.+ |+++-.+ .-......-+.--++ T Consensus 222 ~l~~g~~~~~l~~-----~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~----~~~s~~e~~~~~f~~~~l~P~~~ 292 (376) T protein:vir:78 222 PQLEGFNYEEFGT-----TSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLH----GDMADLSNNMKAYMEYCIDPLTK 292 (376) T ss_pred EcCCCceEEeecc-----CccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhC----CCCCCHHHHHHHHHHHHHHHHHH Confidence 4688875332211 10101111225677778888999999887553332 2333222 223445555777888 Q ss_pred HHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCc-- Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPE-- 469 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~-- 469 (516) .|++.||+.|+++- ...-++.++..-..|+++.+++++++++.|++.+ +.+|+.+|+|+-.++ T Consensus 293 ~ie~~l~~kll~~~----------~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~lg~~p~~~g~~ 357 (376) T protein:vir:78 293 KLEDELNAKLFTFS----------EFLAGEHIKIIHKKDIIENAEAVDKLVASGSFNR-----NEVRELLGAERVDNPEL 357 (376) T ss_pred HHHHHHHhhhCCcc----------cceecccchhhcccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCC Confidence 88999988775431 1122233444456788999999999999998876 579999999964333 Q ss_pred ccccCcccccCCCCCCccccccccc Q lcl|NC_016071. 470 DMSTDELLKLLGQDTSRSGDGMTAG 494 (516) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (516) |+.... ..-.+.+++...| T Consensus 358 d~~~~~------~n~~~~~~~~e~g 376 (376) T protein:vir:78 358 DKYLIT------KNYQSADEGGEDG 376 (376) T ss_pred ceeeec------cCceehhccccCC Confidence 222111 1111112211111 No 95 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.24 E-value=9.5e-11 Score=75.53 Aligned_cols=364 Identities=10% Similarity=-0.009 Sum_probs=172.7 Q ss_pred cchHHHHHHHHHHHhhcccc----cCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhH---HHHH Q lcl|NC_016071. 29 LGSGALSQLRAESEVMKVEE----LRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASK---DAAE 101 (516) Q Consensus 29 ~g~~~~~~~~~~~~~~~~~~----lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~---~~a~ 101 (516) +|=.+ .+..+.....+-+ +-|.. -.++ +.-+.|.+|+..+-..|.+++|.+.-....+...+. ..-. T Consensus 1 Mg~f~--~~~~f~~~~~~~~~~~~~~~~~-~~~~---~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~ 74 (378) T protein:vir:93 1 MNLFG--KVVSFSRGKLNNDTQRVTAWQN-EAVE---YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGS 74 (378) T ss_pred Cccch--hhhhhhccccCCCcceeeeccc-chhH---HHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccc Confidence 11110 0000100000000 01111 1111 133579999999999999999976433222111111 0112 Q ss_pred HHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCcee Q lcl|NC_016071. 102 FVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTL 179 (516) Q Consensus 102 ~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l 179 (516) -+...|+. -+...+..+++..++ +.+.+|-+.+.+++.-.. |.++. ..++. T Consensus 75 ~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~------g~~~~----------------l~~~~----- 127 (378) T protein:vir:93 75 DLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNT------GELLD----------------LLFAD----- 127 (378) T ss_pred hHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCC------ceEEE----------------EEecC----- Confidence 23444542 233455667776644 577789998877664221 22110 00111 Q ss_pred eeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 180 KGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLET 259 (516) Q Consensus 180 ~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~ 259 (516) .++.+|.+..+ |...+-++ -.|.|++..+.-..- T Consensus 128 ----------------------------------~~~~~~~~dii-h~r~~~~~-~~~~s~l~~~~~~i~---------- 161 (378) T protein:vir:93 128 ----------------------------------DKKEYKTEELV-RLTSPFYI-NEDTSILDNALASIQ---------- 161 (378) T ss_pred ----------------------------------CeeEeccceeE-EecCcccc-chhhHHHHHHHHHHH---------- Confidence 12234555544 43333222 236677776643221 Q ss_pred HHHhhccccceeeeecccccccccCC-CCHHHHHHHHHHHHHHHHhhcccceE--EEeccCcccccccccceeeeecccc Q lcl|NC_016071. 260 IGASKDLGGIIELKIPSQILNKAAID-PKSPESEMVQGLMADAANAHAGEQAY--FILPSDMNAQGGEQYKMSLKGIDGA 336 (516) Q Consensus 260 ~~~er~g~~~~v~~~pp~~~~k~~~~-~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP~g~~i~~~e~~~iel~~~~g~ 336 (516) .+.. .+. |++++.. +.. .....++..+.+.+..++...|..++ ++++.|+++. ..+-+ T Consensus 162 ~~~~-~~~-------~~g~l~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~----------~l~~~ 222 (378) T protein:vir:93 162 TKLE-QGK-------LRGLLKI-NAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIV----------ELKKD 222 (378) T ss_pred HHHh-cCc-------ccceeee-CCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEE----------EccCC Confidence 1111 222 2222221 111 11223334556666666666666665 4456665422 22212 Q ss_pred CcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc Q lcl|NC_016071. 337 GKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDE 416 (516) Q Consensus 337 g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~ 416 (516) ....++ ...++..++|++++.-..-.. .|+++- +........-+.--++.|++.||+.|+..--.--+...... T Consensus 223 ~~~~~~-~~~~~~~~~Ia~~fgVPp~~l----~g~~~e-~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~ 296 (378) T protein:vir:93 223 YSVLNK-DEIDLIKSELLTGYFMNENIL----LGTATQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYY 296 (378) T ss_pred hhhhhH-HHHHHHHHHHHHHhCCCHHHh----cCCcHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccc Confidence 222234 344778899999887754222 133331 22334445667888999999999988754211000000001 Q ss_pred ccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccc-cC Q lcl|NC_016071. 417 DMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTA-GS 495 (516) Q Consensus 417 ~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 495 (516) .-.+|.++.....|+++.+++++++++.|++.+ +.+|+.+|+|+-+.+|+....... .+-.......+.+. .. T Consensus 297 ~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~-----NE~R~~~gl~p~~ggD~~~~~~n~-~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:93 297 ERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDVYIANLNA-VAVKNLSDLQGSRKDVT 370 (378) T ss_pred cceeeccchhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeecccc-ccccchhhhcCccCCCC Confidence 124566677788899999999999999999876 578999999976555543221111 11110000010010 00 Q ss_pred -CCCCccc Q lcl|NC_016071. 496 -NGNGTGK 502 (516) Q Consensus 496 -~~~~~~~ 502 (516) .+++..+ T Consensus 371 ~~~e~~n~ 378 (378) T protein:vir:93 371 STDETNNQ 378 (378) T ss_pred CCCCCCCC Confidence 0111111 No 96 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.21 E-value=1.7e-10 Score=74.14 Aligned_cols=364 Identities=10% Similarity=-0.012 Sum_probs=171.7 Q ss_pred cchHHHHHHHHHHHhhcccc----cCCc-ccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhh---HHHH Q lcl|NC_016071. 29 LGSGALSQLRAESEVMKVEE----LRWP-CFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKAS---KDAA 100 (516) Q Consensus 29 ~g~~~~~~~~~~~~~~~~~~----lr~~-~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~---~~~a 100 (516) +|-.+- +..+.....+-+ .-|. +.+. ..-+.|.+|+..+-..|.+++|.+.-....+...+ ...- T Consensus 1 Mg~f~~--~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~ 73 (378) T protein:vir:16 1 MNLFGK--VVSFSRGKLNNDTQRVTAWQNEAVE-----YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAG 73 (378) T ss_pred Cccchh--hhhhhcccccCCcceeeecccchhh-----HHHHHHHHHHHHHHhhhhhCceeEEEEccccccccccccccc Confidence 221110 000100000000 0111 1111 13457999999999999999997632211111000 0111 Q ss_pred HHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccccCchhcccccceeecCCCce Q lcl|NC_016071. 101 EFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRT 178 (516) Q Consensus 101 ~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~ 178 (516) .-+.+.|+. -+...+..+++..++ +.+.+|-+.+.++|.-.. |.++. ..++. T Consensus 74 ~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~------g~~~~----------------l~~~~---- 127 (378) T protein:vir:16 74 SDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNT------GELLD----------------LLFAD---- 127 (378) T ss_pred chHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCC------ceEEE----------------EEecC---- Confidence 224444542 334455667776655 567789999988875321 12110 01110 Q ss_pred eeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 179 LKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLE 258 (516) Q Consensus 179 l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w 258 (516) ....+|.+.. +|...+-++ -.|.|++..+.-..- T Consensus 128 -----------------------------------~~~~~~~~di-ih~r~~~~~-~~~~s~l~~~~~~i~--------- 161 (378) T protein:vir:16 128 -----------------------------------DKKEYKPEEL-VRLTSPFYI-NEDTSILDNALASIQ--------- 161 (378) T ss_pred -----------------------------------CeeEecccce-EEecCccCc-cchhHHHHHHHHHHH--------- Confidence 0112333443 343222222 235666665543211 Q ss_pred HHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EEeccCcccccccccceeeeecccc Q lcl|NC_016071. 259 TIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FILPSDMNAQGGEQYKMSLKGIDGA 336 (516) Q Consensus 259 ~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP~g~~i~~~e~~~iel~~~~g~ 336 (516) .+.. .+. +.+++.....=..+..++..+.+.+..++...|..++ ++++.|++++ ..+-+ T Consensus 162 -~~~~-~~~-------~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~----------~l~~~ 222 (378) T protein:vir:16 162 -TKLE-QGK-------LRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIV----------ELKKD 222 (378) T ss_pred -HHHh-cCc-------cceeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEE----------EccCC Confidence 1111 121 1222211111011223344566677766666666666 4556665422 22222 Q ss_pred CcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc Q lcl|NC_016071. 337 GKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDE 416 (516) Q Consensus 337 g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~ 416 (516) ....++. ..++..++|++++.-..-... |+++- +-.......-+.-.++.|++.||+.|+++--...+...... T Consensus 223 ~~~~~~~-~~~~~~~~Ia~~fgVPp~~l~----g~~~e-~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~ 296 (378) T protein:vir:16 223 YSVLNKD-EIDLIKSELLTGYFMNENILL----GTASQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYY 296 (378) T ss_pred hhhhhHH-HHHHHHHHHHHHhCCCHHHhc----CCchH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccc Confidence 2222343 347788999998877653331 33332 22233445567888889999999888764321111000001 Q ss_pred ccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCccccccc-ccC Q lcl|NC_016071. 417 DMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMT-AGS 495 (516) Q Consensus 417 ~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 495 (516) .-.+|.++.....|+++.++++.++++.|++.+ +.+|+.+|+|+-+.+|+..-...- .+-.......+.+ ... T Consensus 297 ~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~ggD~~~~~~n~-~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:16 297 ERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDVYIANLNA-VAVKNLSDLQGSRKDVT 370 (378) T ss_pred cceeeccchhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccc-ccccchhhhcCccCCCC Confidence 123566677788899999999999999999876 579999999976555543211111 0000000001000 000 Q ss_pred -CCCCccc Q lcl|NC_016071. 496 -NGNGTGK 502 (516) Q Consensus 496 -~~~~~~~ 502 (516) .+++..+ T Consensus 371 ~~~e~~ne 378 (378) T protein:vir:16 371 STDETNNQ 378 (378) T ss_pred CCCCCCCC Confidence 1111111 No 97 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.17 E-value=9.3e-10 Score=70.11 Aligned_cols=434 Identities=11% Similarity=0.013 Sum_probs=194.3 Q ss_pred CCcccc-----Ccccccchhhh--ccc-CCCCccccc-chHHHHHHHHHHHhhcccccC-CcccHH-H-HHHHhhChHHH Q lcl|NC_016071. 1 MSTRFA-----QPSEVVKAGNE--NLA-VSRLRTGEL-GSGALSQLRAESEVMKVEELR-WPCFLA-T-VEAMKQDHTVS 68 (516) Q Consensus 1 ~~~r~~-----~~~~~~~~~~~--~p~-~~~~~~~e~-g~~~~~~~~~~~~~~~~~~lr-~~~~~~-~-y~~m~~D~~v~ 68 (516) |=+|.. .++........ +++ .+......- +.+.+..+ ....... .+.... + -+..++-+.|. T Consensus 3 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~g~~~~~~~~~g~~v~~~~a~~~~~v~ 76 (466) T protein:vir:81 3 LIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQT------LAGPSTELAPDTFVGLATQAYQANGPVF 76 (466) T ss_pred hhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHh------hccccccccCccccccchhhhhccHHHH Confidence 222222 11111111111 111 111111100 11111111 1111111 111111 1 23445789999 Q ss_pred HHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeeccccccc Q lcl|NC_016071. 69 TALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKY 147 (516) Q Consensus 69 s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~ 147 (516) +|+..+-..|.+++|.+....+.... +.....+...+.+-+...++.+++..++ +.+.+|.+++++++...+ ...+ T Consensus 77 ~~i~~Ia~~ia~lp~~~~~~~~~~~~--~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g-~l~~ 153 (466) T protein:vir:81 77 ACMLVRQLVFSSVRFRWQRLRDGKPS--DTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFV-RMRP 153 (466) T ss_pred HHHHHHHHhhccCceEEEEecCCcee--eccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCcc-cccc Confidence 99999999999999988644322111 1111223334444344566778887766 678899999999875322 2222 Q ss_pred ccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEe Q lcl|NC_016071. 148 AGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMS 227 (516) Q Consensus 148 ~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~ 227 (516) +-.-.+..|.+.++..+. ...+.++......+. ..+. .........+|...+|.++ T Consensus 154 ~~~g~~~~l~~l~~~~v~----~~~~~~~~~~~~y~~--------~~~~------------~~~~~~~~~~~~~dviHir 209 (466) T protein:vir:81 154 DWVDVVVEERMVRGGRGE----LGGGQLGWRKVGYLY--------TEGG------------RQSGNESVGFLAEDVVHFA 209 (466) T ss_pred ccCcceeEEEEecCcceE----EEEcCCCceEEEEEE--------EecC------------cccccceeeeccccEEEEc Confidence 222223445555554332 223333322211110 0000 0011234457777776666 Q ss_pred ecCc-CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_016071. 228 LGGT-ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHA 306 (516) Q Consensus 228 ~~~~-~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~ 306 (516) +... .+..+|.|.+..+....-.-....++-..+....+.+=-+++. +..-++++ .+++++..++... T Consensus 210 ~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~--------~~~l~~e~---~~~~~~~~~~~~~ 278 (466) T protein:vir:81 210 PIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKH--------NPMADPAA---VKKWADEVNSKHA 278 (466) T ss_pred CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEec--------CCCCCHHH---HHHHHHHHHHHhc Confidence 5432 3556899999999887766666666666666654433222222 22223333 3344444444333 Q ss_pred cc-ce--EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc---CCccchhhHHHHHH- Q lcl|NC_016071. 307 GE-QA--YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG---NDGQGSYNLSESKQ- 379 (516) Q Consensus 307 g~-~a--~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~---~~~~GS~Al~~vh~- 379 (516) |. .+ .++++.|++.+ ..+.+....+|.+..++...+|++++--..--.+ ..+.++|+-.+-+. T Consensus 279 g~~n~g~~~vl~~g~~~~----------~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~ 348 (466) T protein:vir:81 279 GVDNAWKNLNLYPGADAD----------VVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARR 348 (466) T ss_pred CccccccceEcCCCceEE----------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHH Confidence 32 23 36778776433 2222334445777888999999999755432222 11224444444333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecC--cCchhHHHHHH-------HHHHHHhCCccccc Q lcl|NC_016071. 380 SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGL--IQEVDMEGFSK-------FVQRIGAVGYLPKT 450 (516) Q Consensus 380 ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~--~~~~dl~~~a~-------~~~~L~~~G~~~~~ 450 (516) .....-+.-.++.|++.||+.|+.. .+... -+|.|+. .-..|.+..++ .++.+++.|+ .+ T Consensus 349 ~f~~~tl~P~~~~ie~~l~~~L~~~--------~~~~~-~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~-t~- 417 (466) T protein:vir:81 349 RLADGTAHPLWQNLSGCIGHVMPDM--------GPDVR-LWYDADDVPFLREDEKDAADIQKVRAETINTLITAGY-EP- 417 (466) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCc--------ccCcc-eEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCC-Ch- Confidence 3445567788889999999866542 11111 1344443 33345555443 3777888885 33 Q ss_pred HHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 451 PTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 451 ~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +.+|+.... .+....+.... .+....+.+. . ...+++....++.| +..| T Consensus 418 ----nE~r~~~~~----gd~~~~~~~~~-~~~~~~~~~~--~--~~~~~~~~~~~Gg~----~ngn 466 (466) T protein:vir:81 418 ----ESVVAAVNS----GDLRLLKHTGL-TSVQLLPPGV--S--ASASSDTPTSGGAD----DNGN 466 (466) T ss_pred ----hhccccccC----CccccccCCCc-chhhhccccc--c--cccCCCCcccCCCC----cCCC Confidence 344432211 11110000000 0000000000 0 00000100111111 1112 No 98 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.14 E-value=1.3e-09 Score=69.26 Aligned_cols=432 Identities=11% Similarity=-0.033 Sum_probs=201.6 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHH-hhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAM-KQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~~v~ 79 (516) |++|-.+........... ...+. .-.|....+....+. .-..+.. -.+..++.| ++++-+..++++--.-.+ T Consensus 1 ~~~~~~a~~~~~~~~a~~--~~~~~-~~~g~~~~~d~~~~~-~~~~~~~---~~~~~l~~lY~~~~l~r~iVd~~a~d~~ 73 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVN--RNDFM-VGHGKANSRDKLTRQ-TPGNGQK---LDLKACENLYASNSIAMNIVDIISEDMV 73 (461) T ss_pred Cccchhhhhhhhhhhhhh--hhHHH-hhcCCcchhhhhhcc-ccCcccc---cCHHHHHHHHHhCCccchhhccchHHhh Confidence 998877665554322210 11110 111111011111110 0001110 133333334 358888888888888777 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccc--------ccce Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKY--------AGYI 151 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~--------~g~~ 151 (516) +..|+|+. ++ .+..+.+++.++++.. |..+...+-.+..||.+.+=+.-. ++..+.+ .... T Consensus 74 r~g~~i~~----~~---~~~~~~~~~~~~~l~~---~~~l~~~~~~~rl~G~a~i~i~v~-d~~~~~~~~~~pl~~~~~~ 142 (461) T protein:vir:80 74 RAGWSLKT----DN---KEMKKNIESKWRKLKT---KDRFQKLYADKRLYGDGFLSIGVV-SSNREQADLSTAIDPKTIK 142 (461) T ss_pred cCCeeeec----CC---HHHHHHHHHHHHHhhH---HHHHHHHHHhhcccccEEEEEEee-cCCccccCccCCccccccc Confidence 77777653 22 2445667777777653 666666677899999987644321 1111111 1111 Q ss_pred eeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccc---cccccccCCCccccccccEEEEee Q lcl|NC_016071. 152 TIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMS---LVTNLTSSADEVFIPINKLMVMSL 228 (516) Q Consensus 152 ~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~iP~~k~i~~~~ 228 (516) .+..|.+-.+..|.. ....+++....++.+.++.+...-. .............|-+.+++.+.. T Consensus 143 ~~~~l~~~~~~~i~~-------------~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~ 209 (461) T protein:vir:80 143 SIPYINTFNTQKVTQ-------------LYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQG 209 (461) T ss_pred ceeEEEeccccccch-------------hhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecC Confidence 111222111111100 0011111111112211111110000 000001122335677888888887 Q ss_pred cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 229 GGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE 308 (516) Q Consensus 229 ~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~ 308 (516) ..-.+..+|.|++..+|....--.....-=++.+.++.. .+++..-.. .-.+. +...+.+ .+...+.+ T Consensus 210 ~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~--~v~k~~~l~-----~~~~~-~~~~~~~---~~~~~~~~- 277 (461) T protein:vir:80 210 LRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAF--KVYKTDDID-----ALNKD-DKANLTA---MLDFMFRT- 277 (461) T ss_pred CCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCC--CceecchHH-----hhhch-HHHHHHH---HHHHhcCC- Confidence 777788899999999998665444444434445555443 444432111 11111 2222222 22222323 Q ss_pred ceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHH Q lcl|NC_016071. 309 QAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQR 388 (516) Q Consensus 309 ~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~a 388 (516) ...+++..+ .+++.++.+-+| ...+++..-.+||-+.--..--+-.+..|..|-|+-.....-+.+++ T Consensus 278 ~g~~~~d~~--------e~~e~~~~~lsg----l~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~yyd~i~~ 345 (461) T protein:vir:80 278 EALAIIKGD--------EQLTKESTNVSG----MKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMNYYARVSS 345 (461) T ss_pred ceEEEEcCC--------cceEEEecCcCC----HHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHHHHHHHHHHH Confidence 334455443 245555554333 45677777777776654443222222235567677667778888888 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCcCCcccc----ceEEecCcCchhH-------HHHHHHHHHHHhCCcccccHHHHHHH Q lcl|NC_016071. 389 DIDIIVEAFNKNLIPQLLALNDIRLSDEDM----PKLKPGLIQEVDM-------EGFSKFVQRIGAVGYLPKTPTVINKI 457 (516) Q Consensus 389 Da~~i~~~ln~~li~~lv~lN~~~~~~~~~----P~~~~~~~~~~dl-------~~~a~~~~~L~~~G~~~~~~~~~~~i 457 (516) ..+....-+.+.|++.|+.--+.+++...+ -.|.|...-..+- +..|++++++++.|++.+ .+..+.+ T Consensus 346 ~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~-~e~r~~l 424 (461) T protein:vir:80 346 IQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDP-DEVKETR 424 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCH-HHHHHHH Confidence 887655555556777776533333322111 2356654433332 344667899999998877 3445667 Q ss_pred HHHcCCCCCC--Ccc--c--ccCcccccCCCCCCccccc Q lcl|NC_016071. 458 LEVGGFDEEI--PED--M--STDELLKLLGQDTSRSGDG 490 (516) Q Consensus 458 ~e~~Glp~~~--~~~--~--~~~~~~~~~~~~~~~~~~~ 490 (516) +.++++.+.. +.+ + ....+....++ ...++| T Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~e~~~g 461 (461) T protein:vir:80 425 FGRFGLENSSKFSGDSAEIDKLAKLVYDAYA--KKNADG 461 (461) T ss_pred HHhcCCCCCccCCCCCchhhhhhhhcccccc--ccCCCC Confidence 7788875432 111 1 10000010111 111111 No 99 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.11 E-value=1.9e-09 Score=68.44 Aligned_cols=423 Identities=10% Similarity=0.060 Sum_probs=189.3 Q ss_pred ccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeC Q lcl|NC_016071. 9 SEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVL 88 (516) Q Consensus 9 ~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~ 88 (516) .+.++.. .++.. + +|+..-..+. .-..+..+ ....++...++.+-+..++++.-.-.++..|+|+. T Consensus 1 ~~~~D~~-~~~~~-~-----~g~~~~~~~~----~~~~~~~~--~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~- 66 (437) T protein:vir:52 1 MKFFDGI-KSLAL-K-----LGSKQEQTYY----SPSLSLTD--DLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYS- 66 (437) T ss_pred Cchhhhh-HhHHh-c-----CCCcccccee----ecCccccc--cHHHHHHHHHhCchhhHHhhcchHHhhcCCceEec- Confidence 1111111 11110 0 1110000000 00011111 12233333346899999999887777777777753 Q ss_pred CCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccc-ccc-ccceeeccccccCchhccc Q lcl|NC_016071. 89 YNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAP-SKY-AGYITIDKIAFRPQSSLSR 166 (516) Q Consensus 89 ~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~-~~~-~g~~~~~~l~~r~q~ti~~ 166 (516) ++ .+.+..+.+++.++++.. |..+...+-.+..||=+++=+ ..++.. ..| +..-.++.|.+.++..+ T Consensus 67 ---~d-~~~~~~~~~~~~~~~l~~---~~~l~~a~~~~rl~G~a~i~i--~~d~~~~~~pl~~~~~~~~~~v~~~~~v-- 135 (437) T protein:vir:52 67 ---ND-LNSKQLDLFTKFERSLKL---RETLTKALQWSSLYGSVGLLV--VTDSQNTSAPLKPTERLKRLIILPKWKI-- 135 (437) T ss_pred ---CC-CCHHHHHHHHHHHHhhcH---HHHHHHHHHhcccccceEEEE--EecCCCcccccccCCceeEEEEechhhc-- Confidence 22 122334667788887753 555555555688999776533 333321 111 00011223333332211 Q ss_pred ccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeec---CcCCccccchhHHH Q lcl|NC_016071. 167 SKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLG---GTESNPAGVSPLVG 243 (516) Q Consensus 167 ~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~---~~~g~p~G~gLlr~ 243 (516) ...+ ...+++....++ .|..+ .+.+......|.+.+++.+... ...++.+|.|+|.. T Consensus 136 ------~~~~----~~~~dp~s~~fg--------~p~~y--~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~ 195 (437) T protein:vir:52 136 ------SPTG----TKDDDVLSPNFG--------RYSEY--SILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEK 195 (437) T ss_pred ------cccc----cccccccccccC--------cceEE--EEecCCcceeEccceeEEecCccCCCccccccCCchHHH Confidence 1111 001111111111 12122 1233344567888898887542 24467789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccc Q lcl|NC_016071. 244 CYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGG 323 (516) Q Consensus 244 ~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~ 323 (516) +|....--.....-=+..+.++. +.+++.. ........+ . .+.+....+++...+ +....+++..+- T Consensus 196 ~~~~i~~~~~~~~~~~~l~~~~~--~~v~k~~-~l~~~l~~~---~-~~~~~~~~~~~~~~~-~~~~~~~~d~~~----- 262 (437) T protein:vir:52 196 IIDVLKRFDSASVNVGDLIFESK--IDIFKIA-GLSDKIAAG---M-ENEVASVISAVQEIK-SATNSLLLDAEN----- 262 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHcC--CCceecc-hHHHHhcCC---c-HHHHHHHHHHHHHhc-CCCceEEEcCCc----- Confidence 99766544333333344455543 3444432 110111111 1 122333334443333 334455665442 Q ss_pred cccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc-ccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 324 EQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN-LGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLI 402 (516) Q Consensus 324 e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt-s~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li 402 (516) +++.++.+=+ +...+++..-.+||.+.--..-- .+.+. |..|-|+-....+-+.+++.......-+.+.|+ T Consensus 263 ---~~e~~~~~~s----gl~~~l~~~~~~iaaa~~iP~t~L~G~s~-~Glasge~D~~~yyd~i~~~Qe~~l~p~le~l~ 334 (437) T protein:vir:52 263 ---EYDRKELTFT----GLKDLLTEFRNAVAGAADMPVTILFGQSV-SGLASGDEDIQNYHEAIRRLQETRLRPIFEIID 334 (437) T ss_pred ---ceEEEecCcC----CHHHHHHHHHHHHHHHhcCchhhhcCcCc-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3455544322 34567777777888665444322 22232 334666766777778888877654444445677 Q ss_pred HHHHHhcCCcCCccccceEEecCcCchh-------HHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc Q lcl|NC_016071. 403 PQLLALNDIRLSDEDMPKLKPGLIQEVD-------MEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE 475 (516) Q Consensus 403 ~~lv~lN~~~~~~~~~P~~~~~~~~~~d-------l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~ 475 (516) +.|+.--++..++. -.|.|...-..+ .+..+++++++++.|++.+ .+..+.+++.-.++.-.+++..... T Consensus 335 ~~i~~~~~g~~~~~--~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~-~e~r~~L~~~g~~~~i~~~~~~~~~ 411 (437) T protein:vir:52 335 PLICNELFGGLPAD--WWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNE-YQIANELRESGLFANISAEHIEELK 411 (437) T ss_pred HHHHHHhcCCCCCc--ceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCH-HHHHHHHHhcCCCCCCCcccccccc Confidence 76665443221111 134554332222 2456778999999998877 3445556554222311111111001 Q ss_pred cccc-CCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 476 LLKL-LGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 476 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) ...+ ..+..++ +......++.+.++ T Consensus 412 ~~~~~~~~~~~~--~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 412 NADEFAGNFEEP--EKMEGAQVQNSEDQ 437 (437) T ss_pred CCCCCCCccCCC--CCCCCCCCCCCCCC Confidence 1111 1111111 11111111111111 No 100 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=99.08 E-value=2.8e-09 Score=67.47 Aligned_cols=348 Identities=11% Similarity=0.057 Sum_probs=173.1 Q ss_pred CC--ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MS--TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~--~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |+ .++.+++..... .-.+. ... ..+ ...+..+. -+..++-+.|.+|+..+-..| T Consensus 1 M~~~~~f~~r~~~~~~-~~~~~-------------------~~~--~~~-~~~~~~v~-~~~al~~~av~~cv~~ia~~i 56 (359) T protein:vir:10 1 MSILNPFERRSSITPN-NYYPF-------------------MVQ--NGS-IVPNSLVD-ATEALKNSDLYAVTSLISSDI 56 (359) T ss_pred CcccchhhccccCCCC-cchhh-------------------hhc--ccc-ccCCcccC-HHHhhcchHHHHHHHHHHHhh Confidence 33 334332221100 00000 000 000 00111111 133456788999999999999 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) .++++. +++ ... ..+.+-+...+..+++..+. +.+.+|-++.++++...+ . +..|. T Consensus 57 a~~p~~-------~~~---~~~----~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g-------~--~~~l~ 113 (359) T protein:vir:10 57 AGTRFI-------GNQ---VFT----SVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNS-------L--MKELR 113 (359) T ss_pred hcCccc-------cch---HHH----HHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCC-------e--EEEEE Confidence 998762 121 122 22333333456667777766 567789999998875432 1 22333 Q ss_pred ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcC----C Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTE----S 233 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~----g 233 (516) +.++.++. ...+ +|.+...+. ....+....+|.+..+.+++.... + T Consensus 114 ~l~~~~v~----i~~~-~~~~~y~~~-------------------------~~~~~~~~~~~~~evih~~~~~~~~~~~d 163 (359) T protein:vir:10 114 LIPSNAIT----IDLT-DDTLTYEVN-------------------------QFDDYPSAKYNASEMIHVKIMAYGVDTLH 163 (359) T ss_pred EeCCceEE----EEEc-CCeEEEEEE-------------------------ecCCceEEEEcccceEEeccCCCCCCccC Confidence 33433221 1112 222211110 011223455677776666544322 3 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE-- Q lcl|NC_016071. 234 NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY-- 311 (516) Q Consensus 234 ~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~-- 311 (516) ..+|.|.+..+....-......++...+...-+.+--+++.| ....++++ .+.+++.......|..+| T Consensus 164 g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-------~~~l~~e~---~~~~~~~~~~~~~~~n~g~~ 233 (359) T protein:vir:10 164 NLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVP-------QGTLSSEA---KDSIRKEFEKANGGNNSGRV 233 (359) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-------CCCCCHHH---HHHHHHHHHHHhCccccCCc Confidence 457999999888877777777777777665322232233222 11223333 334455555555555554 Q ss_pred EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCC--ccchhhHHHHHHHHHHHHHHHH Q lcl|NC_016071. 312 FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGND--GQGSYNLSESKQSIHGHFVQRD 389 (516) Q Consensus 312 ~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~--~~GS~Al~~vh~ev~~~~~~aD 389 (516) ++++.|++... ++. +.....+.+..++..++|++++.-..--.+.. ...+++.. ++.....+.-- T Consensus 234 ~vl~~g~~~~~--------l~~--~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~---e~~~~~~l~~~ 300 (359) T protein:vir:10 234 MVLDQSADFST--------VSI--NADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQI---KDLYVNALNRF 300 (359) T ss_pred eecCCCcceee--------ecC--CHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHH---HHHHHHHHHHH Confidence 67788875332 111 22333466777888899999986655333211 11233222 22222223333 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC Q lcl|NC_016071. 390 IDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI 467 (516) Q Consensus 390 a~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~ 467 (516) +..+++-|+..|.+.+ .++. .. .+.++ .+.+...+.++++.|++.+ +.+|+.+|+|+=- T Consensus 301 l~p~~~~l~~~l~~~~-~~~~-----~~--~~~~d------~~~~~~~~~~~~~~G~~t~-----NE~R~~l~~~pv~ 359 (359) T protein:vir:10 301 IEPLISELRIKCDSSI-GVDM-----SP--ITDYS------NSVFKADILNWVKEGIIEP-----TEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHHHHhhhhh-cccc-----hh--hhhcC------HHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCC Confidence 3444445554443322 2221 00 12222 2445567888999999876 5799999998543 No 101 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.01 E-value=6.1e-09 Score=65.63 Aligned_cols=379 Identities=11% Similarity=0.028 Sum_probs=185.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHH-HHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATV-EAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~D~~v~s~l~~Rk~~v~ 79 (516) |+=-..-.++..+.......+ ++ .....-...+. .+. +..++-+.|.+|+..+-..|. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~-------~~------------~~~~~~~~~~~--~v~~~~~l~~~~v~~~i~~ia~~ia 59 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDV-------VD------------SDFLASLKGNE--WVSAETALRNSDLFSIINQLSNDLA 59 (382) T ss_pred CccccccccCCcccccccccc-------hh------------hhccccccCCc--ccchHhhhccHHHHHHHHHHHHhhc Confidence 332111111101100000000 00 00000111111 122 233567899999999999999 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +++|.+.-. . ++. .+.+-+...++.++++.+. +.+.+|-++++++.... |. +..|.+ T Consensus 60 ~~~~~~~~~----~------~~~---L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~-------G~--~~~l~~ 117 (382) T protein:vir:48 60 TVKLITSRK----K------LQG---IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN-------GR--DMKWEY 117 (382) T ss_pred cCceeeecc----h------hhh---hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC-------Cc--EEEEEE Confidence 999876411 1 111 2233344467888888877 57889999999886432 22 223334 Q ss_pred cCchhcccccceeecCCCceee-eccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLK-GIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAG 237 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G 237 (516) .++..++ ...+.+|..+. .+... ....+....+|...++++++....+.++| T Consensus 118 i~~~~v~----v~~~~~~~~~~y~~~~~-----------------------~~~~~~~~~~~~~evih~~~~~~~~~~~G 170 (382) T protein:vir:48 118 LRPSQVS----FNRLDNKDGIYYNITFD-----------------------DPRIPPKQHVPQNDVLHFRLLSVDGGMTS 170 (382) T ss_pred EcCceeE----EEEcCCCCeEEEEEEec-----------------------CccccceeEEcCccEEEecCCCCCCcccc Confidence 4443222 12233332221 00000 00112234567777777776666778999 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccC Q lcl|NC_016071. 238 VSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSD 317 (516) Q Consensus 238 ~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g 317 (516) .|.+..+....-.-....++...+...-+.|--+++.+ ..-..++.+.+.. .......+....++++.| T Consensus 171 ~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~--------~~~~~e~~~~~~~---~~~~~~~n~g~~~vl~~g 239 (382) T protein:vir:48 171 VSPLMALSRELDIQKASGNLTINSLKNALNANGILKIK--------GGGLLDFKTKLSR---SRQAMKQMQGGPLVLDDL 239 (382) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC--------CCCChHHHHHHHH---HHHhhccCCCCeeEcCCC Confidence 99999998877766667777777776555544344432 1222233332222 222222222334667877 Q ss_pred cccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 318 MNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAF 397 (516) Q Consensus 318 ~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~l 397 (516) ++++ ..+-+....++.+..++..++|++++.-.....+..+.++ ...+.........++--++.|++.| T Consensus 240 ~~~~----------~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~-~~~~~~~~~~~~~l~p~~~~i~~~l 308 (382) T protein:vir:48 240 EDFT----------PLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQ-SSLEMSSDLYSKAVSRYLRPFLSEL 308 (382) T ss_pred ceEE----------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-cHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6432 2222233445777778888999999877654443222222 2333334455566677788888888 Q ss_pred HHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCccc Q lcl|NC_016071. 398 NKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELL 477 (516) Q Consensus 398 n~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~ 477 (516) |+.|.+++ .++ ..+.+. .|-..+...+.+|...|++.+ +.+|+.++-..-.+++. . ... T Consensus 309 ~~~l~~~~-~~~-------~~~~~~------~~~~~~~~~~~~l~~~g~~t~-----~e~r~~l~~~g~~~~~~-~-~~~ 367 (382) T protein:vir:48 309 SQKLSCDV-DAD-------IFPAVD------PTGSNYISRINSLVKTGTLAQ-----NQGLYILQQAEILPKEL-P-NGE 367 (382) T ss_pred HHHhcChh-hhh-------hhhhhc------cchhHHHHHHHHHhhcCccCH-----HHHHHHHhhCCCCCcch-h-hhh Confidence 88765543 111 111111 122344556778888998765 45677764211111110 0 000 Q ss_pred ccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 478 KLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) .+.+ + .++.|++..| T Consensus 368 ~~~~--~-------------------~~GGd~~~~~ 382 (382) T protein:vir:48 368 NPNS--T-------------------LKGGEEDGQD 382 (382) T ss_pred cCCC--C-------------------CCCCCCCCCC Confidence 0000 0 0111221111 No 102 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=98.97 E-value=8.9e-09 Score=64.73 Aligned_cols=362 Identities=11% Similarity=0.000 Sum_probs=164.0 Q ss_pred ccccchHHHHHHHHHH-HhhcccccC----CcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCC---ChhhH Q lcl|NC_016071. 26 TGELGSGALSQLRAES-EVMKVEELR----WPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRD---SKASK 97 (516) Q Consensus 26 ~~e~g~~~~~~~~~~~-~~~~~~~lr----~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d---~~~~~ 97 (516) |+-.+. +..+. ....+...+ +++.+. +.-+.|.+|+..+-..|.++++.+.-....+ +.... T Consensus 1 M~if~~-----~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~ 70 (378) T protein:vir:94 1 MNLFGK-----VVSFSRGKLNNDTQRVTAWQNEAVE-----YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLIS 70 (378) T ss_pred CchhHH-----hHhhhhcccccCcceeeeeecchhh-----hhhHHHHHHHHHHHHhHhhCceeeeeecccccccccccc Confidence 221111 11110 111111110 122221 1235799999999999999998653211111 10001 Q ss_pred HHHHHHHHHHhh-ccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccccCchhcccccceeecCC Q lcl|NC_016071. 98 DAAEFVEYALKN-LANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDED 175 (516) Q Consensus 98 ~~a~~v~~~l~~-~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~d 175 (516) ..-.-+.++|+. -+...+..++...++ +.+.+|.+.+-++|.-.. |.++- +.+..+ T Consensus 71 ~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~------g~~~~----------------~~~~~~ 128 (378) T protein:vir:94 71 MAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSET------GELLD----------------LLFAND 128 (378) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCC------CcEEE----------------EEEecC Confidence 111223445543 233345566666544 466789888766665332 22210 011111 Q ss_pred CceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHH Q lcl|NC_016071. 176 GRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIE 255 (516) Q Consensus 176 g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~ 255 (516) +..+|.+..+ |....-+.+. +.+++..+.-..-- T Consensus 129 ---------------------------------------~~~~~~~dvi-h~~~~~~~~~-~~~~~~~~~~~~~~----- 162 (378) T protein:vir:94 129 ---------------------------------------KKEYKPEELV-RLTSPFYINE-DTSILDNALASIQT----- 162 (378) T ss_pred ---------------------------------------cEEechhcee-eecCcCCccc-chhHHHHHHHHHHH----- Confidence 1123333333 3322222222 34566655432111 Q ss_pred HHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EEeccCcccccccccceeeeec Q lcl|NC_016071. 256 NLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FILPSDMNAQGGEQYKMSLKGI 333 (516) Q Consensus 256 ~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP~g~~i~~~e~~~iel~~~ 333 (516) +. +.+.+=-+++.| ..+ ..+..++..+.+.+..++...|..++ ++++.|++.+ .. T Consensus 163 -----~~-~~~~~~g~l~~~-~~l------~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~----------~l 219 (378) T protein:vir:94 163 -----KL-EQGKLRGLLKIN-AFL------DIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIV----------EL 219 (378) T ss_pred -----HH-hhCCcccceeeC-CcC------CHHHHHHHHHHHHHHHHHhhcccccccceeccCCceEE----------Ec Confidence 11 112111112221 111 11222334456666666666666665 4556665422 22 Q ss_pred cccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_016071. 334 DGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESK-QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIR 412 (516) Q Consensus 334 ~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh-~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~ 412 (516) +.+....+. +..++..++|++++.-..-.. . |+++ +.+ ......-+.-.++.|++.||+.|+..--..-+.. T Consensus 220 ~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l--~--g~~~--e~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~ 292 (378) T protein:vir:94 220 KKDYSVLNK-DEIDLIKSELLTGYFMNENIL--L--GTAT--QEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKG 292 (378) T ss_pred cCChHHhhH-HHHHHHHHHHHHHhCCCHHHh--c--CCch--HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhh Confidence 222222334 345778889999887753222 1 3333 212 2233445667788888888887765421111100 Q ss_pred CCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccc-cC--CCCCCcccc Q lcl|NC_016071. 413 LSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLK-LL--GQDTSRSGD 489 (516) Q Consensus 413 ~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~-~~--~~~~~~~~~ 489 (516) .....-..|.++.....|+++.++++.++++.|++.+ +.+|+.+|+|+-+.+|+..-...- +. .....+... T Consensus 293 ~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~-----NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~ 367 (378) T protein:vir:94 293 NLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGNRK 367 (378) T ss_pred hcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeeeecccccchhcchhcccccC Confidence 0001113455566667899999999999999999876 579999999876555543211110 00 000011111 Q ss_pred cccccCCCCCccc Q lcl|NC_016071. 490 GMTAGSNGNGTGK 502 (516) Q Consensus 490 ~~~~~~~~~~~~~ 502 (516) +.+ ..+++..+ T Consensus 368 ~~~--~~~e~~n~ 378 (378) T protein:vir:94 368 DVT--STDETNNQ 378 (378) T ss_pred CCC--CCCCCCCC Confidence 111 01111111 No 103 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=98.93 E-value=3.6e-09 Score=66.88 Aligned_cols=275 Identities=10% Similarity=0.020 Sum_probs=148.4 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHh-hccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALK-NLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~-~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) |.++++.+.-.. ...+...+ ..|. +-+...++.+++..++ +.+.+|-++++++.... |. +.. T Consensus 1 ia~l~~~~~~~~---~~~~~~l~----~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~-------G~--~~~ 64 (278) T protein:vir:78 1 MASLPLKMYEDY---KVVNTEVS----DLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-------HQ--PSK 64 (278) T ss_pred CccceeEEEecC---cccccHHH----HHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCC-------Cc--EEE Confidence 889998775322 11222232 3333 3344566788888877 67889999999987533 22 223 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) |-+.++.+++ ...+.+|..+...- ....+....+|.+-++.+++....+.+ T Consensus 65 l~~l~~~~v~----v~~~~~~~~~~y~~-------------------------~~~~g~~~~~~~~evih~~~~~~~~~~ 115 (278) T protein:vir:78 65 LFLLNPDVVE----MLIENQSRELYYSI-------------------------HAATGNKLIVHNMDMLHFKHIVASNMV 115 (278) T ss_pred EEEECCceeE----EEEcCCCceEEEEE-------------------------EcCCceEEEEccccEEEECCCCCCCCe Confidence 4444444332 23344443321110 011223345677766655555456678 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP 315 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP 315 (516) +|.|.+..|....-......+ |.. .+++.+ |.++ -+....-++++. +++++..+....+....+++| T Consensus 116 ~G~s~~~~~~~~i~~~~~~~~-~~~--~~~~~~------~~~i-~~~~~~l~~e~~---~~~~~~~~~~~~~~g~~~vl~ 182 (278) T protein:vir:78 116 QGISPIDVLKNTTDFDNAVRT-FNL--TEMQKP------DSFM-LKYGSNVGKEKR---QQVLEDFKQYYEENGGILFQE 182 (278) T ss_pred eeccHHHHHHHHHHHHHHHHH-HHH--HHhcCC------CcEE-EEeCCCCCHHHH---HHHHHHHHHHhccCCCceecC Confidence 999999998776655443333 322 233332 1111 122222233332 233333333333334456777 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~ 394 (516) .|++++ ..+-+....++.+..++..++|++++.-...-.+..++++++-.+.+. ......++-.++.|+ T Consensus 183 ~g~~~~----------~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~ 252 (278) T protein:vir:78 183 PGVEIE----------PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYE 252 (278) T ss_pred CCceEE----------EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHH Confidence 776432 222233445677888899999999988876555444446666655554 555667889999999 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCc Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQE 428 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~ 428 (516) +.||+.|+++--... .. +|+|+...= T Consensus 253 ~~ln~~L~~~~e~~~------g~--~~~f~~~~l 278 (278) T protein:vir:78 253 EEFNRKLLTKTDREK------IG--ILNLTLNLI 278 (278) T ss_pred HHHHhhcCChhHhcC------Cc--eEEEecccC Confidence 999998876421111 01 344542111 No 104 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=98.90 E-value=1.7e-08 Score=63.24 Aligned_cols=364 Identities=12% Similarity=0.025 Sum_probs=159.6 Q ss_pred ccccchHHHHHHHHHHHhhcccccC----CcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhh--HH- Q lcl|NC_016071. 26 TGELGSGALSQLRAESEVMKVEELR----WPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKAS--KD- 98 (516) Q Consensus 26 ~~e~g~~~~~~~~~~~~~~~~~~lr----~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~--~~- 98 (516) |+-.+.. ..+ ..........+ ..+.+. +.-+.|.+|+..+-..|.++++.+.-....+...+ .+ T Consensus 1 M~~f~k~--~~~--~~~~~~~~~~~~~~~~~~~~~-----~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~ 71 (378) T protein:vir:85 1 MNLFGKV--VSF--SRGKLNNDTQRVTAWQNEAVE-----YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISM 71 (378) T ss_pred Cchhhhh--hhh--hhcccccCCcceeeeeccchh-----hhhHHHHHHHHHHHHhHhhCceeEEEEecccccccccccc Confidence 2211110 000 00000000000 111221 13456999999999999999987643222111111 11 Q ss_pred HHHHHHHHHhhc-cCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccccCchhcccccceeecCCC Q lcl|NC_016071. 99 AAEFVEYALKNL-ANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDG 176 (516) Q Consensus 99 ~a~~v~~~l~~~-~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg 176 (516) .-.-+...|... +...+..++...+. +.+.+|-+.+.+++.... |.+.. ..+..++ T Consensus 72 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~------g~~~~----------------~~~~~~~ 129 (378) T protein:vir:85 72 AGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSET------GELLD----------------LLFANDK 129 (378) T ss_pred ccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCC------ceEEE----------------EEecCCC Confidence 112233444422 23345556666544 567789999877775432 22210 1111111 Q ss_pred ceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_016071. 177 RTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIEN 256 (516) Q Consensus 177 ~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~ 256 (516) ....++..|.|+ .+-+.+- +.+.+..+.-. +. T Consensus 130 ---------------------------------------~~~~~~dvih~~-~~~~~~~-~~~~~~~a~~~------~~- 161 (378) T protein:vir:85 130 ---------------------------------------KEYKPEELVRLV-SPFYINE-DTSILDNALAS------IQ- 161 (378) T ss_pred ---------------------------------------EEEcccceEEEe-cCcCccc-hhhHHHHHHHH------HH- Confidence 111223334333 2212111 12333333211 11 Q ss_pred HHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--EEeccCcccccccccceeeeecc Q lcl|NC_016071. 257 LETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--FILPSDMNAQGGEQYKMSLKGID 334 (516) Q Consensus 257 ~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~iiP~g~~i~~~e~~~iel~~~~ 334 (516) . +. +.+.+=-+++. +..+ ++ +..++..+.+.+...+...|..++ ++++.|+++. -++. T Consensus 162 --~-~~-~~~~~~g~l~~-~~~l-----~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~--------~l~~- 221 (378) T protein:vir:85 162 --T-KL-EQGKLRGLLKI-NAFL-----DI-DNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIV--------ELKK- 221 (378) T ss_pred --H-HH-hcCCcceEEEe-CCcC-----CH-HHHHHHHHHHHHHHHHhhcccccccceecCCCceEE--------eccC- Confidence 1 11 22221111121 1111 11 122333455555555555555554 4556665422 1221 Q ss_pred ccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHhcCC Q lcl|NC_016071. 335 GAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQL---LALNDI 411 (516) Q Consensus 335 g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~l---v~lN~~ 411 (516) +....++ +.+++..++|++++.-..-.. .||++..+ .......-+.-.++.|+..||+.|+.+- ..+-.. T Consensus 222 -~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l----~~s~~e~~-~~~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~ 294 (378) T protein:vir:85 222 -DYSVLNK-DEIELIKSELLTGYFMNENIL----LGTATQEQ-QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNL 294 (378) T ss_pred -ChhhhhH-HHHHHHHHHHHHHhCCCHHHh----cCCchHHH-HHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcc Confidence 2222344 345778889999887764222 13443211 2234445566777788888888776431 111000 Q ss_pred cCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccc Q lcl|NC_016071. 412 RLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGM 491 (516) Q Consensus 412 ~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (516) + ..-..|..+.....|+++.++++.++++.|++.+ +.+|+.+|+|+-+.+|...-.... .+-.......+. T Consensus 295 ~---~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~lgl~p~~gGD~~~~~~N~-~~~~~~~~~~~~ 365 (378) T protein:vir:85 295 Y---YERIIVDNQLFKFATLKELIDLYHENINGPIFTQ-----NQLLVKMGEQPIEGGDIYIANLNA-VAVKNLSDLQGS 365 (378) T ss_pred c---cceeeecchhhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCeEeecccc-cccccchhhcCc Confidence 0 0112344556667899999999999999999886 579999999876555543211111 010000101000 Q ss_pred cccCCCCCcccccccccchh Q lcl|NC_016071. 492 TAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 492 ~~~~~~~~~~~~~~~~d~~~ 511 (516) +.+ ..+...+++. T Consensus 366 ~~~-------~~~~~e~~n~ 378 (378) T protein:vir:85 366 RKD-------VASTDETNNQ 378 (378) T ss_pred cCC-------CCCCCCCCCC Confidence 000 0011111111 No 105 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=98.89 E-value=1.8e-08 Score=63.01 Aligned_cols=451 Identities=8% Similarity=-0.074 Sum_probs=188.4 Q ss_pred CCccccCcccccchhhhcccCCCCccccc-chHHHHHHHHHHHh--hcccccCCcccHHHHHHHhhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGEL-GSGALSQLRAESEV--MKVEELRWPCFLATVEAMKQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~-g~~~~~~~~~~~~~--~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~ 77 (516) |+-|.+.+..+.......|++..-.+... .+.+...-....+. ..-...++. ++.++...++.+-+..++++.-.- T Consensus 48 ~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~a~Y~~~~l~r~iVd~~A~d 126 (537) T protein:vir:10 48 MAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFI-GHQMCALIATHWLVNKACSQMPRD 126 (537) T ss_pred CCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCc-cHHHHHHHHhCchhhhhhhhhhHH Confidence 33333222222221111121111111100 00000000000000 000000121 355555445689999999999777 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecc-ccc-cc---c--cc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTES-APS-KY---A--GY 150 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~-~~~-~~---~--g~ 150 (516) .++..|.|+...+ ++.+.+..+.++..++++.. |..+...+-.+..||-+++=+.=.... ..+ .| + +. T Consensus 127 ~~r~~~~i~~~~~--~~~~~~~~~~l~~~~~~l~~---~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~k 201 (537) T protein:vir:10 127 AMRKGYKIISDDG--NELDPKDAKFIDRYDRAFNI---KKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMP 201 (537) T ss_pred hhcCCceeecCCc--ccccHHHHHHHHHHHHHhhH---HHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccc Confidence 7788888875432 23334556778888887753 445555555678899876522211111 100 01 0 00 Q ss_pred eeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeec- Q lcl|NC_016071. 151 ITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLG- 229 (516) Q Consensus 151 ~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~- 229 (516) ..++.|.+..+ +|.... ....+.+++....++. |..+.+ .+..|=+.+++.+... T Consensus 202 g~~k~l~vidp-------~~~~~~---~~~~~~~dp~sp~fg~--------P~~y~v------~g~~iH~SRli~f~g~~ 257 (537) T protein:vir:10 202 GAYKGIVQIDP-------YWCAPL---LDAQASSNPVSMHFYE--------PTYWLI------NGKKYHRSHLAIYINDE 257 (537) T ss_pred cceeEEEEech-------hhcccc---cchhhhccCCccccCC--------ceeeee------cCeEecceeEEEecCCC Confidence 01111111111 111100 0000011111111111 111111 1234445666655422 Q ss_pred -----CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 230 -----GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANA 304 (516) Q Consensus 230 -----~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~ 304 (516) ....+.+|.|+|..||-...--.....-=+..+.++.. .+++..... .- .+ +....+++. ++... T Consensus 258 ~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~--~v~k~~~~~---~l--~~--~~~~~~r~~-~~~~~ 327 (537) T protein:vir:10 258 VVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQ--TVLKVDAAQ---VL--AN--KQQFDETMS-WWTAT 327 (537) T ss_pred CchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ceeeechHH---hh--cC--HHHHHHHHH-HHHhh Confidence 12345679999999987654333333323334444433 343321110 00 11 111222222 22222 Q ss_pred hcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccc-cccCCccchhhHHHHHHHHHH Q lcl|NC_016071. 305 HAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFI-NLGNDGQGSYNLSESKQSIHG 383 (516) Q Consensus 305 ~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtL-ts~~~~~GS~Al~~vh~ev~~ 383 (516) + +....+++..+. .+++.++.+-+| ...+++..-++||-+.--..- ..+.+.+|..|.|+-....+- T Consensus 328 r-~n~g~~~id~e~-------e~~e~~~~~lsg----l~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yy 395 (537) T protein:vir:10 328 R-DNYQVRVVDKDN-------EDVVQIDTTLND----LDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYH 395 (537) T ss_pred c-CCcceeEecCCC-------ceeEEEeccCCC----HHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHH Confidence 2 223334454432 245555544443 345777777777766433221 133343455566776777888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHH-------HHHHHHHHHhCCcccccHHHHHH Q lcl|NC_016071. 384 HFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEG-------FSKFVQRIGAVGYLPKTPTVINK 456 (516) Q Consensus 384 ~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~-------~a~~~~~L~~~G~~~~~~~~~~~ 456 (516) +.+++-...|...+++ |++.|+..-+ ++. .--.|.|......+-++ .+++++++++.|++.+ +. T Consensus 396 d~I~~~Qe~l~p~l~~-l~~ll~~~~~--~~~-~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~-----~E 466 (537) T protein:vir:10 396 EECESTQDDMRPLIDR-HHQLVCRSHL--RKR-IRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDG-----VD 466 (537) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHhcC--CCC-cceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCH-----HH Confidence 8888877777777754 6776665543 222 12245565444443333 4567899999998776 34 Q ss_pred HHHHcC---------CCCCCC-cc-cc--cCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 457 ILEVGG---------FDEEIP-ED-MS--TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 457 i~e~~G---------lp~~~~-~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +|+.++ |....+ ++ +. .+...++......+..++..++ .... ..+..|+.++..+| T Consensus 467 vr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~a~~ 535 (537) T protein:vir:10 467 VNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFG--ATSS--GESANDPRDSGAAF 535 (537) T ss_pred HHHHHhccCccccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCC--CCcc--ccccCCCccCcccc Confidence 555543 321111 11 11 1111111110000111110111 1111 12344444555555 No 106 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=98.77 E-value=2.7e-08 Score=62.09 Aligned_cols=335 Identities=10% Similarity=0.038 Sum_probs=147.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccc--cchHHHHHHHHHHHh-hccccc-CCcccHHHHHHHhh-ChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGE--LGSGALSQLRAESEV-MKVEEL-RWPCFLATVEAMKQ-DHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e--~g~~~~~~~~~~~~~-~~~~~l-r~~~~~~~y~~m~~-D~~v~s~l~~Rk 75 (516) ||+|..+.++.............+..++ .--.+. .+..+... ..+... .-|-+.+-..++.+ .+|.+++|..++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~-~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h~~~i~~k~ 79 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQTEIFSFGDPIPVLDRA-DILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHHESAIITKA 79 (346) T ss_pred CCcccCCCCCcccccccccCeEEEecCCcceecCch-hHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhcchhhhhhh Confidence 9999887766544333221122222221 110000 01111111 011111 11223333344443 788888887777 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) ..+..+ +. .|++ ..+..++..-+++-+.+|.+++|+++...+. +.. T Consensus 80 n~l~~l---~~-~Pn~---------------------~~t~~~f~~~~~d~ll~Gnay~~i~r~~~G~---------~~~ 125 (346) T protein:vir:10 80 NILLST---CE-VDSR---------------------YLSRRDLSSFVKDYLVFGNAYFEVVRNRLGQ---------VQR 125 (346) T ss_pred hhHHHH---Hh-CCCC---------------------CCCHHHHHHHHHHHHhcCCeEEEEEEcCCCc---------EEE Confidence 666543 11 1121 1122334444556677999999998765332 224 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNP 235 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p 235 (516) |.+.++.+++. .-+.++ ... .. ...++....+|.+.++.++...-.+.. T Consensus 126 L~pl~~~~v~~----~~~~~~-~~~-~~-------------------------~~~~g~~~~~~~~dIih~r~~~~~~~~ 174 (346) T protein:vir:10 126 IESPLAKYVRK----GLEAGQ-FYY-VP-------------------------QRFDHQEHEFAKGSIYHLLEPDINQDI 174 (346) T ss_pred EEEecCCceEE----EEcCCe-EEE-EE-------------------------EccCCeEEEEecccEEEecCCCCCCCe Confidence 44555544431 111111 110 00 001123345666665544444334667 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE--E Q lcl|NC_016071. 236 AGVSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY--F 312 (516) Q Consensus 236 ~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~--~ 312 (516) ||.+.+..+......-.....+-..+.. +|+ +=-+++.+ . ..-++++.+ ++++..+....+..++ + T Consensus 175 ~G~~~~~~a~~si~l~~~a~~~~~~~~~-NG~~~~~il~~~------d-~~l~~e~~~---~i~~~~~~~~g~~n~~~~~ 243 (346) T protein:vir:10 175 YGLPQYLSALQSAWLNESATLFRRKYFL-NGAHAGFVFYMS------D-ASQKQEDVE---NIRQQLKQSKGVGNFKNLF 243 (346) T ss_pred eeccHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCceEEEeC------C-CCCCHHHHH---HHHHHHHHhcCccccCcee Confidence 9999998888777766666666666654 332 21122211 1 112333333 3444444333333333 2 Q ss_pred EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--CccchhhHHHHHHHHHH-HHHHHH Q lcl|NC_016071. 313 ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSYNLSESKQSIHG-HFVQRD 389 (516) Q Consensus 313 iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~vh~ev~~-~~~~aD 389 (516) +++.|.+. ..+++...+-+....+|.+.-++-..+|+.+.--.---.+. +++|+++-.+....++. .-+.-- T Consensus 244 vl~~~~~~-----~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~ 318 (346) T protein:vir:10 244 VHAPNGKK-----DGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVFFITEIEPL 318 (346) T ss_pred EecCCCCc-----cceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHH Confidence 33333321 22445544444444557777777788899888765433321 22344443433333322 113333 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHH Q lcl|NC_016071. 390 IDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQ 439 (516) Q Consensus 390 a~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~ 439 (516) ++.|++ +|+.|... .+.|...+-.+ +.+ T Consensus 319 ~~~iee-~n~~L~~e---------------~i~F~~~~ll~------~~~ 346 (346) T protein:vir:10 319 QERLKE-FNQWLGQE---------------VIKFKPSKLLQ------RTQ 346 (346) T ss_pred HHHHHH-HHhhcccc---------------eeeechhhhcc------cCC Confidence 334432 33222211 23333211111 111 No 107 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=98.65 E-value=1e-07 Score=58.91 Aligned_cols=330 Identities=11% Similarity=0.050 Sum_probs=144.1 Q ss_pred CCccccCcccccchhhh--cccCCCCccccc--chHHHHHHHHHHHhhcccc-cCCcccHHHHHHHhh-ChHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE--NLAVSRLRTGEL--GSGALSQLRAESEVMKVEE-LRWPCFLATVEAMKQ-DHTVSTALDTK 74 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~--~p~~~~~~~~e~--g~~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~-D~~v~s~l~~R 74 (516) ||+|.+........... .+....+..++. --.+ ..+..+.....+.+ ..-|-+..-..++.+ .+|.+|+|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~-~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i~~k 79 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLDR-RDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVK 79 (344) T ss_pred CCCCCCCCCchhhHHhhcCCCceEEEEcCCceeecCc-chhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCccceeh Confidence 99998754332222211 111222222211 0000 00111111111211 011223333345544 88999999887 Q ss_pred HHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 75 YVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 75 k~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) +..+.+. +.|++.-. ..++-.-+++-+.+|.+.+|++....+ . +. T Consensus 80 ~n~l~~~---~~Pnp~~t-----------------------~~~f~~~~~d~ll~Gnay~~~~rn~~G-------~--~~ 124 (344) T protein:vir:56 80 RNILAST---FIPHPWLS-----------------------QQDFSRFVLDFLVFGNAFLEKRYSTTG-------K--VI 124 (344) T ss_pred hhhHHhh---cCCCCCCC-----------------------HHHHHHHHHHHHhcCCeEEEEEECCCC-------c--EE Confidence 7766552 33332211 112222244556689999999875433 2 23 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCc-CC Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGT-ES 233 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~-~g 233 (516) .|.+.|+..++ ...++..... +...++.+..+.+.+ +|..... .+ T Consensus 125 ~L~pl~~~~v~------~~~~~~~~~~---------------------------~~~~g~~~~~~~~dI-iHir~~~~~~ 170 (344) T protein:vir:56 125 RLETSPAKYTR------RGVEEDVYWW---------------------------VPSFNEPTAFAPGSV-FHLLEPDINQ 170 (344) T ss_pred EEEEeCCceeE------EeecCCEEEE---------------------------EecCCeEEEEcCccE-EEECCCCCCC Confidence 44455543332 2223321111 111223345566654 4544443 45 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cceE Q lcl|NC_016071. 234 NPAGVSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQAY 311 (516) Q Consensus 234 ~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a~ 311 (516) ..||.+.+..+......-.....+-..|.. +|+ +=-+++.+ . ..-++++. +++++..+....+ ..-. T Consensus 171 ~~~Gls~~~~a~~si~l~~~a~~~~~~~f~-NGa~pg~Il~~~------d-~~ls~e~~---~~lk~~~~~~~g~~~~r~ 239 (344) T protein:vir:56 171 ELYGLPEYLSALNSAWLNESATLFRRKYYE-NGAHAGYIMYVT------D-AVQDRNDI---EMLRENMVKSKGRNNFKN 239 (344) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCceEEEec------C-CCCCHHHH---HHHHHHHHHhcCCCCccc Confidence 578999998888777765555555555554 332 22122211 1 11233333 3344444433321 2122 Q ss_pred EEe--ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc--CCccchhhHHHHHHHHHH-HHH Q lcl|NC_016071. 312 FIL--PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG--NDGQGSYNLSESKQSIHG-HFV 386 (516) Q Consensus 312 ~ii--P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~ev~~-~~~ 386 (516) ++| |.|- ...++++..+-+....+|.+.-++-..+|+.+.--.---++ .+++|+++-.+....++. .-+ T Consensus 240 l~l~~p~g~------~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~f~~~tL 313 (344) T protein:vir:56 240 LFLYAPQGK------ADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNEL 313 (344) T ss_pred eEEecCCCC------ccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHH Confidence 333 4331 01244444444444555777777778889988766543332 122344444433333322 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH Q lcl|NC_016071. 387 QRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM 431 (516) Q Consensus 387 ~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl 431 (516) .--++.|++ +|+.|...++.++ -...+..|- T Consensus 314 ~Pl~~~ie~-~n~~l~~~~~~F~-------------~y~l~~~~~ 344 (344) T protein:vir:56 314 IPLQDRIRE-INGWIGQEVIRFK-------------NYSLDTDNG 344 (344) T ss_pred HHHHHHHHH-HHhhhccccccCC-------------CccccccCC Confidence 333444433 4443433333322 112222221 No 108 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=98.60 E-value=2.1e-07 Score=57.15 Aligned_cols=329 Identities=11% Similarity=0.047 Sum_probs=141.4 Q ss_pred CCccccCcccccchhhhcccCCCCcccc--cchHHHHHHHHHHHhhcccc-cCCcccHHHHHHHhh-ChHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGE--LGSGALSQLRAESEVMKVEE-LRWPCFLATVEAMKQ-DHTVSTALDTKYV 76 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e--~g~~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~-D~~v~s~l~~Rk~ 76 (516) ||+|+++.+.........+ ...+..++ .-..+ ..+..+.....+.+ ..-|-+..-..++.+ .+|.+|+|..++. T Consensus 1 m~~~~~~~~~~~~~~~~~~-~~~~~~~~p~~~~~~-~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~n 78 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQK-MEAFTFGEPVPVLDK-RDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKRN 78 (340) T ss_pred CCCCCCCccccccccCccc-eeEEEcCCceeecCc-chhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhhhh Confidence 9999877655443222211 11121111 10000 00111111111111 112333444455554 8999999998877 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) .+.+. +.|.+.- +..++-.-+++-+.+|-+.+|+++...+. +..| T Consensus 79 ~l~~~---~~Pn~~l-----------------------t~~~f~~~~~d~ll~Gnay~~~~rn~~G~---------~~~L 123 (340) T protein:vir:98 79 VLAST---YIPHPLL-----------------------SRQDFSRFALDYLVFGNAFLEQRHSVTGQ---------LIKL 123 (340) T ss_pred HHhhc---cCCCCCC-----------------------CHHHHHHHHHHHHhcCCeEEEEEECCCCc---------EEEE Confidence 77653 3333221 11122223345566899999999765332 2234 Q ss_pred cccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccc Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPA 236 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~ 236 (516) .+.+...+ ....+++....+ ...++.+.++.+.++.++.-.-.+..| T Consensus 124 ~pl~~~~v------r~~~~~~~~~~~---------------------------~~~~~~~~~~~~eViHir~~~~~~~~~ 170 (340) T protein:vir:98 124 LTSPAKYT------RRGVDDSVFWFV---------------------------ENFTQPHEFAPDTVFHLLEPDINQEIY 170 (340) T ss_pred EEeCCceE------EEcccCcEEEEE---------------------------ecCCeEEEEccccEEEEcCCCCCCCcc Confidence 44444322 223333322111 112233456666654444323245679 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccce--EEE Q lcl|NC_016071. 237 GVSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQA--YFI 313 (516) Q Consensus 237 G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a--~~i 313 (516) |.+.+..+......-.....+-..|.. +|+ +=-+++.+ ...-++++.+ ++++..++......+ -++ T Consensus 171 Gls~~~~a~~si~l~~aa~~~~~~~f~-NGa~pg~il~~~-------~~~ls~e~~~---~lk~~~~~~~G~~n~~~~~v 239 (340) T protein:vir:98 171 GLPEYLSALNSAWLNESATLFRRKYYQ-NGAHAGYIMYVT-------DPAQSATDVE---SLRDAMRNSKGLGNFKNLFF 239 (340) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCceEEEec-------CCCCCHHHHH---HHHHHHHHhcCccccCceeE Confidence 999988887776665555554445543 442 21122221 1112333333 344444443221111 122 Q ss_pred -eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--CccchhhHHHHHHHHHHH-HHHHH Q lcl|NC_016071. 314 -LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSYNLSESKQSIHGH-FVQRD 389 (516) Q Consensus 314 -iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~vh~ev~~~-~~~aD 389 (516) .|.|- ...++++..+-+....+|.+.-++--.+|+.+.--.---.+- +++|+++-.+....++.. -+.-- T Consensus 240 l~~~g~------~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl 313 (340) T protein:vir:98 240 YSPNGK------PDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVFVRNELSPL 313 (340) T ss_pred ecCCCC------ccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHHHH Confidence 23321 112445544444445567777777778898887655433321 223444433333322221 12222 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHH Q lcl|NC_016071. 390 IDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDME 432 (516) Q Consensus 390 a~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~ 432 (516) ++.|++ +|+.|. ... ++|++.+-.+.+ T Consensus 314 ~~~iee-~n~~L~-------------~e~--~rF~~~~l~~~d 340 (340) T protein:vir:98 314 QDRFRE-VNDWLG-------------MEV--IRFKEYTLDNPE 340 (340) T ss_pred HHHHHH-HHhccc-------------ccc--cccCccccccCC Confidence 223322 232211 122 223222222211 No 109 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=98.54 E-value=3.2e-07 Score=56.22 Aligned_cols=330 Identities=9% Similarity=-0.004 Sum_probs=139.8 Q ss_pred CCccccCcccccchh------hhcc----cCCCCcccccchH-HHHHHHHHHHhhcccccCC-cccHHHHHHHh-hChHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAG------NENL----AVSRLRTGELGSG-ALSQLRAESEVMKVEELRW-PCFLATVEAMK-QDHTV 67 (516) Q Consensus 1 ~~~r~~~~~~~~~~~------~~~p----~~~~~~~~e~g~~-~~~~~~~~~~~~~~~~lr~-~~~~~~y~~m~-~D~~v 67 (516) ||+|.+...+..-.. +..+ ....+..++.... .-+.+..+.....+.++-. |-+..-.-++. ..+|. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~~~~h 80 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGSSVYL 80 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHHHHhhhhhh Confidence 999987654432111 1111 0111121111000 0001111111112222111 12222223443 37888 Q ss_pred HHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccc Q lcl|NC_016071. 68 STALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKY 147 (516) Q Consensus 68 ~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~ 147 (516) +++|..++..+.+. +.|++.- +..++-..+++.+.+|-+.+|++....|. T Consensus 81 ~~~l~~k~n~l~~~---~~Pn~~~-----------------------t~~~f~~~v~d~ll~Gnay~~~~rn~~G~---- 130 (350) T protein:vir:11 81 QSGLKFKRNMLAKT---FIPHRLL-----------------------SRATFEQFSLDWLTFGSAYLEQPRSRLGT---- 130 (350) T ss_pred ccchhhhhhhhhhc---ccCCCCC-----------------------CHHHHHHHHHHHHhcCCeEEEEEEcCCCC---- Confidence 88888776655542 2333221 11122223345567899999998654321 Q ss_pred ccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEe Q lcl|NC_016071. 148 AGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMS 227 (516) Q Consensus 148 ~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~ 227 (516) +..|.+.++..++ ...+++....+ ...+....+|.+.+|.++ T Consensus 131 -----~~~L~~l~~~~vr------~~~~~~~~~~~---------------------------~~~~~~~~~~~~eVihir 172 (350) T protein:vir:11 131 -----RMPLQAPLAKYMR------RGTDLETFYQV---------------------------RSWKDEHEFEKGSVIQLR 172 (350) T ss_pred -----EEEEEEeCCceeE------eeecCCeEEEE---------------------------eeCCeEEEECcccEEEeC Confidence 2234444443332 22333221111 112233456666655444 Q ss_pred ecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_016071. 228 LGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHA 306 (516) Q Consensus 228 ~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~ 306 (516) .-.-.+..||.+.+..+......-.....+-..|.. +|+ +=-+++.+ ...-++++.+. +++..+.... T Consensus 173 ~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~-NGa~~~gil~~~-------~~~ls~e~~~~---l~~~~~~~~G 241 (350) T protein:vir:11 173 EADINQEIYGVPEWFCALQSALLNESATLFRRKYYN-NGSHAGFILYMT-------DAAQNEEDIDA---LRTALKTAKG 241 (350) T ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCceEEEec-------CCCCCHHHHHH---HHHHHHHhcC Confidence 333345678999988887777665554444444443 343 11122211 11223334333 4444443332 Q ss_pred ccceE---EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc--CCccchhhHHHHHHHH Q lcl|NC_016071. 307 GEQAY---FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG--NDGQGSYNLSESKQSI 381 (516) Q Consensus 307 g~~a~---~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~ev 381 (516) +.+++ +..|.|-+ ..++++..+-+....+|.+.-++-..+|+.+.--.---.+ .+++|+++-.+....+ T Consensus 242 ~~N~~~~~v~~~~g~~------~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~ 315 (350) T protein:vir:11 242 PGNFRNLFVYAPNGKK------EGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAV 315 (350) T ss_pred ccccCceeeecCCCCc------cceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHH Confidence 22222 22233211 1244444444444556777778888899988775443222 1223444433333332 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH Q lcl|NC_016071. 382 H-GHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM 431 (516) Q Consensus 382 ~-~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl 431 (516) + ..-+.--++.|++ +|+.|.+.. +.|++-.-.+| T Consensus 316 f~~~~L~P~~~~ie~-ln~~l~~~~---------------~~F~~~~~~~l 350 (350) T protein:vir:11 316 WASLELAPMQTRLQQ-VNEMIGEEV---------------VRFAQFDAPGL 350 (350) T ss_pred HHHHHHHHHHHHHHH-HHhhcCccc---------------cccCcccccCC Confidence 2 2223333444432 443332222 22332222222 No 110 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=98.54 E-value=3.3e-07 Score=56.08 Aligned_cols=408 Identities=10% Similarity=-0.004 Sum_probs=173.9 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHH-hhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAM-KQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m-~~D~~v~s~l~~Rk~~v~ 79 (516) |++. +.+..+.+..... +. +-.|+.. - ...+ .....+..+..| ++.+-+..++++.-.-.+ T Consensus 5 m~~~-~~~~~~~D~~~~~--~~----~~~g~~~---~------~~~~--~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~ 66 (435) T protein:vir:79 5 MSDK-VKAITKEDGYNEI--FG----SKDGTFR---P------NAFY--MQRAAFKALSQFYEEDGMARRIVDVIPEEMV 66 (435) T ss_pred cccc-cccchhhcchhhh--hc----ccccccc---c------Cccc--CCcCCHHHHHHHHhcCchhhhhhccchHHhh Confidence 7766 4444455543321 11 0111100 0 0000 001123333444 458888888888777666 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccc-cc---cccceeecc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAP-SK---YAGYITIDK 155 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~-~~---~~g~~~~~~ 155 (516) +.-|+|+- ++ ++ +.++..++++.. |..+...+-.+..||++.+=+.=. ++.. .. +.|. ++. T Consensus 67 r~g~~i~g----~~--~~---~~~~~~~~~l~~---~~~l~~a~~~~rl~G~~~i~i~~~-d~~~~~~Pl~~~g~--i~~ 131 (435) T protein:vir:79 67 TPGFKVDG----VK--NE---KSFKSRWDELRL---NAKIIDALSWSRLFGGSAILAVVA-DNKMLKSPVKPGAQ--LED 131 (435) T ss_pred cCCceecC----CC--hH---HHHHHHHHHhhH---HHHHHHHHHhhhccccEEEEEEec-CCCCcccccccCCc--eee Confidence 66676641 11 11 235556666542 455555566789999987533221 2211 11 1221 112 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEee------c Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSL------G 229 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~------~ 229 (516) |.+..+..+. .. .+.+++....++ .|..+.+...+...+..|-+.+++.+.. . T Consensus 132 i~v~d~~~i~--------~~-----~~~~dp~sp~fg--------~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~ 190 (435) T protein:vir:79 132 IRVYDRYQIT--------IH-----ERETNARSVRYG--------EPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEK 190 (435) T ss_pred EEeechhhcc--------ch-----hhccCCcccccC--------cceEEEEecCCCCCceEEcceeEEEecCCcchhhh Confidence 2222211111 00 001111111111 2222222222233455677778777642 2 Q ss_pred CcCCccccchhH-HHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 230 GTESNPAGVSPL-VGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE 308 (516) Q Consensus 230 ~~~g~p~G~gLl-r~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~ 308 (516) ...++++|.|.| +.+|....--.....-=+..+.|+.. .+++.+-.. ....+ ...+.+...++..+. ..+... T Consensus 191 ~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~--~v~~~~~l~--~~~~~-~~~~~~~~~r~~~~~-~~~~~~ 264 (435) T protein:vir:79 191 RRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQ--AVWKARDLA--LMCDD-EEGRYAARLRLAQVD-DESGVG 264 (435) T ss_pred ccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--ccccchhHH--HhhcC-ccchHHHHHHHHHHH-HhcCCC Confidence 456789999965 78887554433333333444445433 334432111 11111 111222222332222 222222 Q ss_pred ceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc-ccCCccchhhHHHHHHHHHHHHHH Q lcl|NC_016071. 309 QAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN-LGNDGQGSYNLSESKQSIHGHFVQ 387 (516) Q Consensus 309 ~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt-s~~~~~GS~Al~~vh~ev~~~~~~ 387 (516) .+.+++..+ .+++.++.+=+ +...+++..-.+||.+.--..-- .+.+.+|-.|.|+--...+-+.++ T Consensus 265 ~~~~i~~~~--------e~~e~~~~~ls----gl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~ 332 (435) T protein:vir:79 265 KAIGIDATD--------EEYEVLNSDVS----GVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLID 332 (435) T ss_pred CceeEecCC--------cceEEEecccC----CHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHH Confidence 233333322 23555544333 34567777777888766554422 232333322445555666777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecC------cCchh-HHHHHHHHHHHHhCCcccccHHHHHHHHH- Q lcl|NC_016071. 388 RDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGL------IQEVD-MEGFSKFVQRIGAVGYLPKTPTVINKILE- 459 (516) Q Consensus 388 aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~------~~~~d-l~~~a~~~~~L~~~G~~~~~~~~~~~i~e- 459 (516) +-......-+.+.|++.++ ++ ++ -.|.|.. .+..+ .+..|++++++++.|++.++ +..+.++. T Consensus 333 ~~Qe~~l~p~l~~l~~li~-~s----~d---~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~-e~r~~L~~~ 403 (435) T protein:vir:79 333 RKRVEDYKPILEFLLPFMI-SE----TE---WSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLK-ETRDTLRSI 403 (435) T ss_pred HHHHHHHHHHHHHHHHHhh-cC----CC---CeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHHh Confidence 7654433333333444332 11 11 1234432 22222 25567889999999998763 33444433 Q ss_pred --HcCCCCCCCcccccCcccccCCCCCCcccccc Q lcl|NC_016071. 460 --VGGFDEEIPEDMSTDELLKLLGQDTSRSGDGM 491 (516) Q Consensus 460 --~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (516) .+|+......+ .+....-.++...+.|++. T Consensus 404 ~~~~~~~~~~~~~--~~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 404 CPDLKIMDNDNIE--LPEPEDLDPEPGQEGGLNK 435 (435) T ss_pred ccccCCCCccccc--CCccccCCCCCCCCCCCCC Confidence 33433211110 1111111111112222211 No 111 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=98.53 E-value=3.5e-07 Score=55.95 Aligned_cols=323 Identities=11% Similarity=0.036 Sum_probs=146.7 Q ss_pred CCccccCcccccchhhhcccCCCCcccc--cchHHHHHHHHHHHhhcccccC---CcccHHHHHHHhh-ChHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGE--LGSGALSQLRAESEVMKVEELR---WPCFLATVEAMKQ-DHTVSTALDTK 74 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e--~g~~~~~~~~~~~~~~~~~~lr---~~~~~~~y~~m~~-D~~v~s~l~~R 74 (516) ||+|..+......... ...+..++ .--.+ ..+..+.....+..-+ -|-++.-..++.+ .+|..++|..| T Consensus 1 m~~~~~~~~~~~~~~~----~~~~~~~~p~~~~~~-~~~~~~~~~~~~~~~~~~~pP~~~~~La~l~~~~~~h~~~L~~k 75 (337) T protein:vir:78 1 MTKRQQQPAQAAASSP----RPSVVFSMPEAIDPT-AWMTDYTGVFYNPYGEYYQPPIDRKGLAKVARANAHHGAILMAR 75 (337) T ss_pred CCCcccCcccccccCc----eeEEEecCcccccCc-chhHhhhhhhhccCcceecCCCCHHHHHHHhhcchhhhhHHHhh Confidence 9999887654332211 11111111 10000 0011111111111111 1223333345544 88999999888 Q ss_pred HHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 75 YVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 75 k~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) ...+.+. +.+ . .++ |+.++ ++-+.+|.+.+|+++...|. +. T Consensus 76 ~N~~~~~---f~~--~---------~~~-------------~~~~~---~d~ll~GNay~~~~rn~~G~---------~~ 116 (337) T protein:vir:78 76 RNMVAGR---FTN--Q---------RAT-------------ITAFV---HNYLQFGDGGLLKLRNSFGQ---------VV 116 (337) T ss_pred hcccccc---CcC--c---------HHH-------------HHHHH---HHHHhhCCeEEEEEECCCCc---------EE Confidence 7765542 111 0 011 23333 34556899999999865332 33 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCc-CC Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGT-ES 233 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~-~g 233 (516) .|.+.|+.+++ ...||+... .....+...+|.+.++ |..... .+ T Consensus 117 ~L~pl~~~~v~------~~~d~~~~~----------------------------~~~~~~~~~~~~~eIi-Hik~~~~~~ 161 (337) T protein:vir:78 117 GLHPLSSVYLR------RREDGCFVY----------------------------LQQGKPNLIYRPDDVI-WLAQYDPEQ 161 (337) T ss_pred EEEEeCCceeE------eeeCCeEEE----------------------------EEcCCceEEECCccEE-EECCCCCCC Confidence 45555554442 233443211 1112234456666654 544444 45 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceE-- Q lcl|NC_016071. 234 NPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAY-- 311 (516) Q Consensus 234 ~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~-- 311 (516) ..||.+.+..+......-.....+-..|.. +|+- |.+++......-++++. +++++..++...+..++ T Consensus 162 ~~~Gls~~~~a~~si~l~~aa~~~~~~~f~-NGa~------p~~il~~~~~~l~~e~~---~~lk~~~~~~~G~~n~~~~ 231 (337) T protein:vir:78 162 QVYGMPDYLGGLQSALLNQDATLFRRRYFL-NGAH------MGFIFYATDPNMDDDTE---EEMKEMIANSKGVGNFRSM 231 (337) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCC------CceeEEcCCCCCCHHHH---HHHHHHHHHhcCcccccce Confidence 678999888887776665555544444443 3331 11122111111223333 33444444433222222 Q ss_pred -EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc---CCccchhhHHHHHHHH-HHHHH Q lcl|NC_016071. 312 -FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG---NDGQGSYNLSESKQSI-HGHFV 386 (516) Q Consensus 312 -~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~---~~~~GS~Al~~vh~ev-~~~~~ 386 (516) +.+|.|.+ ..++++..+-+....+|.+.-++-..+|+.+.--.---.+ ...+|+++-.+....+ ..+-+ T Consensus 232 ~v~~~~g~~------~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~f~~~~L 305 (337) T protein:vir:78 232 FVNIPDGKP------DGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDATYARNEV 305 (337) T ss_pred EEEcCCCCc------cceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHHHHHHHH Confidence 33344421 1244555544445555777667777888888765432221 1112444434444333 33446 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH Q lcl|NC_016071. 387 QRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM 431 (516) Q Consensus 387 ~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl 431 (516) .--++.|++.+|+.+++.-. ++.|+.....-+ T Consensus 306 ~P~~~~ie~~~n~~ll~~~~-------------~~~f~~~~~~~~ 337 (337) T protein:vir:78 306 LPLCELVQDAINSAGLPRAL-------------WVTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHhhhcCChhh-------------ceeccccccccC Confidence 66677777777765443221 122332222211 No 112 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=98.50 E-value=4.3e-07 Score=55.48 Aligned_cols=330 Identities=11% Similarity=0.061 Sum_probs=141.6 Q ss_pred CCccccCcccccchhhh--cccCCCCcccccchHHHHH--HHHHHHhhccccc-CCcccHHHHHHHhh-ChHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE--NLAVSRLRTGELGSGALSQ--LRAESEVMKVEEL-RWPCFLATVEAMKQ-DHTVSTALDTK 74 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~--~p~~~~~~~~e~g~~~~~~--~~~~~~~~~~~~l-r~~~~~~~y~~m~~-D~~v~s~l~~R 74 (516) ||+|.+.........+. ......+..++.- +-++. +..+.....+.+. .-|-+..-..++.+ .+|.+|+|..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~-~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k 79 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFTFGEPV-PVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSPIYVK 79 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEEcCCce-eecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccchhhh Confidence 99998765433222111 1112222222210 00000 1111111111110 01112222234444 88889998887 Q ss_pred HHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 75 YVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 75 k~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) +..+.+. +.|++.-. . .+ |+.+ +++-+.+|-+.+|++....+ . +. T Consensus 80 ~n~l~~~---~~Pn~~~t----~--~~--------------f~~~---~~d~ll~Gnay~~i~rn~~G-------~--~~ 124 (344) T protein:vir:60 80 RNILAST---FIPHPWLS----Q--QD--------------FSRF---VLDFLVFGNAFLEKRYSTTG-------K--VI 124 (344) T ss_pred hhHHHhh---ccCCCCCC----H--HH--------------HHHH---HHHHHhcCCeEEEEEECCCC-------c--EE Confidence 7776552 33432211 0 11 3223 33455689999999876432 2 23 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCc-CC Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGT-ES 233 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~-~g 233 (516) .|.+.|+.+++ +..+++.... +...++...+|.+.++ |..... .+ T Consensus 125 ~L~~l~~~~vr------~~~~~~~~~~---------------------------v~~~~~~~~~~~~eIi-Hir~~~~~~ 170 (344) T protein:vir:60 125 RLETSPAKYTR------RGVEEDVYWW---------------------------VPSFNEPTAFAPGSVF-HLLEPDINQ 170 (344) T ss_pred EEEEcCcceEE------EeecCCeEEE---------------------------EccCCeEEEEcCccEE-EEcCCCCCC Confidence 44455544332 2222221111 1112233456666654 444433 45 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcc-cceE Q lcl|NC_016071. 234 NPAGVSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAG-EQAY 311 (516) Q Consensus 234 ~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g-~~a~ 311 (516) ..||.+.+..+.-....-.....+-..|.. +|+ +=-+++.+ ...-++++. +++++..+...++ ..-. T Consensus 171 ~~yGlsp~~~a~~si~l~~~a~~~~~~~f~-NG~~pg~il~~~-------~~~ls~e~~---~~ik~~~~~~~g~~~~r~ 239 (344) T protein:vir:60 171 ELYGLPEYLSALNSAWLNESATLFRRKYYE-NGAHAGYIMYVT-------DAVQDRNDI---EMLRENMVKSKGRNNFKN 239 (344) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCceEEEec-------CcCCCHHHH---HHHHHHHHHhcCCCCCcc Confidence 679999988887776665554444444443 332 11122211 111233333 3344444433322 1112 Q ss_pred EEe--ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc--CCccchhhHHHHHHHHHH-HHH Q lcl|NC_016071. 312 FIL--PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG--NDGQGSYNLSESKQSIHG-HFV 386 (516) Q Consensus 312 ~ii--P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~ev~~-~~~ 386 (516) ++| |.|-. ..++++..+-+....+|.+.-++-..+|+.+.--.---.+ .+++|+++-.+-...++. .-+ T Consensus 240 ~~l~~p~g~~------~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~L 313 (344) T protein:vir:60 240 LFLYAPQGKA------DGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNEL 313 (344) T ss_pred eEEecCCCCc------cceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHH Confidence 333 43311 1244444444444455777777888899998866543332 122344444433333221 112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH Q lcl|NC_016071. 387 QRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM 431 (516) Q Consensus 387 ~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl 431 (516) .--++.|++ || .|| +...-+|.....+..|- T Consensus 314 ~Pl~~~~e~-ln----~~l---------g~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 314 IPLQDRIRE-IN----GWL---------GQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHH-HH----Hhc---------CCcccccCccccCCCCC Confidence 222222221 22 222 12333455555555552 No 113 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=98.48 E-value=5e-07 Score=55.12 Aligned_cols=455 Identities=10% Similarity=0.011 Sum_probs=185.5 Q ss_pred CCccccCccc--ccchhhhccc--CCCCcccccchHHHHH---HHHHHH----hhcccccCCcccHHHHHHHhhChHHHH Q lcl|NC_016071. 1 MSTRFAQPSE--VVKAGNENLA--VSRLRTGELGSGALSQ---LRAESE----VMKVEELRWPCFLATVEAMKQDHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~--~~~~~~~~p~--~~~~~~~e~g~~~~~~---~~~~~~----~~~~~~lr~~~~~~~y~~m~~D~~v~s 69 (516) |-.+.++.+. ..+.-...|+ .|+.+.+ ..-.++. ..+-.+ ........+ -++.++....+.+-+.. T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~a~~~g~~~~~~~~~~~~~~~~~~~-~~~~l~a~Y~~~~l~r~ 99 (532) T protein:vir:94 23 VDAKRATHTSLGLATAHEIDPTAYSPYERNA--AQNAMAMDYGLQTGRNGRNALSFVEATSW-PGFPTLALLAQLPEYRT 99 (532) T ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccc--cccccccccccCccccccccccccccccc-chHHHHHHHHcCchhhh Confidence 2222221111 1111111121 1211111 1000000 000000 000001111 23444444446889999 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccc- Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYA- 148 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~- 148 (516) ++++.-.-.++.-|+|... .+++...+..+.++..++++.. |..+...+-.+..||.+++=+.=+-.+.....+ T Consensus 100 ~Vd~~aed~~r~~~~i~~~--~~~~~~~~~~~~i~~~~~~l~v---~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~ 174 (532) T protein:vir:94 100 MHETPADECVRAWGKITCS--SKDELAADKATRITQKLEQYNV---RTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADA 174 (532) T ss_pred hhccchHHHhhCCceEeeC--CccccchHHHHHHHHHHHhhhH---HHHHHHHHHhhhcccceEEEEEeccCCccccccc Confidence 9999988888888887643 2233345666778888887743 555555566788999987432211111100000 Q ss_pred ---------cceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccc Q lcl|NC_016071. 149 ---------GYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIP 219 (516) Q Consensus 149 ---------g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP 219 (516) +.-.++.|.+..+ ||.. .+.. -.+++....++ .|..+.+ ..+..|. T Consensus 175 p~~l~~~~I~~g~~~~l~vld~-------~~v~-p~~~----~~~dp~sp~fg--------~P~~y~v-----~~g~~iH 229 (532) T protein:vir:94 175 PLLLSPSFVQRGCLIGFATIEP-------MWLS-PNAY----NATDPTLPSFY--------KPDSWIA-----TSGKKIH 229 (532) T ss_pred cccccccccccceeeEEEeech-------heec-cccc----ccccccccccC--------CceeEEE-----ccCeeec Confidence 0001112222211 1211 1100 00111111111 1111111 1234677 Q ss_pred cccEEEEeecC------cCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHH Q lcl|NC_016071. 220 INKLMVMSLGG------TESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEM 293 (516) Q Consensus 220 ~~k~i~~~~~~------~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~ 293 (516) +.+++.|.... ...+.+|.|++..+|-...--.....-=+..+.++. +.+++.- +.... ........ T Consensus 230 ~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~--~~v~k~~---~a~~l--s~~~~~~~ 302 (532) T protein:vir:94 230 SSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFS--MTNLATD---MAQLL--APGGAQSL 302 (532) T ss_pred cceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcC--Cceeeec---hHHhh--cchhHHHH Confidence 77877765332 234557999999998755433233222233344433 3344320 00111 11122222 Q ss_pred HHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc--ccCCccch Q lcl|NC_016071. 294 VQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN--LGNDGQGS 371 (516) Q Consensus 294 l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS 371 (516) .+++. ++...+ +....++++.+. .+++.++.+-+| ...+++..-++||-+.-- .+| .+.+.+|- T Consensus 303 ~~r~~-~~~~~~-~n~g~~~id~~~-------e~~e~~~~~lsg----l~~~l~~~~~~iAaa~~I-P~t~LfG~sp~Gl 368 (532) T protein:vir:94 303 DARLQ-LFNLYR-DNRNIGALDKGT-------EEIQQTNTPLSG----LDSLQAQSQEQMAAVSHI-PLVKLLGITPNGL 368 (532) T ss_pred HHHHH-HHHhhc-CCccceEEcCCC-------ceeEEEecccCC----HHHHHHHHHHHHHhHhCC-CeeeeecCCcccc Confidence 23332 222221 334456666543 235555443332 456777777788754433 223 23333343 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH-------HHHHHHHHHHHhC Q lcl|NC_016071. 372 YNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM-------EGFSKFVQRIGAV 444 (516) Q Consensus 372 ~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl-------~~~a~~~~~L~~~ 444 (516) -|-|+--...+-+.+++-......-+.+.|++.|+..-++..++ . -.|+|...-..+- +..+++++++++. T Consensus 369 nstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~-d-~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~ 446 (532) T protein:vir:94 369 NASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDP-G-LAWEWSPLMELDDKELAEVRQLNASTDSTLMEL 446 (532) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC-C-ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc Confidence 35566566778888887765554444455666665443322221 1 2455653333322 3446678899999 Q ss_pred CcccccHHHHHHHHHHcCCCCCC----------C--cccccCccccc---CCCCCCcccc-cccccCCCCCccccccccc Q lcl|NC_016071. 445 GYLPKTPTVINKILEVGGFDEEI----------P--EDMSTDELLKL---LGQDTSRSGD-GMTAGSNGNGTGKISSTRD 508 (516) Q Consensus 445 G~~~~~~~~~~~i~e~~Glp~~~----------~--~~~~~~~~~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d 508 (516) |++.+ +.+|+.++..... + +.+......+. .++.+.+.++ +...+.+...+.+..+..| T Consensus 447 Gvi~~-----~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 521 (532) T protein:vir:94 447 GVIDA-----KMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQAD 521 (532) T ss_pred CCCCH-----HHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCcc Confidence 98765 4677777753211 0 00000000000 0000001000 0000000001111111111 Q ss_pred chhhh--hcC Q lcl|NC_016071. 509 NSVSN--MDN 516 (516) Q Consensus 509 ~~~~~--~~~ 516 (516) ..... .-| T Consensus 522 ~~~~~~~~~~ 531 (532) T protein:vir:94 522 PAQNDQPVGN 531 (532) T ss_pred ccccCCCcCC Confidence 11111 112 No 114 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=98.46 E-value=2e-07 Score=57.30 Aligned_cols=341 Identities=13% Similarity=0.037 Sum_probs=136.2 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCC------------------cccHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRW------------------PCFLATVEAMK 62 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~------------------~~~~~~y~~m~ 62 (516) ||+|.+...+-..+....++..+... +...........+.+.+..|-|.. |-.+....++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~ 79 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPT-EHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSF 79 (368) T ss_pred CCccccccchhccCcccccccccCcc-hhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHH Confidence 99999777654433222111111000 000000000111111222222211 11222122222 Q ss_pred h-ChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeec Q lcl|NC_016071. 63 Q-DHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTE 141 (516) Q Consensus 63 ~-D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~ 141 (516) + .+|-.+++..+...+. +. +.|+ + ..+..++-.-+++-+.+|.+++|++.... T Consensus 80 ~~~~~h~~~~~~~~n~l~-l~--~~Pn--~---------------------~~t~~~f~~l~~d~ll~Gnay~~~~r~~~ 133 (368) T protein:vir:79 80 RAAAHHSSAVYVKRNILV-ST--FIPH--P---------------------LLSRATFERLVLDWQVFGNAYLERRENVL 133 (368) T ss_pred hhccccchhhhhhcchhh-hh--cCCC--c---------------------CCCHHHHHHHHHHHhhcCCeEEEEEEcCC Confidence 2 4444444443332221 11 1111 1 12233443445566779999999987643 Q ss_pred ccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccc Q lcl|NC_016071. 142 SAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPIN 221 (516) Q Consensus 142 ~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~ 221 (516) +. +..|.+.++.+++ ...|++....+ ...++...+|.+ T Consensus 134 G~---------~~~L~~l~~~~v~------~~~~~~~~~~~---------------------------~~~~~~~~~~~~ 171 (368) T protein:vir:79 134 GG---------TIRLDTPLAKYVR------RGLDLNTYFFV---------------------------QNWQQPYTFAAG 171 (368) T ss_pred CC---------EEEEEEeCcccce------eeccCCEEEEE---------------------------ecCCeEEEEccc Confidence 32 2234444443332 23333322111 112334456666 Q ss_pred cEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHH Q lcl|NC_016071. 222 KLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADA 301 (516) Q Consensus 222 k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~ 301 (516) .++..+.-.-.+..||.+.+..+......-.....+-..+.. +|+- |.+++......-++++. +++++.. T Consensus 172 dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~-NGa~------~~gil~~~~~~l~~e~~---~~lk~~~ 241 (368) T protein:vir:79 172 SVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYK-NGSH------AGFILYMTDAAQKQEDV---DTLREAM 241 (368) T ss_pred cEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCC------CceEEEeCCCCCCHHHH---HHHHHHH Confidence 654444333345679999998888776665555554445543 3431 12222111112233333 3444444 Q ss_pred HHhhcccceE--EEe-ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--CccchhhHHH Q lcl|NC_016071. 302 ANAHAGEQAY--FIL-PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSYNLSE 376 (516) Q Consensus 302 ~~~~~g~~a~--~ii-P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~ 376 (516) +......+++ +++ |.|.+ ..++++..+-+....+|.+.-++-.++|+.+..-.-.-++. +++|+++-.+ T Consensus 242 ~~~~G~~N~g~~~vl~~~g~~------~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e 315 (368) T protein:vir:79 242 KSAKGPGNFRNLFMYAPNGKK------DGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVE 315 (368) T ss_pred HHhcCCcccCceeEecCCCCc------cceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHH Confidence 4433333333 222 33321 12444444444444567777788889999998655433321 1223333333 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecC--cCchhHHHHHHHHHHHHhC Q lcl|NC_016071. 377 SKQSIHG-HFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGL--IQEVDMEGFSKFVQRIGAV 444 (516) Q Consensus 377 vh~ev~~-~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~--~~~~dl~~~a~~~~~L~~~ 444 (516) ....++. .-+.--++.| ..+|-.-++ . .+.|+. ....|.+..|..-++ ++ T Consensus 316 ~~~~~f~~~~l~Pl~~~i------------e~ln~~l~~--e--~~rF~~~~l~~~D~~a~a~~~~r--sa 368 (368) T protein:vir:79 316 KAAMVFARNEVKPLQDRL------------LAINDWIGD--E--VVRFAPYALGGHDQPAAAPGGQR--SA 368 (368) T ss_pred HHHHHHHHHHHHHHHHHH------------HHHHhccCc--c--eeeechhHhhcccccccCCcccc--cC Confidence 2222221 1122222233 233311111 1 233432 112222222211111 00 No 115 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=98.45 E-value=6.2e-07 Score=54.61 Aligned_cols=413 Identities=10% Similarity=-0.013 Sum_probs=167.1 Q ss_pred CcccccchhhhcccCCCCcccccchHHHHHHHH-HHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCcee Q lcl|NC_016071. 7 QPSEVVKAGNENLAVSRLRTGELGSGALSQLRA-ESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDF 85 (516) Q Consensus 7 ~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~-~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i 85 (516) -+...++..+. ...+ ......+. .+.-..+.++...++.+-+..++++.-.-.++.-|+| T Consensus 1 ~~~~~~d~~~~------------------~~~~~~~~~~~~~-~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i 61 (427) T protein:vir:10 1 MKIVKHDGYND------------------IFNGGADGSPKPF-FMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKM 61 (427) T ss_pred CCccccchHHH------------------HhhcCCCCcccCc-cccCchHHHHHHHHcCchhhhhhccchHHhhcCCccc Confidence 00001111110 0000 00011111 2222245655555568888888888877666766766 Q ss_pred eeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccc-cc--cccceeeccccccCch Q lcl|NC_016071. 86 KVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAP-SK--YAGYITIDKIAFRPQS 162 (516) Q Consensus 86 ~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~-~~--~~g~~~~~~l~~r~q~ 162 (516) +. ++. + +.++..|+++.. |..+...+-.+..||.+++=+.= ++.. +. .++.-.++.|.+.++. T Consensus 62 ~g----~~~--~---~~~~~~~~~l~~---~~~l~~a~~~~rl~G~a~i~i~v--~d~~~l~~p~~~~g~l~~l~v~d~~ 127 (427) T protein:vir:10 62 SG----VKD--E---KEFKSLWDSYKL---DSSLVDLLCWARLYGGAAMVAII--KDNRMLTSQAKPGAKLEGVRVYDRF 127 (427) T ss_pred cC----ccH--H---HHHHHHHHHhhH---HHHHHHHHHhccccceeEEEEEe--cCCCccccccCCCcceeEEEEechh Confidence 42 211 1 235566666643 45555555569999999964432 2211 00 0111122233333222 Q ss_pred hcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeec------CcCCccc Q lcl|NC_016071. 163 SLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLG------GTESNPA 236 (516) Q Consensus 163 ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~------~~~g~p~ 236 (516) .+.... + .+++....+ ..|..+.+...+...+..|-+.+++.+... ....+++ T Consensus 128 ~~~~~~-~------------~~dp~s~~f--------g~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~ 186 (427) T protein:vir:10 128 AITVEK-R------------VTNARSPRY--------GEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGW 186 (427) T ss_pred cccccc-c------------ccCcccccc--------CcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcc Confidence 221100 0 011111111 122222222222333456777787776422 3467789 Q ss_pred cchhHH-HHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec Q lcl|NC_016071. 237 GVSPLV-GCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP 315 (516) Q Consensus 237 G~gLlr-~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP 315 (516) |.|+|. .+|....--.....-=+..+.|+.. .+++..-. .....+. ..+.+.+.++..+... +....+.+++. T Consensus 187 G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~--~v~k~~~l--~~~~~~~-~~~~~~~~r~~~~~~~-~~~~~~~~l~~ 260 (427) T protein:vir:10 187 GASVLNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGL--AEMCDDD-DAQYAARLRLAQVDDN-SGVGRAIGIDA 260 (427) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHhcc--ccccchhH--HHHhcCc-cchHHHHHHHHHHHHh-cCcccceeeec Confidence 999774 5555333222222222333444433 33333211 1111111 1122223333333222 11122223332 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccc-cCCccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINL-GNDGQGSYNLSESKQSIHGHFVQRDIDIIV 394 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts-~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~ 394 (516) .+ .+++.++.+=+| ...+++..-.+||-+.--..--+ +.+.+|--|.|+--....-+.+++-..... T Consensus 261 ~~--------e~~e~~~~~lsg----l~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l 328 (427) T protein:vir:10 261 ET--------EEYDVLNSDISG----VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDY 328 (427) T ss_pred CC--------CceeEEecccCC----hHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHH Confidence 22 234444433232 45577777778887654443212 222233224455556667777777664433 Q ss_pred HHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH-HHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc- Q lcl|NC_016071. 395 EAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM-EGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS- 472 (516) Q Consensus 395 ~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl-~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~- 472 (516) .-+-+.|++.++ ++-. ..-+-.|-...+..+..++ +..|++++++++.|++.++ +..+.++...+.....+..+. T Consensus 329 ~p~l~~l~~~i~-~s~~-~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~-e~r~~L~~~~~~~~~~~~~~~~ 405 (427) T protein:vir:10 329 RPLLEFLLPFIV-DEEE-WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLE-EARDTLRSIAPEFKLKDGNNIN 405 (427) T ss_pred HHHHHHHHHHhh-cCCC-cEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHhhhccccCCCCcccc Confidence 333233555433 2200 0000011222222222222 4568899999999998874 445566544333221111111 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) .+..++ ..+..++.+++ +..+| T Consensus 406 ~e~~~~-~~e~~p~~~e~------------------~~d~~ 427 (427) T protein:vir:10 406 IREPEE-TTEPEPGLGEK------------------LEDEN 427 (427) T ss_pred ccccch-hcCCCCCCCCC------------------CCCCC Confidence 000000 00111111111 11111 No 116 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=98.42 E-value=7.1e-07 Score=54.28 Aligned_cols=451 Identities=11% Similarity=0.023 Sum_probs=180.0 Q ss_pred CCccccCcc--cccchhhhcccCCC--Cccccc----chHHHHHHHHHHHhhcccccC-------CcccHHHHHHHhhCh Q lcl|NC_016071. 1 MSTRFAQPS--EVVKAGNENLAVSR--LRTGEL----GSGALSQLRAESEVMKVEELR-------WPCFLATVEAMKQDH 65 (516) Q Consensus 1 ~~~r~~~~~--~~~~~~~~~p~~~~--~~~~e~----g~~~~~~~~~~~~~~~~~~lr-------~~~~~~~y~~m~~D~ 65 (516) |-...++.. ++.+.....+ +|. +.+.-+ ...++....+......++.+. .--++.++...++.. T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~ 121 (765) T protein:vir:96 43 IRGWNVEPEKAPVIRSVKDFL-EPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHW 121 (765) T ss_pred HhhcccccccCCCCCCCCccc-CcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCc Confidence 111111111 1111111111 111 111000 000111111100011111111 112355555455689 Q ss_pred HHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccc- Q lcl|NC_016071. 66 TVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAP- 144 (516) Q Consensus 66 ~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~- 144 (516) -+..++++.-.-.++..|+|+.. +++...+..++++..++++.. |..+...+-.+..||-+++ +...++.. T Consensus 122 l~rkiVd~pAeDa~R~g~~I~~~---~~e~~~~~~~~l~~~~~rl~v---~~~l~ea~~~~RlyGga~i--~i~i~~~D~ 193 (765) T protein:vir:96 122 LVDKACSMSGEDAARNGWELKSD---GRKLSDEQSALIARRDMEFRV---KDNLVELNRFKNVFGVRIA--LFVVESDDP 193 (765) T ss_pred hhhhhhhcchHHhhcCCceeecC---ccccCHHHHHHHHHHHHHhhH---HHHHHHHHHHhhhceeeEE--EEEecccCc Confidence 99999998866666677777642 233344556678888887753 4455555556899986653 22222110 Q ss_pred --c-cc---c--cceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCcc Q lcl|NC_016071. 145 --S-KY---A--GYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEV 216 (516) Q Consensus 145 --~-~~---~--g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (516) + .| + +...++.|.... ++|.... ......+++....++. |..+.+ .+. T Consensus 194 ~~l~~PL~~~~I~kg~~kgl~vld-------p~~~~~~---~v~e~~~Dp~sp~fg~--------P~~y~i------~g~ 249 (765) T protein:vir:96 194 DYYEKPFNPDGIAPGSYKGISQID-------PYWAMPQ---LTAESTADPSAEHFYE--------PDFWII------SGK 249 (765) T ss_pred chhhccccccccccceeeEEEEec-------hhhcccc---cchhccccccccccCc--------ceeeee------cCc Confidence 0 11 0 000111111111 1111110 0000111111111111 111111 122 Q ss_pred ccccccEEEEeec------CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHH Q lcl|NC_016071. 217 FIPINKLMVMSLG------GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPE 290 (516) Q Consensus 217 ~iP~~k~i~~~~~------~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~ 290 (516) .|=+.++|.+... ....+.+|.|+|..||-...--.....-=+..+.|+.. .+++.-.. ..-..+ T Consensus 250 ~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~--~v~k~~~~-------~~l~~~ 320 (765) T protein:vir:96 250 KYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRT--STIHVDVE-------KAIANE 320 (765) T ss_pred eeccceEEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhcc--ceeeechH-------hhhccH Confidence 3445566655322 24455679999999987655433333333444445443 33332111 111122 Q ss_pred HHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccc-ccccCCcc Q lcl|NC_016071. 291 SEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGF-INLGNDGQ 369 (516) Q Consensus 291 ~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt-Lts~~~~~ 369 (516) .+..+++..+.+. + +....++|-.+ .+++.++.+=+| ...+++..-++||-+.--.. ...+.+-. T Consensus 321 ~~l~~r~~~~~~~-r-~n~g~~~id~e--------e~~e~~s~~lsg----l~d~l~~~~~~iAaas~IP~t~LfGqsp~ 386 (765) T protein:vir:96 321 DAFNARLAFWIAN-R-DNHGVKVIGID--------ETMEQFDTNLSD----FDSVIMNQYQLVAAIAKTPATKLLGTSPK 386 (765) T ss_pred HHHHHHHHHHHHh-c-CCceeEEecCC--------cceeEEecccCC----HHHHHHHHHHHHHhhhCCCeeeeccCCcc Confidence 2333334333322 2 33444554433 245555543332 45577777677776644432 12233334 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH-------HHHHHHHHHHH Q lcl|NC_016071. 370 GSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM-------EGFSKFVQRIG 442 (516) Q Consensus 370 GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl-------~~~a~~~~~L~ 442 (516) |-.|-|+--...+-+.+++.......-+.+.|++.|+... ..+.. -.+.|......+- +..|+++++++ T Consensus 387 GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s~--~i~~d--~~i~FnpL~~~sekEkAei~~k~Aea~~~~~ 462 (765) T protein:vir:96 387 GFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKSE--SIDVQ--LEIVWNPVDSTTSQQQAELNNKKAATDEIYI 462 (765) T ss_pred cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Confidence 5556666667777888888776554444455777766532 12211 2445543332222 34567799999 Q ss_pred hCCcccccHHHHHHHHHHcC------CCCCCCcc---------cccCcccccCCCCCCccccccc----c-cCCCCCccc Q lcl|NC_016071. 443 AVGYLPKTPTVINKILEVGG------FDEEIPED---------MSTDELLKLLGQDTSRSGDGMT----A-GSNGNGTGK 502 (516) Q Consensus 443 ~~G~~~~~~~~~~~i~e~~G------lp~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~----~-~~~~~~~~~ 502 (516) +.|++.+ +.+|+++. +..-.+++ +......++..+.....+++.. . ...+.+++. T Consensus 463 ~~Gvis~-----dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~ 537 (765) T protein:vir:96 463 NSGVVSP-----DEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPV 537 (765) T ss_pred hcCCCCH-----HHHHHHHhccccCCCCCCCccccccccCCCccccccccCCCcccccccCccccccCCCCccCCCCccc Confidence 9998775 34566553 21111110 0000001110000000000000 0 000000000 Q ss_pred c---------cccccchhhhhcC Q lcl|NC_016071. 503 I---------SSTRDNSVSNMDN 516 (516) Q Consensus 503 ~---------~~~~d~~~~~~~~ 516 (516) + +......+-..+| T Consensus 538 ~~~p~~~~p~~~~~~~~~g~~~~ 560 (765) T protein:vir:96 538 PAAPRGTKPLAKAAEEGAGEAAT 560 (765) T ss_pred ccCCcccCCccccccccCccccC Confidence 0 1111111111222 No 117 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=98.38 E-value=9.3e-07 Score=53.66 Aligned_cols=405 Identities=12% Similarity=0.024 Sum_probs=165.7 Q ss_pred ccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCC Q lcl|NC_016071. 11 VVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYN 90 (516) Q Consensus 11 ~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~ 90 (516) +++...=...+-. |+.+- -..........+.++...++.+-+..++++.-.-.++.-|+|+ T Consensus 1 ~~~~D~~~n~~~g------g~~~~---------~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~---- 61 (422) T protein:vir:10 1 MVKTDSYANIFLG------GSDGS---------EIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHID---- 61 (422) T ss_pred CccchhhHHHHcC------CCCCc---------cccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCcccc---- Confidence 1111110000000 00000 0000111111233333334688899999988877777777764 Q ss_pred CCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccc-cc---cccceeeccccccCchhccc Q lcl|NC_016071. 91 RDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAP-SK---YAGYITIDKIAFRPQSSLSR 166 (516) Q Consensus 91 ~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~-~~---~~g~~~~~~l~~r~q~ti~~ 166 (516) +++.. +.+++.|+++.. |..+...+-.+..||++++=+.=+ ++.. .. +.|. ++.|.+.++..+. T Consensus 62 ~~~~~-----~~~~~~~~~l~~---~~~l~~a~~~~rl~G~a~i~i~v~-d~~~~~~Pl~~~g~--~~~l~v~d~~~i~- 129 (422) T protein:vir:10 62 GIDDE-----PAFWSRWDDLEM---TQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAE--LETVRVYDRTQVK- 129 (422) T ss_pred CCCHH-----HHHHHHHHHhhH---HHHHHHHHHhhccccceEEEEEec-CCCCccccccccCc--eeeEEeecccccc- Confidence 22221 124556666643 555555566799999998643321 2211 11 1221 2222222221111 Q ss_pred ccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeec------CcCCccccchh Q lcl|NC_016071. 167 SKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLG------GTESNPAGVSP 240 (516) Q Consensus 167 ~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~------~~~g~p~G~gL 240 (516) .. ...+++....+ ..|..+.+...+...+..|=+.+++.+... ....+++|.|+ T Consensus 130 -------~~-----~~~~dp~s~~f--------g~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~ 189 (422) T protein:vir:10 130 -------VQ-----TREENPRNARF--------GEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSV 189 (422) T ss_pred -------ch-----hcccCcccccc--------CcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchh Confidence 10 00111111111 122222222222223345556676666322 35677789997 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcc Q lcl|NC_016071. 241 LV-GCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMN 319 (516) Q Consensus 241 lr-~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~ 319 (516) |. .||....--.....-=+..+.|+. +.+++.... .... +....+.+.+.++..+... +....+.+++..+ T Consensus 190 l~~~~~~~i~~~~~~~~~~~~l~~~~~--~~v~~~~~l--~~~~-~~~~~~~~~~~r~~~~~~~-~~~~~~~~l~~~~-- 261 (422) T protein:vir:10 190 LSSDILDSIKDYTNCERLATQLLKRKQ--QAVWKAKGL--AELC-DDSEGFGAARLRLAQVDNN-SGVGQAIGIDAES-- 261 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhc--cccccchhH--HHhc-CCccchHHHHHHHHHHHHh-cCCccceeEecCC-- Confidence 75 477644433333333344444543 334443211 1111 1122222233333333222 2222233333333 Q ss_pred cccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccc-cCCccchhhHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_016071. 320 AQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINL-GNDGQGSYNLSESKQSIHGHFVQRDIDII-VEAF 397 (516) Q Consensus 320 i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts-~~~~~GS~Al~~vh~ev~~~~~~aDa~~i-~~~l 397 (516) .+++.++.+-+| ...+++..-.+||-+.--..--+ +.+.+|--|.|+--...+-+.+++-.... ...| T Consensus 262 ------e~~e~~~~~lsg----l~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l 331 (422) T protein:vir:10 262 ------EEYSVLNSDIGG----IDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPIL 331 (422) T ss_pred ------cceEEEecccCC----hHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 245555554443 45677777778886544333212 22222322445555667777777766543 3334 Q ss_pred HHHHHHHHHHhcCCcCCccccceEEecCcCchhH-HHHHHHHHHHHhCCcccccHHHHHHHHHH---cCCCCCCCccccc Q lcl|NC_016071. 398 NKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM-EGFSKFVQRIGAVGYLPKTPTVINKILEV---GGFDEEIPEDMST 473 (516) Q Consensus 398 n~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl-~~~a~~~~~L~~~G~~~~~~~~~~~i~e~---~Glp~~~~~~~~~ 473 (516) +.|++.|+ ++-. +.-+-.|-...+..+..++ +..|++++++++.|++.++ +..+.+++. .|+.....+++ . T Consensus 332 -~~l~~~i~-~s~~-~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~-e~r~~L~~~~~~~~~~~~~~~~~-~ 406 (422) T protein:vir:10 332 -EFLIPFIV-NAEE-WSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDID-EARDTLRTIAPEVKINDGSVETE-V 406 (422) T ss_pred -HHHHHHhc-ccCC-cEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHhhhhcccccCCCCCCccc-c Confidence 33555443 2210 0000012222222222232 5567889999999988763 233344332 22221111111 1 Q ss_pred CcccccCCCCCCcccc Q lcl|NC_016071. 474 DELLKLLGQDTSRSGD 489 (516) Q Consensus 474 ~~~~~~~~~~~~~~~~ 489 (516) +.......+..-+..+ T Consensus 407 ~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 407 TISETSNDPLEVPTDD 422 (422) T ss_pred chhhcCCCCCCCCCCC Confidence 1111000111111111 No 118 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=98.37 E-value=9.9e-07 Score=53.50 Aligned_cols=334 Identities=12% Similarity=0.078 Sum_probs=138.7 Q ss_pred CCccccCcccc-cchhhh-cccCCCCcccccchHH-----HH--HHHHHHHhhcccc-cCCcccHHHHHHHhh-ChHHHH Q lcl|NC_016071. 1 MSTRFAQPSEV-VKAGNE-NLAVSRLRTGELGSGA-----LS--QLRAESEVMKVEE-LRWPCFLATVEAMKQ-DHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~~-~~~~~~-~p~~~~~~~~e~g~~~-----~~--~~~~~~~~~~~~~-lr~~~~~~~y~~m~~-D~~v~s 69 (516) ||+|..+.... +...+. .-+..+. ..+..+.| ++ .+..+.....+.+ -.-|-+..-.-++.+ .+|..+ T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s 104 (376) T protein:vir:10 26 MSKRRSRAPRTFAAAPNPSAGSAAPA-RAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 104 (376) T ss_pred chhccCCCcccchhhhhHhhhccCcc-eeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHHhhh Confidence 99987663321 111110 0000000 01111111 00 0011111111111 011222222335544 899999 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAG 149 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g 149 (516) +|..++.-+.+. +.|++.- +..++-.-+++.+.+|.+.+|++....|. T Consensus 105 ~l~~k~n~l~~~---~~Pnp~l-----------------------T~~~f~~~v~d~ll~Gnay~~~~rn~~G~------ 152 (376) T protein:vir:10 105 ALFFKANVLAST---FRPHRWL-----------------------SRHAFERWALDFLTFGNGYLERRRNMVGG------ 152 (376) T ss_pred hHHHHhHHHHhc---cCCCCCC-----------------------CHHHHHHHHHHHHhcCCeEEEEEECCCCC------ Confidence 999887766552 3333221 12223333445667899999998765331 Q ss_pred ceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeec Q lcl|NC_016071. 150 YITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLG 229 (516) Q Consensus 150 ~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~ 229 (516) +..|.+.++..++. ..|+.....+ ....+...++.+.++.++.- T Consensus 153 ---~~~L~pl~~~~vr~------~~d~~~~~~~---------------------------~~~~~~~~~~~~eViHir~~ 196 (376) T protein:vir:10 153 ---TLRLEPALAKYVRR------KADFNGFVYV---------------------------NGWQERHEFEPDSVFQLVRP 196 (376) T ss_pred ---EEEEEEeCCcceEE------EeeCCeEEEE---------------------------EcCCeEEEEccccEEEecCC Confidence 23444555543321 1122111110 11123334566665544443 Q ss_pred CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_016071. 230 GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQ 309 (516) Q Consensus 230 ~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~ 309 (516) .-.+..||.+.+..+......-.....+-..|.+ +|+- |.+++......-++++. +.+++..++.....+ T Consensus 197 ~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~-NGa~------pggIl~~~d~~l~~e~~---~~lr~~~~~~~G~~N 266 (376) T protein:vir:10 197 DINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYE-NGSH------AGFILYMTDAAQKQDDV---DNMRDALKNAKGPGN 266 (376) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCC------CceEEEecCCCCCHHHH---HHHHHHHHHhcCccc Confidence 3356778998888877766654444444444443 4431 11111111111233333 344444444332122 Q ss_pred e---EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--CccchhhHHHHHHHHHH- Q lcl|NC_016071. 310 A---YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSYNLSESKQSIHG- 383 (516) Q Consensus 310 a---~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~vh~ev~~- 383 (516) + .+..|.|-+ ..++++..+-+....+|.+.-++-..+|+.+.--.---.+. +++|+++-.+....++. T Consensus 267 ~~~~~vl~~~g~~------~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~f~~ 340 (376) T protein:vir:10 267 FRNVFMYAPGGKK------DGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGR 340 (376) T ss_pred cCceeEecCCCCc------cceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHH Confidence 1 222333311 12444444444455567777778888899887665433321 22334444433333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHH Q lcl|NC_016071. 384 HFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEG 433 (516) Q Consensus 384 ~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~ 433 (516) .-+.--++.|++ +|..|. ..+.+|........|.++ T Consensus 341 ~~L~Pl~~~iee-ln~~L~-------------~~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 341 NEIRPLQARFAE-LNDWLG-------------EEVVRFDDYEIPPAPVAA 376 (376) T ss_pred HHHHHHHHHHHH-HHhhcc-------------ccccccChhHhhcccccC Confidence 222223333332 332221 122222222222222221 No 119 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=98.37 E-value=1e-06 Score=53.41 Aligned_cols=332 Identities=12% Similarity=0.076 Sum_probs=138.0 Q ss_pred CCccccCcccccchh-----hh-ccc-CCCCcccc--cchHHHHHHHHHHHhhcccc-cCCcccHHHHHHHhh-ChHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAG-----NE-NLA-VSRLRTGE--LGSGALSQLRAESEVMKVEE-LRWPCFLATVEAMKQ-DHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~~~~~~-----~~-~p~-~~~~~~~e--~g~~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~-D~~v~s 69 (516) ||+|.++........ +. .|+ ...+..++ .--.+ ..+..+.....+.+ -.-|-+..-.-++.+ .+|.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~-~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~ 79 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNR-AEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 79 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCc-chhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhh Confidence 999987643222111 00 111 11111111 00000 00111111111111 011223333344444 999999 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAG 149 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g 149 (516) +|..++..+.+. +.|++.- +..++-.-+++.+.+|-+.+|++....| T Consensus 80 ~l~~k~n~l~~~---~~Pnp~~-----------------------t~~~f~~~v~d~ll~Gnay~~~~r~~~G------- 126 (351) T protein:vir:79 80 ALFFKANVLAST---FRPHRWL-----------------------SRHAFERWALDFLTFGNGYLERRRNMVG------- 126 (351) T ss_pred hhhhhhhHHhhc---ccCCCCC-----------------------CHHHHHHHHHHHHhcCCeEEEEEECCCC------- Confidence 999887777552 3333221 1112222344566789999999876433 Q ss_pred ceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeec Q lcl|NC_016071. 150 YITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLG 229 (516) Q Consensus 150 ~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~ 229 (516) . +..|.+.++.+++. ..++..... ....+....++.+..| |... T Consensus 127 ~--~~~L~~l~~~~v~~------~~~~~~~~~---------------------------~~~~g~~~~~~~~eIi-hir~ 170 (351) T protein:vir:79 127 G--TLRLEPALAKYVRR------KADFSGFVY---------------------------VNGWQERHEFEPDSVF-QLVR 170 (351) T ss_pred C--EEEEEEeCCcceee------eecCCeEEE---------------------------EecCceEEEEcCccEE-EeCC Confidence 1 23444555543331 111111110 0111223345555544 4444 Q ss_pred Cc-CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 230 GT-ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE 308 (516) Q Consensus 230 ~~-~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~ 308 (516) .. .+..||.+.+..+......-.....+-..|.. +|+- |.+++......-++++.+ .+++..++..... T Consensus 171 ~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~-NGa~------pg~il~~~~~~ls~e~~~---~lk~~~~~~~G~~ 240 (351) T protein:vir:79 171 PDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYE-NGSH------AGFILYMTDAAQKQDDVD---NMRDALKNAKGPG 240 (351) T ss_pred CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCC------CceEEEecCCCCCHHHHH---HHHHHHHHhcCcc Confidence 43 46788999888888776665544444444443 4431 111111111112333333 3444444332222 Q ss_pred ce---EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--CccchhhHHHHHHHHHH Q lcl|NC_016071. 309 QA---YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSYNLSESKQSIHG 383 (516) Q Consensus 309 ~a---~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~vh~ev~~ 383 (516) .+ .+..|.|-. ..++++...-+....+|.+.-++-..+|+.+..-.---.+. +++|+++-.+....++. T Consensus 241 N~~~~~v~~~~g~~------~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~~f~ 314 (351) T protein:vir:79 241 NFRNVFMYAPGGKK------DGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFG 314 (351) T ss_pred ccCceeEecCCCCc------cceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHH Confidence 22 222343321 12445444444445557777788888999887654432221 22233433333333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcC--chhHHH Q lcl|NC_016071. 384 HFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQ--EVDMEG 433 (516) Q Consensus 384 ~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~--~~dl~~ 433 (516) ..+|. -++..|-.+|...+ ..+ ++|+..+ ..|.++ T Consensus 315 ----------~~~l~-Pl~~~ie~ln~~lg--~~~--~~F~~~~llr~d~~a 351 (351) T protein:vir:79 315 ----------RNEIR-PLQARFAELNDWLG--DEV--VTFDDYEIPPAPVAA 351 (351) T ss_pred ----------HHHHH-HHHHHHHHHHhhcC--cce--eeeChhhhccccccC Confidence 22221 12222223442111 222 3443221 222211 No 120 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=98.30 E-value=1.5e-06 Score=52.48 Aligned_cols=330 Identities=10% Similarity=0.023 Sum_probs=146.2 Q ss_pred CCccccCcccccchhhhc-cc-----CCCCcccccchHHHHHHHHHHHhhccccc---CCcccHHHHHHHhh-ChHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNEN-LA-----VSRLRTGELGSGALSQLRAESEVMKVEEL---RWPCFLATVEAMKQ-DHTVSTA 70 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~-p~-----~~~~~~~e~g~~~~~~~~~~~~~~~~~~l---r~~~~~~~y~~m~~-D~~v~s~ 70 (516) |++-.........+.+.. =+ .|-+...++ ++-+.-+. +..- .-|-+..-..++.+ .+|.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~----~~~~~~~~----~~~~~~~epp~~~~~La~l~~~n~~h~~~ 72 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTNSWM----TRYCELFY----NDFDDYWEPPISLKGLAEIANANGYHGSL 72 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCcchH----HHHHHHHh----cCCCccccCCCCHHHHHHHHhhhhhhhhh Confidence 776554433333221110 00 111111111 11112221 1111 11333333455554 8999999 Q ss_pred HHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccc Q lcl|NC_016071. 71 LDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGY 150 (516) Q Consensus 71 l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~ 150 (516) |..++.-+.+. +.|.+.- +..++-.-+++-+.+|-+.+|++....+. T Consensus 73 i~~k~N~l~~~---~~Pn~~~-----------------------t~~~f~~~~~d~ll~Gnay~~~~rn~~G~------- 119 (348) T protein:vir:26 73 LKARANYVAGR---FMNGGGL-----------------------PMYKMNSACWDYFGLGMSAFVKIRSYLKN------- 119 (348) T ss_pred HhhhhhHHhhc---ccCCCCC-----------------------CHHHHHHHHHHHHhcCCeEEEEEEcCCCc------- Confidence 99888877652 3333221 11223233445667899999998654331 Q ss_pred eeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecC Q lcl|NC_016071. 151 ITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGG 230 (516) Q Consensus 151 ~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~ 230 (516) +..|.+.|+.+++ ...||+... ....++...++.+.++.++ .. T Consensus 120 --~~~L~~l~~~~v~------~~~d~~~~~----------------------------~~~~g~~~~f~~~dIiHir-~~ 162 (348) T protein:vir:26 120 --VIALEPLPMVHMR------KRKNGDFVQ----------------------------LLRNNEQKVFKAKDVIFIP-QY 162 (348) T ss_pred --EEEEEEecCceeE------eeecCcEEE----------------------------EEecCeEEEEcCccEEEEc-CC Confidence 2234444443332 122332110 0112233456666655444 43 Q ss_pred c-CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 231 T-ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE 308 (516) Q Consensus 231 ~-~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~ 308 (516) . .+..||.+.+..+......-.....+-..|. ++|+ +=-+++.+ ...-++++. +++++..+....+. T Consensus 163 ~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f-~NGa~pg~Il~~~-------~~~ls~e~~---~~lk~~~~~~~G~~ 231 (348) T protein:vir:26 163 DPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYY-LNGAHMGFIFYAT-------DPNLSEADE---KALKEKIASSKGIG 231 (348) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH-hccCCCceEEEec-------CCCCCHHHH---HHHHHHHHHhcCcc Confidence 3 4567899988888776665544444444444 3443 11122211 111233333 34444444433222 Q ss_pred ce---EEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc--CCccchhhHHHHHHHHH- Q lcl|NC_016071. 309 QA---YFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG--NDGQGSYNLSESKQSIH- 382 (516) Q Consensus 309 ~a---~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~ev~- 382 (516) .+ .+.+|.|-+ ..+++...+-+....+|.+.-++-..+|+.+.--.---.+ .+++|+++-.+....++ T Consensus 232 n~~~~~vl~~~g~~------~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~f~ 305 (348) T protein:vir:26 232 NFRSMFVNIPNGKE------KGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQVYD 305 (348) T ss_pred cccceeEEcCCCCc------cceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHH Confidence 22 223343321 1244555444444445666666667788887765442222 12234444333333332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHH Q lcl|NC_016071. 383 GHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFV 438 (516) Q Consensus 383 ~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~ 438 (516) ..-+.--++.|++.||+.+. .+... +|.|+..-..+... +.++ T Consensus 306 ~~~l~P~~~~ie~~ln~~l~----------~~~~~--~~~fdl~~~~e~~~-~~a~ 348 (348) T protein:vir:26 306 FYEVIPVCKRFMDAVNNDPE----------IPDNL--KLKFNLNPGVESAN-GSAV 348 (348) T ss_pred HHHHHHHHHHHHHHHhhhhC----------CCCcc--EEEEecCcccccch-hhcC Confidence 23355566666666765321 12222 45554332222222 2233 No 121 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=98.29 E-value=1.6e-06 Score=52.36 Aligned_cols=331 Identities=12% Similarity=0.070 Sum_probs=137.3 Q ss_pred CCccccCcccccchh-----hh-ccc-CCCCcccc--cchHHHHHHHHHHHhhcccc-cCCcccHHHHHHHh-hChHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAG-----NE-NLA-VSRLRTGE--LGSGALSQLRAESEVMKVEE-LRWPCFLATVEAMK-QDHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~~~~~~-----~~-~p~-~~~~~~~e--~g~~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~-~D~~v~s 69 (516) ||+|.++........ +. .|+ ...+..++ .--.+ ..+..+.....+.+ -.-|-+..-.-++. ..+|.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~-~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~ 79 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNR-AEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 79 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCc-chhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhhh Confidence 999987643222111 00 111 11111111 00000 00111111111111 01122333334444 4899999 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAG 149 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g 149 (516) +|..++..+.+. +.|.+.- +..++-.-+++.+.+|-+.+|++-...| . T Consensus 80 ~l~~k~n~l~~~---~~Pn~~~-----------------------t~~~f~~~~~d~ll~Gnay~~~~rn~~G---~--- 127 (351) T protein:vir:78 80 ALFFKANVLAST---FRPHRWL-----------------------SRHAFERWALDFLTFGNGYLERRRNMVG---G--- 127 (351) T ss_pred hhhhhhhHHhhc---ccCCCCC-----------------------CHHHHHHHHHHHHhcCCeEEEEEECCCC---C--- Confidence 998877776552 3333221 1122333344666789999999865432 1 Q ss_pred ceeeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeec Q lcl|NC_016071. 150 YITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLG 229 (516) Q Consensus 150 ~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~ 229 (516) +..|.+.++..++. ..+.++ .. .....+....+|.+.++ |... T Consensus 128 ---~~~L~pl~~~~v~~----~~~~~~--~~---------------------------~~~~~~~~~~~~~~eVi-hir~ 170 (351) T protein:vir:78 128 ---TLRLEPALAKYVRR----KADFSG--FV---------------------------YVNGWQERHEFAPDSVF-QLVR 170 (351) T ss_pred ---EEEEEEecCcceEE----eeeCCe--EE---------------------------EEecCCeEEEEccccEE-EEcC Confidence 22344444433321 111111 10 01112233445666655 4444 Q ss_pred Cc-CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 230 GT-ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE 308 (516) Q Consensus 230 ~~-~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~ 308 (516) .. .+..||.+.+..+......-.....+-..|. ++|+- |.+++......-++++. +.+++..++..... T Consensus 171 ~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f-~NGa~------pggIl~~~~~~ls~e~~---~~lr~~~~~~~G~~ 240 (351) T protein:vir:78 171 PDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYY-ENGSH------AGFILYMTDAAQKQDDV---DNMRDALKNAKGPG 240 (351) T ss_pred CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH-hccCC------CceEEEecCCCCCHHHH---HHHHHHHHHhcCcc Confidence 44 4678999988888876665444444433443 34431 11122111111233333 33444444433322 Q ss_pred ceE---EEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--CccchhhHHHHHHHHHH Q lcl|NC_016071. 309 QAY---FILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSYNLSESKQSIHG 383 (516) Q Consensus 309 ~a~---~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~Al~~vh~ev~~ 383 (516) .++ +..|.|.+ ..++++..+-+....+|.+.-++-..+|+.+.--.---.+- +++|+++-.+....++. T Consensus 241 N~~~~~v~~~~g~~------~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~ 314 (351) T protein:vir:78 241 NFRNVFMYAPGGKK------DGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFG 314 (351) T ss_pred cccceeeecCCCCc------cceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHH Confidence 322 22243321 12444444434444457777777788899887665433321 22233433332222222 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcC--chhHHH Q lcl|NC_016071. 384 -HFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQ--EVDMEG 433 (516) Q Consensus 384 -~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~--~~dl~~ 433 (516) .-+.--++.|+ .+|...+ ..+ |+|+..+ ..|.++ T Consensus 315 ~~~l~P~~~~ie------------e~n~~l~--~~~--~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 315 RNEIRPLQARFA------------ELNDWLG--DEV--VRFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHHHHHHH------------HHHhhcC--ccc--eecChhhhccccccC Confidence 11222233332 2331111 122 4443322 222221 No 122 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=98.26 E-value=8.3e-07 Score=53.93 Aligned_cols=245 Identities=9% Similarity=-0.074 Sum_probs=128.5 Q ss_pred CC---ccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCC--cccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MS---TRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRW--PCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~---~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~--~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) |. +++++.......+. .. +.. ..+.+.+ ...+ .-+..++-+.|.+|++.+. T Consensus 1 MglF~~~~~r~~~~~~~~~-------------~~--------~~~--~~~~~~~~~~~~v-~~~~al~~~~v~~~i~~ia 56 (251) T protein:vir:46 1 MGIFYKNEKRDLQYNEDDL-------------QM--------MVQ--TLPSFQGTKLRQY-KDIEAIRHSDIFTAVMMIA 56 (251) T ss_pred CCccccccccccCCCccch-------------hh--------hhh--hhccccCcCccee-chhhhhccHHHHHHHHHHH Confidence 32 22222111100000 00 000 0011111 1111 1234456788999999999 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) ..|.+++|++.-. .+...+..+..++. .+-+...++.+++..+. +.+.+|-+..+++.... |. +. T Consensus 57 ~~iA~lp~~~~~~--~~~~~~~~~~~ll~---~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~-------G~--~~ 122 (251) T protein:vir:46 57 SDLARMPIRVTVN--GQINYSDRIVNLLN---TRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-------GE--PM 122 (251) T ss_pred HhHhhCceEEeeC--ccccccchHHHHHh---ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-------Cc--EE Confidence 9999999987532 22222222333332 23344566778887766 57889999999987543 22 33 Q ss_pred cccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESN 234 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~ 234 (516) .|.+.++..++ ...+++|+........ .....+....+|.+.+|.+++.+. +. T Consensus 123 ~L~~i~~~~v~----v~~~~~g~~~~~~~~~----------------------~~~~~g~~~~~~~~diiH~r~~~~-dg 175 (251) T protein:vir:46 123 NLTFRKTSEIE----LKSDARGRLYYFHQRI----------------------DSNGNNIERNVKFEDMLDIKFYSL-DG 175 (251) T ss_pred EEEEECCceEE----EEECCCCcEEEEEEEe----------------------ccCCcceeEEECCccEEEecCcCC-CC Confidence 45555554443 2345555443211110 001122345678888777776544 45 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEe Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFIL 314 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~ii 314 (516) .+|.|++..+....-......++-..+...-+.+--+++.| .+-.++ +..+.+++.......|.+-++.+ T Consensus 176 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------~~l~~~--e~~~~~~~~~~~~~~g~~n~g~~ 245 (251) T protein:vir:46 176 INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK--------GVLDNK--KARDRAREEFPKVLVELNKLGKL 245 (251) T ss_pred eeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeC--------CCCCCH--HHHHHHHHHHHHHhcCccccccc Confidence 79999999998877777777777777776544443333332 111111 22334455455555665555668 Q ss_pred ccCccc Q lcl|NC_016071. 315 PSDMNA 320 (516) Q Consensus 315 P~g~~i 320 (516) +.||+- T Consensus 246 ~~gm~~ 251 (251) T protein:vir:46 246 SYSMNQ 251 (251) T ss_pred ccccCC Confidence 888862 No 123 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.26 E-value=1.9e-06 Score=51.91 Aligned_cols=468 Identities=12% Similarity=0.058 Sum_probs=183.8 Q ss_pred CCccccCcccccchhhhcccCCCCc---ccccc-hHHHHHHHHH------HHhhcccccCCcccHHHHHHHhh-ChHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLR---TGELG-SGALSQLRAE------SEVMKVEELRWPCFLATVEAMKQ-DHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~---~~e~g-~~~~~~~~~~------~~~~~~~~lr~~~~~~~y~~m~~-D~~v~s 69 (516) |.+. +.....+-.. ..+.+ ..+-| +..-....+| .+.+-.++ +.....--++|.+ ++++.+ T Consensus 1 ~~~p----~~~~~~~~~~--~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~--~~~lr~RaRdl~rNn~~a~~ 72 (533) T protein:vir:34 1 MKTP----TIPTLLGPDG--MTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPN--FTRGNARADDLVRNNGYAAN 72 (533) T ss_pred CCCc----hhhhhhcccc--cchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHH--HHHHHHHHHHHHhcChHHHH Confidence 3322 1111111000 00000 00000 0000000011 00010111 0011112245544 899999 Q ss_pred HHHHHHHHHhcCCceeeeCCC-----CCChhhHHHHHHHHHHHhh----------ccCcCCHHHHHHHHHHH-Hhhccee Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYN-----RDSKASKDAAEFVEYALKN----------LANQQTLRDIARSAATF-NEYGFSI 133 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~-----~d~~~~~~~a~~v~~~l~~----------~~~~~~~~~~l~~~lda-~~~G~S~ 133 (516) +++.....|-+.-+.+.+.+. -+++.+++..+.|+..|.. .....+|..+...++.+ +.-|=++ T Consensus 73 av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f 152 (533) T protein:vir:34 73 AIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELF 152 (533) T ss_pred HHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceE Confidence 999999999998777665432 2345667777778777753 34456788888888865 5668888 Q ss_pred eeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeecccccccccccccccccc--cccccccccccc Q lcl|NC_016071. 134 FEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQI--SSAMSLVTNLTS 211 (516) Q Consensus 134 ~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 211 (516) +-+.|....+.. + .-+|....+.-|.-+ ..-.+.++++-+++-+... .-.+|+.. +++-........ T Consensus 153 ~~~~~~~~~g~~-----~-~~~lq~ie~d~l~~~--~~~~~~~~i~~GIe~d~~G---r~~aY~i~~~~~~~~~~~~~~~ 221 (533) T protein:vir:34 153 VQATWDTSSSRL-----F-RTQFRMVSPKRISNP--NNTGDSRNCRAGVQINDSG---AALGYYVSEDGYPGWMPQKWTW 221 (533) T ss_pred EEeeeccCCCCc-----c-ceEEEEechhhcCCC--CCCCCCCceEeeeEECCCC---CeEEEEEeecCCCCccccccce Confidence 888888754321 1 112233333333211 1111122222222211100 00001100 001000000000 Q ss_pred CCCccccccccEEEEeecC-cCCccccchhHHHHHHHHHHHHHHHHHHHHHH--hhccccceeeeec-----cccccccc Q lcl|NC_016071. 212 SADEVFIPINKLMVMSLGG-TESNPAGVSPLVGCYRAFREKILIENLETIGA--SKDLGGIIELKIP-----SQILNKAA 283 (516) Q Consensus 212 ~~~~~~iP~~k~i~~~~~~-~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~--er~g~~~~v~~~p-----p~~~~k~~ 283 (516) ......+|.. -|+|.+.. +.|..-|.+.|.++.....-........++.. .--.+.|..-..+ ....+... T Consensus 222 ~~~~~~v~a~-~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~ 300 (533) T protein:vir:34 222 IPRELPGGRA-SFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANS 300 (533) T ss_pred eeeeeccChh-HeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCc Confidence 1111223333 46777664 58889999999988775544333333222211 1111111110000 00000000 Q ss_pred CCCCHHHHHHHHHHHHHHH------HhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHH Q lcl|NC_016071. 284 IDPKSPESEMVQGLMADAA------NAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRF 357 (516) Q Consensus 284 ~~~~~~~~~~l~~l~~~~~------~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~i 357 (516) . +....+........ .+..+.-....++.|. +|++.+.+..+ .+|..|.+..-+.|+.++ T Consensus 301 ~----~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe--------~i~~~~~~~p~--~~~~~f~~~~lr~iAagl 366 (533) T protein:vir:34 301 Q----EQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGD--------SLNLQTAQDTD--NGYSVFEQSLLRYIAAGL 366 (533) T ss_pred c----cccccccccchhhhhccCcceeeccCceeeecCCCC--------eeeecCCCCCC--CCHHHHHHHHHHHHHhhc Confidence 0 00000100000000 0011111223345554 56665554433 357888888889998887 Q ss_pred hc--ccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCCc-CCcc---c-------c--c Q lcl|NC_016071. 358 GA--GFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLA---LNDIR-LSDE---D-------M--P 419 (516) Q Consensus 358 LG--qtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~---lN~~~-~~~~---~-------~--P 419 (516) .- +.||.+-+ ..||+.+..-..-+....+.....+...+-+-+...+++ +|+.- .|.. . + . T Consensus 367 Gi~ye~lt~D~s-~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~ 445 (533) T protein:vir:34 367 GVSYEQLSRNYA-QMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNC 445 (533) T ss_pred CCCHHHHhhhcc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhce Confidence 43 22554322 356665544433333333434443433333333333332 34211 1110 0 0 1 Q ss_pred eEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc---ccc-cCCCCCCcccccccccC Q lcl|NC_016071. 420 KLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE---LLK-LLGQDTSRSGDGMTAGS 495 (516) Q Consensus 420 ~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~---~~~-~~~~~~~~~~~~~~~~~ 495 (516) .+....-...|..+-+++....+++|+... ++.+++ .|.....--++-... ..+ -.+.+..+... .. T Consensus 446 ~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~----~~~~a~-~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~----~~ 516 (533) T protein:vir:34 446 DWIGSGRMAIDGLKEVQEAVMLIEAGLSTY----EKECAK-RGDDYQEIFAQQVRETMERRAAGLKPPAWAAAA----FE 516 (533) T ss_pred eeccCCccccChHHHHHHHHHHHHcCCCCH----HHHHHH-cCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcC----cc Confidence 222233444566666788888999998664 233333 355421110000000 000 00101111000 00 Q ss_pred CCCCcccccccccchhh Q lcl|NC_016071. 496 NGNGTGKISSTRDNSVS 512 (516) Q Consensus 496 ~~~~~~~~~~~~d~~~~ 512 (516) ++........+.|+.+| T Consensus 517 s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 517 SGLRQSTEEEKSDSRAA 533 (533) T ss_pred CCCCCCCCCCcccCCCC Confidence 01111111111222222 No 124 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=98.20 E-value=2.8e-06 Score=51.06 Aligned_cols=335 Identities=11% Similarity=0.019 Sum_probs=150.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccccc-hHHHHHHHHHHHhhccccc-CCcccHHHHHHHh-hChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELG-SGALSQLRAESEVMKVEEL-RWPCFLATVEAMK-QDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g-~~~~~~~~~~~~~~~~~~l-r~~~~~~~y~~m~-~D~~v~s~l~~Rk~~ 77 (516) |+++.++..+.....+.. ..-.+..+|.- +..+ .+.+.- ...+.+. .-|-+..-..++. ..+|-+|+|..++.- T Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~y~~~~-~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k~n~ 77 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPI-NDRTFSLSEITASPAL-DYVGIG-FDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANM 77 (345) T ss_pred CCccccccchhhhcCCCc-eEEEeecCCcccchhh-ccccee-eecCCccccCCCCHHHHHHHhhcchhhcchhhhhhhH Confidence 999998887655443321 11112222111 1111 111110 0011110 1122222233444 489999999888887 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) +.+. +.|++.- +..++..-+++-+.+|-+.+|++....+. +..|. T Consensus 78 l~~~---~~Pn~~~-----------------------t~~~f~~~v~d~ll~Gnay~~i~rn~~G~---------~~~L~ 122 (345) T protein:vir:37 78 VSAT---YEGGKAL-----------------------SKMEMRALCLNLIQFGDVGLLKVRNGFGQ---------VVRLV 122 (345) T ss_pred Hhhc---cCCCCCC-----------------------CHHHHHHHHHHHHhcCCeEEEEEECCCCC---------EEEEE Confidence 7652 3443321 12223233445566899999999765332 22444 Q ss_pred ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAG 237 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G 237 (516) +.++..++ ...|+......+.. .....++...+|.+.+|.++.-.-.+..|| T Consensus 123 pl~~~~vr------~~~d~~~~~~~~~~----------------------~~~~~g~~~~~~~~eViHir~~~~~~~~~G 174 (345) T protein:vir:37 123 PLSSLYLR------VHKDGGYSYLMKKS----------------------LYDTAQEIYRYDAKDIIFIKLYDPMQQVYG 174 (345) T ss_pred EecCceeE------EeecCCeeEEEeee----------------------eeccCceEEEEccccEEEEcCCCCCCCccc Confidence 44544332 23333322211110 001112334566666555543333456789 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccc---eEEE Q lcl|NC_016071. 238 VSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQ---AYFI 313 (516) Q Consensus 238 ~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~---a~~i 313 (516) .+.+..+......-.....+-..|.. +|+ +=-+++.+ . ..-++++. +++++..++...|.. ..+. T Consensus 175 l~~~~~a~~si~l~~~a~~~~~~~f~-NGa~~~~Il~~t------~-~~l~~e~~---~~lk~~~~~~~g~~n~~~~~i~ 243 (345) T protein:vir:37 175 SPDYVGGIQSALLNSDATVFRRRYFS-NGAHMGFILYST------D-PDLTEEME---EEIARKISESKGVGNFRSMFVN 243 (345) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHh-ccCCcceEEEeC------C-CCCCHHHH---HHHHHHHHHhcCccccCceeEe Confidence 98877776665554444444444443 443 11112211 1 11223333 334444444433322 2233 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc--CCccchhhHHHHHHHHHH-HHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG--NDGQGSYNLSESKQSIHG-HFVQRDI 390 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~ev~~-~~~~aDa 390 (516) +|.|-. ..++++..+-+....+|.+.-++-..+|+.+.--.---.+ .+++|+++-.+-...++. .-+.--+ T Consensus 244 ~~~g~~------~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~~f~~~~l~P~~ 317 (345) T protein:vir:37 244 IAGGHP------DGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQ 317 (345) T ss_pred cCCCCc------cceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHHHHHHHHHHHHH Confidence 344411 1244444444444555777777777889888765442222 122344544443443332 2244556 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHH Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEG 433 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~ 433 (516) +.|++.+|+. .. .++ -..+.|+. .++.. T Consensus 318 ~~ie~~ln~~-----~e-----~~~--~~~i~F~~---~~l~k 345 (345) T protein:vir:37 318 EIIAETINQD-----PE-----IKN--LLKIKFRE---QNFAK 345 (345) T ss_pred HHHHHHhhhh-----hc-----cCC--cceEEECc---hhhcC Confidence 6666666641 11 111 12466653 22222 No 125 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=98.17 E-value=3.3e-06 Score=50.66 Aligned_cols=327 Identities=11% Similarity=0.071 Sum_probs=139.5 Q ss_pred CCccccCcccccchhhh--cccCCCCcccc--c---chHHHHHHHHHHHhhcccc-cCCcccHHHHHHHhh-ChHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE--NLAVSRLRTGE--L---GSGALSQLRAESEVMKVEE-LRWPCFLATVEAMKQ-DHTVSTAL 71 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~--~p~~~~~~~~e--~---g~~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~-D~~v~s~l 71 (516) ||+|.+........... .+.+..+..++ . ++.-++-+.-+ .+.+ -.-|-+..-..++.+ .+|.+|+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~----~~~~~~~pp~~~~~la~~~~a~~~h~~~i 76 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMTASGPKMEAFTFGEPVPVLDRRDILDYVECI----SNGRWYEPPVSFTGLAKSLRAAVHHSSPI 76 (344) T ss_pred CCcccCCCCcchhhhhhccCCceEEEEcCCceEecCcchhhhhhhhh----hcCceecCCCCHHHHHHHHhhhhhhCccc Confidence 99998765332222211 11122222221 1 01001111111 1111 011222333344444 88999998 Q ss_pred HHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccce Q lcl|NC_016071. 72 DTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYI 151 (516) Q Consensus 72 ~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~ 151 (516) ..++..+.+. +.|.+.-. . .+ |..+ +++-+.+|-+.+|++....+ . T Consensus 77 ~~k~n~l~~~---~~Pn~~lt----~--~~--------------f~~~---~~d~ll~Gnay~~i~rn~~G-------~- 122 (344) T protein:vir:20 77 YVKRNILAST---FIPHPWLS----Q--QD--------------FSRF---VLDFLVFGNAFLEKRYSTTG-------K- 122 (344) T ss_pred eehhhhHHHh---ccCCCCCC----H--HH--------------HHHH---HHHHHhcCCeEEEEEECCCC-------c- Confidence 8777666552 33332211 0 11 2223 34556689999999875432 2 Q ss_pred eeccccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCc Q lcl|NC_016071. 152 TIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGT 231 (516) Q Consensus 152 ~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~ 231 (516) +..|.+.+..+++ +..+++....+ ...+..+.+|.+.++..+.-.- T Consensus 123 -~~~L~pl~~~~vr------~~~~~~~~~~~---------------------------~~~~~~~~~~~~eIiHir~~~~ 168 (344) T protein:vir:20 123 -VIRLETSPAKYTR------RGVEEDVYWWV---------------------------PSFNEPTAFAPGSVFHLLEPDI 168 (344) T ss_pred -EEEEEEcCCceeE------eeecCCEEEEE---------------------------ccCCeEEEEcCccEEEeCCCCC Confidence 3345555544332 22222221111 1122334456666544443333 Q ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccce Q lcl|NC_016071. 232 ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQA 310 (516) Q Consensus 232 ~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a 310 (516) .+..||.+.+..+......-.....+-..|.. +|+ +=-+++.+ ...-++++. +++++..+... |..+ T Consensus 169 ~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~-NGa~p~~Il~~~-------d~~l~~e~~---~~ik~~~~~~~-g~~n 236 (344) T protein:vir:20 169 NQELYGLPEYLSALNSAWLNESATLFRRKYYE-NGAHAGYIMYVT-------DAVQDRNDI---EMLRENMVKSK-GRNN 236 (344) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCceEEEec-------CcCCCHHHH---HHHHHHHHHhc-CCCC Confidence 46679999888887766665555555455543 332 11122211 111233333 34444444433 2222 Q ss_pred --EEEe--ccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc--CCccchhhHHHHHHHHHH- Q lcl|NC_016071. 311 --YFIL--PSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG--NDGQGSYNLSESKQSIHG- 383 (516) Q Consensus 311 --~~ii--P~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~ev~~- 383 (516) .++| |.|- ...++++..+-+....+|.+.-++-..+|+.+.--.---.+ .+++|+++-.+....++. T Consensus 237 ~r~l~l~~p~g~------~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~ 310 (344) T protein:vir:20 237 FKNLFLYAPQGK------ADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVR 310 (344) T ss_pred ccceEEecCCCC------ccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHH Confidence 2333 4321 11244444444444455777777888899998865443322 122344444443333322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHH Q lcl|NC_016071. 384 HFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDME 432 (516) Q Consensus 384 ~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~ 432 (516) .-+.--++.+++ ||. || +. ..-+|.+...+..| + T Consensus 311 ~~l~P~~~~~e~-in~----~l-------g~--~~i~F~~~~l~~~d-~ 344 (344) T protein:vir:20 311 NELIPLQDRIRE-ING----WL-------GQ--EVIRFKNYSLDTDN-D 344 (344) T ss_pred HHHHHHHHHHHH-HHH----hc-------CC--cccccCccccccCC-C Confidence 112222223321 222 21 11 11223333333333 1 No 126 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=98.06 E-value=5.6e-06 Score=49.40 Aligned_cols=335 Identities=10% Similarity=0.001 Sum_probs=148.8 Q ss_pred CCccccCcccccchhhhccc-CCCCcccccchHHHHHHHHHHHhhcccc-cCCcccHHHHHHHhh-ChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLA-VSRLRTGELGSGALSQLRAESEVMKVEE-LRWPCFLATVEAMKQ-DHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~-~~~~~~~e~g~~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~-D~~v~s~l~~Rk~~ 77 (516) |+|+.++...-+... .|. .-.+..+|.--..+..+.+..+ +.+.+ -.-|-+..-..++.+ .+|-+++|..++.. T Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~f~~~~~~~~~~~~y~~~~~-~~~~~~~epp~~~~~la~l~~~~~~h~~~i~~k~n~ 77 (345) T protein:vir:37 1 MKTNVKTDNKKGIVI--APINDRTFSLNEISASPALDYVGIGF-DENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANM 77 (345) T ss_pred CCCCccccchhhccc--CcceeEEeecCCcccccchhhhhhhh-cCCccccCCCCCHHHHHHHhhcccccccceeeechH Confidence 999887765433211 111 1112222221100111211111 01110 001112223334544 89999999877766 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) +.+. +.|.+.- +..++...+++.+.+|.+.+|++....|. +..|. T Consensus 78 l~~~---~~Pn~~l-----------------------t~~~f~~~~~d~ll~Gnay~~~~rn~~G~---------~~~L~ 122 (345) T protein:vir:37 78 VSSL---YEGGKAL-----------------------SRMDMRALCLNLIQFGDVGLLKVRNGFGQ---------VVRLV 122 (345) T ss_pred HHhh---ccCCCCC-----------------------CHHHHHHHHHHHHhcCCeEEEEEEcCCCc---------EEEEE Confidence 6542 3333221 12233333456667899999998764331 22344 Q ss_pred ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCcccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAG 237 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G 237 (516) +.|+..++ ...|++....++.. .....+....+|.+.++..+.-.-.+..|| T Consensus 123 pl~~~~vr------~~~d~~~~~~~~~~----------------------~~~~~g~~~~~~~~dVihir~~~~~~~~~G 174 (345) T protein:vir:37 123 PLSSLYLR------VRKDGGYSYLMKKS----------------------LYDTAQEIYRYDAKDIIFIKLYDPMQQVYG 174 (345) T ss_pred EEcCceeE------EEEeCCeeEEEEEe----------------------EecCCceEEEEccccEEEecCCCCCCCccc Confidence 44444332 12233222111110 001122344566666554443333456789 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhccc-cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc---ceEEE Q lcl|NC_016071. 238 VSPLVGCYRAFREKILIENLETIGASKDLG-GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE---QAYFI 313 (516) Q Consensus 238 ~gLlr~~~~~~~fK~~~~~~w~~~~er~g~-~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~---~a~~i 313 (516) .+.+..+......-.....+-..|.+ +|+ +=-+++.+ . ..-++++.+. +++..+....+. ...+. T Consensus 175 ls~~~~a~~si~l~~~a~~~~~~~f~-NG~~p~~Il~~~------d-~~l~~e~~~~---lk~~~~~~~g~~n~~~~~i~ 243 (345) T protein:vir:37 175 SPDYVGGIQSALLNSDATVFRRRYFS-NGAHMGFILYST------D-PDLTEEMEEE---IARKISESKGVGNFRSMFVN 243 (345) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHh-ccCCcceEEEec------C-CCCCHHHHHH---HHHHHHHhcCcccccceEEE Confidence 99988888776665555554444443 443 21122221 1 1123333333 333333322221 12233 Q ss_pred eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc--CCccchhhHHHHHHHHH-HHHHHHHH Q lcl|NC_016071. 314 LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG--NDGQGSYNLSESKQSIH-GHFVQRDI 390 (516) Q Consensus 314 iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~--~~~~GS~Al~~vh~ev~-~~~~~aDa 390 (516) .|.|-+ ..++++..+-+....+|.+.-++...+|+.+.--.---.+ .+++|+++-.+....++ ..-+.--+ T Consensus 244 ~p~g~~------~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l~P~~ 317 (345) T protein:vir:37 244 IANGHP------DGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQ 317 (345) T ss_pred cCCCcc------cceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHHHHHHH Confidence 344321 1234444433444455777777888899988765443222 12234444444443333 33345566 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHH Q lcl|NC_016071. 391 DIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEG 433 (516) Q Consensus 391 ~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~ 433 (516) +.|++.+|+. + . .++. ..+.|+. .++.+ T Consensus 318 ~~ie~~ln~~-~----~-----~~~~--~~i~F~~---~~L~~ 345 (345) T protein:vir:37 318 EIIAETINQD-P----E-----IKNL--LKIKFRE---QNFAK 345 (345) T ss_pred HHHHHHhhhh-c----c-----CCCc--ceEEecc---hhhcC Confidence 6677777641 1 1 1111 2455642 33322 No 127 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.04 E-value=6.1e-06 Score=49.17 Aligned_cols=450 Identities=13% Similarity=0.041 Sum_probs=181.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHH-----H-hhcccccCCcc---------cHHHHHHHhh-C Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAES-----E-VMKVEELRWPC---------FLATVEAMKQ-D 64 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~-----~-~~~~~~lr~~~---------~~~~y~~m~~-D 64 (516) |..=.+.+.-+-+.- .|..++.+..- ......+.+.. . .-..|.++.+. ...--+++.+ + T Consensus 1 ~~r~~~~~~~~dr~i--~~~~~~~~~~~--~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn 76 (505) T protein:vir:96 1 MKRAEKKPSLAQRMV--NWAWYRYVEPQ--KNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINN 76 (505) T ss_pred CCCCccccchhhccc--chhhhhhHHHH--HHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcC Confidence 433222222111110 01111110000 00000000000 0 00011111111 1111244444 8 Q ss_pred hHHHHHHHHHHHHHhc-CCceeeeCCCC-CChhhHHHHHHHHHHHhhc--------cCcCCHHHHHHHHHHH-Hhhccee Q lcl|NC_016071. 65 HTVSTALDTKYVFVTK-AFNDFKVLYNR-DSKASKDAAEFVEYALKNL--------ANQQTLRDIARSAATF-NEYGFSI 133 (516) Q Consensus 65 ~~v~s~l~~Rk~~v~~-~~w~i~~~~~~-d~~~~~~~a~~v~~~l~~~--------~~~~~~~~~l~~~lda-~~~G~S~ 133 (516) +++++++++....|.+ .-+.+.+.+.. +...+++.++.|+..|+.. ....+|..+.+.++.+ +.-|=++ T Consensus 77 ~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f 156 (505) T protein:vir:96 77 PYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVL 156 (505) T ss_pred hHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceE Confidence 9999999999998887 46666554322 2234667888888777654 3345688888877764 4457666 Q ss_pred eeEEEeecccccccccceeeccccccCchhcccc------------cceeecCCCceeeecccccccccccccccccccc Q lcl|NC_016071. 134 FEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRS------------KPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISS 201 (516) Q Consensus 134 ~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~------------~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~ 201 (516) +-++|.... .+.+ +|...++.-|.-+ .=..||.+|+.+-.+-. . .+| T Consensus 157 ~~~~~~~~~-------~~~~-~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~-----------~--~hP 215 (505) T protein:vir:96 157 VREHRGYPN-------KWGY-ALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLL-----------V--NHP 215 (505) T ss_pred EEEeecCCC-------Ccce-EEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEe-----------e--cCC Confidence 544443221 1111 1222233323211 01234444443321110 0 011 Q ss_pred ccccccccccCCCccccccccEEEEeec-CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccc Q lcl|NC_016071. 202 AMSLVTNLTSSADEVFIPINKLMVMSLG-GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILN 280 (516) Q Consensus 202 ~~~~~~~~~~~~~~~~iP~~k~i~~~~~-~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~ 280 (516) ................||... |+|.+. .+.+..-|.+.|.++.....-........++.+.-...=..+++..+.-.+ T Consensus 216 gd~~~~~~~~~~~~~rvpa~~-vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~ 294 (505) T protein:vir:96 216 GDNSYCYHYAGQTYERVPADE-IIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYD 294 (505) T ss_pred CccccccccccccccccCHhH-hhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCC Confidence 111111111122344577654 566665 567888899999988776554444444333333221111112232222111 Q ss_pred cccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc- Q lcl|NC_016071. 281 KAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA- 359 (516) Q Consensus 281 k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG- 359 (516) ....+. +.... ..+..| ....++.|. +|++.+.+.. ...|..|.+..-++|+..+.- T Consensus 295 ~~~~~~---~~~~~-------~~l~pG--~i~~L~pGe--------~i~~~~~~~p--~~~~~~f~~~~lr~iaaglgi~ 352 (505) T protein:vir:96 295 QPPEDD---QGEIV-------EEVEAG--TYQLLPYGI--------RFKEHKIDHP--HTNFGAFVKSSLRGVAAGMGPA 352 (505) T ss_pred Cccccc---cCccc-------cccCCc--eeeecCCCC--------eeeeeCCCCC--CCCHHHHHHHHHHHHHhhcCCC Confidence 111111 11000 111112 234445564 4555555433 345788888888999887743 Q ss_pred -ccccccCCccchhhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHhcCC-cCCc---cccceE--EecCcCc Q lcl|NC_016071. 360 -GFINLGNDGQGSYNLSESKQSIHGHFVQRDID----IIVEAFNKNLIPQLLALNDI-RLSD---EDMPKL--KPGLIQE 428 (516) Q Consensus 360 -qtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~----~i~~~ln~~li~~lv~lN~~-~~~~---~~~P~~--~~~~~~~ 428 (516) +.||.+-+ +.||+.+..-..-+....+.... .+|.-+=+-++...+ +++. ..|+ ..+-.. ....-.. T Consensus 353 ye~lt~D~s-~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~-l~G~i~~p~~~~~~~~~~~w~~p~~~~ 430 (505) T protein:vir:96 353 YNRLAHDLE-GVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSL-LTQALPLNMVDIDRLSQYAFQPRGWDW 430 (505) T ss_pred HHHHhcccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HcCCcCCCCccchhhceeeeccCCccc Confidence 23554322 24565444332222222222322 333333233333322 2321 1111 112122 2223333 Q ss_pred hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc---ccccCCCCCCcccccccccCCCCCcccccc Q lcl|NC_016071. 429 VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE---LLKLLGQDTSRSGDGMTAGSNGNGTGKISS 505 (516) Q Consensus 429 ~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (516) .|..+-+++....+..|+... ++.+++ .|.....--++-... ..+..-....+. ..+.+ +....+... T Consensus 431 iDP~Ke~~a~~~~i~~G~~t~----~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~---~~~~~-~~~~~~~~~ 501 (505) T protein:vir:96 431 VDPAKDSKAHSESIKNRTRSR----SSIIRA-AGDDPEDVFDEIAWEEQLMRDKGVNPTPPE---QESKD-ATTDEEDDS 501 (505) T ss_pred cChHHHHHHHHHHHHcCCCCH----HHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCC---CCCCC-CCCCCCCCC Confidence 566666788889999998665 334444 466431110000000 000000000000 00001 111111122 Q ss_pred cccc Q lcl|NC_016071. 506 TRDN 509 (516) Q Consensus 506 ~~d~ 509 (516) +.|| T Consensus 502 ~~d~ 505 (505) T protein:vir:96 502 ASDD 505 (505) T ss_pred CCCC Confidence 3333 No 128 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=97.94 E-value=1e-05 Score=47.99 Aligned_cols=474 Identities=11% Similarity=0.037 Sum_probs=183.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccc--cchHHHHH----HHHH------HHhhcccccCCcccHHHHHHHhh-ChHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGE--LGSGALSQ----LRAE------SEVMKVEELRWPCFLATVEAMKQ-DHTV 67 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e--~g~~~~~~----~~~~------~~~~~~~~lr~~~~~~~y~~m~~-D~~v 67 (516) |-....++.....++ .+..+..- .+..|.+. ..+| .+.+-.++++ ..-.--+++.+ +++. T Consensus 1 m~~~~~r~~~~~a~~-----~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~--~lr~RaRdL~rNn~~a 73 (553) T protein:vir:63 1 MTKVTVRKLSEVTSG-----RPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKR--IADARGRDMADNDGFT 73 (553) T ss_pred Ccchhhhhhcccccc-----cchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHH--HHHHHHHHHHhcChHH Confidence 333222221111111 11111000 00011000 0011 0111111100 01112244444 8999 Q ss_pred HHHHHHHHHHHhcCCceeeeCC------CCCChhhHHHHHHHHHHHhh----------ccCcCCHHHHHHHHHH-HHhhc Q lcl|NC_016071. 68 STALDTKYVFVTKAFNDFKVLY------NRDSKASKDAAEFVEYALKN----------LANQQTLRDIARSAAT-FNEYG 130 (516) Q Consensus 68 ~s~l~~Rk~~v~~~~w~i~~~~------~~d~~~~~~~a~~v~~~l~~----------~~~~~~~~~~l~~~ld-a~~~G 130 (516) +++++.....|-+.-+.+.+.+ +-+.+.+++..+.|+..|+. .....+|+.+...++. .+.-| T Consensus 74 ~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dG 153 (553) T protein:vir:63 74 NGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTG 153 (553) T ss_pred HHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCC Confidence 9999999999999877766543 33445666777777766653 3345678888888885 46668 Q ss_pred ceeeeEEEeecccccccccceeeccccccCchhcccc----------cceeecCCCceeeeccccccccccccccccccc Q lcl|NC_016071. 131 FSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRS----------KPWVFDEDGRTLKGIYQSKMAFANFQNGLTQIS 200 (516) Q Consensus 131 ~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~----------~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~ 200 (516) =+++-+.|....+... .-+|...++.-|.-+ .=..||.+|+.+-.+-...+.+..+. T Consensus 154 E~~~~~~~~~~~~~~~------~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~------- 220 (553) T protein:vir:63 154 EVLATAEWDRAANRPY------ATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQ------- 220 (553) T ss_pred ceEEEeeeccCCCCcc------cceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCcccc------- Confidence 8888888876543111 112222233323211 11234444443322110000000000 Q ss_pred cccccccccccCCCccccccccEEEEeec-CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_016071. 201 SAMSLVTNLTSSADEVFIPINKLMVMSLG-GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQIL 279 (516) Q Consensus 201 ~~~~~~~~~~~~~~~~~iP~~k~i~~~~~-~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~ 279 (516) ................+|... |+|.+. .+.|..-|.++|.++.....-........++... -.+-|..+. T Consensus 221 -~~~~~~~~~r~~~~~~v~a~~-vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~-i~A~~a~fi------ 291 (553) T protein:vir:63 221 -MAPDMYKWKFVQQSKPWGRRQ-VIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAV-INASYAAAI------ 291 (553) T ss_pred -ccccccceeeeccccccChhH-heecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHH-Hhhhheeee------ Confidence 000000011111223466554 566665 4678889999999887765544433333332221 122221111 Q ss_pred ccccCCCCHHHHHHHHHHH-------------HHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHH Q lcl|NC_016071. 280 NKAAIDPKSPESEMVQGLM-------------ADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELV 346 (516) Q Consensus 280 ~k~~~~~~~~~~~~l~~l~-------------~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li 346 (516) +...++ ....+.+.... ........|.. -.-|-.|+-.......+|++...+..+ .+|..|+ T Consensus 292 -~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~pG~i~~L~pGe~i~~~~p~~p~--~~~~~F~ 366 (553) T protein:vir:63 292 -ESELPP-EFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGAN-NIQIDGAKIPHLFPGTKLNLKPMGTPG--GVGSEFE 366 (553) T ss_pred -ecCCCh-hhhhhhccccccccccccccccccccccccccccc-ceeecCceeeecCCCCeeeecCCCCCC--CCHHHHH Confidence 000000 00000000000 00000000000 001112222222333456666554333 3578888 Q ss_pred HHHHHHHHHHHhc--ccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCC-cCCccc--- Q lcl|NC_016071. 347 NSRKKAILDRFGA--GFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLA---LNDI-RLSDED--- 417 (516) Q Consensus 347 ~~~d~~Isk~iLG--qtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~---lN~~-~~~~~~--- 417 (516) +..-+.|+..+.- +.||.+-++ .||+.+-.-..-+....+.....+...+-+-+..++++ +++. ..|+.. T Consensus 367 ~~~lr~iaaglGi~Ye~lt~D~s~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~ 445 (553) T protein:vir:63 367 ASLNRHLASAFGMSYEEFTRDFSK-ANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRD 445 (553) T ss_pred HHHHHHHHhhcCCCHHHHhhhccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccch Confidence 8888999887743 335554332 45655544433333333344444434333333333322 3321 111100 Q ss_pred ----------c--ceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc----ccccCC Q lcl|NC_016071. 418 ----------M--PKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE----LLKLLG 481 (516) Q Consensus 418 ----------~--P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~----~~~~~~ 481 (516) + ..+....-...|..+-+++....+..|+... ++.+++. |.....--++-... ...-.+ T Consensus 446 ~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~----~~~~a~~-G~D~~~v~~q~a~e~~~~~~~Gl~ 520 (553) T protein:vir:63 446 LFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTY----EREIARL-GGDFRKSFAQRAREDALLKKYGLT 520 (553) T ss_pred hhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCH----HHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCC Confidence 0 0122233334565555778888889998654 2333333 44321000000000 000000 Q ss_pred CCCCcccccccccCCCCCcccccccccchhhhh Q lcl|NC_016071. 482 QDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNM 514 (516) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 514 (516) -+..+.........+..+++..+++.+++...- T Consensus 521 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 521 FNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred CCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 000000000000000000111111111111111 No 129 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=97.87 E-value=1.3e-05 Score=47.29 Aligned_cols=463 Identities=10% Similarity=0.044 Sum_probs=186.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHH-HHHHHHH------HHhhcccccCCcccHHHHHHHhh-ChHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGA-LSQLRAE------SEVMKVEELRWPCFLATVEAMKQ-DHTVSTALD 72 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~-~~~~~~~------~~~~~~~~lr~~~~~~~y~~m~~-D~~v~s~l~ 72 (516) |+++-.-+..-..+ ...++-..-|..+ -....+| .+.+-.++ +.....--+++.+ ++++.++++ T Consensus 1 ~~~~~~~~~~~~~~------~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~--~~~lr~RaRdl~rNn~~a~~av~ 72 (530) T protein:vir:38 1 MKIPSLVGPDGKTS------LREYAGYHGGGGGFGGQLRGWNPPSESADAALLPN--YSRGNARADDLVRNNGYAANAVQ 72 (530) T ss_pred CccceeecCccccc------hHHHhhhhcccCCCCCcccccccCCCCHHHHHHHH--HHHHHHHHHHHHhcChHHHHHHH Confidence 77766554331111 0000000000000 0000111 01111111 0111122245544 899999999 Q ss_pred HHHHHHhcCCceeeeCC-----CCCChhhHHHHHHHHHHHhh----------ccCcCCHHHHHHHHHHH-HhhcceeeeE Q lcl|NC_016071. 73 TKYVFVTKAFNDFKVLY-----NRDSKASKDAAEFVEYALKN----------LANQQTLRDIARSAATF-NEYGFSIFEK 136 (516) Q Consensus 73 ~Rk~~v~~~~w~i~~~~-----~~d~~~~~~~a~~v~~~l~~----------~~~~~~~~~~l~~~lda-~~~G~S~~Ei 136 (516) .....|-+.-+.+.+.+ +-+.+.+++..+.|+..|.. .....+|..+.+.++.+ +.-|=.++-+ T Consensus 73 ~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 152 (530) T protein:vir:38 73 LHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQA 152 (530) T ss_pred HHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEe Confidence 99999999877666543 23345677777888887763 33456788888888764 5568888888 Q ss_pred EEeecccccccccceeeccccccCchhccccc----------ceeecCCCceeeeccccccccccccccccccccccccc Q lcl|NC_016071. 137 VYRTESAPSKYAGYITIDKIAFRPQSSLSRSK----------PWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLV 206 (516) Q Consensus 137 vw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~----------~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~ 206 (516) .|..+.+.. +.-+|-..++.-|.-+. =..||..|+.+-.+-. ..+++-... T Consensus 153 ~~~~~~g~~------~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~-------------~~~~~~~~~ 213 (530) T protein:vir:38 153 TWDSDSTRL------FRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVS-------------DDGYPGWMA 213 (530) T ss_pred eeccCCCCc------cceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEe-------------eccCCCccc Confidence 887654321 11123333333332110 0233343332211100 000000000 Q ss_pred cccccCCCccccccccEEEEeecC-cCCccccchhHHHHHHHHHHHHHHHHHHHHHH--hhccccceeeeeccccccccc Q lcl|NC_016071. 207 TNLTSSADEVFIPINKLMVMSLGG-TESNPAGVSPLVGCYRAFREKILIENLETIGA--SKDLGGIIELKIPSQILNKAA 283 (516) Q Consensus 207 ~~~~~~~~~~~iP~~k~i~~~~~~-~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~--er~g~~~~v~~~pp~~~~k~~ 283 (516) ...........+|.. -|+|.+.. +.+..-|.+.|.++.....--.......++.. .-..+.|..-..+..-.+... T Consensus 214 ~~~~~~~~~~~v~a~-~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~ 292 (530) T protein:vir:38 214 QNWTYIPRELPGGRP-SFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFI 292 (530) T ss_pred cccceeeeeeccChh-HeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCcccccccc Confidence 000000111223333 46677664 57889999999988775554433333332221 111111111000000000000 Q ss_pred CCC-CHHHHHHHHHHHHHHH------HhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHH Q lcl|NC_016071. 284 IDP-KSPESEMVQGLMADAA------NAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDR 356 (516) Q Consensus 284 ~~~-~~~~~~~l~~l~~~~~------~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~ 356 (516) .++ ..++...+........ .+..+.-....++.| .+|++.+....+ ..|..|.+..-+.|+.+ T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG--------e~i~~~~p~~p~--~~~~~f~~~~lr~iaag 362 (530) T protein:vir:38 293 LGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPG--------DSLNLQSAQDTD--NGYSTFEQSLLRYIAAG 362 (530) T ss_pred ccCCcccccccccccchhhhhcccccceeccCceeeecCCC--------CeeeeeCCCCCC--CCHHHHHHHHHHHHHhh Confidence 000 0000000000000000 001111122334445 456666554433 35778888888898887 Q ss_pred Hhc--ccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCCc-CCcc---c-------c-- Q lcl|NC_016071. 357 FGA--GFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLA---LNDIR-LSDE---D-------M-- 418 (516) Q Consensus 357 iLG--qtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~---lN~~~-~~~~---~-------~-- 418 (516) +.- +.||.+-+ +.||+.+..-..-+....+.....+...+-+-+...+++ +++.- .|.. . + T Consensus 363 lGi~ye~lt~D~s-~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~ 441 (530) T protein:vir:38 363 LGVSYEQLSRNYS-QMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGN 441 (530) T ss_pred cCCCHHHHhcccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhc Confidence 743 22444322 246766554444444444444444444333333333332 23211 1110 0 1 Q ss_pred ceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc---c-cc-cCCCCCCcccccccc Q lcl|NC_016071. 419 PKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE---L-LK-LLGQDTSRSGDGMTA 493 (516) Q Consensus 419 P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~---~-~~-~~~~~~~~~~~~~~~ 493 (516) ..+....-...|..+-+++....+.+|+... ++.+++ .|.....--++-... . .. ...+..++... T Consensus 442 ~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~----~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~---- 512 (530) T protein:vir:38 442 ANWIGSGRMAIDGLKEVQEAVMLIEAGLSTY----EKECAK-RGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAF---- 512 (530) T ss_pred eeeecCCccccChHHHHHHHHHHHHcCCCCH----HHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCccccc---- Confidence 1223344444566666778888899998654 233333 354321100000000 0 00 00000000000 Q ss_pred cCCCCCcccccccccchh Q lcl|NC_016071. 494 GSNGNGTGKISSTRDNSV 511 (516) Q Consensus 494 ~~~~~~~~~~~~~~d~~~ 511 (516) .+....+.+.++..++.+ T Consensus 513 ~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 513 EAGVKKSNEEEQDGARAA 530 (530) T ss_pred CCCCCCCCCCCCCCCCCC Confidence 000000111111111111 No 130 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.71 E-value=2.7e-05 Score=45.67 Aligned_cols=443 Identities=11% Similarity=0.002 Sum_probs=159.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhh----ChHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQ----DHTVSTALDTKYV 76 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~----D~~v~s~l~~Rk~ 76 (516) |+|-........+.-... +++ +.....+-... ..++. ....+.+++.. .....-++++.-. T Consensus 1 ~~t~~~~i~~L~~~~~~~--~~r----------~~~l~~Yy~G~--~~i~~-~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 65 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARD--LPN----------LLEAEAYRNGT--RRLKT-IGIGAPPELAYLDVQPGWVATYLRTLSD 65 (480) T ss_pred CCCHHHHHHHHHHHHHHH--HHH----------HHHHHHHHhcc--ccccc-cccccchhHhhhhhhcchHHHHHHHHHh Confidence 665544333222211110 000 01111111111 11211 00111122210 1111111111111 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) .+.-.. |.+. + +.+..+.+.+.|+. ..|..++..+ .+|..||.| +++||.-.....-.+|...+.. T Consensus 66 ~l~~~g--~~~~---~---d~~~~~~l~~i~~~----N~~d~~~~~~~~~a~~~G~a-y~~v~~~~~~~~d~~g~~~i~~ 132 (480) T protein:vir:78 66 RLDIEG--FRIS---E---DSEGLEELWNWWQA----NDLDEESVLGHDDSLTFGRS-YITVSHPDVESGDPAGIPLIRV 132 (480) T ss_pred hhccCc--eecC---C---CchhHHHHHHHHHh----cCHHHHHHHHHHHHhhcCce-EEEEecCccccCCCCCeeEEEE Confidence 111111 2221 1 22334445555543 2366666664 579999986 5788864322222234444333 Q ss_pred ccccCchhcccccceeecC--CCceeeecccc--------ccccccccccccccc-----cccccccccccCCCcccccc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDE--DGRTLKGIYQS--------KMAFANFQNGLTQIS-----SAMSLVTNLTSSADEVFIPI 220 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~--------~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~iP~ 220 (516) +.+. .+. ..||+ .++.+..++-. ......|..+....- ....+.. ........++. T Consensus 133 ~~p~---~~~----~~~D~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~ 203 (480) T protein:vir:78 133 ESPL---YMY----AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVV--DGDVIKHGLGV 203 (480) T ss_pred Eccc---ceE----EEEcCCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCcccccc--ccccccCCCCC Confidence 2221 111 12221 22222222100 000000111000000 0000000 00111122445 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHH-HHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSP-ESEMVQGLM 298 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~-~~~~l~~l~ 298 (516) .-++.|++..+.+.|+|.|-+..-..+.+-- +..+..++..++.+..|..++.|.. ......+ ....+ . T Consensus 204 vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~------~~~~~~~~~~~~~---~ 274 (480) T protein:vir:78 204 VPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVT------TDELTNDGENTTL---D 274 (480) T ss_pred cceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCC------ccccccccccchh---h Confidence 5667788888899999998876422222211 3344556677777777776665421 1110000 11111 1 Q ss_pred HHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC--Cccchh-hHH Q lcl|NC_016071. 299 ADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN--DGQGSY-NLS 375 (516) Q Consensus 299 ~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~--~~~GS~-Al~ 375 (516) .+. | .-..+ .|-+ .++...+.. ....|...++.+-.+|+...--..-..+. .+.+|- |+- T Consensus 275 ~~~-----~--~~~~~-~~~~--------~~~~~~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk 337 (480) T protein:vir:78 275 IYY-----G--RILTL-ASEA--------AKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII 337 (480) T ss_pred hhh-----h--hhccC-CCCC--------ceEEecCcc-CHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHH Confidence 110 1 00111 1212 233333322 12234444444444444221100001110 011121 221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccc--eEEecCcCchhHHHHHHHHHHHHhCCcccccHHH Q lcl|NC_016071. 376 ESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMP--KLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTV 453 (516) Q Consensus 376 ~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P--~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~ 453 (516) ....-....++.-.+.+...|. ++++.++.+.+..... .+. .+.|......++.+.++++.+|+.+|..+.. T Consensus 338 -~~~~~l~~ka~~~~~~f~~~l~-~~~~l~~~~~g~~~~~-~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s--- 411 (480) T protein:vir:78 338 -ATDSRIVKMAERKGRIFGGAWE-RAMRIAMQIMGREVTE-EYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIP--- 411 (480) T ss_pred -HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHcCCCccc-cceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCC--- Confidence 1122222223333344445564 4667777776432222 222 4577777788889999999999988853322 Q ss_pred HHHHHHHcCCCCCCCcccc---cCcc-------cccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 454 INKILEVGGFDEEIPEDMS---TDEL-------LKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 454 ~~~i~e~~Glp~~~~~~~~---~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) .+.+++.+|+.+...++.. .... .....+.......+ ..+.....+.+++.+.-.+-.+ T Consensus 412 ~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 412 KEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP-TVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCC-CCCCCCCccccccCCCCcccCC Confidence 4667888888643211100 0000 00000110000000 0000000111111111111111 No 131 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=97.59 E-value=4.1e-05 Score=44.63 Aligned_cols=406 Identities=12% Similarity=0.038 Sum_probs=175.2 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHh------------hcccccCCcccHHHHHHHhh----C Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEV------------MKVEELRWPCFLATVEAMKQ----D 64 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~------------~~~~~lr~~~~~~~y~~m~~----D 64 (516) |. |..+ +|. ... -...|....+. ...|... .+.-+-|+.-+. - T Consensus 1 m~--------V~~~---hp~--------y~a-~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~-~E~~~~Y~~rl~rA~~~ 59 (452) T protein:vir:94 1 MP--------IETK---HPE--------YLA-YENDWIDCRVASLGQREVKKKGVRFLPKLS-GQTDDMYNAYKQRALFY 59 (452) T ss_pred CC--------CCCc---CHH--------HHH-HHHHHHHHHHHhcChHHHHcCCcccCCCCC-CCCHHHHHHHHhhccCC Confidence 21 1110 110 000 01122222111 0111111 223344544332 5 Q ss_pred hHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeeccc Q lcl|NC_016071. 65 HTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESA 143 (516) Q Consensus 65 ~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~ 143 (516) +++...++.-...|.+.++.+++++ .+.++. .+.. ..+++.+++.++ .++.||.+.+=+-|-..++ T Consensus 60 n~~~~t~~~~~G~vf~k~p~~~~p~--------~l~~~~----~D~~-G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~ 126 (452) T protein:vir:94 60 SITSKTLSALSGMVLDQPPVITHPD--------AMSKYF----EDQS-GIQFYEVFTRAVEETLLMGRVGVFIDRPLTGG 126 (452) T ss_pred chHHHHHHHHhchhhcCCceecccH--------HHHHHH----hccc-CCCHHHHHHHHHHHHHhcCeEEEEEeeccCCC Confidence 7888888888888888887765421 222221 1333 356889998877 6999999888777755432 Q ss_pred ccccccceeeccccccCchhcccccceeecCCCceee-eccccc----ccccccccccccc------ccccccccccccC Q lcl|NC_016071. 144 PSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLK-GIYQSK----MAFANFQNGLTQI------SSAMSLVTNLTSS 212 (516) Q Consensus 144 ~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~-~~~q~~----~~~~~~~~~~~~~------~~~~~~~~~~~~~ 212 (516) +| .+...++..|-. |.++.+|++.+ .++... ....+......++ ...+....+.... T Consensus 127 --rP-------y~~~~~~~~Ii~---W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~ 194 (452) T protein:vir:94 127 --DP-------YISVYTTENILN---WEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQD 194 (452) T ss_pred --ce-------EEEEechhhhcC---ccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccC Confidence 12 122233344432 67777775321 111110 0000000000000 0000100111011 Q ss_pred C-------------CccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_016071. 213 A-------------DEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQIL 279 (516) Q Consensus 213 ~-------------~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~ 279 (516) . .+..++.=-|+++ +....+--.+.+.|..++..-+---....+.-.-+..-+.|++++++. T Consensus 195 ~~~~~~~~~~~~~~~~~~l~~IP~v~~-~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~---- 269 (452) T protein:vir:94 195 GKVWELAKTSTIQNVGVTMDYIPFFCI-TPSGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGA---- 269 (452) T ss_pred CceeeeccceeecCCCcccceeEEEEE-cCCCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecC---- Confidence 1 1111111123322 222222233555455554432211111122222333345666666541 Q ss_pred ccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEecc-CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 280 NKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPS-DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFG 358 (516) Q Consensus 280 ~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~-g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iL 358 (516) +.. + .++.|+.++..+|+ |. +..|++.+|++ ....++.++...++| ..+ T Consensus 270 -----~~~-~-------------~i~iG~~~~~~lpe~~~--------~~~yie~~g~~-i~~~~~~l~~le~~m--~~~ 319 (452) T protein:vir:94 270 -----ESQ-S-------------TMHIGSTKAWVIPEVAA--------KVGFLEFTGQG-LQSLEKALSEKQAQL--ASL 319 (452) T ss_pred -----cCC-C-------------ceEecccccccCCCCCC--------cceEEccCchh-HHHHHHHHHHHHHHH--HHH Confidence 110 0 13448888889995 64 45667766654 223455666666666 233 Q ss_pred c-ccccccCCccchhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEe--cCcCchhHHHH Q lcl|NC_016071. 359 A-GFINLGNDGQGSYNLS-ESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKP--GLIQEVDMEGF 434 (516) Q Consensus 359 G-qtLts~~~~~GS~Al~-~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~--~~~~~~dl~~~ 434 (516) | ..|.....+. ..+.+ .....-...++.+-+..+++.++ ++++++..+-+. +.. .+|.. +.......... T Consensus 320 Ga~ll~~~~~~~-~s~ea~~~~~~~~~s~L~~~a~~~e~al~-~~l~~~a~w~g~---~~~-~~v~~n~dF~~~~~~~~~ 393 (452) T protein:vir:94 320 SARLIDNSTRGS-EATETVKLRYMSETASLKSVTRAVEALLN-KAYSCIMDMESM---GGT-LNIKLNSAFLDSKLTAAE 393 (452) T ss_pred HHHhhccCCCcc-hHHHHHHHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCC---CCc-eEEEeccccccccCCHHH Confidence 3 3333322111 11112 22333335677888888899996 588998887632 222 23432 22222211344 Q ss_pred HHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccc Q lcl|NC_016071. 435 SKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKI 503 (516) Q Consensus 435 a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (516) .+++-++...|.+.. ....+++ ++.|++.++.+++.+..+ .+.+ ..++. +.|+++.+++ T Consensus 394 ~~al~~~~~~G~is~-~t~~~~L-~~~gvl~~~~e~~~i~~E-~~~~-~~~~~------~~~~~~~~~~ 452 (452) T protein:vir:94 394 LKAWVEAYLSGGISK-EIYIHAL-KVGKVLPPPGESMGVIPD-PPAP-EPSPS------NTPPNPSSKA 452 (452) T ss_pred HHHHHHHHhcCCCcH-HHHHHHH-HhCCCCCCccCHHHHHHH-hhcc-CcccC------CCCCCCccCC Confidence 567778889997654 2333444 346887665444332222 1111 11111 1222222222 No 132 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.55 E-value=4.7e-05 Score=44.29 Aligned_cols=440 Identities=11% Similarity=-0.006 Sum_probs=159.7 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccC-CcccHHHHHHHh----hChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELR-WPCFLATVEAMK----QDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr-~~~~~~~y~~m~----~D~~v~s~l~~Rk 75 (516) |+|-........+.-... .++ +.....+-... ..++ .+. .+.+++. ......-++.+.- T Consensus 1 ~~t~~d~i~~L~~~~~~~--~~r----------~~~~~~Yy~G~--~~i~~~~~--~~~~~~~~~~~~~n~~~~ivd~~~ 64 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARD--LPN----------LLEAEAYRNGT--RRLKTIGI--GAPPELAYLDVQPGWVATYLRTLS 64 (480) T ss_pred CCCHHHHHHHHHHHHHHH--HHH----------HHHHHHHHhcc--ccchhccc--ccchhhhhhhhhcchHHHHHHHHH Confidence 655544333333322111 000 11111111111 1111 010 0111111 0111111222211 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) ..+.--. |.+. .+ .+..+.+.+.|+. ..|..++.++ .++.-||.| +++||.-.....-.+|...+. T Consensus 65 ~~l~~~g--~~~~--~d----~~~~~~l~~i~~~----N~~~~~~~~~~~~a~~~G~a-y~~v~~~~~~~~d~~~~~~i~ 131 (480) T protein:vir:78 65 DRLDIEG--FRIS--ED----SEGLEELWNWWQA----NDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIR 131 (480) T ss_pred hhhccCc--eecC--CC----chhHHHHHHHHHh----cCHHHHHHHHHHHHhhcCce-EEEeecCccccCCCCCeeEEE Confidence 1111111 2221 12 2233445555543 1366677665 579999997 578886332222224444433 Q ss_pred cccccCchhcccccceeecC--CCceeeecccccc--------cccccccccccc----c-cccccccccccCCCccccc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKM--------AFANFQNGLTQI----S-SAMSLVTNLTSSADEVFIP 219 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~--------~~~~~~~~~~~~----~-~~~~~~~~~~~~~~~~~iP 219 (516) .+.|+. +. ..||+ +++.+..++-... ....|..+.... . ....+.. ........++ T Consensus 132 ~~~p~~---~~----~i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g 202 (480) T protein:vir:78 132 VESPLY---MY----AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVV--DGDVIKHGLG 202 (480) T ss_pred EEcccc---eE----EEEcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccc--cccccccCCC Confidence 332221 11 11221 1222222211000 000000000000 0 0000000 0001111233 Q ss_pred cccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCC-CHHHHHHHHHH Q lcl|NC_016071. 220 INKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDP-KSPESEMVQGL 297 (516) Q Consensus 220 ~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~-~~~~~~~l~~l 297 (516) ..-++.|.++.+.+.|+|.|-+..-..+.+-- +..+...+..++.+..|..+++|.. .... .+.+...+ T Consensus 203 ~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~------~~~~~~~~~~~~~--- 273 (480) T protein:vir:78 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVT------TDELTNDGENTTL--- 273 (480) T ss_pred CcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCC------ccccccccccchh--- Confidence 44567778888889999998775422222211 1233334556676777766655421 1110 01111111 Q ss_pred HHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCc-----cchh Q lcl|NC_016071. 298 MADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDG-----QGSY 372 (516) Q Consensus 298 ~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~-----~GS~ 372 (516) .... | ....++ |-+ .++...+.. +...++++++.-|.+..-.-.+....=+ .+|- T Consensus 274 ~~~~-----~--~~~~~~-~~~--------~~~~~~~~~----~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg 333 (480) T protein:vir:78 274 DIYY-----G--RILTLA-SEA--------AKISEFKAA----ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA 333 (480) T ss_pred hhhh-----h--hhccCC-CCC--------ceEEecCcc----CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHH Confidence 1110 1 111222 212 333333322 1233444444444333211111111111 1122 Q ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhCCccccc Q lcl|NC_016071. 373 -NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDE-DMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKT 450 (516) Q Consensus 373 -Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~ 450 (516) |+ .....-....++.-.+.+...|. ++++.++.+++...... .--.+.|......++.+.++++.+|+.+|..+.. T Consensus 334 ~Al-~~~~~~l~~k~~~~~~~f~~~l~-~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s 411 (480) T protein:vir:78 334 EAI-IATDSRIVKMAERKGRIFGGAWE-RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIP 411 (480) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCC Confidence 22 11222223333444445555564 36677777764322221 1246778778888999999999999998864433 Q ss_pred HHHHHHHHHHcCCCCCCCcccc-c--Cc-------ccccCCCCCCcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 451 PTVINKILEVGGFDEEIPEDMS-T--DE-------LLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 451 ~~~~~~i~e~~Glp~~~~~~~~-~--~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) .+.+++.+|+.+...++.. . .. ...+. +..+.+.+....+...+.+.+++++.-....+ T Consensus 412 ---~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 412 ---KEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT-KAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred ---HHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccc-cCCCccccCCCCCCCCCccCCCcccCCCcCCC Confidence 4678888998643211100 0 00 00000 00000111111111111111111111111111 No 133 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.46 E-value=6.3e-05 Score=43.63 Aligned_cols=415 Identities=12% Similarity=0.050 Sum_probs=146.9 Q ss_pred CCccccCcccccchhhhcccCCCCccc------ccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTG------ELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTK 74 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~------e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~R 74 (516) ||++..-+++-.- ..+.+...|.+ -.|+---+.+..+ -....+...+.+..|+ ++.-...++++- T Consensus 1 ~~~~~~~~~~~~~---~~~~~~~~rd~l~~~~~glg~~r~~~~~~~---g~~~~~~~~~l~~~Yr---~~~ia~~iVd~~ 71 (449) T protein:vir:10 1 MTDKLTLAVNHAL---NDARMARARMGLMVPTMGLDNKRHSAWCEY---GFPELVTYENLYSLYR---RGGIAHGAVEKL 71 (449) T ss_pred CchhhHHHHhhhc---chhHHHHHHHHHHHHHhcCCcccchhhhhc---CCcccCCHHHHHHHHh---cCchhHHHHHhh Confidence 8887443221110 00011111110 1111111112111 1111122222222222 355666666654 Q ss_pred HHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccc-c-cccee Q lcl|NC_016071. 75 YVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSK-Y-AGYIT 152 (516) Q Consensus 75 k~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~-~-~g~~~ 152 (516) -... ...|...++ +-+.+..+.. ..++..++.+....-|..+....-.+.+||++++=+.- .++..+. | +.--. T Consensus 72 ~d~~-~~~~~~i~~-g~~~~~~~~~-~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v-~d~~~l~~Pl~~~~~ 147 (449) T protein:vir:10 72 VGKC-WQTNPEIIE-GDDADDSEDE-TSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHI-RDEKDWNLPATKGRG 147 (449) T ss_pred hhhh-hhcCccccc-Cccccchhhh-HHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEe-cCCCCCCcccccCcc Confidence 3322 234433222 2222221111 12222222221111144444344456688999863322 1222211 1 00012 Q ss_pred eccccccCchhcccccceeecCCCceeeecccccccccccccccccccccccccccc---ccCCCccccccccEEEEeec Q lcl|NC_016071. 153 IDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNL---TSSADEVFIPINKLMVMSLG 229 (516) Q Consensus 153 ~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~iP~~k~i~~~~~ 229 (516) +.+|.+.....|.... ..+++....+ +.|..+.+.. .....+..|=+.+++.+... T Consensus 148 i~~i~v~~~~~i~~~~-------------~~~dp~sp~y--------g~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~ 206 (449) T protein:vir:10 148 LQKVSVSWAGSLKVAE-------------WDTGINSKTY--------GQPKLWKYTERLPNGSSRRVDIHPDRVFILGDY 206 (449) T ss_pred eeeEEeeccccCChhh-------------hhcCCCCCCC--------CCceEEEEeeeccCCCccceeeccceeEeecCC Confidence 2223322211111000 0111111111 2222222111 11122233444555544322 Q ss_pred CcCCccccchhHHHHHHHHH-H-HH--HHHHHHHH--------HHhh--ccccceeeeecccccccccCCCCHHHHHHHH Q lcl|NC_016071. 230 GTESNPAGVSPLVGCYRAFR-E-KI--LIENLETI--------GASK--DLGGIIELKIPSQILNKAAIDPKSPESEMVQ 295 (516) Q Consensus 230 ~~~g~p~G~gLlr~~~~~~~-f-K~--~~~~~w~~--------~~er--~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~ 295 (516) +. .|.++|+++|-..+ + |. ..-.-|+. -.+| +..++ ...+ +... .+..+ T Consensus 207 ~~----~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l------~~~~-----~~~~--e~~~~ 269 (449) T protein:vir:10 207 SE----DAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNL------ASLY-----GVSI--DELQD 269 (449) T ss_pred CC----CChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhh------hHHh-----hCCc--hHHHH Confidence 22 27789999985321 0 10 00001111 1111 11111 1111 1111 11122 Q ss_pred HHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc--ccCCccchhh Q lcl|NC_016071. 296 GLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN--LGNDGQGSYN 373 (516) Q Consensus 296 ~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS~A 373 (516) .+.+.++.+..|.+..++...+ +++.++.+=+| ...+++..=.++|-++ +-.+| .+.+- |..+ T Consensus 270 ~~~~~~~~~~~~~~~~~i~~~~---------d~~~~~~~~sg----l~d~l~~~~q~iaaa~-~IP~t~L~Gqsp-~gln 334 (449) T protein:vir:10 270 KFNEVAGEINRGNDVLMTTQGA---------TVTPLVTSVAD----PTATYNVNLQTAAAGV-DIPTRILIGNQQ-AERS 334 (449) T ss_pred HHHHHHHHHhccchheeecCCc---------ceEEEecccCC----hhHHHHHHHHHHHHHh-CCCeeeeeccCc-cccc Confidence 2333343344455544443221 24444443333 3345554445565544 33322 22222 2233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH-------HHHHHHHHHHHhCCc Q lcl|NC_016071. 374 LSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM-------EGFSKFVQRIGAVGY 446 (516) Q Consensus 374 l~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl-------~~~a~~~~~L~~~G~ 446 (516) ..+ -...+.+.+.+-...+.-.|. .|+..|+...+ +....--.|.|......+- +..|++++++++.|. T Consensus 335 st~-D~~nyyd~i~~~Q~~l~p~le-~l~~~l~~s~~--g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~ 410 (449) T protein:vir:10 335 STE-DQKYFNARCQSRRVDLSFEIE-DFCDKLIELKI--IDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGD 410 (449) T ss_pred cch-hHHHHHHHHHHHHHhhhHHHH-HHHHHHHHhhc--CCCCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccc Confidence 222 234566666665555556664 46776666543 2211112445543333332 334667888888884 Q ss_pred ccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCccccccc Q lcl|NC_016071. 447 LPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISST 506 (516) Q Consensus 447 ~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (516) ..+ .+.+.+|+..|...+..++...+. .+. ++.+..+++ T Consensus 411 ~~~--~~~~EiR~~~~~~~~~~~~~~~e~---------~de----------~~~~~d~~a 449 (449) T protein:vir:10 411 NPA--FSREEIRTAAGYDNDDEEPLGEED---------GDE----------EDKATDSAA 449 (449) T ss_pred cCC--cCHHHHHHHhcccCCCCCCCCCCC---------Ccc----------ccccCCcCC Confidence 322 124679999998754332211100 000 000111111 No 134 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=97.16 E-value=0.00015 Score=41.58 Aligned_cols=432 Identities=13% Similarity=0.044 Sum_probs=174.1 Q ss_pred CCccccCcccccchhhhccc-CCCCcccccchHHHHHHHHH-HHhhcccccCC-c--ccHHHHHHHh-h---ChHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLA-VSRLRTGELGSGALSQLRAE-SEVMKVEELRW-P--CFLATVEAMK-Q---DHTVSTAL 71 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~-~~~~~~~e~g~~~~~~~~~~-~~~~~~~~lr~-~--~~~~~y~~m~-~---D~~v~s~l 71 (516) |-|---+.+.|..+--++-. .|.+.+- .....|. .-......|.. + ..=.-|+.-+ + =+++...+ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~i------rd~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl 74 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKV------RHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTL 74 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHH------HHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHH Confidence 65555444444443332211 1111100 0111110 00011111221 1 1112254433 2 47778888 Q ss_pred HHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccC-cCCHHHHHHHHH-HHHhhcceeeeEEEeecccc----- Q lcl|NC_016071. 72 DTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLAN-QQTLRDIARSAA-TFNEYGFSIFEKVYRTESAP----- 144 (516) Q Consensus 72 ~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~-~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~----- 144 (516) +.--..|.+.++.++++ .. ++.++++... ..+++.+++.++ .++.||.+.+=+-+-..++. T Consensus 75 ~~l~G~vfrk~p~~~~p--------~~----l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade 142 (491) T protein:vir:95 75 SGMVGSVMRKEPEINIP--------KE----LEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQ 142 (491) T ss_pred HHHhchhhcCCceeecc--------HH----HHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHH Confidence 87777787777766432 12 2333443322 246888888866 58889998876665333210 Q ss_pred ----cccccceeeccccccCchhcccccceeecCCCc----eeeeccccc----ccccccccccccc------------- Q lcl|NC_016071. 145 ----SKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGR----TLKGIYQSK----MAFANFQNGLTQI------------- 199 (516) Q Consensus 145 ----~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~----~l~~~~q~~----~~~~~~~~~~~~~------------- 199 (516) .+| + +....+..|-. |.++..|. ..+.++... ....+......++ T Consensus 143 ~~~~~rP--y-----~~~~~~~~Iin---W~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~ 212 (491) T protein:vir:95 143 NAGLLNP--T-----IAFYTTENIVN---WRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQ 212 (491) T ss_pred HHhcCCc--E-----EEEechhhhcC---ceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEE Confidence 011 1 11122222221 44443321 111111110 0000000000000 Q ss_pred --------ccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHH----HHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 200 --------SSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYR----AFREKILIENLETIGASKDLG 267 (516) Q Consensus 200 --------~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~----~~~fK~~~~~~w~~~~er~g~ 267 (516) +..............+..++.=-|+.+- ....+-..+...|..++. ||... .--.+..+ .-+. T Consensus 213 ~v~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~-~~~~~~~~~~pPLl~LA~lni~Hy~~s--sd~~~~l~--~~~~ 287 (491) T protein:vir:95 213 RLFRFDAEGGAQEEVVEIYPDLGESLRGVIPFTFIG-ATNNDATIDDAPLLPLAELNIGHYRNS--ADNEESSF--VVGQ 287 (491) T ss_pred EEEEEcCCCcceeeeeeeeecCCCcccCeeEEEEEe-cCCCCCCCCcCchHHHHHHHHHHhhhh--hHHHHHHH--Hccc Confidence 0000000000000111122222233332 223344445555555554 33222 11222222 2345 Q ss_pred cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHH Q lcl|NC_016071. 268 GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVN 347 (516) Q Consensus 268 ~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~ 347 (516) |++++++... .+ ++ .+.... ...++-|..++..+|.+.+ ..++++++++- .++.++ T Consensus 288 P~l~~~G~d~--------~~-~~--~~~~~~--~~~i~~g~~~~~~lP~~~~--------~~~ie~~~~~~---~~~~l~ 343 (491) T protein:vir:95 288 PTLFIYPGDN--------LT-PQ--SFKEAN--PNGIKFGSRCGHNLGYGGS--------AQLIQAGENNL---ARQNML 343 (491) T ss_pred ceeeeecCcc--------cC-cc--hhhccC--cceeEecCcCCcCCCCCCc--------cceeecCcchH---HHHHHH Confidence 6665554210 00 00 011011 1123457788888888753 45556655432 233334 Q ss_pred HHHHHHHHHHhcccccc-cCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceE--Eec Q lcl|NC_016071. 348 SRKKAILDRFGAGFINL-GNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKL--KPG 424 (516) Q Consensus 348 ~~d~~Isk~iLGqtLts-~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~--~~~ 424 (516) -...+| +.+|-.|.. +...+++. ..........++.+-+..+++.++ +++++++.+-+.. .+ .-+.| ..+ T Consensus 344 ~~e~qm--~~~Ga~l~~~~~~~Ta~~--~~~~~~~~~S~L~~~a~~~e~al~-~~l~~~a~w~G~~-~~-~~v~i~~n~d 416 (491) T protein:vir:95 344 DKEQQA--IQIGAQLITPSQQITAES--ARIQRGADTSVMATIARNVSQAYT-DALRWVAMMLGKP-ED-SEVEFQLNMD 416 (491) T ss_pred HHHHHH--HHHHHHhccCCcchhHHH--HHHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCC-CC-CceEEEeecc Confidence 333333 334433332 21112222 222334446678888899999996 5889999986321 11 11222 222 Q ss_pred C-cCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccc--cCCCCCCcccccccccCCCCCcc Q lcl|NC_016071. 425 L-IQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLK--LLGQDTSRSGDGMTAGSNGNGTG 501 (516) Q Consensus 425 ~-~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 501 (516) . ....| ....+++-++...|.+.. .....++ ++.||+.+..+++....... +.+..++.+++...++.... T Consensus 417 F~~~~~~-~~~~~all~~~~~G~is~-~t~~~~L-~~~~vl~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~--- 490 (491) T protein:vir:95 417 FFLQPMT-AQDRAAWMADINAGLLPA-TAYYAAL-RKAGVTDWTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQ--- 490 (491) T ss_pred cccccCC-HHHHHHHHHHHhcCCCCH-HHHHHHH-HhCCCCCccHHHHHHHHHhcCCCCCccccccccchhhhhhcc--- Confidence 2 12223 234567777788897654 3344444 55688865443332221111 12222233333333222111 Q ss_pred c Q lcl|NC_016071. 502 K 502 (516) Q Consensus 502 ~ 502 (516) + T Consensus 491 ~ 491 (491) T protein:vir:95 491 E 491 (491) T ss_pred C Confidence 1 No 135 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=97.11 E-value=0.00017 Score=41.28 Aligned_cols=445 Identities=11% Similarity=-0.010 Sum_probs=181.8 Q ss_pred CCc----cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhh-ChHHHHHHHHHH Q lcl|NC_016071. 1 MST----RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQ-DHTVSTALDTKY 75 (516) Q Consensus 1 ~~~----r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~-D~~v~s~l~~Rk 75 (516) +|= |-..+-.... .....+.++.. + +.... +-.+.+-.+.+ .....--+++.+ ++++.+++++.. T Consensus 11 ~sP~~~~~R~~ar~~~~-~y~aa~~~r~~-~--~~~~~----~s~~~~~~~~~--~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 11 FSPGWKAARLRSRAVIQ-AYEAVKTTRTH-K--ARREN----RTADQLSQYGA--VSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred cChHHHHHHHhhHHHHh-hccccCccccc-C--CCCCC----CChHHHHHHHH--HHHHHHHHHHHhcChHHHHHHHHHH Confidence 110 0000000000 11111111100 0 00000 00011111110 001111244444 999999999999 Q ss_pred HHHhcC-CceeeeCCCCCC-hhhHHHHHHHHHHHhhc------cCcCCHHHHHHHHHHH-HhhcceeeeEEEeecccccc Q lcl|NC_016071. 76 VFVTKA-FNDFKVLYNRDS-KASKDAAEFVEYALKNL------ANQQTLRDIARSAATF-NEYGFSIFEKVYRTESAPSK 146 (516) Q Consensus 76 ~~v~~~-~w~i~~~~~~d~-~~~~~~a~~v~~~l~~~------~~~~~~~~~l~~~lda-~~~G~S~~Eivw~~~~~~~~ 146 (516) ..|-+. -+.+.+.+...+ ..+++.++.|+..|+.. ....+|+.+...++.+ +.-|=.++-++|.+.... T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~-- 158 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSL-- 158 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCcc-- Confidence 999876 455555444333 34566777777777643 3356788888887754 556988988888654311 Q ss_pred cccceeeccccccCchhccccc--------ceeecCCCceeeeccccccccccccccccccccccccccccccCCCcccc Q lcl|NC_016071. 147 YAGYITIDKIAFRPQSSLSRSK--------PWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFI 218 (516) Q Consensus 147 ~~g~~~~~~l~~r~q~ti~~~~--------~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 218 (516) .+|..+.-+|...++.-|.-+. =..||++|+.+-.+-. .. +|.. ....+-+.| T Consensus 159 ~~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~-----------~~--hPgd------~~~~~~~rv 219 (502) T protein:vir:79 159 TPSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVY-----------KS--RPVS------GRQMETKEV 219 (502) T ss_pred CCCcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEe-----------ec--CCCC------CcccceeEe Confidence 1222222233333333332110 0234444443221100 00 0000 011233567 Q ss_pred ccccEEEEeec-CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceee-e--ecccccccccCCCCHHHHHHH Q lcl|NC_016071. 219 PINKLMVMSLG-GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIEL-K--IPSQILNKAAIDPKSPESEMV 294 (516) Q Consensus 219 P~~k~i~~~~~-~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~-~--~pp~~~~k~~~~~~~~~~~~l 294 (516) |... |+|.+. .+.+..-|.+.|.++.....-........++... -.+-|..+ + .+.........++..... T Consensus 220 pA~~-vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~-i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~--- 294 (502) T protein:vir:79 220 DAER-MLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAAR-IAAALGMYIRKGDGQSYEPDGNGSKENERE--- 294 (502) T ss_pred chhh-eEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHH-HhhhheeeeecCCCcccccccCCCCCcccc--- Confidence 7765 566665 5688889999999988765544444333333221 12222111 1 111111000000000000 Q ss_pred HHHHHHHHHhhcccceEEE---eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc--ccccccCCcc Q lcl|NC_016071. 295 QGLMADAANAHAGEQAYFI---LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA--GFINLGNDGQ 369 (516) Q Consensus 295 ~~l~~~~~~~~~g~~a~~i---iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG--qtLts~~~~~ 369 (516) ..+ ..|.+ ++.|. +|++.+.+.. ..+|..|++..-++|+.++.- +.||.+- + T Consensus 295 -------~~l----~pG~i~~~L~pGe--------~i~~~~p~~p--~~~~~~f~~~~lr~iaaglGi~ye~lt~D~-s- 351 (502) T protein:vir:79 295 -------LTI----QPGIIYDDLKPGE--------EIGMVKSDRP--NPNLETFRNGQLRAVAAGSRLSFSSTARNY-N- 351 (502) T ss_pred -------ccc----cCCccccccCCCc--------eeeeeCCCCC--CCCHHHHHHHHHHHHHhhcCCCHHHHhccc-c- Confidence 001 11222 34454 5666555433 335888999999999888643 3456543 2 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCCc-C----CccccceEE--ecCcCchhHHHHHHHHH Q lcl|NC_016071. 370 GSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLA---LNDIR-L----SDEDMPKLK--PGLIQEVDMEGFSKFVQ 439 (516) Q Consensus 370 GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~---lN~~~-~----~~~~~P~~~--~~~~~~~dl~~~a~~~~ 439 (516) +||+.+..-..-+....+.....+...+-+-+...+++ +++.- . ....+.... -..-...|..+-+++.. T Consensus 352 ~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~ 431 (502) T protein:vir:79 352 GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWK 431 (502) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHH Confidence 46765554443344444444444444333333333222 33210 1 111121222 23333456666677888 Q ss_pred HHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc----ccccCCCCC-CcccccccccCCCCCcccccccc-cchhh Q lcl|NC_016071. 440 RIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE----LLKLLGQDT-SRSGDGMTAGSNGNGTGKISSTR-DNSVS 512 (516) Q Consensus 440 ~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~----~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-d~~~~ 512 (516) ..++.|+.... +.+++ .|.....--++-... ...-.+-+. +....+... ..++.+.++.. +++.. T Consensus 432 ~~i~~Gl~t~~----~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~---~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 432 IQIRGGAATES----DWVRA-GGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSS---AATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHHcCCCCHH----HHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCC Confidence 88899986642 23333 355321100000000 000000000 000000000 00000001111 11111 No 136 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=97.02 E-value=0.00021 Score=40.78 Aligned_cols=452 Identities=10% Similarity=-0.031 Sum_probs=153.8 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhh--cccccCCcccHHHHHHHhhChH-----------H Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVM--KVEELRWPCFLATVEAMKQDHT-----------V 67 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~--~~~~lr~~~~~~~y~~m~~D~~-----------v 67 (516) |+.-.+..+..+ .. ++.+...|. ..+....... +.+.+ -++.+.|+--..-.+ + T Consensus 1 ~~~~~~~~~~~~---~~---~~~l~~~e~-----~~i~~L~~~~~~~~~r~--~~l~~YY~G~~~i~~~~~~~p~~~~~~ 67 (504) T protein:vir:99 1 MTEETTSASKFT---FR---IPELNDDVV-----DKVNGLYQQLVDRTPRN--LLRASFYDGKYAIRQIGNLIPPEYLRT 67 (504) T ss_pred CCccCCcccccc---cc---cCCCCHHHH-----HHHHHHHHHHHHHhHHH--HHHHHHHhccccchhccccccHHHHHH Confidence 443322222111 11 223332331 1111111111 11111 111122211000000 0 Q ss_pred HHHHHHHHHHHhcCC----c-eeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeec Q lcl|NC_016071. 68 STALDTKYVFVTKAF----N-DFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTE 141 (516) Q Consensus 68 ~s~l~~Rk~~v~~~~----w-~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~ 141 (516) .+++.==+..|..+. . -|.++. ++.. .+.+.++|+. ..|.....+ ..+|+-||.|+ ++||.-. T Consensus 68 ~~v~n~~~~iVd~~a~rl~~~Gf~~~d--~~~~----~~~l~~i~~~----N~ld~~~~~~~~~a~iyG~af-~~v~~~~ 136 (504) T protein:vir:99 68 ATVLGWSAKAVDTLARRCNLESFVWPD--GDYG----SIGGPDVWDE----NFFATKANNAMVSSLIHGPAF-LINTEGG 136 (504) T ss_pred hhccCcHHHHHHHHHhhhccceeeCCC--CChh----hHHHHHHHHh----cChhhHHHHHHHHHHhhCcee-EEEecCC Confidence 111111112222210 0 222221 1111 1223444432 225555544 44788999976 6888654 Q ss_pred ccccc-------c-ccceeeccccccCchhcccccceeecCCCceeee---ccccccccccccccccccccccccccccc Q lcl|NC_016071. 142 SAPSK-------Y-AGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKG---IYQSKMAFANFQNGLTQISSAMSLVTNLT 210 (516) Q Consensus 142 ~~~~~-------~-~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~---~~q~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (516) .+... | +.+..++....++..-+ +++..+.+|..... .......+..... ..+..... T Consensus 137 d~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~---~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~--------~~~~~~~~ 205 (504) T protein:vir:99 137 AGEPDSLIHVKSAMQATGEWNSRRNAMDSLL---SITSRDAEGHPTGIALYEDGVTVTADMDDD--------GDWHADVR 205 (504) T ss_pred CCCceeEEEEeccceeEEEEeCCCCceeEEE---EEEEecCCCeEEEEEEEcCCcEEEEEEcCC--------ceeeeccc Confidence 33211 1 11122222222222211 12333444432211 1111111100000 01111112 Q ss_pred cCCCccccccccEEEEeecCcCCccccchhHH-HHHHHHHHHHH--HHHHHHHHHhhccccceeeeecccccccccCCCC Q lcl|NC_016071. 211 SSADEVFIPINKLMVMSLGGTESNPAGVSPLV-GCYRAFREKIL--IENLETIGASKDLGGIIELKIPSQILNKAAIDPK 287 (516) Q Consensus 211 ~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr-~~~~~~~fK~~--~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~ 287 (516) ....| +| ++.|.++.+.+.|+|.+-+- .+-. +.... .+..-+...+-+..|-.++.|. ...+.. T Consensus 206 ~~~~g--vP---vV~~~n~~~~~~~~G~sei~~~v~~--l~Da~~~~~~~~~~~~e~~a~p~r~i~G~------~~~~~~ 272 (504) T protein:vir:99 206 THKLG--VP---VEVLPYKPREDRPLGSSRITRPVMS--LQQRALKGCIRMDGHADVYSFPQLILLGA------DAKNFR 272 (504) T ss_pred cCCCC--cc---eEEecccccCccccCcccchhhHHH--HHHHHHHHHHHHHHHHHHhcchhhhhccC------Cccccc Confidence 22223 45 57788888889999988542 2211 11111 1111223344445554444431 111110 Q ss_pred HHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcc--ccc-c Q lcl|NC_016071. 288 SPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAG--FIN-L 364 (516) Q Consensus 288 ~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGq--tLt-s 364 (516) ..+..........+. .-..+|.+.+..+.+..+.++...+.+ +...|...++.+-.+||..---. .|- + T Consensus 273 ~~d~~~~~~~~~~~~-------~i~~~~~~~~~~~~~~~~~~~~q~~~~-~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~ 344 (504) T protein:vir:99 273 NKDGSMKPAWQIALA-------RVFALPDDEDEPDAARARADVKQFPAS-SPQPHIEMLEQIAMMFSGETSIPVESLGFS 344 (504) T ss_pred cccccccchhhhhhh-------hhhcCCCccccccccCccceeeecCCC-ChHHHHHHHHHHHHHHHhhhCCCHHHhccc Confidence 111110111111111 112344433322222223333333222 22234444444444443211110 110 1 Q ss_pred cCCccchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCcCCcc-ccceEEecCcCchhHHHHHHHHHH Q lcl|NC_016071. 365 GNDGQGSY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL--NDIRLSDE-DMPKLKPGLIQEVDMEGFSKFVQR 440 (516) Q Consensus 365 ~~~~~GS~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l--N~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~ 440 (516) +..+.+|- |+. ....-....++.-.+.+...+. ++++..+.+ |....+.. .-..++|......++.+.|+++.| T Consensus 345 ~~~n~sSa~Ai~-~~~~~L~~ka~~k~~~f~~~l~-~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~K 422 (504) T protein:vir:99 345 NRANPTSADAYI-ASREDLIAEAEGATDDWSPAFR-RSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAK 422 (504) T ss_pred ccccccHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHH Confidence 11122333 332 2223333334444455666664 455554444 32111221 124667888889999999999999 Q ss_pred HHhCCcccccHHHHHHHHHHcCCCCCCCc----cccc----CcccccCCCCCCcccccccccCC----CCCccccccccc Q lcl|NC_016071. 441 IGAVGYLPKTPTVINKILEVGGFDEEIPE----DMST----DELLKLLGQDTSRSGDGMTAGSN----GNGTGKISSTRD 508 (516) Q Consensus 441 L~~~G~~~~~~~~~~~i~e~~Glp~~~~~----~~~~----~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~d 508 (516) |+..|.....+ .+.+.+.+|+++.+-+ +... ....+.....+.+.+++.....+ ...++.++.++. T Consensus 423 l~~ag~~l~~~--~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p 500 (504) T protein:vir:99 423 MLGAGPEWLKE--TEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRP 500 (504) T ss_pred HHhhccccccc--hHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCc Confidence 99998532111 3457788899743211 0000 00011111111111111110000 000111111111 Q ss_pred chhh Q lcl|NC_016071. 509 NSVS 512 (516) Q Consensus 509 ~~~~ 512 (516) +-.- T Consensus 501 ~~~~ 504 (504) T protein:vir:99 501 TLVG 504 (504) T ss_pred ccCC Confidence 1111 No 137 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=96.96 E-value=0.00023 Score=40.49 Aligned_cols=434 Identities=12% Similarity=0.042 Sum_probs=165.2 Q ss_pred CCccccCcccccchhh-hcccCCCCcccccchHHHHHH-HHHHHhhcccccC-CcccHHHHHHHh-hChHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGN-ENLAVSRLRTGELGSGALSQL-RAESEVMKVEELR-WPCFLATVEAMK-QDHTVSTALDTKYV 76 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~-~~p~~~~~~~~e~g~~~~~~~-~~~~~~~~~~~lr-~~~~~~~y~~m~-~D~~v~s~l~~Rk~ 76 (516) ||.=.--+.+-...|+ +.| . .+- +..+...+ +++. ....+ ..++|+.|++|. .++.|.++++-+-. T Consensus 20 ~~~~~~~~~p~~~dG~s~i~---~-~~~--~~~~~~~~~~~~~----gg~~~n~~eLI~~YR~ma~~~pEVd~AideIvn 89 (533) T protein:vir:58 20 LSPMYGMGAPHGAGGSSMIP---I-NMY--HPFATAGYASRFY----GGIEFNRFFLYDMYDRMDYTDPLISTVLDIIAD 89 (533) T ss_pred hchhhcccCccCCCCCcccc---C-CCC--cchhhhhhhhhhh----ccccccHHHHHHHHHHhhccCcchhhHHHhhhc Confidence 2222211111111121 111 1 001 11111111 1111 11222 234799999995 69999999997755 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) .+. ++. ..+++.. -.|++.......-+-|..+|+.--+||..+= .|+-||++..+++ T Consensus 90 eai------v~d-~~~~pV~--------v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR--------~WYVDGriy~Hki 146 (533) T protein:vir:58 90 ECT------IPN-ENGNIVD--------VVTKDIELAKAILSYLDYVINIEKNAYPIIR--------NMIKYGDMFLHIL 146 (533) T ss_pred eee------Eec-CCCceeE--------eecccccccHHHHHHHHHHhcchhhhhHHHH--------hhhhcceeEEEec Confidence 332 221 1122110 0011111111122334445555555555441 2444556555554 Q ss_pred cccCchhccc-----cccee--ecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeec Q lcl|NC_016071. 157 AFRPQSSLSR-----SKPWV--FDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLG 229 (516) Q Consensus 157 ~~r~q~ti~~-----~~~f~--~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~ 229 (516) ...|...|.+ ++.+. ++.... ..+..+...+ ........++.||.+..+++.++ T Consensus 147 ik~~k~GI~elr~lDPr~i~~vr~~~t~---------~eyyvy~~~~----------~~~~s~~~~~kI~~daI~y~~SG 207 (533) T protein:vir:58 147 EKGSDGTIEKFQVVSPYIFSKRYNPETD---------TWYYVITDVY----------RNVVSGYFNEDIPEEDVIHFSHK 207 (533) T ss_pred cCCcccchhhheecCCeeeEEEEeeccc---------eEEEeecccc----------cccccCccccccchhheeeeeec Confidence 4333333321 11111 111000 0111111111 11122345688998776655555 Q ss_pred -CcCCccccchhHHHHHHHHHHHHHHHHHHHHH-----HhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016071. 230 -GTESNPAGVSPLVGCYRAFREKILIENLETIG-----ASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAAN 303 (516) Q Consensus 230 -~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~-----~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~ 303 (516) .....+++.|.|+++..++=--+....--+++ -||--+.+-|+-.| .....+. +..++.. T Consensus 208 l~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlp-----------k~KAeqY---l~~im~k 273 (533) T protein:vir:58 208 IDTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVP-----------PDKINEY---LTNIAMQ 273 (533) T ss_pred cccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCC-----------ccCHHHH---HHHHHHh Confidence 45577999999999988876544433322222 12222222222222 1112222 2222222 Q ss_pred hhc----ccceEEEe--ccCc-----cc-------ccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_016071. 304 AHA----GEQAYFIL--PSDM-----NA-------QGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG 365 (516) Q Consensus 304 ~~~----g~~a~~ii--P~g~-----~i-------~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~ 365 (516) ++. .+..|-|- -+-| .- +.....+|+.+. | |.. .-.+-|+|+.+++-+++-...--.+ T Consensus 274 ~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLp--G-g~l-gemeDV~YF~kkLy~ALnVP~sRl~ 349 (533) T protein:vir:58 274 YKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQ--G-SKV-DLAEDVEYMLNRLISALKVPKAFIG 349 (533) T ss_pred cccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecC--C-CCC-CcHHHHHHHHHHHHHHhCCCeeecC Confidence 211 11112110 0000 00 001123455553 3 223 3346789999999999988765454 Q ss_pred CCccchhhHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEec----CcCchhHHHHHHHHH Q lcl|NC_016071. 366 NDGQGSYNLSES-KQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPG----LIQEVDMEGFSKFVQ 439 (516) Q Consensus 366 ~~~~GS~Al~~v-h~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~----~~~~~dl~~~a~~~~ 439 (516) .++ |+...+++ ..|+ |...++.-...+.+.|.+||| ||+..-+. .. ++.|- ..|-.|.+.+.+++. T Consensus 350 ~e~-~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLi-----lk~iit~e-ew-~~~f~~Dn~f~ElKe~Eil~~Ri~ 421 (533) T protein:vir:58 350 YEG-DVNAKNTLATQDIKFNNTIKRIQGFFVEELERMVR-----MNKEFADQ-DF-RLVMNRSNSIVEGERFAVIEQRIG 421 (533) T ss_pred CCC-CCccchhhhHHHHHHHHHHHHHHHHHHHHHhcccc-----cccCcchh-he-eeeeeccchHHHHHHHHHHHHHHH Confidence 443 22233343 2333 444555555566666766553 56543332 22 23332 222334444445555 Q ss_pred HHHhCCcccccHHHHHHHHHHc-CCCCCCCcccccCcccccCC--------CCCCcccccccccCC----------CCCc Q lcl|NC_016071. 440 RIGAVGYLPKTPTVINKILEVG-GFDEEIPEDMSTDELLKLLG--------QDTSRSGDGMTAGSN----------GNGT 500 (516) Q Consensus 440 ~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~----------~~~~ 500 (516) .|..+- | .+..+||++.+ .++....+.+.+-+.+...+ ...+++......++| +.++ T Consensus 422 ~l~~~d---p-yvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~ 497 (533) T protein:vir:58 422 IAERLK---G-WVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEFDFGT 497 (533) T ss_pred HHHHhc---c-hhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCChhhHhccc Confidence 554332 1 33456775543 55431111000000000000 000000000000000 0000 Q ss_pred c--cc--cccccchhhhh-------cC Q lcl|NC_016071. 501 G--KI--SSTRDNSVSNM-------DN 516 (516) Q Consensus 501 ~--~~--~~~~d~~~~~~-------~~ 516 (516) . +- .+.-+-++.++ ++ T Consensus 498 ~~~~~~~~~~~~~~a~~~~~~~~g~~~ 524 (533) T protein:vir:58 498 EGGEELGGELNLGGAFEEFEEETGGGE 524 (533) T ss_pred CCcccccccccccccchhhhhhcCCcc Confidence 0 00 00001111111 11 No 138 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=96.94 E-value=0.00025 Score=40.34 Aligned_cols=462 Identities=12% Similarity=0.069 Sum_probs=189.9 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHH----H--------------- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEA----M--------------- 61 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~----m--------------- 61 (516) |+.-+ .+.++-.+. -.-.-+|+..-...+....+.=.+-+..|-+.+++|.. | T Consensus 1 ~~~~~----~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~ 72 (651) T protein:vir:80 1 MKLAT----TTTDKNRQT----YDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVN 72 (651) T ss_pred Ccccc----cccchhhhh----hhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCC Confidence 22222 222221111 01112344444444444443222111223222222211 0 Q ss_pred ------hhChHHHHHHHHHHHHHhc-----CCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhh Q lcl|NC_016071. 62 ------KQDHTVSTALDTKYVFVTK-----AFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEY 129 (516) Q Consensus 62 ------~~D~~v~s~l~~Rk~~v~~-----~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~ 129 (516) .-++.|+.+++.+...+.. -.| |.+.+..+..+.++.++.|...+.+--.+..|...+..+ +|++-+ T Consensus 73 ~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~-~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~ 151 (651) T protein:vir:80 73 ADWRHKITTGKAFEAIETIHAYLMSATFPNKNW-FDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLIT 151 (651) T ss_pred CCCCccccChhHHHHHHHHHHHHHHhhcCCCce-eEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhccc Confidence 0135677777777766655 334 555566566666777888888876532345688777664 689999 Q ss_pred cceeeeEEEeeccccc---------------ccc---------cceeeccccccCchhcccccceeecCC------Ccee Q lcl|NC_016071. 130 GFSIFEKVYRTESAPS---------------KYA---------GYITIDKIAFRPQSSLSRSKPWVFDED------GRTL 179 (516) Q Consensus 130 G~S~~Eivw~~~~~~~---------------~~~---------g~~~~~~l~~r~q~ti~~~~~f~~~~d------g~~l 179 (516) |.+++=+.|++..... ... |.+.+..+.+ .-|.+|.. ..-+ T Consensus 152 G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p---------~~~~~dp~a~~~~d~~~v 222 (651) T protein:vir:80 152 GNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDM---------FDCFYDPNVTDPNRGAFI 222 (651) T ss_pred CceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecH---------HHeeecCCCcCcccccee Confidence 9999887886431100 000 1111111110 01111110 0000 Q ss_pred eec------------------------ccc------------cccccccc-ccc--------------cccccccccccc Q lcl|NC_016071. 180 KGI------------------------YQS------------KMAFANFQ-NGL--------------TQISSAMSLVTN 208 (516) Q Consensus 180 ~~~------------------------~q~------------~~~~~~~~-~~~--------------~~~~~~~~~~~~ 208 (516) ... ... ...+.... .++ ......-.+..+ T Consensus 223 ~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~ 302 (651) T protein:vir:80 223 RKLTKTKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVV 302 (651) T ss_pred eeeeeeHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEE Confidence 000 000 00000000 000 000000000001 Q ss_pred cccCCCcc----ccc---cccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccc Q lcl|NC_016071. 209 LTSSADEV----FIP---INKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNK 281 (516) Q Consensus 209 ~~~~~~~~----~iP---~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k 281 (516) +...++.+ ..| ..-|++++.....+..||.|....+...-...+...+..+..+.+...|.- ..++.- T Consensus 303 v~~~g~~il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~--~v~~d~--- 377 (651) T protein:vir:80 303 VTIMGNEVLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMY--TLRSDG--- 377 (651) T ss_pred EEEcCcEEecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcE--EecCCc--- Confidence 11111101 122 236899999999999999999999999888888888888777777555542 222210 Q ss_pred ccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccc Q lcl|NC_016071. 282 AAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGF 361 (516) Q Consensus 282 ~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt 361 (516) -.+.+ . +. + ..|+++-.+... .+..+.. ++........++++++..|..+.+-.. T Consensus 378 ----~~~~~--~---l~----~-----~pg~vi~~~~~~------~~~~l~~-~~~~~~~~~~~l~~l~~~~~~~~gv~~ 432 (651) T protein:vir:80 378 ----LLQPE--D---VY----T-----EPGKVFLVSDHG------DLQPLAN-QSSNFSITYQESSFLESTIDKNFGTGN 432 (651) T ss_pred ----cccHH--H---hh----c-----CCCceEEecCCC------Cceeecc-CcccchhHHHHHHHHHHHHHHHhcCCh Confidence 11111 0 10 1 112222222111 1211211 111111234689999999988877665 Q ss_pred ccccCCc--cchhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEe--------cCcCch Q lcl|NC_016071. 362 INLGNDG--QGSYNLSESKQ--SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKP--------GLIQEV 429 (516) Q Consensus 362 Lts~~~~--~GS~Al~~vh~--ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~--------~~~~~~ 429 (516) +..+... .+..-+++|+. +.....+..-.+.+..++-+.|+..++.++..++.....|++.- ..+... T Consensus 433 ~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~ 512 (651) T protein:vir:80 433 YVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVE 512 (651) T ss_pred HHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCcc Confidence 5543221 12222244443 33445566667777777667788888888766665555554311 001111 Q ss_pred h------------------HHHHHHHHHHHHhCCccccc-------HHHHHHHHHHcCCCCCCCcccccCcccccCCCCC Q lcl|NC_016071. 430 D------------------MEGFSKFVQRIGAVGYLPKT-------PTVINKILEVGGFDEEIPEDMSTDELLKLLGQDT 484 (516) Q Consensus 430 d------------------l~~~a~~~~~L~~~G~~~~~-------~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~ 484 (516) | ...+++..+ +..++...|. ......+.+..|++.+.+- ..++...++++.. T Consensus 513 dl~~~~~iv~~g~~~~~~r~~~~~~l~~-~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~--l~~~~q~~~~~~~ 589 (651) T protein:vir:80 513 DLQKEVRLVPIGSDHVIERKQYIEDRLT-FIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAY--LKQQDQQAPANPQ 589 (651) T ss_pred ceeeeeeeeeccHHHHHHHHHHHHHHHH-HHHhhccCCccchhhhHHHHHHHHHHHcCCCCcHHh--cCCCccchhhhhh Confidence 1 112222222 2322222221 1123446778898754321 1110000000000 Q ss_pred CcccccccccCCCCCcccccccccc----hh----hhhcC Q lcl|NC_016071. 485 SRSGDGMTAGSNGNGTGKISSTRDN----SV----SNMDN 516 (516) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~d~----~~----~~~~~ 516 (516) .+. . .++...+ ..+....+..- +. +-+.. T Consensus 590 ~~~-~-~q~~~~~-~~a~~~~~~~~~~~~~~~~~~~~~~~ 626 (651) T protein:vir:80 590 EAL-L-SQAKDVG-GQAMSNMLQNQLQADGGTQMMSEMYG 626 (651) T ss_pred HHH-H-hhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 0 0000000 00000000000 00 00000 No 139 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=96.89 E-value=0.00027 Score=40.11 Aligned_cols=413 Identities=10% Similarity=-0.006 Sum_probs=149.0 Q ss_pred cCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHH Q lcl|NC_016071. 20 AVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDA 99 (516) Q Consensus 20 ~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~ 99 (516) =+|+ +. +..+....++..-.+..-++.+....+.-- .|+. +|...+.. T Consensus 1 ~l~~------------------------~~--~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~--gf~~---~d~~~~~~- 48 (434) T protein:vir:98 1 MLPK------------------------NA--EQAFLDFQRKARTNFCGLIANASVHRLLAL--GVTG---PDGEPDTR- 48 (434) T ss_pred CCCC------------------------Cc--cHHHHHhhhhhhccchHHHHHHHHhhhccC--ceec---CCCchHHH- Confidence 0111 00 001111111111112222222211111111 1232 22223323 Q ss_pred HHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccc------------cc-cceeeccccccCchhcc Q lcl|NC_016071. 100 AEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSK------------YA-GYITIDKIAFRPQSSLS 165 (516) Q Consensus 100 a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~------------~~-g~~~~~~l~~r~q~ti~ 165 (516) +.+.|++ ..|.....++ .+|+-||.|. +++|....+... |. ....++....++...| T Consensus 49 ---~~~i~~~----N~~d~~~~~~~~~a~i~G~ay-~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai- 119 (434) T protein:vir:98 49 ---ASRWWQA----NRLDSRQKLVWRMAMAQSAGY-MLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGL- 119 (434) T ss_pred ---HHHHHHh----cChhHHHHHHHHHHhhcCceE-EEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEE- Confidence 3444442 2366666664 5799999775 578864332211 00 0111122222222222 Q ss_pred cccceeecCCCceeeecc--ccccccccccccccccc-cccccccc-cccCCCccccccccEEEEeecCcCCccccchhH Q lcl|NC_016071. 166 RSKPWVFDEDGRTLKGIY--QSKMAFANFQNGLTQIS-SAMSLVTN-LTSSADEVFIPINKLMVMSLGGTESNPAGVSPL 241 (516) Q Consensus 166 ~~~~f~~~~dg~~l~~~~--q~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLl 241 (516) +.|..+.++....... ................. ....+... .........+...-++.|.+++..+. .|.|-+ T Consensus 120 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~ 196 (434) T protein:vir:98 120 --KVWHNDIDGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEF 196 (434) T ss_pred --EEEEeccCCceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCc-CCcchh Confidence 1233333332111100 00000000000000000 00000000 00000011222333455666665544 488888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccc Q lcl|NC_016071. 242 VGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQ 321 (516) Q Consensus 242 r~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~ 321 (516) ..+-...=-=+..+...+...+-+..|..+++|... ....+..... . ...+.+.++.. ++.+..+-+.+ T Consensus 197 e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~---~~~~~~~~~~---~----~~~~~~~~~~~-~i~~~~~~~~~ 265 (434) T protein:vir:98 197 AGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKF---AKRTDPATGM---T----VVDQPFVPSPS-AVWASEGENTQ 265 (434) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCc---cccccccccc---c----hhhhhhhcccc-ccccCCCCCce Confidence 765332222233444556666767777666654211 0001110000 0 00111111212 22233332222 Q ss_pred cccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc-CCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 322 GGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG-NDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKN 400 (516) Q Consensus 322 ~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~-~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~ 400 (516) +.+.+.+ ....|...++.|=.+|+...--..-..+ ..+..|...-+....-....++.-.+.+...|. + T Consensus 266 --------~~q~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~-~ 335 (434) T protein:vir:98 266 --------FGQLDAT-DLSGFLKEHASDVRDMLTISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLE-S 335 (434) T ss_pred --------EEEecCc-chHHHHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Confidence 2222222 1223444444444444433211100000 011123322223333334444444455666674 4 Q ss_pred HHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccC--cccc Q lcl|NC_016071. 401 LIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTD--ELLK 478 (516) Q Consensus 401 li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~--~~~~ 478 (516) +++.++.+++... +..-..+.|....+.++.+.|+++.+|+..|+ + .+.+++.+|+++.+-+..... .... T Consensus 336 ~~rl~~~~~g~~~-~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~--~----~e~~~~~lg~~~~e~~r~~~e~~~~~~ 408 (434) T protein:vir:98 336 VLALAAAQAGVPE-DYTEAEVRWANPAHVTMAVKADAATKLKSIGY--P----LDVIAEELDESPARVRRIVAGAASQAL 408 (434) T ss_pred HHHHHHHhcCCCh-hheeeeEEecCCCCCCHHHHHHHHHHHHhcCC--c----HHHHHHhCCCCHHHHHHHHHHHHHHHH Confidence 6777777775332 22235788888999999999999999999885 2 356888998864221100000 0000 Q ss_pred cCCCCCCcccccccccCCCCCccccccc Q lcl|NC_016071. 479 LLGQDTSRSGDGMTAGSNGNGTGKISST 506 (516) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (516) .........++..+...+..+.+.. + T Consensus 409 ~~~~~~~~~~~~~~g~~~~~~~~~d--g 434 (434) T protein:vir:98 409 LAASLLPAPGAPSAGNVPDSGGAVD--G 434 (434) T ss_pred HHHhhhccCCCCCCCCCCcccCCCC--C Confidence 0000000001100001111111111 1 No 140 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=96.89 E-value=0.00028 Score=40.09 Aligned_cols=459 Identities=11% Similarity=0.030 Sum_probs=174.5 Q ss_pred CCcccc-------Ccccccchhhhccc------CCCCcccc---------------cchHHHHHHHHHHHhh-----ccc Q lcl|NC_016071. 1 MSTRFA-------QPSEVVKAGNENLA------VSRLRTGE---------------LGSGALSQLRAESEVM-----KVE 47 (516) Q Consensus 1 ~~~r~~-------~~~~~~~~~~~~p~------~~~~~~~e---------------~g~~~~~~~~~~~~~~-----~~~ 47 (516) |.+|.+ ++.++.+..+.+|. .++.++.. ++......+..+...+ ..+ T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~ 118 (862) T protein:vir:99 39 LARTRQNWPVQKEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSS 118 (862) T ss_pred HHhhcccCCcccccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccc Confidence 332221 11222222233331 11110000 0000000000000000 000 Q ss_pred c------cCC-----cccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCH Q lcl|NC_016071. 48 E------LRW-----PCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTL 116 (516) Q Consensus 48 ~------lr~-----~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~ 116 (516) . ..| --.+.++...++.+-+..++++.-.-.++.-|+|...... +..+.+..+.++..++++.. | T Consensus 119 y~~~~~~~~~~~~~~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~-~e~~~e~~~~ie~~~~rL~v---~ 194 (862) T protein:vir:99 119 YAVPEALQDWYLSQGFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEG-EEIDEESLEKFKAIDVEFKV---K 194 (862) T ss_pred cccchhccccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCcc-cccCHHHHHHHHHHHHHhhH---H Confidence 0 000 0124555545569999999999999888888888754322 22334556778888877642 4 Q ss_pred HHHHHHHHHHHhhcceee-eEEEeeccccc-cc---cc--ceeeccccccCchhcccccceeecCCCceeeecccccccc Q lcl|NC_016071. 117 RDIARSAATFNEYGFSIF-EKVYRTESAPS-KY---AG--YITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAF 189 (516) Q Consensus 117 ~~~l~~~lda~~~G~S~~-Eivw~~~~~~~-~~---~g--~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~ 189 (516) ..+...+..+..||-++. -++=..++..+ .| ++ .-.++.|....+ +|.... ......+++... T Consensus 195 ~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp-------~w~~p~---~v~~~~~Dp~sp 264 (862) T protein:vir:99 195 ENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDP-------YWMMPM---LTAESTADPSSQ 264 (862) T ss_pred HHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEech-------hhhccc---cccccccccccc Confidence 444444445888985543 22201111100 11 00 000011111111 111100 000011121111 Q ss_pred ccccccccccccccccccccccCCCccccccccEEEEeecC------cCCccccchhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 190 ANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGG------TESNPAGVSPLVGCYRAFREKILIENLETIGAS 263 (516) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~------~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~e 263 (516) .++.+.++.+ .+..|=+.+++.+.... ...+++|.|++..||-...--......=+..+. T Consensus 265 ~yGkP~~y~I--------------~g~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ 330 (862) T protein:vir:99 265 FFYEPEFWII--------------SGQKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAM 330 (862) T ss_pred ccCCceeeee--------------cCeeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111111 12234445555543322 345578999999998755433333322333444 Q ss_pred hccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHH Q lcl|NC_016071. 264 KDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQ 343 (516) Q Consensus 264 r~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~ 343 (516) ++. +.+++.... ..-..+....+++.. +...+ +....++|-.+ .+++.++.+=+| .. T Consensus 331 ka~--l~v~ktd~l-------~~l~~ed~l~~r~~~-~~~~r-dN~Gi~liD~e--------Ee~e~ls~slSG----L~ 387 (862) T protein:vir:99 331 NKR--TTAIHTDTA-------KAIANEDKFIQRLMF-WVRYR-DNHAVKVLGTD--------ETMEQFDTSLAD----FD 387 (862) T ss_pred Hhc--cceeechhH-------hhhccHHHHHHHHHH-HHhcc-CcceeEEecCC--------CceeEEecccCC----hH Confidence 433 334443211 111112222333322 22221 22334444333 245555554333 44 Q ss_pred HHHHHHHHHHHHHHhccccc-ccCCccchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCcCCccccceE Q lcl|NC_016071. 344 ELVNSRKKAILDRFGAGFIN-LGNDGQGSYNLSESKQSIHGHFVQRDID-IIVEAFNKNLIPQLLALNDIRLSDEDMPKL 421 (516) Q Consensus 344 ~li~~~d~~Isk~iLGqtLt-s~~~~~GS~Al~~vh~ev~~~~~~aDa~-~i~~~ln~~li~~lv~lN~~~~~~~~~P~~ 421 (516) .+++..-.+||-+.--..-- .+.+-.|-.|-|+--..++-+.+++-.. .|...|++ |+. |+.+-. +-... -.| T Consensus 388 dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~Ler-L~~-li~~~l--g~~~d-~~i 462 (862) T protein:vir:99 388 AVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQR-HYL-ISRLSL--GIQHE-IDV 462 (862) T ss_pred HHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHH-HHH-HHHHhc--CCCCc-ceE Confidence 56676667787764333211 2222234345555455666666666543 34445543 333 222211 11111 134 Q ss_pred EecCcCchhH-------HHHHHHHHHHHhCCcccccHHHHHHHHH--HcCCCC---CCCcccc------cCccccc-CCC Q lcl|NC_016071. 422 KPGLIQEVDM-------EGFSKFVQRIGAVGYLPKTPTVINKILE--VGGFDE---EIPEDMS------TDELLKL-LGQ 482 (516) Q Consensus 422 ~~~~~~~~dl-------~~~a~~~~~L~~~G~~~~~~~~~~~i~e--~~Glp~---~~~~~~~------~~~~~~~-~~~ 482 (516) +|......+- +..|++++++++.|++.++ +..+.+++ .+|++. ...+++. ....+++ .+. T Consensus 463 eFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispd-EvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~ 541 (862) T protein:vir:99 463 VMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPD-EERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQ 541 (862) T ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCccc Confidence 4533322222 3446778999999988763 22233321 224321 1111000 0000000 000 Q ss_pred CCCcc---------------------cccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 483 DTSRS---------------------GDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 483 ~~~~~---------------------~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) .+++. .+..+.++....+....+..++.++..++ T Consensus 542 ~~ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~~~~~~ 596 (862) T protein:vir:99 542 ETASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPMPEDDAPVAG 596 (862) T ss_pred ccccccccccccCCccccCCcccccccCCCCCCCccccccccccCCCccccccCc Confidence 00000 00111111111122223334444444444 No 141 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=96.86 E-value=0.00029 Score=39.95 Aligned_cols=453 Identities=14% Similarity=0.031 Sum_probs=191.6 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhh--ChHHHHHHHHHH--- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQ--DHTVSTALDTKY--- 75 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~--D~~v~s~l~~Rk--- 75 (516) |---. ..+-+..|+|-++-|-+-+ ....+ -++-..|++|++|-. -.++.++++-+. T Consensus 1 ~~~~~----------~~~~~~~~~~~g~~~~p~~-----v~~~d----~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~ 61 (527) T protein:vir:10 1 MGQDK----------RQYGSTQQLRAGEANFPNA-----VTDFD----KARLASYRLYEDMYLTNTSDYQVILRGGDEGD 61 (527) T ss_pred CCccc----------cccCCCcCcCCccccCccc-----CCHHH----HHHHHHHHHHHHHhcCchhheeeecCCccccc Confidence 22111 1122344555555442211 00000 122235666666653 235555554433 Q ss_pred ---------HHHhcCCceeeeCCCC--CChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeeccc Q lcl|NC_016071. 76 ---------VFVTKAFNDFKVLYNR--DSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESA 143 (516) Q Consensus 76 ---------~~v~~~~w~i~~~~~~--d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~ 143 (516) ..|....-+|.++... .+..+.+ |++.|+......-|+....+ --++..-|=.|+=+.|..... T Consensus 62 ~r~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~----v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~ 137 (527) T protein:vir:10 62 QRPIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAK----VDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKD 137 (527) T ss_pred cceeeehhhHHhhCCcceeeccCccccccchhHH----HHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCC Confidence 3455666777766544 2223334 44444433323335544433 347888899999999985331 Q ss_pred ccccccceeecccc------------------------ccCchhcc------cccceee--cCCCcee-----eeccccc Q lcl|NC_016071. 144 PSKYAGYITIDKIA------------------------FRPQSSLS------RSKPWVF--DEDGRTL-----KGIYQSK 186 (516) Q Consensus 144 ~~~~~g~~~~~~l~------------------------~r~q~ti~------~~~~f~~--~~dg~~l-----~~~~q~~ 186 (516) . .+++.++.+- ++.|..-. +.++|.| +++|..+ ..+ ... T Consensus 138 ~---~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt-~~~ 213 (527) T protein:vir:10 138 E---GSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYT-EEL 213 (527) T ss_pred c---CCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeee-ece Confidence 0 0122222111 11111100 0111221 1122100 000 000 Q ss_pred ccccccccccccccccccccccccc---CCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 187 MAFANFQNGLTQISSAMSLVTNLTS---SADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGAS 263 (516) Q Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~e 263 (516) ....+|+.--.++...-.+-+.... .....+|..=-++.|...+..+..+|.|-|..+--..---+..+.+....++ T Consensus 214 w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~ 293 (527) T protein:vir:10 214 YEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMV 293 (527) T ss_pred eeccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHH Confidence 0000000000000000000000000 0001112222345566778899999999988877666555566666666677 Q ss_pred hccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHH Q lcl|NC_016071. 264 KDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQ 343 (516) Q Consensus 264 r~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~ 343 (516) -.|.||.++.+.+.+-. -|...-..|..|+.++..+..++.+++. ......|. T Consensus 294 ~sG~Pi~~~tg~~~vd~-------------------------~G~~~~~~VgPG~iweL~e~ak~~~v~~--~~~la~~~ 346 (527) T protein:vir:10 294 FGGLGFYATDSAPPRDS-------------------------RGNMVPWTISPLGMVEHGQNNKIYRVNG--VASLEPSQ 346 (527) T ss_pred HhCCceeeecccccccc-------------------------cCCcCccccCCceeEecCCCcceeeccc--hhhhHHHH Confidence 67788877665432100 0222334455666666666667776654 33344588 Q ss_pred HHHHHHHHHHHHHHhccccccc---CCccchhhHHHHHHHHHHHHHHHHHHHHHHHH---HHHH-HHHHHHhcCCcCCcc Q lcl|NC_016071. 344 ELVNSRKKAILDRFGAGFINLG---NDGQGSYNLSESKQSIHGHFVQRDIDIIVEAF---NKNL-IPQLLALNDIRLSDE 416 (516) Q Consensus 344 ~li~~~d~~Isk~iLGqtLts~---~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~l---n~~l-i~~lv~lN~~~~~~~ 416 (516) ..++++.++|+-.---.-..++ .++.-|-..=+....-....++.....+..+. .+++ +.||-.+-.-.+.+. T Consensus 347 ~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~ 426 (527) T protein:vir:10 347 THMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDA 426 (527) T ss_pred HHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCC Confidence 8889999887765444333332 12211221112222222222222322222222 1112 244333221111121 Q ss_pred ---ccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccc--cc Q lcl|NC_016071. 417 ---DMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGD--GM 491 (516) Q Consensus 417 ---~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 491 (516) ..-++.|...-+.|.++..+.+.+|+..|++.. .-..+.+.+.-|+..++.+-+......+.++-+...+.. |+ T Consensus 427 ~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~-etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a 505 (527) T protein:vir:10 427 DKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPA-KKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGA 505 (527) T ss_pred ccccceEEEecccCCCCHHHHHHHHHHHHHcCchhH-HHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhh Confidence 123789999999999999999999999998764 222223333336544332211111111111111111111 11 Q ss_pred cccCC-CCCcccccccccchhhhhcC Q lcl|NC_016071. 492 TAGSN-GNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 492 ~~~~~-~~~~~~~~~~~d~~~~~~~~ 516 (516) .++.. |.++. ..-..-| T Consensus 506 ~~~~~~g~~~~--------~~d~~~~ 523 (527) T protein:vir:10 506 QMAAEQGIPDE--------EDDQALN 523 (527) T ss_pred hhccccCCCCC--------CcccccC Confidence 11110 11111 1111122 No 142 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=96.85 E-value=0.0003 Score=39.93 Aligned_cols=453 Identities=14% Similarity=0.031 Sum_probs=191.6 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhh--ChHHHHHHHHHH--- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQ--DHTVSTALDTKY--- 75 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~--D~~v~s~l~~Rk--- 75 (516) |---. ..+-+..|+|-++-|-+-+ ....+ -++-..|++|++|-. -.++.++++-+. T Consensus 1 ~~~~~----------~~~~~~~~~~~g~~~~p~~-----v~~~d----~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~ 61 (527) T protein:vir:10 1 MGQDK----------RQYGSTQQLRAGEANFPNA-----VTDFD----KARLASYRLYEDMYLTNTSDYQVILRGGDEGD 61 (527) T ss_pred CCccc----------cccCCCcCcCCccccCccc-----CCHHH----HHHHHHHHHHHHHhcCchhheeeecCCccccc Confidence 22111 1122344555555442211 00000 122235666666653 235555554433 Q ss_pred ---------HHHhcCCceeeeCCCC--CChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeeccc Q lcl|NC_016071. 76 ---------VFVTKAFNDFKVLYNR--DSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESA 143 (516) Q Consensus 76 ---------~~v~~~~w~i~~~~~~--d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~ 143 (516) ..|....-+|.++... .+..+.+ |++.|+......-|+....+ --++..-|=.|+=+.|..... T Consensus 62 ~r~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~----v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~ 137 (527) T protein:vir:10 62 QRPIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAK----VDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKD 137 (527) T ss_pred cceeeehhhHHhhCCcceeeccCccccccchhHH----HHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCC Confidence 3455666777766544 2223334 44444443333335544433 347888899999999985331 Q ss_pred ccccccceeecccc------------------------ccCchhcc------cccceee--cCCCcee-----eeccccc Q lcl|NC_016071. 144 PSKYAGYITIDKIA------------------------FRPQSSLS------RSKPWVF--DEDGRTL-----KGIYQSK 186 (516) Q Consensus 144 ~~~~~g~~~~~~l~------------------------~r~q~ti~------~~~~f~~--~~dg~~l-----~~~~q~~ 186 (516) . .+++.++.+- ++.|..-. +.++|.| +++|..+ ..+ ... T Consensus 138 ~---~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt-~~~ 213 (527) T protein:vir:10 138 E---GSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYT-EEL 213 (527) T ss_pred c---CCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeee-ece Confidence 0 0122222111 11111100 0111221 1122100 000 000 Q ss_pred ccccccccccccccccccccccccc---CCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 187 MAFANFQNGLTQISSAMSLVTNLTS---SADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGAS 263 (516) Q Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~e 263 (516) ....+|+.--.++...-.+-+.... .....+|..=-++.|...+..+..+|.|-|..+--..---+..+.+....++ T Consensus 214 w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~ 293 (527) T protein:vir:10 214 YEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMV 293 (527) T ss_pred eeccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHH Confidence 0000000000000000000000000 0001112222345566778899999999988877666555566666666677 Q ss_pred hccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHH Q lcl|NC_016071. 264 KDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQ 343 (516) Q Consensus 264 r~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~ 343 (516) -.|.||.++.+.+.+-. -|...-..|..|+.++..+..++.+++. ......|. T Consensus 294 ~sG~Pi~~~tg~~~vd~-------------------------~G~~~~~~VgPG~iweL~e~ak~~~v~~--~~~la~~~ 346 (527) T protein:vir:10 294 FGGLGFYATDSAPPRDS-------------------------RGNMVPWTISPLGMVEHGQNNKIYRVNG--VASLEPSQ 346 (527) T ss_pred HhCCceeeecccccccc-------------------------cCCcCccccCCceeEecCCCcceeeccc--hhhhHHHH Confidence 67788877665432100 0222334455666666666667776654 33344588 Q ss_pred HHHHHHHHHHHHHHhccccccc---CCccchhhHHHHHHHHHHHHHHHHHHHHHHHH---HHHH-HHHHHHhcCCcCCcc Q lcl|NC_016071. 344 ELVNSRKKAILDRFGAGFINLG---NDGQGSYNLSESKQSIHGHFVQRDIDIIVEAF---NKNL-IPQLLALNDIRLSDE 416 (516) Q Consensus 344 ~li~~~d~~Isk~iLGqtLts~---~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~l---n~~l-i~~lv~lN~~~~~~~ 416 (516) ..++++.++|+-.---.-..++ .++.-|-..=+....-....++.....+..+. .+++ +.||-.+-.-.+.+. T Consensus 347 ~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~ 426 (527) T protein:vir:10 347 THMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDA 426 (527) T ss_pred HHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCC Confidence 8889999887765444333332 12211221112222222222222322222222 1112 244333221111121 Q ss_pred ---ccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccc--cc Q lcl|NC_016071. 417 ---DMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGD--GM 491 (516) Q Consensus 417 ---~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 491 (516) ..-++.|...-+.|.++..+.+.+|+..|++.. .-..+.+.+.-|+..++.+-+......+.++-+...+.. |+ T Consensus 427 ~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~-~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a 505 (527) T protein:vir:10 427 DKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPA-KKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGA 505 (527) T ss_pred ccccceEEEecccCCCCHHHHHHHHHHHHHcCchhH-HHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhh Confidence 123789999999999999999999999998764 222223333336544332211111111111111111111 11 Q ss_pred cccCC-CCCcccccccccchhhhhcC Q lcl|NC_016071. 492 TAGSN-GNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 492 ~~~~~-~~~~~~~~~~~d~~~~~~~~ 516 (516) .++.. |.++. ..-..-| T Consensus 506 ~~~~~~g~~~~--------~~d~~~~ 523 (527) T protein:vir:10 506 QMAAEQGIPDE--------EDDQALN 523 (527) T ss_pred hhccccCCCCC--------CcccccC Confidence 11110 11111 1111122 No 143 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=96.83 E-value=0.00031 Score=39.80 Aligned_cols=427 Identities=12% Similarity=-0.036 Sum_probs=154.2 Q ss_pred cccchhhhcccCCCCcccccchHH--HHHHHHHHHhhcccccCCcccHHHHHHHhh-ChHH-----------HHHHHHHH Q lcl|NC_016071. 10 EVVKAGNENLAVSRLRTGELGSGA--LSQLRAESEVMKVEELRWPCFLATVEAMKQ-DHTV-----------STALDTKY 75 (516) Q Consensus 10 ~~~~~~~~~p~~~~~~~~e~g~~~--~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~-D~~v-----------~s~l~~Rk 75 (516) .+...+. .+|++...|.-... ++.+... .+.++ ...+.|+ ... ..++ ..++.==+ T Consensus 1 ~~~~~~~---~~~gl~~~~~~~~~~L~~~~~~~-----~~~~~--~~~~Yy~-G~~~~~~~~~~~p~~~r~~~~v~nw~~ 69 (474) T protein:vir:81 1 MIQQQTV---RIPSLSNDENALINGLLAQIENL-----RWKNL--LRTSYYE-NKRTIQYVGTLIPPQYFNLGLVLGWTG 69 (474) T ss_pred CcCCCcC---cCCCCChhHHHHHHHHHHHHHHH-----hhHHH--HHHHHhc-cCCChhhccccccHHHHHHHhhcChHH Confidence 1211222 23333333321100 1111111 01110 0011111 110 0011 11111112 Q ss_pred HHHhcCC-----ceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeeccccccc-- Q lcl|NC_016071. 76 VFVTKAF-----NDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKY-- 147 (516) Q Consensus 76 ~~v~~~~-----w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~-- 147 (516) +.|..+. =.|.++ +. +..+ ..+.+.|+. ..|...... ..+|+-||.|+ ..||.-+.+...+ T Consensus 70 ~~Vd~~a~rl~~~Gf~~~-d~-~~~~----~~l~~iw~~----N~ld~~~~~~~~~al~~G~sf-~~V~~~~d~~~~~~i 138 (474) T protein:vir:81 70 KAVDALARRCNLEGFVWP-DG-DLDS----LGGTEVVDD----NHLLSEIDSAIVAAMQHGPAF-LINTVGEDDEPEALI 138 (474) T ss_pred HHHHHHHhhhcccceECC-CC-Cccc----hHHHHHHHh----cChhHHHHHHHHHHHhhCcee-EEEecCCCCCceeEE Confidence 2222210 012222 11 1111 123444432 125554544 55799999996 6788644331111 Q ss_pred ------ccceeeccccccCchhcccccceeecCCCceeeec---cccccccccccccccccccccccccccccCCCcccc Q lcl|NC_016071. 148 ------AGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGI---YQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFI 218 (516) Q Consensus 148 ------~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~---~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 218 (516) +....++....++..-+ ..+..+.+|...... ........ .......|.......+-| + T Consensus 139 ~~~sp~~~~~~~D~~~~~~~~al---~~~~~~~~g~~~~~~ly~~~~~~~~~-------~~~~~~~w~~~~~~~~~g--v 206 (474) T protein:vir:81 139 HVKDASEATGEWNRRRRGLNNLL---SIIDKDKEGKVLSLALYLDNETVTAQ-------RDKATLKWQVDRDEHVYG--V 206 (474) T ss_pred EEeccceEEEEEeCCCCcceeee---EEEEEcCCCcEEEEEEEeCCcEEEEE-------EcCccceeeeccCCCCCC--c Confidence 11222232222222211 124455555422111 11111000 000111111111122222 4 Q ss_pred ccccEEEEeecCcCCccccchhH-HHHHH--HHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHH Q lcl|NC_016071. 219 PINKLMVMSLGGTESNPAGVSPL-VGCYR--AFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQ 295 (516) Q Consensus 219 P~~k~i~~~~~~~~g~p~G~gLl-r~~~~--~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~ 295 (516) | ++.|.++++-+.|+|.|-+ +.+-. --+.| .+-.-+...|=+.+|-.++.|. ...+....+..... T Consensus 207 P---vV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r--~~~~~~~~~e~~a~pqr~i~G~------~~~~~~d~d~~~~~ 275 (474) T protein:vir:81 207 P---AQVLPYKPAPKRPFGQSRITKPMMGLQDAGVR--ELARREGHMDVFSYPEFWLLGA------DESALKNADGTIKS 275 (474) T ss_pred c---eEEecccccccCcCCccccchhHHHHHHHHHH--HHHHHHHHHHHhcchhheeecC------Chhhcccccccccc Confidence 4 6788889889999998743 33311 11111 1111223344445554444431 11110000101111 Q ss_pred HHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccc----c---CCc Q lcl|NC_016071. 296 GLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINL----G---NDG 368 (516) Q Consensus 296 ~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts----~---~~~ 368 (516) .....+. .-..+|.+-+.++.+....++-..+. .+...+++.++.-|. .+.+.|-+. + -++ T Consensus 276 ~~~~~~~-------~i~~~~~d~d~~~~~~~~~~~~q~~~----a~l~~~~~~l~~~~~-~~a~~t~iP~~~lG~~~~~n 343 (474) T protein:vir:81 276 VWEARLG-------RIKGLPDDADADIPQLARADVKQFPA----ASPDAHWSDINGLAK-LFAREASLPDTAVAISGLSN 343 (474) T ss_pred hhhhhHH-------HHhcCCCcccccccccccccccccCC----CChhHHHHHHHHHHH-HHHhhhCCCHHHhccccccc Confidence 1111111 11234444333222221222222221 123344555543332 233333221 1 122 Q ss_pred cchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcccc-----ceEEecCcCchhHHHHHHHHHHHH Q lcl|NC_016071. 369 QGSY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDM-----PKLKPGLIQEVDMEGFSKFVQRIG 442 (516) Q Consensus 369 ~GS~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~-----P~~~~~~~~~~dl~~~a~~~~~L~ 442 (516) .+|- |+...+... ....+.-.+.+...+. ++++..+.+.+.+..+... -.++|...+...+.+.|+++.||+ T Consensus 344 p~SaeAi~a~~~~l-~~kae~k~~~fg~~l~-~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~ 421 (474) T protein:vir:81 344 PTSAESYDASQYEL-IAEAEGAVDDFTPALR-KAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQL 421 (474) T ss_pred ccHHHHHHHHHHHH-HHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHH Confidence 2333 444333333 2233444456666774 5777777775432212111 245677778888899999999999 Q ss_pred hCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 443 AVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 443 ~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) .+|..+++ .+-+++.+|+.+.+-+..-... ....+. ..-+.........++++ T Consensus 422 ~a~~~~~~---~~~~~~~lg~t~~~i~~~~~~~-~~~~~~---~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 422 AAVPWLAE---TEVGLELIGLTPQQARRAMADK-RRVQGR---GTLQALIDRSNNGATAQ 474 (474) T ss_pred hcccCCCc---HHHHHhhcCCCHHHHHHHHHHH-HHHhHH---HHHHHHHhcCCCCCCCC Confidence 99965543 3456788899743211100000 000000 00011111111122222 No 144 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=433 Identities=9% Similarity=0.000 Sum_probs=158.3 Q ss_pred CCccccCcccccchhhh-cccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE-NLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~-~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~ 79 (516) |.+... .+. +.... +...+.+..... ...............++.| + ..+...-++.+....+. T Consensus 38 i~~~~~--~~~-~~~~~YY~g~~~i~~~~~--~~~~~~~~~~~~~~~~~~r----------i-~~n~~~~ivd~~~~yl~ 101 (503) T protein:vir:59 38 IDEHNP--EPL-LKGVRYYMCENDIEKKRR--TYYDAAGQQLVDDTKTNNR----------T-SHAWHKLFVDQKTQYLV 101 (503) T ss_pred HHhhcH--HHH-HHHHHHhccccchhhccc--hhcccccccccccccccce----------e-ecchHHHHHHHHHhhhh Confidence 221100 011 11111 111110000000 0000000000000000000 0 12333344444444455 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +-+..+.+ + +++..++++.++++ .|.+.+.. ..++.-||.++ +.+|... +|.+.+.-+.| T Consensus 102 g~~~~~~~----~---d~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~~-~~v~~d~------dg~~~i~~~~p 162 (503) T protein:vir:59 102 GEPVTFTS----D---NKTLLEYVNELADD-----DFDDILNETVKNMSNKGIEY-WHPFVDE------EGEFDYVIFPA 162 (503) T ss_pred cCCeeecc----C---cHHHHHHHHHHHhc-----CHHHHHHHHHHHHhhCCeEE-EEEeecC------CCceEEEEEcc Confidence 55544432 2 23556677776643 25555554 44688899986 4666433 34444333333 Q ss_pred cCchhcccccceeecC--CCceeeeccccccc---------ccccccc----------cccccccccccccc-ccCCCcc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDE--DGRTLKGIYQSKMA---------FANFQNG----------LTQISSAMSLVTNL-TSSADEV 216 (516) Q Consensus 159 r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~---------~~~~~~~----------~~~~~~~~~~~~~~-~~~~~~~ 216 (516) +.-.. .|++ +++.+..++-.... ...|..+ .+............ .....+. T Consensus 163 ~~~~~-------i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (503) T protein:vir:59 163 EEMIV-------VYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQ 235 (503) T ss_pred ceeEE-------EEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecce Confidence 21100 1111 12222222110000 0000000 00000000000000 0000111 Q ss_pred cccccc--EEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHH Q lcl|NC_016071. 217 FIPINK--LMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEM 293 (516) Q Consensus 217 ~iP~~k--~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~ 293 (516) +.+..+ ++.| .+|+.|.|.+..+ ..++-- +..+...+..++.+..++.++++..+ .+ +.+. T Consensus 236 ~~~~~~vPiv~~-----~nn~~~~sd~~~~-~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~------~~----~~~~ 299 (503) T protein:vir:59 236 AIGWGRVPIIPF-----KNNEEMVSDLKFY-KDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDG------EN----PKEF 299 (503) T ss_pred eccCCccceEEe-----cCCCCCCcchhhh-HHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCc------cc----cchh Confidence 111111 2222 2567899988874 344433 33455566667888888888775321 11 1111 Q ss_pred HHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhh Q lcl|NC_016071. 294 VQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYN 373 (516) Q Consensus 294 l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~A 373 (516) ...+ .....+.+|.+.+ ++++..+. ....+...++.+.+.|.+.--+..++.+..++.+.+ T Consensus 300 ~~~~---------~~~~~~~~~~~~~--------~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg 360 (503) T protein:vir:59 300 TANL---------RYHSVIKVSGDGG--------VDTLRAEI--PVDSAAKELERIQDELYKSAQAVDNSPETIGGGATG 360 (503) T ss_pred hhhh---------hcccceeccCCCc--------ceeEeccC--CHHHHHHHHHHHHHHHHHHhcccCCCcccccccccH Confidence 1111 1122344565543 34443332 233467788888888877765554444332221111 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-c--CCcC-CccccceEEecCcCchhHHHHHHHHHHHHhCCccc Q lcl|NC_016071. 374 LS-ESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL-N--DIRL-SDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLP 448 (516) Q Consensus 374 l~-~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l-N--~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~ 448 (516) .+ +....-....+..-.+.+...|. ++++.++.+ + ...- ....-..+.|...-+.|..+.++++.+|+.+|++. T Consensus 361 ~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS 439 (503) T protein:vir:59 361 PALENLYALLDLKANMAERKIRAGLR-LFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMS 439 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCc Confidence 11 11111111122333334455553 355554443 2 1110 11122478899999999999999999999999754 Q ss_pred ccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 449 KTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 449 ~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) .+.+.+.++. +.+..+-+-...+.....+......+.......++.....+....++.+..|+ T Consensus 440 -----~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 440 -----KETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred -----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 3455566543 22211100000000000000000000000000000000011111111122222 No 145 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=96.75 E-value=0.00037 Score=39.42 Aligned_cols=432 Identities=12% Similarity=0.066 Sum_probs=167.8 Q ss_pred CCccccCcccccchhhhccc-CCCCcccccchHHHHHHHHHH-HhhcccccCC-cc--cHHHHHHHh-h---ChHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLA-VSRLRTGELGSGALSQLRAES-EVMKVEELRW-PC--FLATVEAMK-Q---DHTVSTAL 71 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~-~~~~~~~e~g~~~~~~~~~~~-~~~~~~~lr~-~~--~~~~y~~m~-~---D~~v~s~l 71 (516) |-|---+.+.|..+--++-. .|.+.+- .....|.. -.++...|.. ++ .-.-|+.-+ | =+++...+ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~i------rd~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl 74 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKV------RHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTL 74 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHH------HHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHH Confidence 66655555555543333221 1111100 01111210 0111111211 11 112254433 2 46666666 Q ss_pred HHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccC-cCCHHHHHHHHH-HHHhhcceeeeEEEeecccc----- Q lcl|NC_016071. 72 DTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLAN-QQTLRDIARSAA-TFNEYGFSIFEKVYRTESAP----- 144 (516) Q Consensus 72 ~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~-~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~----- 144 (516) +.--..|.+.+..++++ .. ++.++++... ..+++.+++.++ .++.||.+.+=+-+-..+.. T Consensus 75 ~~l~G~vfrk~p~~~~p--------~~----l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade 142 (489) T protein:vir:78 75 SGMVGSVMRKEPEINIP--------KE----LEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQ 142 (489) T ss_pred HHHhchhhcCCcceecc--------HH----HHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHH Confidence 66666666665544321 12 2334443322 246888888866 58889999876665332210 Q ss_pred ----cccccceeeccccccCchhcccccceeecCCC-c---eeeeccccc------ccccccccccccc----------- Q lcl|NC_016071. 145 ----SKYAGYITIDKIAFRPQSSLSRSKPWVFDEDG-R---TLKGIYQSK------MAFANFQNGLTQI----------- 199 (516) Q Consensus 145 ----~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg-~---~l~~~~q~~------~~~~~~~~~~~~~----------- 199 (516) .+| .+....+..|-. |.++..| + .++.++... ..|.........+ T Consensus 143 ~~~~~rP-------y~~~~~~~~Iin---W~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~ 212 (489) T protein:vir:78 143 NAGLLNP-------TIAFYTTENIVN---WRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQ 212 (489) T ss_pred HHhcCCc-------EEEEechhhhcC---ceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEE Confidence 011 111222223322 4454433 1 112222210 0000000000000 Q ss_pred --------ccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHH----HHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 200 --------SSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYR----AFREKILIENLETIGASKDLG 267 (516) Q Consensus 200 --------~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~----~~~fK~~~~~~w~~~~er~g~ 267 (516) +........+.....+..++.=-|+++-. ...+--.+...|..++. ||... .--.+..+ .-+. T Consensus 213 ~~~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~-~~~~~~~~~pPLl~LA~lni~Hy~~s--sd~~~~l~--~~~~ 287 (489) T protein:vir:78 213 RLFRFDAEGGAQEDVVEIYPDLGESLRGVIPFTFIGA-TNNDATIDDAPLLPLAELNIGHYRNS--ADNEESSF--VVGQ 287 (489) T ss_pred EEEEeecCCcccceeeEEeccCCCCccCeeeEEEEec-CCCCCCCCcCchHHHHHHHHHHhhhh--hHHHHHHH--Hccc Confidence 00000000000001122222222333322 22333334544555544 33322 11222222 3346 Q ss_pred cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHH Q lcl|NC_016071. 268 GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVN 347 (516) Q Consensus 268 ~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~ 347 (516) |++++++... .+ ++ .+.... ...++.|..++..+|.+.+ ..+++.++++- .++.++ T Consensus 288 P~l~i~G~d~--------~~-~~--~~~~~~--~~~i~~g~~~~~~lp~~~~--------~~~ie~~~~~~---~r~~l~ 343 (489) T protein:vir:78 288 PTLFIYPGEN--------LT-PQ--AFKEAN--PNGIKFGSRRGHNLGYGGS--------AQLIQAGENNL---ARQNML 343 (489) T ss_pred ceeeeecCcc--------CC-cc--cccccC--ccceeeCCcccccCCCCCC--------cceeccCcchH---HHHHHH Confidence 6666554211 00 00 010000 1123457778888887753 44555554321 233333 Q ss_pred HHHHHHHHHHhc-ccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceE--Eec Q lcl|NC_016071. 348 SRKKAILDRFGA-GFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKL--KPG 424 (516) Q Consensus 348 ~~d~~Isk~iLG-qtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~--~~~ 424 (516) -...+|. .+| ..++.+...+++. +.........++.+-+..+++.++ +++++++.+-+.. .+ .-+.| ..+ T Consensus 344 ~le~qm~--~lGa~l~~~~~~~Ta~~--~~~~~~~~~S~L~~~a~~~e~al~-~~l~~~a~w~G~~-~~-~~~~i~~n~d 416 (489) T protein:vir:78 344 DKEQQAI--QIGAQLITPTQQITAQS--ARIQRGADTSVMATIARNVSQAYT-DALRWVAVMLGKP-ED-TEVEFRLNMD 416 (489) T ss_pred HHHHHHH--HHhhhhccCCcchhHHH--HHHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCC-CC-CceEEEeecc Confidence 3334433 344 4443322122222 222334446678888899999996 5889999985321 11 11222 212 Q ss_pred C-cCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccc Q lcl|NC_016071. 425 L-IQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKI 503 (516) Q Consensus 425 ~-~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (516) . ....| ....+++-.+...|.+.. .....+++ +-|+..+..+++.......+.+-...-.++ -+.+ T Consensus 417 F~~~~~d-~~~~~al~~~~~~G~is~-~t~~~~L~-~~gv~d~~~e~~~~ei~~~~~~~~~~~~g~----------~~~~ 483 (489) T protein:vir:78 417 FFLEPMT-AQDRAAWMADINAGLLPA-TAYYAALR-KAGVTDWTDADIKDAVADQPLPVATEVQGE----------IPQS 483 (489) T ss_pred cCcccCC-HHHHHHHHHHHhcCCCCH-HHHHHHHH-hCCCCCccHHHHHHHHhhcCCCcccCCccc----------CCCC Confidence 1 22233 233556677788897654 33445553 457765433222211111111100000011 0111 Q ss_pred cccccc Q lcl|NC_016071. 504 SSTRDN 509 (516) Q Consensus 504 ~~~~d~ 509 (516) ++..+. T Consensus 484 ~q~~~~ 489 (489) T protein:vir:78 484 AQQQEK 489 (489) T ss_pred cccccC Confidence 111111 No 146 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=96.63 E-value=0.00046 Score=38.89 Aligned_cols=445 Identities=13% Similarity=0.061 Sum_probs=169.9 Q ss_pred CCccccC--------cccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhh-ChHHHHHH Q lcl|NC_016071. 1 MSTRFAQ--------PSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQ-DHTVSTAL 71 (516) Q Consensus 1 ~~~r~~~--------~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~-D~~v~s~l 71 (516) |.=..+. ..+......+..+..+ +..-.+.. + .+.+-.++++ ....--+++.+ ++++++++ T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~-~~~~~~~~-----s--~d~~~~~~~~--~lr~RaRdl~rNn~~a~~av 70 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGH-RWQDIGDY-----G--PDTAVASGIQ--TLRARSHHNVRNNPWATNAV 70 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCc-ccCCCCCC-----C--hhHHHHHHHH--HHHHHHHHHHhcChHHHHHH Confidence 2111110 0000000111100000 00000000 0 0001011100 01111244444 89999999 Q ss_pred HHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhc------cCcCCHHHHHHHHHHH-HhhcceeeeEEEeecccc Q lcl|NC_016071. 72 DTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNL------ANQQTLRDIARSAATF-NEYGFSIFEKVYRTESAP 144 (516) Q Consensus 72 ~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~------~~~~~~~~~l~~~lda-~~~G~S~~Eivw~~~~~~ 144 (516) +.....|-+..+...+ ..+ +++..+.|+..|+.. ....+|+.+.+.++.+ +.-|=++.=+.|.... T Consensus 71 ~~~~~~vVG~Gi~p~~--~~~---~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~-- 143 (495) T protein:vir:10 71 ATWVAAAVGNGLTPRW--RMK---EQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLS-- 143 (495) T ss_pred HHHHHhhcCCCccccc--CCc---hHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccC-- Confidence 9999999888665443 332 345566666666543 3346788888877764 4557776656665432 Q ss_pred cccccceeeccccccCchhcccc-------------cceeecCCCceeeecccccccccccccccccccccccccccccc Q lcl|NC_016071. 145 SKYAGYITIDKIAFRPQSSLSRS-------------KPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTS 211 (516) Q Consensus 145 ~~~~g~~~~~~l~~r~q~ti~~~-------------~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (516) +|.-+.-+|...++.-|.-+ .=..||.+|+.+-.+- . ..+|.... .... T Consensus 144 ---~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i-----------~--~~hpgd~~--~~~~ 205 (495) T protein:vir:10 144 ---EGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCF-----------Y--RNHPAESS--LIGD 205 (495) T ss_pred ---CCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEE-----------e--ecCCCccc--cccc Confidence 11111112223333333211 1123344443322110 0 00111000 0011 Q ss_pred CCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceee-e--ecccccccccCCCCH Q lcl|NC_016071. 212 SADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIEL-K--IPSQILNKAAIDPKS 288 (516) Q Consensus 212 ~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~-~--~pp~~~~k~~~~~~~ 288 (516) ..+-+.||... |+|.+..+.+..-|.++|.++-..-.+..+ ....++.. |-.+-|..+ + .++...+....++.. T Consensus 206 ~~~~~rvpA~~-vlH~f~~r~gQ~RGis~la~i~~l~~l~~y-~dael~~a-~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 282 (495) T protein:vir:10 206 PVDTVWIKAEH-VLHVTVLTVRSDAGAPWFQLLLRLNELDQY-EDAELVRK-KTAALFAAFIQEATADSTGGPTIGQPKR 282 (495) T ss_pred ccceeeechhh-eEeccccCCCcccCcchhHHHHHHHHhhHH-HHHHHHHH-HHhhhheeeeecCCCccccccccCcccc Confidence 12335578765 567787788888898888655432222221 11111111 111212111 1 111111111000110 Q ss_pred HHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc--ccccccC Q lcl|NC_016071. 289 PESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA--GFINLGN 366 (516) Q Consensus 289 ~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG--qtLts~~ 366 (516) +... .....+ +.-....++.|. +|++.+.+..+ .+|..|++..-+.|+..+.- +.||.+- T Consensus 283 ~~~~------~~~~~l--~pG~i~~L~pGe--------~i~~~~p~~p~--~~~~~f~~~~lr~iaaglGi~Ye~ltgD~ 344 (495) T protein:vir:10 283 SKGG------KRITGL--NPGTLQYLQPGQ--------EVKFSNPADVG--TTYEPWLRYQLLSIAKGYGITYEMLTGDL 344 (495) T ss_pred ccCc------ccceec--CCceeeecCCCC--------eeeeeCCCCCC--CCHHHHHHHHHHHHHhhcCCCHHHHhccc Confidence 0000 000111 111233445564 45565554333 35778888888898887643 3344433 Q ss_pred CccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH---hcCC-cCCc-----cccc--eEEecCcCchhHHHH Q lcl|NC_016071. 367 DGQGSYNLSESKQSIHGHFVQRDI-DIIVEAFNKNLIPQLLA---LNDI-RLSD-----EDMP--KLKPGLIQEVDMEGF 434 (516) Q Consensus 367 ~~~GS~Al~~vh~ev~~~~~~aDa-~~i~~~ln~~li~~lv~---lN~~-~~~~-----~~~P--~~~~~~~~~~dl~~~ 434 (516) ++ .||+.+..-..-+....+... .++...+.+-+..++++ +++. ..|+ ..+- .+....-...|..+- T Consensus 345 s~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke 423 (495) T protein:vir:10 345 RG-VNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKK 423 (495) T ss_pred cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHH Confidence 22 456555433333333333332 23333333333333333 3321 1111 0011 122233344566666 Q ss_pred HHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc---cc-ccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 435 SKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE---LL-KLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 435 a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~---~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) +++....++.|+... ++.+++ .|.....--++-... .. .-.+-+..+.... .+++ ...+.+.+...|+ T Consensus 424 ~~A~~~~i~~G~~s~----~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~-~~~~-~~~~~~~~~~~~e 495 (495) T protein:vir:10 424 HLADLGDVRAGFAPI----SDKQAE-RGYDMEELFDMISDANQLIDEYDLRLDSDPRYVN-GSGA-EQKSVMEAALNNE 495 (495) T ss_pred HHHHHHHHHcCCCCH----HHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCC-CccC-CCCCCCCCCCCCC Confidence 788889999998665 233443 355321100000000 00 0000000000000 0000 0011111111122 No 147 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=96.31 E-value=0.00075 Score=37.71 Aligned_cols=418 Identities=10% Similarity=0.016 Sum_probs=160.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHh--hcccccCCcccHHHHHH------------------ Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEV--MKVEELRWPCFLATVEA------------------ 60 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~--~~~~~lr~~~~~~~y~~------------------ 60 (516) |.|+....-...+..+....+.. -+..++.. .+.+ |.-+..+.|+- T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~------------~i~~~i~~~~~~~~--~~~~~~~YY~g~~~i~~~~~~~~~~~~~~ 70 (472) T protein:vir:93 5 QPTQTEIFDAIVRTNNKPETLEE------------MIVRYIKQHLEKLP--EISIGQEYYEQRPDIVKEPKPVDATGAVD 70 (472) T ss_pred CCcchhhhhceeeecCchhhHHH------------HHHHHHHHHHHHHH--HHHHHHHHhccccccccccchhhcccccc Confidence 43333222222211111100000 01111110 0000 00011111110 Q ss_pred -Hhh-----ChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhccee Q lcl|NC_016071. 61 -MKQ-----DHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSI 133 (516) Q Consensus 61 -m~~-----D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~ 133 (516) .+. .+...-++.+....+.+-+..+.+ .+.++.++++.++++ .+.+.+.++ .++.-||.+ T Consensus 71 ~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~-------~d~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~- 137 (472) T protein:vir:93 71 PLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-------TDDEVVKRIDEVLGN-----RFDDKLHSVLTGASNKGIE- 137 (472) T ss_pred ccccccccccchHHHHHHHHhhhhcccCeeecc-------CChHHHHHHHHHHhc-----cHHHHHHHHHHHHhhcCeE- Confidence 000 233333444444444444433322 234566778877753 255666554 578889985 Q ss_pred eeEEEeecccccccccceeeccccccCchhcccccceeec--CCCceeeeccccccccc----ccccccc-ccccccccc Q lcl|NC_016071. 134 FEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFD--EDGRTLKGIYQSKMAFA----NFQNGLT-QISSAMSLV 206 (516) Q Consensus 134 ~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~--~dg~~l~~~~q~~~~~~----~~~~~~~-~~~~~~~~~ 206 (516) ++++|... +|.+.+..+.|+- +. ..|+ ..++++..++....... .+..... ......+.. T Consensus 138 ~~~v~~d~------d~~~~i~~~~p~~---~~----~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (472) T protein:vir:93 138 WLHPYLDE------EGEFKLFRVPAEQ---GI----PIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSL 204 (472) T ss_pred EEEEEECC------CCceEEEEEcccc---eE----EEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCee Confidence 56787543 3444333332221 11 1122 12333333221110000 0000000 000000000 Q ss_pred cc-cc-------cCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeeccc Q lcl|NC_016071. 207 TN-LT-------SSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQ 277 (516) Q Consensus 207 ~~-~~-------~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~ 277 (516) .. .. .......+..--++.|+ +|+.|.|.+.. ..+.+-- +..+...+..++-+..+..++++... T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----nn~~g~s~~e~-v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~ 278 (472) T protein:vir:93 205 IPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFM-YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD 278 (472) T ss_pred eecccccccccccccccCCCCCcceEEec-----CCCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCc Confidence 00 00 00000011111233332 36789999987 4444432 33555666667777888877765321 Q ss_pred ccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHH Q lcl|NC_016071. 278 ILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRF 357 (516) Q Consensus 278 ~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~i 357 (516) ....+ ... .+. ....+.+|.+.+ ++++.... ....+..+++.+.+.|...- T Consensus 279 --------~~~~~--~~~----~~~-----~~~~~~~~~~~~--------~~~l~~~~--~~~~~~~~~~~l~~~i~~~s 329 (472) T protein:vir:93 279 --------QELPE--FKR----LLR-----YYGAIKVSDNGG--------VDTIQVEV--PVENSKKYLDELYQKIMLFG 329 (472) T ss_pred --------ccchh--hHH----HHh-----hccccccCCCCc--------ceeEeecC--CHHHHHHHHHHHHHHHHHHh Confidence 11111 111 111 111233465543 34443322 23347778888888887765 Q ss_pred hcccccccCCccchhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHH Q lcl|NC_016071. 358 GAGFINLGNDGQGSYNLSESKQ--SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFS 435 (516) Q Consensus 358 LGqtLts~~~~~GS~Al~~vh~--ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a 435 (516) -...++.++.++.+.+.| .+. .-....+..-.+.+...| +++++.++.+.+.... ..-..+.|....+.|..+.+ T Consensus 330 ~~p~~~~~~~~~n~Sg~A-l~~~~~~l~~ka~~~~~~~~~~l-~~~~~li~~~~~~~~~-~~~i~v~f~~~~p~~~~~~~ 406 (472) T protein:vir:93 330 QAVDFSSDKFGSAPSGVA-LEFLYTNLNLKADKLARKAKVAI-QELLWFVFEHFDIKGE-HKDVDISFNYNKVANTELQV 406 (472) T ss_pred CCCCCCccccccCchHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCcc-cceeeEEeCCCCCCCHHHHH Confidence 544445443222111111 111 111222233334455555 4566777776532221 12235778888899999999 Q ss_pred HHHHHHHhCCcccccHHHHHHHHHHcC-CCCCCCcccccCc----ccccCCCCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 436 KFVQRIGAVGYLPKTPTVINKILEVGG-FDEEIPEDMSTDE----LLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 436 ~~~~~L~~~G~~~~~~~~~~~i~e~~G-lp~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) +++.+|+ |++. ++.+.+.++ ++.+..+-+-... ..+.... -.+.+...+...+.. .+.. T Consensus 407 ~~~~k~~--giis-----~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~---~~~~~~d~~~~~~~~------~~~~ 470 (472) T protein:vir:93 407 QTAQQSM--GIVS-----HETVLENHPFVEDLQAELERIEQEQMEYNKQLPN---LDDGGADGAQQQERS------NNKE 470 (472) T ss_pred HHHHHHh--ccCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccC---cCcccCCCCCCCCCC------Cccc Confidence 9999984 6533 344555654 3322111000000 0111110 000000000000000 0000 Q ss_pred hh Q lcl|NC_016071. 511 VS 512 (516) Q Consensus 511 ~~ 512 (516) .+ T Consensus 471 ~e 472 (472) T protein:vir:93 471 SE 472 (472) T ss_pred CC Confidence 00 No 148 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=96.22 E-value=0.00086 Score=37.38 Aligned_cols=444 Identities=8% Similarity=0.001 Sum_probs=171.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHh---hcccccCC------cccHHHHH-------HH--- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEV---MKVEELRW------PCFLATVE-------AM--- 61 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~---~~~~~lr~------~~~~~~y~-------~m--- 61 (516) +.+|+-.-+++.-.......+.... .+.+..++.. ...+.++. ++...++. .. T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ 88 (502) T protein:vir:48 17 LNLRFHRESRIRYRADNLEELMVNN--------WELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADK 88 (502) T ss_pred hhcccChhHHhhhcccchhhhcccc--------HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccc Confidence 5556655555443322222121110 0111111110 00111000 00000000 00 Q ss_pred -hhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEe Q lcl|NC_016071. 62 -KQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYR 139 (516) Q Consensus 62 -~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~ 139 (516) .....-.-++......+.+-+..+++. +++....+.+++..++.+- .|..++.+++ ++.-||.+ ++++|. T Consensus 89 ki~~n~~k~Ivd~~~~yl~g~p~~~~~~---d~~~~~~~~~~l~~~~~~N----~~~~~~~~~~~~~~~~G~a-~~~v~~ 160 (502) T protein:vir:48 89 RAVHNYGRMISKFKTGYLAGNPIRVEYD---DNEDNSQNDDAIKRIGRIN----DIDTHNRNLIRDLSQTGRA-YEVIYR 160 (502) T ss_pred eeecchHHHHHHHHhhhhcccCeeEecC---CccchhHHHHHHHHHHhhc----CHhHHHHHHHHHHhhcCeE-EEEEEe Confidence 012344455555555566666666543 2334456677777777642 3666766544 68889975 478886 Q ss_pred ecccccccccceeeccccccCchhcccccceeecC--CCceeeeccccccccccccccccccccccccccccccC----- Q lcl|NC_016071. 140 TESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSS----- 212 (516) Q Consensus 140 ~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 212 (516) -. +|.+.+..+.|+.- ...|++ +++.+..++-...........+..+-.+...+...... T Consensus 161 de------dg~~~i~~~~p~~~-------~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~ 227 (502) T protein:vir:48 161 SE------YDETRIKRLSPLET-------FVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEI 227 (502) T ss_pred CC------CCceEEEEEcccce-------EEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeec Confidence 43 34444433332211 012222 22333322211100000000000000000000000000 Q ss_pred -CCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHH Q lcl|NC_016071. 213 -ADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPES 291 (516) Q Consensus 213 -~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~ 291 (516) .....+..--++.| .+|+.|.|.+..+...-=--...+..++..++.+..++.++++.... ...... T Consensus 228 ~~~~~~~g~vPvv~~-----~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~-------~~~~~~ 295 (502) T protein:vir:48 228 SVTPHAFGTVPITEF-----LNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLAL-------PQGMQA 295 (502) T ss_pred cceecCCCccceEEe-----cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccc-------ccccch Confidence 00000111112333 24778999998743322222445666778888888888888764211 011111 Q ss_pred HHHHHHHHHHHHhhcccceEEEe-ccCcccc-cccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCcc Q lcl|NC_016071. 292 EMVQGLMADAANAHAGEQAYFIL-PSDMNAQ-GGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQ 369 (516) Q Consensus 292 ~~l~~l~~~~~~~~~g~~a~~ii-P~g~~i~-~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~ 369 (516) ..+ . ..+++. +...... ..+...++++.... ....+...++.+.+.|.+.--...++.++.++ T Consensus 296 ~~~---~----------~~~~~~~~~~~~~~~~~~~~d~~~l~~~~--~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~ 360 (502) T protein:vir:48 296 SDM---K----------RTRLMQLKPPKSADGKEGTVKAEYLTKSY--DVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSG 360 (502) T ss_pred hhh---h----------hcceeeccccccccccccCcceeEeeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcccccc Confidence 101 0 011111 1110000 11223455554432 22236678899999998765444444433221 Q ss_pred chhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC----ccccceEEecCcCchhHHHHHHHHHHHHh Q lcl|NC_016071. 370 GSYNLSESKQSI--HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLS----DEDMPKLKPGLIQEVDMEGFSKFVQRIGA 443 (516) Q Consensus 370 GS~Al~~vh~ev--~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~----~~~~P~~~~~~~~~~dl~~~a~~~~~L~~ 443 (516) .+ +.-..+... ....+..-.+.+...|. ++++.++.+-...+. +..-..+.|....+.|..+.++++.+|. T Consensus 361 n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~- 437 (502) T protein:vir:48 361 NA-SGEALKYKLFGLDQDRVDTQSQFTQGLK-RRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLG- 437 (502) T ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh- Confidence 11 111111111 11122223344455553 344554443111111 1122578899999999999999999984 Q ss_pred CCcccccHHHHHHHHHHcCC-CCCCCcccccCcc--cccCCCCCCcccccccccCCCCCcc--cccccccchhh Q lcl|NC_016071. 444 VGYLPKTPTVINKILEVGGF-DEEIPEDMSTDEL--LKLLGQDTSRSGDGMTAGSNGNGTG--KISSTRDNSVS 512 (516) Q Consensus 444 ~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~d~~~~ 512 (516) |.+ + ++.+.+.++. +.+. +| .... ++...+...-..........+.+.. ..+.-+.+... T Consensus 438 -g~i-S----~et~l~~l~~v~D~~--~E-~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 438 -GQV-S----QETALSLSGLVENPT--EE-LDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred -ccC-c----HHHHHHhCCCCCCHH--HH-HHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCcCCCCC Confidence 643 3 3556677765 2221 11 1111 0000000000000000000010000 00001111111 No 149 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=95.80 E-value=0.0014 Score=36.17 Aligned_cols=212 Identities=10% Similarity=-0.034 Sum_probs=94.2 Q ss_pred eeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCc-CCccccchhHHHHHHHH Q lcl|NC_016071. 170 WVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGT-ESNPAGVSPLVGCYRAF 248 (516) Q Consensus 170 f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~-~g~p~G~gLlr~~~~~~ 248 (516) .+...||+.....+.. .....+....++.+..+ |..... .+..+|.+.+..+.... T Consensus 1 ~r~~~dg~~~y~~~~~----------------------~~~~~g~~~~~~~~eil-H~r~~~~~~~~~Glspi~~a~~~i 57 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKS----------------------LYDTKSEIYEYNKNDVI-FIKLYDPMQQVYGSPDYVGGITSA 57 (219) T ss_pred CceeecCeEEEEEecc----------------------eecCCceeEEeccccEE-EecCCCCCCCcceecHHHHHHHHH Confidence 0112223221111000 00112234456666654 444433 45568999988877655 Q ss_pred HHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccce-EEEe--ccCcccccccc Q lcl|NC_016071. 249 REKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQA-YFIL--PSDMNAQGGEQ 325 (516) Q Consensus 249 ~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a-~~ii--P~g~~i~~~e~ 325 (516) ..-....++-..|....+.|=-+++.| . ..-+++. .+++++..+..+.+..+ .+++ |.|.. T Consensus 58 ~~~~aa~~~~~~~f~Ng~~p~gil~~~------~-~~l~~e~---~~~~~~~~~~~~g~~n~~~~~l~~~gg~~------ 121 (219) T protein:vir:98 58 LLNSDATIFRRRYYSNGAHMGFILYST------D-PDMTEEM---EDEIAERIRDSKGVGNFRSMFVNIAGGHP------ 121 (219) T ss_pred HHHHHHHHHHHHHHhcCCCCceEEEeC------C-CCCCHHH---HHHHHHHHHHhcCcccccceeEecCCCCc------ Confidence 544444444444554333322222221 1 1112222 33444444443333222 1233 22210 Q ss_pred cceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc-C-CccchhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 326 YKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG-N-DGQGSYNLSE-SKQSIHGHFVQRDIDIIVEAFNKNLI 402 (516) Q Consensus 326 ~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~-~-~~~GS~Al~~-vh~ev~~~~~~aDa~~i~~~ln~~li 402 (516) .-+++...+-+.....|.+.-++-..+|+.+.--.--.++ . .++++++-.+ .....-.+-+.--++.|++.||+++ T Consensus 122 ~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~- 200 (219) T protein:vir:98 122 DGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDY- 200 (219) T ss_pred cceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhh- Confidence 0133333333333444666666777889988866443332 1 1223343333 2233334445566666777777542 Q ss_pred HHHHHhcCCcCCccccceEEecCcCchhHH Q lcl|NC_016071. 403 PQLLALNDIRLSDEDMPKLKPGLIQEVDME 432 (516) Q Consensus 403 ~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~ 432 (516) + ++ . -.++.|+.....|+. T Consensus 201 --~--~~-----~--~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 201 --E--IK-----S--ALKVNFKQPEKRDKN 219 (219) T ss_pred --c--CC-----C--ccEEeecCcccccCC Confidence 1 11 1 136788887777765 No 150 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=95.77 E-value=0.0015 Score=36.10 Aligned_cols=410 Identities=12% Similarity=0.043 Sum_probs=148.0 Q ss_pred Ccccccch--HHHHHHHHHHH--------hhcccccC-CcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCC Q lcl|NC_016071. 24 LRTGELGS--GALSQLRAESE--------VMKVEELR-WPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRD 92 (516) Q Consensus 24 ~~~~e~g~--~~~~~~~~~~~--------~~~~~~lr-~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d 92 (516) +...+.-. .-++.+..... -+-.+.++ .+. ..-+++ +|-.+ +.+-=+..|....=.+.+. +-. T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~--~~~~~~-~~~k~--~~n~~~~ivd~~~~~l~~~-g~~ 74 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGV--AIPPEL-QRVQT--VVSWPGIAVDALEERLDWL-GWT 74 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCc--ccchhh-hhhhh--hcchHHHHHHHHHhhhccc-ccc Confidence 22121100 00111111100 00011111 000 000111 12211 1111122222110011111 101 Q ss_pred ChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeeccccccCchhccccccee Q lcl|NC_016071. 93 SKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWV 171 (516) Q Consensus 93 ~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~ 171 (516) .+.+ +.+.+.++. ..|.+++.++ .++..||.| ++++|.-. +|...+..+.|+ .+. .. T Consensus 75 ~~d~----~~l~~i~~~----n~~~~~~~~~~~~~~~~G~a-~~~v~~d~------~g~~~i~~~~p~---~~~----~i 132 (441) T protein:vir:80 75 NGDG----YGLDGVYAA----NRLATASCDVHLDALIFGLS-FVAIIPHG------DGTVSVRPQSPK---NCT----GK 132 (441) T ss_pred CCCh----HHHHHHHHh----cCHHHHHHHHHHHHhhcCee-EEEEEeCC------CCceEEEEEccc---eEE----EE Confidence 1111 124444432 2367777665 578999997 56888643 233333322221 110 11 Q ss_pred ecCC-Cceeeecc-cc----c-ccccccccccccc--ccccccccccccCCCccccccccEEEEeecCcCCccccchhHH Q lcl|NC_016071. 172 FDED-GRTLKGIY-QS----K-MAFANFQNGLTQI--SSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLV 242 (516) Q Consensus 172 ~~~d-g~~l~~~~-q~----~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr 242 (516) ||++ ++....++ .. . ....-|..+.... ...-+.+.. .......+...-++.|.+..+.+.|+|.|-+- T Consensus 133 ~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~--~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~ 210 (441) T protein:vir:80 133 FSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVE--VDRIPNVLGAVPLVPIVNRRRTSRIDGRSEIT 210 (441) T ss_pred EeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceee--ccccccCCCceeEEEeeccccCCccCCcccch Confidence 2221 11111100 00 0 0000010000000 000000000 01111122333356677788889999998654 Q ss_pred HHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccc Q lcl|NC_016071. 243 GCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQ 321 (516) Q Consensus 243 ~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~ 321 (516) ....+.+-. +..+..++...+.+..|..+++|.. .+...... .. ........+|.+.+.+ T Consensus 211 ~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~---------~~~~~~~~-~~---------~~~~~i~~~~~~~~~~ 271 (441) T protein:vir:80 211 RSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVS---------ADEFSQPG-WV---------LSMASVWAVDKDDDGD 271 (441) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCC---------ccccccch-hh---------hcccccccCCCCCCCC Confidence 332222211 2344456667777888877776531 11111100 00 0112233445443322 Q ss_pred cccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCc-c----chh-hHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 322 GGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDG-Q----GSY-NLSESKQSIHGHFVQRDIDIIVE 395 (516) Q Consensus 322 ~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~-~----GS~-Al~~vh~ev~~~~~~aDa~~i~~ 395 (516) . +++.+.+.+ +.+.++++++.-|-+..-...+....-+ . .|. |+- ....-....++.-.+.+.. T Consensus 272 ~-----~~~~~~~~~----~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~-~~~~~l~~k~~~~~~~f~~ 341 (441) T protein:vir:80 272 T-----PNVGSFPVN----SPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALA-AEESRLVKRAERRQTSFGQ 341 (441) T ss_pred c-----ceeEecCcc----chHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 2 233332222 2344556665555443322222211111 0 122 221 1111122222222333444 Q ss_pred HHHHHHHHHHHHhcCCcCCc-c--ccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc Q lcl|NC_016071. 396 AFNKNLIPQLLALNDIRLSD-E--DMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS 472 (516) Q Consensus 396 ~ln~~li~~lv~lN~~~~~~-~--~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~ 472 (516) .|. ++++.++.+-+..... . .-..++|....+.++.+.++++.+|+..|....+ .+.+++.+|+++..-+... T Consensus 342 ~l~-~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s---~~~~~~~l~~~~~e~~~~~ 417 (441) T protein:vir:80 342 GWL-SVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPAD---SRTVLEMLGLDDVQVEAVM 417 (441) T ss_pred HHH-HHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCccccc---HHHHHHhCCCCHHHHHHHH Confidence 443 3445455542111111 1 1236778889999999999999999999975443 3457788888642211110 Q ss_pred cCcccccCCCCCCcccccccccCCCCCcccc Q lcl|NC_016071. 473 TDELLKLLGQDTSRSGDGMTAGSNGNGTGKI 503 (516) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (516) .. +...++.-+...+....+ +.+- T Consensus 418 --~e-~~e~~~~~~~~~~~~~~~----~~~~ 441 (441) T protein:vir:80 418 --RH-RAESSDPLAVLAGAISRQ----TNEV 441 (441) T ss_pred --HH-HHHHHHHHHHHhhhhhcc----cccC Confidence 00 000000001111111111 1111 No 151 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=95.76 E-value=0.0015 Score=36.07 Aligned_cols=447 Identities=9% Similarity=0.066 Sum_probs=160.3 Q ss_pred CCccccCcc---------cccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhh---ChHHH Q lcl|NC_016071. 1 MSTRFAQPS---------EVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQ---DHTVS 68 (516) Q Consensus 1 ~~~r~~~~~---------~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~---D~~v~ 68 (516) |-+|++.-- +..+.-...|.+ .+...+..+ +..|..+ +.-..|.+..-..-. ....+ -.-+. T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~--I~~w~~~-Y~g~~~~~~~~~~~~--~~~~~~~~sl~~~ 76 (517) T protein:vir:98 3 VIQRIKNFFKRGGYALSGQTLKSINDHEKI-NIDPNELAR--IERNLRQ-YEGDYPQVEYINSQG--KIQERDYMTLNLR 76 (517) T ss_pred hHHHHHHHHHHHHHHhcccchhHhhcCCce-ecCHHHHHH--HHHHHHH-hcCCCcccccccccc--cccccceeecCcH Confidence 222221100 000000001110 000111212 2334433 222333332100000 00000 00112 Q ss_pred HH-HHHHHHHHhcCCceeeeCCCCC----ChhhHHHHHHHHHHHhhccCcCCHHHHH-HHHHHHHhhcceeeeEEEeecc Q lcl|NC_016071. 69 TA-LDTKYVFVTKAFNDFKVLYNRD----SKASKDAAEFVEYALKNLANQQTLRDIA-RSAATFNEYGFSIFEKVYRTES 142 (516) Q Consensus 69 s~-l~~Rk~~v~~~~w~i~~~~~~d----~~~~~~~a~~v~~~l~~~~~~~~~~~~l-~~~lda~~~G~S~~Eivw~~~~ 142 (516) .. ..+.-..|..-.-.|.+..... +......++++++.+++-. |...+ ..+.+++..|=.++=+.|.... T Consensus 77 ~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~----f~~~~~~~~e~a~a~G~~a~k~~~d~~~ 152 (517) T protein:vir:98 77 KLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNK----FIKNLSDYLEPTFALGGLTVRPYVDNGE 152 (517) T ss_pred HHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhcc----HHHHHHHHHHHHhhhCCEEEEEEEeCCe Confidence 11 1222223333333455542211 1223456788888887543 44444 4456788889888877776432 Q ss_pred cccccccceeeccccccCchhcccccceeecCCCce------------------eeecccccccccc-----c------- Q lcl|NC_016071. 143 APSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRT------------------LKGIYQSKMAFAN-----F------- 192 (516) Q Consensus 143 ~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~------------------l~~~~q~~~~~~~-----~------- 192 (516) . .+. +.++..+- +..++.+|.. -+.++-|.+.... + T Consensus 153 ~--------~I~---~v~ad~~~---Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly 218 (517) T protein:vir:98 153 I--------EFS---WALANAFY---PLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELY 218 (517) T ss_pred e--------EEE---EEcCCeeE---EEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEE Confidence 1 111 11111110 0122222211 1111111111100 0 Q ss_pred ccc-ccccccccccccccccCCCccccc---cccEEEEee----cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016071. 193 QNG-LTQISSAMSLVTNLTSSADEVFIP---INKLMVMSL----GGTESNPAGVSPLVGCYRAFREKILIENLETIGASK 264 (516) Q Consensus 193 ~~~-~~~~~~~~~~~~~~~~~~~~~~iP---~~k~i~~~~----~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er 264 (516) ..+ ....+.++.+-....+-.+.+.++ .--|.+++. ....++|+|.|.+..|.-..-.-+..+.-|..-++. T Consensus 219 ~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~ 298 (517) T protein:vir:98 219 KSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM 298 (517) T ss_pred ecCCCccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh Confidence 000 000011111100011111112222 111223322 223478999999998875444334333333332221 Q ss_pred ccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHH Q lcl|NC_016071. 265 DLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQE 344 (516) Q Consensus 265 ~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~ 344 (516) .. .-+..|...+.......+....... .........+..+..-.+ ++. .+..--...|.+ T Consensus 299 --g~-~~i~vp~~~l~~~~~~~g~~~~~~~----------d~~~~~y~~~~~~~~~~~-----i~~--~~~~iR~e~~~~ 358 (517) T protein:vir:98 299 --GQ-RTVFVSDVMLRTVPDESGMPPPQVF----------DPDVNVYKSIRMGTDEEF-----VKD--VTHDIRTEQYKE 358 (517) T ss_pred --CC-cceecChhhhccccCCCCcccCCCC----------CcccceeeeccCCCCCCc-----eee--eccccchHHHHH Confidence 11 1234455444332221110000000 000000011110000000 110 000001112444 Q ss_pred HHHHHHHHH-HHHHhc-ccccccCCccchhhHHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHh-------cCCcC Q lcl|NC_016071. 345 LVNSRKKAI-LDRFGA-GFINLGNDGQGSYNLSESKQS-IH-GHFVQRDIDIIVEAFNKNLIPQLLAL-------NDIRL 413 (516) Q Consensus 345 li~~~d~~I-sk~iLG-qtLts~~~~~GS~Al~~vh~e-v~-~~~~~aDa~~i~~~ln~~li~~lv~l-------N~~~~ 413 (516) .++.+=++| .++-++ +|++.+. .|..-+.++-.+ -+ -.-+.+-.+.+..+| ++|++-++.+ |. .. T Consensus 359 ~~~~~L~~i~~~~Gls~~t~~~~~--~~~kTATEi~s~~~~~~~t~~~~~~~~~~aL-~~lv~~i~~l~~~~~~~~~-~~ 434 (517) T protein:vir:98 359 AINQALRTLEMELKLSVGTFSFDG--RSMKTATEIVSENDLTYRTRNDHVYEVEQFI-KGLVISVLELAKTYKLFGG-EI 434 (517) T ss_pred HHHHHHHHHHHHhCCCcccccccc--cccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCC-CC Confidence 455554555 334345 4455432 232212222221 11 112334445556666 4566665432 21 11 Q ss_pred CccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCC-Cccccccc Q lcl|NC_016071. 414 SDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDT-SRSGDGMT 492 (516) Q Consensus 414 ~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 492 (516) +....+.+.|+..-.+|.++.++.+.+++.+|++.+ +.++.+.||+.+.+-+++.. +.....++.. ........ T Consensus 435 ~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~----~~~i~~~~g~~eeeA~~e~~-~i~~E~~~~~~~~~~~~~~ 509 (517) T protein:vir:98 435 PSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPT----VEAIQRIFKVPKKTAEQWLE-EIRKDQIELDPVTISQRAQ 509 (517) T ss_pred CCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCH----HHHHHHhCCCChHHHHHHHH-HHHHhccccCCCCcccccc Confidence 222336788998888998999999999999998664 67899999997533222221 1111111110 00011111 Q ss_pred ccCCCCCc Q lcl|NC_016071. 493 AGSNGNGT 500 (516) Q Consensus 493 ~~~~~~~~ 500 (516) ..++|.+. T Consensus 510 ~~~~gd~e 517 (517) T protein:vir:98 510 KRMFGDEE 517 (517) T ss_pred CCCCCCCC Confidence 12222222 No 152 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=95.54 E-value=0.0019 Score=35.52 Aligned_cols=456 Identities=12% Similarity=0.047 Sum_probs=175.5 Q ss_pred CCccccCcccccchhhh-cccCCCCc-----ccccc--hHH----HHHHHHHHHhhccc---------ccCC-------c Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE-NLAVSRLR-----TGELG--SGA----LSQLRAESEVMKVE---------ELRW-------P 52 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~-~p~~~~~~-----~~e~g--~~~----~~~~~~~~~~~~~~---------~lr~-------~ 52 (516) |....++.-+-.+.... .|+.||.. ++.+- |+. ...|....+..... .|.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~ 80 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDE 80 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCc Confidence 65433333333333333 44444321 11221 111 22333332221111 0111 1 Q ss_pred ccHHHHHHHh-h---ChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccC-cCCHHHHHHHHHH-H Q lcl|NC_016071. 53 CFLATVEAMK-Q---DHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLAN-QQTLRDIARSAAT-F 126 (516) Q Consensus 53 ~~~~~y~~m~-~---D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~-~~~~~~~l~~~ld-a 126 (516) +.-+-|+.-+ + -+++...++.--..|.+.+..++++ +.++.++++... ..+++.++++++. + T Consensus 81 E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p------------~~l~~l~~d~D~~G~~L~~f~~~~~~~~ 148 (535) T protein:vir:80 81 EQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQLP------------PALEAIVEDIDGEGVSLDQQAKKALGYT 148 (535) T ss_pred CCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcceecc------------HHHHHHHhccCCCCCCHHHHHHHHHHHH Confidence 2223354433 2 5677777777777777666544322 123344443322 2368889988764 7 Q ss_pred HhhcceeeeEEEeecccc----------cccccceeeccccccCchhcccccceeecCCCc---e-eeecccccc----c Q lcl|NC_016071. 127 NEYGFSIFEKVYRTESAP----------SKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGR---T-LKGIYQSKM----A 188 (516) Q Consensus 127 ~~~G~S~~Eivw~~~~~~----------~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~---~-l~~~~q~~~----~ 188 (516) +.||.+.+=+-|-..++. .+| + +....+..|-. |.++..|. + ++.++.... . T Consensus 149 l~~G~~~iLVD~P~~~~~~t~ade~~~~~rP--y-----~~~y~ae~Iin---W~~~~v~G~~~Lt~v~lrE~~~~~dd~ 218 (535) T protein:vir:80 149 MGFGRAAIFTDYPNVGRPVTVLEQKLGLYRP--T-----ITLVHPTSIIN---WRTKLVGGKSVISLVVIQENVLAQDDG 218 (535) T ss_pred HhcCeEEEEEeecCCCCcccHHHHHhcCCCc--E-----EEEechhhccC---ccccccCCccceeEEEEEEEEEecCCC Confidence 789998775555433210 011 1 11111111211 33333221 0 011110000 0 Q ss_pred cc----------------ccccccc----ccccccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHH Q lcl|NC_016071. 189 FA----------------NFQNGLT----QISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAF 248 (516) Q Consensus 189 ~~----------------~~~~~~~----~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~ 248 (516) |. .|....+ .......+...+.....+..++.=-|+++. ....+--.+...|..++..- T Consensus 219 f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~-~~~~~~~~~~pPLl~LA~ln 297 (535) T protein:vir:80 219 FETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIG-PLDNNADIDHPPLLDLCEVN 297 (535) T ss_pred cccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEee-cCCCCCCCCccchHHHHHHH Confidence 00 0000000 000000000011111112223222344332 22333444555555555432 Q ss_pred HHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccce Q lcl|NC_016071. 249 REKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKM 328 (516) Q Consensus 249 ~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~i 328 (516) +--=....+.-.-+..-+.|+++++++.. ...+. ... -.-+.-|..+++.+|++.+ . T Consensus 298 i~Hy~~ssd~~~il~~~~~P~l~i~G~~~---------~~~~~-----~~~-~~~i~iG~~~~~~lP~~~~--------~ 354 (535) T protein:vir:80 298 IGHYRNSADYEEMAFVAGQPTAFFTGLTK---------DWVED-----VFK-DFKVHLGSRAIIPLPQGAT--------A 354 (535) T ss_pred HHHhhchhHHHHHHHHhcCceeeeecCch---------hhhhc-----CCC-CcceEecCcccccCCCCCC--------c Confidence 21111111122222233567777665421 00000 000 0013348888888998865 3 Q ss_pred eeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCc-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 329 SLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDG-QGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLA 407 (516) Q Consensus 329 el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~-~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~ 407 (516) .+++.++++-. .+.++-...+|.. +.+..+..+..+ +.+.| .........++.+-+..+++.++ ++++++.. T Consensus 355 ~~~e~~~~~~a---~~~l~~~e~qM~~-lGa~ll~~~~~~~Ta~~a--~~~~~~~~S~L~~~a~~le~al~-~aL~~~A~ 427 (535) T protein:vir:80 355 GILQITPNSVP---FEAMTHKESQMIA-MGANLLVKSGGNRTFGEA--QQEEASEQSILSACTKNVSMAFR-KALRWANQ 427 (535) T ss_pred ceeeeccchhH---HHHHHHHHHHHHH-HHHHhhccCcccccHHHH--HHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHH Confidence 44455554432 2345555555544 222333322111 11112 12233335667788899999996 58899988 Q ss_pred hcCCcCCccccceEEe--cCc-CchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCC--cccccCcccccCCC Q lcl|NC_016071. 408 LNDIRLSDEDMPKLKP--GLI-QEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIP--EDMSTDELLKLLGQ 482 (516) Q Consensus 408 lN~~~~~~~~~P~~~~--~~~-~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~--~~~~~~~~~~~~~~ 482 (516) +-+....+ .-+.|.. +.. ..-|. ..++++-++...|.+.. ....+++ ++.|+..+.. +++. ...+....+ T Consensus 428 w~G~~~~~-~~~~i~~n~dF~~~~ld~-~~~~all~~~~~G~Is~-et~~~~L-~r~gvl~~~~~~eee~-~ri~~E~~~ 502 (535) T protein:vir:80 428 FQTGIVND-ETVEYNLNTDFPAARLTP-NERAELILEWQQGAITF-KEMRAGL-RRAGVASEDDAKAETE-GKATVEFIA 502 (535) T ss_pred HcCCccCC-CceEEEeccccccccCCH-HHHHHHHHHHhcCCCCH-HHHHHHH-HhCCCCCcccchHHHH-HHHHhhhhh Confidence 76422122 2233432 222 22232 33556778888898654 3334454 5567754322 2211 111111111 Q ss_pred CCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 483 DTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) ....++.....++. ++++.++.--+.-+|.|- T Consensus 503 ~~~~~g~~~d~~~~--g~~~~~~~~~~~~~~~~~ 534 (535) T protein:vir:80 503 KTAAAGKVGDAASG--GTNKAKLNNGNGGGNQAG 534 (535) T ss_pred ccccCCCCCCCCCC--CCCcCcccCCccccccCC Confidence 11111211111111 111111111122222222 No 153 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=95.50 E-value=0.0019 Score=35.45 Aligned_cols=418 Identities=11% Similarity=0.011 Sum_probs=163.3 Q ss_pred cccchhhhcccCCCCcccccchHHHHHHHHHHHhh--ccc-----------------ccCCcc--cHHHHHH------H- Q lcl|NC_016071. 10 EVVKAGNENLAVSRLRTGELGSGALSQLRAESEVM--KVE-----------------ELRWPC--FLATVEA------M- 61 (516) Q Consensus 10 ~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~--~~~-----------------~lr~~~--~~~~y~~------m- 61 (516) ...-..-+.-.-..+.. | -+..++..- ... .+..+. .+..|.. . T Consensus 1 ~~~~~~~~~~~~~~~~~-e-------~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILP-K-------HIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLD 72 (474) T ss_pred CchHHHHhhccccCCCH-H-------HHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccc Confidence 11111111100000000 0 011111100 000 000000 0001100 0 Q ss_pred ------hhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceee Q lcl|NC_016071. 62 ------KQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIF 134 (516) Q Consensus 62 ------~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~ 134 (516) ...+...-++.+....+.+-+..+.+.. +...++++.+++.++++.- .|...+.. ..++.-||.+ + T Consensus 73 ~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~--~~~~~e~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~a-~ 145 (474) T protein:vir:10 73 VSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDE--NAEKNEKLKKFITNFAIRN----SVDDEDSEIGKMAAICGYG-A 145 (474) T ss_pred cCcccccccchHHHHHHhHhhheeccceeEeeCC--CCcchHHHHHHHHHHHhhc----CHhHHHHHHHHHHhhcCeE-E Confidence 0133444444545555556665555432 3345567777888877643 25555555 4578889975 5 Q ss_pred eEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccc--cccccccccccccccccC Q lcl|NC_016071. 135 EKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQN--GLTQISSAMSLVTNLTSS 212 (516) Q Consensus 135 Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 212 (516) +++|... +|.+.+..+.|+. +. ..|++.+..+..++-.......... ....+-.....+...... T Consensus 146 ~~~~~d~------~~~~~~~~i~p~~---~~----~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~ 212 (474) T protein:vir:10 146 RLAYIDT------NGDIRIKNIDPYN---VI----FVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEG 212 (474) T ss_pred EEEEeCC------CCeeEEEEEcccc---eE----EEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecC Confidence 6887543 3444443333321 10 1234444433333211110000000 000000011111111110 Q ss_pred C--------CccccccccEEEEeecCcCCccccchhHHHHHHHHHH-HHHHHHHHHHHHhhccccceeeeeccccccccc Q lcl|NC_016071. 213 A--------DEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFRE-KILIENLETIGASKDLGGIIELKIPSQILNKAA 283 (516) Q Consensus 213 ~--------~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~f-K~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~ 283 (516) . ....++.--++.|. +|+.|.|.+..+- +.+- =...+...+..++.+..|++++++.-+ T Consensus 213 ~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~-~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~------ 280 (474) T protein:vir:10 213 IDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVI-HLIDAYDLTMSDASSEISQTRLAYLVLRGMGM------ 280 (474) T ss_pred CCcccccccccCCCCccceEEec-----CCCCCCCchHHHH-HHHHHHHHHHHHHHHHHHHhhcchhhhccCCC------ Confidence 0 00111111223332 4778999888743 3332 233455666677888888877775311 Q ss_pred CCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc Q lcl|NC_016071. 284 IDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN 363 (516) Q Consensus 284 ~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt 363 (516) +. ++ ...+. . ..+..+.+.+. .++++..... ...+...++.+.+.|.+.--+..++ T Consensus 281 --~~-~~---~~~~~---~-----~~~i~~~~~~~--------~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~ 336 (474) T protein:vir:10 281 --SE-EM---IQETQ---K-----SGAFELFDKDM--------DVKYLTKDVN--DTMIENHLDRIEKNIMRFAKSVNFN 336 (474) T ss_pred --Cc-hh---hhhhh---h-----cceeEecCCCC--------ceeEEeccCC--HHHHHHHHHHHHHHHHHHhCCcccc Confidence 11 11 11111 0 11222334443 3555544332 2346778899989987754433344 Q ss_pred ccCCccchhhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHh-cC--CcCCc--cccceEEecCcCchhHHHH Q lcl|NC_016071. 364 LGNDGQGSYNLSESKQSIHGHF----VQRDIDIIVEAFNKNLIPQLLAL-ND--IRLSD--EDMPKLKPGLIQEVDMEGF 434 (516) Q Consensus 364 s~~~~~GS~Al~~vh~ev~~~~----~~aDa~~i~~~ln~~li~~lv~l-N~--~~~~~--~~~P~~~~~~~~~~dl~~~ 434 (516) .+.-+ | +++-+.-...... +..-.+.+...|. ++++.++.+ +. ....+ ..-..+.|...-+.|..+. T Consensus 337 ~~~~~-~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~ 412 (474) T protein:vir:10 337 SDEFN-G--NVPIIGMKLKLMALENKCMTFERKMTAMLR-YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEE 412 (474) T ss_pred ccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHH Confidence 33221 1 1122222222222 2222334444442 455555543 21 11111 1225678888889999999 Q ss_pred HHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 435 SKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 435 a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) ++++.+|. |++. .+.+.+.+++ +.+..+-+-...+.....+.......+. .+.+..+...+ T Consensus 413 a~~~~kl~--g~iS-----~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~-------~~~~~~~~~s~ 474 (474) T protein:vir:10 413 SQVLINLK--GQVS-----ERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGD-------ANDKSQNNQSE 474 (474) T ss_pred HHHHHHHh--ccCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCC-------cCCCCccccCC Confidence 99999985 6532 3556666654 3221111111111110011000000000 00000000001 No 154 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=95.50 E-value=0.0019 Score=35.45 Aligned_cols=418 Identities=11% Similarity=0.011 Sum_probs=163.3 Q ss_pred cccchhhhcccCCCCcccccchHHHHHHHHHHHhh--ccc-----------------ccCCcc--cHHHHHH------H- Q lcl|NC_016071. 10 EVVKAGNENLAVSRLRTGELGSGALSQLRAESEVM--KVE-----------------ELRWPC--FLATVEA------M- 61 (516) Q Consensus 10 ~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~--~~~-----------------~lr~~~--~~~~y~~------m- 61 (516) ...-..-+.-.-..+.. | -+..++..- ... .+..+. .+..|.. . T Consensus 1 ~~~~~~~~~~~~~~~~~-e-------~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILP-K-------HIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLD 72 (474) T ss_pred CchHHHHhhccccCCCH-H-------HHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccc Confidence 11111111100000000 0 011111100 000 000000 0001100 0 Q ss_pred ------hhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceee Q lcl|NC_016071. 62 ------KQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIF 134 (516) Q Consensus 62 ------~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~ 134 (516) ...+...-++.+....+.+-+..+.+.. +...++++.+++.++++.- .|...+.. ..++.-||.+ + T Consensus 73 ~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~--~~~~~e~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~a-~ 145 (474) T protein:vir:94 73 VSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDE--NAEKNEKLKKFITNFAIRN----SVDDEDSEIGKMAAICGYG-A 145 (474) T ss_pred cCcccccccchHHHHHHhHhhheeccceeEeeCC--CCcchHHHHHHHHHHHhhc----CHhHHHHHHHHHHhhcCeE-E Confidence 0133444444545555556665555432 3345567777888877643 25555555 4578889975 5 Q ss_pred eEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeeccccccccccccc--cccccccccccccccccC Q lcl|NC_016071. 135 EKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQN--GLTQISSAMSLVTNLTSS 212 (516) Q Consensus 135 Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 212 (516) +++|... +|.+.+..+.|+. +. ..|++.+..+..++-.......... ....+-.....+...... T Consensus 146 ~~~~~d~------~~~~~~~~i~p~~---~~----~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~ 212 (474) T protein:vir:94 146 RLAYIDT------NGDIRIKNIDPYN---VI----FVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEG 212 (474) T ss_pred EEEEeCC------CCeeEEEEEcccc---eE----EEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecC Confidence 6887543 3444443333321 10 1234444433333211110000000 000000011111111110 Q ss_pred C--------CccccccccEEEEeecCcCCccccchhHHHHHHHHHH-HHHHHHHHHHHHhhccccceeeeeccccccccc Q lcl|NC_016071. 213 A--------DEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFRE-KILIENLETIGASKDLGGIIELKIPSQILNKAA 283 (516) Q Consensus 213 ~--------~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~f-K~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~ 283 (516) . ....++.--++.|. +|+.|.|.+..+- +.+- =...+...+..++.+..|++++++.-+ T Consensus 213 ~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~-~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~------ 280 (474) T protein:vir:94 213 IDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVI-HLIDAYDLTMSDASSEISQTRLAYLVLRGMGM------ 280 (474) T ss_pred CCcccccccccCCCCccceEEec-----CCCCCCCchHHHH-HHHHHHHHHHHHHHHHHHHhhcchhhhccCCC------ Confidence 0 00111111223332 4778999888743 3332 233455666677888888877775311 Q ss_pred CCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc Q lcl|NC_016071. 284 IDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN 363 (516) Q Consensus 284 ~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt 363 (516) +. ++ ...+. . ..+..+.+.+. .++++..... ...+...++.+.+.|.+.--+..++ T Consensus 281 --~~-~~---~~~~~---~-----~~~i~~~~~~~--------~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~ 336 (474) T protein:vir:94 281 --SE-EM---IQETQ---K-----SGAFELFDKDM--------DVKYLTKDVN--DTMIENHLDRIEKNIMRFAKSVNFN 336 (474) T ss_pred --Cc-hh---hhhhh---h-----cceeEecCCCC--------ceeEEeccCC--HHHHHHHHHHHHHHHHHHhCCcccc Confidence 11 11 11111 0 11222334443 3555544332 2346778899989987754433344 Q ss_pred ccCCccchhhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHh-cC--CcCCc--cccceEEecCcCchhHHHH Q lcl|NC_016071. 364 LGNDGQGSYNLSESKQSIHGHF----VQRDIDIIVEAFNKNLIPQLLAL-ND--IRLSD--EDMPKLKPGLIQEVDMEGF 434 (516) Q Consensus 364 s~~~~~GS~Al~~vh~ev~~~~----~~aDa~~i~~~ln~~li~~lv~l-N~--~~~~~--~~~P~~~~~~~~~~dl~~~ 434 (516) .+.-+ | +++-+.-...... +..-.+.+...|. ++++.++.+ +. ....+ ..-..+.|...-+.|..+. T Consensus 337 ~~~~~-~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~ 412 (474) T protein:vir:94 337 SDEFN-G--NVPIIGMKLKLMALENKCMTFERKMTAMLR-YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEE 412 (474) T ss_pred ccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHH Confidence 33221 1 1122222222222 2222334444442 455555543 21 11111 1225678888889999999 Q ss_pred HHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 435 SKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 435 a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) ++++.+|. |++. .+.+.+.+++ +.+..+-+-...+.....+.......+. .+.+..+...+ T Consensus 413 a~~~~kl~--g~iS-----~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~-------~~~~~~~~~s~ 474 (474) T protein:vir:94 413 SQVLINLK--GQVS-----ERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGD-------ANDKSQNNQSE 474 (474) T ss_pred HHHHHHHh--ccCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCC-------cCCCCccccCC Confidence 99999985 6532 3556666654 3221111111111110011000000000 00000000001 No 155 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=95.49 E-value=0.002 Score=35.42 Aligned_cols=409 Identities=9% Similarity=-0.063 Sum_probs=162.2 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhccccc-CCcccHHHH---HHHhhChHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEEL-RWPCFLATV---EAMKQDHTVSTALDTKYV 76 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~l-r~~~~~~~y---~~m~~D~~v~s~l~~Rk~ 76 (516) +++|.++-.+. ..+-. -.++.+ +......-. .+ .......-++.+... T Consensus 6 ~~~~~~r~~~l--------------------------~~yy~-g~~~~~~~~~~~~~~~~~~~k-i~~n~~~~ivd~~~~ 57 (440) T protein:vir:95 6 LGSQKQRLAIL--------------------------ASYAQ-GDNFSILSGHRRLDDEKADYR-VRHKWGGYISSFATG 57 (440) T ss_pred HHHHHHHHHHH--------------------------HHHhc-cCCcccccccccccccCCcce-eecchHHHHHHhhhh Confidence 11111111000 00100 000100 000000000 00 013444555565556 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) .+.+-+..+.+. +..+.+..+++.+.|.+- .+...... ..++.-||.+. +++|... +|.+.+.. T Consensus 58 ~l~g~~~~~~~~----~~~~~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~a~-~~~~~d~------~~~~~i~~ 122 (440) T protein:vir:95 58 YVIGNPVSIGVM----EGGSADQLSTIKDIEWQN----DINALNSDLAFDASVYGRAY-EYHFRDK------DKVDRVVL 122 (440) T ss_pred heeccCceEeeC----CCccHHHHHHHHHHHHhc----CHhHHHHHHHHHHhhcCeEE-EEEEecC------CCceEEEE Confidence 666666565543 334455667777776543 25555544 44688899975 5666533 23333332 Q ss_pred ccccCchhcccccceeecCC--Cceeeeccccccccccccccccccccccccccccc--------cCCCccccccc--cE Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDED--GRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLT--------SSADEVFIPIN--KL 223 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~iP~~--k~ 223 (516) +.|+. +. ..|++. ++.+..++....... .+..+-.+.....+.. ...+..+-|.. -+ T Consensus 123 ~~p~~---~~----~~~d~~~~~~~~~~i~~~~~~~~----~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 191 (440) T protein:vir:95 123 ISPLE---MF----VIRDLTVEQNIIAAVHLPIYADK----VNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPV 191 (440) T ss_pred Ecccc---eE----EEEcCCCCCceEEEEEEEEecCc----eEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeE Confidence 22221 11 122222 222222221110000 0000001111110000 00011111222 22 Q ss_pred EEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016071. 224 MVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAAN 303 (516) Q Consensus 224 i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~ 303 (516) +.|+ +|..|.|.+..+-...=-=+..+..++..++.+..|.+++++.+.-..- ..++ ...+... T Consensus 192 v~~~-----n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~-----~~e~---~~~~~~~--- 255 (440) T protein:vir:95 192 VEWW-----NNRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKL-----SPED---AAKMKDA--- 255 (440) T ss_pred EEee-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCC-----Cccc---hhhhhhc--- Confidence 3333 3667888888755433223445666788888889999998875432111 1111 1111110 Q ss_pred hhcccceEEEeccCccccc-ccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHH Q lcl|NC_016071. 304 AHAGEQAYFILPSDMNAQG-GEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIH 382 (516) Q Consensus 304 ~~~g~~a~~iiP~g~~i~~-~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~ 382 (516) ....++.+..... .+...++++..+. ....+...++.+.+.|...--...++.+.-++. ++-+.-+.. T Consensus 256 ------~~~~~~~~~~~~~~~~~~~~~~lt~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n---~Sg~Al~~~ 324 (440) T protein:vir:95 256 ------NMLFLKTGISTTGQQTTADASYIYKQY--DVNGTEAYKNRLANDIHRFSRIPNLDDDRFNST---SSGIALLYK 324 (440) T ss_pred ------cceecccccccccCCCCcceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCccccccccccc---chHHHHHHH Confidence 0111121111110 1223455554432 223467788888888877655444444322211 122222222 Q ss_pred HHH----HHHHHHHHHHHHHHHHHHHHHHh-cCCcCC--ccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHH Q lcl|NC_016071. 383 GHF----VQRDIDIIVEAFNKNLIPQLLAL-NDIRLS--DEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVIN 455 (516) Q Consensus 383 ~~~----~~aDa~~i~~~ln~~li~~lv~l-N~~~~~--~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~ 455 (516) ... +..-.+.+.+.+. ++++.++.+ +...+. +..-..+.|....+.|..+.++++.+|. |++ + .+ T Consensus 325 ~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl~--g~i-S----~e 396 (440) T protein:vir:95 325 MIGLEQVRKDKETYFTKALR-RRYELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEAG--GEI-S----QE 396 (440) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHHh--ccC-c----HH Confidence 222 2222233444443 344544432 111111 2233678899999999999999999984 553 3 24 Q ss_pred HHHHHcCCCCCCCcccccCcccc-cCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 456 KILEVGGFDEEIPEDMSTDELLK-LLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 456 ~i~e~~Glp~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) .+.+.++.-.+..+.+-...+.. ...+.....++ ...+..+++ T Consensus 397 t~~~~l~~~d~~~E~~ri~~E~~~~~~~~~~~~~~----~~~~~~~~e 440 (440) T protein:vir:95 397 TLMENASFTDYKTEHSRILKQGGSSDLEIGQIVGD----ADVGQADTE 440 (440) T ss_pred HHHHhCCCCCcHHHHHHHHHHHHHhhhhHHhhccC----CCCCCcCCC Confidence 45566654322111111111111 01111111111 111112222 No 156 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=95.28 E-value=0.0024 Score=34.97 Aligned_cols=426 Identities=9% Similarity=0.021 Sum_probs=154.3 Q ss_pred ccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhCh---------------------------H--HHH-HHHHHH Q lcl|NC_016071. 26 TGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDH---------------------------T--VST-ALDTKY 75 (516) Q Consensus 26 ~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~---------------------------~--v~s-~l~~Rk 75 (516) |+ |++.--+.|..|-.-+... +-.+++.-|..+.+|. | +.+ +..+-- T Consensus 1 ~~-~~~~~~~~i~~w~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A 77 (518) T protein:vir:78 1 MG-VWSVMTRFIKGWLNGKPNG--SEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAA 77 (518) T ss_pred Cc-chhhHHHHHHHhhcCCCCc--cchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHH Confidence 11 1111112222222111110 0111111111111110 1 122 222223 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHH-HHHHHHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDI-ARSAATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~-l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) ..|.+-.-.|++. +.+...++...+++++.+++.+ |... ...+..+...|=.++=+.|.. |++.+. T Consensus 78 ~ll~~e~~~i~v~-~~~~~d~e~~~~~l~~il~~n~----f~~~~~~~~e~a~a~G~~~~k~~~d~--------~~~~i~ 144 (518) T protein:vir:78 78 EYISGKPLSIDVT-GVNGSKDENLTKQLKEALRIDN----FDSKSVKIVELAGGSGVSAVKINILN--------GRPSIS 144 (518) T ss_pred HhhcCCCceEEec-CccccCcHHHHHHHHHHHHhcc----HHHHHHHHHHHhhccCceEEEEEEEC--------CeeEEE Confidence 3344544556654 2222234455677887776543 4444 455567888888887666642 222221 Q ss_pred cccccCchhcccccceeecCCCcee-----------------eecccccccccc---------------ccc--cccccc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDEDGRTL-----------------KGIYQSKMAFAN---------------FQN--GLTQIS 200 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~dg~~l-----------------~~~~q~~~~~~~---------------~~~--~~~~~~ 200 (516) .+ ++..+.. ...+|+++ +.+..|...... +.+ +..... T Consensus 145 ~v---~ad~~~P-----~~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~ 216 (518) T protein:vir:78 145 VH---SSSQFWI-----DFKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPI 216 (518) T ss_pred EE---cCCeeEE-----EeecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccc Confidence 11 1111110 00111110 111111000000 000 000000 Q ss_pred c------ccccccccccCCCccccc---cccEEEEeec-----CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_016071. 201 S------AMSLVTNLTSSADEVFIP---INKLMVMSLG-----GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDL 266 (516) Q Consensus 201 ~------~~~~~~~~~~~~~~~~iP---~~k~i~~~~~-----~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g 266 (516) . .+.......+..+...++ +.-|++|... ...++|+|.|.+..|.-..-.=+..+.-|+.-++ . T Consensus 217 ~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~-~- 294 (518) T protein:vir:78 217 SAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGE-K- 294 (518) T ss_pred cccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHH-h- Confidence 0 000011111111121121 1224455433 2357899999999886443333333333443333 2 Q ss_pred ccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHH Q lcl|NC_016071. 267 GGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELV 346 (516) Q Consensus 267 ~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li 346 (516) +=+-+++|...+.+...+.+...... +..+.+....+...++....-...++.+... =....|.+.+ T Consensus 295 -g~~~i~v~~~~l~~~~~~~~~~~~~~----------fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~--Ir~e~~~~~~ 361 (518) T protein:vir:78 295 -TKTKIAASERMFRKKVNKSTDKEEWS----------MNVDEDYFMQFKGTLDAGAKLNDMIQFMQGD--FRDGSYRETM 361 (518) T ss_pred -CCceeeechhHhccCCCCCCCccccc----------cCCCCceEEEecCcCCCCCccccceeeeecc--cChHHHHHHH Confidence 22345666555543332222111000 0112222333322111000000011111110 0112344555 Q ss_pred HHHHHHHHHHH-hc-ccccccCCccchhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHh-cCCc------CC- Q lcl|NC_016071. 347 NSRKKAILDRF-GA-GFINLGNDGQGSYNLSESKQSIHG--HFVQRDIDIIVEAFNKNLIPQLLAL-NDIR------LS- 414 (516) Q Consensus 347 ~~~d~~Isk~i-LG-qtLts~~~~~GS~Al~~vh~ev~~--~~~~aDa~~i~~~ln~~li~~lv~l-N~~~------~~- 414 (516) +.+=++|...+ ++ +|+.. +++. ....++..+-.. .-+..-...+...| ++|+..++.+ +..+ .+ T Consensus 362 ~~~l~~~~~~~G~s~~tfg~--~~~~-~TATei~s~~~~~~~t~~~~~~~~e~al-~~l~~~i~~l~~~~~~~~~~~~~~ 437 (518) T protein:vir:78 362 EYFAQKAVSKSGYNPATFNL--GNRE-VKATEIWSLQDATVRKIEKKKRLIQNVY-EQMLWDFLYLLTGGTNNKEKAIMR 437 (518) T ss_pred HHHHHHHHHhhCCChhhcCc--cccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhcCccccccCC Confidence 55545554443 22 33322 2211 222233322211 12333334444445 4566665543 1100 11 Q ss_pred ccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCCcccccCcccccCCCCCCcccccccc Q lcl|NC_016071. 415 DEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIPEDMSTDELLKLLGQDTSRSGDGMTA 493 (516) Q Consensus 415 ~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (516) +..-+.|.|+..-.+|.++.++.+++++.+|++.+ +.++++.+ +..+...+++.. +.....+...++..+... T Consensus 438 ~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~----e~~i~~~~~~~~deea~~e~~-ri~~E~~~~~~~~p~~~~- 511 (518) T protein:vir:78 438 DEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSV----EEKVKLIHPKWEDEEIQAEVK-RIYLENAIGEVPDPEAIG- 511 (518) T ss_pred CceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCH----HHHHHHhCCCCCHHHHHHHHH-HHHHHhcccCCCCCcccc- Confidence 12237788999899999999999999999998654 45677654 654322111111 100000101111111000 Q ss_pred cCCCCCc Q lcl|NC_016071. 494 GSNGNGT 500 (516) Q Consensus 494 ~~~~~~~ 500 (516) +-...+- T Consensus 512 g~~~~~g 518 (518) T protein:vir:78 512 GMETKGG 518 (518) T ss_pred CCCCCCC Confidence 0000000 No 157 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=95.13 E-value=0.0027 Score=34.67 Aligned_cols=426 Identities=12% Similarity=0.024 Sum_probs=162.8 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHH------------hhcccccC----CcccHHHHHHHh-h Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESE------------VMKVEELR----WPCFLATVEAMK-Q 63 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~------------~~~~~~lr----~~~~~~~y~~m~-~ 63 (516) |+- | ...+|..-. -...|....+ ....|... ..+.-+-|+.-+ + T Consensus 1 m~~-------V---~~~hp~y~~---------~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~r 61 (501) T protein:vir:95 1 MPN-------V---SFIRPELGK---------LLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKR 61 (501) T ss_pred CCC-------C---CCCCHHHHH---------HHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhc Confidence 321 0 011111000 0111211111 11222211 112223455443 2 Q ss_pred ---ChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccC-cCCHHHHHHHHHH-HHhhcceeeeEEE Q lcl|NC_016071. 64 ---DHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLAN-QQTLRDIARSAAT-FNEYGFSIFEKVY 138 (516) Q Consensus 64 ---D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~-~~~~~~~l~~~ld-a~~~G~S~~Eivw 138 (516) -+++...++.--..|.+.+..++.+ .. ++.++++... ..+++.++++++. ++.||.+.+=+-| T Consensus 62 A~~~n~~~~t~~~l~G~vf~k~p~~~~p--------~~----l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~ 129 (501) T protein:vir:95 62 AVFYNVARRTLFGLVGQVFMRDPVVKVP--------AL----LNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDY 129 (501) T ss_pred cccCchHHHHHHHHhhhhhcCCcceeCc--------HH----HHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEee Confidence 4666666666666666555444321 12 3333443321 2368889988764 7789998775555 Q ss_pred eeccccc-cc-----ccceeeccccccCchhcccccceeecCCCc----eeeeccccc----ccccccc----------- Q lcl|NC_016071. 139 RTESAPS-KY-----AGYITIDKIAFRPQSSLSRSKPWVFDEDGR----TLKGIYQSK----MAFANFQ----------- 193 (516) Q Consensus 139 ~~~~~~~-~~-----~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~----~l~~~~q~~----~~~~~~~----------- 193 (516) -..++.. .. .+.+. =.+....+..|-. |.++..|. .++.++... ..|.... T Consensus 130 P~~~~~~~~t~a~~~~~~~r-Py~~~~~~~~Iin---W~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~ 205 (501) T protein:vir:95 130 PTTEAEGGASIADLEAGRIR-PTLYVYSPTEIIN---WRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDE 205 (501) T ss_pred cCCCCcccccHHHHHhccCC-cEEEEecHhhhcC---cceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCC Confidence 3221100 00 00000 0011111222211 33332221 000111000 0000000 Q ss_pred ccc-----c-------------ccccc---cccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHH Q lcl|NC_016071. 194 NGL-----T-------------QISSA---MSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKI 252 (516) Q Consensus 194 ~~~-----~-------------~~~~~---~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~ 252 (516) .|. + ..... ..+.....+...--.|| |+++..... +--.+...|..++.. -.++ T Consensus 206 ~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IP---fv~~~~~~~-~~~~~~pPLl~lA~l-ni~h 280 (501) T protein:vir:95 206 EGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIP---FMFIGSENN-DSNPDNPNFYDLASL-NMAH 280 (501) T ss_pred CceEEEEEEEecCCcccCcceecCCcccccceeeeeccCCCcCCeee---EEEEecCCC-CCCCCccchHHHHHH-HHHH Confidence 000 0 00000 00000001111111233 444322222 222233334344422 1122 Q ss_pred HHH-HHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeee Q lcl|NC_016071. 253 LIE-NLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLK 331 (516) Q Consensus 253 ~~~-~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~ 331 (516) +-. .+.-.-+-.-+.|+++++++-.- ... .... ..+.-|..++..+|.|.+ ..|+ T Consensus 281 y~~ssd~~~~l~~~~~P~l~i~G~~~~-----------~~~---~~~~--~~i~~G~~~~~~lP~~~~--------~~~i 336 (501) T protein:vir:95 281 YRNSADYEESCYIVGQPTPVLIGLTEE-----------WVT---NVLK--GSVNFGSRGGIPLPVGAD--------AKLL 336 (501) T ss_pred HhhhhHHHHHHHHcccceeeeeCCccc-----------ccc---cCCC--CceeecccccccCCCCCc--------eeEE Confidence 111 11222222345677766653210 000 0000 123448888899998864 5666 Q ss_pred eccccCcchhHHHHHHHHHHHHHHHHhcccccccCC-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_016071. 332 GIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGND-GQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALND 410 (516) Q Consensus 332 ~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~-~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~ 410 (516) +.+|++ -.++.++....+|..+ .+..++...+ .+++. ......-...++.+-+..+++.++ +++++++.+-+ T Consensus 337 e~~~~~---i~~~~l~~l~~~m~~~-Ga~ll~~~~~~~Ta~~--~~~~~~~~~S~L~~~a~~le~al~-~~l~~~a~w~g 409 (501) T protein:vir:95 337 QASENT---MLKEAMDTKERQMVAL-GAKLVEQKEVQRTATE--AELEAASEGSTLSSATKNVSAAFE-WALKWAARWVG 409 (501) T ss_pred ecChhh---HHHHHHHHHHHHHHHH-HHhhccCCccchhHHH--HHHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHcC Confidence 665543 2356677777777553 2343432211 12222 222334445568888889999996 58899988763 Q ss_pred CcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccc---cCCCCCCcc Q lcl|NC_016071. 411 IRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLK---LLGQDTSRS 487 (516) Q Consensus 411 ~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~---~~~~~~~~~ 487 (516) .. +...-.++.-+......-...++++.++...|.+.. .....++ ++.|++.+..+++....... ..+.+.... T Consensus 410 ~~-~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~-~t~~~~L-~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~ 486 (501) T protein:vir:95 410 QA-DSGVKFELNTDFDIARMTPDERRSLVEEWQKGAITF-EEMRTGL-RKAGVATEDDSKAKEKIAKDTAEAMALATPAN 486 (501) T ss_pred CC-CCceEEEEecccccccCCHHHHHHHHHHHhCCCCcH-HHHHHHH-HhCCCCChhHHHHHHHHHhhhcCcccccccCC Confidence 21 122112233233332222445678888899998665 3344555 44688865433221110000 000000111 Q ss_pred cccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 488 GDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +++... +.|| ..| T Consensus 487 ~~~~~~------------gg~~----~~~ 499 (501) T protein:vir:95 487 VPGDGS------------GGDN----VGN 499 (501) T ss_pred CCCCCc------------cccc----ccC Confidence 111111 1111 111 No 158 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=95.06 E-value=0.0029 Score=34.53 Aligned_cols=425 Identities=12% Similarity=0.033 Sum_probs=142.7 Q ss_pred CCccccCc--ccccchhhhcccCCCCcccccchHHHHHHHHHHHhhc-cccc--CCcccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQP--SEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMK-VEEL--RWPCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~--~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~-~~~l--r~~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) |+..+..- ......-... .+ -+.....+-...+ .+++ ..+..++-...........-++...- T Consensus 1 ~~~~t~~~~~~~l~~~~~~~--~~----------r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~ 68 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDG--MS----------RVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVA 68 (456) T ss_pred CCCCCHHHHHHHHHHHHHHH--HH----------HHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHH Confidence 33322100 0000000000 00 0111111111111 0011 01111111111112233333444433 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) ..+..-.+.+ ....+...+.. +.+.|++ ..+..+..++ .++.-||.+ ++++|.-..+ ...+. T Consensus 69 ~~l~~~~~~~--~~~~d~~~~~~----~~~i~~~----N~~d~~~~~~~~~a~i~G~a-y~~v~~d~~g------~~~i~ 131 (456) T protein:vir:10 69 DRIIPNGITV--GGSADSDLALR----ARRIWRD----NRMDSVCKQWVKYGLDFGES-YLTCWRRDDG------TATIT 131 (456) T ss_pred hhhccCCeec--CCCCCcchHHH----HHHHHHh----cChhhHHHHHHHHHhhcCee-EEEEeeCCCC------ceEEE Confidence 3344444433 22222222223 3344432 1355666655 578889997 5799975533 22222 Q ss_pred cccccCchhccc---------ccceeecCCCceeeeccccccccccc-ccc--cccc-cccc----ccccccccCCCccc Q lcl|NC_016071. 155 KIAFRPQSSLSR---------SKPWVFDEDGRTLKGIYQSKMAFANF-QNG--LTQI-SSAM----SLVTNLTSSADEVF 217 (516) Q Consensus 155 ~l~~r~q~ti~~---------~~~f~~~~dg~~l~~~~q~~~~~~~~-~~~--~~~~-~~~~----~~~~~~~~~~~~~~ 217 (516) .+.|..-..+.+ ..++..+.|+.....+.........+ ... +... .... +............. T Consensus 132 ~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (456) T protein:vir:10 132 ADSPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGS 211 (456) T ss_pred EEccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCC Confidence 222111000000 00011122222111110000000000 000 0000 0000 00000000000001 Q ss_pred cccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCC-HHHHHHHHH Q lcl|NC_016071. 218 IPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPK-SPESEMVQG 296 (516) Q Consensus 218 iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~-~~~~~~l~~ 296 (516) +|+ +++ ..|+.|.|.+..+--..=--+..+...+...+-+..+..++.|... ..+. +.+...+.. T Consensus 212 ~~p---vv~-----~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~------~~~~~d~~g~~~~~ 277 (456) T protein:vir:10 212 PPP---VVV-----YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEH------GLPNVDENGNAIDY 277 (456) T ss_pred cee---EEE-----ecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCc------ccccccccccccch Confidence 111 111 2578888888875431111122222233344444444444443211 1100 000011111 Q ss_pred HHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCC-ccchh-hH Q lcl|NC_016071. 297 LMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGND-GQGSY-NL 374 (516) Q Consensus 297 l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~-~~GS~-Al 374 (516) . ..+.++......+|.+.++. .+ +. .....|...++.+-.+|+..---.....+.. +..|. |+ T Consensus 278 ~----~~~~~~~~~~~~~~~~~~~~--------q~--~~-~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai 342 (456) T protein:vir:10 278 A----SIFEAAPGALWELPPGVDIW--------ES--QA-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGA 342 (456) T ss_pred h----hhhhhhccccccCCCCcceE--------Ee--cc-cChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHH Confidence 1 01111222233356554321 11 11 1122355555555555554222111111111 11121 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHH Q lcl|NC_016071. 375 SESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVI 454 (516) Q Consensus 375 ~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~ 454 (516) . ....-....++.-.+.+...|. ++++.++.+++. ....-.++.|....+.++.+.|+++.+|+.+|+.. . T Consensus 343 ~-~~~~~l~~k~~~~~~~f~~~l~-~~~rl~~~~~g~--~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~-----~ 413 (456) T protein:vir:10 343 H-NIEKGFLFKCEDRLSIAKIGLE-AILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESW-----A 413 (456) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCC--CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCCh-----H Confidence 2 1222223333334445555664 466766677642 22223477888888899999999999999999733 2 Q ss_pred HHHHHHcCCCCCCCcccccCc---ccccC-CCCC-CcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 455 NKILEVGGFDEEIPEDMSTDE---LLKLL-GQDT-SRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 455 ~~i~e~~Glp~~~~~~~~~~~---~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) .-+++.+|+.+..-+.+..+. +.... ++.. .+..++. + T Consensus 414 ~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~---------------------~ 456 (456) T protein:vir:10 414 SIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS---------------------R 456 (456) T ss_pred HHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCC---------------------C Confidence 345677888532111001110 00000 0000 0100100 0 No 159 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=95.06 E-value=0.0029 Score=34.53 Aligned_cols=425 Identities=12% Similarity=0.033 Sum_probs=142.7 Q ss_pred CCccccCc--ccccchhhhcccCCCCcccccchHHHHHHHHHHHhhc-cccc--CCcccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQP--SEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMK-VEEL--RWPCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~--~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~-~~~l--r~~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) |+..+..- ......-... .+ -+.....+-...+ .+++ ..+..++-...........-++...- T Consensus 1 ~~~~t~~~~~~~l~~~~~~~--~~----------r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~ 68 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDG--MS----------RVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVA 68 (456) T ss_pred CCCCCHHHHHHHHHHHHHHH--HH----------HHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHH Confidence 33322100 0000000000 00 0111111111111 0011 01111111111112233333444433 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) ..+..-.+.+ ....+...+.. +.+.|++ ..+..+..++ .++.-||.+ ++++|.-..+ ...+. T Consensus 69 ~~l~~~~~~~--~~~~d~~~~~~----~~~i~~~----N~~d~~~~~~~~~a~i~G~a-y~~v~~d~~g------~~~i~ 131 (456) T protein:vir:10 69 DRIIPNGITV--GGSADSDLALR----ARRIWRD----NRMDSVCKQWVKYGLDFGES-YLTCWRRDDG------TATIT 131 (456) T ss_pred hhhccCCeec--CCCCCcchHHH----HHHHHHh----cChhhHHHHHHHHHhhcCee-EEEEeeCCCC------ceEEE Confidence 3344444433 22222222223 3344432 1355666655 578889997 5799975533 22222 Q ss_pred cccccCchhccc---------ccceeecCCCceeeeccccccccccc-ccc--cccc-cccc----ccccccccCCCccc Q lcl|NC_016071. 155 KIAFRPQSSLSR---------SKPWVFDEDGRTLKGIYQSKMAFANF-QNG--LTQI-SSAM----SLVTNLTSSADEVF 217 (516) Q Consensus 155 ~l~~r~q~ti~~---------~~~f~~~~dg~~l~~~~q~~~~~~~~-~~~--~~~~-~~~~----~~~~~~~~~~~~~~ 217 (516) .+.|..-..+.+ ..++..+.|+.....+.........+ ... +... .... +............. T Consensus 132 ~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (456) T protein:vir:10 132 ADSPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGS 211 (456) T ss_pred EEccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCC Confidence 222111000000 00011122222111110000000000 000 0000 0000 00000000000001 Q ss_pred cccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCC-HHHHHHHHH Q lcl|NC_016071. 218 IPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPK-SPESEMVQG 296 (516) Q Consensus 218 iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~-~~~~~~l~~ 296 (516) +|+ +++ ..|+.|.|.+..+--..=--+..+...+...+-+..+..++.|... ..+. +.+...+.. T Consensus 212 ~~p---vv~-----~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~------~~~~~d~~g~~~~~ 277 (456) T protein:vir:10 212 PPP---VVV-----YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEH------GLPNVDENGNAIDY 277 (456) T ss_pred cee---EEE-----ecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCc------ccccccccccccch Confidence 111 111 2578888888875431111122222233344444444444443211 1100 000011111 Q ss_pred HHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCC-ccchh-hH Q lcl|NC_016071. 297 LMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGND-GQGSY-NL 374 (516) Q Consensus 297 l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~-~~GS~-Al 374 (516) . ..+.++......+|.+.++. .+ +. .....|...++.+-.+|+..---.....+.. +..|. |+ T Consensus 278 ~----~~~~~~~~~~~~~~~~~~~~--------q~--~~-~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai 342 (456) T protein:vir:10 278 A----SIFEAAPGALWELPPGVDIW--------ES--QA-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGA 342 (456) T ss_pred h----hhhhhhccccccCCCCcceE--------Ee--cc-cChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHH Confidence 1 01111222233356554321 11 11 1122355555555555554222111111111 11121 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHH Q lcl|NC_016071. 375 SESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVI 454 (516) Q Consensus 375 ~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~ 454 (516) . ....-....++.-.+.+...|. ++++.++.+++. ....-.++.|....+.++.+.|+++.+|+.+|+.. . T Consensus 343 ~-~~~~~l~~k~~~~~~~f~~~l~-~~~rl~~~~~g~--~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~-----~ 413 (456) T protein:vir:10 343 H-NIEKGFLFKCEDRLSIAKIGLE-AILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESW-----A 413 (456) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCC--CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCCh-----H Confidence 2 1222223333334445555664 466766677642 22223477888888899999999999999999733 2 Q ss_pred HHHHHHcCCCCCCCcccccCc---ccccC-CCCC-CcccccccccCCCCCcccccccccchhhh Q lcl|NC_016071. 455 NKILEVGGFDEEIPEDMSTDE---LLKLL-GQDT-SRSGDGMTAGSNGNGTGKISSTRDNSVSN 513 (516) Q Consensus 455 ~~i~e~~Glp~~~~~~~~~~~---~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 513 (516) .-+++.+|+.+..-+.+..+. +.... ++.. .+..++. + T Consensus 414 ~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~---------------------~ 456 (456) T protein:vir:10 414 SIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS---------------------R 456 (456) T ss_pred HHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCC---------------------C Confidence 345677888532111001110 00000 0000 0100100 0 No 160 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=94.99 E-value=0.003 Score=34.40 Aligned_cols=435 Identities=11% Similarity=-0.037 Sum_probs=155.1 Q ss_pred cccCCCCcccccchHH----------HHHHHHHHHhhcccccCCcccHHHHHHHh----hChHHHHHHHHHH--HHHhcC Q lcl|NC_016071. 18 NLAVSRLRTGELGSGA----------LSQLRAESEVMKVEELRWPCFLATVEAMK----QDHTVSTALDTKY--VFVTKA 81 (516) Q Consensus 18 ~p~~~~~~~~e~g~~~----------~~~~~~~~~~~~~~~lr~~~~~~~y~~m~----~D~~v~s~l~~Rk--~~v~~~ 81 (516) .+....+...++=..- +.....+-.. .++++. ....+.+++. ......-++.+.- +.+.+ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g--~~~i~~-~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~G- 76 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDA--ERRPDA-IGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEG- 76 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhc--ccchhh-cCcccchhhhhhhhhcchHHHHHHHHHHhhhccc- Confidence 1111111111100000 0111111110 011110 0011112221 1111111112111 11111 Q ss_pred CceeeeC--CCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeeccccccc--ccceeeccc Q lcl|NC_016071. 82 FNDFKVL--YNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKY--AGYITIDKI 156 (516) Q Consensus 82 ~w~i~~~--~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~--~g~~~~~~l 156 (516) +.+-.+ .......+.+..+.+.+.|..- .|.....+ ..+++-||.| +++||......... ++...++ T Consensus 77 -f~~~~~~~~~~~~~~d~~~~~~l~~i~~~N----~~~~~~~~~~~~a~i~G~a-~~~v~~~~~~~~~~~~~~~~~i~-- 148 (488) T protein:vir:23 77 -FRIPSANGEEPESGGENDPASELWDWWQAN----NLDIEATLGHTDALIYGTA-YITISMPDPEVDFDVDPEVPLIR-- 148 (488) T ss_pred -eeccCCcccccccccchhHHHHHHHHHHhc----ChhHHHHHHHHHHhhcCce-EEEEecCCcccccCCCCCcceEE-- Confidence 211000 0011123345556666666532 36666665 4478889997 67888654322111 1111111 Q ss_pred cccCchhccc------------ccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEE Q lcl|NC_016071. 157 AFRPQSSLSR------------SKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLM 224 (516) Q Consensus 157 ~~r~q~ti~~------------~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i 224 (516) +.++..+.. .+.+ ++.++.....+.-....... .+...... +.. .......++..-++ T Consensus 149 -~~~p~~~~~~~d~~~~~~~~~~~~~-~~~~~~~~~~~~~y~~~~~~---~~~~~~~~--~~~---~~~~~h~~g~vPvv 218 (488) T protein:vir:23 149 -VEPPTALYAEVDPRTRKVLYAIRAI-YGADGNEIVSATLYLPDTTM---TWLRAEGE--WEA---PTSTPHGLEMVPVI 218 (488) T ss_pred -EeccceeEEEEecCCCceEEEEEEE-EecCCCcEEEEEEEecCcEE---EEEecCCc--eEe---ccccccCCCCcceE Confidence 222221110 0011 12222211111000000000 00000000 000 00112233334457 Q ss_pred EEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016071. 225 VMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAAN 303 (516) Q Consensus 225 ~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~ 303 (516) .|+++.+.+.|+|.|-+.......+-. +..+...+..++-+..|..+++|.. ..+-...+... .... T Consensus 219 ~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~------~~~~~~~~~~~-~~~~----- 286 (488) T protein:vir:23 219 PISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAK------PEELGINAETG-QRMF----- 286 (488) T ss_pred EeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCC------ccccccccccc-chhh----- Confidence 788888889999998775433222211 2233445555666666666555421 00000000000 0011 Q ss_pred hhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc------CCccchh-hHHH Q lcl|NC_016071. 304 AHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG------NDGQGSY-NLSE 376 (516) Q Consensus 304 ~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~------~~~~GS~-Al~~ 376 (516) .++..+..++|.|-+.. +...++++ .+.++++++.-|-+.. +.+-+.. ..+.+|- |+-. T Consensus 287 -~~~~~~v~~~~~g~~~~--------~~q~~~~~----~~~~~~~l~~~i~~~~-~~~~~p~~~~g~~~~n~~Sg~Al~~ 352 (488) T protein:vir:23 287 -DAYMARILAFEGGEGAH--------AEQFSAAE----LRNFVDALDALDRKAA-SYSGLPPQYLSSSSDNPASAEAIKA 352 (488) T ss_pred -hhhhhhhccCCCCCCce--------eEecCCCC----hHHHHHHHHHHHHHHh-cccCCCHHHhccccCcchHHHHHHH Confidence 11223344566664333 33333221 3345555554443322 1111111 0111222 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-CCcc-ccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHH Q lcl|NC_016071. 377 SKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIR-LSDE-DMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVI 454 (516) Q Consensus 377 vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~-~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~ 454 (516) ...-....++.-.+.+...|. ++++.++.+.... .+.. .--.+.|....+.++.+.++++.+|++.|..+.. . T Consensus 353 -~~~~l~~k~~~~~~~f~~~l~-~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s---~ 427 (488) T protein:vir:23 353 -AESRLVKKVERKNKIFGGAWE-QAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIP---R 427 (488) T ss_pred -HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCC---H Confidence 222222233334444555663 4666666553221 1111 1235678888888999999999999998842221 3 Q ss_pred HHHHHHcCCCCCCCccccc----C------cccccCCCCCCcccccccccCCCCCccccccc Q lcl|NC_016071. 455 NKILEVGGFDEEIPEDMST----D------ELLKLLGQDTSRSGDGMTAGSNGNGTGKISST 506 (516) Q Consensus 455 ~~i~e~~Glp~~~~~~~~~----~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (516) +.+++.+|+-....++... . ...........+...+.. ...+..+++.++| T Consensus 428 et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~~~a 488 (488) T protein:vir:23 428 ERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEA-PVGEPPAPEPDAA 488 (488) T ss_pred HHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCC-CCCCCCCCCCCCC Confidence 5688888874322111100 0 001111111112111111 1112222222222 No 161 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=94.98 E-value=0.003 Score=34.39 Aligned_cols=371 Identities=11% Similarity=-0.001 Sum_probs=134.9 Q ss_pred CcccccchHHHHHHHHHHHhhcccccCCcccHHHHH--------------HHhhChHHHHHHHHHHHHHhcCCceeeeCC Q lcl|NC_016071. 24 LRTGELGSGALSQLRAESEVMKVEELRWPCFLATVE--------------AMKQDHTVSTALDTKYVFVTKAFNDFKVLY 89 (516) Q Consensus 24 ~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~--------------~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~ 89 (516) |...-++. ..........|.-...+.|+ ++..+ +..++.==+..|..+.=++.+.. T Consensus 1 ~~~~~i~~--------L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~--~~~v~nw~~~iVds~a~rl~~~G 70 (409) T protein:vir:94 1 MTEKGIGY--------LRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQ--YRSILGWCAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHH--------HHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHH--HhhhcchhHHHHHHhHhhcccCc Confidence 22222221 11111111001111111121 11111 11111111222222211222221 Q ss_pred CCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccccceeeccccccCchhccc-- Q lcl|NC_016071. 90 NRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSR-- 166 (516) Q Consensus 90 ~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~-- 166 (516) - . .++.+ +.++|+. ..|.....+ ..+|+-||.|+ ..||.-.. |...+.-+.|+.-..|.+ T Consensus 71 f-~-~~d~~----l~~i~~~----N~ld~~~~~~~~~aliyG~sf-~~v~~~~d------g~~~i~~~sp~~~~~i~D~~ 133 (409) T protein:vir:94 71 F-E-NDDFT----VNEIFEE----NNPDIFFDSAVLSSLIASCSF-TYISKGEN------DAVRLQVIEAVNATGIIDPI 133 (409) T ss_pred c-c-CCchH----HHHHHHh----cChhHHHHHHHHHHHHhccee-EEEecCCC------CceEEEEeccceEEEEEecC Confidence 1 1 11111 3444432 235555544 44789999975 47886432 322222111111000000 Q ss_pred -------ccceeecCCCceeeecccccccccccccccccc-ccccccccccccCCCccccccccEEEEeecCcCCccccc Q lcl|NC_016071. 167 -------SKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQI-SSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGV 238 (516) Q Consensus 167 -------~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~ 238 (516) .+.+.-+.++..+... -+..+.+.. ....+...... + ..+..-++.|.++.+-+.|+|. T Consensus 134 ~~~~~~a~~~~~~d~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~-n----~~g~vPvV~f~n~~~~~~~~G~ 200 (409) T protein:vir:94 134 TGLLTEGYAVLERDENNNVVLEA--------HFLPDRTDYYYRDSRNNISIA-N----PTGHPLLVPIIHRPDAVRPFGR 200 (409) T ss_pred CCceeeeEEEEEecCCCceEEEE--------EEecCcEEEEEecCceeEeee-C----CCCCcceEEeccccccccccCc Confidence 0111112222111100 000000000 00000000111 1 1223346777888888899998 Q ss_pred hhH-HHHHH--HHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec Q lcl|NC_016071. 239 SPL-VGCYR--AFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP 315 (516) Q Consensus 239 gLl-r~~~~--~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP 315 (516) |-+ +.+-. --+-|. +-.-+...|=+..|-.++.|. .. +... .... ...+ ..-..+| T Consensus 201 s~I~e~v~~l~da~~r~--~~~~~~~~e~~a~pqr~i~G~------d~-d~~~--~~~~---~~~~-------~~i~~~~ 259 (409) T protein:vir:94 201 SRITRSGMYWQSNAKRT--LERADVTAEFYSFPQKYVTGL------SD-DAEP--METW---KATV-------SSMLQFT 259 (409) T ss_pred cccchhHHHHHHHHHHH--HHHHHHHHHHhcChhheeEec------CC-CCcc--cchh---hhhH-------HHhhcCC Confidence 855 32211 111221 122233444455665555542 11 1111 1111 1111 1123456 Q ss_pred cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc-cc--CCccchh-hHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 316 SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN-LG--NDGQGSY-NLSESKQSIHGHFVQRDID 391 (516) Q Consensus 316 ~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt-s~--~~~~GS~-Al~~vh~ev~~~~~~aDa~ 391 (516) ++.+-+. +++.+.+++ +...|...++.+-.++|-. .+=.+. .+ .++.+|- |+...+..+. ...+.-.+ T Consensus 260 ~d~dg~~-----~~v~q~~~~-~l~~~~~~l~~~~~~~a~~-t~lP~~~lg~~~~NpsSa~Al~a~~~~L~-~~a~~k~~ 331 (409) T protein:vir:94 260 KDEDGDK-----PTLGQFTQP-SMSPFTEQLRTAAAGFAGE-TGLTLDDLGFVSDNPSSVEAIKASHENLR-LAGRKAQR 331 (409) T ss_pred CCCCCCC-----ceEEecCCC-ChhHHHHHHHHHHHHHhhh-cCCCHHHhccccCchhHHHHHHHHHHHHH-HHHHHHHH Confidence 5533221 222222222 2223433444333444421 110010 00 1111222 3333222222 22233344 Q ss_pred HHHHHHHHHHHHHHHHhcCCcC--Ccc-ccceEEecCc---CchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCC Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRL--SDE-DMPKLKPGLI---QEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDE 465 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~--~~~-~~P~~~~~~~---~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~ 465 (516) .+...+. ++++..+.+=.... +++ .-..++|... +...+.+.|+++.||+.+|..+.+ .+.+++.+|+.. T Consensus 332 ~fg~~~~-~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~---~~~~~~~lG~~~ 407 (409) T protein:vir:94 332 SLGAGLL-NVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFIN---KDTIRDLTGIEG 407 (409) T ss_pred HHHHHHH-HHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccc---hhHHHHHcCCCC Confidence 4556664 46665555421111 111 1135667633 344456778899999999964443 357899999986 Q ss_pred CC Q lcl|NC_016071. 466 EI 467 (516) Q Consensus 466 ~~ 467 (516) ++ T Consensus 408 ~d 409 (409) T protein:vir:94 408 GE 409 (409) T ss_pred CC Confidence 54 No 162 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=94.57 E-value=0.0041 Score=33.69 Aligned_cols=458 Identities=11% Similarity=0.007 Sum_probs=171.8 Q ss_pred CCccc-cCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHh-h---ChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRF-AQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMK-Q---DHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~-~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~-~---D~~v~s~l~~Rk 75 (516) |+.|. +..++....+.++ .|...+-.--..|...++.. .....|... .+.-+-|+.=+ + -+++...++.-- T Consensus 1 m~~~~~~~v~~~h~~y~a~--~~~W~~ird~~~G~~~~r~~-g~~YLPk~~-~E~~~~Y~~rl~rA~~~n~~~~tl~~l~ 76 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQM--LPRWHVIETLLGGTEAMREA-GETYLPRHQ-EETDKGYQERLASAVLLNMVEQTLDTLS 76 (513) T ss_pred CCCCCCCCCCcCCHHHHHH--HHHHHHHHHHhcChHHHHhh-cccCCCCCC-CCCHHHHHHHHhcccCCChHHHHHHHHh Confidence 88884 3333333222221 11111110000111111100 011222211 23333454433 2 567777777777 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccC-cCCHHHHHHHHHH-HHhhcceeeeEEEeeccccc--cc---- Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLAN-QQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPS--KY---- 147 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~-~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~--~~---- 147 (516) -.|.+.+..+. .+.+ ....+ .++++... ..+++.+++.++. ++.||.+.+=+-|-..++.. .+ T Consensus 77 G~vf~k~p~~~----~~~p--~~~~~---~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~A 147 (513) T protein:vir:97 77 GKPFSEPIKLN----EDVP--KAIEE---TILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLA 147 (513) T ss_pred hhhhhcCcccC----cCch--HHHHH---HHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHH Confidence 77766654332 1111 12222 23333321 2468899988775 99999887655443221100 00 Q ss_pred ---ccceeeccccccCchhcccccceeecCCCc-e-e--eecccc---ccccc--------ccccccccccccc------ Q lcl|NC_016071. 148 ---AGYITIDKIAFRPQSSLSRSKPWVFDEDGR-T-L--KGIYQS---KMAFA--------NFQNGLTQISSAM------ 203 (516) Q Consensus 148 ---~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~-~-l--~~~~q~---~~~~~--------~~~~~~~~~~~~~------ 203 (516) .+... -.+....+..|-. |.++..|. . + +.++.. ...|. .+..|.+.+-... T Consensus 148 de~~~~~r-Py~~~~~~e~Iin---W~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~ 223 (513) T protein:vir:97 148 DDRREGLR-PYWVMIKPECLLF---ARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQ 223 (513) T ss_pred HHHhhccC-ceEEEecHhhhcC---cceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCcc Confidence 00000 0011112222221 33333221 0 0 000000 00000 0111111110000 Q ss_pred --ccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccc Q lcl|NC_016071. 204 --SLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNK 281 (516) Q Consensus 204 --~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k 281 (516) .+.+. . ..+..|+.=-|+.+- ....+-..+...|..++..-+--=+...+.-.-+..-+.|++++++.. T Consensus 224 ~~e~~~~-~--~g~~~l~~IP~v~~~-~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~----- 294 (513) T protein:vir:97 224 KEEWALA-D--EWATGLNYVPLVTFY-ADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGAS----- 294 (513) T ss_pred ccceEEe-c--CCCCcCCceeEEEEe-cCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCC----- Confidence 00000 0 011222222344333 223344445555555554222111122222222333456666666431 Q ss_pred ccCCCCHHHHHHHHHHHHHHHHhhcccceEEEecc-CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcc Q lcl|NC_016071. 282 AAIDPKSPESEMVQGLMADAANAHAGEQAYFILPS-DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAG 360 (516) Q Consensus 282 ~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~-g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGq 360 (516) .. +.. .+..|..+.+.+|. |. +..+++.+|++- ...+..++...++| ..+|- T Consensus 295 ----~~--~~~----------~i~iG~~~~~~lpe~~~--------~~~yie~~g~~i-~~~~~~l~~le~qm--~~~Ga 347 (513) T protein:vir:97 295 ----GE--DSD----------PVVVGPNKVLYNPDPAG--------RFYYVEHTGQAI-AAGRTDLKDLEEQM--AGYGA 347 (513) T ss_pred ----cC--CCC----------ceEeeccccccCCCCCC--------cceeeccCchhH-HHHHHHHHHHHHHH--HHHHH Confidence 10 000 12348888888885 54 456677766532 23455667677777 33333 Q ss_pred cccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecC-cCchhH-HHHHHHH Q lcl|NC_016071. 361 FINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGL-IQEVDM-EGFSKFV 438 (516) Q Consensus 361 tLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~-~~~~dl-~~~a~~~ 438 (516) .|.....+.-|--...........++.+-+..+.+.++ ++++++..+-+. +..-++|.+.. ....++ ...++++ T Consensus 348 ~ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~-~~l~~~a~wlg~---~~~~~~v~in~dF~~~~~~~~~~~al 423 (513) T protein:vir:97 348 EFLKRKTGGQTATARALDSAEATSDLSAMTGLFEDALA-QALDITADWLRL---GPNGGTVELVKDYDLEEMDAPGLQAL 423 (513) T ss_pred HhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCC---CCCccEEEeccccCcccCCHHHHHHH Confidence 33222111111122234445555667778888888886 588999887632 21223443311 112222 3345677 Q ss_pred HHHHhCCcccccHHHHHHHHHHcCCCCCCCc----ccccCcccccCC---CCCCcccc--cccccCCCCCccc--ccccc Q lcl|NC_016071. 439 QRIGAVGYLPKTPTVINKILEVGGFDEEIPE----DMSTDELLKLLG---QDTSRSGD--GMTAGSNGNGTGK--ISSTR 507 (516) Q Consensus 439 ~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~----~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~~~--~~~~~ 507 (516) .++...|.+.. ....+++++.-=|++..+. ++..++..+..+ .+..++.. +.....++++..+ .-+.+ T Consensus 424 ~~a~~~G~is~-~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (513) T protein:vir:97 424 QVAREKRDISR-KTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGGEG 502 (513) T ss_pred HHHHhCCCCCH-HHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCc Confidence 77888887654 2233444432223322121 111111111110 01111110 0001111111110 00111 Q ss_pred cchhhhhcC Q lcl|NC_016071. 508 DNSVSNMDN 516 (516) Q Consensus 508 d~~~~~~~~ 516 (516) -|.-+|--- T Consensus 503 ~~~~~~~~~ 511 (513) T protein:vir:97 503 GEGGGNPGG 511 (513) T ss_pred cccCCCCCC Confidence 111111000 No 163 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=94.55 E-value=0.0041 Score=33.66 Aligned_cols=427 Identities=12% Similarity=0.052 Sum_probs=158.8 Q ss_pred CCccccCcccccchhhh-cccCCCCcccccchHHHHHHHHHHHhhc--ccccCCcccHHHHH---HH-----------h- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE-NLAVSRLRTGELGSGALSQLRAESEVMK--VEELRWPCFLATVE---AM-----------K- 62 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~-~p~~~~~~~~e~g~~~~~~~~~~~~~~~--~~~lr~~~~~~~y~---~m-----------~- 62 (516) |..= .=+..++..+ . +-.++ ..+++--+-+..++.... .+.+ -+..+.|+ ++ . T Consensus 1 ~~~~---~~~~~~~~~~e~--~~~~~--~~~~~~~~~i~~~i~~~~~~~~~~--~~~~~yY~g~~~i~~~~~~~~~~~~~ 71 (478) T protein:vir:10 1 MISI---NWPWDKPYHEQV--VEQIK--PKYETQEEMILRLVREHKENIDNI--TMGERYYNHHPDILDAPPKRDVNGDY 71 (478) T ss_pred Cccc---cCCCCchhHHHH--HHHHh--hccCCcHHHHHHHHHHHHHHHHHH--HHHHHHhcCCCchhcccccccccccc Confidence 1110 0000000000 0 00000 000000001111111100 0000 00111111 00 0 Q ss_pred ---------hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcce Q lcl|NC_016071. 63 ---------QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFS 132 (516) Q Consensus 63 ---------~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S 132 (516) ......-++.+....+.+-+..+.+ + +.+..+++.+++++ .|.+.+..+ .++.-||.+ T Consensus 72 ~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~----~---~d~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~ 139 (478) T protein:vir:10 72 DETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGV----D---NDKALKQIQHTLNH-----KWDDKLVDILTAASNKGIE 139 (478) T ss_pred ccccccceeccchHHHHHHHHHhhhccCCeeeec----C---ChHHHHHHHHHHhc-----CHHHHHHHHHHHHHhcCeE Confidence 0122223333333333444433332 1 22345566666642 255666554 468889987 Q ss_pred eeeEEEeecccccccccceeeccccccCchhcccccceeecCCCceeeecccccccccc----ccc----------cccc Q lcl|NC_016071. 133 IFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFAN----FQN----------GLTQ 198 (516) Q Consensus 133 ~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~----~~~----------~~~~ 198 (516) + +.+|.-. +|.+.+.-+.|+. +.. -|.....+..+..++........ |.. +... T Consensus 140 ~-~~~~~d~------~g~~~~~~~~p~~---~~~--i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~ 207 (478) T protein:vir:10 140 W-VQPYVDE------EGEFKTFRVPAEQ---AVP--IWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLI 207 (478) T ss_pred E-EEEEecC------CCeeEEEEEcccc---eEE--EEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeee Confidence 5 5676533 2333333222211 100 01111122222222211100000 000 0000 Q ss_pred cc---cccccccccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHH-HHHHHHHHHHHHhhccccceeeee Q lcl|NC_016071. 199 IS---SAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFRE-KILIENLETIGASKDLGGIIELKI 274 (516) Q Consensus 199 ~~---~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~f-K~~~~~~w~~~~er~g~~~~v~~~ 274 (516) .. ...+.............++.-.++.|+ +||.|.|.+..+ .+.+- -...+..++..++.+..|+.++++ T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----n~~~g~sd~~~v-~~liDa~~~~~S~~~~~~~~~~~p~~~~~g 281 (478) T protein:vir:10 208 PDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFK-----NNPQEVSDLFMY-KTIIDALDKRLSDTQNTFDESVELIYILKG 281 (478) T ss_pred ccccccccccccceecccccccCCccceEEec-----cCCCCCCcHHHH-HHHHHHHHHHHHHHHHHHHHhhCceeeeec Confidence 00 000000000000001111111233332 478899998874 33332 234566677778888888888776 Q ss_pred cccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHH Q lcl|NC_016071. 275 PSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAIL 354 (516) Q Consensus 275 pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Is 354 (516) ..+ .+.. .....+. . ...+.++.. +...++++.... ....+...++.+.+.|. T Consensus 282 ~~~------~~~~----~~~~~~~-------~--~~~~~~~~~------~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~ 334 (478) T protein:vir:10 282 YEG------EDMK----DFMHNLK-------Y--YKAISVAGE------SGSGVDTIKVEV--PIDSVKEYTKMLRDYII 334 (478) T ss_pred CCc------cccc----hhhhhhh-------h--cceEEecCC------CCCcceEEeecC--ChHHHHHHHHHHHHHHH Confidence 421 1111 1111111 1 112223311 123345554332 23347778888888888 Q ss_pred HHHhcccccccCCccchhhHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchh Q lcl|NC_016071. 355 DRFGAGFINLGNDGQGSYNLSESKQSIHG----HFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVD 430 (516) Q Consensus 355 k~iLGqtLts~~~~~GS~Al~~vh~ev~~----~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~d 430 (516) +.--+..++.+..+ | +++-+.-+... ..+..-.+.+...|. ++++.++.+.+... +..-+.+.|....+.| T Consensus 335 ~~s~~p~~~~~~~~-~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~g~~~-~~~~i~i~f~~~~p~d 409 (478) T protein:vir:10 335 EFGQGVDFQQDKFG-N--SPSGIALKFMYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYRLDV-KVQDIEITFNFNVMVN 409 (478) T ss_pred HHhCccccCccccc-c--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCc-ccccceEEecCCCCCC Confidence 87655555543322 1 11222222211 122333344555564 46677777654322 2233678899889999 Q ss_pred HHHHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 431 MEGFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 431 l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) ..+.|+++.+| .|++ + .+.+.+.++. +.+..+-+-...+.....+.......+.. ..+.....++ T Consensus 410 ~~e~a~~~~kl--~g~i-S----~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~-------~~~~~~~~~~ 475 (478) T protein:vir:10 410 ELENSQIAMNS--TGLL-S----KETILSNHAWVEDPVAEMERIEQENIELNQQLPDIEEGLN-------GEQQRQSENN 475 (478) T ss_pred HHHHHHHHHHH--hCCC-C----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccC-------CCCCCCCCCC Confidence 99999999998 4543 3 3567777765 32211111111111111111111111110 1111111111 Q ss_pred hhh Q lcl|NC_016071. 510 SVS 512 (516) Q Consensus 510 ~~~ 512 (516) ... T Consensus 476 ~~~ 478 (478) T protein:vir:10 476 QPE 478 (478) T ss_pred CCC Confidence 111 No 164 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=94.42 E-value=0.0045 Score=33.46 Aligned_cols=427 Identities=10% Similarity=0.003 Sum_probs=165.9 Q ss_pred CCccccCcccccchhhhcccCCCCccc-cc----chHHHHHHHHHHHhhccccc-CCcccHHHHHH-------------- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTG-EL----GSGALSQLRAESEVMKVEEL-RWPCFLATVEA-------------- 60 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~-e~----g~~~~~~~~~~~~~~~~~~l-r~~~~~~~y~~-------------- 60 (516) |..-.-.+++|..+-. |.++.+.-. .+ ...--.-+..++.... ..+ |.-+..+.|+- T Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~i~~~~~~~~~~ 77 (483) T protein:vir:12 1 MAQALIKGGNILYPSQ--PTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDAT 77 (483) T ss_pred CccchhcCCceeecCc--chhhhhhhcccccCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhcccccccccccccccc Confidence 6555555555543322 222211000 00 0000011111211100 000 01111121210 Q ss_pred -----Hh-----hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhh Q lcl|NC_016071. 61 -----MK-----QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEY 129 (516) Q Consensus 61 -----m~-----~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~ 129 (516) .+ -.....-++.+....+.+-+..+.+ .+.+..+++++++++ .+.+.+.+ ..++.-| T Consensus 78 ~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~-------~d~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~ 145 (483) T protein:vir:12 78 GAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-------TDDEVVKRIDEVLGN-----RFDDKLHSVLTGASNK 145 (483) T ss_pred ccccccccccccccchHHHHHHHHhhhhcccCceecc-------CChHHHHHHHHHHhc-----cHHHHHHHHHHHHhhC Confidence 00 0222333334333444444433321 234566777777753 25556655 4578889 Q ss_pred cceeeeEEEeecccccccccceeeccccccCchhcccccceeec--CCCceeeeccccccccc----cccccccccc-cc Q lcl|NC_016071. 130 GFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFD--EDGRTLKGIYQSKMAFA----NFQNGLTQIS-SA 202 (516) Q Consensus 130 G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~--~dg~~l~~~~q~~~~~~----~~~~~~~~~~-~~ 202 (516) |.+ ++++|.-. +|.+.+..+.|+. +. -.|+ ..++.+..++....... .+.......- .. T Consensus 146 G~~-y~~v~~d~------d~~~~i~~~~p~~---~~----~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~ 211 (483) T protein:vir:12 146 GIE-WLHPYLDE------EGEFKLFRVPAEQ---GI----PIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYE 211 (483) T ss_pred CeE-EEEEEEcC------CCceEEEEEcccc---eE----EEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEe Confidence 986 45777533 3444444333322 10 1122 22333333322110000 0000000000 00 Q ss_pred cccc-cccc---c----CCCccccccccEEEEeecCcCCccccchhHHHHHHHHH-HHHHHHHHHHHHHhhccccceeee Q lcl|NC_016071. 203 MSLV-TNLT---S----SADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFR-EKILIENLETIGASKDLGGIIELK 273 (516) Q Consensus 203 ~~~~-~~~~---~----~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~-fK~~~~~~w~~~~er~g~~~~v~~ 273 (516) .+.. .... . ......+..--++.|. +|+.|.|.+..+- +.+ --+..+..++..++-+..+..+++ T Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~-~liDa~d~~~S~~~~~~~~~~~~~lv~~ 285 (483) T protein:vir:12 212 NGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYK-TLIDAYNRRLSDLSNTFKDSNELTYVLT 285 (483) T ss_pred CCeeeecccccccccccccccCCCCccceEEec-----CCCCCCCchhhHH-HHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 0000 0000 0 0000001111123332 3678899988643 333 223456667777787888888776 Q ss_pred ecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHH Q lcl|NC_016071. 274 IPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAI 353 (516) Q Consensus 274 ~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~I 353 (516) +... ....+ ....+. ....+.++.+.+ ++++..+. ....+..+++.+.+.| T Consensus 286 g~~~--------~~~~~--~~~~~~---------~~~~~~~~~~~~--------~~~l~~~~--~~~~~~~~~~~l~~~I 336 (483) T protein:vir:12 286 NYDD--------QELPE--FKRLLR---------YYGAIKVSDNGG--------VDTIQVEV--PVENSKKYLDELYQKI 336 (483) T ss_pred cCCc--------ccchh--HHHhhh---------hccccccCCCCc--------ceEEeecC--CHHHHHHHHHHHHHHH Confidence 5321 11111 111111 111233455543 44443332 2234677888888888 Q ss_pred HHHHhcccccccCCccchhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH Q lcl|NC_016071. 354 LDRFGAGFINLGNDGQGSYNLSESKQ--SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM 431 (516) Q Consensus 354 sk~iLGqtLts~~~~~GS~Al~~vh~--ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl 431 (516) .+.--...++.++-++.+.+.| ... .-....+..-.+.+...| +++++.++.+..... +..-..+.|....+.|. T Consensus 337 ~~~s~~p~~~~~~~~~n~Sg~A-l~~~~~~l~~k~~~~~~~f~~~l-~~~~~li~~~~~~~~-~~~~i~v~f~~~~p~~~ 413 (483) T protein:vir:12 337 MLFGQAVDFSSDKFGSAPSGVA-LEFLYTNLNLKADKLARKAKVAI-QELLWFVFEHFDIKG-EHKDVDISFNYNKVANT 413 (483) T ss_pred HHHhCCCCCCccccccCcHHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCC-ccceeeEEeCCCCCCCH Confidence 7765544444443222111111 111 111112222333444445 346666666543222 22234678899999999 Q ss_pred HHHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcc----cccCCCCCCccccc--ccccCCCCCccc Q lcl|NC_016071. 432 EGFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDEL----LKLLGQDTSRSGDG--MTAGSNGNGTGK 502 (516) Q Consensus 432 ~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~----~~~~~~~~~~~~~~--~~~~~~~~~~~~ 502 (516) .+.++++.+|. |++. ++.+.+.++. +.+..+-+-...+ .+...+. ...+.. .....+++.+.+ T Consensus 414 ~~~a~~~~kl~--GiiS-----~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~-~~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 414 ELQVQTAQQSM--GIVS-----HETVLENHPFVEDLQAELERIEQEQMEYNKQLPNL-DDGGADGAQQQERSNNKESE 483 (483) T ss_pred HHHHHHHHHHh--ccCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc-cccccCCcccCCCCCcccCC Confidence 99999999984 6532 3455666643 3322110100110 1111110 000000 001111121211 No 165 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=94.40 E-value=0.0045 Score=33.43 Aligned_cols=418 Identities=11% Similarity=-0.002 Sum_probs=155.3 Q ss_pred cccchhhhcccCCCCcccccchHHHHHHHHHHHh--hcccccCCcccHHHHH----------------HHh-hChHHHHH Q lcl|NC_016071. 10 EVVKAGNENLAVSRLRTGELGSGALSQLRAESEV--MKVEELRWPCFLATVE----------------AMK-QDHTVSTA 70 (516) Q Consensus 10 ~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~--~~~~~lr~~~~~~~y~----------------~m~-~D~~v~s~ 70 (516) ...++.+-. -.| +..++-. .-+..++.. .+.+.++ +..+.|+ ..+ ..+...-+ T Consensus 1 ~~~~~~~~~-~~~--~~~~~~~---~~i~~~i~~~~~~~~r~~--~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~i 72 (453) T protein:vir:73 1 MNLKPIKLM-TYS--RDEEITD---KVVNDFMKKHQEEVERYE--YLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYI 72 (453) T ss_pred Cccccceee-ecc--ccccCCH---HHHHHHHHHHHHHHHHHH--HHHHHhccccchhcCCCCCccCccceeecchHHHH Confidence 001000000 000 0011100 011111110 0001100 0011110 000 13344444 Q ss_pred HHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeeccccccccc Q lcl|NC_016071. 71 LDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAG 149 (516) Q Consensus 71 l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g 149 (516) +.+....+.+-+..+.+ + +.+..+++.++++.- .|...+..+ .++.-||.+ ++.+|.-. +| T Consensus 73 vd~~~~~l~g~~~~~~~----~---d~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~~-~~~v~~d~------~~ 134 (453) T protein:vir:73 73 VDTFVGYFNGIPIKKTH----D---DKSVLEAMQLFDNLN----DMEDEESELAKIACVYGRA-YELMYQNE------ST 134 (453) T ss_pred HHHhhhhhcccCceeec----C---ChHHHHHHHHHHHhc----ChhHHHHHHHHHHHhcCeE-EEEEEeCC------CC Confidence 44444445555444432 2 234456677776542 255555554 468889986 45777533 23 Q ss_pred ceeeccccccCchhcc-----c----ccceeecCCCceeeeccccccccccccccccccccccccccccccCCCcccccc Q lcl|NC_016071. 150 YITIDKIAFRPQSSLS-----R----SKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPI 220 (516) Q Consensus 150 ~~~~~~l~~r~q~ti~-----~----~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~ 220 (516) .+.+..+.|+.-..+. + ..+|.++.++.....+......... ..-...+.+........+ .|| T Consensus 135 ~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~-----~~~~~~~~~~~~~~~~~g--~vP- 206 (453) T protein:vir:73 135 ESEVIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETISI-----TGKAGEVKFGESTYNVYS--DLP- 206 (453) T ss_pred ceEEEEEcccceEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEEEE-----EecCCceEEccceeccCC--cee- Confidence 3333222222110000 0 0012233333321111100000000 000000000000111111 122 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHH-HHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFR-EKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMA 299 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~-fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~ 299 (516) ++.|+ +|+.|.|.+..+-. .+ -=...+..++..++.+..|..++++.- ....+...+.... T Consensus 207 --vv~~~-----n~~~g~s~~~~v~~-liDa~~~~~S~~~~~~~~~~~~~l~~~g~~---------~~~~~~~~~~~~~- 268 (453) T protein:vir:73 207 --IVEYN-----FNEERQSIFEPVHS-LINSYNKVTSEKANDVEYFSDQYLVFLGAE---------VDEEDAKNIKDNR- 268 (453) T ss_pred --EEEec-----CCCCCCcchhhHHH-HHHHHHHHHHHHHHHHHHhccceeeeecCC---------CCchhhhcccccc- Confidence 23333 46778888876432 33 223456677778888888888877531 1111111111100 Q ss_pred HHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh-hHHHHH Q lcl|NC_016071. 300 DAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY-NLSESK 378 (516) Q Consensus 300 ~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~-Al~~vh 378 (516) ++.......++.+..+.+ .+++++..+.. ...+...++.+.+.|...--+..++.+..+.-|. |+- .. T Consensus 269 ~~~~~~~~~~~~~~~~~~--------~d~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~-~~ 337 (453) T protein:vir:73 269 LINFFDKNSNGQGTNAAK--------VDVKFLDKPDS--DVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALA-YK 337 (453) T ss_pred cccccccccccccccccC--------ceeEEeeecCC--HHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHH-HH Confidence 000000001111122222 33555544332 2346778899999997755444444433221121 211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC-C-ccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHH Q lcl|NC_016071. 379 QSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRL-S-DEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINK 456 (516) Q Consensus 379 ~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~-~-~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~ 456 (516) ..-....+..-.+.+...|. ++++.++.+....+ . +..-..+.|....+.|..+.++++.+++ |++ + ++. T Consensus 338 ~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--gii-s----~et 409 (453) T protein:vir:73 338 LQAMSNLALSFQRKFQSALN-RRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GIT-S----EET 409 (453) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccC-c----HHH Confidence 11111122223334445553 45555555421111 1 1223578898889999999999999986 653 3 344 Q ss_pred HHHHcCC-CCCCCcccccCcccc-cCCCCCCcccccccccCCCCCcccccccccchhhhh Q lcl|NC_016071. 457 ILEVGGF-DEEIPEDMSTDELLK-LLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNM 514 (516) Q Consensus 457 i~e~~Gl-p~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 514 (516) +.+.++. +.+..+-+-...+.. ....... + ..+..|..-.+| T Consensus 410 ~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~-----------~-----~~~~~~~~~~~~ 453 (453) T protein:vir:73 410 ALSVISVIPDVQAEMEKIKKKKLLQLSLTRT-----------S-----NLVRMKQMRGNL 453 (453) T ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh-----------c-----cCCcchhhhcCC Confidence 5555544 322111000000000 0000000 0 011111122222 No 166 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=94.09 E-value=0.0054 Score=33.00 Aligned_cols=460 Identities=13% Similarity=0.088 Sum_probs=168.9 Q ss_pred CCccccCcccccchhhhccc-CCCCcccccchHHHH--HH-HHHHHhhcccccC-CcccHHHHHHHhhChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLA-VSRLRTGELGSGALS--QL-RAESEVMKVEELR-WPCFLATVEAMKQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~-~~~~~~~e~g~~~~~--~~-~~~~~~~~~~~lr-~~~~~~~y~~m~~D~~v~s~l~~Rk 75 (516) ||.=+-=.-+-.+....+|| +|+-. +=|+.-+. .+ +.+.+.+ +..+ ..+.++.|++|..++.|-++++-.- T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~--~dg~~~i~~~~~~~~~~~~e--~~~~~~~eLI~~YR~ma~~pEvd~Av~eIV 76 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDN--LDGSQPVSGGGYYGYTVDFD--GQVRNEYQLISRYREMVLQPECDSAVDDIV 76 (533) T ss_pred CccccccccccccccccCCCCCCCCc--ccccceeecccccceeeecc--cccchHHHHHHHHHHHhhccchhhHHHHhh Confidence 55433222111122222333 12211 11111100 01 1121111 2233 3468999999999999999999876 Q ss_pred HHHhcCCce---eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccccccee Q lcl|NC_016071. 76 VFVTKAFND---FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYIT 152 (516) Q Consensus 76 ~~v~~~~w~---i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~ 152 (516) .-+.-.+-. +.++-. +.+.++.+-+.|.+. |+ .+..+|+.--+||..+= .|+-||++. T Consensus 77 neaiv~d~~~~pV~i~Ld-~~~~s~~iK~kI~eE---------F~-~Il~ll~F~~~~~e~fR--------~WYVDgRi~ 137 (533) T protein:vir:10 77 NETICGNFDDVPVSVELS-NLKVSDKIKKLIREE---------FG-EILRLLDFENRSYEIFR--------RWYVDGRLF 137 (533) T ss_pred cceeeecCCCceEEEEec-ccccchHHHHHHHHH---------HH-HHHHHhccchhhhHHHh--------hhhhcceEE Confidence 643221100 111111 111233333333332 22 23355555556665542 233344444 Q ss_pred eccccc-------------cCchhccccccee-ecCCCcee----eeccccccccccccccccccccccccccccccCCC Q lcl|NC_016071. 153 IDKIAF-------------RPQSSLSRSKPWV-FDEDGRTL----KGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSAD 214 (516) Q Consensus 153 ~~~l~~-------------r~q~ti~~~~~f~-~~~dg~~l----~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (516) .+++.. .+|..|++.|... ...++-.- .........+..|.+. +. ..+... T Consensus 138 fHkiid~~~pk~GI~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~--------g~---~~~~~~ 206 (533) T protein:vir:10 138 YHKVIDPDNPQGGLIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPK--------GL---KNSTTQ 206 (533) T ss_pred EEEEecCCCccccceeeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccc--------cc---cccCCC Confidence 443322 2222333222211 11121100 0000011111111111 11 112355 Q ss_pred ccccccccEEEEeecCc--CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC--CCCHHH Q lcl|NC_016071. 215 EVFIPINKLMVMSLGGT--ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI--DPKSPE 290 (516) Q Consensus 215 ~~~iP~~k~i~~~~~~~--~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~--~~~~~~ 290 (516) ++.||. ..|+|+|..- .++..=.|.|.++..++==-+.....-.++ .+-.+|.+|+=+-.- =|.... T Consensus 207 ~vkI~~-dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIY--------RitRAPeRRvFYIDVGnLPk~KA 277 (533) T protein:vir:10 207 GLKIAP-DSICYVHSGIMDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIY--------RLSRAPERRIFYIDVGNLPKNKA 277 (533) T ss_pred ceecch-hheeeeeccceeCCCCceeccchHhHHHHHhhHHHHhhHHHH--------hhhccccceEEEEecCCCCchhH Confidence 788988 5688888532 222223478888888764333322222221 122222222211111 122222 Q ss_pred HHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHH Q lcl|NC_016071. 291 SEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKA 352 (516) Q Consensus 291 ~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~ 352 (516) .+. +..++..++. ....|- -||.= +.....+|+-+ .|+... .-.+=|+|..+. T Consensus 278 eqY---lr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRR---eGgrgTEItTL--pGgqnL-gem~DV~YF~kK 348 (533) T protein:vir:10 278 EQY---LREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR---EGGRGTEITTL--PGGQNL-GELEDVKYFQKK 348 (533) T ss_pred HHH---HHHHHHhccceEEEeccCceecccchhhhhHhhhccccc---CCCCccceeec--cccCCc-ChHHHHHHHHHH Confidence 222 3333332221 011111 12210 00111233333 222222 233458999999 Q ss_pred HHHHHhcccccccCCccchhhH-HH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccceEEecCc Q lcl|NC_016071. 353 ILDRFGAGFINLGNDGQGSYNL-SE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPKLKPGLI 426 (516) Q Consensus 353 Isk~iLGqtLts~~~~~GS~Al-~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~~~~~~~ 426 (516) +-+++--..--.+.+++-+... ++ +-.|+ |...+..-...+...|..-|-..|+.=+ .--+. .--..+.|+.. T Consensus 349 LY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKg-iit~eeW~~i~~~I~~~f~ 427 (533) T protein:vir:10 349 LYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQLVLKG-VISIEEWDQMKEHIQYDYI 427 (533) T ss_pred HHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCHHHHHHHhhcceEeee Confidence 9988887764444443222211 22 22333 2333444444444444433333332211 11110 11123444333 Q ss_pred Cc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCC--cccccCc---------ccccCCCCCCccc Q lcl|NC_016071. 427 QE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIP--EDMSTDE---------LLKLLGQDTSRSG 488 (516) Q Consensus 427 ~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~--~~~~~~~---------~~~~~~~~~~~~~ 488 (516) .+ .+.+-+.+++..|..+--.+-...+.+|+++.+ .+...+- ++...+. +...+-+.+.+ . T Consensus 428 ~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~-~ 506 (533) T protein:vir:10 428 ADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAA-G 506 (533) T ss_pred ecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcC-C Confidence 32 444445556666655422222244577876654 4331100 0000000 00000000011 1 Q ss_pred ccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 489 DGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +....+.++ +..+|+..|.+....|- T Consensus 507 ~~~~~~~~~--~~~~~~~~~~~~~~~~~ 532 (533) T protein:vir:10 507 DPDAGGAPA--EEVAPEGPDPSDERKAE 532 (533) T ss_pred CCCcCCccc--ccCCCCCCCcchhhccC Confidence 111111111 12234445555555555 No 167 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=94.04 E-value=0.0056 Score=32.94 Aligned_cols=422 Identities=10% Similarity=-0.012 Sum_probs=160.6 Q ss_pred CCccccCcccccchhhhc-ccCCCCcccccchHHHHHHHHHHHh--hcccccCCcccHHHH------------------- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNEN-LAVSRLRTGELGSGALSQLRAESEV--MKVEELRWPCFLATV------------------- 58 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~-p~~~~~~~~e~g~~~~~~~~~~~~~--~~~~~lr~~~~~~~y------------------- 58 (516) +...+-...+.....-.. ...+ |+-. ..+..++.. ++.+.+ -++.+.| T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~-----~~~~---~~i~~~i~~~~~~~~r~--~~l~~YY~g~~~I~~~~~~~~~~~~~ 89 (492) T protein:vir:94 20 ILYPSQPTQTEIFDAIVRTNNKP-----ETLE---EMIVRYIKQHLEKLPEI--SIGQEYYEQRPDIVKEPKPVDATGAV 89 (492) T ss_pred eeecCccchhhhhhcccccCCch-----hhHH---HHHHHHHHHHHHHHHHH--HHHHHHhccccccccccccccccccc Confidence 111110000000000000 0000 0000 000111000 000000 0001111 Q ss_pred HHHh-----hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcce Q lcl|NC_016071. 59 EAMK-----QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFS 132 (516) Q Consensus 59 ~~m~-----~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S 132 (516) +..+ ..+...-++.+....+.+-+..+++ .+.+..++++.++++ .+.+.+.+ ..++.-||.+ T Consensus 90 ~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~-------~d~~~~~~l~~~~~n-----~~~~~~~~~~~~a~~~G~a 157 (492) T protein:vir:94 90 DPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-------TDDEVVKRIDEVLGN-----RFDDKLHSVLTGASNKGIE 157 (492) T ss_pred cccccccccccchHHHHHHHHHhhhcccCceecc-------CchHHHHHHHHHHhc-----cHHHHHHHHHHHHhhCCeE Confidence 0000 1334444455555555555544432 223556777777653 25555554 4568889987 Q ss_pred eeeEEEeecccccccccceeeccccccCchhcccccceeec--CCCceeeeccccccccccccccccccccccccccccc Q lcl|NC_016071. 133 IFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFD--EDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLT 210 (516) Q Consensus 133 ~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~--~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (516) . +++|.-. +|.+.+..+.|+ .+. -.|+ ..++.+..++........ ......+.....+.. T Consensus 158 ~-~~v~~d~------dg~~~~~~~~p~---~~~----~v~d~~~~~~~~a~ir~~~~~~~~----~~~~y~~~~v~~~~~ 219 (492) T protein:vir:94 158 W-LHPYLDE------EGEFKLFRVPAE---QGI----PIWTDKEHEELEAFIRMYKLENET----KVEYWDKVTVNYYVY 219 (492) T ss_pred E-EEEEecC------CCceEEEEEccc---ceE----EEEcCCCCCceEEEEEEEeeccce----eEEEEecCeEEEEEE Confidence 5 5777533 344444333332 111 1222 233444433321110000 000000000000000 Q ss_pred c----------CC-----Ccc--ccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeee Q lcl|NC_016071. 211 S----------SA-----DEV--FIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELK 273 (516) Q Consensus 211 ~----------~~-----~~~--~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~ 273 (516) . .. ... .++.--++.| .+|+.|.|.+..+-...=--+..+...+..++.+..|..+++ T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~ 294 (492) T protein:vir:94 220 ENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPF-----KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLK 294 (492) T ss_pred ecCeeeeccccccccccccccccCCCccceEEe-----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 0 00 000 1111112333 246678898876433222223345666677788888887776 Q ss_pred ecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHH Q lcl|NC_016071. 274 IPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAI 353 (516) Q Consensus 274 ~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~I 353 (516) +... ....+ ....+. ......++.+.+ ++++..+. ....+...++++.+.| T Consensus 295 g~~~--------~~~~~------~~~~~~-----~~~~~~~~~~~~--------~~~l~~~~--~~~~~~~~~~~l~~~I 345 (492) T protein:vir:94 295 NYDD--------QELPE------FKRLLR-----YYGAIKVSDNGG--------VDTIQVEV--PVENSKKYLDELYQKI 345 (492) T ss_pred cCCc--------ccchh------hHHHHh-----hccceecCCCCc--------ceeEeccC--CHHHHHHHHHHHHHHH Confidence 5321 11111 111111 112334565543 34443322 2234677888888888 Q ss_pred HHHHhcccccccCCccchhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhH Q lcl|NC_016071. 354 LDRFGAGFINLGNDGQGSYNLSESKQ--SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDM 431 (516) Q Consensus 354 sk~iLGqtLts~~~~~GS~Al~~vh~--ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl 431 (516) .+.--...++.+.-++.+.+.| ... .-....+..-.+.+...| +++++.++.+..... +..--.+.|....+.|. T Consensus 346 ~~~s~~p~~~~~~~~~n~Sg~A-l~~~~~~l~~k~~~k~~~f~~~l-~~~~~li~~~~~~~~-~~~~i~v~f~~~~p~~~ 422 (492) T protein:vir:94 346 MLFGQAVDFSSDKFGSAPSGVA-LEFLYTNLNLKADKLARKAKVAI-QELLWFVFEHFDIKG-EHKDVDISFNYNKVANT 422 (492) T ss_pred HHHhCCcCCCccccccCchHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCc-ccceeeEEecCCCCCCH Confidence 7776555555443222111111 111 112222333334455555 346666666653222 22224678888889999 Q ss_pred HHHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 432 EGFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 432 ~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) .+.++++.+|+ |++ + .+.+.+.++. +.+..+-+-...+.....+.......+....... |+. T Consensus 423 ~e~~~~~~kl~--gii-S----~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~----------~~~ 485 (492) T protein:vir:94 423 ELQVQTAQQSM--GIV-S----HETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADSAQQ----------QER 485 (492) T ss_pred HHHHHHHHHHh--ccC-c----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCcc----------ccC Confidence 99999999985 653 3 3556677754 3222111111111000000000000000000000 111 Q ss_pred hhhhcC Q lcl|NC_016071. 511 VSNMDN 516 (516) Q Consensus 511 ~~~~~~ 516 (516) ..+..| T Consensus 486 ~~~~e~ 491 (492) T protein:vir:94 486 SNNKES 491 (492) T ss_pred CccccC Confidence 111111 No 168 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=93.90 E-value=0.006 Score=32.76 Aligned_cols=434 Identities=8% Similarity=-0.050 Sum_probs=143.8 Q ss_pred hhhcccCCCCcccccchHHH----HHHHH----H----HHhhccccc-CC-----cccHHHHHHHhhChHHHHHHHHHHH Q lcl|NC_016071. 15 GNENLAVSRLRTGELGSGAL----SQLRA----E----SEVMKVEEL-RW-----PCFLATVEAMKQDHTVSTALDTKYV 76 (516) Q Consensus 15 ~~~~p~~~~~~~~e~g~~~~----~~~~~----~----~~~~~~~~l-r~-----~~~~~~y~~m~~D~~v~s~l~~Rk~ 76 (516) --+.|- ..+...++-.... ..+.. + .+-+-.+.+ .. ++..+.++.+..-...+-++++.-. T Consensus 1 ~~~~p~-~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~ 79 (479) T protein:vir:99 1 MIDLPD-EDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQ 79 (479) T ss_pred CccCCc-ccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHh Confidence 111221 1111111110000 00000 0 000001111 00 0001111111111122222222111 Q ss_pred HHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 77 FVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 77 ~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) .+. .-.|++ .++.. .+.+.+.|+. ..|.+...++ .++.-||. .++++|..... ..-+|...+.- T Consensus 80 ~l~--~~gf~~---~d~~~----~~~~~~i~~~----N~~d~~~~~~~~~a~~~G~-af~~v~~~~~~-~d~~g~~~i~~ 144 (479) T protein:vir:99 80 QLI--VDGYRK---TGTNE----NAKGWDTWRL----NQMDKQQFWLNRAVLTFGY-AFIKVTSGISP-LDGTTVARIKC 144 (479) T ss_pred hcc--cccccC---CCchh----hHHHHHHHHh----cChhHHHHHHHHHHhhcCc-eEEEEecCCCC-cCCCCceEEEE Confidence 111 111222 12222 2334455543 1356666664 47889998 56788852110 00122222222 Q ss_pred ccccCchhcc-c-------ccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEe Q lcl|NC_016071. 156 IAFRPQSSLS-R-------SKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMS 227 (516) Q Consensus 156 l~~r~q~ti~-~-------~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~ 227 (516) +.|+.-..|. + ..+..++.++................ ...+.+. ......++..-++.|. T Consensus 145 ~~p~~~~~iydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~-----~~~~h~~g~vPvv~f~ 212 (479) T protein:vir:99 145 IDPRDAFAIWEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFK-------QGKFIYR-----ETVSHDYGHIPFVRYV 212 (479) T ss_pred echhheEEEecCCcccceeeEEEeecCceeEEEEecceEEEEEec-------CCceeec-----cccccCCCCcceEEee Confidence 2211110010 0 00111111111111000000000000 0000000 0111123344467778 Q ss_pred ecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_016071. 228 LGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHA 306 (516) Q Consensus 228 ~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~ 306 (516) ++.+. .++|.|.+..+ ...+-- +..+......++.+..|..++.|-- ..+....+.... .+. T Consensus 213 n~~~~-~~~g~sd~e~v-~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~------~~~~~~~~~~~~--------~~~- 275 (479) T protein:vir:99 213 NVMDL-RGVCYGDVEPL-VTVAKAIDKTGLDILLVQHHQSFQIRWATGLM------LPEGANADQEKM--------RFA- 275 (479) T ss_pred cCCCc-CcCCcchhHHH-HHHHHHHHHHHHHHHHHHHHhhchhhhhcCCC------cccccccchhcc--------ccc- Confidence 87766 45799988764 333322 2234445566677777766655421 111111111000 000 Q ss_pred ccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh-hHHHHHHHHHHHH Q lcl|NC_016071. 307 GEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY-NLSESKQSIHGHF 385 (516) Q Consensus 307 g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~-Al~~vh~ev~~~~ 385 (516) ...++...|.+. ++...+.. ....|...++.+-.+|+...--..-..+..+..|. |+.. ...-.... T Consensus 276 --~~~i~~~~~~~~--------~~~q~~~~-~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~-~~~~l~~k 343 (479) T protein:vir:99 276 --QESMLISQNEKA--------SFGAIPAA-PLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAA-GTRQTMQK 343 (479) T ss_pred --cccceeecCCCc--------eEEEeccc-chHHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHH-HHHHHHHH Confidence 112233333332 23323221 12223334444444444221111011110111222 2222 22222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCcCCcccc-ceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CC Q lcl|NC_016071. 386 VQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDM-PKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GF 463 (516) Q Consensus 386 ~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~-P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Gl 463 (516) .+.-.+.+...|. ++++.++.+.+...+...+ -.+.|......++.+.++++.+|+.+|.+.. +.+.+.+ |+ T Consensus 344 a~~~~~~f~~al~-~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~-----et~l~~l~gv 417 (479) T protein:vir:99 344 LFEKQATWKASHN-QTMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPA-----EGVWDMIPNL 417 (479) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCH-----HHHHHhcCCC Confidence 3333445555664 3566666665432221111 3456666777888999999999999986443 3444554 78 Q ss_pred CCCCCccc----ccC----cccccC-----CCCCCcccccccc-cCCCCCcccccccccchh Q lcl|NC_016071. 464 DEEIPEDM----STD----ELLKLL-----GQDTSRSGDGMTA-GSNGNGTGKISSTRDNSV 511 (516) Q Consensus 464 p~~~~~~~----~~~----~~~~~~-----~~~~~~~~~~~~~-~~~~~~~~~~~~~~d~~~ 511 (516) ..+.-+.. ... ...... +..+..+.++... ..+++.++..++...+.+ T Consensus 418 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 418 DQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcchhccCCCCC Confidence 64321110 000 000000 0000111111111 111122222111111111 No 169 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=93.78 E-value=0.0064 Score=32.61 Aligned_cols=443 Identities=10% Similarity=-0.046 Sum_probs=150.6 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHH----------HHHHHHHHHhhcccccCCcccHHHHHHHhhC----hH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGA----------LSQLRAESEVMKVEELRWPCFLATVEAMKQD----HT 66 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~----------~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D----~~ 66 (516) ||+-....-.+.. .++-..- +.....+-... ..++. ....+-.++... .. T Consensus 1 ~~~~~~~~~~~~~-------------~~~~~~l~~~~~~~~~rl~~l~~Yy~G~--~~i~~-~~~~~~~~~~~~~~~~n~ 64 (484) T protein:vir:77 1 MTSPLQKQENVDP-------------EKAREEMLNLFTERTQDLGDNTAYYESE--RRPDA-VGVTVPQQMQKLLAHVGY 64 (484) T ss_pred CCCcccccCCCCH-------------HHHHHHHHHHHHHHHHHHHHHHHHHhcc--ccchh-cccccchhHHhhhhhcCc Confidence 6655443332221 1110000 11111111110 01110 001111222110 11 Q ss_pred HHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeeccccc Q lcl|NC_016071. 67 VSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPS 145 (516) Q Consensus 67 v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~ 145 (516) ..-++++....+.-.. |.++ .++.. .+.+.+.+++ ..|..+..++ .+++-||.| +++||.-..+.. T Consensus 65 ~~~ivd~~~~~l~~~g--~~~~--~~~~~----~~~l~~i~~~----N~~d~~~~~~~~~a~~~G~a-~~~v~~~~~~~~ 131 (484) T protein:vir:77 65 PRLYIDAIAARQELEG--FRLG--GADKA----DEQLWDWWQA----NDLDIESTLGHTDSLVHGRS-YITISKPDPNID 131 (484) T ss_pred HHHHHHHHHhhhccCc--eecC--Ccchh----HHHHHHHHHh----cCHhHHHHHHHHHHhhcCce-EEEEecCCCCcc Confidence 1111121111111111 2221 12222 2334444432 2366666664 478899996 568886554322 Q ss_pred cccc--ceeeccccccC---------chhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCC Q lcl|NC_016071. 146 KYAG--YITIDKIAFRP---------QSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSAD 214 (516) Q Consensus 146 ~~~g--~~~~~~l~~r~---------q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (516) ...+ ...+..+.++. ...+...+++. +++++......-...... + .+......+.+ ... . T Consensus 132 ~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~-~~~~~~~~~~~~y~~~~~-~--~~~~~~~~~~~---~~~--~ 202 (484) T protein:vir:77 132 PGVDPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIE-DEEGNEVIGATLYLPNNT-V--IWNREDGQWVQ---VAN--V 202 (484) T ss_pred cccccccceEEEeccceeEEEecCCCCceEEEEEEEE-eecCCcEEEEEEEecCeE-E--EEEecCCceEe---ecc--c Confidence 1111 00111111110 00000111111 222111111000000000 0 00000000000 000 0 Q ss_pred ccccccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHH Q lcl|NC_016071. 215 EVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEM 293 (516) Q Consensus 215 ~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~ 293 (516) ...++..-++.|.++.+.+.|.|.|-+.......+-. +..+..++..++-+..|..++.|... .+-..++... T Consensus 203 ~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~------~~~~~~~~~~ 276 (484) T protein:vir:77 203 AHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKG------EELGVDPETG 276 (484) T ss_pred cCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCc------chhccccccc Confidence 1122233357788888999999998775433322211 22344455666666666655554210 0000000000 Q ss_pred HHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc-----CCc Q lcl|NC_016071. 294 VQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG-----NDG 368 (516) Q Consensus 294 l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~-----~~~ 368 (516) ...+. .+...-..+|.+ + .++...+.+ +...++++++.-|-+..-.-.++-. ..+ T Consensus 277 ~~~~~-------~~~~~~~~~~~~-~--------~~~~q~~~~----~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n 336 (484) T protein:vir:77 277 QTLFD-------AYLARILAFEDH-E--------SKAQQFSAA----ELRNFVDALDALDRKAAAYTGLPPYYLSFSSEN 336 (484) T ss_pred chhhh-------hhhhhhcccCCC-C--------ceeEeecCC----ChHHHHHHHHHHHHHHhcccCCCHHHhccccCc Confidence 01111 111222333432 1 122222221 1334555555554433211111110 011 Q ss_pred cchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhCC Q lcl|NC_016071. 369 QGSY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL-NDIRLSDE-DMPKLKPGLIQEVDMEGFSKFVQRIGAVG 445 (516) Q Consensus 369 ~GS~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l-N~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G 445 (516) .+|. |+.. ...-....++.-.+.+...|. ++++.++.+ |....+.. .--.+.|......++.+.++++.+|++.| T Consensus 337 ~~Sg~Al~~-~~~~l~~ka~~k~~~f~~~l~-~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g 414 (484) T protein:vir:77 337 PASAEAIRS-SESRLVKTVERKNKIFGGAWE-QAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNG 414 (484) T ss_pred chHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhcc Confidence 1222 2221 122222223333344555553 355555444 22111111 11356788888899999999999999988 Q ss_pred cccccHHHHHHHHHHcCCCCCCCcccc-cCcccc-----cCCCCCCcccccccccCCCCCcccccccccchhh Q lcl|NC_016071. 446 YLPKTPTVINKILEVGGFDEEIPEDMS-TDELLK-----LLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVS 512 (516) Q Consensus 446 ~~~~~~~~~~~i~e~~Glp~~~~~~~~-~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 512 (516) .-+.+ .+-+.+.+|+-+...++.. ....+. ...+.....+.....+...+.+...+.+..++++ T Consensus 415 ~gi~s---~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 415 QGVIP---KERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQPNPAEEAAA 484 (484) T ss_pred CCCCC---HHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCcccccCCCccccCC Confidence 52222 4567788888543222110 000000 0000000000100001111111112233333333 No 170 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=93.71 E-value=0.0066 Score=32.53 Aligned_cols=465 Identities=15% Similarity=0.120 Sum_probs=205.6 Q ss_pred CccccCcccccchhhhcccCCCCcccc--------cchH--HH-HHHHHHHHhhccc--------c-c----CCc-ccHH Q lcl|NC_016071. 2 STRFAQPSEVVKAGNENLAVSRLRTGE--------LGSG--AL-SQLRAESEVMKVE--------E-L----RWP-CFLA 56 (516) Q Consensus 2 ~~r~~~~~~~~~~~~~~p~~~~~~~~e--------~g~~--~~-~~~~~~~~~~~~~--------~-l----r~~-~~~~ 56 (516) -||.+.+++-.- +-+.-++..-...| .-++ -. +-+.|++--+.+- + | ..| +.++ T Consensus 1 ~~~~~~~~~~~~-t~~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~f~npd~~~~ 79 (525) T protein:vir:10 1 MTRTKGSKNKST-TIEKQSLQIEQLQEHINELERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLWFNNPDKYIN 79 (525) T ss_pred CCCCcCCccccc-chhhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhcChHHHHH Confidence 344444332211 10000010000000 0000 00 0112222111100 0 0 011 1111 Q ss_pred HHHHHh-----hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhh-ccCcCCHHHHHHHHHHHHhhc Q lcl|NC_016071. 57 TVEAMK-----QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKN-LANQQTLRDIARSAATFNEYG 130 (516) Q Consensus 57 ~y~~m~-----~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~-~~~~~~~~~~l~~~lda~~~G 130 (516) -.+.++ .|+.|....+-+ .++-.+++.|.+-. .+ ...++.--.+.-.|+. +. -..+.+++|--+-.. T Consensus 80 ~i~~l~~y~yi~~~~v~ql~~li-~~lp~l~y~i~~~~-~~-k~~~~~~s~~n~~l~k~i~----hk~ltrdll~q~a~~ 152 (525) T protein:vir:10 80 NIVNLLTYYYIIDGNVFQLYDLI-FSLPPLDYQIKVLK-RD-KDYKEDLSTINLYLEKKIQ----HKQLTRDLLVQLAHS 152 (525) T ss_pred HHHHHHHHhhhhcchHHHHHHHH-HhcCCcceeehhhh-hc-cchhhHHHHHHHHHHHhHH----HHHHHHHHHHHhhcc Confidence 112222 255555544433 34556677766432 11 2223333334433332 11 112333333222221 Q ss_pred ceeeeEEEeecccccccccceeeccccccCch--------hcccccce-eecCCCceeeecccccccccccccccccccc Q lcl|NC_016071. 131 FSIFEKVYRTESAPSKYAGYITIDKIAFRPQS--------SLSRSKPW-VFDEDGRTLKGIYQSKMAFANFQNGLTQISS 201 (516) Q Consensus 131 ~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~--------ti~~~~~f-~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~ 201 (516) =.++ -.|--+...-++ .....|+..-|. ++-+..|| .+.++-|.++...-++.--.+- +. T Consensus 153 gtli-g~wlg~~~~py~---~vf~~~kyvfp~~r~~g~~v~vid~~~f~~~~~~~r~~~~~~lsp~i~~~~---y~---- 221 (525) T protein:vir:10 153 GTLI-GTWLGSKREPYF---NVFNNLKYVFPYGRAKGKMVAVIDLQWFDEMSELERKLTFENLSPLITENK---YK---- 221 (525) T ss_pred Ccee-EeeecCCCCcch---hhhhhhhhhccccccCCceEEEEehHHhhhhhHHHHHHHHHhhchhhhhhh---hh---- Confidence 1111 133222211111 111111111110 01112344 2333333322211111100000 00 Q ss_pred cccccccc---ccCCCccccccccEEEEeecCcCCccc-cchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccc Q lcl|NC_016071. 202 AMSLVTNL---TSSADEVFIPINKLMVMSLGGTESNPA-GVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQ 277 (516) Q Consensus 202 ~~~~~~~~---~~~~~~~~iP~~k~i~~~~~~~~g~p~-G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~ 277 (516) .|..+- .....-+++|.++.++.+...-+-||- |.++.-+.......|+..-......+.|-..+|.+++. T Consensus 222 --~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~EqsIA~kii~a~avLk~--- 296 (525) T protein:vir:10 222 --KWKEYNGENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDLEQSIADKIIKAMAVLKF--- 296 (525) T ss_pred --HHhhcccccchhheeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHhhhhheeeee--- Confidence 000000 111234678999999998887666666 88888889899999998888888888898889888874 Q ss_pred ccccccCCCCHHH---HHHHHHHHHHHHH-hhc-ccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHH Q lcl|NC_016071. 278 ILNKAAIDPKSPE---SEMVQGLMADAAN-AHA-GEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKA 352 (516) Q Consensus 278 ~~~k~~~~~~~~~---~~~l~~l~~~~~~-~~~-g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~ 352 (516) .++...+..-.+ +..+.....++.- +.+ ..-+.+.||.=.+++|.+- ..+....|-. -.+..|+- T Consensus 297 -gg~~gn~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~i--------k~~~~glDg~-K~d~I~~D 366 (525) T protein:vir:10 297 -RGKDDNDSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEI--------KNGDKTLDPK-KYDSIDND 366 (525) T ss_pred -ccccCccccCchHHHHHHHHHHHHHHhcccccccCeEEEeccceeecccccc--------cCcccCCCch-hhhhhhhh Confidence 233333322222 3333333333321 222 0123344587776666542 2222222322 34667788 Q ss_pred HHHH-HhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcccc-ceEEecCcCchh Q lcl|NC_016071. 353 ILDR-FGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDM-PKLKPGLIQEVD 430 (516) Q Consensus 353 Isk~-iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~-P~~~~~~~~~~d 430 (516) |.-+ -|.+.|++ +.||.||.+++..++|-..+---.+.|.++-| +|+.|++. .+... -.|.++...+.+ T Consensus 367 I~~A~GlS~sL~n--GdggNyAtaslnld~fykkigVm~e~Iee~y~-kL~d~Vl~------~~k~~nyifnydkd~pi~ 437 (525) T protein:vir:10 367 ITNATGISQVLTN--GTKGNYASAKLNLDVFYKKIGVMLEIIEEIYN-QLIDIILG------EEKGCNYIFQYNKDTPIE 437 (525) T ss_pred hhhhhccceeeec--CCCCceeeeeeeHHHHHHHHHHHHHHHHHHHH-HHHhhhcC------cccCcceEEecCCCchhh Confidence 7444 46688887 55689999999999998887777777776664 57766532 22211 245678888999 Q ss_pred HHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcc------cccCcccccCCCCCC---cccccccccCCCCCcc Q lcl|NC_016071. 431 MEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPED------MSTDELLKLLGQDTS---RSGDGMTAGSNGNGTG 501 (516) Q Consensus 431 l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~------~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 501 (516) +++..+.+=+|.+.|+... ++-+..|+.....=+ |-..-.++..|+-++ ...++.-.|.|..++. T Consensus 438 ~kkk~d~LIkL~d~g~s~k------~vldl~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SGk~~n~iG~P~~dd~ 511 (525) T protein:vir:10 438 REKKLDTLIKLEAQGYSAK------YVLDILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSGKDGNDIGSPKLDDS 511 (525) T ss_pred hhhhhhhhhhhhccchhhh------hhhhhhccCcchHHHHHHHHHHHHHHhhhccccccceeeeccccccccCCccCCC Confidence 9988888889999998543 455556665322111 111112233333222 1122233344433332 Q ss_pred cccccccchhhhhc Q lcl|NC_016071. 502 KISSTRDNSVSNMD 515 (516) Q Consensus 502 ~~~~~~d~~~~~~~ 515 (516) ..+.+.-+|-.+-- T Consensus 512 ~~~dati~s~~~~~ 525 (525) T protein:vir:10 512 DSSDATIESKERGV 525 (525) T ss_pred cchhhhhhhhhcCC Confidence 22211111111111 No 171 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=93.59 E-value=0.007 Score=32.39 Aligned_cols=447 Identities=8% Similarity=-0.010 Sum_probs=170.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHH--------HHHHHH--Hhhccccc-CCcccHHH-H-HHHhhChHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALS--------QLRAES--EVMKVEEL-RWPCFLAT-V-EAMKQDHTV 67 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~--------~~~~~~--~~~~~~~l-r~~~~~~~-y-~~m~~D~~v 67 (516) .++|+-.-++..-.......++.-....+-.. ++ .+.... +...++.+ ..+....- . ..-....+. T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~ 94 (501) T protein:vir:96 16 LNLRFHRESRIRYRADNLEELMVNNWELLKNF-INHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYG 94 (501) T ss_pred cccccchhHHhhhcccccccccCChHHHHHHH-HHHHHHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchH Confidence 23333222222211111111111100001000 00 000000 00111111 00000000 0 000124455 Q ss_pred HHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccc Q lcl|NC_016071. 68 STALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSK 146 (516) Q Consensus 68 ~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~ 146 (516) .-++.+....+.+-+..+.+.. +...+++.+++.++|++- .|..++..+. ++.-||.+. +.+|.-. T Consensus 95 k~Ivd~~~~yl~g~p~~~~~~~---~~~~~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~a~-~~v~~de----- 161 (501) T protein:vir:96 95 RMISKFKTGYLAGNPIRVEYDD---NDDNSQNDDAIKRIGRIN----DLDSLNRTLIRDLSQTGRAY-EVIYRSE----- 161 (501) T ss_pred HHHHHHHhhhhcccCeeEeeCC---ccchhHHHHHHHHHHHhc----CHHHHHHHHHHHHhhcCeEE-EEEEEcC----- Confidence 5566666666666666665542 334456677777777642 3666665544 688899755 6777543 Q ss_pred cccceeeccccccCchhcccccceeecC--CCceeeeccccccccccccccccccccccccccccccC------CCcccc Q lcl|NC_016071. 147 YAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSS------ADEVFI 218 (516) Q Consensus 147 ~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~i 218 (516) +|.+.+..+.|+. .. -.|++ .++.+..++-...........+..+-.+.......... .....+ T Consensus 162 -dg~~~i~~~~p~~---~~----~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~ 233 (501) T protein:vir:96 162 -YDETRIKRLSPLE---TF----VIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTHAF 233 (501) T ss_pred -CCceEEEEEccce---eE----EEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccccccCC Confidence 3444443333221 11 12332 23344433321110000000000010111111010000 000111 Q ss_pred ccccEEEEeecCcCCccccchhHHHHHHHHHH-HHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHH Q lcl|NC_016071. 219 PINKLMVMSLGGTESNPAGVSPLVGCYRAFRE-KILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGL 297 (516) Q Consensus 219 P~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~f-K~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l 297 (516) ...-++.| .+|+.|.|.+..+ .+.+- -...+..++..++.+..++.++++... .+ ..+....+... T Consensus 234 g~vPvv~~-----~nn~~g~sd~e~v-~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~------~~-~~~~~~~~~~~ 300 (501) T protein:vir:96 234 GTVPITEY-----LNNIDGIGDYETE-LYLIDLYDSAESDTANHMSDMADAILAIYGDLA------LP-KGMQASDMKRT 300 (501) T ss_pred CccceEEe-----cCCccCCCchhhh-HHHHHHHHHHHHHHHHHHHHhcCceeeeecccc------cC-cccchhhhhhc Confidence 11112333 2578899999875 33432 234566677778888888888876421 11 11111001000 Q ss_pred HHHHHHhhcccceEEEecc-CcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHH Q lcl|NC_016071. 298 MADAANAHAGEQAYFILPS-DMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSE 376 (516) Q Consensus 298 ~~~~~~~~~g~~a~~iiP~-g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~ 376 (516) -...++. +.........+++++..+. ....+..+++.+.+.|...--...++.++.++.+ +. T Consensus 301 ------------~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~---Sg 363 (501) T protein:vir:96 301 ------------RLMQLKPPKSADGKEGTVKAEYLTKSY--DVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNT---SG 363 (501) T ss_pred ------------CeeeecccccccccccCcceeeEeccC--CHHHHHHHHHHHHHHHHHHhCCcccCcccccccc---hH Confidence 0111111 0000011123455554433 2234677888888888776555444443322111 11 Q ss_pred HHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHh-c--CCc-CCccccceEEecCcCchhHHHHHHHHHHHHhCCccc Q lcl|NC_016071. 377 SKQSIH----GHFVQRDIDIIVEAFNKNLIPQLLAL-N--DIR-LSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLP 448 (516) Q Consensus 377 vh~ev~----~~~~~aDa~~i~~~ln~~li~~lv~l-N--~~~-~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~ 448 (516) +.-... ...+..-.+.+...|. ++++.++.+ + ... ..+..-..+.|....+.|..+.++++.+|+ |++ T Consensus 364 ~Al~~~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~i- 439 (501) T protein:vir:96 364 EALKYKLFGLDQDRVDTQSQFTKGLK-RRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQV- 439 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccC- Confidence 112111 1222223344455553 355554443 1 110 111122578899999999999999999996 643 Q ss_pred ccHHHHHHHHHHcCC-CCCCCcccccCcccc-----cC-CCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 449 KTPTVINKILEVGGF-DEEIPEDMSTDELLK-----LL-GQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 449 ~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~-----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) + ++.+.+.++. +.+..+-+-...+.. +. .+.....++.. ......+.....|.-. T Consensus 440 S----~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~----~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 440 S----QETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYT----DEVKETHTDDFEREYE 501 (501) T ss_pred c----hHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcccccC----CcCCCCCCCccccccC Confidence 3 3445555543 322111000111100 00 00000001100 0001111111111111 No 172 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=93.52 E-value=0.0073 Score=32.31 Aligned_cols=384 Identities=9% Similarity=-0.071 Sum_probs=135.6 Q ss_pred CcccccchHHHHHHHHHHHhhcccccCCcccHHHH--------------HHHhhChHHHHHHHHHHHHHhcCCceeeeCC Q lcl|NC_016071. 24 LRTGELGSGALSQLRAESEVMKVEELRWPCFLATV--------------EAMKQDHTVSTALDTKYVFVTKAFNDFKVLY 89 (516) Q Consensus 24 ~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y--------------~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~ 89 (516) |....+... +..+.... + |.-+..+.| +++..... +++.=-+..|..+.=++.+.. T Consensus 1 m~~~~i~~L-~~~~~~~~-----~--r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~~Vd~~a~rl~~~G 70 (422) T protein:vir:97 1 MNYMGMGYL-RRKLALFK-----T--GVDKRYRYYAMDDRDDTRSIVMPNNVREMYR--SVLEWTAKGVDSLADRIIFRE 70 (422) T ss_pred CChHHHHHH-HHHHHHHH-----H--HHHHHHHHHhcCCChhhcCccccHHHHHHHH--hhcchhHHHHHHHHhccccce Confidence 221111110 11111100 0 000011111 11111111 111111122222111122211 Q ss_pred CCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeeccccccc--------ccceeeccccccC Q lcl|NC_016071. 90 NRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKY--------AGYITIDKIAFRP 160 (516) Q Consensus 90 ~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~--------~g~~~~~~l~~r~ 160 (516) - ..++.+ +.+.|+. ..|...... ..+|+-||.|+. .||.-.+ ...| +....++....++ T Consensus 71 f--~~~d~~----l~~~w~~----N~ld~~~~~~~~~al~~G~sf~-~v~~~~~-~~~p~i~~~sp~~~~~i~D~~~~~~ 138 (422) T protein:vir:97 71 F--TNDDFN----AWEIFKA----NNPDIFFDTAIQSALIASCCFV-YIMPGAE-DGLPKMQVIEASKATGILDPTTFLL 138 (422) T ss_pred e--eCCchh----HHHHHHh----cChHHHHHHHHHHHHHhcceeE-EEeeCCC-CCeeEEEEechhhEEEEEeCCCCcc Confidence 0 111222 3344432 125555554 447899999765 6775321 1111 1111122222222 Q ss_pred chhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchh Q lcl|NC_016071. 161 QSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSP 240 (516) Q Consensus 161 q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gL 240 (516) ..-+ .++..+.+|.......-.......+..+ +..... .+ ..+.--++.|.++++-+.|+|.|- T Consensus 139 ~~a~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~-~~----~~g~vPvv~~~n~~~~~~~~G~s~ 202 (422) T protein:vir:97 139 TEGY---AILESDSNGNPTLEAYFTDKDIWYYPKK--------GKPYNI-KN----PTGHPLLVPIIHRPDAVRPFGRSR 202 (422) T ss_pred eeeE---EEEEecCCCcEEEEEEEcCceEEEEcCC--------Cccccc-cC----CCCCcceEEecccCCCccccCccc Confidence 2111 1333444444321111000000000000 000001 11 112223567778888899999885 Q ss_pred H-HHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCc Q lcl|NC_016071. 241 L-VGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDM 318 (516) Q Consensus 241 l-r~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~ 318 (516) + +.+- +..-. +..+-.-+...|=+.+|-.++.|. ... ....+ .. ... -+.-..+|.+. T Consensus 203 I~e~v~-~l~da~~r~~~~~~~~~e~~a~pqr~i~G~------d~d-~~~~~--~~---~~~-------~~~i~~~~~de 262 (422) T protein:vir:97 203 ITKAGM-YHQKAAKRTLERAEVTAEFYSFPQKYVLGM------DPD-AKPME--KW---RAT-------VSTLLEISKDE 262 (422) T ss_pred cchhHH-HHHHHHHHHHHHHHHHHHHhcchhhhhccc------Ccc-cccCc--hh---hhh-------hhhhhccCCCC Confidence 5 3221 11100 111112233444455555444432 111 11111 11 111 11223466554 Q ss_pred ccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC------Cccchh-hHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 319 NAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN------DGQGSY-NLSESKQSIHGHFVQRDID 391 (516) Q Consensus 319 ~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~------~~~GS~-Al~~vh~ev~~~~~~aDa~ 391 (516) +-+. +++..-+++ + ...++++++. +...+-+.+=+... ++..|- |+. ....-....++.-.+ T Consensus 263 ~~~~-----~~v~q~~~~-~---l~~~~~~l~~-~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~-a~~~~L~~ka~~k~~ 331 (422) T protein:vir:97 263 DGDK-----PTVGQFTTA-S---MAPFMEHLKM-YASLFAGGSGLTLDDLGFPSDNPSSVESIK-AAHENLRAAGRKAQR 331 (422) T ss_pred CCCc-----ceeeecCCC-C---hhHHHHHHHH-HHHHHhcccCCCHHHhccccCchhHHHHHH-HHHHHHHHHHHHHHH Confidence 3222 222222222 2 2234454432 22223332222110 111222 222 222333333444455 Q ss_pred HHHHHHHHHHHHHHHHhcCCcCCcc-cc--ceEEecCc---CchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCC Q lcl|NC_016071. 392 IIVEAFNKNLIPQLLALNDIRLSDE-DM--PKLKPGLI---QEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDE 465 (516) Q Consensus 392 ~i~~~ln~~li~~lv~lN~~~~~~~-~~--P~~~~~~~---~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~ 465 (516) .+...+. ++++.++.+........ .. ..+.|... +...+.+.|+++.||+.+|-...+ .+.+++.+|+.. T Consensus 332 ~fg~~l~-~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~---~~~~~~~lg~~~ 407 (422) T protein:vir:97 332 SFSSGFL-NVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMD---ADVIRDLTGVKG 407 (422) T ss_pred HHHHHHH-HHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhcccccc---HHHHHHHcCCCc Confidence 6666674 46676666653221111 01 24566633 333367778899999998632222 357899999965 Q ss_pred CCCcccccCcccccCCCC Q lcl|NC_016071. 466 EIPEDMSTDELLKLLGQD 483 (516) Q Consensus 466 ~~~~~~~~~~~~~~~~~~ 483 (516) +..+-.-+ ++..++. T Consensus 408 ~~~~~~~~---~~~~~d~ 422 (422) T protein:vir:97 408 ADKPIPAI---TEVTTDG 422 (422) T ss_pred hhHHHHHH---HhhhccC Confidence 43221111 1111111 No 173 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=93.26 E-value=0.0082 Score=32.02 Aligned_cols=423 Identities=10% Similarity=-0.018 Sum_probs=141.1 Q ss_pred CCCCcccccchHHHHHHHHHH--------HhhcccccC-----CcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeee Q lcl|NC_016071. 21 VSRLRTGELGSGALSQLRAES--------EVMKVEELR-----WPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKV 87 (516) Q Consensus 21 ~~~~~~~e~g~~~~~~~~~~~--------~~~~~~~lr-----~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~ 87 (516) +++....++-..-+..+.... +-+-..+++ .+..++...........+-++.+.-..+..-.+.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~-- 78 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV-- 78 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeec-- Confidence 222222222111111111100 000001110 01111111111112233334444433333444432 Q ss_pred CCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccccCchhcc- Q lcl|NC_016071. 88 LYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLS- 165 (516) Q Consensus 88 ~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~- 165 (516) ....|.... +.+.+.|++- .|..+..+++ ++.-||.+ ++++|.-..+ ...+..+.|..-..+. T Consensus 79 ~~~~d~~~~----~~~~~~~~~n----~~d~~~~~~~~~a~~~G~a-~~~~~~~edg------~~~i~~~~p~~~~~i~d 143 (456) T protein:vir:79 79 GGSADSDLA----LRARRIWRDN----RMDSVCKQWVKYGLDFGES-YLTCWRRDDG------TATITADSPETMVVSVD 143 (456) T ss_pred CCCCCccHH----HHHHHHHHhc----ChhHHHHHHHHHHhhcCee-EEEEeeCCCC------ceEEEEeccceeEEEEc Confidence 222222222 3344444431 3666666655 78889986 6789965433 2222222221100000 Q ss_pred ---------cccceeecCCCceee-ecc--ccccccccccccccccccccccccccccCCCccc---cc-cccEEEEeec Q lcl|NC_016071. 166 ---------RSKPWVFDEDGRTLK-GIY--QSKMAFANFQNGLTQISSAMSLVTNLTSSADEVF---IP-INKLMVMSLG 229 (516) Q Consensus 166 ---------~~~~f~~~~dg~~l~-~~~--q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---iP-~~k~i~~~~~ 229 (516) -.++| .+.|+.... .+. ....... ..+. ..............+.... .| ....+-+. T Consensus 144 ~~~~~~~~~~~~~~-~~~d~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv-- 216 (456) T protein:vir:79 144 PLQPWRIRSAMRWW-RDLDAESDFAIVWSGDGWQKFA--RPCF--VQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVV-- 216 (456) T ss_pred CCCCCceEEEEEEE-EecCCceeEEEEEcCCceEEEE--EEEE--eeccccceeeeccCCceeecccccCCCCceeEE-- Confidence 00111 111111000 000 0000000 0000 0000000000000000000 00 00111111 Q ss_pred CcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 230 GTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE 308 (516) Q Consensus 230 ~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~ 308 (516) ...|+.|.|.+..+-. .+-. +..+..-+..++-+..+..++.+...- ....+.. -..+. ..+.+.++. T Consensus 217 -~~~N~~~~gd~e~v~~-liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~--~~~~d~~---g~~i~----~~~~~~~~~ 285 (456) T protein:vir:79 217 -VYQNPDGMGEVEPHID-IINRINRAELQLLSTMAIQAFRQRALKSSEHR--LPKVDEN---GNAID----YASIFEAAP 285 (456) T ss_pred -EecCCCCCchhhhhHH-HHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcc--ccccccc---ccccc----hhhhhhhhc Confidence 1357888888887532 1111 111112223344444444444332110 0000000 00010 111111222 Q ss_pred ceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCC-ccchhhHHHHHHHHHHHHHH Q lcl|NC_016071. 309 QAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGND-GQGSYNLSESKQSIHGHFVQ 387 (516) Q Consensus 309 ~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~-~~GS~Al~~vh~ev~~~~~~ 387 (516) .+...+|.+.++- +|..+ ....|.+.++..-.+|+...--..-..+.. +..|...-+....-....++ T Consensus 286 ~~~~~~~~~~~~~-------q~~~~----~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~ 354 (456) T protein:vir:79 286 GALWELPPGVDIW-------ESQTN----DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCE 354 (456) T ss_pred cccccCCCCccee-------eeccc----ChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHH Confidence 2334456554321 11111 112355555555555554321111111111 11222111222222233334 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC Q lcl|NC_016071. 388 RDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI 467 (516) Q Consensus 388 aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~ 467 (516) .-.+.+...|. ++++.++.+.+. ....-.++.|......++.+.|+++.+|+.+|+.. ..-.++.+|+.+.. T Consensus 355 ~~~~~f~~~l~-~~~~l~~~~~g~--~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~-----~~~~~~~lg~~~~~ 426 (456) T protein:vir:79 355 DRLSIAKIGLE-AILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESW-----ASIRRNILNYNADQ 426 (456) T ss_pred HHHHHHHHHHH-HHHHHHHHhcCC--CccccceEEeCCCCCcCHHHHHHHHHHHHhcCCCh-----HHHHHhcCCCCHHH Confidence 44456666674 477777777642 22223467787788888899999999999999743 23356777885421 Q ss_pred CcccccCcccccCCCCCCcccccccccCCCCCcccccccc Q lcl|NC_016071. 468 PEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTR 507 (516) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (516) -+......... + .+.-..+. +.. ++..++| T Consensus 427 i~~~e~~r~~~---e-~~~~~~~~-~~~-----~~~~~~~ 456 (456) T protein:vir:79 427 IKQDDLDRARE---Q-ITLFAGNP-VQR-----PQEDGSR 456 (456) T ss_pred HHHHHHHHHHH---H-HHHHhhhH-hhc-----CCCCCCC Confidence 11100000000 0 00000000 000 0001111 No 174 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=92.98 E-value=0.0092 Score=31.74 Aligned_cols=450 Identities=8% Similarity=-0.006 Sum_probs=170.8 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchH----H---HHHHHHH-HH-hhccccc-CCcccH-HHHHHH-hhChHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSG----A---LSQLRAE-SE-VMKVEEL-RWPCFL-ATVEAM-KQDHTVS 68 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~----~---~~~~~~~-~~-~~~~~~l-r~~~~~-~~y~~m-~~D~~v~ 68 (516) .+.|+-.-+++.-.......++.....++-.. - ...+... .+ .-.++.+ ..+... ...... ....+.. T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k 95 (501) T protein:vir:27 16 LNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGR 95 (501) T ss_pred hhcccChhHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHH Confidence 33344444333322221111111100001000 0 0000000 00 0011111 110000 000000 0134445 Q ss_pred HHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeeccccccc Q lcl|NC_016071. 69 TALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKY 147 (516) Q Consensus 69 s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~ 147 (516) -++.+....+.+-+..++.. ++...+.+.+++..++..- .|..++.++. ++.-||.+ ++++|... T Consensus 96 ~Ivd~~~~yl~g~p~~~~~~---d~~~~~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~a-~~~vy~de------ 161 (501) T protein:vir:27 96 MISKFKTGYLAGNPIRVEYD---DNDNNSQNDDTIKRIGRIN----DIDSHNRTLIRDLSQTGRA-YEVIYRNE------ 161 (501) T ss_pred HHHHHHhhhhcccCeeEecC---CccchHHHHHHHHHHHHhc----ChhHHHHHHHHHHhhCCeE-EEEEEeCC------ Confidence 55555555566666555543 3334455667777776543 3666776654 68889986 56787643 Q ss_pred ccceeeccccccCchhcccccceeecC--CCceeeeccccccccccccccccccccccccccccccC----CCccccccc Q lcl|NC_016071. 148 AGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSS----ADEVFIPIN 221 (516) Q Consensus 148 ~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~iP~~ 221 (516) +|.+.+..+.|+.-. -.|++ .++.+..++-...........+..+-.+-......... ....+-|.. T Consensus 162 d~~~~i~~~~p~~~~-------~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g 234 (501) T protein:vir:27 162 YDETRIKRLNPLETF-------VIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFG 234 (501) T ss_pred CCceEEEEEccceeE-------EEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCC Confidence 233433333222110 11222 12232222211100000000000000000000000000 000011111 Q ss_pred --cEEEEeecCcCCccccchhHHHHHHHHHH-HHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHH Q lcl|NC_016071. 222 --KLMVMSLGGTESNPAGVSPLVGCYRAFRE-KILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLM 298 (516) Q Consensus 222 --k~i~~~~~~~~g~p~G~gLlr~~~~~~~f-K~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~ 298 (516) -++.| .+|+.|.|.+..+- +.+- -...+..++..++-+..++.++++.... +..+....+... T Consensus 235 ~vPvv~~-----~nn~~g~sd~e~v~-~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~-------~~~~~~~~~~~~- 300 (501) T protein:vir:27 235 TVPITEF-----LNNVDGIGDYETEL-YLIDLYDSAESDTANHMSDMADAILAIYGDLAL-------PKGMQASDMKRT- 300 (501) T ss_pred cccEEEe-----cCCCCCCCchhhhH-HHHHHHHHHHHHHHHHHHHhcCceeeeecCccC-------Ccccchhhhhhc- Confidence 12333 24678999998743 3332 2345566677778888888888764221 111111111100 Q ss_pred HHHHHhhcccceEEEeccCcc-cccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHH Q lcl|NC_016071. 299 ADAANAHAGEQAYFILPSDMN-AQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSES 377 (516) Q Consensus 299 ~~~~~~~~g~~a~~iiP~g~~-i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~v 377 (516) ..+.+..+.. .......+++++..+- ....+..+++.+.+.|.+.--...++.++.++.+.+. .. T Consensus 301 -----------~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~-Al 366 (501) T protein:vir:27 301 -----------RLMQLKPPKSADGKEGTVKAEYLTKSY--DVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGE-AL 366 (501) T ss_pred -----------CceeecccccccCCCCCcceeeeeccC--CHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHH-HH Confidence 0111111100 0011112345554332 2234677889998888877666555554332211111 11 Q ss_pred HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCcC-CccccceEEecCcCchhHHHHHHHHHHHHhCCcccccH Q lcl|NC_016071. 378 KQSI--HGHFVQRDIDIIVEAFNKNLIPQLLAL---NDIRL-SDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTP 451 (516) Q Consensus 378 h~ev--~~~~~~aDa~~i~~~ln~~li~~lv~l---N~~~~-~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~ 451 (516) +... ....+..-.+.+...|. ++++.++.+ +.... .+..-..+.|....+.|..+.++++.+|. |++. T Consensus 367 ~~~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~--g~iS--- 440 (501) T protein:vir:27 367 KYKLFGLDQDRVDTQSQFTQGLK-RRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQVS--- 440 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCc--- Confidence 1111 12222333344555553 355554443 21111 11122568899999999999999999985 6533 Q ss_pred HHHHHHHHHcC-CCCCCCcccccCcc------cccCCCCCCcccccccccCCCCCcccccccc Q lcl|NC_016071. 452 TVINKILEVGG-FDEEIPEDMSTDEL------LKLLGQDTSRSGDGMTAGSNGNGTGKISSTR 507 (516) Q Consensus 452 ~~~~~i~e~~G-lp~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (516) .+.+.+.++ ++.+..+-+-...+ ....++.....+.+........++.+..+.. T Consensus 441 --~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 441 --QETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred --HHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCccccccccCC Confidence 344555653 33222110000000 0011111111111111111111111111111 No 175 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=92.92 E-value=0.0095 Score=31.68 Aligned_cols=443 Identities=13% Similarity=0.098 Sum_probs=167.5 Q ss_pred CCccccCcccccchhhhcccCCCCccc--cc--chHHHHH----HHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTG--EL--GSGALSQ----LRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALD 72 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~--e~--g~~~~~~----~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~ 72 (516) +........+..+...+-|+.|..--+ || +..+... ++.+.+.+ +..+..++++.|++|..+|.|-++++ T Consensus 5 ~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~--~~~~~~eLI~~YR~ma~~pEvd~Av~ 82 (511) T protein:vir:56 5 TKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSE--GTIPVKELIKSYRALAEYHEVDDAIQ 82 (511) T ss_pred cchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceecccccccc--CccchHHHHHHHHHHhhccchhhHHH Confidence 222222222222211111111111111 11 1111000 11122222 11233478999999999999999999 Q ss_pred HHHHHHhcCCc-e--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccccc Q lcl|NC_016071. 73 TKYVFVTKAFN-D--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAG 149 (516) Q Consensus 73 ~Rk~~v~~~~w-~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g 149 (516) -.-.-+.-.+- . |.+.-. +.+.++.+-+.|.+. |+ .+..+|+.--+||..+= .|+-|| T Consensus 83 eIvne~iv~d~~~~pV~l~ld-~~~~s~~iK~kI~ee---------F~-~Il~ll~F~~~~~~~fR--------~WYVDg 143 (511) T protein:vir:56 83 EIVDEAIVYENDKEVVWLNLD-NTDFSENIKAKINEE---------FD-RVVSLLQMRKHGYKWFR--------KWYVDS 143 (511) T ss_pred HhhcceeEecCCCceEEEEec-ccCcchHHHHHHHHH---------HH-HHHHHhccchhhhHHHh--------hhhhcc Confidence 87664321110 0 011100 111223333333332 22 23355555555555442 233344 Q ss_pred ceeecc----------ccccCchhcccccceeecC-CCceeeeccccccccccccccccccccccccccccccCCCcccc Q lcl|NC_016071. 150 YITIDK----------IAFRPQSSLSRSKPWVFDE-DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFI 218 (516) Q Consensus 150 ~~~~~~----------l~~r~q~ti~~~~~f~~~~-dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 218 (516) ++..++ |...+|..|++.|-...+. +|.-+. . ....+.-|.+.....+ +++........++.| T Consensus 144 Ri~fHkiid~k~GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~--~-~~~ey~~Y~~~~~~~~---~~~~~~~~~~~~vkI 217 (511) T protein:vir:56 144 RIYFHKILDKDNNIIELRPLNPMKMELVREIQKETIDGVEVV--K-GTLEYYVYKQSDYKMP---SWMSATNRAQTSFRI 217 (511) T ss_pred eEEEEEEeccccceeehhhcCcccchhhhhhhcccccccccc--c-ceeeeeEecCCCcccC---cccccccccccceee Confidence 444333 2222333333333333321 221111 1 1111111211111111 111111223456778 Q ss_pred ccccEEEEeecCc----CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccccc--CCCCHHHHH Q lcl|NC_016071. 219 PINKLMVMSLGGT----ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAA--IDPKSPESE 292 (516) Q Consensus 219 P~~k~i~~~~~~~----~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~--~~~~~~~~~ 292 (516) |.+- |+|+|..- .++++..|.|.++..|+==-+.....-.++ .+-.+|.+|+=+-. .=|....++ T Consensus 218 ~~da-I~y~hSGL~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIY--------RitRAPeRRvFYIDVGnLPk~KAeq 288 (511) T protein:vir:56 218 PKDA-IVFAHSGLMRGCADDPYIIGYLDRAIKPANQLKMLEDALVIY--------RLARAPERRVFYVDVGNLPTQKAQQ 288 (511) T ss_pred chhh-eeeecccceeccCCCCeeeccchhhhHHHHhhHHHHhhHHHH--------hhhccccceEEEEecCCCCchhHHH Confidence 8764 77777653 577889999999998875433332222221 12222222221111 112222222 Q ss_pred HHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHH Q lcl|NC_016071. 293 MVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAIL 354 (516) Q Consensus 293 ~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Is 354 (516) . +..+++.++. .+..|- -||.= +.....+|+-+ .|+... .-.+=|+|..+.+- T Consensus 289 Y---l~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEItTL--pGgqnl-gem~DV~YF~kKLy 359 (511) T protein:vir:56 289 Y---VNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRR---EGSKGTEVSTL--PGGQSL-GDIEDVLYFNRKLY 359 (511) T ss_pred H---HHHHHHhcCceEEEeccCceeccchhhhhhHhhhccccc---CCCCccceeec--cccCCc-ChHHHHHHHHHHHH Confidence 2 2333322211 111111 12210 00111233333 222222 23345899999999 Q ss_pred HHHhcccccccCCcc-chhhH---HH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccceEEecC Q lcl|NC_016071. 355 DRFGAGFINLGNDGQ-GSYNL---SE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPKLKPGL 425 (516) Q Consensus 355 k~iLGqtLts~~~~~-GS~Al---~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~~~~~~ 425 (516) +++--..--.+.++. ++..+ ++ +-.|+ |...++.-...+...|..-|-..|+. .+.--+. .--+.+.|+. T Consensus 360 ~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLil-Kgiit~eeW~~i~~~I~~~f 438 (511) T protein:vir:56 360 KAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIV-NNIITEEEWDANHEKLYVVF 438 (511) T ss_pred HHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-ccCCCHHHHHHHhhcceEEe Confidence 988877643432221 22222 22 22333 23334444444444454333333332 1111110 1113344433 Q ss_pred cCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCC--cccccCcccccCCCCCCccccc Q lcl|NC_016071. 426 IQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIP--EDMSTDELLKLLGQDTSRSGDG 490 (516) Q Consensus 426 ~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~--~~~~~~~~~~~~~~~~~~~~~~ 490 (516) ..+ .+.+-+.+++..|..+--.+-...+.+||++.+ .+...+- ++...+.+. ..+-...+..+. T Consensus 439 ~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~E~-k~~~~~~~e~~f 511 (511) T protein:vir:56 439 NQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDEEE-TNPRFQQDDQGF 511 (511) T ss_pred eecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHHhh-cCCCCCCcccCC Confidence 333 344444555655554432222244678887664 4442111 111111111 112222222111 No 176 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=92.55 E-value=0.011 Score=31.34 Aligned_cols=413 Identities=10% Similarity=0.011 Sum_probs=161.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHH--HHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLA--TVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~--~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |+.+. +.+--+.+- .++- -+.....+-... .+-|..+.-.+ .-.++ ..+...-++......+ T Consensus 1 l~~~~-----l~~~i~~~~-------~~~~--r~~~l~~yy~g~-~~il~~~~~~~~~~~~ki-~~n~~~~ivd~~~~~l 64 (429) T protein:vir:98 1 MTKDL-----LSELIQKHR-------SFNL--SYSAYKQLYEGD-HAILQQKQKEQYKPDNRL-VVNFAKYIVDTFNGYF 64 (429) T ss_pred CCHHH-----HHHHHHHHH-------HHHH--HHHHHHHHhccc-cccccccccccCCCccee-ecchHHHHHHHHhhhh Confidence 22221 000000000 0000 000111111111 01010000000 00000 1344455555555555 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) .+-+..+++ ++ ++.-+++.+++++. .|...+..+ .++.-||.+ ++++|... +|.+.+.-+. T Consensus 65 ~g~~~~~~~----~~---~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~~-~~~v~~d~------~g~~~~~~~~ 126 (429) T protein:vir:98 65 IGVPVQTSH----EN---KQVSNYLELLDGYN----DQDDNNAELSKICSIYGHG-YELVFNDE------NAEAGITYLT 126 (429) T ss_pred cccCceeec----CC---hHHHHHHHHHHhhc----CHhHHHHHHHHHHhhcCeE-EEEEEecC------CCcEEEEEEc Confidence 565555443 22 23445666666542 255555544 468889975 56777533 3444443333 Q ss_pred ccCchhcccccceeecC--CCceeeeccccccccccccccccccccccccccccccC-----CCcccccccc--EEEEee Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSS-----ADEVFIPINK--LMVMSL 228 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~iP~~k--~i~~~~ 228 (516) |+. +. -.|++ ++..+..++-..........-++ .......+.... .+..+-|..+ ++.| T Consensus 127 p~~---~~----~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~-- 194 (429) T protein:vir:98 127 PLE---AF----IVYDDSIRQKPLFAVRYFYNKGGVLEGSYS---DASNITYFKDGEKGIEIGESEPHPFDGVPMIEY-- 194 (429) T ss_pred ccc---eE----EEEeCCCCCceEEEEEEEEecCceEEEEEE---eCceEEEEEecCCceEecccccccCCccceEEe-- Confidence 221 11 11222 12233222211100000000000 000000000000 0111111112 2222 Q ss_pred cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhccc Q lcl|NC_016071. 229 GGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGE 308 (516) Q Consensus 229 ~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~ 308 (516) .+|+.|.|.+..+.-..=--...+..++...+.+..|+.++++-. ...+. +.. +.. T Consensus 195 ---~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~---------~~~~~---~~~---~~~------ 250 (429) T protein:vir:98 195 ---VENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAE---------LDDET---LKS---LRD------ 250 (429) T ss_pred ---cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC---------CCcch---hhh---Hhh------ Confidence 346789999987544333334456667777888888888876521 11111 111 110 Q ss_pred ceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh-hHHHHHHHHHHHHHH Q lcl|NC_016071. 309 QAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY-NLSESKQSIHGHFVQ 387 (516) Q Consensus 309 ~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~-Al~~vh~ev~~~~~~ 387 (516) ...+.+|.+.. +...++++..+.. ...+...++.+.+.|.+.--+..++.++.+.-|. |+. ....-....+. T Consensus 251 ~~~~~~~~~~~----~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~-~~~~~l~~k~~ 323 (429) T protein:vir:98 251 TRIINLKDTDA----QQLTVEFLQKPDA--DATQEHLLDRLENLIFRTAMVANISDESFGTASGIALR-YRLQAMDNLAK 323 (429) T ss_pred CceeeccCCCC----CCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHH-HHHHHHHHHHH Confidence 12233443321 1234555554332 2346677888888887776555555433322122 221 11111222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcCCcccc--ceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCC-C Q lcl|NC_016071. 388 RDIDIIVEAFNKNLIPQLLALNDIRLSDEDM--PKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGF-D 464 (516) Q Consensus 388 aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~--P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p 464 (516) .-.+.+...+. ++++.++.+-...+....+ ..+.|....+.|..+.++++.+|. |++ + .+.+.+.++. + T Consensus 324 ~~~~~~~~~l~-~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl~--g~i-s----~et~~~~l~~v~ 395 (429) T protein:vir:98 324 TKERKFMSGMN-RRYKLIASYPTSKIGPKDWIGIKYKFTRNLPANLLEESQIAGNLA--GIV-S----EETQVGVLSIVE 395 (429) T ss_pred HHHHHHHHHHH-HHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHHh--ccC-c----hHHHHHhCCCCC Confidence 33345555553 4555555542111111111 367888999999999999999984 543 3 3567777764 3 Q ss_pred CCCCcccccCcccccCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 465 EEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) .+..+=+-...+.....+..+.+. .+..++.+.. T Consensus 396 d~~~E~~ri~~E~~~~~~~~~~~~----~~~~~~~~~~ 429 (429) T protein:vir:98 396 NPQKEIERKNSDKSTLISRQAGGL----NGQNTTTILE 429 (429) T ss_pred CHHHHHHHHHHHHHHHHHHHHhhh----cCCCCCCCCC Confidence 221110000011110000000000 0001111111 No 177 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=92.43 E-value=0.011 Score=31.23 Aligned_cols=413 Identities=10% Similarity=-0.023 Sum_probs=155.6 Q ss_pred CCccccCcccccchhhhcccC-CCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAV-SRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~-~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~ 79 (516) +++|..+-....+.......+ ...+.. ..........++.| + -.....-++.+....+. T Consensus 56 ~~~~~~r~~~l~~YY~g~~~i~~~~~~~---------~~~~~~~~~~~~~r------i-----~~n~~k~Ivd~~~~yl~ 115 (492) T protein:vir:97 56 HLEKLPEISIGQEYYEQRPDIVKEPKPV---------DATGAVDPLKPDDR------M-----ITNFHANLVDQKVSYIV 115 (492) T ss_pred HHHHHHHHHHHHHHhcccCccccccccc---------cccccccccccccc------c-----ccchHHHHHHHHhhhhc Confidence 111111111111110000000 000000 00000000000000 0 02222333333333444 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +-+..+++ .+++..+++++++++ .+.+.+.+ ..++.-||. +++++|... +|.+.+.-+.| T Consensus 116 g~p~~~~~-------~d~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~-a~~~v~~d~------dg~~~~~~~~p 176 (492) T protein:vir:97 116 GKPIAFKH-------TDDEVVKRIDEVLGN-----RFDDKLHSVLTGASNKGI-EWLHPYLDE------EGEFKLFRVPA 176 (492) T ss_pred ccCceecc-------CchHHHHHHHHHHhc-----cHHHHHHHHHHHHhhcCe-EEEEEEecC------CCceEEEEEcc Confidence 44444432 223556777777653 25556655 456888998 456888643 34444443333 Q ss_pred cCchhcccccceeec--CCCceeeeccccccccc--------------cccccccccccccc-cccccccCCCccccccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFD--EDGRTLKGIYQSKMAFA--------------NFQNGLTQISSAMS-LVTNLTSSADEVFIPIN 221 (516) Q Consensus 159 r~q~ti~~~~~f~~~--~dg~~l~~~~q~~~~~~--------------~~~~~~~~~~~~~~-~~~~~~~~~~~~~iP~~ 221 (516) +. +. ..|+ ..++++..++....... .+..+......... -...... ....++.- T Consensus 177 ~~---~~----~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~v 247 (492) T protein:vir:97 177 EQ---GI----PIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHF--STGSWGKI 247 (492) T ss_pred cc---eE----EEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccc--ccCCCCCc Confidence 21 10 1122 12333333321110000 00000000000000 0000000 00001111 Q ss_pred cEEEEeecCcCCccccchhHHHHHHHHHH-HHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHH Q lcl|NC_016071. 222 KLMVMSLGGTESNPAGVSPLVGCYRAFRE-KILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMAD 300 (516) Q Consensus 222 k~i~~~~~~~~g~p~G~gLlr~~~~~~~f-K~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~ 300 (516) -++.|. +|+.|.|.+..+- +.+- -+..+...+..++.+..+..++++... .+.+ + .. .. T Consensus 248 Pvv~~~-----nn~~g~sd~e~v~-~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~------~~~~--~--~~----~~ 307 (492) T protein:vir:97 248 PFIPFK-----NNDLEISDIFMYK-TLIDAYNRRLSDLSNTFKDSNELTYVLKNYDD------QELP--E--FK----RL 307 (492) T ss_pred ceEEec-----CCCCCCCchHhHH-HHHHHHHHHHHHHHHHHHHhccceeeeecCCc------ccch--h--HH----HH Confidence 123332 3677889987643 3332 233455666677778888877765321 1111 1 11 11 Q ss_pred HHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH- Q lcl|NC_016071. 301 AANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ- 379 (516) Q Consensus 301 ~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~- 379 (516) +. ....+.++.+.+ ++++..+. ....+..+++.+.+.|.+.--...++.+.-++.+.+.| ... T Consensus 308 ~~-----~~~~~~~~~~~~--------~~~l~~~~--~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A-l~~~ 371 (492) T protein:vir:97 308 LR-----YYGAIKVSDNGG--------VDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA-LEFL 371 (492) T ss_pred Hh-----hccceecCCCCc--------ceeEeccC--CHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHH-HHHH Confidence 11 112334566543 44443332 23347778888888888775555555443222111111 111 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHH Q lcl|NC_016071. 380 -SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKIL 458 (516) Q Consensus 380 -ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~ 458 (516) .-....+..-.+.+...+. ++++.++.+........ --.+.|....+.|..+.++++.+|. |++. ++.+. T Consensus 372 ~~~l~~ka~~~~~~f~~~l~-~~~~li~~~~~~~~~~~-~i~v~f~~~~p~~~~e~a~~~~kl~--G~iS-----~et~l 442 (492) T protein:vir:97 372 YTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGEHK-DVDISFNYNKVANTELQVQTAQQSM--GIVS-----HETVL 442 (492) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCcccc-eeeEEecCCCCCCHHHHHHHHHHHh--ccCc-----hHHHH Confidence 1111122333334445553 46666666543222222 2367888888999999999999984 6533 34566 Q ss_pred HHcCC-CCCCCcccccCcccccCCC-CCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 459 EVGGF-DEEIPEDMSTDELLKLLGQ-DTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 459 e~~Gl-p~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) +.++. +.+..+-+-...+.....+ ...-.+.+...+..+++. ....+. T Consensus 443 ~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~e 492 (492) T protein:vir:97 443 ENHPFVEDLQAELERIEQEQTEYNKQLPNLDDGGADSAQQQERS-----NNKESE 492 (492) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCcccccc-----cccccC Confidence 66654 3222111111111000000 000000000001000000 000000 No 178 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=92.39 E-value=0.012 Score=31.19 Aligned_cols=453 Identities=11% Similarity=0.046 Sum_probs=180.4 Q ss_pred CCccccCcccc----cchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhh-ChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEV----VKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQ-DHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~~~~----~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~-D~~v~s~l~~Rk 75 (516) +|-+.+ ..+. .....+..+..+... +.. ... -.+.+-.+.+ .....--++|-+ +++++++++... T Consensus 11 ~sP~~a-~~R~~ar~~~~~y~aa~~~r~~~---~~~---~~~-s~~~~i~~~~--~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 11 LAPELV-ARRLAAREAIQAYEAARPGRTHK---AKR---QPL-GADTSLQKSA--VSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred cchHHH-HHHHHhHHHhccccccCcccccc---ccC---CCC-ChHHHHHHHH--HHHHHHHHHHHhcChHHHHHHHHHH Confidence 110000 0000 000111111110000 000 000 0011111111 011122244444 999999999988 Q ss_pred HHHhcC-CceeeeCCCC-CChhhHHHHHHHHHHHhhc------cCcCCHHHHHHHHHH-HHhhcceeeeEEEeecccccc Q lcl|NC_016071. 76 VFVTKA-FNDFKVLYNR-DSKASKDAAEFVEYALKNL------ANQQTLRDIARSAAT-FNEYGFSIFEKVYRTESAPSK 146 (516) Q Consensus 76 ~~v~~~-~w~i~~~~~~-d~~~~~~~a~~v~~~l~~~------~~~~~~~~~l~~~ld-a~~~G~S~~Eivw~~~~~~~~ 146 (516) ..|-+. -+.+.+.+-. +....++.++.|+..|+.. ....+|+.+.+.++. .+.-|=+++-+.|...... + T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~-~ 159 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNY-T 159 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccc-c Confidence 888763 3455554333 3345566777777777644 334678888888775 4666988888888765421 0 Q ss_pred cccceeeccccccCchhcccccceeecC-CCceeeeccccccc-cccccccccccccccccccccccCCCccccccccEE Q lcl|NC_016071. 147 YAGYITIDKIAFRPQSSLSRSKPWVFDE-DGRTLKGIYQSKMA-FANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLM 224 (516) Q Consensus 147 ~~g~~~~~~l~~r~q~ti~~~~~f~~~~-dg~~l~~~~q~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i 224 (516) + |..+.-+|...++.-|.-+ ++. +++.+-+++-+... -.-|.....+ |. .... ......-+.||... | T Consensus 160 ~-g~~~~~~lqliepd~l~~~----~~~~~~~i~~GIE~D~~Grp~aY~i~~~h--Pg-d~~~-~~~~~~~~rvpA~~-V 229 (548) T protein:vir:95 160 F-ATSVPFALELLEPDYLPFS----YNNLSKGIVQGIERDTWRRKRAYHLLKDH--PG-NLQT-LGGSLAVKRVEAER-I 229 (548) T ss_pred C-CcccceEEEEechhhcCCC----CCCCCCceeeeeEECCCCceEEEEEeecC--CC-cccc-cccccceeeechhH-h Confidence 0 1111112223333333211 111 22222232211100 0000000000 00 0000 01122345577765 5 Q ss_pred EEeec-CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhh--ccccceeeeecccccccccCCCCHHHHHHHHHHHHHH Q lcl|NC_016071. 225 VMSLG-GTESNPAGVSPLVGCYRAFREKILIENLETIGASK--DLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADA 301 (516) Q Consensus 225 ~~~~~-~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er--~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~ 301 (516) +|.+. .+.|..-|.++|.++.....--.......++...- -.+.|..-..++..... ....+.. T Consensus 230 lHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~----~~~~~~~--------- 296 (548) T protein:vir:95 230 IHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVE----PGKDRKN--------- 296 (548) T ss_pred eecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCC----CCccccc--------- Confidence 56655 56788899999999887655444444333332221 11112111111111000 0000000 Q ss_pred HHhhcccceEEE---eccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc--ccccccCCccchhhHHH Q lcl|NC_016071. 302 ANAHAGEQAYFI---LPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA--GFINLGNDGQGSYNLSE 376 (516) Q Consensus 302 ~~~~~g~~a~~i---iP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG--qtLts~~~~~GS~Al~~ 376 (516) ..+.. ..|.+ ++.|. +|++.+.+.. ...|..|.+..-+.|+..+.- +.||.+- + +|||.+- T Consensus 297 ~~~~~--~pG~iv~~L~pGe--------~i~~~~p~~p--~~~~~~f~~~~lr~IAaglGipYe~ltgD~-s-~nYSS~R 362 (548) T protein:vir:95 297 RTIPI--APGMVFDDLEPGE--------DVGMIESNRP--NPFLEGFRNGQLRMIGAGTRSTYSSVSRAY-D-GTYSAQR 362 (548) T ss_pred ccccc--cCCccccccCCCc--------eeeecCCCCC--CCCHHHHHHHHHHHHHhhcCCCHHHHhccc-c-hhHHHHH Confidence 00000 11222 34454 4666554433 335788888888999888633 3455543 2 4676554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCC-cCC----ccccceEEe--cCcCchhHHHHHHHHHHHHhCCc Q lcl|NC_016071. 377 SKQSIHGHFVQRDIDIIVEAFNKNLIPQLLA---LNDI-RLS----DEDMPKLKP--GLIQEVDMEGFSKFVQRIGAVGY 446 (516) Q Consensus 377 vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~---lN~~-~~~----~~~~P~~~~--~~~~~~dl~~~a~~~~~L~~~G~ 446 (516) .-..-+....+.....+...+-+-+-.++++ +++. ..| ...+-...+ ..-...|..+-+++...++..|+ T Consensus 363 ~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl 442 (548) T protein:vir:95 363 QELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGF 442 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCC Confidence 4333333333333333333332222222222 3321 011 111111211 22233465556777788888887 Q ss_pred ccccHH----------------HHHHHHHHcCCCCCCCccc-ccCcccc------------------------------- Q lcl|NC_016071. 447 LPKTPT----------------VINKILEVGGFDEEIPEDM-STDELLK------------------------------- 478 (516) Q Consensus 447 ~~~~~~----------------~~~~i~e~~Glp~~~~~~~-~~~~~~~------------------------------- 478 (516) .....+ .+....+++||+-+.+... ....... T Consensus 443 ~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (548) T protein:vir:95 443 ADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAG 522 (548) T ss_pred CCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCC Confidence 554211 1122355677753211100 0000000 Q ss_pred ---cCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 479 ---LLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 479 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) |.|+.+....-|.+.+++.| .|. T Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~--------~~~ 548 (548) T protein:vir:95 523 LPVPGPDFPNESNNGGADGQPSN--------PDP 548 (548) T ss_pred CcCCCCCCCcccccCCCCCCCCC--------CCC Confidence 11111111111222222222 222 No 179 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=92.23 E-value=0.012 Score=31.06 Aligned_cols=437 Identities=11% Similarity=0.002 Sum_probs=155.8 Q ss_pred CCcccccchHH-HHH-HHHHHHhhcccccCCcccH----HHHHHHhh--ChHHHHHH---HHHHHHHhcCCceeeeCCCC Q lcl|NC_016071. 23 RLRTGELGSGA-LSQ-LRAESEVMKVEELRWPCFL----ATVEAMKQ--DHTVSTAL---DTKYVFVTKAFNDFKVLYNR 91 (516) Q Consensus 23 ~~~~~e~g~~~-~~~-~~~~~~~~~~~~lr~~~~~----~~y~~m~~--D~~v~s~l---~~Rk~~v~~~~w~i~~~~~~ 91 (516) -+...|+-+.. +++ +.-.++...+-...++..- ..++++.. +.|..... ++.+.-..+....+...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc Confidence 22222222110 111 1112222333222332111 11222211 22222211 11121221221111100000 Q ss_pred -C-------------------------------ChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEE Q lcl|NC_016071. 92 -D-------------------------------SKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVY 138 (516) Q Consensus 92 -d-------------------------------~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw 138 (516) + +..+++..+++..+++. ..+..+...+. ++.-||. +++++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~-a~~~vy 155 (511) T protein:vir:96 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDL----NDVESHNRSLGLDLSIYGK-AYELMI 155 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhh----cChhHHHHHHHHHHHhcCe-eEEEEE Confidence 0 00112233344444432 23555655544 6788997 457888 Q ss_pred eecccccccccceeeccccccCchhcccccceeecC--CCceeeecccccccccccc----ccccccccccccccccccC Q lcl|NC_016071. 139 RTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQ----NGLTQISSAMSLVTNLTSS 212 (516) Q Consensus 139 ~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 212 (516) .-. +|.+.+..+.|+.-. ..|++ .++.+..++-......... .....+-.+..+..+.... T Consensus 156 ~d~------dg~~~i~~~~p~~~~-------~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~ 222 (511) T protein:vir:96 156 RNQ------DDETRLYKSDAMSTF-------IIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNR 222 (511) T ss_pred eCC------CCceEEEEEcccceE-------EEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecC Confidence 533 355554444333210 12332 2333443332111000000 0000011111111111111 Q ss_pred CC------------ccccccccEEEEeecCcCCccccchhHHHHHHHHHH-HHHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_016071. 213 AD------------EVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFRE-KILIENLETIGASKDLGGIIELKIPSQIL 279 (516) Q Consensus 213 ~~------------~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~f-K~~~~~~w~~~~er~g~~~~v~~~pp~~~ 279 (516) +. ...+..--++.|. +++.|.|.+..+-. .+- -...+..++..++.+..+++++++.... T Consensus 223 ~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~gd~e~v~~-liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~- 295 (511) T protein:vir:96 223 TNGLKLTPRENSFESHSFERMPITEFS-----NNERRKGDYEKVIT-LIDLYDNAESDTANYMSDLNDAMLLIKGNLNL- 295 (511) T ss_pred CCcccccccccccccCcCcccceEEec-----CCCCCCCchhhhHH-HHHHHHHHHHHHHHHHHHhhcchhheecCccC- Confidence 11 1111112233333 35678888887532 332 2335566777788888888888764321 Q ss_pred ccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_016071. 280 NKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA 359 (516) Q Consensus 280 ~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG 359 (516) +.. +........ ++ . ......+.+.+. .......++++..+-. ...+..+++.+.+.|...--. T Consensus 296 -----~~~--~~~~~~~~~-~~-~---~~~~~~~~~~~~--~~~~~~~~~~l~~~~~--~~~~e~~~~~L~~~I~~~s~~ 359 (511) T protein:vir:96 296 -----DPV--EVRKQKEAN-VL-F---LEPTVYVDAEGR--ETEGSVDGGYIYKQYD--VQGTEAYKDRLNSDIHMFTNT 359 (511) T ss_pred -----Cch--hhccccccc-ce-e---ccccceeccccc--cCCCCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCC Confidence 111 100000000 00 0 000000111111 1112234555544322 234677888888888876655 Q ss_pred ccccccCCccc-hh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC-cCCc-cccceEEecCcCchhHH Q lcl|NC_016071. 360 GFINLGNDGQG-SY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL---NDI-RLSD-EDMPKLKPGLIQEVDME 432 (516) Q Consensus 360 qtLts~~~~~G-S~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l---N~~-~~~~-~~~P~~~~~~~~~~dl~ 432 (516) ..++.++-++. |. |+.- ...-....+..-.+.+...|+ ++++.++.+ +.. ..+. ..-..+.|...-+.|.. T Consensus 360 P~~~~~~~~~n~Sg~Al~~-~~~~l~~ka~~~~~~f~~~l~-~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~ 437 (511) T protein:vir:96 360 PNMKDDNFSGTQSGEAMKY-KLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLI 437 (511) T ss_pred ccccccccccccHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHH Confidence 55555432211 11 2211 111111112222334455553 344544443 111 1111 11247889988899999 Q ss_pred HHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccC---CCCCCcccccccccCCCCCccccccccc Q lcl|NC_016071. 433 GFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLL---GQDTSRSGDGMTAGSNGNGTGKISSTRD 508 (516) Q Consensus 433 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~d 508 (516) +.++++.+|+ |++. .+.+.+.++. +.+..+-+-...+.+.. ........++. .+.+....+... T Consensus 438 e~~d~~~kl~--G~iS-----~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 505 (511) T protein:vir:96 438 EELKAYIDSG--GKIS-----QTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD-----INDDEQDDDTKD 505 (511) T ss_pred HHHHHHHHHh--ccCC-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCC-----CCCCCCCCCccC Confidence 9999999985 6533 3445566643 32211100011110000 00000000000 000111111111 Q ss_pred chhhhh Q lcl|NC_016071. 509 NSVSNM 514 (516) Q Consensus 509 ~~~~~~ 514 (516) ++..-- T Consensus 506 ~~~e~~ 511 (511) T protein:vir:96 506 TVDKKE 511 (511) T ss_pred cccccC Confidence 111111 No 180 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=92.23 E-value=0.012 Score=31.06 Aligned_cols=437 Identities=11% Similarity=0.002 Sum_probs=155.8 Q ss_pred CCcccccchHH-HHH-HHHHHHhhcccccCCcccH----HHHHHHhh--ChHHHHHH---HHHHHHHhcCCceeeeCCCC Q lcl|NC_016071. 23 RLRTGELGSGA-LSQ-LRAESEVMKVEELRWPCFL----ATVEAMKQ--DHTVSTAL---DTKYVFVTKAFNDFKVLYNR 91 (516) Q Consensus 23 ~~~~~e~g~~~-~~~-~~~~~~~~~~~~lr~~~~~----~~y~~m~~--D~~v~s~l---~~Rk~~v~~~~w~i~~~~~~ 91 (516) -+...|+-+.. +++ +.-.++...+-...++..- ..++++.. +.|..... ++.+.-..+....+...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:78 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc Confidence 22222222110 111 1112222333222332111 11222211 22222211 11121221221111100000 Q ss_pred -C-------------------------------ChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEE Q lcl|NC_016071. 92 -D-------------------------------SKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVY 138 (516) Q Consensus 92 -d-------------------------------~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw 138 (516) + +..+++..+++..+++. ..+..+...+. ++.-||. +++++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~-a~~~vy 155 (511) T protein:vir:78 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDL----NDVESHNRSLGLDLSIYGK-AYELMI 155 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhh----cChhHHHHHHHHHHHhcCe-eEEEEE Confidence 0 00112233344444432 23555655544 6788997 457888 Q ss_pred eecccccccccceeeccccccCchhcccccceeecC--CCceeeecccccccccccc----ccccccccccccccccccC Q lcl|NC_016071. 139 RTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQ----NGLTQISSAMSLVTNLTSS 212 (516) Q Consensus 139 ~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 212 (516) .-. +|.+.+..+.|+.-. ..|++ .++.+..++-......... .....+-.+..+..+.... T Consensus 156 ~d~------dg~~~i~~~~p~~~~-------~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~ 222 (511) T protein:vir:78 156 RNQ------DDETRLYKSDAMSTF-------IIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNR 222 (511) T ss_pred eCC------CCceEEEEEcccceE-------EEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecC Confidence 533 355554444333210 12332 2333443332111000000 0000011111111111111 Q ss_pred CC------------ccccccccEEEEeecCcCCccccchhHHHHHHHHHH-HHHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_016071. 213 AD------------EVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFRE-KILIENLETIGASKDLGGIIELKIPSQIL 279 (516) Q Consensus 213 ~~------------~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~f-K~~~~~~w~~~~er~g~~~~v~~~pp~~~ 279 (516) +. ...+..--++.|. +++.|.|.+..+-. .+- -...+..++..++.+..+++++++.... T Consensus 223 ~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~gd~e~v~~-liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~- 295 (511) T protein:vir:78 223 TNGLKLTPRENSFESHSFERMPITEFS-----NNERRKGDYEKVIT-LIDLYDNAESDTANYMSDLNDAMLLIKGNLNL- 295 (511) T ss_pred CCcccccccccccccCcCcccceEEec-----CCCCCCCchhhhHH-HHHHHHHHHHHHHHHHHHhhcchhheecCccC- Confidence 11 1111112233333 35678888887532 332 2335566777788888888888764321 Q ss_pred ccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_016071. 280 NKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA 359 (516) Q Consensus 280 ~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG 359 (516) +.. +........ ++ . ......+.+.+. .......++++..+-. ...+..+++.+.+.|...--. T Consensus 296 -----~~~--~~~~~~~~~-~~-~---~~~~~~~~~~~~--~~~~~~~~~~l~~~~~--~~~~e~~~~~L~~~I~~~s~~ 359 (511) T protein:vir:78 296 -----DPV--EVRKQKEAN-VL-F---LEPTVYVDAEGR--ETEGSVDGGYIYKQYD--VQGTEAYKDRLNSDIHMFTNT 359 (511) T ss_pred -----Cch--hhccccccc-ce-e---ccccceeccccc--cCCCCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCC Confidence 111 100000000 00 0 000000111111 1112234555544322 234677888888888876655 Q ss_pred ccccccCCccc-hh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC-cCCc-cccceEEecCcCchhHH Q lcl|NC_016071. 360 GFINLGNDGQG-SY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL---NDI-RLSD-EDMPKLKPGLIQEVDME 432 (516) Q Consensus 360 qtLts~~~~~G-S~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l---N~~-~~~~-~~~P~~~~~~~~~~dl~ 432 (516) ..++.++-++. |. |+.- ...-....+..-.+.+...|+ ++++.++.+ +.. ..+. ..-..+.|...-+.|.. T Consensus 360 P~~~~~~~~~n~Sg~Al~~-~~~~l~~ka~~~~~~f~~~l~-~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~ 437 (511) T protein:vir:78 360 PNMKDDNFSGTQSGEAMKY-KLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLI 437 (511) T ss_pred ccccccccccccHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHH Confidence 55555432211 11 2211 111111112222334455553 344544443 111 1111 11247889988899999 Q ss_pred HHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccC---CCCCCcccccccccCCCCCccccccccc Q lcl|NC_016071. 433 GFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLL---GQDTSRSGDGMTAGSNGNGTGKISSTRD 508 (516) Q Consensus 433 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~d 508 (516) +.++++.+|+ |++. .+.+.+.++. +.+..+-+-...+.+.. ........++. .+.+....+... T Consensus 438 e~~d~~~kl~--G~iS-----~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 505 (511) T protein:vir:78 438 EELKAYIDSG--GKIS-----QTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD-----INDDEQDDDTKD 505 (511) T ss_pred HHHHHHHHHh--ccCC-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCC-----CCCCCCCCCccC Confidence 9999999985 6533 3445566643 32211100011110000 00000000000 000111111111 Q ss_pred chhhhh Q lcl|NC_016071. 509 NSVSNM 514 (516) Q Consensus 509 ~~~~~~ 514 (516) ++..-- T Consensus 506 ~~~e~~ 511 (511) T protein:vir:78 506 TVDKKE 511 (511) T ss_pred cccccC Confidence 111111 No 181 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=92.03 E-value=0.013 Score=30.89 Aligned_cols=426 Identities=11% Similarity=-0.027 Sum_probs=159.3 Q ss_pred CCccccCcccccchhhh-cccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE-NLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVT 79 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~-~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~ 79 (516) ++.|..+-.+..+.... ++-+...+ . .. .+..++-| .......-++.+....+. T Consensus 53 ~~~~~~r~~~l~~Yy~g~~~i~~~~~-~------------~~-~~~~~~~k-----------i~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:99 53 MDYQRPRLKVLSDYYEGKTKNLVELT-R------------RK-EEYMADNR-----------VAHDYASYISDFINGYFL 107 (511) T ss_pred HHhhHHHHHHHHHHhcccCccccccC-c------------cc-ccccCcce-----------eecchHHHHHHHHHhhhc Confidence 11111111111111000 00000000 0 00 00000000 012333444444445555 Q ss_pred cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecccccccccceeeccccc Q lcl|NC_016071. 80 KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAF 158 (516) Q Consensus 80 ~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~ 158 (516) +-+..+++ + +.++.+++..+++.- .|..+...+. ++.-||.+ ++++|... +|.+.+..+.| T Consensus 108 g~p~~~~~----~---d~~~~~~l~~~~~~n----~~~~~~~~~~~~~~i~G~a-~~~vy~de------d~~~~i~~~~p 169 (511) T protein:vir:99 108 GNPIQYQD----D---DKDVLEAIEAFNDLN----DVESHNRSLGLDLSIYGKA-YELMIRNQ------DDETRLYKSDA 169 (511) T ss_pred ccCceeec----C---chHHHHHHHHHHhhc----CHhHHHHHHHHHHHhcCee-EEEEEeCC------CCceEEEEEcc Confidence 55555542 1 224456677776542 2556665544 68889964 56787643 34454444333 Q ss_pred cCchhcccccceeecCC--Cceeeeccccccccccc----cccccccccccccccccccCCC----------cc--cccc Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDED--GRTLKGIYQSKMAFANF----QNGLTQISSAMSLVTNLTSSAD----------EV--FIPI 220 (516) Q Consensus 159 r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~----------~~--~iP~ 220 (516) +. + ...|++. ++.+..++......... ...+..+-.+-.+..+.....+ .. .+.. T Consensus 170 ~~---~----~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 242 (511) T protein:vir:99 170 MS---T----FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFER 242 (511) T ss_pred ce---e----EEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCc Confidence 22 1 1123322 33333333211100000 0000001111111111111110 01 1111 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMAD 300 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~ 300 (516) --++.|+ +|+.|.|.+..+-...=--...+..++..++.+..+++++++.... +.. +........ + T Consensus 243 vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~------~~~--~~~~~~~~~-~ 308 (511) T protein:vir:99 243 MPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL------DPV--EVRKQKEAN-V 308 (511) T ss_pred cceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCccc------Cch--hhccccccc-c Confidence 1233333 3677888888763322222345566777778888888887764221 111 000000000 0 Q ss_pred HHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHH Q lcl|NC_016071. 301 AANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQS 380 (516) Q Consensus 301 ~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~e 380 (516) + . . .......+.........+++++..+.. ...+...++.+.+.|.+.--...++.++-++. ++.+.-. T Consensus 309 ~-~---~--~~~~~~~~~~~~~~~~~d~~~l~~~~~--~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn---~Sg~Alk 377 (511) T protein:vir:99 309 L-F---L--EPTVYADSEGRETEGSVDGGYIYKQYD--VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT---QSGEAMK 377 (511) T ss_pred e-e---c--ccccccccccccCCCCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCCccccccccccc---chHHHHH Confidence 0 0 0 000000111111112234566654332 33467788999888877665555555432211 1222222 Q ss_pred HHHH----HHHHHHHHHHHHHHHHHHHHHHHh---cCCc-CC-ccccceEEecCcCchhHHHHHHHHHHHHhCCcccccH Q lcl|NC_016071. 381 IHGH----FVQRDIDIIVEAFNKNLIPQLLAL---NDIR-LS-DEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTP 451 (516) Q Consensus 381 v~~~----~~~aDa~~i~~~ln~~li~~lv~l---N~~~-~~-~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~ 451 (516) .... .+..-.+.+...|+ ++++.++.+ +... .+ +..-..+.|....+.|..+.++++.+|. |++. T Consensus 378 ~~~~~l~~ka~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--GiiS--- 451 (511) T protein:vir:99 378 YKLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKIS--- 451 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCC--- Confidence 2222 22223344455554 355554443 2111 11 1112478888888999999999999985 6533 Q ss_pred HHHHHHHHHcCC-CCCCCcccccCccc---ccCCCCCCcccccccccCCCCCcccccccccchh Q lcl|NC_016071. 452 TVINKILEVGGF-DEEIPEDMSTDELL---KLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSV 511 (516) Q Consensus 452 ~~~~~i~e~~Gl-p~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 511 (516) .+.+.+.++. +.+..+-+-...+. ....+......++.......++..+. ..|... T Consensus 452 --~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~d~~e 511 (511) T protein:vir:99 452 --QTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKD--SIDKKE 511 (511) T ss_pred --HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcC--cccccC Confidence 3455566643 32211100000000 00000000000000000001111111 111111 No 182 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=91.57 E-value=0.015 Score=30.55 Aligned_cols=439 Identities=10% Similarity=-0.004 Sum_probs=158.4 Q ss_pred CCcccccchH--HHHHHHHHHHhhcccccCCcccHH----HHHHHhh--ChHHHHH---HHHHHHHHhcCCceee-eCCC Q lcl|NC_016071. 23 RLRTGELGSG--ALSQLRAESEVMKVEELRWPCFLA----TVEAMKQ--DHTVSTA---LDTKYVFVTKAFNDFK-VLYN 90 (516) Q Consensus 23 ~~~~~e~g~~--~~~~~~~~~~~~~~~~lr~~~~~~----~y~~m~~--D~~v~s~---l~~Rk~~v~~~~w~i~-~~~~ 90 (516) -+...|+-+. -.-.+....+...+-...++..-+ .++++.. ..|..-. +++.+.-..+....+. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC Confidence 2222333221 111122223333333333332111 1122111 1222211 1111222222111110 0000 Q ss_pred CCC-------------------------------hhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEE Q lcl|NC_016071. 91 RDS-------------------------------KASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVY 138 (516) Q Consensus 91 ~d~-------------------------------~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw 138 (516) .+. ..+.++.+++..+++ ...|......+. ++.-||.+ ++++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~----~n~~~~~~~~~~~~~~~~G~a-y~~vy 155 (511) T protein:vir:93 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEVIEAFND----LNDVESHNRSLGLDLSIYGKA-YELMI 155 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHh----hcCHhHHHHHHHHHHHhcCee-EEEEE Confidence 000 011222233333332 223666665554 68889965 56777 Q ss_pred eecccccccccceeeccccccCchhcccccceeecCC--Cceeeecccccccccccc----ccccccccccccccccccC Q lcl|NC_016071. 139 RTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDED--GRTLKGIYQSKMAFANFQ----NGLTQISSAMSLVTNLTSS 212 (516) Q Consensus 139 ~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 212 (516) ... +|.+.+..+.|+.- ...|++. ++.+..++.......... .....+-.+..+..+.... T Consensus 156 ~de------~~~~~i~~~~p~~~-------~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~ 222 (511) T protein:vir:93 156 RNQ------DDETRLYKSDAMST-------FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSR 222 (511) T ss_pred eCC------CCceEEEEEcccee-------EEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecC Confidence 533 34444444333321 1123332 344444332111000000 0000000111111111111 Q ss_pred C------------CccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccc Q lcl|NC_016071. 213 A------------DEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILN 280 (516) Q Consensus 213 ~------------~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~ 280 (516) . ....+..--++.|+ .|+.|.|.+..+-..-=-=...+..++..++.+..+++++++.... T Consensus 223 ~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~-- 295 (511) T protein:vir:93 223 TNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-- 295 (511) T ss_pred CCccccccccccccccCCCccceEEec-----CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCccc-- Confidence 1 01111111233333 3567888888764322222335566777788888888888764221 Q ss_pred cccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcc Q lcl|NC_016071. 281 KAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAG 360 (516) Q Consensus 281 k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGq 360 (516) +. .+........ ++ . ... .....+.........+++++..+.. ...+..+++.+.+.|.+.--.. T Consensus 296 ----~~--~~~~~~~~~~-~~-~---~~~--~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~L~~~I~~~s~~P 360 (511) T protein:vir:93 296 ----DP--VEVRKQKEAN-VL-F---LEP--TVYADSEGRETEGSVDGGYIYKQYD--VQGTEAYKDRLNSDIHMFTNTP 360 (511) T ss_pred ----Cc--hhhccccccc-ce-e---ccc--ccccccccccCCCCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCCc Confidence 11 1100000000 00 0 000 0000011111122345666654332 2346778888889988776666 Q ss_pred cccccCCccchhhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHh---cCCc-CC-ccccceEEecCcCchhH Q lcl|NC_016071. 361 FINLGNDGQGSYNLSESKQSIHGHF----VQRDIDIIVEAFNKNLIPQLLAL---NDIR-LS-DEDMPKLKPGLIQEVDM 431 (516) Q Consensus 361 tLts~~~~~GS~Al~~vh~ev~~~~----~~aDa~~i~~~ln~~li~~lv~l---N~~~-~~-~~~~P~~~~~~~~~~dl 431 (516) .++.++.++.+ +.+.-...... +..-.+.+...|. ++++.++.+ ++.. .+ +..-..+.|....+.|. T Consensus 361 ~~~~~~~~~n~---Sg~Al~~~~~~l~~k~~~k~~~f~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~ 436 (511) T protein:vir:93 361 NMKDDNFSGTQ---SGEAMKYKLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSL 436 (511) T ss_pred ccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCcccccccccceEEeCCCCCCCH Confidence 56554332211 22222222222 2222334455553 345555543 2111 11 11124778888889999 Q ss_pred HHHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 432 EGFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 432 ~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) .+.++++.+|. |++. .+.+.+.++. +.+..+-+-...+.+...+...... + ......+.+....+..+++ T Consensus 437 ~e~~~~~~kl~--g~iS-----~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:93 437 IEELKAYIDSG--GKIS-----QTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGI-Y-KDPRDINDDEQDDDTKDTV 507 (511) T ss_pred HHHHHHHHHHh--ccCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhc-c-cCCCCCCCCCCCCcccccc Confidence 99999999984 6533 3456666644 3221110001011000000000000 0 0000011111111122222 Q ss_pred hhhh Q lcl|NC_016071. 511 VSNM 514 (516) Q Consensus 511 ~~~~ 514 (516) ..-. T Consensus 508 ~~~~ 511 (511) T protein:vir:93 508 DKKE 511 (511) T ss_pred cccC Confidence 2111 No 183 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=90.90 E-value=0.018 Score=30.09 Aligned_cols=472 Identities=13% Similarity=0.080 Sum_probs=172.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHH-------H------------ Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEA-------M------------ 61 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~-------m------------ 61 (516) |+ +..+.+|.+ .++ ++---+..-.+|+.-++.++...+.-.+.+..|.+.++.|.. + T Consensus 1 ~~--~~~~~~~~~-~~~-~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (641) T protein:vir:94 1 MT--IEMPTPIIE-DKE-SAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADD 76 (641) T ss_pred Cc--cCCCccccc-CCc-chhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccch Confidence 32 222333332 111 111122233556666677777777666666666544433210 0 Q ss_pred -----h-hChHHHHHHHHHHHHHhc-----CCc-eeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHh Q lcl|NC_016071. 62 -----K-QDHTVSTALDTKYVFVTK-----AFN-DFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNE 128 (516) Q Consensus 62 -----~-~D~~v~s~l~~Rk~~v~~-----~~w-~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~ 128 (516) . -|+++...++.....+.+ .+| ++++ .++.+.+.|+++.+.|++.-.+..|.+.+...+ +++. T Consensus 77 ~~~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p----~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~ 152 (641) T protein:vir:94 77 ADWRHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKG----MVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVL 152 (641) T ss_pred hcccccccchhHHHHHHHHhhHHhhhhcCCCceEEEec----CCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhh Confidence 0 144444444444333332 345 3332 233455677777777765433445666666654 7888 Q ss_pred hcceeeeEEEeec-----------ccccc-cc---------cc----------eeecc---------cccc-Cchhcccc Q lcl|NC_016071. 129 YGFSIFEKVYRTE-----------SAPSK-YA---------GY----------ITIDK---------IAFR-PQSSLSRS 167 (516) Q Consensus 129 ~G~S~~Eivw~~~-----------~~~~~-~~---------g~----------~~~~~---------l~~r-~q~ti~~~ 167 (516) +|..++.+-|... ++... .. .. +.++. ...| -..++..- T Consensus 153 ~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l 232 (641) T protein:vir:94 153 YGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHEL 232 (641) T ss_pred cCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHH Confidence 9988776666432 11000 00 00 00000 0000 00000000 Q ss_pred -cceeecCCCceeeeccccccccccc-------cccccc-----------cccccccccccccC----CCccccc-cccE Q lcl|NC_016071. 168 -KPWVFDEDGRTLKGIYQSKMAFANF-------QNGLTQ-----------ISSAMSLVTNLTSS----ADEVFIP-INKL 223 (516) Q Consensus 168 -~~f~~~~dg~~l~~~~q~~~~~~~~-------~~~~~~-----------~~~~~~~~~~~~~~----~~~~~iP-~~k~ 223 (516) .--.|+.|.-.......+.....+. ....+. ..+.+.+...+.+. .++.... ..-| T Consensus 233 ~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf 312 (641) T protein:vir:94 233 VTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPF 312 (641) T ss_pred HhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCe Confidence 0000000000000000000000000 000000 00001111111100 0011000 1258 Q ss_pred EEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016071. 224 MVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAAN 303 (516) Q Consensus 224 i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~ 303 (516) +++++....++.||.|....|..-..-.+...+.-+..++.-..|...+. +.+.+. + . + T Consensus 313 ~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~-~~~~~~-----~--~-------------~ 371 (641) T protein:vir:94 313 VTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLV-EDGILK-----R--E-------------D 371 (641) T ss_pred EEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeec-cccccc-----c--c-------------e Confidence 99999999999999999999998888887777777777666444332110 111100 0 0 0 Q ss_pred hhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCc-cchh-hHHHHHH-- Q lcl|NC_016071. 304 AHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDG-QGSY-NLSESKQ-- 379 (516) Q Consensus 304 ~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~-~GS~-Al~~vh~-- 379 (516) ++.+ .|+++..+... .+..+ ..++.........+++++..|.+++....+..+.+. .|.. -+.+|.. T Consensus 372 l~~~--PG~ii~~~~~~------~v~pl-~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~ 442 (641) T protein:vir:94 372 VKAK--PGAVFKVAQHG------SLQPI-DMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVR 442 (641) T ss_pred eecc--CCcceeeCCCC------cceee-cCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHH Confidence 1111 12232222110 11111 112221112345789999899888876655443221 1211 1222321 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------------ccc-eE--EecC--cCchhHHHHHHH Q lcl|NC_016071. 380 SIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDE-----------------DMP-KL--KPGL--IQEVDMEGFSKF 437 (516) Q Consensus 380 ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~-----------------~~P-~~--~~~~--~~~~dl~~~a~~ 437 (516) +.....+..-.+.+.+.+-..|+.+++.+|-.+.... ..| .+ .++. ........-+.. T Consensus 443 ~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~ 522 (641) T protein:vir:94 443 DAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERM 522 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHH Confidence 1112223334444554444555665555552221111 011 11 1111 111111111222 Q ss_pred HHHHH---hCCcccc-------cHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccc--cc Q lcl|NC_016071. 438 VQRIG---AVGYLPK-------TPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKI--SS 505 (516) Q Consensus 438 ~~~L~---~~G~~~~-------~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 505 (516) ++.|. +.....| .......+.+..|+|.|.. ........+.+.....+. ...... ..+++ .. T Consensus 523 i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p~~--~ir~~~~~~~~~~~~~~~--~q~~~~--~~a~~~~~~ 596 (641) T protein:vir:94 523 VTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRFTDPMR--YIKKAEAPPAAPPIAPAE--PGALPP--EMMNSVGGG 596 (641) T ss_pred HHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCCCCchh--hccCccCchhHHHHHHHH--HHHHHH--HHHHHHHhh Confidence 22221 1111112 1112355667778875432 111000000000000000 000000 00000 01 Q ss_pred cccchhhh--------hcC Q lcl|NC_016071. 506 TRDNSVSN--------MDN 516 (516) Q Consensus 506 ~~d~~~~~--------~~~ 516 (516) ..|.+.++ |.| T Consensus 597 ~~~~a~~~~~~~~~~~~~~ 615 (641) T protein:vir:94 597 LNDQAIAGMTPEDVSDLAS 615 (641) T ss_pred hHHHHHHHhhHHHHHHHHH Confidence 11122122 222 No 184 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=90.14 E-value=0.022 Score=29.63 Aligned_cols=433 Identities=12% Similarity=-0.040 Sum_probs=151.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHH-HHHHHHhhcccccCCcccHHHHH-------------HHhhCh- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQ-LRAESEVMKVEELRWPCFLATVE-------------AMKQDH- 65 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~-~~~~~~~~~~~~lr~~~~~~~y~-------------~m~~D~- 65 (516) |+ . .+|+....+.=...+.. +..+. .+.+.++. .-+.|+ .-.++- T Consensus 1 ~~-------------~---~i~~~~~~~~~~~~~~~L~~~~~--~~~~r~~~--~~~YY~G~~~i~~~~~~~~~~~~~~~ 60 (485) T protein:vir:24 1 MT-------------A---PLPGQEEIADPAIARDEMVSAFE--DQNQNLRS--NTSYYEAERRPEAIGVTVPVQMQSLL 60 (485) T ss_pred CC-------------C---CCCCCCcccchHHHHHHHHHHHH--HHHHHHHH--HHHHHhccCchhhcCcccchhhhhhh Confidence 11 1 12222222221111111 11111 11111100 001110 000111 Q ss_pred ----HHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEee Q lcl|NC_016071. 66 ----TVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRT 140 (516) Q Consensus 66 ----~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~ 140 (516) ...-++.+.-..+..-. |.+. .++..++ .+.+.|+. ..|+.+..++ .++.-||.| +++||.- T Consensus 61 ~~~n~~~~ivd~~~~~l~~~g--~~~~--~~~~~~~----~l~~i~~~----N~~d~~~~~~~~~a~i~G~a-y~~v~~~ 127 (485) T protein:vir:24 61 AHVGYPRLYVDSIAERQAVEG--FRLG--DADEADE----ELWQWWQA----NNLDIEAPLGYTDAYVHGRS-YITISRP 127 (485) T ss_pred hccchHHHHHHHHhhhhccCc--eecC--CCchhHH----HHHHHHHh----cChhHHHHHHHHHHhhcCce-EEEEecC Confidence 11111111111111112 2221 1222222 23444432 2366666554 468899997 6788875 Q ss_pred cccccccc--cceeeccccccCchhccc---------ccceeecCCCceeeeccccccccccccccccc-ccc-cccccc Q lcl|NC_016071. 141 ESAPSKYA--GYITIDKIAFRPQSSLSR---------SKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQ-ISS-AMSLVT 207 (516) Q Consensus 141 ~~~~~~~~--g~~~~~~l~~r~q~ti~~---------~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~-~~~-~~~~~~ 207 (516) ..+....+ +...+..+.|+.-..+.+ .+++ +++++.......- |....+. ... .-.+. T Consensus 128 ~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~-~~~~~~~~~~~~~-------y~~~~~~~~~~~~~~~~- 198 (485) T protein:vir:24 128 DPQIDLGWDPNVPLIRVEPPTRMYAEIDPRIGRPAKAIRVA-YDAEGNEIQAATL-------YTPNETFGWFRAEGEWV- 198 (485) T ss_pred CcccccccCCCcceEEEeccceeEEEeeCCcCceeEEEEEE-EeecCCeEEEEEE-------EcCCcEEEEEecCCceE- Confidence 54322111 111111111110000000 0011 1222211111110 0000000 000 00000 Q ss_pred ccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCC Q lcl|NC_016071. 208 NLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDP 286 (516) Q Consensus 208 ~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~ 286 (516) . .......++.--++.|.++.+.+.|+|.|-+.......+-. ...+...+...+-+..|..+++|... .+- T Consensus 199 ~--~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~------~~~ 270 (485) T protein:vir:24 199 E--WFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKP------EEI 270 (485) T ss_pred e--ecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCc------ccc Confidence 0 00111223344457788888888899998876433322211 22334455566767777766654211 000 Q ss_pred CHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC Q lcl|NC_016071. 287 KSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN 366 (516) Q Consensus 287 ~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~ 366 (516) ...+..... . +.++..+-..+|.+ + .++.....+ +...++++++.-|.+.--.-.++... T Consensus 271 ~~~~~~~~~-~------~~~~~~~i~~~~~~-~--------~~~~q~~~~----~~e~~~~~l~~~i~~~s~~~~~p~~~ 330 (485) T protein:vir:24 271 GVDPETGQT-L------FDAYLARILAFEDA-E--------GKIQQFSAA----ELANFTNALDQIAKQVAAYTGLPPQY 330 (485) T ss_pred ccccccccc-h------hhhcccceeccCCC-C--------ceEEeeccc----chHHHHHHHHHHHHHHhcccCCCHHH Confidence 000000000 0 01111122233322 2 222222222 23445666665554443222222111 Q ss_pred -----Cccchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCcCC-ccccceEEecCcCchhHHHHHHHH Q lcl|NC_016071. 367 -----DGQGSY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL-NDIRLS-DEDMPKLKPGLIQEVDMEGFSKFV 438 (516) Q Consensus 367 -----~~~GS~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l-N~~~~~-~~~~P~~~~~~~~~~dl~~~a~~~ 438 (516) .+..|. |+. ....-....++.-.+.+...|+ ++++.++.+ |....+ +..-..+.|....+.++.+.++++ T Consensus 331 fg~~~~n~~Sg~Al~-~~~~~l~~ka~~~~~~f~~~l~-~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~ 408 (485) T protein:vir:24 331 LSTAADNPASAEAIR-AAESRLIKKVERKNAIFGGAWE-EAMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAA 408 (485) T ss_pred hccccCcchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHH Confidence 111122 222 1222223333344445555664 355655555 321111 112246678888888999999999 Q ss_pred HHHHhCCc-ccccHHHHHHHHHHcCCCCCCCccccc-CcccccCCC-------CCCcccccccccCCCCCccc-cccccc Q lcl|NC_016071. 439 QRIGAVGY-LPKTPTVINKILEVGGFDEEIPEDMST-DELLKLLGQ-------DTSRSGDGMTAGSNGNGTGK-ISSTRD 508 (516) Q Consensus 439 ~~L~~~G~-~~~~~~~~~~i~e~~Glp~~~~~~~~~-~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~-~~~~~d 508 (516) .+|+..|. +++ .+-+++.+|+.....++... .......+. ...+..++..... ...+.+ .+.+.| T Consensus 409 ~kl~~~g~~~~s----~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~-e~~~~~~~~~~~~ 483 (485) T protein:vir:24 409 TKLYGNGQGVIP----RERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPT-PAPKPQPAIEGGD 483 (485) T ss_pred HHHHhcccccCC----HHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCC-CCCCCccCCCCCC Confidence 99998874 232 35677888886432111110 000000000 0011111110000 111111 122233 Q ss_pred ch Q lcl|NC_016071. 509 NS 510 (516) Q Consensus 509 ~~ 510 (516) ++ T Consensus 484 ~a 485 (485) T protein:vir:24 484 SA 485 (485) T ss_pred CC Confidence 33 No 185 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=89.61 E-value=0.025 Score=29.33 Aligned_cols=449 Identities=11% Similarity=-0.028 Sum_probs=145.7 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccC-CcccHH-HHHHHh-hChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELR-WPCFLA-TVEAMK-QDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr-~~~~~~-~y~~m~-~D~~v~s~l~~Rk~~ 77 (516) ||-|+.-....-........+-..-.... ..+.....+-... +.++ .+..+. -+..+. .-...+-++.+.-.. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~--~r~~~~~~Yy~G~--~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~ 76 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDST--QNLKTNTSYYEAE--RRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAER 76 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHH--HHHHHHHHHHhcC--CcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhh Confidence 44443333222111110000000000000 0000011111100 1111 000000 001111 011111111111111 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccc--cccceeec Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSK--YAGYITID 154 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~--~~g~~~~~ 154 (516) +.-.. |.+ +.++. ..+.+.+.|.+ ..|+.+...+ .+|+-||.| ++++|.-..+... .++...+. T Consensus 77 l~~~g--~~~--~~~~~----~~~~~~~i~~~----N~~d~~~~~~~~~a~i~G~a-y~~v~~~e~~~~~~~~~~~~~i~ 143 (485) T protein:vir:10 77 QAVEG--FRF--GDADE----ADEELWQWWQA----NNLDIEAPLGYTDAYVHGRS-YITISRPDPQIDLGWDPNTPIIR 143 (485) T ss_pred hcccc--eec--CCCch----hHHHHHHHHHh----cCHhHHHHHHHHHHhhcCce-EEEEeeCCcccccccCCCeeEEE Confidence 10111 222 12222 23344455542 2366666654 468899988 5688865433211 11222221 Q ss_pred cccccCchhcccccceeecC-CCceeeeccccc-------ccccccccccccc----ccccccccccccCCCcccccccc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDE-DGRTLKGIYQSK-------MAFANFQNGLTQI----SSAMSLVTNLTSSADEVFIPINK 222 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~-dg~~l~~~~q~~-------~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~iP~~k 222 (516) -+ ++..+. -.||+ .++....++-.. .....|....+.. ...+.. .......++..- T Consensus 144 ~~---~p~~~~----~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~-----~~~~~~~~g~vP 211 (485) T protein:vir:10 144 VE---PPTRMY----AEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQE-----WFNNPHGLGVVP 211 (485) T ss_pred EE---ccceeE----EEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEE-----eccccCCCCccc Confidence 11 111110 11221 111111111000 0000011000000 000000 001112234445 Q ss_pred EEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHH Q lcl|NC_016071. 223 LMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADA 301 (516) Q Consensus 223 ~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~ 301 (516) ++.|.++.+.+.|+|.|-+..-....+-. +..+...+...+-+..|..++++... .+-...+..... .. T Consensus 212 vv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~------~~~~~~~~~~~~-~~--- 281 (485) T protein:vir:10 212 VVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKP------EEIGVDPETGQT-LF--- 281 (485) T ss_pred EEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCc------ccccccccccch-hh--- Confidence 57788888888899988765322222111 22333444566667666665554210 000000000000 00 Q ss_pred HHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccccc-----CCccch-hhHH Q lcl|NC_016071. 302 ANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLG-----NDGQGS-YNLS 375 (516) Q Consensus 302 ~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~-----~~~~GS-~Al~ 375 (516) .....+-..+| +.+.. +...+.+ +...++++++.-|-+..-.-.++.. ..+..| -|+- T Consensus 282 ---~~~~~~i~~~~-~~d~k--------~~q~~~~----~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~ 345 (485) T protein:vir:10 282 ---DAYLARILAFE-DAEGK--------IQQFSAA----ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIR 345 (485) T ss_pred ---hhcccceeccC-CCCce--------EEeeccc----chHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHH Confidence 11111222223 22222 2222222 1233455554444333211111110 011122 2332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCcCCc-cccceEEecCcCchhHHHHHHHHHHHHhCCc-ccccHH Q lcl|NC_016071. 376 ESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALN-DIRLSD-EDMPKLKPGLIQEVDMEGFSKFVQRIGAVGY-LPKTPT 452 (516) Q Consensus 376 ~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN-~~~~~~-~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~-~~~~~~ 452 (516) -. ..-....++.-.+.+...|+ ++++.++.+. ....+. ..--.+.|....+.++.+.|+++.+|++.|. +++ T Consensus 346 ~~-~~~l~~k~~~k~~~f~~~l~-~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s--- 420 (485) T protein:vir:10 346 AA-ESRLIKKVERKNSIFGGAWE-EAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIP--- 420 (485) T ss_pred HH-HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCC--- Confidence 21 22222223334444455564 3555555543 211111 1112567888889999999999999999883 232 Q ss_pred HHHHHHHHcCCCCCCCccccc-CcccccCC----CCCCcccccccccCCCCCcccc---cccccch Q lcl|NC_016071. 453 VINKILEVGGFDEEIPEDMST-DELLKLLG----QDTSRSGDGMTAGSNGNGTGKI---SSTRDNS 510 (516) Q Consensus 453 ~~~~i~e~~Glp~~~~~~~~~-~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~---~~~~d~~ 510 (516) .+-+++.+|+.+..-++... .......+ +.....+++...+......+.+ ..+.|-+ T Consensus 421 -~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 421 -RERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred -HHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 35667888886432111100 00000000 0000000000000000000000 0111111 No 186 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=89.32 E-value=0.027 Score=29.18 Aligned_cols=415 Identities=10% Similarity=0.013 Sum_probs=160.5 Q ss_pred CCccccCcccccchhhhcccCCCCc-ccccchHHHHHHHHHHHh--hcccccCCcccHHHHH----------------HH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLR-TGELGSGALSQLRAESEV--MKVEELRWPCFLATVE----------------AM 61 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~-~~e~g~~~~~~~~~~~~~--~~~~~lr~~~~~~~y~----------------~m 61 (516) |.-. -|.+=-+. -.++- ...+..++.. .+.+.+ -+..+.|+ .. T Consensus 1 ~~~~-------------~~~~~~~~~~~~~~---~~~i~~~i~~~~~~~~r~--~~~~~Yy~g~~~i~~~~~~~~~~~~~ 62 (452) T protein:vir:36 1 MKYK-------------PPKLMTFSKDEPIT---VEVVTKFMEKHKLEVARY--EYLKNMYLGIMAIDDEPAKDSWKPDN 62 (452) T ss_pred Cccc-------------CceeEEcCCccCCC---HHHHHHHHHHHHHHHHHH--HHHHHHhccccccccCccccccCccc Confidence 2111 11100000 01110 0111111111 001110 00111111 00 Q ss_pred h-hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEe Q lcl|NC_016071. 62 K-QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYR 139 (516) Q Consensus 62 ~-~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~ 139 (516) + ..+...-++.+....+.+-+..+.+. +.+.-+++.+++++- .|...+..+ .++.-+|.+ ++.+|. T Consensus 63 ki~~n~~~~ivd~~~~~l~g~~~~~~~~-------d~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~~-~~~v~~ 130 (452) T protein:vir:36 63 RLAVNFTKYIVDTFTGYFNGIPVKKSHS-------DKEILTKLQEFDNLN----DMEDEESELAKMACIYGRA-FEFLYQ 130 (452) T ss_pred eeecchHHHHHHHHhhhhcccCceeecC-------ChhHHHHHHHHHhhc----ChhHHHHHHHHHHHhcCeE-EEEEEe Confidence 0 13445555555555555665555431 223445677766542 255555554 468889975 467775 Q ss_pred ecccccccccceeeccccccCchhcccccceeecCC--CceeeeccccccccccccccccccccccccccccccC----- Q lcl|NC_016071. 140 TESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDED--GRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSS----- 212 (516) Q Consensus 140 ~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 212 (516) -. +|.+.+..+.++. +. -.|++. ...+..++-..... ...+..+-.+...+...... T Consensus 131 d~------~g~~~i~~~~p~~---~~----~v~d~~~~~~~~~~i~~~~~~~---~~~~~~vyt~~~i~~~~~~~~~~~~ 194 (452) T protein:vir:36 131 DE------DTQTNVVYNSPEN---MF----MVYDDTVKQEPLFAVRYGVDED---KKLQGEVYTLLETIKISGENDEISF 194 (452) T ss_pred cC------CCeeEEEEEcccc---eE----EEEcCCCCCceEEEEEEEEecC---ceEEEEEEecCeEEEEEEcCCceEE Confidence 33 3334333332221 10 122221 22222221100000 00000011111111110000 Q ss_pred CCccccccc--cEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHH Q lcl|NC_016071. 213 ADEVFIPIN--KLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPE 290 (516) Q Consensus 213 ~~~~~iP~~--k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~ 290 (516) ..+.+-+.. -++.| .+|+.|.|.+..+--..=--...+..++..++.+..|..++++.. ...++ T Consensus 195 ~~~~~~~~g~iPvv~~-----~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~---------~~~~~ 260 (452) T protein:vir:36 195 GEGTYNPYPDLPVVEF-----YFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAA---------VEEED 260 (452) T ss_pred ecceeccCCcccEEEe-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC---------cCchh Confidence 011111222 23333 335678888876443222223455667888888888888876521 11111 Q ss_pred HHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccc Q lcl|NC_016071. 291 SEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQG 370 (516) Q Consensus 291 ~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~G 370 (516) ...+. . ...+.++.+.. .....++++..+. ....+...++.+.+.|...--+..++.+..+.. T Consensus 261 ---~~~~~-------~--~~~~~~~~~~~---~~~~~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~ 323 (452) T protein:vir:36 261 ---LKNIR-------S--NRVINYYADGE---GKNVDVKFLEKPD--SDSQTENLLDRLTKLIFQTTMVANISDESFGSS 323 (452) T ss_pred ---hhhhh-------h--cceEEecCCCC---ccCCcceeEeecC--CHHHHHHHHHHHHHHHHHHhCccccCcccccCC Confidence 11111 0 11234443321 1112355554432 233467788888888877655544444333222 Q ss_pred hh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc--cceEEecCcCchhHHHHHHHHHHHHhCCcc Q lcl|NC_016071. 371 SY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDED--MPKLKPGLIQEVDMEGFSKFVQRIGAVGYL 447 (516) Q Consensus 371 S~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~--~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~ 447 (516) |. |+.. ...-....+..-.+.+...|. ++++.++.+....+.... -..+.|....+.|..++++++.+++ |++ T Consensus 324 Sg~Al~~-~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~--g~i 399 (452) T protein:vir:36 324 SGVSLAY-KLQAMSNLALSFQRKFQSSLN-SRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDIKEQAETANILM--GIT 399 (452) T ss_pred cHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHh--ccC Confidence 21 2211 111111222223344455553 466665654322222212 2468888889999999999999984 543 Q ss_pred cccHHHHHHHHHHcCCC-CCCCcccccCcccc---cCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 448 PKTPTVINKILEVGGFD-EEIPEDMSTDELLK---LLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 448 ~~~~~~~~~i~e~~Glp-~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) . .+.+.+.++.- .+..+-+-...+.. ...+......++.. ...+....+ T Consensus 400 S-----~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-~~~~~~~~e 452 (452) T protein:vir:36 400 S-----QETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTD-TVVSETNEE 452 (452) T ss_pred C-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccCCCCccc-ccCccccCC Confidence 2 35566777643 22111000000000 00001111111100 000111111 No 187 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=89.10 E-value=0.028 Score=29.07 Aligned_cols=387 Identities=11% Similarity=-0.034 Sum_probs=138.9 Q ss_pred ccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHH Q lcl|NC_016071. 19 LAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKD 98 (516) Q Consensus 19 p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~ 98 (516) .++...| +.....+-... +.++. -.+.+-+++..+- .+++.--++.|..+.=++.++.-. .++.+ T Consensus 1 l~~~~~r--------~~~~~~yY~g~--~~~~~-~~~~~p~~~~~~~--~~v~nw~~~~Vds~a~rl~~~Gf~--~~d~~ 65 (410) T protein:vir:95 1 MNLYQSR--------VNLRYKHYAMQ--HYEAP-TGITIPAHIRAKY--QAVLGWAAKGVDSLADRLIFRAFA--NDDFN 65 (410) T ss_pred CCcchhh--------HHHHHHHhcCC--CCccc-cchhccHHHHhHH--HhhcchhHHHHHHhHhhhcccccc--CCCch Confidence 1111111 11111121111 11110 1122223332111 122222233333322122222111 11112 Q ss_pred HHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccc------cccceeeccccccCchhccccccee Q lcl|NC_016071. 99 AAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSK------YAGYITIDKIAFRPQSSLSRSKPWV 171 (516) Q Consensus 99 ~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~------~~g~~~~~~l~~r~q~ti~~~~~f~ 171 (516) +.++|+. ..|.....+ ..+|+-||.|+. .||.-..+.-. .+....++....++..-++ .+. T Consensus 66 ----l~~i~~~----N~ld~~~~~~~~~al~~G~sf~-~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~~~al~---~~~ 133 (410) T protein:vir:95 66 ----VTEIFDR----NNPDIFFDSAILSALIGSCSFV-YISKGEDDEVRLQVIESSNATGVIDPITGLLVEGYA---VLA 133 (410) T ss_pred ----HHHHHhh----cChHHHHHHHHHHHHHhCceeE-EEecCCCCceEEEEEcccceEEEEeCCCCceEEEEE---EEE Confidence 3444432 236555555 447999999754 78864322100 0111112221111111110 111 Q ss_pred ecCCCceeeeccccccccccccccccccccccccccccccCC--CccccccccEEEEeecCcCCccccchhH-HHHHHH- Q lcl|NC_016071. 172 FDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSA--DEVFIPINKLMVMSLGGTESNPAGVSPL-VGCYRA- 247 (516) Q Consensus 172 ~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~iP~~k~i~~~~~~~~g~p~G~gLl-r~~~~~- 247 (516) -+++|.... . ..+..+.+ ......+.. ..-..+..-+|.|.++++-+.|+|.|-+ +.+-.. T Consensus 134 ~~~~~~~~~-~-------~~~~~~~~-------~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~ 198 (410) T protein:vir:95 134 RDDYNRPTL-E-------AYFEPNAT-------HFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQ 198 (410) T ss_pred ecCCCeEEE-E-------EEEeCCcE-------EEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHH Confidence 122221100 0 00000000 000000000 0112344456788888888899998843 433221 Q ss_pred -HHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCccccccccc Q lcl|NC_016071. 248 -FREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQY 326 (516) Q Consensus 248 -~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~ 326 (516) -+-|. +-.-+..+|=+.+|-.++.|. ...+ ... ... .. ....-..+|++.+-+ T Consensus 199 da~~r~--~~~~~~~~e~~a~pqr~i~G~------d~d~-~~~--~~~---~~-------~~~~i~~~~~~~~~~----- 252 (410) T protein:vir:95 199 KYAKRT--LERADITAEFYSWPQKYILGL------DPDA-EPM--EKW---KA-------TVSSLLTISSSDKGV----- 252 (410) T ss_pred HHHHHH--HHHHHHHHHHhcchhheeecc------CCCC-CcC--chh---hh-------hhhhheeccCCCCCC----- Confidence 11121 112223344455555555432 1111 111 111 11 112234566653322 Q ss_pred ceeeeeccccCcchhHHHHHHHHHHHHHHHHhccc----ccccCCccchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 327 KMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGF----INLGNDGQGSY-NLSESKQSIHGHFVQRDIDIIVEAFNKNL 401 (516) Q Consensus 327 ~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt----Lts~~~~~GS~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~l 401 (516) ..++.+.++. +...|...++-+-.+||-. .++ |....++.+|- |+...+..+. ..++.-.+.+...+. ++ T Consensus 253 ~~~v~q~~~~-~l~~~~~~l~~l~~~~a~~--s~lP~~~lg~~~~NpsSa~Al~a~~~~L~-~ka~~k~~~fg~~l~-~~ 327 (410) T protein:vir:95 253 KPSVGQFTTA-SMSPFTEQLRTAAAGFAGE--MGLTLDDLGFVSDNPSSVEAIKASHENLR-LAGRKAQRSLGAGLL-NV 327 (410) T ss_pred cceEEecCCC-ChHHHHHHHHHHHHHHhhh--cCCCHHHhccccCchhHHHHHHHHHHHHH-HHHHHHHHHHHHHHH-HH Confidence 1223222222 2223433333333333322 111 10001122332 4443333322 223333455566664 46 Q ss_pred HHHHHHhcCCcC--Cccc-cceEEec---CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc Q lcl|NC_016071. 402 IPQLLALNDIRL--SDED-MPKLKPG---LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE 475 (516) Q Consensus 402 i~~lv~lN~~~~--~~~~-~P~~~~~---~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~ 475 (516) ++..+.+=.... +... -..+.+. ..+...+.+.|+++.||+.+|--+.. .+-+++.+|+.++.... .... T Consensus 328 ~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~---~~~~~~~lg~~~~~~~~-~~~~ 403 (410) T protein:vir:95 328 AYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYIN---AETIRDLTGIAGDMSAK-PVVS 403 (410) T ss_pred HHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCcc---HHHHHHhcCCChHHHHH-HHHH Confidence 666555522111 1111 1234454 45556678889999999998522221 35688999997432111 0000 Q ss_pred ccccCCC Q lcl|NC_016071. 476 LLKLLGQ 482 (516) Q Consensus 476 ~~~~~~~ 482 (516) .....++ T Consensus 404 e~~~~g~ 410 (410) T protein:vir:95 404 EGGSNGE 410 (410) T ss_pred HHHhCCC Confidence 1111111 No 188 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=88.97 E-value=0.029 Score=29.01 Aligned_cols=417 Identities=10% Similarity=-0.006 Sum_probs=156.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHh--hcccccCCcccHHHHH------------HHh---- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEV--MKVEELRWPCFLATVE------------AMK---- 62 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~--~~~~~lr~~~~~~~y~------------~m~---- 62 (516) |.-.. -.+ -..|.-..+... .+..++.. .+.+ |.-+..+.|+ ... T Consensus 1 ~~~~~---~~~----~~~p~d~~~~~~--------~l~~~i~~~~~~~~--r~~~~~~yy~g~~~i~~~~~~~~~~~~~k 63 (453) T protein:vir:39 1 MKYKP---PKL----MTFPKDEPITNE--------VVTKFMEKHRLEVA--RYEYLKNMYRGIMAIDAEPTKDLWKPDNR 63 (453) T ss_pred CeecC---Ccc----eEcCCCCCCCHH--------HHHHHHHHHHHHHH--HHHHHHHHhhccCchhcCCCccccCccce Confidence 11000 000 001110000000 00111000 0000 0000111111 000 Q ss_pred -hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEee Q lcl|NC_016071. 63 -QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRT 140 (516) Q Consensus 63 -~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~ 140 (516) ..+...-++.+....+.+-+..+++ .+++..+.+.++|.+- .|...+.+ ..++.-||.+ ++.+|.- T Consensus 64 i~~n~~~~ivd~~~~~l~g~~~~~~~-------~d~~~~~~l~~i~~~N----~~~~~~~~~~~~~~~~G~~-~~~v~~d 131 (453) T protein:vir:39 64 LTVNFTKYIVDTFTGYFNGIPVKKSH-------SDKETLSKLQEFDNLN----DMEDEESELAKMACIYGRA-FELLYQN 131 (453) T ss_pred eecchHHHHHHHHhhhhcccCceecc-------CChHHHHHHHHHHHhc----ChhHHHHHHHHHHhhcCeE-EEEEEec Confidence 1233334444444444444443332 1234456677777642 25555554 4578889975 4677753 Q ss_pred cccccccccceeeccccccCchhcccccceeecCC--CceeeeccccccccccccccccccccccccccccccC-----C Q lcl|NC_016071. 141 ESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDED--GRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSS-----A 213 (516) Q Consensus 141 ~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~ 213 (516) . +|.+.+..+.++. +. ..|++. ...+..++-....... .+..+-.+..+....... . T Consensus 132 ~------~g~~~i~~~~p~~---~~----~v~d~~~~~~~~~~ir~~~~~~~~---~~~~~yt~~~i~~~~~~~~~~~~~ 195 (453) T protein:vir:39 132 E------ETQTNVIYNTPEN---MF----MVYDDTIKQEPLFAVRYGYDDDYK---LYGEVYTKETTYALNGTMGFYNMT 195 (453) T ss_pred C------CCceEEEEEcccc---eE----EEecCCCCCeEEEEEEEEEeCCeE---EEEEEEeCCeEEEEEecCCceeee Confidence 3 2333333222221 10 112211 1111111110000000 000000000110000000 0 Q ss_pred Ccc--ccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHH Q lcl|NC_016071. 214 DEV--FIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPES 291 (516) Q Consensus 214 ~~~--~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~ 291 (516) +.. .++..-++.|. +++.|.|.+..+--..=--+..+..++..++.+..|..++++... + .++ T Consensus 196 ~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~--------~-~~~- 260 (453) T protein:vir:39 196 EQAPNPFDDLPVVEFY-----FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAV--------E-EED- 260 (453) T ss_pred cccccCCCceeEEEec-----CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCC--------C-chh- Confidence 011 11111223332 367788988765433223344666777788888888887775311 1 111 Q ss_pred HHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccch Q lcl|NC_016071. 292 EMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGS 371 (516) Q Consensus 292 ~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS 371 (516) ... +..+ ....++.+.. -.+..+++++..+.. ...+...++.+.+.|...--...++.+..+..| T Consensus 261 --~~~---~~~~------~~~~~~~~~~--~~~~~~~~~lt~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~S 325 (453) T protein:vir:39 261 --LKN---IRSN------RVINYYGESS--EAKNVDVKFLEKPDS--DSQTENLLDRLTKLIFQTTMVANISDESFGSSS 325 (453) T ss_pred --hhh---hhhc------ceeeecCCCC--CCCCCceeEEeecCC--HHHHHHHHHHHHHHHHHHhCCcccccccccCCh Confidence 111 1110 1122222211 112234566654332 334677888888888775444444443322222 Q ss_pred h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc--ccceEEecCcCchhHHHHHHHHHHHHhCCccc Q lcl|NC_016071. 372 Y-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDE--DMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLP 448 (516) Q Consensus 372 ~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~--~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~ 448 (516) . |+.. ...-....+..-.+.+...|. ++++.++.+....+... .-..+.|....+.|+.+.++++.+|. |++ T Consensus 326 g~Al~~-~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~--g~i- 400 (453) T protein:vir:39 326 GVSLAY-KLQAMSNLALSFQRKFQSSLN-SRYKLYCELSTNVSNKEAWKDIEYTFTRNEPKDIKEQAETANILM--GIT- 400 (453) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHh--ccC- Confidence 1 2211 111112223333344455553 35565555432222111 12367888889999999999999984 543 Q ss_pred ccHHHHHHHHHHcCC-CCCCCcccccCcccc---cCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 449 KTPTVINKILEVGGF-DEEIPEDMSTDELLK---LLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 449 ~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) + .+.+.+.++. +.+..+-+-...+.. ...+......++....++.. ..+ T Consensus 401 s----~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~-~~e 453 (453) T protein:vir:39 401 S----QETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPET-NEE 453 (453) T ss_pred C----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCc-CCC Confidence 3 3556677764 322111000000100 01111111112111111111 111 No 189 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=88.90 E-value=0.03 Score=28.97 Aligned_cols=438 Identities=10% Similarity=0.033 Sum_probs=156.0 Q ss_pred CCccccCcccccchhh----hcccC------CCCcccccchHHHHHHHHHHHhhcccccCCcccH-HH-HHHHhhChHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGN----ENLAV------SRLRTGELGSGALSQLRAESEVMKVEELRWPCFL-AT-VEAMKQDHTVS 68 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~----~~p~~------~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~-~~-y~~m~~D~~v~ 68 (516) |-+|.+.. .+++- .+.++ +.+.+..-....+..+..+- ....+.+ +.+.. .. ..+.+.-.-+. T Consensus 3 ~~~~~k~~---~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y-~g~~~~~-~~~~~~~~~~~~~~~sln~~ 77 (508) T protein:vir:15 3 LIQRIKDL---FWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYY-SDKLQYI-HYQASDGIKKKRLKNTINMA 77 (508) T ss_pred hHHHHHHH---HHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHh-cCCCccc-ccccCCCCccccceeecchH Confidence 44444322 11110 00110 11111100000111222221 1111111 00000 00 00000000122 Q ss_pred HHHHH-HHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHH-HHHHHHHhhcceeeeEEEeecccccc Q lcl|NC_016071. 69 TALDT-KYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIA-RSAATFNEYGFSIFEKVYRTESAPSK 146 (516) Q Consensus 69 s~l~~-Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l-~~~lda~~~G~S~~Eivw~~~~~~~~ 146 (516) ..+-+ --..|.+-.-.|++.. + ....+++.+.+++- .|...+ ..+.++..+|=.++=+.|.... T Consensus 78 ~~i~~~~A~lv~~e~~~i~v~~---~---~~~~e~l~~il~~n----~f~~~~~~~~e~a~a~G~~~~k~~~d~~~---- 143 (508) T protein:vir:15 78 KTAARRIASVVFNEKAEIHVKD---N---NEADKFLNDVLEDN----DFKNKFEEALEKGVALGGFAMRPYIDGNH---- 143 (508) T ss_pred HHHHHHHHhhhhCCCceEEeCC---c---hHHHHHHHHHHHhc----cHHHHHHHHHHHHhhcCceEEEEEEeCCe---- Confidence 22222 2223333333555431 1 12335677777642 244444 4556799999988877775322 Q ss_pred cccceeeccccccCchhcccccceeecCCCc------------------eeeecccccc------ccc--cccc-ccccc Q lcl|NC_016071. 147 YAGYITIDKIAFRPQSSLSRSKPWVFDEDGR------------------TLKGIYQSKM------AFA--NFQN-GLTQI 199 (516) Q Consensus 147 ~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~------------------~l~~~~q~~~------~~~--~~~~-~~~~~ 199 (516) +.+..+ ++..+- +..++.++. ..+.++-|.. ... -+.. ..... T Consensus 144 ----~~i~~v---~ad~~~---P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~l 213 (508) T protein:vir:15 144 ----IKIAWV---RADQFY---PLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIV 213 (508) T ss_pred ----eEEEEE---cCCeeE---EEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhc Confidence 111111 111000 001111110 0000000000 000 0000 00000 Q ss_pred ccccccccc--cccCCCccc---cccccEEEEee----cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccce Q lcl|NC_016071. 200 SSAMSLVTN--LTSSADEVF---IPINKLMVMSL----GGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGII 270 (516) Q Consensus 200 ~~~~~~~~~--~~~~~~~~~---iP~~k~i~~~~----~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~ 270 (516) +.++.+-.. ..+..+.+. ++.--|++++. ....++|+|.|.+..|.-..-.=+..+..|+.-+ |. +=+ T Consensus 214 G~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~--~~~ 290 (508) T protein:vir:15 214 GNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RL--GQK 290 (508) T ss_pred CcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-Hh--ccc Confidence 111111000 000001111 11112344432 2345789999999988743333333333343333 22 223 Q ss_pred eeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHH Q lcl|NC_016071. 271 ELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRK 350 (516) Q Consensus 271 v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d 350 (516) -+++|...+.....+.. . +..+......++.+.+ ....|+.+...= ....|.+.++.+- T Consensus 291 ~i~v~~~~l~~d~~~~~-----~----------~~~~~~~~~~~~~~~~----~~~~i~~~~~~i--r~e~~~~~~~~~l 349 (508) T protein:vir:15 291 HIAVQPGMLRFDDEHKP-----T----------FDTEQNVYVGVLSDDN----NGLGVKDMTTPI--RTVQYKDAIDHFI 349 (508) T ss_pred ceeechHHhcCCCCCcc-----c----------cCCCCeeEEeccCCCC----CCCceeEeeccc--ChHHHHHHHHHHH Confidence 44556555543322110 0 0112222333332211 011122222110 1112445555555 Q ss_pred HHHHHHH-hcccccccCCccchhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCcCCc--------- Q lcl|NC_016071. 351 KAILDRF-GAGFINLGNDGQGSYNLSESK--QSIHGHFVQRDIDIIVEAFNKNLIPQLLAL---NDIRLSD--------- 415 (516) Q Consensus 351 ~~Isk~i-LGqtLts~~~~~GS~Al~~vh--~ev~~~~~~aDa~~i~~~ln~~li~~lv~l---N~~~~~~--------- 415 (516) +.|...+ ++. -|.+-++.|....-++. ..-...-+..-.+.+..+|. +|++-++.+ +.....+ T Consensus 350 ~~~~~~~gls~-~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~~~~~al~-~lv~~il~l~~~~~~~~~g~~~~~~~~~ 427 (508) T protein:vir:15 350 KEFEVQIGLST-GTFSYSNDGVKTATEVVSNNSMTYQTRSSYLTMVEKAID-ELCQSIFELANAGALFDDGKPLFTLDSA 427 (508) T ss_pred HHHHHHhCCCc-hhcccccCccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccccccc Confidence 5555544 332 12222222221111221 11122223445556666664 566655432 2111111 Q ss_pred --cccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcc-ccccc Q lcl|NC_016071. 416 --EDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRS-GDGMT 492 (516) Q Consensus 416 --~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 492 (516) ..-+.|.|+..-.+|.++.++.+.+++.+|++.. +.++.+.||+++.+-+++. ++..+..+...... .-+.. T Consensus 428 ~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~----e~~i~~~~g~~deea~~el-~ri~~E~~~~~~~~~~~~~~ 502 (508) T protein:vir:15 428 SQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSK----QTFLQRNYGMTDEQAAEEL-AKIQSEAPTDTFEGGRSAIL 502 (508) T ss_pred cCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCH----HHHHHhcCCCChHHHHHHH-HHHHHhccccCccccccccC Confidence 1124678888888888888889999999998664 6788889998753222221 11111111111111 11222 Q ss_pred ccCCCC Q lcl|NC_016071. 493 AGSNGN 498 (516) Q Consensus 493 ~~~~~~ 498 (516) .+..|+ T Consensus 503 ~g~~ge 508 (508) T protein:vir:15 503 NGGDGE 508 (508) T ss_pred CCCCCC Confidence 233333 No 190 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=88.60 E-value=0.031 Score=28.83 Aligned_cols=443 Identities=9% Similarity=0.040 Sum_probs=161.5 Q ss_pred CCccccCccccc------chhhhcccCCCCccc--ccchHHHHHHHHHHHhhcccccCCcccHH-HH-HHHhhChHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVV------KAGNENLAVSRLRTG--ELGSGALSQLRAESEVMKVEELRWPCFLA-TV-EAMKQDHTVSTA 70 (516) Q Consensus 1 ~~~r~~~~~~~~------~~~~~~p~~~~~~~~--e~g~~~~~~~~~~~~~~~~~~lr~~~~~~-~y-~~m~~D~~v~s~ 70 (516) |-.|++.-=+.. +.-+..-..+.+-.. ++.+ +..+..+- .-..+.++....-. +. +.+. ---+... T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~--i~~~~~~Y-~g~~~~~~~~~~~~~~~~~~~~-slnl~~~ 78 (500) T protein:vir:98 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDR--ITTNLKYY-KSDWDSVLYLNTDGETKKRDLN-HLPIART 78 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHH--HHHHHHHh-cCCCCCcccccCCCCcccCcee-ecchHHH Confidence 666666542211 111111001111111 1111 11222221 11122221100000 00 0000 0011122 Q ss_pred H-HHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccc Q lcl|NC_016071. 71 L-DTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYA 148 (516) Q Consensus 71 l-~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~ 148 (516) + .+--..|.+-.-.|++. +.+..+++++++++-. |...+.. +..+..+|=.++=+.|.... T Consensus 79 i~~~~A~lv~~e~~~i~~~-------d~~~~~~l~~il~~n~----f~~~~~~~~e~a~a~G~~~~k~~~d~~~------ 141 (500) T protein:vir:98 79 AAKKIASLVFNEQAEIKVD-------DDAANEFISETLKNDR----FNKNFERYLESCLALGGLAMRPYVDGDK------ 141 (500) T ss_pred HHHHHhhhhcCCcceEecC-------ChHHHHHHHHHHhhcc----HHHHHHHHHHHHhhcCCEEEEEEEeCCc------ Confidence 2 22222343333344442 2456778888887532 5555544 45688899888877776321 Q ss_pred cceeeccccccCchhcccccceeecCCCcee------------------eecccccc------cccc--cccc-cccccc Q lcl|NC_016071. 149 GYITIDKIAFRPQSSLSRSKPWVFDEDGRTL------------------KGIYQSKM------AFAN--FQNG-LTQISS 201 (516) Q Consensus 149 g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l------------------~~~~q~~~------~~~~--~~~~-~~~~~~ 201 (516) +.+.. .++..+- ++.++.++... +.++-|.. ...+ +... ....+. T Consensus 142 --~~I~~---v~ad~~~---P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~ 213 (500) T protein:vir:98 142 --VRVAF---VQAPVFL---PLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGS 213 (500) T ss_pred --eEEEE---EcCCeeE---EEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCc Confidence 11111 1111110 01112111111 00000000 0000 0000 000011 Q ss_pred ccccccccccCCCcc---ccccccEEEEe----ecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeee Q lcl|NC_016071. 202 AMSLVTNLTSSADEV---FIPINKLMVMS----LGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKI 274 (516) Q Consensus 202 ~~~~~~~~~~~~~~~---~iP~~k~i~~~----~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~ 274 (516) ++.+-....+..+.. .+|.--|.+++ .....++|+|.|.+..|.-..-.=+..+.-|+.-++. +=..+++ T Consensus 214 ~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~---g~~~i~v 290 (500) T protein:vir:98 214 RVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM---GQRRVAV 290 (500) T ss_pred ccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh---Ccceeee Confidence 111100001111110 11111233432 2345688999999998875444334444444433331 2223455 Q ss_pred cccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHH Q lcl|NC_016071. 275 PSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAIL 354 (516) Q Consensus 275 pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Is 354 (516) |...+.....+.+.... .... +.........++.+-+ ....++.... .-....|.+.++.+=++|+ T Consensus 291 ~~~~l~~~~~~~~g~~~------~~~~--~d~~~~~~~~~~~~~~----~~~~i~~~~~--~ir~e~~~~~l~~~l~~i~ 356 (500) T protein:vir:98 291 PESLTALTVRTTDGDVV------PRPR--FESDQNVYIRMGGRDL----DSSAIQDLTT--PIRADDYIKAINEGLSLFE 356 (500) T ss_pred chHHhcccCCCCCcccc------CCcc--cCCCcceEEEcCCCCC----cCcceeEecc--ccChHHHHHHHHHHHHHHH Confidence 65554433322111000 0000 0000111112221100 0001221111 1111224445555555554 Q ss_pred HH-Hhc-ccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHh----c--CCcCCccccceEEecC Q lcl|NC_016071. 355 DR-FGA-GFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIVEAFNKNLIPQLLAL----N--DIRLSDEDMPKLKPGL 425 (516) Q Consensus 355 k~-iLG-qtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~li~~lv~l----N--~~~~~~~~~P~~~~~~ 425 (516) .. -++ +|+..+.++ -..|..-.-. .-...-+..-.+.+..+| ++|++-++.+ + ....+...-+.+.|+. T Consensus 357 ~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~~~~~~~~~~al-~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d 434 (500) T protein:vir:98 357 MQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMRNSIVALVEQSL-KELVISIFEIAKAYDLYQSEVPSMDNISISLDD 434 (500) T ss_pred HHhCCCccccccCcCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCC Confidence 33 334 333332221 1123221111 111122334445566666 3576666543 1 1111222224678888 Q ss_pred cCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCC Q lcl|NC_016071. 426 IQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGN 498 (516) Q Consensus 426 ~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (516) .-.+|.++.++.+.+++.+|++.. +.++.+.||+++.+-+++.....+..+++......+.. --|+ T Consensus 435 ~i~~d~~~~~~~~~~~v~aGi~s~----~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~---~~g~ 500 (500) T protein:vir:98 435 GVFTDRDAELDYWIKVVNAGFGTR----EMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTH---LYGE 500 (500) T ss_pred CCCCCHHHHHHHHHHHHHcCCCCH----HHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCcccc---ccCC Confidence 778888888899999999998664 56888999987542222222111212222222211111 1122 No 191 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=88.60 E-value=0.031 Score=28.83 Aligned_cols=443 Identities=9% Similarity=0.040 Sum_probs=161.5 Q ss_pred CCccccCccccc------chhhhcccCCCCccc--ccchHHHHHHHHHHHhhcccccCCcccHH-HH-HHHhhChHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVV------KAGNENLAVSRLRTG--ELGSGALSQLRAESEVMKVEELRWPCFLA-TV-EAMKQDHTVSTA 70 (516) Q Consensus 1 ~~~r~~~~~~~~------~~~~~~p~~~~~~~~--e~g~~~~~~~~~~~~~~~~~~lr~~~~~~-~y-~~m~~D~~v~s~ 70 (516) |-.|++.-=+.. +.-+..-..+.+-.. ++.+ +..+..+- .-..+.++....-. +. +.+. ---+... T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~--i~~~~~~Y-~g~~~~~~~~~~~~~~~~~~~~-slnl~~~ 78 (500) T protein:vir:30 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDR--ITTNLKYY-KSDWDSVLYLNTDGETKKRDLN-HLPIART 78 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHH--HHHHHHHh-cCCCCCcccccCCCCcccCcee-ecchHHH Confidence 666666542211 111111001111111 1111 11222221 11122221100000 00 0000 0011122 Q ss_pred H-HHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccc Q lcl|NC_016071. 71 L-DTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYA 148 (516) Q Consensus 71 l-~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~ 148 (516) + .+--..|.+-.-.|++. +.+..+++++++++-. |...+.. +..+..+|=.++=+.|.... T Consensus 79 i~~~~A~lv~~e~~~i~~~-------d~~~~~~l~~il~~n~----f~~~~~~~~e~a~a~G~~~~k~~~d~~~------ 141 (500) T protein:vir:30 79 AAKKIASLVFNEQAEIKVD-------DDAANEFISETLKNDR----FNKNFERYLESCLALGGLAMRPYVDGDK------ 141 (500) T ss_pred HHHHHhhhhcCCcceEecC-------ChHHHHHHHHHHhhcc----HHHHHHHHHHHHhhcCCEEEEEEEeCCc------ Confidence 2 22222343333344442 2456778888887532 5555544 45688899888877776321 Q ss_pred cceeeccccccCchhcccccceeecCCCcee------------------eecccccc------cccc--cccc-cccccc Q lcl|NC_016071. 149 GYITIDKIAFRPQSSLSRSKPWVFDEDGRTL------------------KGIYQSKM------AFAN--FQNG-LTQISS 201 (516) Q Consensus 149 g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l------------------~~~~q~~~------~~~~--~~~~-~~~~~~ 201 (516) +.+.. .++..+- ++.++.++... +.++-|.. ...+ +... ....+. T Consensus 142 --~~I~~---v~ad~~~---P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~ 213 (500) T protein:vir:30 142 --VRVAF---VQAPVFL---PLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGS 213 (500) T ss_pred --eEEEE---EcCCeeE---EEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCc Confidence 11111 1111110 01112111111 00000000 0000 0000 000011 Q ss_pred ccccccccccCCCcc---ccccccEEEEe----ecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeee Q lcl|NC_016071. 202 AMSLVTNLTSSADEV---FIPINKLMVMS----LGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKI 274 (516) Q Consensus 202 ~~~~~~~~~~~~~~~---~iP~~k~i~~~----~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~ 274 (516) ++.+-....+..+.. .+|.--|.+++ .....++|+|.|.+..|.-..-.=+..+.-|+.-++. +=..+++ T Consensus 214 ~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~---g~~~i~v 290 (500) T protein:vir:30 214 RVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM---GQRRVAV 290 (500) T ss_pred ccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh---Ccceeee Confidence 111100001111110 11111233432 2345688999999998875444334444444433331 2223455 Q ss_pred cccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHH Q lcl|NC_016071. 275 PSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAIL 354 (516) Q Consensus 275 pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Is 354 (516) |...+.....+.+.... .... +.........++.+-+ ....++.... .-....|.+.++.+=++|+ T Consensus 291 ~~~~l~~~~~~~~g~~~------~~~~--~d~~~~~~~~~~~~~~----~~~~i~~~~~--~ir~e~~~~~l~~~l~~i~ 356 (500) T protein:vir:30 291 PESLTALTVRTTDGDVV------PRPR--FESDQNVYIRMGGRDL----DSSAIQDLTT--PIRADDYIKAINEGLSLFE 356 (500) T ss_pred chHHhcccCCCCCcccc------CCcc--cCCCcceEEEcCCCCC----cCcceeEecc--ccChHHHHHHHHHHHHHHH Confidence 65554433322111000 0000 0000111112221100 0001221111 1111224445555555554 Q ss_pred HH-Hhc-ccccccCCccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHh----c--CCcCCccccceEEecC Q lcl|NC_016071. 355 DR-FGA-GFINLGNDGQGSYNLSESKQ-SIHGHFVQRDIDIIVEAFNKNLIPQLLAL----N--DIRLSDEDMPKLKPGL 425 (516) Q Consensus 355 k~-iLG-qtLts~~~~~GS~Al~~vh~-ev~~~~~~aDa~~i~~~ln~~li~~lv~l----N--~~~~~~~~~P~~~~~~ 425 (516) .. -++ +|+..+.++ -..|..-.-. .-...-+..-.+.+..+| ++|++-++.+ + ....+...-+.+.|+. T Consensus 357 ~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~~~~~~~~~~al-~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d 434 (500) T protein:vir:30 357 MQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMRNSIVALVEQSL-KELVISIFEIAKAYDLYQSEVPSMDNISISLDD 434 (500) T ss_pred HHhCCCccccccCcCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCC Confidence 33 334 333332221 1123221111 111122334445566666 3576666543 1 1111222224678888 Q ss_pred cCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCC Q lcl|NC_016071. 426 IQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGN 498 (516) Q Consensus 426 ~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (516) .-.+|.++.++.+.+++.+|++.. +.++.+.||+++.+-+++.....+..+++......+.. --|+ T Consensus 435 ~i~~d~~~~~~~~~~~v~aGi~s~----~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~---~~g~ 500 (500) T protein:vir:30 435 GVFTDRDAELDYWIKVVNAGFGTR----EMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTH---LYGE 500 (500) T ss_pred CCCCCHHHHHHHHHHHHHcCCCCH----HHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCcccc---ccCC Confidence 778888888899999999998664 56888999987542222222111212222222211111 1122 No 192 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=88.50 E-value=0.032 Score=28.79 Aligned_cols=412 Identities=11% Similarity=0.061 Sum_probs=151.5 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) ..+|..+-.+.-+. +-.-.. | +.... ........+.+.+ ..++ -.....-++.+....+.+ T Consensus 39 ~~~~~~~~~~~~~Y---Y~g~~~-----i----~~~~~-~~~~~~~~~~~~~-~~ki-----~~n~~k~Ivd~~~~~l~g 99 (474) T protein:vir:94 39 HRKQLDKITVGQRY---YDKDND-----I----VKQMK-KVDVHGNIDYDKP-DWRI-----TTNFHQNLVDQKVSYVAS 99 (474) T ss_pred HHHHHHHHHHHHHH---hccccc-----h----hcccc-hhccccccccccC-ccee-----ecchHHHHHHHHHhhhhc Confidence 11111111111000 000000 0 00000 0000000000000 0000 012223333333344445 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeecccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFR 159 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r 159 (516) -+..+.+ + +.+..++++.++++ .|.+.+.++ .++.-||.+ ++.+|... +|.+.+..+.|+ T Consensus 100 ~p~~~~~----~---d~~~~~~l~~~~~n-----~~~~~~~e~~~~~~~~G~~-~~~~~~d~------~~~~~i~~~~p~ 160 (474) T protein:vir:94 100 KPVTYSC----E---DENVLKVIHDVLDT-----RWDNKLIDILTATSNKGID-WLQVYINE------NGEMKLFRVPAE 160 (474) T ss_pred CCceecc----C---cHHHHHHHHHHHhc-----cHHHHHHHHHHHHhhcCce-EEEEEecC------CCeeEEEEEccc Confidence 5544432 2 23455677776642 255555544 568889974 57787543 333433333222 Q ss_pred CchhcccccceeecC--CCceeeecccccccccccccccccccccccccccccc-----------------CCCcccccc Q lcl|NC_016071. 160 PQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTS-----------------SADEVFIPI 220 (516) Q Consensus 160 ~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~iP~ 220 (516) .+. -.|++ .++.+..++......... ..+-.+.....+... ......+.. T Consensus 161 ---~~~----~v~d~~~~~~~~~~ir~~~~~~~~~----~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 229 (474) T protein:vir:94 161 ---QAI----PIWVDKEREELKSFIRYYKFNNEEK----VEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGR 229 (474) T ss_pred ---ceE----EEEcCCCCCceEEEEEEEEecCeEE----EEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCc Confidence 111 11221 233333333211100000 000000000000000 000001111 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMA 299 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~ 299 (516) .-++.| .+|+.|.|.+..+ .+.+-- +..+..++..++.+..|.+++++..+ .+ .......+. T Consensus 230 vPvv~~-----~nn~~g~sd~e~v-~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~------~~----~~~~~~~~~- 292 (474) T protein:vir:94 230 VPFIAF-----KNNPEEVSDIWMY-KSIIDAIDKRLSDAQNMFDESVELIYILKGYEG------ED----LEEFMRGLK- 292 (474) T ss_pred cceEEe-----cCCcCCCCcHHHH-HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc------cc----chhhhhhhh- Confidence 123333 2478899999874 444433 44666777778888888888775321 11 111111111 Q ss_pred HHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH Q lcl|NC_016071. 300 DAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ 379 (516) Q Consensus 300 ~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ 379 (516) ....+.++.|- .++++.... ....+...++.+.+.|...--+..++.++.++.+.+.| ... T Consensus 293 --------~~~~i~~~~~~--------~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A-l~~ 353 (474) T protein:vir:94 293 --------YYKAINVDGDG--------GVETIQVEV--PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIA-LKF 353 (474) T ss_pred --------ccceeeccCCC--------ceeEEeecC--CHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHH-HHH Confidence 11223344443 344544433 23346778888888887765444444433222111111 111 Q ss_pred HH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHH Q lcl|NC_016071. 380 SI--HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKI 457 (516) Q Consensus 380 ev--~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i 457 (516) .. ....+..-.+.+...| +++++.++.+.+..... .--.+.|....+.|..+.|+.+.+ .|++. ++.+ T Consensus 354 ~~~~l~~k~~~k~~~~~~~l-~~~~~li~~~~~~~~d~-~~i~v~f~~~~p~~~~e~a~~~~~---~g~iS-----~et~ 423 (474) T protein:vir:94 354 LYGNLDLKANKLKNKATVAI-QELISFIIDFNNLKTDV-KDIEISFNFNRMMNDAEQSQIIAQ---SQYLS-----RETL 423 (474) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCccc-ceeeEEeccCcccCHHHHHHHHHH---cCCCC-----HHHH Confidence 11 1111222223445555 34666677765322221 223567777778887776665544 57643 3456 Q ss_pred HHHcC-CCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 458 LEVGG-FDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 458 ~e~~G-lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) .+.++ ++.+..+-+-...+.....+....-+.+..... ..+.+..+.+-+ T Consensus 424 l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~e 474 (474) T protein:vir:94 424 VKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGA--QQQEGSNNKESE 474 (474) T ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCc--ccCCCCcccccC Confidence 66664 333221111111111100010001000000000 000000000000 No 193 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=88.50 E-value=0.032 Score=28.79 Aligned_cols=412 Identities=11% Similarity=0.061 Sum_probs=151.5 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) ..+|..+-.+.-+. +-.-.. | +.... ........+.+.+ ..++ -.....-++.+....+.+ T Consensus 39 ~~~~~~~~~~~~~Y---Y~g~~~-----i----~~~~~-~~~~~~~~~~~~~-~~ki-----~~n~~k~Ivd~~~~~l~g 99 (474) T protein:vir:97 39 HRKQLDKITVGQRY---YDKDND-----I----VKQMK-KVDVHGNIDYDKP-DWRI-----TTNFHQNLVDQKVSYVAS 99 (474) T ss_pred HHHHHHHHHHHHHH---hccccc-----h----hcccc-hhccccccccccC-ccee-----ecchHHHHHHHHHhhhhc Confidence 11111111111000 000000 0 00000 0000000000000 0000 012223333333344445 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeecccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFR 159 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r 159 (516) -+..+.+ + +.+..++++.++++ .|.+.+.++ .++.-||.+ ++.+|... +|.+.+..+.|+ T Consensus 100 ~p~~~~~----~---d~~~~~~l~~~~~n-----~~~~~~~e~~~~~~~~G~~-~~~~~~d~------~~~~~i~~~~p~ 160 (474) T protein:vir:97 100 KPVTYSC----E---DENVLKVIHDVLDT-----RWDNKLIDILTATSNKGID-WLQVYINE------NGEMKLFRVPAE 160 (474) T ss_pred CCceecc----C---cHHHHHHHHHHHhc-----cHHHHHHHHHHHHhhcCce-EEEEEecC------CCeeEEEEEccc Confidence 5544432 2 23455677776642 255555544 568889974 57787543 333433333222 Q ss_pred CchhcccccceeecC--CCceeeecccccccccccccccccccccccccccccc-----------------CCCcccccc Q lcl|NC_016071. 160 PQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTS-----------------SADEVFIPI 220 (516) Q Consensus 160 ~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~iP~ 220 (516) .+. -.|++ .++.+..++......... ..+-.+.....+... ......+.. T Consensus 161 ---~~~----~v~d~~~~~~~~~~ir~~~~~~~~~----~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 229 (474) T protein:vir:97 161 ---QAI----PIWVDKEREELKSFIRYYKFNNEEK----VEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGR 229 (474) T ss_pred ---ceE----EEEcCCCCCceEEEEEEEEecCeEE----EEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCc Confidence 111 11221 233333333211100000 000000000000000 000001111 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMA 299 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~ 299 (516) .-++.| .+|+.|.|.+..+ .+.+-- +..+..++..++.+..|.+++++..+ .+ .......+. T Consensus 230 vPvv~~-----~nn~~g~sd~e~v-~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~------~~----~~~~~~~~~- 292 (474) T protein:vir:97 230 VPFIAF-----KNNPEEVSDIWMY-KSIIDAIDKRLSDAQNMFDESVELIYILKGYEG------ED----LEEFMRGLK- 292 (474) T ss_pred cceEEe-----cCCcCCCCcHHHH-HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc------cc----chhhhhhhh- Confidence 123333 2478899999874 444433 44666777778888888888775321 11 111111111 Q ss_pred HHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHH Q lcl|NC_016071. 300 DAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQ 379 (516) Q Consensus 300 ~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ 379 (516) ....+.++.|- .++++.... ....+...++.+.+.|...--+..++.++.++.+.+.| ... T Consensus 293 --------~~~~i~~~~~~--------~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A-l~~ 353 (474) T protein:vir:97 293 --------YYKAINVDGDG--------GVETIQVEV--PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIA-LKF 353 (474) T ss_pred --------ccceeeccCCC--------ceeEEeecC--CHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHH-HHH Confidence 11223344443 344544433 23346778888888887765444444433222111111 111 Q ss_pred HH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHH Q lcl|NC_016071. 380 SI--HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKI 457 (516) Q Consensus 380 ev--~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i 457 (516) .. ....+..-.+.+...| +++++.++.+.+..... .--.+.|....+.|..+.|+.+.+ .|++. ++.+ T Consensus 354 ~~~~l~~k~~~k~~~~~~~l-~~~~~li~~~~~~~~d~-~~i~v~f~~~~p~~~~e~a~~~~~---~g~iS-----~et~ 423 (474) T protein:vir:97 354 LYGNLDLKANKLKNKATVAI-QELISFIIDFNNLKTDV-KDIEISFNFNRMMNDAEQSQIIAQ---SQYLS-----RETL 423 (474) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCccc-ceeeEEeccCcccCHHHHHHHHHH---cCCCC-----HHHH Confidence 11 1111222223445555 34666677765322221 223567777778887776665544 57643 3456 Q ss_pred HHHcC-CCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 458 LEVGG-FDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 458 ~e~~G-lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) .+.++ ++.+..+-+-...+.....+....-+.+..... ..+.+..+.+-+ T Consensus 424 l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~e 474 (474) T protein:vir:97 424 VKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGA--QQQEGSNNKESE 474 (474) T ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCc--ccCCCCcccccC Confidence 66664 333221111111111100010001000000000 000000000000 No 194 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=88.47 E-value=0.032 Score=28.77 Aligned_cols=448 Identities=9% Similarity=-0.009 Sum_probs=154.5 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHH-HHHHHhhChHHHHHHHHHH-HHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLA-TVEAMKQDHTVSTALDTKY-VFV 78 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~-~y~~m~~D~~v~s~l~~Rk-~~v 78 (516) |-.+.. .+...+.+ ..| -+.+...+..++ ..+..+ ++-..+++..-.... ...+-..-.-+...+-+.. ..| T Consensus 14 ~~~~~~-~~~~~~i~-~~~-~i~~~~~~~~~i--~~~~~~-y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv 87 (522) T protein:vir:47 14 GRYYMQ-TSNLNSIL-EHP-KIAVTQEEYDRI--KRNLVY-YQSKWDDVQYKNTDGDIKSRPMNHLPIARTASKKIASLV 87 (522) T ss_pred HHHHhh-cccchhcc-ccC-CCCCCHHHHHHH--HHHHHH-hcCCcccccccccCcchhcccceecchHHHHHHHHhhhh Confidence 222221 11111100 011 111111222111 122221 111222221100000 0000001112223222222 233 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccccceeecccc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIA 157 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~ 157 (516) .+-.-.|++. +.+..+++.+.+++.. |...+.. +..+...|=.++=+.|..+. +.+..+ T Consensus 88 ~~e~~~i~v~-------d~~~~~~l~~~l~~n~----f~~~~~~~~e~a~a~G~~a~k~~~d~~~--------~~i~~v- 147 (522) T protein:vir:47 88 YNEQATITTK-------NEILQKFLDDMLTNDR----FNKNFERYLESCLALGGLAMRPYIDGDK--------VRVAFI- 147 (522) T ss_pred cCCcceeecC-------ChHHHHHHHHHHhhcc----hHHHHHHHHHHhhccCCEEEEEEEcCCc--------eEEEEE- Confidence 3333344431 2456778888886533 5554544 55688888888877775321 111110 Q ss_pred ccCchhcccccceeecCCCce------------------eeecccccc-----------------cccc--ccccc-ccc Q lcl|NC_016071. 158 FRPQSSLSRSKPWVFDEDGRT------------------LKGIYQSKM-----------------AFAN--FQNGL-TQI 199 (516) Q Consensus 158 ~r~q~ti~~~~~f~~~~dg~~------------------l~~~~q~~~-----------------~~~~--~~~~~-~~~ 199 (516) +...+.+ ..++.+|.. .+.++-|.. ...+ +.+.- ..+ T Consensus 148 --~ad~~~P---~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l 222 (522) T protein:vir:47 148 --QAPVFFP---LESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVL 222 (522) T ss_pred --cCCceEE---EEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCccc Confidence 1100000 011111110 000110000 0000 00000 000 Q ss_pred ccccccccc--cccCCCccccc---cccEEEEee----cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccce Q lcl|NC_016071. 200 SSAMSLVTN--LTSSADEVFIP---INKLMVMSL----GGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGII 270 (516) Q Consensus 200 ~~~~~~~~~--~~~~~~~~~iP---~~k~i~~~~----~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~ 270 (516) +.++.+-.. .....+.+.++ .-=|++++. ....++|+|.|.+..|--..-.=+.. |.+++.-+=.+=. T Consensus 223 G~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~---~s~~~~e~~~g~~ 299 (522) T protein:vir:47 223 GQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRS---YDEFMWEVRMGQR 299 (522) T ss_pred CccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHH---HHHHHHHHHhccc Confidence 011000000 01111111111 111333332 23457899999999886433222222 3333321111112 Q ss_pred eeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHH Q lcl|NC_016071. 271 ELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRK 350 (516) Q Consensus 271 v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d 350 (516) -++.|...+............. ... +..+......+..+.. - ...++..... -....|.+.++.+- T Consensus 300 ~i~v~~~~l~~~~~~~~g~~~~----~~~----fd~~~~~f~~~~~~~~--~--~~~i~~~~~~--ir~e~~~~~~~~~l 365 (522) T protein:vir:47 300 RVIVPEHLTQRQYQRPDGTIDF----RPR----FDVEQNVYMQIGGSSM--D--AGGITDLTSP--IRANDYILAISEGL 365 (522) T ss_pred eeecchHHhccCCCCCCccccc----ccc----cCcccceEeecCCCCC--C--CCcceeeccc--cChHHHHHHHHHHH Confidence 2344444433321111100000 000 0001111111211110 0 0011111110 01123555555555 Q ss_pred HHHHH-HHhc-ccccccCCccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh-------cCCcCCccccc Q lcl|NC_016071. 351 KAILD-RFGA-GFINLGNDGQGSYNLSESKQSI--HGHFVQRDIDIIVEAFNKNLIPQLLAL-------NDIRLSDEDMP 419 (516) Q Consensus 351 ~~Isk-~iLG-qtLts~~~~~GS~Al~~vh~ev--~~~~~~aDa~~i~~~ln~~li~~lv~l-------N~~~~~~~~~P 419 (516) +.|+. +.++ +|++.+ ++|-..+.++..+- ...-+..-.+.+..+| ++|+.-++.+ |. ..+...-+ T Consensus 366 ~~i~~~~gls~~tf~~~--~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al-~~lv~~i~~l~~~~~~~~~-~~~~~~~i 441 (522) T protein:vir:47 366 KLFEMQIGVSSGMFTFD--GQGMKTATEIVSENSDTYQMRSSIVALVEQSI-KELCVSMCELGKAVGVYSG-EIPELDDI 441 (522) T ss_pred HHHHHHhCCCccccCcc--ccccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhccC-CCCCccee Confidence 55543 3344 334332 22221122332111 1112344455666666 4677777644 21 11223336 Q ss_pred eEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCC Q lcl|NC_016071. 420 KLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNG 499 (516) Q Consensus 420 ~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (516) .+.|+..-.+|.++.++.+.+++.+|++.+ +.++.+.||+++.+-.++.. +...... ++.+......+++. T Consensus 442 ~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~----e~~i~~~~g~~eeea~~el~-ri~~E~~----~~~~~~~~~~~~~~ 512 (522) T protein:vir:47 442 SVNLDDGVFTDRHAELDYWAKMVAAGFSTK----KRAIGKTLNISGVEAEKELN-AINSELL----PMNDAELAIYGMHD 512 (522) T ss_pred EEEcCCCCCCCHHHHHHHHHHHHhcCCCCH----HHHHHhcCCCChHHHHHHHH-HHHHhhc----cCCCCCCCCCCCCC Confidence 788888888888888899999999998664 67899999987532222211 1111111 10000000011111 Q ss_pred cccccccccchh Q lcl|NC_016071. 500 TGKISSTRDNSV 511 (516) Q Consensus 500 ~~~~~~~~d~~~ 511 (516) +. -+..|+.- T Consensus 513 ~~--~~~~d~~~ 522 (522) T protein:vir:47 513 QN--EEKADDKG 522 (522) T ss_pred cc--cccCCCCC Confidence 11 11222222 No 195 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=87.56 E-value=0.038 Score=28.37 Aligned_cols=441 Identities=13% Similarity=0.078 Sum_probs=161.4 Q ss_pred CCccccCcccccch------hhhccc-CCCCcccccchH-----HH-HHHHH----HHHhhcccccC-CcccHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKA------GNENLA-VSRLRTGELGSG-----AL-SQLRA----ESEVMKVEELR-WPCFLATVEAMK 62 (516) Q Consensus 1 ~~~r~~~~~~~~~~------~~~~p~-~~~~~~~e~g~~-----~~-~~~~~----~~~~~~~~~lr-~~~~~~~y~~m~ 62 (516) |++=++--.++-.. .+..+| +|| ..+=|+. .. ....| +.+. .++.+ ..++|+.|++|. T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p--~~~dGa~~i~~~~~~~~~~g~~~~~~~~--~~~~~~~~eLI~~YR~ma 78 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKLGHESIATP--KKDDGATEIETREGEATYNAVMQQFFGI--DNNISGTKDLINTYRQLI 78 (516) T ss_pred chHhcccccchhhhHHhhhhcCCcCcccCC--CCCCCceeeecCCCcccccceeeeeecc--ccccchHHHHHHHHHHHh Confidence 22211111111000 000111 111 1111110 00 01111 1211 12222 245799999999 Q ss_pred hChHHHHHHHHHHHHHhcCCc-e--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEe Q lcl|NC_016071. 63 QDHTVSTALDTKYVFVTKAFN-D--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYR 139 (516) Q Consensus 63 ~D~~v~s~l~~Rk~~v~~~~w-~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~ 139 (516) .+|.|-++++-.-.-+.-.+- . |.+.- .+.+.++.+-+.|.+.+ +.+..+|+.--+||..+= T Consensus 79 ~~pEvd~Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~ik~kI~eeF----------~~Il~ll~F~~~~~~~fR---- 143 (516) T protein:vir:10 79 NNPEVERAVANIVNEAIVYERGHKVVSLDL-DDTDFGSNVKEKILEEF----------DEVCRLLDASRKLDTLFR---- 143 (516) T ss_pred hccchhhHHHHhhcceeEecCCCceEEEEe-cccCcchHHHHHHHHHH----------HHHHHHhccchhhhHHHh---- Confidence 999999999987654321110 0 00000 01112233333333332 234455555556665541 Q ss_pred ecccccccccceeeccccccCchhcc-----------cccceee-cCCCceeeecccccccccccccccccccccccccc Q lcl|NC_016071. 140 TESAPSKYAGYITIDKIAFRPQSSLS-----------RSKPWVF-DEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVT 207 (516) Q Consensus 140 ~~~~~~~~~g~~~~~~l~~r~q~ti~-----------~~~~f~~-~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~ 207 (516) .|+-||++.++++..+|..-|. ..|.... +.+|..+.. ....+.-|..+... +...- T Consensus 144 ----~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~---~~~e~~~Y~~~~~~----~~~~g 212 (516) T protein:vir:10 144 ----RWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVK---GYREFFIYTTGNEG----YSYNG 212 (516) T ss_pred ----hhhhcceEEEEEEecCccccceeeeeeCCcceeeEeeecccccccchhhh---hhhheeeeccCccc----ccccc Confidence 2444566655555554443332 2221111 112211110 00111111111100 00000 Q ss_pred ccccCCCccccccccEEEEeecC---cCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC Q lcl|NC_016071. 208 NLTSSADEVFIPINKLMVMSLGG---TESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI 284 (516) Q Consensus 208 ~~~~~~~~~~iP~~k~i~~~~~~---~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~ 284 (516) ..-+....+.||.+ .|+|+|.. ..++.+ .|.|.++..|+==-+.....-.++ .+-.+|.+|+=+-.- T Consensus 213 ~~~~~~~~ikI~~d-AI~y~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIY--------RitRAPeRRvFYIDv 282 (516) T protein:vir:10 213 RIFEPNTRIKIPRS-AVVYASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIY--------RITRAPERRVFYIDV 282 (516) T ss_pred ceeCCCcceeechh-heeeecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHH--------hhhccccceEEEEec Confidence 00111234566665 48888853 234455 789999988775333322222211 122222222211111 Q ss_pred --CCCHHHHHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHH Q lcl|NC_016071. 285 --DPKSPESEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQE 344 (516) Q Consensus 285 --~~~~~~~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~ 344 (516) =|....++. +..++..++. .+..|- -||.= +.....+|+-+ .|+... .-.+ T Consensus 283 GnlPk~KAeqY---l~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEItTL--pGgqnl-gem~ 353 (516) T protein:vir:10 283 GNMNNRKATEY---VNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRR---DGKSVTEVSSL--PGAQTM-GDMD 353 (516) T ss_pred CCCCchhHHHH---HHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc---CCCCccceeec--cccCCc-ChHH Confidence 122222222 2333322211 001111 12210 00111233333 222222 2334 Q ss_pred HHHHHHHHHHHHHhcccccccCCccchh---hHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---c Q lcl|NC_016071. 345 LVNSRKKAILDRFGAGFINLGNDGQGSY---NLSE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---E 416 (516) Q Consensus 345 li~~~d~~Isk~iLGqtLts~~~~~GS~---Al~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~ 416 (516) =|+|..+.+-+++--..--.+.+++++. ..++ +..|+ |...++.-...+...|..-|-..|+.=+ .--+. . T Consensus 354 DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKg-iit~eew~~ 432 (516) T protein:vir:10 354 DVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKR-IITEDEWDE 432 (516) T ss_pred HHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCHHHHHH Confidence 5899999999998887644544443332 2233 33334 3333444444455555433333333211 11110 1 Q ss_pred ccceEEecCcCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCCcc--cccCcccccCCCCCCcc Q lcl|NC_016071. 417 DMPKLKPGLIQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIPED--MSTDELLKLLGQDTSRS 487 (516) Q Consensus 417 ~~P~~~~~~~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~~~--~~~~~~~~~~~~~~~~~ 487 (516) --+.+.|+...+ .+.+-+.+++..|..+-=.+-...+.+||++.+ .++..+-.+ ...+.+.+ .+-...|. T Consensus 433 i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~-~~~~~~p~ 511 (516) T protein:vir:10 433 QINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAG-IKRFQNPE 511 (516) T ss_pred HhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhh-CCCCCCCC Confidence 113344443333 333444445555443321122245578887654 555321111 11111111 11001111 Q ss_pred cccccccCCCC Q lcl|NC_016071. 488 GDGMTAGSNGN 498 (516) Q Consensus 488 ~~~~~~~~~~~ 498 (516) .+. .+ T Consensus 512 ~~~------~f 516 (516) T protein:vir:10 512 NED------DF 516 (516) T ss_pred ccc------cC Confidence 000 00 No 196 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=87.56 E-value=0.038 Score=28.37 Aligned_cols=441 Identities=13% Similarity=0.078 Sum_probs=161.4 Q ss_pred CCccccCcccccch------hhhccc-CCCCcccccchH-----HH-HHHHH----HHHhhcccccC-CcccHHHHHHHh Q lcl|NC_016071. 1 MSTRFAQPSEVVKA------GNENLA-VSRLRTGELGSG-----AL-SQLRA----ESEVMKVEELR-WPCFLATVEAMK 62 (516) Q Consensus 1 ~~~r~~~~~~~~~~------~~~~p~-~~~~~~~e~g~~-----~~-~~~~~----~~~~~~~~~lr-~~~~~~~y~~m~ 62 (516) |++=++--.++-.. .+..+| +|| ..+=|+. .. ....| +.+. .++.+ ..++|+.|++|. T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p--~~~dGa~~i~~~~~~~~~~g~~~~~~~~--~~~~~~~~eLI~~YR~ma 78 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKLGHESIATP--KKDDGATEIETREGEATYNAVMQQFFGI--DNNISGTKDLINTYRQLI 78 (516) T ss_pred chHhcccccchhhhHHhhhhcCCcCcccCC--CCCCCceeeecCCCcccccceeeeeecc--ccccchHHHHHHHHHHHh Confidence 22211111111000 000111 111 1111110 00 01111 1211 12222 245799999999 Q ss_pred hChHHHHHHHHHHHHHhcCCc-e--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEe Q lcl|NC_016071. 63 QDHTVSTALDTKYVFVTKAFN-D--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYR 139 (516) Q Consensus 63 ~D~~v~s~l~~Rk~~v~~~~w-~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~ 139 (516) .+|.|-++++-.-.-+.-.+- . |.+.- .+.+.++.+-+.|.+.+ +.+..+|+.--+||..+= T Consensus 79 ~~pEvd~Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~ik~kI~eeF----------~~Il~ll~F~~~~~~~fR---- 143 (516) T protein:vir:10 79 NNPEVERAVANIVNEAIVYERGHKVVSLDL-DDTDFGSNVKEKILEEF----------DEVCRLLDASRKLDTLFR---- 143 (516) T ss_pred hccchhhHHHHhhcceeEecCCCceEEEEe-cccCcchHHHHHHHHHH----------HHHHHHhccchhhhHHHh---- Confidence 999999999987654321110 0 00000 01112233333333332 234455555556665541 Q ss_pred ecccccccccceeeccccccCchhcc-----------cccceee-cCCCceeeecccccccccccccccccccccccccc Q lcl|NC_016071. 140 TESAPSKYAGYITIDKIAFRPQSSLS-----------RSKPWVF-DEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVT 207 (516) Q Consensus 140 ~~~~~~~~~g~~~~~~l~~r~q~ti~-----------~~~~f~~-~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~ 207 (516) .|+-||++.++++..+|..-|. ..|.... +.+|..+.. ....+.-|..+... +...- T Consensus 144 ----~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~---~~~e~~~Y~~~~~~----~~~~g 212 (516) T protein:vir:10 144 ----RWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVK---GYREFFIYTTGNEG----YSYNG 212 (516) T ss_pred ----hhhhcceEEEEEEecCccccceeeeeeCCcceeeEeeecccccccchhhh---hhhheeeeccCccc----ccccc Confidence 2444566655555554443332 2221111 112211110 00111111111100 00000 Q ss_pred ccccCCCccccccccEEEEeecC---cCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC Q lcl|NC_016071. 208 NLTSSADEVFIPINKLMVMSLGG---TESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI 284 (516) Q Consensus 208 ~~~~~~~~~~iP~~k~i~~~~~~---~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~ 284 (516) ..-+....+.||.+ .|+|+|.. ..++.+ .|.|.++..|+==-+.....-.++ .+-.+|.+|+=+-.- T Consensus 213 ~~~~~~~~ikI~~d-AI~y~hSGL~d~~~~~i-~syLhkAiKp~NQLkm~EDAlVIY--------RitRAPeRRvFYIDv 282 (516) T protein:vir:10 213 RIFEPNTRIKIPRS-AVVYASSGLMDCSDRGI-IGYLHNAVKPANQLKLLEDAMVIY--------RITRAPERRVFYIDV 282 (516) T ss_pred ceeCCCcceeechh-heeeecccceeCCCCce-eeeehhhhHhHHhhHHHHhhHHHH--------hhhccccceEEEEec Confidence 00111234566665 48888853 234455 789999988775333322222211 122222222211111 Q ss_pred --CCCHHHHHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHH Q lcl|NC_016071. 285 --DPKSPESEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQE 344 (516) Q Consensus 285 --~~~~~~~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~ 344 (516) =|....++. +..++..++. .+..|- -||.= +.....+|+-+ .|+... .-.+ T Consensus 283 GnlPk~KAeqY---l~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEItTL--pGgqnl-gem~ 353 (516) T protein:vir:10 283 GNMNNRKATEY---VNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRR---DGKSVTEVSSL--PGAQTM-GDMD 353 (516) T ss_pred CCCCchhHHHH---HHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc---CCCCccceeec--cccCCc-ChHH Confidence 122222222 2333322211 001111 12210 00111233333 222222 2334 Q ss_pred HHHHHHHHHHHHHhcccccccCCccchh---hHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---c Q lcl|NC_016071. 345 LVNSRKKAILDRFGAGFINLGNDGQGSY---NLSE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---E 416 (516) Q Consensus 345 li~~~d~~Isk~iLGqtLts~~~~~GS~---Al~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~ 416 (516) =|+|..+.+-+++--..--.+.+++++. ..++ +..|+ |...++.-...+...|..-|-..|+.=+ .--+. . T Consensus 354 DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKg-iit~eew~~ 432 (516) T protein:vir:10 354 DVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKR-IITEDEWDE 432 (516) T ss_pred HHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCHHHHHH Confidence 5899999999998887644544443332 2233 33334 3333444444455555433333333211 11110 1 Q ss_pred ccceEEecCcCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCCcc--cccCcccccCCCCCCcc Q lcl|NC_016071. 417 DMPKLKPGLIQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIPED--MSTDELLKLLGQDTSRS 487 (516) Q Consensus 417 ~~P~~~~~~~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~~~--~~~~~~~~~~~~~~~~~ 487 (516) --+.+.|+...+ .+.+-+.+++..|..+-=.+-...+.+||++.+ .++..+-.+ ...+.+.+ .+-...|. T Consensus 433 i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~-~~~~~~p~ 511 (516) T protein:vir:10 433 QINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAG-IKRFQNPE 511 (516) T ss_pred HhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhh-CCCCCCCC Confidence 113344443333 333444445555443321122245578887654 555321111 11111111 11001111 Q ss_pred cccccccCCCC Q lcl|NC_016071. 488 GDGMTAGSNGN 498 (516) Q Consensus 488 ~~~~~~~~~~~ 498 (516) .+. .+ T Consensus 512 ~~~------~f 516 (516) T protein:vir:10 512 NED------DF 516 (516) T ss_pred ccc------cC Confidence 000 00 No 197 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=86.85 E-value=0.043 Score=28.09 Aligned_cols=425 Identities=9% Similarity=0.002 Sum_probs=161.7 Q ss_pred hcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHH---HHh------------------------hChHHHH Q lcl|NC_016071. 17 ENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVE---AMK------------------------QDHTVST 69 (516) Q Consensus 17 ~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~---~m~------------------------~D~~v~s 69 (516) -.|.+--+...+++..-.+.+.......+.+.++. ..+.|+ +++ ......- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~ 78 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHI--GENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTE 78 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHhcccchhhhcccccccccccccccccccccccccchHHH Confidence 11111111112222222222222211112222110 111111 000 1233334 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYA 148 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~ 148 (516) .+.+...-+.+.+..+.+ ++..++++.+.+...+++ .|.+.+.+ ..++.-||.+ .|.+|... + T Consensus 79 Ivd~~~~yl~G~Pv~~~~----~d~~~~e~~~~l~~~~~~-----~~~~~~~el~~~~s~~G~a-y~~~y~de------~ 142 (537) T protein:vir:78 79 LVDQLAQYLLSNGVEVKV----KDEDNTQLDEILQEYFDE-----DFQATIDTLVTNASKKGFE-GIFARTTS------E 142 (537) T ss_pred HHHHHhhhhcccCceeec----CcchhHHHHHHHHHHhhc-----cHHHHHHHHHHHHhhcCee-EEEeeecC------C Confidence 455555566676666653 334455666777665532 25555444 5578889986 56777544 3 Q ss_pred cceeeccccccCchhcccccceeecCCCceeeeccccccccccccc------cccccccccccccccccC---------- Q lcl|NC_016071. 149 GYITIDKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQN------GLTQISSAMSLVTNLTSS---------- 212 (516) Q Consensus 149 g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~------~~~~~~~~~~~~~~~~~~---------- 212 (516) |.+.+..+.+.. + .-.|++.+...-.++-.......... ....+-.+.....+.... T Consensus 143 ~~~~~~~i~p~~---~----~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~ 215 (537) T protein:vir:78 143 GKLKFQTVDGLT---L----IPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLD 215 (537) T ss_pred CceEEEEEccce---e----EEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCccccccccc Confidence 444443333221 1 11344444433222211000000000 000000000000000000 Q ss_pred --CCcccc-------------------------cccc--EEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 213 --ADEVFI-------------------------PINK--LMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGAS 263 (516) Q Consensus 213 --~~~~~i-------------------------P~~k--~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~e 263 (516) ..+.+| |..+ ++.|+ +|..|.|.+..+-...=-=...+..-+-.++ T Consensus 216 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~-----nn~~~~sd~e~v~~LiDayd~~~S~~an~~~ 290 (537) T protein:vir:78 216 EAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLY-----NNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQ 290 (537) T ss_pred ccccccccceeeeccccccccccccccccccccCCcceeEEEec-----cCccCCCchhhhHHHHHHHHHHHHhhhhHHH Confidence 000000 1111 12222 3567788887654322222334455566778 Q ss_pred hccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec-cCcccccccccceeeeeccccCcchhH Q lcl|NC_016071. 264 KDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP-SDMNAQGGEQYKMSLKGIDGAGKQYST 342 (516) Q Consensus 264 r~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP-~g~~i~~~e~~~iel~~~~g~g~~~~~ 342 (516) .|..+++++++..+ .. ..+....+. +..+ +-++ .+ ..++++..... ...+ T Consensus 291 ~~~~~ilvi~g~~~--------~~--~~~~~~~l~--------~~~~-i~v~~d~--------~~v~~l~~~~~--~~~~ 341 (537) T protein:vir:78 291 DFSEAIYVVKGFSG--------DS--TDKLRQNIK--------AKKM-IGVNGDN--------AGMEIQTVSIP--YEAR 341 (537) T ss_pred HhcCceeeeecCCC--------cc--chhHHHHHh--------hcCc-eeecCCC--------CceeEEEecCC--HHHH Confidence 88888888876311 11 111111111 1111 1122 22 23555544432 2235 Q ss_pred HHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHh---cCCcCCc Q lcl|NC_016071. 343 QELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDID----IIVEAFNKNLIPQLLAL---NDIRLSD 415 (516) Q Consensus 343 ~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~----~i~~~ln~~li~~lv~l---N~~~~~~ 415 (516) ..+++++.+.|-+. +++..++..+.| - ++.+.-..+...+...|. .+...|.+ +++.++.+ .+..-.+ T Consensus 342 e~~ld~L~~~I~~~--s~~~~~~~~~~g-n-~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~-~~~~i~~~~~~~~~~~~d 416 (537) T protein:vir:78 342 KAKMDIDVENIYRS--GMGFNSTAVGDG-N-VTNVVIKSRYTLLAMKARKMETSLRKVLRW-CADMVVSDIALRGLGEYD 416 (537) T ss_pred HHHHHHHHHHHHHh--cCCCCCcccccc-C-CcHHHHHHHHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhcCCcccc Confidence 67888888888654 344333322222 2 233333333333333333 33444432 33333332 2111112 Q ss_pred cccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC---------------------Cccccc- Q lcl|NC_016071. 416 EDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI---------------------PEDMST- 473 (516) Q Consensus 416 ~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~---------------------~~~~~~- 473 (516) ..-..+.|...-+.|..+.++.+++|++.|.+. ++-+.+.++.-... .+.+.. T Consensus 417 ~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS-----~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~ 491 (537) T protein:vir:78 417 SNDICFEIEPHVLANELDIATTRKTEAETEALK-----IGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQS 491 (537) T ss_pred cceeeEEeccCCCCCHHHHHHHHHHHHhcCcch-----HHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccc Confidence 234678899999999999999999999999654 22333333321100 000000 Q ss_pred -----C--ccccc-CCC-CCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 474 -----D--ELLKL-LGQ-DTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 474 -----~--~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) . ....+ ..+ .+.|..+.-+.|.|.++++..|.+.--- T Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 492 LDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred cCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccCCCC Confidence 0 00000 000 0011111111122222222211111000 No 198 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=86.55 E-value=0.045 Score=27.98 Aligned_cols=473 Identities=13% Similarity=0.054 Sum_probs=161.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHH-HHHHHhhcccccC-CcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQL-RAESEVMKVEELR-WPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~-~~~~~~~~~~~lr-~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) ||.=+-=.-+-. .+...+|..|....+-++.....+ +.+.+..-.-+.+ ..++|+.|++|..++.|-++++-+-.-+ T Consensus 1 m~~lfgf~i~~~-~~~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIVnea 79 (564) T protein:vir:10 1 MSQLFGFLINEK-EGQKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIVNEF 79 (564) T ss_pred Ccchhcceeeee-ccCCCCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 544321111100 011112211111111111100011 1111111101111 2368999999999999999999876532 Q ss_pred hcC-Cce--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 79 TKA-FND--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 79 ~~~-~w~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) .-. +-. |++.- .+.+.++.+-+-|.+. |+ .+..+|+.--+||..+= .|+-||++..++ T Consensus 80 Iv~d~~~~pV~vdL-~~~~~s~siK~kI~eE---------F~-~Il~ll~F~~~~~e~fR--------~WYVDgRi~fHk 140 (564) T protein:vir:10 80 VVNDGDDKPVEVDL-QNLEIGSGVKKKIRDE---------FN-RILRMMNFNVNAHEIIR--------NWYVDGRSHYHK 140 (564) T ss_pred eEecCCCceEEEEe-cccCcchHHHHHHHHH---------HH-HHHHHhccchhhhHHHh--------hhhhcceEEEEE Confidence 111 000 01110 0112233333333332 22 23355555555655542 233344444443 Q ss_pred -------------ccccCchhcccccceeecCC--Cceeeeccc------cccccccccccccccccccccccccccCCC Q lcl|NC_016071. 156 -------------IAFRPQSSLSRSKPWVFDED--GRTLKGIYQ------SKMAFANFQNGLTQISSAMSLVTNLTSSAD 214 (516) Q Consensus 156 -------------l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (516) |...+|..|++.+-...+.+ +..+..-.. ....+..+.+-......+............ T Consensus 141 iid~~~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~ 220 (564) T protein:vir:10 141 VIDLDNPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQE 220 (564) T ss_pred EeeCCChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCccccccccccccccc Confidence 33333444443332222221 111111000 000111111000000111111111122345 Q ss_pred ccccccccEEEEeecCc--CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC--CCCHHH Q lcl|NC_016071. 215 EVFIPINKLMVMSLGGT--ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI--DPKSPE 290 (516) Q Consensus 215 ~~~iP~~k~i~~~~~~~--~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~--~~~~~~ 290 (516) ++.||.+ .|+|+|..- .++..=.|.|+++..++==-+.....-.++ .+-.+|.+|+=+-.- =|.... T Consensus 221 ~ikI~~d-aI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIY--------RitRAPeRRvFYIDVGnLPk~KA 291 (564) T protein:vir:10 221 GIKIASD-AIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIY--------RLSRAPERRIFYIDVGNLPKVKA 291 (564) T ss_pred ceeechh-hcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHH--------hhhccccceEEEEecCCCCchhH Confidence 6788875 567777532 233334578888888764333322222221 122222222211111 122222 Q ss_pred HHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHH Q lcl|NC_016071. 291 SEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKA 352 (516) Q Consensus 291 ~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~ 352 (516) ++. +..++..++. ....|- -||.= +.....+|+-+ .|+.... -..=|+|..+. T Consensus 292 eqY---lr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPRR---eGgrgTEItTL--pGgqnLg-em~DV~YF~kK 362 (564) T protein:vir:10 292 EQY---LRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRR---EGGRGTEITTL--PGGQNLG-ELKDVEYFKKK 362 (564) T ss_pred HHH---HHHHHHhcCceEEEeccCceecccchhhhhHhhhccccc---CCCcccceeec--cccCCcc-hHHHHHHHHHH Confidence 222 3333332221 011111 12210 00111233333 2222222 23457999999 Q ss_pred HHHHHhcccccccCCcc-chh-hHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccceEEecC Q lcl|NC_016071. 353 ILDRFGAGFINLGNDGQ-GSY-NLSES-KQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPKLKPGL 425 (516) Q Consensus 353 Isk~iLGqtLts~~~~~-GS~-Al~~v-h~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~~~~~~ 425 (516) +-+++--..--.+.+++ -+. ..+++ -.|+ |...+..-...+...|..-|-..|+.=+ .--+. .--..+.|+. T Consensus 363 LY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKg-iit~eeW~~i~~~I~~~f 441 (564) T protein:vir:10 363 LYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKG-IITPEDWDDMEEHIQYDF 441 (564) T ss_pred HHHHhCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCHHHHHHHhhcceEEe Confidence 99988877644443431 111 11232 2233 3333444444444445433333333211 11110 1112344433 Q ss_pred cCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCC---------------------CCCCcccccCccc Q lcl|NC_016071. 426 IQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFD---------------------EEIPEDMSTDELL 477 (516) Q Consensus 426 ~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp---------------------~~~~~~~~~~~~~ 477 (516) ..+ .+.+-+.+++..|..+--.+-...+.+||++.+ .+. +|.+.++.- .+ T Consensus 442 ~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~--~~ 519 (564) T protein:vir:10 442 LFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLD--DM 519 (564) T ss_pred eecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCC--Cc Confidence 332 344444556666555421222234567776543 321 111111110 00 Q ss_pred ccCCCCCCcccccccccCC-------CCCcccccccc-cchhhhh Q lcl|NC_016071. 478 KLLGQDTSRSGDGMTAGSN-------GNGTGKISSTR-DNSVSNM 514 (516) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~-d~~~~~~ 514 (516) ...+.+-.|.-.++..-.+ .++.++.+... ..+..|. T Consensus 520 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 564 (564) T protein:vir:10 520 EKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKSQSNK 564 (564) T ss_pred cCCCCcCCcchhhhccccccccChhhhccCCCCCCCCCCcCcCCC Confidence 1011110111111110000 00001111000 0011111 No 199 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=86.52 E-value=0.045 Score=27.97 Aligned_cols=421 Identities=10% Similarity=0.040 Sum_probs=161.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhh--cccccCCcccHHHHH------------------H Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVM--KVEELRWPCFLATVE------------------A 60 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~--~~~~lr~~~~~~~y~------------------~ 60 (516) |..-+..+........-...+- +.+.+--..+..+++.. +.+.++ +..+.|+ + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~i~~~i~~~~~~~~~~~--~~~~Yy~g~~~i~~r~~~~~~~~~~~ 73 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLK-----PQFETQEEMIIRLIDDHRKQLDKIT--VGQRYYDKDNDIVKQMKKVDVYGNID 73 (474) T ss_pred CcceeecCCCCchhhHHHHhhh-----hccCChHHHHHHHHHHHHHHHHHHH--HHHHHhcccCchhccccccccccccc Confidence 5554444433222111011110 11111111122221110 000000 0001110 0 Q ss_pred H-h-----hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhccee Q lcl|NC_016071. 61 M-K-----QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSI 133 (516) Q Consensus 61 m-~-----~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~ 133 (516) . + ......-++.+....+.+-+..+.+ .+++..++++.++++ .|.+.+.. ..++.-||.+ T Consensus 74 ~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~-------~d~~~~~~l~~~~~n-----~~~~~~~e~~~~~~~~G~~- 140 (474) T protein:vir:95 74 YDKPDWRITTNFHQNLVDQKVSYVASKPVTYSC-------EDESVLKIIHDVLDT-----RWDNKLIDILTATSNKGID- 140 (474) T ss_pred cccccceeccchHHHHHHHHHhhhccCCceecc-------CchHHHHHHHHHHhc-----cHHHHHHHHHHHHhhcCcE- Confidence 0 0 0222333344444445555555432 223455677777652 25555554 5578889985 Q ss_pred eeEEEeecccccccccceeeccccccCchhcccccceeecC--CCceeeeccccccc----cccccccc----ccccccc Q lcl|NC_016071. 134 FEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMA----FANFQNGL----TQISSAM 203 (516) Q Consensus 134 ~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~----~~~~~~~~----~~~~~~~ 203 (516) ++++|... +|.+.+.-+.|.. .. ..|++ .++.+..++..... ...|.... ....... T Consensus 141 ~~~v~~d~------~~~~~i~~~~p~~---~~----~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 207 (474) T protein:vir:95 141 WLQVYINE------NGEMKLFRVPAEQ---AI----PIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGL 207 (474) T ss_pred EEEEEecC------CCceEEEEEcccc---eE----EEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCcc Confidence 46777543 2334333322221 10 11221 12222222211000 00000000 0000000 Q ss_pred cccc-----ccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeeccc Q lcl|NC_016071. 204 SLVT-----NLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQ 277 (516) Q Consensus 204 ~~~~-----~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~ 277 (516) .... ..........+..--++.|. +|+.|.|.+..+- +.+-- +..+..++..++.+..|..++++..+ T Consensus 208 ~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~e~v~-~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~ 281 (474) T protein:vir:95 208 IPDYYYGANHIQSHFSNGNWGRVPFIAFK-----NNPEEVSDIWMYK-SLIDAIDKRLSDAQNMFDESVELIYILKGYEG 281 (474) T ss_pred ccccccCcccccccccccCCCccceEeec-----CCCCCCCcHHHHH-HHHHHHHHHHHHHHHHHHHhcCceeeeecCCc Confidence 0000 00000000011111123332 4788999988743 33332 44666777778888888888775321 Q ss_pred ccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHH Q lcl|NC_016071. 278 ILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRF 357 (516) Q Consensus 278 ~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~i 357 (516) .+.. ..... .. ....+.++.+.+ ++++.... ....+...++.+.+.|...- T Consensus 282 ------~~~~----~~~~~----~~-----~~~~i~~~~~~~--------~~~l~~~~--~~~~~~~~~~~l~~~i~~~s 332 (474) T protein:vir:95 282 ------QDLE----EFMRG----LK-----YYKAINVDGDGG--------VETIQVEV--PVSSTKEYIDLMRAYIMEFG 332 (474) T ss_pred ------ccch----hhhhh----hh-----ccceeeccCCCc--------eeEEeecC--CHHHHHHHHHHHHHHHHHHh Confidence 1111 01111 11 111233455543 44443332 23347778888888888765 Q ss_pred hcccccccCCccchhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHH Q lcl|NC_016071. 358 GAGFINLGNDGQGSYNLSESKQSIH----GHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEG 433 (516) Q Consensus 358 LGqtLts~~~~~GS~Al~~vh~ev~----~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~ 433 (516) -+..++.++.++. ++.+.-+.. ...+..-.+.+...| +++++.++.+.+... +..-..+.|....+.|..+ T Consensus 333 ~~p~~~~~~~~~n---~Sg~Alk~~~~~l~~k~~~k~~~~~~~l-~~~~~li~~~~g~~~-d~~~i~v~f~~~~p~d~~e 407 (474) T protein:vir:95 333 QGVDFQTDKFGSA---PSGIALKFLYGNLDLKANKLKNKATVAI-QELIGFIIDFNNLKM-DVKDIEISFNFNRMMNDAE 407 (474) T ss_pred CCccccccccccc---chHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCc-ccceeeEEeccCCCcCHHH Confidence 5544554332211 122121111 112233334555556 457777777664322 2223467788888888776 Q ss_pred HHHHHHHHHhCCcccccHHHHHHHHHHcC-CCCCCCcccccCcccc---cCCCCCCcc-ccccc-ccCCCCCccc Q lcl|NC_016071. 434 FSKFVQRIGAVGYLPKTPTVINKILEVGG-FDEEIPEDMSTDELLK---LLGQDTSRS-GDGMT-AGSNGNGTGK 502 (516) Q Consensus 434 ~a~~~~~L~~~G~~~~~~~~~~~i~e~~G-lp~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~-~~~~~~~~~~ 502 (516) .++++ +.+|++. .+.+.+.++ ++.+..+-+-...+.. ...+..... .++.. ...+++.+++ T Consensus 408 ~a~~~---~~~g~iS-----~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 408 QSQII---AQSQYLS-----RETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred HHHHH---HhcCCCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccCCCC Confidence 66654 4568643 344555664 3332211111111100 011111110 00000 0111111111 No 200 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=85.94 E-value=0.049 Score=27.76 Aligned_cols=428 Identities=10% Similarity=0.006 Sum_probs=155.7 Q ss_pred CCc--cccCcc---cccchhhhcccCCC-CcccccchH-----HHHHHHHHHHhhcccccCCccc----HHHHHHH---- Q lcl|NC_016071. 1 MST--RFAQPS---EVVKAGNENLAVSR-LRTGELGSG-----ALSQLRAESEVMKVEELRWPCF----LATVEAM---- 61 (516) Q Consensus 1 ~~~--r~~~~~---~~~~~~~~~p~~~~-~~~~e~g~~-----~~~~~~~~~~~~~~~~lr~~~~----~~~y~~m---- 61 (516) |.. +.-... .+.+.-...-..+. +-..-|=.- -+.....+-... .+-+..+.. ...+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~-~~i~~~~~~~~~~~~~~~~~~~~k 79 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHD-PDVLRLAPKLDNKGEIDPLKPDWR 79 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccC-Ccchhccchhcccccccccccchh Confidence 110 000000 00000000000000 000000000 000000000000 000000000 0000000 Q ss_pred hhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEee Q lcl|NC_016071. 62 KQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRT 140 (516) Q Consensus 62 ~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~ 140 (516) .-.+...-++.+....+.+-+..+.+. +.+..+.+.+++++ .|.+.+.. ..++.-+|.+. +++|.- T Consensus 80 i~~n~~~~Ivd~~~~~l~g~p~~~~~~-------d~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~~-~~~y~d 146 (474) T protein:vir:96 80 MFTNYHQNLVDQKVAYAVANPVTFSSD-------DDKSLKTIQEVLNH-----KWDDKLVDILTAASNKGIEW-LQPYID 146 (474) T ss_pred cccchHHHHHHhhhhhhcccCceeecC-------chHHHHHHHHHHhc-----CHHHHHHHHHHHHHhcCeeE-EEEEec Confidence 002333344444445555555555431 23445666666653 24444544 45688899965 677754 Q ss_pred cccccccccceeeccccccCchhcccccceeecC--CCceeeecccccccccc----cc----------ccccccccccc Q lcl|NC_016071. 141 ESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFAN----FQ----------NGLTQISSAMS 204 (516) Q Consensus 141 ~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~----~~----------~~~~~~~~~~~ 204 (516) . +|.+.+..+.|+. +. -.|++ .+..+..++........ |. .+......... T Consensus 147 ~------~~~~~i~~~~p~~---~~----~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~ 213 (474) T protein:vir:96 147 E------NGEFKTFRVPAEQ---AI----PIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHG 213 (474) T ss_pred C------CCceEEEEEcccc---eE----EEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccc Confidence 3 3444443333321 10 12221 22232222211110000 00 00000000000 Q ss_pred cc-cccccCCCcccccccc--EEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccc Q lcl|NC_016071. 205 LV-TNLTSSADEVFIPINK--LMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILN 280 (516) Q Consensus 205 ~~-~~~~~~~~~~~iP~~k--~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~ 280 (516) .. ..........+-+..+ ++.|+ +|+.|.|.+..+ .+.+-- +..+..++..++.+..|+.++++.-+ T Consensus 214 ~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~e~v-~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~--- 284 (474) T protein:vir:96 214 EEHIQSHYYVGNKRVSWGRVPFIPFK-----NNPQEMSDLFMY-KTIIDAMDKRLSDTQNTFDESTELIYILKGYEG--- 284 (474) T ss_pred cccccccccccccccCCCceeEEEec-----cCCCCCCcHHHH-HHHHHHHHHHHHHHHHHHHHhccceeeeecCCc--- Confidence 00 0000000011112222 23332 467889988774 333322 33566788888899999888775311 Q ss_pred cccCCCCHHHHHHHHHHHHHHHHhhcccceEEEec-cCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_016071. 281 KAAIDPKSPESEMVQGLMADAANAHAGEQAYFILP-SDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA 359 (516) Q Consensus 281 k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP-~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG 359 (516) .+... .. .+. .....+.+| .|. +++++..+.. ...+...++.+.+.|.+.--+ T Consensus 285 ---~~~~~----~~-------~~~--~~~~~i~~~~~~~--------~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~ 338 (474) T protein:vir:96 285 ---QDLDE----FM-------RNL--KYYKAINVDGDGS--------GVDTIQIEVP--VQSSKEYLDMLRDYVIEFGQG 338 (474) T ss_pred ---ccccc----hh-------hhh--hcCceEEecCCCC--------ceeEEeecCC--hHHHHHHHHHHHHHHHHHhCC Confidence 11111 11 111 111223334 232 3555544332 223667888888888886655 Q ss_pred ccccccCCccchhhHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHH Q lcl|NC_016071. 360 GFINLGNDGQGSYNLSESKQSIHGH----FVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFS 435 (516) Q Consensus 360 qtLts~~~~~GS~Al~~vh~ev~~~----~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a 435 (516) ..++.+..+ +- ++.+..+.... .+..-.+.+...| +++++.++.+.+.... ..-..+.|....+.|..+++ T Consensus 339 p~~~~~~~~--~n-~Sg~Al~~~~~~l~~k~~~k~~~~~~~l-~~~~~~i~~~~~~~~~-~~~i~i~f~~~~p~~~~e~~ 413 (474) T protein:vir:96 339 VDFQQDKFG--NS-PSGIALKFMYSNLDLKANKLKNKTLTAL-QELLQYIIDFYKLNIK-VQDVEITFNFNVMVNELEQS 413 (474) T ss_pred ccccccccc--cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCcc-cceeeEEeccCCCcCHHHHH Confidence 555543322 11 22222222222 2223334555566 3467777776532222 22246778888888877666 Q ss_pred HHHHHHHhCCcccccHHHHHHHHHHcC-CCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 436 KFVQRIGAVGYLPKTPTVINKILEVGG-FDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 436 ~~~~~L~~~G~~~~~~~~~~~i~e~~G-lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) +. ++.+|.+. ++.+.+.++ ++.+..+-+-...+.....+...+. .+...+ ...-..+.|+ T Consensus 414 ~~---~~~ag~iS-----~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~~~~~-~~~~~~------~~~d~~~e~~ 474 (474) T protein:vir:96 414 QI---GVQSQYLS-----KETVVTNHPWVDDPVAELERIEQDNIDFNKQLPPL-EGDANG------RAQDNESETN 474 (474) T ss_pred HH---HHhcCCCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhccccc-cccccc------ccCCCcccCC Confidence 54 55678643 344555654 3322211111111111111111111 100000 0000111111 No 201 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=85.77 E-value=0.05 Score=27.70 Aligned_cols=425 Identities=11% Similarity=0.040 Sum_probs=159.9 Q ss_pred CCc-cccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhh--cccccCCcccHHHHH--------------HHh- Q lcl|NC_016071. 1 MST-RFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVM--KVEELRWPCFLATVE--------------AMK- 62 (516) Q Consensus 1 ~~~-r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~--~~~~lr~~~~~~~y~--------------~m~- 62 (516) |.. +-.--.++.+.- +-.++ +-+.+--.-+..++... +.+.+ -+..+.|+ ... T Consensus 1 ~~~~~~~~~~~~~~~~-----~~~~~--~~~~~~~~~i~~~i~~~~~~~~r~--~~~~~Yy~g~~~i~~~~~~~~~~~~~ 71 (478) T protein:vir:10 1 MISINWPWDKPYHEQV-----VEQIK--PKYETQEEMILRLVREHKENIDNI--TMGERYYNHHPDILDAPFKRDVNGDY 71 (478) T ss_pred CccccccCCchhhhHH-----HHHhh--hccCChHHHHHHHHHHHHHHHHHH--HHHHHHhcccccccccchhhhccccc Confidence 211 111111111000 00000 00111000111111110 00000 00011110 000 Q ss_pred ---------hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcce Q lcl|NC_016071. 63 ---------QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFS 132 (516) Q Consensus 63 ---------~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S 132 (516) ..+...-++.+....+.+-+..+.+ + +.++.+.+.+++++ .|.+.+.. +.++.-||.+ T Consensus 72 ~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~----~---~~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~ 139 (478) T protein:vir:10 72 DETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGV----D---NDKALKQIQHTLNH-----KWDDKLVDILTAASNKGIE 139 (478) T ss_pred ccccccceeccchHHHHHHHHhhhhcccCceeec----C---ChHHHHHHHHHHhc-----cHHHHHHHHHHHHhhCCeE Confidence 0233344444444445455444432 1 23445566666642 36666655 4578889986 Q ss_pred eeeEEEeecccccccccceeeccccccCchhcccccceeecC--CCceeeecccccccccc----ccccc-ccccccccc Q lcl|NC_016071. 133 IFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFAN----FQNGL-TQISSAMSL 205 (516) Q Consensus 133 ~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~----~~~~~-~~~~~~~~~ 205 (516) . +.+|.-. +|.+.+..+.|+. +. ..|++ .++++..++........ |.... ......-+. T Consensus 140 ~-~~v~~d~------~~~~~~~~~~p~~---~~----~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~ 205 (478) T protein:vir:10 140 W-VQPYVDE------EGEFKTFRVPAEQ---AV----PIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQ 205 (478) T ss_pred E-EEEEecC------CCceEEEEEcccc---eE----EEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCe Confidence 5 6777543 3444433333221 10 12221 23333332211110000 00000 000000000 Q ss_pred -cc---cccc------CCCcccccccc--EEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceee Q lcl|NC_016071. 206 -VT---NLTS------SADEVFIPINK--LMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIEL 272 (516) Q Consensus 206 -~~---~~~~------~~~~~~iP~~k--~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~ 272 (516) .. .... ...+.+-+..+ ++.|+ +|+.|.|.+..+ .+.+-- ...+..++..++.+..+++++ T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v-~~liDa~~~~~S~~~~~~~~~~~~~~~~ 279 (478) T protein:vir:10 206 LIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFK-----NNPQEVSDLFMY-KTIIDALDKRLSDTQNTFDESVELIYIL 279 (478) T ss_pred eeccccccccccccceecccccccCCcceEEEec-----cCCCCCCcHHHH-HHHHHHHHHHHHHHHHHHHHhhCcceee Confidence 00 0000 00011112222 23333 367789988874 333332 335566677778888888887 Q ss_pred eecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHH Q lcl|NC_016071. 273 KIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKA 352 (516) Q Consensus 273 ~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~ 352 (516) ++..+ .+.. +....+. + ..++.++.+ +...++++..+. ....+...++.+.+. T Consensus 280 ~g~~~------~~~~----~~~~~~~--------~-~~~~~~~~~------~~~~~~~l~~~~--~~~~~~~~~~~l~~~ 332 (478) T protein:vir:10 280 KGYEG------EDMK----DFMHNLK--------Y-YKAISVAGE------SGSGVDTIKVEV--PIDSVKEYTKMLRDY 332 (478) T ss_pred ecCCc------cccc----chhhhhh--------h-CceeEecCC------CCCcceEEeecC--CHHHHHHHHHHHHHH Confidence 76321 1111 1111111 1 112333321 123455554433 233467788888888 Q ss_pred HHHHHhcccccccCCccchhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCc Q lcl|NC_016071. 353 ILDRFGAGFINLGNDGQGSYNLSESKQSI----HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQE 428 (516) Q Consensus 353 Isk~iLGqtLts~~~~~GS~Al~~vh~ev----~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~ 428 (516) |.+.--...++.++.++. ++.+.-+. ....+..-.+.+...| +++++.++.+.+... +..-..+.|....+ T Consensus 333 I~~~s~~p~~~~~~~~~n---~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~~~~~~-d~~~i~i~f~~~~p 407 (478) T protein:vir:10 333 IIEFGQGVDFQQDKFGNS---PSGIALKFMYSNLDLKANKLKNKTLTAL-QELLQYIIDFYRLDV-RVQDIEITFNFNVM 407 (478) T ss_pred HHHHhCCcCcCccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCc-ccccceEEeCCCCC Confidence 877654444444332211 12222211 2222333344555556 346666776653322 22235788888888 Q ss_pred hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcC-CCCCCCcccccCcccccCCCCCCcccccccccCCCCCcccccccc Q lcl|NC_016071. 429 VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGG-FDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTR 507 (516) Q Consensus 429 ~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~G-lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (516) .|..+.++.+.++ .|++ + ++.+.+.++ +..+..+-+-...+.....+....-. .+.+ +...++.. T Consensus 408 ~~~~e~~~~~~~~--~g~i-S----~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~------~~~~-d~~~~~~~ 473 (478) T protein:vir:10 408 VNELENSQIAMNS--TGLL-S----KETILGNHSWVQDPVAEMERIEQENIELNQQLPDIE------EGLN-DEQQRQSE 473 (478) T ss_pred CCHHHHHHHHHHH--hCCC-C----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccccC------CCCc-ccccccCc Confidence 9999999998887 4653 3 344556664 33221110001111100000000000 0011 11112222 Q ss_pred cchhh Q lcl|NC_016071. 508 DNSVS 512 (516) Q Consensus 508 d~~~~ 512 (516) |+... T Consensus 474 d~~~e 478 (478) T protein:vir:10 474 DNQSE 478 (478) T ss_pred CCCCC Confidence 22222 No 202 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=84.27 E-value=0.062 Score=27.21 Aligned_cols=416 Identities=10% Similarity=0.018 Sum_probs=155.8 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccc-cCCcccHHH-----HHH----------H-hh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEE-LRWPCFLAT-----VEA----------M-KQ 63 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~-lr~~~~~~~-----y~~----------m-~~ 63 (516) |..+.. ..-+.+--..+. ..+-. +.....+ -.-.++ +..+..... +.. . .. T Consensus 1 ~~~e~~-~~~i~~~~~~~~-------~~~~~--~~~~~~Y--y~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 68 (471) T protein:vir:10 1 MEIEVI-KKIISSQMVKHG-------KFVSQ--AAEAEKY--YRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRIS 68 (471) T ss_pred CCHHHH-HHHHHHHHHHHH-------HHHHH--HHHHHHH--hccccccccccchhhhhcccccccccccccccccceec Confidence 222221 000000000000 00000 0000000 000011 100000000 000 0 01 Q ss_pred ChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeecc Q lcl|NC_016071. 64 DHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTES 142 (516) Q Consensus 64 D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~ 142 (516) .+...-++.+....+.+-+..+.+ + +.+..++++.++++ .|.+....+. ++.-+|.+. +.+|.... T Consensus 69 ~n~~~~Ivd~~~~yl~G~p~~~~~----~---~~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~~-~~v~~d~~ 135 (471) T protein:vir:10 69 HNWHQLLLDQKKAYALTYPPTFDV----D---DKKVNDMIVDVLGD-----DYERISKQLCVNAGNAGIAW-LHVWKDAS 135 (471) T ss_pred cchhHHHHHhhhhhhcccCceecc----C---ChHHHHHHHHHHhc-----CHHHHHHHHHHHHhhCCeEE-EEEEeeCC Confidence 223333444444445555544432 2 23445566666542 3666666644 688899665 56665321 Q ss_pred cccccccceeeccccccCchhcccccceeecC--CCceeeecccccccc----------ccccc-cccccc-------c- Q lcl|NC_016071. 143 APSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAF----------ANFQN-GLTQIS-------S- 201 (516) Q Consensus 143 ~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~----------~~~~~-~~~~~~-------~- 201 (516) +|.+.+.-+.|+--. -.|++ +++.+..++...... ..|.. +..... . T Consensus 136 -----~g~~~~~~~~p~~~~-------~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~ 203 (471) T protein:vir:10 136 -----DNSFRYACVDSKEVI-------PIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEE 203 (471) T ss_pred -----CCeeEEEEEcccceE-------EEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccc Confidence 344444433332110 12222 222222222110000 00000 000000 0 Q ss_pred -----ccccccccccC---CCc--cccccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccce Q lcl|NC_016071. 202 -----AMSLVTNLTSS---ADE--VFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGII 270 (516) Q Consensus 202 -----~~~~~~~~~~~---~~~--~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~ 270 (516) .........+. ... ..+..--++.|+ +|..|.|.+..+- +.+-- +..+..++..++.+..|+. T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~~~sd~e~v~-~liDa~d~~~S~~~~~~~~~~~~~l 277 (471) T protein:vir:10 204 LETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFK-----NNEIETNDLKPIK-DLVDVYDKVFSGFVNDTDDVQEVIF 277 (471) T ss_pred ccccccccccccccccccccccccCCCCceeEEEec-----cCCCCCCchHHHH-HHHHHHHHHHHHHHHHHHHhhCcee Confidence 00000000000 000 011111123332 3456778887643 33322 3355667788888889988 Q ss_pred eeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHH Q lcl|NC_016071. 271 ELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRK 350 (516) Q Consensus 271 v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d 350 (516) ++++..+ .+ ..+... .+. + ...+.++.... .....++++..... ...+...++.+. T Consensus 278 v~~g~~~------~~----~~~~~~----~~~----~-~~~i~~~~~~~---~~~~~~~~l~~~~~--~~~~~~~~~~l~ 333 (471) T protein:vir:10 278 VLTNYGG------QD----KQEFLE----DLK----R-YKMIKMDNDGM---GDQSGVTTIAIDIP--TEARNLILERTK 333 (471) T ss_pred eeecCCc------cc----cchhHH----Hhh----c-CCeEEecCCCC---ccCccceEEeecCC--hHHHHHHHHHHH Confidence 8876321 11 111111 111 0 11222332110 11234666655443 234677889998 Q ss_pred HHHHHHHhcccccccCCccchhhHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCc Q lcl|NC_016071. 351 KAILDRFGAGFINLGNDGQGSYNLSESKQSIHGH----FVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLI 426 (516) Q Consensus 351 ~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~----~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~ 426 (516) +.|...--+..++.+..+.-| | +.-+.... .+..-.+.+.+.| +++++.++.+... .+..-..+.|... T Consensus 334 ~~I~~~s~tp~~~~~~~gn~S---g-~Alk~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~~~~--~d~~~i~i~f~~~ 406 (471) T protein:vir:10 334 KQIFISGQGVNPETDKLGNSS---G-VALKFLYSLLELKAGNMETQFRSGY-ATLVKMILKHLGL--SDKLKIKQTWTRN 406 (471) T ss_pred HHHHHHhCCcCCCcccccCcc---H-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcc--CCCceeEEEeCCC Confidence 888776544444433222111 1 12222222 2333334555556 4566777665532 2223347889999 Q ss_pred CchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCccccc Q lcl|NC_016071. 427 QEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGKIS 504 (516) Q Consensus 427 ~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (516) .+.|.++.++.+++|. |+ ++ ++.+.+.++. ..+..+-+-...+.....+..+. ..+ +..+++.- T Consensus 407 ~p~n~~e~~~~~~kl~--g~-iS----~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~~~-----~~~--~~~~~e~~ 471 (471) T protein:vir:10 407 SINNDTEMAQVVSTLA--TI-TS----RENVAKSNPIVEDWQDELRLQKAEQEGRSEKLYD-----MEE--VEHESEVE 471 (471) T ss_pred CCCCHHHHHHHHHHHh--cc-Cc----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccc-----cCC--CCCccccC Confidence 9999999999999984 54 32 3556666643 22211100010010000000000 000 00111110 No 203 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=83.97 E-value=0.064 Score=27.12 Aligned_cols=451 Identities=12% Similarity=-0.025 Sum_probs=159.2 Q ss_pred CCc-c-----ccCcccccchhhhcccCCC--Cccc-ccchHHHHHHHHHHHh---hcccccCCcccHHHHH--------- Q lcl|NC_016071. 1 MST-R-----FAQPSEVVKAGNENLAVSR--LRTG-ELGSGALSQLRAESEV---MKVEELRWPCFLATVE--------- 59 (516) Q Consensus 1 ~~~-r-----~~~~~~~~~~~~~~p~~~~--~~~~-e~g~~~~~~~~~~~~~---~~~~~lr~~~~~~~y~--------- 59 (516) |-+ | ..-..+.+..........- ...- ++-. ....+..++.. .+.+.++ +..+.|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~-~~~~i~~~i~~~~~~~~~r~~--~l~~YY~g~~~i~~~~ 77 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQ-NINEVSKYIEHHMDYQRPRLK--VLSDYYEGKTKNLVEL 77 (512) T ss_pred CccceeccCceeeeeCceeeeccccccccccCchhhhhhh-hHHHHHHHHHHHHHhhHHHHH--HHHHHhcccCcccccc Confidence 211 1 1111111111000000000 0000 0000 00111111110 1111110 0111110 Q ss_pred -----HHhh-----ChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHh Q lcl|NC_016071. 60 -----AMKQ-----DHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNE 128 (516) Q Consensus 60 -----~m~~-----D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~ 128 (516) +... .....-++......+.+-+..+++ + +.++-+++..+++.- .|.....++. ++.- T Consensus 78 ~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~----~---d~~~~~~l~~~~~~n----~~~~~~~~~~~~~~i 146 (512) T protein:vir:97 78 TRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD----D---DKDVLEAIEAFNDLN----DVESHNRSLGLDLSI 146 (512) T ss_pred CcccccccCcceeecchHHHHHHHHhhhhcccCceecc----C---ChHHHHHHHHHHhhc----CHHHHHHHHHHHHHh Confidence 0001 122233444444444555444432 1 223456677776542 3656665544 6888 Q ss_pred hcceeeeEEEeecccccccccceeeccccccCchhcccccceeecCC--Cceeeeccccccccccc----cccccccccc Q lcl|NC_016071. 129 YGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDED--GRTLKGIYQSKMAFANF----QNGLTQISSA 202 (516) Q Consensus 129 ~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~~~~~~----~~~~~~~~~~ 202 (516) ||.+ ++++|.-. +|.+.+..+.|+.-. -.|+++ ++.+..++-........ ......+-.+ T Consensus 147 ~G~a-y~~vy~de------d~~~~i~~~~p~~~~-------~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~ 212 (512) T protein:vir:97 147 YGKA-YELMIRNQ------DDETRLYKSDAMSTF-------VIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS 212 (512) T ss_pred cCeE-EEEEEeCC------CCceEEEEEcccceE-------EEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeC Confidence 9975 56777533 344544443333211 123321 23333332111000000 0000000011 Q ss_pred cccccccccCCC----------ccccccc--cEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccce Q lcl|NC_016071. 203 MSLVTNLTSSAD----------EVFIPIN--KLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGII 270 (516) Q Consensus 203 ~~~~~~~~~~~~----------~~~iP~~--k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~ 270 (516) ..+..+...... ..+-|.. -++.|+ .|+.|.|.+..+-..-=--...+..++..++.+..++. T Consensus 213 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l 287 (512) T protein:vir:97 213 HGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (512) T ss_pred CcEEEEEecCCCcccccccccccccccCcccceEeec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 111111111111 0111111 123332 35778888887543222223355667777888888988 Q ss_pred eeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHH Q lcl|NC_016071. 271 ELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRK 350 (516) Q Consensus 271 v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d 350 (516) ++++... .+...-....-........ ....+.. ..+......+++++..+- ....+..+++.+. T Consensus 288 v~~G~~~------~~~~~~~~~~~~~~~~~~~----~~~~~~~----~~~~~~~~~d~~~l~~~~--~~~~~e~~~~~L~ 351 (512) T protein:vir:97 288 LIKGNLN------LDPVEVRKQKEANVLFLEP----TVYENRD----TGIETEGSVDGGYIYKQY--DVQGTEAYKDRLN 351 (512) T ss_pred eeecCcc------CCchhhhhhhhcccccccc----cchhhcc----cccCCCCCcceEEEeecC--CHHHHHHHHHHHH Confidence 8876421 1111100000000000000 0000000 000111223455554432 2234677889888 Q ss_pred HHHHHHHhcccccccCCccc-hh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-c--CC-cCC-ccccceEEe Q lcl|NC_016071. 351 KAILDRFGAGFINLGNDGQG-SY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL-N--DI-RLS-DEDMPKLKP 423 (516) Q Consensus 351 ~~Isk~iLGqtLts~~~~~G-S~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l-N--~~-~~~-~~~~P~~~~ 423 (516) +.|.+.--...++.++-++. |. |+.- ...-.......-.+.+...|+ ++++.++.+ + .. ..+ +..-..+.| T Consensus 352 ~~I~~~s~~p~~~~~~~~gn~Sg~Al~~-~~~~l~~ka~~k~~~f~~~l~-~~~~li~~~~~~~~~~~~~~d~~~i~~~f 429 (512) T protein:vir:97 352 SDIHMFTNTPNMKDDNFSGTQSGEAMKY-KLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVY 429 (512) T ss_pred HHHHHHhCCcccCcccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCcccccccccceEEe Confidence 99887765555555433221 11 2111 111111122223334445553 344544443 1 11 111 111246788 Q ss_pred cCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccc---cCCCCCCcccccccccCCCCC Q lcl|NC_016071. 424 GLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLK---LLGQDTSRSGDGMTAGSNGNG 499 (516) Q Consensus 424 ~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 499 (516) ...-+.|..+.++++.+|+ |++. .+.+.+.++. +.+..+-+-...+.+ ..........++. .++ T Consensus 430 ~~~~p~~~~e~~~~~~kl~--giiS-----~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~-----~~~ 497 (512) T protein:vir:97 430 NRNLPKSLIEELKAYIDSG--GKIS-----QTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD-----IND 497 (512) T ss_pred CCCCCcCHHHHHHHHHHHh--ccCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCCC-----CCC Confidence 8888999999999999985 6533 3456666654 322111000000000 0000000001100 001 Q ss_pred cccccccccchhhhh Q lcl|NC_016071. 500 TGKISSTRDNSVSNM 514 (516) Q Consensus 500 ~~~~~~~~d~~~~~~ 514 (516) +........++.+-- T Consensus 498 ~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 498 DEQDDDTKDTVDKKE 512 (512) T ss_pred CCCCCCccccccccC Confidence 111111111111111 No 204 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=82.97 E-value=0.072 Score=26.84 Aligned_cols=437 Identities=11% Similarity=0.031 Sum_probs=157.2 Q ss_pred CCccccCcccccch-hhhcccC------CCCccc--ccchHHHHHHHHHHHhhcccccCCcccHH-HHHHHhhChHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKA-GNENLAV------SRLRTG--ELGSGALSQLRAESEVMKVEELRWPCFLA-TVEAMKQDHTVSTA 70 (516) Q Consensus 1 ~~~r~~~~~~~~~~-~~~~p~~------~~~~~~--e~g~~~~~~~~~~~~~~~~~~lr~~~~~~-~y~~m~~D~~v~s~ 70 (516) |-+|++.--+-..+ -+...++ |++-++ ++.+ +..+..+- .-..+.+.....-. ...+-..---+... T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~--i~~~~~~Y-~g~~~~l~~~~~~~~~~~~~~~slnl~~~ 79 (505) T protein:vir:79 3 FWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVER--IARDKRYY-MDDFKQVTHKNSYGDTQKHELQSVNVTKL 79 (505) T ss_pred hHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHH--HHHHHHHh-cCCCccccccccCCCccccceeecchHHH Confidence 66666643221110 0011111 122111 1111 12222221 11122221000000 00000000012222 Q ss_pred H-HHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccc Q lcl|NC_016071. 71 L-DTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYA 148 (516) Q Consensus 71 l-~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~ 148 (516) + .+--..|.+-.-.|++. +.+..+++++.+++- .|...+.. +..|..+|=.++=+.|... T Consensus 80 i~~~~A~ll~~e~~~i~~~-------d~~~~e~l~~i~~~n----~f~~~~~~~~e~a~a~G~~~~k~~~D~~------- 141 (505) T protein:vir:79 80 ASAKLASLIFNEQCQVTVS-------DETANDFLDDVFQQN----DFYTTFEEKLEEWIALGSGCVRPYVDSG------- 141 (505) T ss_pred HHHHHHhhhcCCCceeecC-------ChHHHHHHHHHHHhc----cHHHHHHHHHHHHhhcCCeEEEEEEeCC------- Confidence 2 22223344433344432 245678888888643 25555554 4578889988887777532 Q ss_pred cceeeccccccCchhcccccceeecCCCc------------------eeeeccccccccccccccc--------cccccc Q lcl|NC_016071. 149 GYITIDKIAFRPQSSLSRSKPWVFDEDGR------------------TLKGIYQSKMAFANFQNGL--------TQISSA 202 (516) Q Consensus 149 g~~~~~~l~~r~q~ti~~~~~f~~~~dg~------------------~l~~~~q~~~~~~~~~~~~--------~~~~~~ 202 (516) .+.+..+ ++..+-+ ..++.++. ..+.++-|......+.... ...+.+ T Consensus 142 -~~~i~~v---~ad~~~P---~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~ 214 (505) T protein:vir:79 142 -KIKLAWA---TADQVYP---LQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGIN 214 (505) T ss_pred -ceEEEEE---cCCeeEE---EEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcc Confidence 1211111 1111100 01121111 0001111100000000000 000000 Q ss_pred cc--cccccccCCCccc---cccccEEEEee----cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeee Q lcl|NC_016071. 203 MS--LVTNLTSSADEVF---IPINKLMVMSL----GGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELK 273 (516) Q Consensus 203 ~~--~~~~~~~~~~~~~---iP~~k~i~~~~----~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~ 273 (516) +. -+....+..+.+. ++..-|.+++. ....++|+|.|.+..|--..-.=+..+.-|+.-++ . +=.-++ T Consensus 215 v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~-~--g~~~i~ 291 (505) T protein:vir:79 215 VPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVK-K--GQRRLI 291 (505) T ss_pred cchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHH-h--ccccee Confidence 00 0000001111111 22223444432 23557899999999886433222222222322222 1 112245 Q ss_pred ecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHH Q lcl|NC_016071. 274 IPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAI 353 (516) Q Consensus 274 ~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~I 353 (516) .|+..+.....+.......... + +......+..+..+.. + ..++... ..-....|.+.++.+=++| T Consensus 292 v~~~~l~~~~~~~~~~~~~~~~-~------fd~~~~~y~~~~~~~~----~-~~i~~~~--~~ir~e~~~~~l~~~l~~i 357 (505) T protein:vir:79 292 VPAEWLKTGSSYGGQASETHPP-M------FDPDETVYQAMYGDAS----E-VGFHDAT--SPIRVADYQATMDFFLREF 357 (505) T ss_pred echHHhcccCCCCccccccccc-C------CCccceeeeeccCCCC----C-CceEEec--ccCCHHHHHHHHHHHHHHH Confidence 5555544333222111000000 0 0000011111110000 0 0121111 1111122444444444555 Q ss_pred HHHH-hc-ccccccCCccchhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCcCC---------ccc Q lcl|NC_016071. 354 LDRF-GA-GFINLGNDGQGSYNLSESKQ--SIHGHFVQRDIDIIVEAFNKNLIPQLLALN---DIRLS---------DED 417 (516) Q Consensus 354 sk~i-LG-qtLts~~~~~GS~Al~~vh~--ev~~~~~~aDa~~i~~~ln~~li~~lv~lN---~~~~~---------~~~ 417 (516) +... ++ +|++. ++.|....-++.. .-...-+..-.+.+..+| ++|++-++.+- +.+.. ... T Consensus 358 ~~~~g~s~~~~~~--~~~~~~TAtei~s~~~~l~~t~~~~~~~~~~al-~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~ 434 (505) T protein:vir:79 358 ENQTGLSQGTFTT--SPSGIQTATEVVTNNSQTYQTRSSYITQVEKTI-KALTYAILELASVPSFYADGQARWTGDVDSL 434 (505) T ss_pred HHHhCCChhhcCC--CccccchHHHHHHHHhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcccccccccccCCCCce Confidence 4433 33 33333 2223221122221 112222334445566666 45777666432 11100 011 Q ss_pred cceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCc---ccccCCCCCCcccc Q lcl|NC_016071. 418 MPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDE---LLKLLGQDTSRSGD 489 (516) Q Consensus 418 ~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~---~~~~~~~~~~~~~~ 489 (516) -+.|.|+..-..|.++.++.+.+++..|++.. +.++.+.+|+++.+-+++.... .+...|+.....|+ T Consensus 435 ~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~----e~~l~~~~~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 435 DITINFNDGVFVDQESKRAADLQAVQAQVMPK----KQFLMRNYGLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred eEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCH----HHHHHhcCCCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 24577888778888888889999999998664 5788999999753222221111 11112222222222 No 205 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=82.61 E-value=0.075 Score=26.74 Aligned_cols=419 Identities=13% Similarity=0.057 Sum_probs=158.5 Q ss_pred CCccccCcccccchhhh-cccCCCCcccccchHHHHHHHHHHHhh--cccccCCcccHHHHH---HH------------- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE-NLAVSRLRTGELGSGALSQLRAESEVM--KVEELRWPCFLATVE---AM------------- 61 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~-~p~~~~~~~~e~g~~~~~~~~~~~~~~--~~~~lr~~~~~~~y~---~m------------- 61 (516) |..-...+ .++...+ .... +.....+--..+..++... +.+.+ -+..+.|+ ++ T Consensus 1 ~~~~~~~~--~~~~~~~~~~~~----~~~~~~~~~~~i~~~i~~~~~~~~~~--~~l~~Yy~g~~~i~~~~~~~~~~~~~ 72 (474) T protein:vir:95 1 MINIIRMP--WDKPYGEEVVEQ----MKPKVETQEEMIIRLINNHKQKLKDI--NVGQKYYDKDNDINYQAYKQDLHGNI 72 (474) T ss_pred CcccccCC--CCCCCCcchhhh----ccccccchHHHHHHHHHHHHHHHHHH--HHHHHHhcccCccccccchhhhcccc Confidence 44332221 2221111 1110 0111111111122222111 11110 01111111 00 Q ss_pred ---hhC-----hHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcce Q lcl|NC_016071. 62 ---KQD-----HTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFS 132 (516) Q Consensus 62 ---~~D-----~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S 132 (516) ..| ....-++......+.+-+..+.+ .+.++.+++.+++++ .|.+.+.. +.++.-||.+ T Consensus 73 ~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~-------~~~~~~~~l~~~~~n-----~~~~~~~~l~~~~~~~G~~ 140 (474) T protein:vir:95 73 DYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAH-------DDDKVLDVIHQVLDT-----RWDNKLIDILTAASNKGID 140 (474) T ss_pred cccccccccccchHHHHHHhhhhhhcccCceecc-------CChHHHHHHHHHHhc-----cHHHHHHHHHHHHhhCCeE Confidence 001 12223333333444454444432 223445666666642 25555554 4468889996 Q ss_pred eeeEEEeecccccccccceeeccccccCchhcccccceeecC--CCceeeeccccccccccccccccccccccccccccc Q lcl|NC_016071. 133 IFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLT 210 (516) Q Consensus 133 ~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (516) ++++|... +|.+.+..+.|+. +. -.|++ .+..+..++........ ...+..+.....+.. T Consensus 141 -~~~~~~d~------~~~~~i~~~~p~~---~~----~v~d~~~~~~~~a~ir~~~~~~~~----~~~vy~~~~i~~~~~ 202 (474) T protein:vir:95 141 -WLQVYINE------DGELKLFRVPAEQ---AI----PIWTDKEREQLNAFIRIFTFNGET----KVEYWTAETVTYYVY 202 (474) T ss_pred -EEEeeeCC------CCceEEEEEcccc---eE----EEEcCCCCCceEEEEEEEeecCee----EEEEEeCCeEEEEEE Confidence 46777543 3444433333221 10 12221 22333332211110000 000000000000000 Q ss_pred cC---------------CCcccccccc--EEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceee Q lcl|NC_016071. 211 SS---------------ADEVFIPINK--LMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIEL 272 (516) Q Consensus 211 ~~---------------~~~~~iP~~k--~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~ 272 (516) .. .....-+..+ ++.| .+|+.|.|.+.. +.+.+-- ...+..++..++.+..|++++ T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~nn~~~~~d~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~ 276 (474) T protein:vir:95 203 ENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAF-----KNNPEEVSDIWM-YKSFVDAIDKRLSDVQNMFDESVELIYIL 276 (474) T ss_pred cCCceeeccccccccccCcccccCCCccceEEe-----cCCCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 00 0000011111 2222 246788998877 4444433 446677788888888888877 Q ss_pred eecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHH Q lcl|NC_016071. 273 KIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKA 352 (516) Q Consensus 273 ~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~ 352 (516) ++.-+ .+.+ .....+. ....+.++.+- .++++..+.. ...+..+++.+.+. T Consensus 277 ~g~~~------~~~~----~~~~~~~---------~~~~i~~~~~~--------~~~~l~~~~~--~~~~~~~~~~l~~~ 327 (474) T protein:vir:95 277 RGYEG------EDLS----EFMEGLK---------YYKAINVSSDG--------GVETIQVEVP--VASTKEYLDMMRAY 327 (474) T ss_pred cCCCc------cccc----chhhhhh---------ccceeeccCCC--------ceeEEeccCC--HHHHHHHHHHHHHH Confidence 75311 1111 0111111 11123345553 3445444332 23467788888888 Q ss_pred HHHHHhcccccccCCccchhhHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCc Q lcl|NC_016071. 353 ILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFV----QRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQE 428 (516) Q Consensus 353 Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~----~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~ 428 (516) |...--...++.++.+ | . ++.+......... ..-.+.+...| +++++.++.+.+..+ +..-..+.|...-+ T Consensus 328 I~~~s~~p~~~~~~~~-~-n-~Sg~Alk~~~~~l~~k~~~~~~~~~~~l-~~~~~~i~~~~g~~~-d~~~i~i~f~~~~p 402 (474) T protein:vir:95 328 IVEFGQGVDFQTDKFG-S-A-TSGIALKFLYTNLNLKANKLKNKANVAL-QELMQFILDFNKIKL-DAKEIEITFNFNVM 402 (474) T ss_pred HHHHhCCcCccccccc-c-c-cHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCc-ccceeeEEecCCCc Confidence 8766655444443322 2 1 1222222211112 22223444555 346666776653222 22334678888888 Q ss_pred hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCC-CCCCcccccCcccc-cCCCCCCcccccccc-cCCCCCcccccc Q lcl|NC_016071. 429 VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFD-EEIPEDMSTDELLK-LLGQDTSRSGDGMTA-GSNGNGTGKISS 505 (516) Q Consensus 429 ~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 505 (516) .|..+.++.+ +..|++. ++.+.+.++.- .+..+-+-...+.. ...+.....+.+... ...++++++.++ T Consensus 403 ~~~~e~a~~~---~~~giiS-----~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 403 VNDLEQSQIG---AQSQYLS-----KETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred cCHHHHHHHH---HHcCCCC-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 8877766654 4467643 34566777543 22111000010100 000111111111110 011112222211 No 206 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=82.61 E-value=0.075 Score=26.74 Aligned_cols=419 Identities=13% Similarity=0.057 Sum_probs=158.5 Q ss_pred CCccccCcccccchhhh-cccCCCCcccccchHHHHHHHHHHHhh--cccccCCcccHHHHH---HH------------- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNE-NLAVSRLRTGELGSGALSQLRAESEVM--KVEELRWPCFLATVE---AM------------- 61 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~-~p~~~~~~~~e~g~~~~~~~~~~~~~~--~~~~lr~~~~~~~y~---~m------------- 61 (516) |..-...+ .++...+ .... +.....+--..+..++... +.+.+ -+..+.|+ ++ T Consensus 1 ~~~~~~~~--~~~~~~~~~~~~----~~~~~~~~~~~i~~~i~~~~~~~~~~--~~l~~Yy~g~~~i~~~~~~~~~~~~~ 72 (474) T protein:vir:96 1 MINIIRMP--WDKPYGEEVVEQ----MKPKVETQEEMIIRLINNHKQKLKDI--NVGQKYYDKDNDINYQAYKQDLHGNI 72 (474) T ss_pred CcccccCC--CCCCCCcchhhh----ccccccchHHHHHHHHHHHHHHHHHH--HHHHHHhcccCccccccchhhhcccc Confidence 44332221 2221111 1110 0111111111122222111 11110 01111111 00 Q ss_pred ---hhC-----hHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcce Q lcl|NC_016071. 62 ---KQD-----HTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFS 132 (516) Q Consensus 62 ---~~D-----~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S 132 (516) ..| ....-++......+.+-+..+.+ .+.++.+++.+++++ .|.+.+.. +.++.-||.+ T Consensus 73 ~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~-------~~~~~~~~l~~~~~n-----~~~~~~~~l~~~~~~~G~~ 140 (474) T protein:vir:96 73 DYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAH-------DDDKVLDVIHQVLDT-----RWDNKLIDILTAASNKGID 140 (474) T ss_pred cccccccccccchHHHHHHhhhhhhcccCceecc-------CChHHHHHHHHHHhc-----cHHHHHHHHHHHHhhCCeE Confidence 001 12223333333444454444432 223445666666642 25555554 4468889996 Q ss_pred eeeEEEeecccccccccceeeccccccCchhcccccceeecC--CCceeeeccccccccccccccccccccccccccccc Q lcl|NC_016071. 133 IFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLT 210 (516) Q Consensus 133 ~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (516) ++++|... +|.+.+..+.|+. +. -.|++ .+..+..++........ ...+..+.....+.. T Consensus 141 -~~~~~~d~------~~~~~i~~~~p~~---~~----~v~d~~~~~~~~a~ir~~~~~~~~----~~~vy~~~~i~~~~~ 202 (474) T protein:vir:96 141 -WLQVYINE------DGELKLFRVPAEQ---AI----PIWTDKEREQLNAFIRIFTFNGET----KVEYWTAETVTYYVY 202 (474) T ss_pred -EEEeeeCC------CCceEEEEEcccc---eE----EEEcCCCCCceEEEEEEEeecCee----EEEEEeCCeEEEEEE Confidence 46777543 3444433333221 10 12221 22333332211110000 000000000000000 Q ss_pred cC---------------CCcccccccc--EEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceee Q lcl|NC_016071. 211 SS---------------ADEVFIPINK--LMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIEL 272 (516) Q Consensus 211 ~~---------------~~~~~iP~~k--~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~ 272 (516) .. .....-+..+ ++.| .+|+.|.|.+.. +.+.+-- ...+..++..++.+..|++++ T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~nn~~~~~d~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~ 276 (474) T protein:vir:96 203 ENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAF-----KNNPEEVSDIWM-YKSFVDAIDKRLSDVQNMFDESVELIYIL 276 (474) T ss_pred cCCceeeccccccccccCcccccCCCccceEEe-----cCCCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 00 0000011111 2222 246788998877 4444433 446677788888888888877 Q ss_pred eecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHH Q lcl|NC_016071. 273 KIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKA 352 (516) Q Consensus 273 ~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~ 352 (516) ++.-+ .+.+ .....+. ....+.++.+- .++++..+.. ...+..+++.+.+. T Consensus 277 ~g~~~------~~~~----~~~~~~~---------~~~~i~~~~~~--------~~~~l~~~~~--~~~~~~~~~~l~~~ 327 (474) T protein:vir:96 277 RGYEG------EDLS----EFMEGLK---------YYKAINVSSDG--------GVETIQVEVP--VASTKEYLDMMRAY 327 (474) T ss_pred cCCCc------cccc----chhhhhh---------ccceeeccCCC--------ceeEEeccCC--HHHHHHHHHHHHHH Confidence 75311 1111 0111111 11123345553 3445444332 23467788888888 Q ss_pred HHHHHhcccccccCCccchhhHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCc Q lcl|NC_016071. 353 ILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFV----QRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQE 428 (516) Q Consensus 353 Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~----~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~ 428 (516) |...--...++.++.+ | . ++.+......... ..-.+.+...| +++++.++.+.+..+ +..-..+.|...-+ T Consensus 328 I~~~s~~p~~~~~~~~-~-n-~Sg~Alk~~~~~l~~k~~~~~~~~~~~l-~~~~~~i~~~~g~~~-d~~~i~i~f~~~~p 402 (474) T protein:vir:96 328 IVEFGQGVDFQTDKFG-S-A-TSGIALKFLYTNLNLKANKLKNKANVAL-QELMQFILDFNKIKL-DAKEIEITFNFNVM 402 (474) T ss_pred HHHHhCCcCccccccc-c-c-cHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCc-ccceeeEEecCCCc Confidence 8766655444443322 2 1 1222222211112 22223444555 346666776653222 22334678888888 Q ss_pred hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCC-CCCCcccccCcccc-cCCCCCCcccccccc-cCCCCCcccccc Q lcl|NC_016071. 429 VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFD-EEIPEDMSTDELLK-LLGQDTSRSGDGMTA-GSNGNGTGKISS 505 (516) Q Consensus 429 ~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 505 (516) .|..+.++.+ +..|++. ++.+.+.++.- .+..+-+-...+.. ...+.....+.+... ...++++++.++ T Consensus 403 ~~~~e~a~~~---~~~giiS-----~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 403 VNDLEQSQIG---AQSQYLS-----KETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred cCHHHHHHHH---HHcCCCC-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 8877766654 4467643 34566777543 22111000010100 000111111111110 011112222211 No 207 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=82.58 E-value=0.076 Score=26.73 Aligned_cols=442 Identities=13% Similarity=0.063 Sum_probs=154.3 Q ss_pred CC--------ccccCc--ccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHh-hChHHHH Q lcl|NC_016071. 1 MS--------TRFAQP--SEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMK-QDHTVST 69 (516) Q Consensus 1 ~~--------~r~~~~--~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~-~D~~v~s 69 (516) |. +.++.. .-+.+.-..+ ...+.+ +.....+-.. ..+-+..+..-+....-+ ...+..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~-------~~~~~~--~~~l~~Yy~g-~~~i~~~~~~~~~~~~~ki~~n~~~~ 70 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIREL-------QNRKKR--LDKLSDYYNG-KQEIEKHEFDNATVEAANVMVNHAKY 70 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHH-------HHHHHH--HHHHHHHhcc-ccchhcCCcCcCCCCcceeecchHHH Confidence 10 000000 0000000000 000000 0000001000 000011110000000000 1345555 Q ss_pred HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeeccccccc- Q lcl|NC_016071. 70 ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKY- 147 (516) Q Consensus 70 ~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~- 147 (516) ++.+....+.+-+..+.+. + .+..+.+.+++++. .|...+.+ ..++.-||.+ ++++|....+.... T Consensus 71 Iv~~~~~~l~g~p~~~~~~----~---~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~~-~~~v~~~~~g~~~~~ 138 (499) T protein:vir:10 71 ITDMNVGFMTGNPVKYVAE----K---GKNIDDILEVFNQI----DIHKHDIELEKDLSVFGYG-YELLYLKKTDPISVR 138 (499) T ss_pred HHHHHhhhhcccCceeecC----C---hhHHHHHHHHHhhc----CHhHHHHHHHHHHHhcCce-EEEEEeccccccccc Confidence 6666556666666555432 1 23344455555442 25554444 4578889975 56777655432110 Q ss_pred ----------ccceeeccccccCchhcccccceeecCCC--ceeeeccccccc--ccccccccccccccccccccccc-- Q lcl|NC_016071. 148 ----------AGYITIDKIAFRPQSSLSRSKPWVFDEDG--RTLKGIYQSKMA--FANFQNGLTQISSAMSLVTNLTS-- 211 (516) Q Consensus 148 ----------~g~~~~~~l~~r~q~ti~~~~~f~~~~dg--~~l~~~~q~~~~--~~~~~~~~~~~~~~~~~~~~~~~-- 211 (516) ...+.+..+.|+.-. ..|++.. ..+..++-.... .......+..+-.+..+..+... T Consensus 139 ~~~~~~~~~~~~~~~~~~v~p~~~~-------~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~ 211 (499) T protein:vir:10 139 DELGNEKLTPNTELKIEVIDPRATV-------VVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTT 211 (499) T ss_pred ccccccccccccceEEEEEcccceE-------EEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCC Confidence 001112222222110 1111111 111111100000 00000000000000000000000 Q ss_pred ----------CCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccc Q lcl|NC_016071. 212 ----------SADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNK 281 (516) Q Consensus 212 ----------~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k 281 (516) ......+..--+|.|+ +|+.|.|.+..+-..-=-=...+..++..++.+..|..++++... T Consensus 212 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~---- 282 (499) T protein:vir:10 212 MEVSANDPIVYDGENLFGAVPIIEFR-----NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGL---- 282 (499) T ss_pred ccccCcceecccccCCCCccceEEec-----CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc---- Confidence 0000011111234343 356788888765432222233556677788888888888876321 Q ss_pred ccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccc Q lcl|NC_016071. 282 AAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGF 361 (516) Q Consensus 282 ~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqt 361 (516) +. +......+.. + ....++.+ +..+++++..... ...+...++.+.+.|...--... T Consensus 283 ----~~--~~~~~~~~~~-------~--~~~~~~~~------~~~d~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~ 339 (499) T protein:vir:10 283 ----GD--DKDDIQRLKR-------G--AIEAPPRE------EGADIEWLTKSFD--ETQVNLLSQSIENDIHKISYVPN 339 (499) T ss_pred ----cc--ccchhhhhhh-------c--ceeccCCC------CCCcceEEeccCC--HHHHHHHHHHHHHHHHHHhCccc Confidence 11 1111111110 1 11122111 1234555544332 23477889999999877543333 Q ss_pred ccccCCccchhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC--CccccceEEecCcCchhHHHHH Q lcl|NC_016071. 362 INLGNDGQGSYNLSESKQSIH----GHFVQRDIDIIVEAFNKNLIPQLLALNDIRL--SDEDMPKLKPGLIQEVDMEGFS 435 (516) Q Consensus 362 Lts~~~~~GS~Al~~vh~ev~----~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~--~~~~~P~~~~~~~~~~dl~~~a 435 (516) ++.+.-+ | .++.+.-+.. ...+..-.+.+...++ ++++.++.+-...+ .+..-..+.|...-+.|..+.+ T Consensus 340 ~~~~~~~-g--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~ 415 (499) T protein:vir:10 340 MNDEKFM-G--NVSGEAMKFKLFGLENLLSIKQRYFFDGLR-RRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVV 415 (499) T ss_pred CCchhhc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHH Confidence 3433221 1 1122222221 2223333445555663 46666666421111 1122357889998999999999 Q ss_pred HHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcc-cc--cCCCCCCcccccccccCCCCCcccccccccch- Q lcl|NC_016071. 436 KFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDEL-LK--LLGQDTSRSGDGMTAGSNGNGTGKISSTRDNS- 510 (516) Q Consensus 436 ~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~- 510 (516) +.+++| .|.+. .+.+.+.++. +.+..+-+-...+ +. ...+.......+.... . ++....++..+.+ T Consensus 416 ~~~~kl--~g~iS-----~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~ 486 (499) T protein:vir:10 416 NNVKNA--DGIIP-----RKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLE-L-EDKQDDSSENDKEA 486 (499) T ss_pred HHHHHH--hccCC-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC-C-CCCCcccCCCCCCC Confidence 999998 45432 2445555543 2221110000000 00 0000000000000000 0 0000000000000 Q ss_pred -hhhhcC Q lcl|NC_016071. 511 -VSNMDN 516 (516) Q Consensus 511 -~~~~~~ 516 (516) ++|+.= T Consensus 487 ~~~~~~~ 493 (499) T protein:vir:10 487 GSNHNQS 493 (499) T ss_pred ccccccC Confidence 111100 No 208 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=81.68 E-value=0.084 Score=26.49 Aligned_cols=445 Identities=11% Similarity=-0.028 Sum_probs=165.6 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHh---hcccccCCcccHHHHH--------------HHh- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEV---MKVEELRWPCFLATVE--------------AMK- 62 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~---~~~~~lr~~~~~~~y~--------------~m~- 62 (516) ...|+..-++..-.....-....... ..+..++.. ...+.+ .+..+.|+ +.. T Consensus 17 ~~~~~~~~~n~~~~~~~~e~~~~~~~--------~~i~~~i~~~~~~~~~r~--~~l~~Yy~g~~~i~~~~~~~~~~~~~ 86 (511) T protein:vir:96 17 INYLFNDEANVVYTYDGTESDLLQNV--------NEVSKYIEHHMDYQRPRL--KVLSDYYEGKTKNLVELTRRKEEYMA 86 (511) T ss_pred hhhhhhhhhCCccccchhhhhhhccH--------HHHHHHHHHHHHhhHHHH--HHHHHHhcccCccccccCcCcccccC Confidence 33333333332221110000000000 011111110 001100 00111110 000 Q ss_pred ----hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEE Q lcl|NC_016071. 63 ----QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKV 137 (516) Q Consensus 63 ----~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eiv 137 (516) ......-++.+....+.+-+..+++. +.+..+++..+++.- .|..+..++. ++.-||. +++.+ T Consensus 87 ~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~-------~~~~~~~l~~~~~~n----~~~~~~~~~~~~~~i~G~-a~~~v 154 (511) T protein:vir:96 87 DNRVAHDYASYISDFINGYFLGNPIQYQDD-------DKDVLEAIEAFNDLN----DVESHNRSLGLDLSIYGK-AYELM 154 (511) T ss_pred cceeecchHHHHHHHHHhhhccCCceeecC-------chHHHHHHHHHHhhc----CHHHHHHHHHHHHHhcCe-eEEEE Confidence 12333444555555555665555431 123456677777542 2556665544 6888997 56788 Q ss_pred EeecccccccccceeeccccccCchhcccccceeecCC--Cceeeeccccccccccccc----ccccccccccccccccc Q lcl|NC_016071. 138 YRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDED--GRTLKGIYQSKMAFANFQN----GLTQISSAMSLVTNLTS 211 (516) Q Consensus 138 w~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 211 (516) |.-. +|.+.+..+.|+. . ...|++. ++.+..++........... ....+-.+..+..+... T Consensus 155 y~de------d~~~~i~~~~p~~---~----~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~ 221 (511) T protein:vir:96 155 IRNQ------DDETRLYKSDAMS---T----FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTS 221 (511) T ss_pred EeCC------CCceEEEEEccce---e----EEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEec Confidence 8643 3444444333221 1 1123322 3334333321110000000 00001111111111111 Q ss_pred CCC----------ccccccc--cEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_016071. 212 SAD----------EVFIPIN--KLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQIL 279 (516) Q Consensus 212 ~~~----------~~~iP~~--k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~ 279 (516) ... ..+-|.. -++.|+ .|..|.|.+..+-..-=--...+..++..++.+..+++++++.... T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~- 295 (511) T protein:vir:96 222 RTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL- 295 (511) T ss_pred CCCcccccccccccccccCCceeeEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccC- Confidence 111 1111111 133333 3567888888764433223445666777888888998888874321 Q ss_pred ccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_016071. 280 NKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA 359 (516) Q Consensus 280 ~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG 359 (516) +.....+..-..+..+.. ...+.+.+. ......+++++..... ...+...++.+.+.|.+.--. T Consensus 296 -----~~~~~~~~~~~~~~~~~~-------~~~~~~~~~--~~~~~~~~~~l~~~~~--~~~~e~~~~~L~~~I~~~s~~ 359 (511) T protein:vir:96 296 -----DPVEVRKQKEANVLFLEP-------TVYADSEGR--ETEGSVDGGYIYKQYD--VQGTEAYKDRLNSDIHMFTNT 359 (511) T ss_pred -----Cchhhcccccccceeccc-------ccccccccc--cCCCCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCC Confidence 111000000000000000 000000010 1112234555544322 234677888888888776666 Q ss_pred ccccccCCccc-hh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCc-CC-ccccceEEecCcCchhHH Q lcl|NC_016071. 360 GFINLGNDGQG-SY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL---NDIR-LS-DEDMPKLKPGLIQEVDME 432 (516) Q Consensus 360 qtLts~~~~~G-S~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l---N~~~-~~-~~~~P~~~~~~~~~~dl~ 432 (516) ..++.++-++. |. |+. ....-....+..-.+.+...|++ +++.++.+ +... .+ +..-..+.|...-+.|.. T Consensus 360 p~~~~~~~~~n~Sg~Al~-~~~~~l~~k~~~k~~~~~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~ 437 (511) T protein:vir:96 360 PNMKDDNFSGTQSGEAMK-YKLFGLEQRTKTKEGLFTKGLRR-RAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLI 437 (511) T ss_pred cccccccccccchHHHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHH Confidence 55555432211 21 211 11111112222233444555543 44444443 2111 11 112357889888899999 Q ss_pred HHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccC-CCCCCcccccccccCCCCCcccccccccch Q lcl|NC_016071. 433 GFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLL-GQDTSRSGDGMTAGSNGNGTGKISSTRDNS 510 (516) Q Consensus 433 ~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (516) +.++++.+| .|++. .+.+.+.++. +.+..+-+-...+.... ............ +.+.+.+..+...++ T Consensus 438 e~~~~~~kl--~G~iS-----~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 507 (511) T protein:vir:96 438 EELKAYIDS--GGKIS-----QTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR---DINDDEQDDDTKDTV 507 (511) T ss_pred HHHHHHHHH--hccCC-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCC---CCCCCCCCCcccccc Confidence 999999988 46543 3455666654 32211100000110000 000000000000 000111111111111 Q ss_pred hhhh Q lcl|NC_016071. 511 VSNM 514 (516) Q Consensus 511 ~~~~ 514 (516) ...- T Consensus 508 ~~~~ 511 (511) T protein:vir:96 508 DKKE 511 (511) T ss_pred cccC Confidence 1111 No 209 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=77.33 E-value=0.13 Score=25.52 Aligned_cols=375 Identities=10% Similarity=-0.019 Sum_probs=136.7 Q ss_pred CcccccchHHHHHHHHHH--------HhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCChh Q lcl|NC_016071. 24 LRTGELGSGALSQLRAES--------EVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKA 95 (516) Q Consensus 24 ~~~~e~g~~~~~~~~~~~--------~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~ 95 (516) |...-|+.- ++.+.... +-+-.+.++. -.+.+-++|.. .+.+++.--+..|..+.=++.+..-. .+ T Consensus 1 ~~~~~i~~L-~~~~~~~~~r~~~~~~yY~g~~~~~~-~~~~~p~~~~~--~~~~v~nw~~~iVds~a~rl~~~Gf~--~~ 74 (409) T protein:vir:16 1 MTEKGIGYL-RFKLSVHKRRAEMRYEQYAMKHVDRF-KGITIPQALSQ--QYRSILGWCAKGVDSLADRLVFREFE--ND 74 (409) T ss_pred CCHHHHHHH-HHHHHHHhHHHHHHHHHHhccCchhh-cchhhhHHHHH--HHhhhcChhHHHHHHhHhhccccccc--Cc Confidence 222222211 11111100 0000000000 00111122211 11122222222222221122222111 11 Q ss_pred hHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccccceeeccccccCchhcccc------- Q lcl|NC_016071. 96 SKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFRPQSSLSRS------- 167 (516) Q Consensus 96 ~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r~q~ti~~~------- 167 (516) +.+ +.++|+. ..|.....+ ..+|+-||.|+. .||.-.. |...+.-+.|+.-..|.++ T Consensus 75 d~~----l~~i~~~----N~ld~~~~~~~~~al~yG~sf~-~v~~~~d------g~~~i~~~sP~~~~~i~D~~~~~~~~ 139 (409) T protein:vir:16 75 DFT----VNEIFEE----NNPDIFFDSTVLSALIASCSFT-YISKGEN------DAVRLQVIEATNATGIIDPITGLLTE 139 (409) T ss_pred chH----HHHHHHh----cChhHHHHHHHHHHHHhCceeE-EEecCCC------CceEEEEEcccceEEEeeccccccee Confidence 111 3444432 235555555 447999999765 7886432 3222222222111111100 Q ss_pred --cceeecCCCceeee---ccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcCCccccchhH- Q lcl|NC_016071. 168 --KPWVFDEDGRTLKG---IYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTESNPAGVSPL- 241 (516) Q Consensus 168 --~~f~~~~dg~~l~~---~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLl- 241 (516) +.|.-+.+|..... +......+ +..+ +. +.. ........-+|.|.++.+-+.|+|.|-+ T Consensus 140 a~~~~~~d~~~~~~~~~~~~~~~~~~~--~~~~--------~~-~~~----~~~~~g~vPvV~f~n~~~~~~~~G~seI~ 204 (409) T protein:vir:16 140 GYAVLERDENNNVVLEAHFLPDRTDYY--YRDS--------RN-NIS----IANPTGNPLLVPIIHRPDAVRPFGRSRIT 204 (409) T ss_pred eeEEEEecCCCceEEEEEEecCcEEEE--EecC--------cc-ccc----eecCCCCcceEEecccccccccCCccccc Confidence 11222222221110 00100000 0000 00 000 1112233446778888888899998744 Q ss_pred HHHHH--HHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcc Q lcl|NC_016071. 242 VGCYR--AFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMN 319 (516) Q Consensus 242 r~~~~--~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~ 319 (516) +.+-. --+.|.. -.-+...|=+.+|-.++.|. .. +....+ .. ...+ ..-..+|++.+ T Consensus 205 ~~v~~l~da~~r~~--~~~~~~~e~~a~pqr~i~G~------d~-d~~~~~--~~---~~~~-------~~i~~~~~d~~ 263 (409) T protein:vir:16 205 RSGMYWQSNAKRTL--ERADVTAEFYSFPQKYVTGL------SD-DAEPME--TW---KATV-------SSMLQFTKDED 263 (409) T ss_pred hhHHHHHHHHHHHH--HHHHHHHHHhcChhheeEec------CC-CCCccc--hh---hhhh-------hHhhccCCCCC Confidence 32211 1111211 11223344455565555542 11 111111 11 1111 12234565432 Q ss_pred cccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc---ccCCccchh-hHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 320 AQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN---LGNDGQGSY-NLSESKQSIHGHFVQRDIDIIVE 395 (516) Q Consensus 320 i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt---s~~~~~GS~-Al~~vh~ev~~~~~~aDa~~i~~ 395 (516) -+ .+++.+.+++ +...|...++-+-.++|-. .+=.+. ...++.+|- |+...+..+. ...+.-.+.+.. T Consensus 264 g~-----~~~v~q~~~~-~l~~~~~~l~~~~~~~a~~-s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~-~ka~~k~~~fg~ 335 (409) T protein:vir:16 264 GD-----KPTLGQFTQP-SMSPFTEQLRTAAAGFAGE-TGLTLDDLGFVSDNPSSVEAIKASHENLR-LAGRKAQRSLGA 335 (409) T ss_pred CC-----CceEEecCCC-ChhHHHHHHHHHHHHHhhh-cCCCHHHcccccCchhHHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 11 1222222222 2223444444444444432 110110 001111332 3333222222 222333444555 Q ss_pred HHHHHHHHHHHHhcCCcCC--ccc-cceEEecCcC---chhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC Q lcl|NC_016071. 396 AFNKNLIPQLLALNDIRLS--DED-MPKLKPGLIQ---EVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI 467 (516) Q Consensus 396 ~ln~~li~~lv~lN~~~~~--~~~-~P~~~~~~~~---~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~ 467 (516) .+ +++.+..+.+=..... ++. --.++|.... ...+.+.|+++.||+.+|....+ .+.+++.+|+..++ T Consensus 336 ~l-~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~---~~v~~~~~g~~~~d 409 (409) T protein:vir:16 336 GL-LNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFIN---KDTIRDLTGIKGAE 409 (409) T ss_pred HH-HHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccc---hhHHHHhccCCCCC Confidence 66 3566665555221111 110 1245676444 44467888999999999864432 35679999998654 No 210 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=77.30 E-value=0.13 Score=25.51 Aligned_cols=444 Identities=10% Similarity=-0.075 Sum_probs=149.2 Q ss_pred CCccccCcccccchhhhcccCCCCcccccch--HHHHHHHHHHHhhcccccC-CcccHHHHHHHhh----ChHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGS--GALSQLRAESEVMKVEELR-WPCFLATVEAMKQ----DHTVSTALDT 73 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~--~~~~~~~~~~~~~~~~~lr-~~~~~~~y~~m~~----D~~v~s~l~~ 73 (516) |++|+.-....-....-...+.. .+.. .-+.-...+-.. .+.+. .+.. +=+++.. .....-++.. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~----~~~~~~~r~~~l~~YY~G--~~~i~~~~~~--~~~~~~~~~~v~n~~~~iVd~ 72 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMIS----AFEDASKDLASNTSYYDA--ERRPEAIGVT--VPREMQQLLAHVGYPRLYVDS 72 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHH----HHHHHHHHHHHHHHHhcc--cCcchhcccc--cchhHhhhhhccchHHHHHHH Confidence 55555443332222111000000 0000 000000111000 01110 0000 0011110 1111111111 Q ss_pred HHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeeccccccc--ccc Q lcl|NC_016071. 74 KYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKY--AGY 150 (516) Q Consensus 74 Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~--~g~ 150 (516) .-..+.-..+ .++ .++..+ +.+.+.|++ ..|.....+ +.++.-||.| +++||.-..+.... ++. T Consensus 73 ~~~~l~~~g~--~~~--~~~~~~----~~~~~i~~~----N~~d~~~~~~~~~a~~~G~a-y~~v~~~e~~~~~~~~~~~ 139 (486) T protein:vir:42 73 VAERQAVEGF--RLG--DADEAD----EELWQWWQA----NNLDIEAPLGYTDAYVHGRS-FITISKPDPQLDLGWDQNV 139 (486) T ss_pred HHhhhcccce--ecC--CCchhH----HHHHHHHHh----cChhHHHHHHHHHHhhcCce-EEEEecCCcccccccCCCe Confidence 1111111112 221 122222 224444442 125555555 4468889997 67898754432211 122 Q ss_pred eeeccccc------------cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCcccc Q lcl|NC_016071. 151 ITIDKIAF------------RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFI 218 (516) Q Consensus 151 ~~~~~l~~------------r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 218 (516) ..+..+.+ ++..-+ +++ ++.++..+.... .|....+......+..+... ......+ T Consensus 140 ~~i~~~~p~~~~~i~d~~~~~~~~~~---~~~-~~~~~~~~~~~~-------~y~~~~~~~~~~~~~~~~~~-~~~~h~~ 207 (486) T protein:vir:42 140 PIIRVEPPTRMHAEIDPRINRVSKAI---RVA-YDKEGNEIQAAT-------LYTPMETIGWFRADGEWAEW-FNVPHGL 207 (486) T ss_pred eEEEEecccceEEEEeCCCCCeEEEE---EEE-EecCCCeEEEEE-------EEcCCcEEEEEecCCcEEee-cceecCC Confidence 22211111 111111 112 222222111100 01111000000000000000 0011223 Q ss_pred ccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHH Q lcl|NC_016071. 219 PINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGL 297 (516) Q Consensus 219 P~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l 297 (516) +..-++.|+++.+.+.|+|.|-+..-..+.+-. +..+...+...+-+..|..+++|... .+-..++....... T Consensus 208 g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~------~~~~~~~~~~~~~~ 281 (486) T protein:vir:42 208 GVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKP------EEIGVDSETGQTLF 281 (486) T ss_pred CCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCc------cccccccccccchh Confidence 334457788888889999998776422222111 11223344455556666555554210 00000000000000 Q ss_pred HHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC-----Cccchh Q lcl|NC_016071. 298 MADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN-----DGQGSY 372 (516) Q Consensus 298 ~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~-----~~~GS~ 372 (516) . +...+-.++|.+ + .++...+.. +...++++++.-|.+.-..-.++... .+..|. T Consensus 282 ~-------~~~~~~~~~~~~-~--------~~~~q~~~~----~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg 341 (486) T protein:vir:42 282 D-------AYLARILAFEDA-E--------GKIQQFSAA----ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASA 341 (486) T ss_pred h-------hhhchhcccCCC-C--------ceEEeeccc----CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHH Confidence 0 011111233332 1 223332222 24456777776665543322222111 111122 Q ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCcCCcc-ccceEEecCcCchhHHHHHHHHHHHHhCCc-cc Q lcl|NC_016071. 373 -NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALN-DIRLSDE-DMPKLKPGLIQEVDMEGFSKFVQRIGAVGY-LP 448 (516) Q Consensus 373 -Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN-~~~~~~~-~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~-~~ 448 (516) |+.- ...-....++.-.+.+...|. ++++.++.+- ....+.. .--.+.|....+.++.+.|+++.+|++.|. ++ T Consensus 342 ~Al~~-~~~~l~~ka~~~~~~f~~~l~-~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~ 419 (486) T protein:vir:42 342 EAIRA-AESRLIKKVERKNLMFGGAWE-EAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVI 419 (486) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCC Confidence 2221 122222223333344455553 3455444442 2111111 113567888888999999999999998764 33 Q ss_pred ccHHHHHHHHHHcCCCCCCCccccc--Ccc---c----ccCCCCC-CcccccccccCCCCCcccccccccc Q lcl|NC_016071. 449 KTPTVINKILEVGGFDEEIPEDMST--DEL---L----KLLGQDT-SRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 449 ~~~~~~~~i~e~~Glp~~~~~~~~~--~~~---~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) + .+-+++.+|+-+...++... ++. . ....... ...+.....+.+...++..+++.|- T Consensus 420 s----~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 420 P----RERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred C----HHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 3 45677888875332211100 000 0 0000000 0000001001011111111111111 No 211 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=76.93 E-value=0.13 Score=25.44 Aligned_cols=409 Identities=10% Similarity=-0.060 Sum_probs=153.7 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHHhc Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFVTK 80 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~ 80 (516) |.+... .+..+...=+..-+.+........+- .+.......++.| ...+...-++.+....+.+ T Consensus 32 ~~~~~~--~~~~~~~~yy~g~~~i~~~~~~~~~~---~~~~~~~~~~~~k-----------i~~~~~~~Ivd~~~~~l~g 95 (479) T protein:vir:79 32 ILKHRP--EKYKQGEEYYYGNTDVNNKRRYYLLD---GAKVDDFTKVNNK-----------AINNYHKLLVDQKVGYSVG 95 (479) T ss_pred HhhhhH--HHHHHHHHHhccCCcccccccccccc---cccccccccCcce-----------eecchHHHHHHHHHhhhhc Confidence 222110 01110000011111100000000000 0000000000000 0123333344444455555 Q ss_pred CCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccceeecccccc Q lcl|NC_016071. 81 AFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGYITIDKIAFR 159 (516) Q Consensus 81 ~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l~~r 159 (516) -+..+.+ ++ .++.++++.++++ .|.+.+.++ .++..||.+. +.+|.... |.+.+.-+.|+ T Consensus 96 ~p~~~~~----~~---~~~~~~~~~~~~n-----~~~~~~~~~~~~~~~~G~~~-~~v~~d~~------~~~~i~~~~p~ 156 (479) T protein:vir:79 96 NPIVFNA----DD---DNLTKLLNDLLGE-----EFDDTITELYLNASNKGVEW-LHPYINRK------GEFKYVIIPAE 156 (479) T ss_pred CCceecc----CC---HHHHHHHHHHHhc-----CHHHHHHHHHHHHHhcCeEE-EEEEeCCC------CceEEEEEccc Confidence 5554432 22 2344556665542 366665554 4788899775 57775432 33333222221 Q ss_pred CchhcccccceeecC--CCceeeecccccc--------------------ccccccccccccccccccccccc------- Q lcl|NC_016071. 160 PQSSLSRSKPWVFDE--DGRTLKGIYQSKM--------------------AFANFQNGLTQISSAMSLVTNLT------- 210 (516) Q Consensus 160 ~q~ti~~~~~f~~~~--dg~~l~~~~q~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~------- 210 (516) ++. ..|++ +++.+..++-... .+.....+ .............. T Consensus 157 ---~~~----~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 228 (479) T protein:vir:79 157 ---EAI----PIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNS-FIQEFLYDEYGKMTDIQEGHF 228 (479) T ss_pred ---eeE----EEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCc-ccccccccccccccccccccc Confidence 110 11121 1111211110000 00000000 00000000000000 Q ss_pred -cCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHH Q lcl|NC_016071. 211 -SSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSP 289 (516) Q Consensus 211 -~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~ 289 (516) .......++..-++.|+ +|++|.|.+..+--..=-=+..+..++..++.+..|+.++++.+.- + T Consensus 229 ~~~~~~~~~~~vPvv~~~-----nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~------~---- 293 (479) T protein:vir:79 229 RINNKEQGWGKVPFIPFK-----NNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGT------S---- 293 (479) T ss_pred cccccccCCCcccEEEec-----CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcc------c---- Confidence 00000111111233332 4678889887654322222335567888889999998887763211 1 Q ss_pred HHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCcc Q lcl|NC_016071. 290 ESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQ 369 (516) Q Consensus 290 ~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~ 369 (516) +.+....+. .+ ..+.++.+- +++++..+. ....+...++.+.+.|.+.--+..++.+.. T Consensus 294 ~~~~~~~~~-------~~--~~i~~~~~~--------~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-- 352 (479) T protein:vir:79 294 LQEFIDNIR-------YY--KSIKVDGGG--------GVDKLEINI--PVEAKKELLDRLEKNIIIFGQGVNPESQNT-- 352 (479) T ss_pred cccchhhhh-------hc--cceecCCCC--------cceEEeccC--CHHHHHHHHHHHHHHHHHHhCccccccccc-- Confidence 111111111 01 123345543 355554443 233477788888888887776666655432 Q ss_pred chhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHh-cCCcC--CccccceEEecCcCchhHHHHHHHHHHHH Q lcl|NC_016071. 370 GSYNLSESKQSIH----GHFVQRDIDIIVEAFNKNLIPQLLAL-NDIRL--SDEDMPKLKPGLIQEVDMEGFSKFVQRIG 442 (516) Q Consensus 370 GS~Al~~vh~ev~----~~~~~aDa~~i~~~ln~~li~~lv~l-N~~~~--~~~~~P~~~~~~~~~~dl~~~a~~~~~L~ 442 (516) |. ++.+.-+.. ...+..-.+.+.+.|. ++++.++.+ |...+ .+..-+.+.|...-+.|.++.++++.+|+ T Consensus 353 gn--~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~ 429 (479) T protein:vir:79 353 GD--KSGVALKFLYSLLDLKCSKTEKKFKKAIR-ELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKST 429 (479) T ss_pred cc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHHh Confidence 21 111111111 1122222233444443 355555443 21111 12233678898888999999999999985 Q ss_pred hCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCcc Q lcl|NC_016071. 443 AVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTG 501 (516) Q Consensus 443 ~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (516) |++ + ++.+.+.++. +.+..+-+-...+.....+....... ...+...++ T Consensus 430 --g~i-S----~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~---~~~~~~~e~ 479 (479) T protein:vir:79 430 --GIV-S----DETIVSNHPWVEDVNDELERLKKQEDTQKEYDDLIPN---NQDGVIDET 479 (479) T ss_pred --ccC-c----HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhccCc---ccCCCcCcC Confidence 653 2 3455666654 32211111111111000000000000 000011111 No 212 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=75.64 E-value=0.14 Score=25.19 Aligned_cols=440 Identities=10% Similarity=0.014 Sum_probs=152.6 Q ss_pred CCccccCcccccc----------hhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcc----cHHHHHHHhhChH Q lcl|NC_016071. 1 MSTRFAQPSEVVK----------AGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPC----FLATVEAMKQDHT 66 (516) Q Consensus 1 ~~~r~~~~~~~~~----------~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~----~~~~y~~m~~D~~ 66 (516) |-.++++.-+-.- .-...+-++ +...++.++ ..+..+- .-..+.+.... .-..-+....-.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~~~~i--~~~~~~Y-~g~~~~~~~~~~~~~~~~~~~~~~s~n~ 76 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVN-ANDEDYKYI--DMWKRLY-QGNYAEWHNLNYEHNGNPVNRRQLSMNL 76 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCc-CCHHHHHHH--HHHHHHh-cCCcchhhccccccCCCccccceeecch Confidence 3332222211100 000001000 111122221 1222221 11111110000 0000000000011 Q ss_pred HHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEeeccccc Q lcl|NC_016071. 67 VSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYRTESAPS 145 (516) Q Consensus 67 v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~~~~~~~ 145 (516) -..+..+--..+.+-+-.|++. +++..+++.+++++- .|...+..++ .|..+|-+++=+.|..+ T Consensus 77 ~~~iv~~~a~~l~~ep~~i~~~-------d~~~~e~l~~~~~~n----~f~~~~~~~~~~a~~~G~~~~~~~~D~~---- 141 (499) T protein:vir:80 77 PKVTAKYMSKLLFNEKVKINID-------DETAEEFVLNVLKTN----GFTKNMERYIEYGEAMGGFVIKVYHDGN---- 141 (499) T ss_pred HHHHHHHHHHhhhCCcceEeeC-------CHHHHHHHHHHHhhc----cHHHHHHHHHHHHhhcCcEEEEEEECCC---- Confidence 1222222233444444455542 245677788777642 2666665554 69999999997777643 Q ss_pred ccccceeeccccccCchhcccccceeecCCCce--ee-------------eccccccccc---cccc--------ccccc Q lcl|NC_016071. 146 KYAGYITIDKIAFRPQSSLSRSKPWVFDEDGRT--LK-------------GIYQSKMAFA---NFQN--------GLTQI 199 (516) Q Consensus 146 ~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~~--l~-------------~~~q~~~~~~---~~~~--------~~~~~ 199 (516) |.+.+..+. +..+-+ ..++ .|+. +. .++-|..... .+.. ..... T Consensus 142 ---~~~~i~~v~---a~~~~P---i~~d-~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~l 211 (499) T protein:vir:80 142 ---KNVKVSFAT---ADCMYP---LSND-SENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNEL 211 (499) T ss_pred ---CcEEEEEEc---CCceEE---EEec-CCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCcccc Confidence 333332211 111110 1111 2221 11 1110000000 0000 00000 Q ss_pred ccccccccccccCCCcccc---ccccEEEEeec----CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceee Q lcl|NC_016071. 200 SSAMSLVTNLTSSADEVFI---PINKLMVMSLG----GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIEL 272 (516) Q Consensus 200 ~~~~~~~~~~~~~~~~~~i---P~~k~i~~~~~----~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~ 272 (516) +.++.+-.-.........+ +.--|++|+.. ...++|+|.|.+..|--..--=+..+..|+.-++. + . ..+ T Consensus 212 G~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~-~-~~i 288 (499) T protein:vir:80 212 GGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-G-K-KKV 288 (499) T ss_pred CcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh-c-c-cce Confidence 1111110000111111111 11225555432 35678999999998854322222233333333331 1 1 122 Q ss_pred eecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHH Q lcl|NC_016071. 273 KIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKA 352 (516) Q Consensus 273 ~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~ 352 (516) +.|...+.......+..... ......-...++...+ +... .++.. +..-....|.+.++.+=++ T Consensus 289 ~v~~~~l~~~~~~~g~~~~~-----------~~~~~~~~~~~~~~~~-~~~~--~i~~~--~~~ir~e~~~~~l~~~l~~ 352 (499) T protein:vir:80 289 LVPSSFVKTAVNLDGSTTQY-----------FDSTDEAFFLYQGEQD-DNGK--AIKDI--SVEIRSTEFIESINAMLRI 352 (499) T ss_pred ecchhhhhccCCCCCCcccC-----------CCcccceeeEeeccCC-CCcC--ceeEe--cCcCChHHHHHHHHHHHHH Confidence 33333332211111100000 0000011111111100 0000 01111 0111111244444444455 Q ss_pred HHHHH-hcc-cccccCCccchhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHh-------cCCcCCccccceE Q lcl|NC_016071. 353 ILDRF-GAG-FINLGNDGQGSYNLSESKQSIHG--HFVQRDIDIIVEAFNKNLIPQLLAL-------NDIRLSDEDMPKL 421 (516) Q Consensus 353 Isk~i-LGq-tLts~~~~~GS~Al~~vh~ev~~--~~~~aDa~~i~~~ln~~li~~lv~l-------N~~~~~~~~~P~~ 421 (516) |+..+ +++ ++.. ++.|-...-++..+-.. .-+..-.+.+...|. +|++.++.+ ++. ..+...+.+ T Consensus 353 i~~~~g~s~~~fg~--~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~-~~~~~~v~v 428 (499) T protein:vir:80 353 YAMQVGLSAGTFTF--DENGLKTATEVVSEKSETYQTKNSHSQLIEQGIK-EMIVSILEVGKLIKAYDGD-TVELDTITV 428 (499) T ss_pred HHHhcCCChhhcCC--CcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccCC-CCCccceEE Confidence 55444 332 2222 22221111122111111 112233445555563 455555432 211 122334678 Q ss_pred EecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCc Q lcl|NC_016071. 422 KPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGT 500 (516) Q Consensus 422 ~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (516) .|+..-..|.++.++.+.+++.+|++.. +.++.+.+|.++.+-+++. ++..+.... ..+.. ...+..|+.. T Consensus 429 ~f~d~i~~d~~~~~~~~~~~~~~Gi~S~----et~l~~~~~~~d~ea~~el-~~i~~E~~~-~~~~~--d~~g~~ge~e 499 (499) T protein:vir:80 429 DFDDSIAQDEDTTINRYTTAKNQGMIPL----KIALQRAWNITEAEADEWA-EMLAKEKQA-EIPNN--DMTGIFGEEE 499 (499) T ss_pred EeCCCCCCCHHHHHHHHHHHHHcCCCCH----HHHHhhcCCCChHHHHHHH-HHHHHHhhc-CCCCC--CccccCCCCC Confidence 8988888888888999999999998664 5688888998753211121 111111000 00111 1112222222 No 213 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=75.44 E-value=0.15 Score=25.15 Aligned_cols=443 Identities=12% Similarity=0.056 Sum_probs=161.0 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHH------HHHHHHHHH---hhcccccC-CcccHHHHHHHhhChHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGA------LSQLRAESE---VMKVEELR-WPCFLATVEAMKQDHTVSTA 70 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~------~~~~~~~~~---~~~~~~lr-~~~~~~~y~~m~~D~~v~s~ 70 (516) +....+...+..+...+-|+.|- .+-|+.- -....++.+ ...-+..+ ..++|+.|++|..+|.|-++ T Consensus 15 ~~~de~~~~~~~~~~~~S~~~p~---~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~A 91 (524) T protein:vir:10 15 ANEDEKEYKQQINNNLESVTAPK---LDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMNNYEVDNA 91 (524) T ss_pred hcchhhhhhhhhccCCCccccCC---CCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhhccchhhH Confidence 21111111111111111111111 1111100 001111111 00112222 34589999999999999999 Q ss_pred HHHHHHHHhcCCc-e--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccc Q lcl|NC_016071. 71 LDTKYVFVTKAFN-D--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKY 147 (516) Q Consensus 71 l~~Rk~~v~~~~w-~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~ 147 (516) ++-.-.-+.-.+- . |.+.-. +.+.++.+-+.|.+. |+ .+..+|+.--+||..+= .|+- T Consensus 92 v~eIVneaiv~d~~~~pV~l~Ld-~~~~s~siK~kI~ee---------F~-~Il~ll~F~~~~~~~fR--------~WYV 152 (524) T protein:vir:10 92 VQEIVSDAIVYEDDKEVVALNLD-GTDFSQSIKDKILAE---------FS-EVLNLLNFQRKGTDHFQ--------RWYV 152 (524) T ss_pred HHHhhcceeEecCCCceEEEEec-ccCcchHHHHHHHHH---------HH-HHHHHhccchhhhHHHh--------hhee Confidence 9987654321110 0 011100 111233333334333 22 33356666666666552 2334 Q ss_pred ccceeecccc-------------ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCC Q lcl|NC_016071. 148 AGYITIDKIA-------------FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSAD 214 (516) Q Consensus 148 ~g~~~~~~l~-------------~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (516) ||++..+++. ...|..|+..|......++.. .-+. ....+..|..+... +.-....-+... T Consensus 153 DgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~-~vi~-~~~e~f~Y~~~~~~----~~~~~~~~~~~~ 226 (524) T protein:vir:10 153 DSRIFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGV-KIVD-GYREFFVYDTGHES----YCADGRIYSAGT 226 (524) T ss_pred eceEEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccc-hhhc-chhhheeecCCCcc----cccCcceecCCc Confidence 5555544433 222333333333322322221 1111 11112222111110 000111123456 Q ss_pred ccccccccEEEEeecCcCC-c-cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccccc--CCCCHHH Q lcl|NC_016071. 215 EVFIPINKLMVMSLGGTES-N-PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAA--IDPKSPE 290 (516) Q Consensus 215 ~~~iP~~k~i~~~~~~~~g-~-p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~--~~~~~~~ 290 (516) ++.||.+- |+|+|..-.+ + -.=.|.|.++..|+==-+.....-.++ .+-.+|.+|+=+-. .=|.... T Consensus 227 ~ikI~~dA-Ivy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIY--------RitRAPeRRvFYIDVGnlPk~KA 297 (524) T protein:vir:10 227 KVKIPRAA-VVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIY--------RITRAPDRRVFYIDTGNMPSRKA 297 (524) T ss_pred ceecchhh-eeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHH--------hhhccccceEEEEecCCCCchhH Confidence 77888775 7777753211 1 122378888888764333322222211 12222222221111 1122222 Q ss_pred HHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHH Q lcl|NC_016071. 291 SEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKA 352 (516) Q Consensus 291 ~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~ 352 (516) ++. +..++..++. ....|- -||.= +.....+|+-+ .|+... .-.+=|+|..+. T Consensus 298 eqY---l~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEItTL--pGgqnl-gem~DV~YF~kk 368 (524) T protein:vir:10 298 AAQ---MQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRR---DGKAVTEVDTM--PGATGM-SDMDDVLYFRTA 368 (524) T ss_pred HHH---HHHHHHhcCceeEEeccCCeeccchhhhhhHhhhccccc---CCCCccceeec--cccCCc-ChHHHHHHHHHH Confidence 222 2333322111 001111 12210 00111233333 222222 233458999999 Q ss_pred HHHHHhcccccccCCccchh---hHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccceEEec Q lcl|NC_016071. 353 ILDRFGAGFINLGNDGQGSY---NLSE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPKLKPG 424 (516) Q Consensus 353 Isk~iLGqtLts~~~~~GS~---Al~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~~~~~ 424 (516) +-+++--..--.+.++.|+. ..++ +-.|+ |...++.-...+...|..-|-..|+.=+ .--+. .--+.+.|+ T Consensus 369 Ly~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKg-iit~eew~~i~~~I~~~ 447 (524) T protein:vir:10 369 LYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKK-IITEDEWEREINNIKVT 447 (524) T ss_pred HHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCHHHHHHHhhcceEE Confidence 99988877644432322222 2223 23333 2333444444444455433333333211 11110 111334444 Q ss_pred CcCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCC--cccccCcccc--cCCCCCCccccc Q lcl|NC_016071. 425 LIQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIP--EDMSTDELLK--LLGQDTSRSGDG 490 (516) Q Consensus 425 ~~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~--~~~~~~~~~~--~~~~~~~~~~~~ 490 (516) ...+ .+.+-+.+++..|..+-=.+-...+.+||++.+ .+...+- ++...+.+.+ .-+++..+..+. T Consensus 448 f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 448 FNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred eeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 3333 333444445555544322222234578887654 4432111 1111111111 011111111111 No 214 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=74.09 E-value=0.16 Score=24.91 Aligned_cols=429 Identities=10% Similarity=-0.057 Sum_probs=157.6 Q ss_pred CCccccCcccccch-hhhcccCCCCcccccchHHHHHHHHHHHhhc---ccccCCcccHHHHHH---------H--h--- Q lcl|NC_016071. 1 MSTRFAQPSEVVKA-GNENLAVSRLRTGELGSGALSQLRAESEVMK---VEELRWPCFLATVEA---------M--K--- 62 (516) Q Consensus 1 ~~~r~~~~~~~~~~-~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~---~~~lr~~~~~~~y~~---------m--~--- 62 (516) |+-=-..+...... ....|.-.-+...+ +..++.... .+.+ -+..+.|+- . . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------i~~~i~~~~~~~~~~~--~~l~~Yy~g~~~i~~~~~~~~~~~~ 70 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNE--------LLGFIAYNETVLKPRY--RENMKLYLGKHKILTAPEKETGADN 70 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHH--------HHHHHHHHHHhhHHHH--HHHHHHhccccccccCcccccCCcc Confidence 43221111111100 00011100111111 111111110 0110 011111110 0 0 Q ss_pred --hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEEEe Q lcl|NC_016071. 63 --QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKVYR 139 (516) Q Consensus 63 --~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eivw~ 139 (516) ......-++......+.+-+..+.+. .++ +..+.+.+.+.+- .|...+.++. ++.-||.+ ++++|. T Consensus 71 ki~~n~~~~Ivd~~~~~l~g~p~~~~~~--~d~----~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~~-~~~v~~ 139 (470) T protein:vir:99 71 RIVVNSAKYVVDVYNGYFCGIEPKLALL--NDS----SKIDEIARWNRQE----NFFDTINEISKQCDIFGRS-IASIYQ 139 (470) T ss_pred eeecchHHHHHHHHhhhhccCCeeEeeC--Cch----hHHHHHHHHHHhc----CHhHHHHHHHHHHHhcCee-EEEEEe Confidence 12333444444444455555455432 122 2234455555432 3655555544 68889975 667775 Q ss_pred ecccccccccceeeccccccCchhcccccceeecCCCc--eeeecccccccccccccccccccccccccccccc------ Q lcl|NC_016071. 140 TESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGR--TLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTS------ 211 (516) Q Consensus 140 ~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~--~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 211 (516) -. +|.+.+..+.|+. +. -.|++... .+..++-...........+..+..+...+..... T Consensus 140 d~------dg~~~i~~~~p~~---~~----~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (470) T protein:vir:99 140 GE------DARPHLMYSSPNH---AF----IIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDT 206 (470) T ss_pred CC------CCeEEEEEEccce---eE----EEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEeccccccc Confidence 43 3444433333221 10 11222111 1111111000000000000000000000000000 Q ss_pred -CCCccccccc--cEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCC Q lcl|NC_016071. 212 -SADEVFIPIN--KLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPK 287 (516) Q Consensus 212 -~~~~~~iP~~--k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~ 287 (516) ..+..+-|.. .++.| .+++.|.|.+..+- +.+-- ...+..++..++.+..|..++++.- .+. T Consensus 207 ~~~~~~~~~~g~vPvv~~-----~n~~~g~sd~e~v~-~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~--------~~~ 272 (470) T protein:vir:99 207 NAAGYAINPYGLVPAVEF-----FENEERQGIFDSIK-TLINALDKVISQKANQVEYFDNAYMYMIGFK--------LPE 272 (470) T ss_pred ccccccccCCCccceEee-----cCCCCCCcchHhHH-HHHHHHHHHHHHHHHHHHHhcCceeeeecCC--------ccc Confidence 0001111111 12222 24678899988743 33322 3355667777788888888777531 111 Q ss_pred HHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCC Q lcl|NC_016071. 288 SPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGND 367 (516) Q Consensus 288 ~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~ 367 (516) .++.+.+ ..+ .......+|... ..+...++++..+. ....+...++.+.+.|...-....++.+.. T Consensus 273 ~~~g~~~---~~~------~~~~~~~~~~~~---~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~ 338 (470) T protein:vir:99 273 DDEGNPK---FDF------KNNRVLYVSQLD---PDTNPQIGFIAKPD--ADQMQENLIQHLTDFIFMMAMVPNIQDKNF 338 (470) T ss_pred ccccchh---hhh------hhcceeeecCCC---CCCCCcceEEeecC--ChHHHHHHHHHHHHHHHHHhCCcccccccc Confidence 1111111 111 111122333211 11223455554432 223466778888888877765554444332 Q ss_pred ccchhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cC-CcCC-ccccceEEecCcCchhHHHHHHHHHHHH Q lcl|NC_016071. 368 GQGSYNLSESKQ--SIHGHFVQRDIDIIVEAFNKNLIPQLLAL-ND-IRLS-DEDMPKLKPGLIQEVDMEGFSKFVQRIG 442 (516) Q Consensus 368 ~~GS~Al~~vh~--ev~~~~~~aDa~~i~~~ln~~li~~lv~l-N~-~~~~-~~~~P~~~~~~~~~~dl~~~a~~~~~L~ 442 (516) ++.+.+.+ .+. .-....++.-.+.+...| +++++.++.+ +. .... ...-..+.|...-+.|..++++++.+|+ T Consensus 339 ~~n~Sg~A-i~~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~ 416 (470) T protein:vir:99 339 AGNSSGVA-LQYKLFAMKNKADSKERKFDKSL-MQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAE 416 (470) T ss_pred ccCchHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHh Confidence 21111111 111 111222333334445555 3455555543 11 1111 1223578899999999999999999986 Q ss_pred hCCcccccHHHHHHHHHHcCCCCCCCcccccCccccc---CCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 443 AVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKL---LGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 443 ~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) |++ + .+.+.+.++.-.+..+=+-...+... ..+......+.. ...++++- . T Consensus 417 --gii-s----~et~l~~l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~--~~d~~~ee-------~ 470 (470) T protein:vir:99 417 --GIV-S----KKTQLGMIPDIEPDAEMKQIAKEKADAIKQTQQLSMPIDIL--KRDNNAEE-------E 470 (470) T ss_pred --ccC-C----HHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcC--CCCCCccC-------C Confidence 653 3 24455555443221110001111000 000111111111 11111111 1 No 215 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=72.59 E-value=0.18 Score=24.66 Aligned_cols=445 Identities=11% Similarity=0.048 Sum_probs=147.5 Q ss_pred CCccccCcccccch----------hhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhC----hH Q lcl|NC_016071. 1 MSTRFAQPSEVVKA----------GNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQD----HT 66 (516) Q Consensus 1 ~~~r~~~~~~~~~~----------~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D----~~ 66 (516) |-.++++.-+-.-. -...+..+ +...++.++ ..+..+- .-+.+.+.... ......-..+ -- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~i--~~~~~yy-~g~~~~~~~~~-~~~~~~~~~~~~~~~n 75 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVN-ANDEDYKYI--DMWKRLY-QGHYAEWHNLN-YEHNGNPVNRRQLSMN 75 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCc-CCHHHHHHH--HHHHHHh-cCCCchhhcch-hccCCCccccceeecc Confidence 54444433221100 00011100 111111111 1111111 11111110000 0000000000 01 Q ss_pred HHH-HHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccc Q lcl|NC_016071. 67 VST-ALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAP 144 (516) Q Consensus 67 v~s-~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~ 144 (516) +.. +..+--..+.+-+-.|++ + +++..+++..++++- .|.+.+.. +..|..+|-+++=+.|..+ T Consensus 76 ~~k~i~~~~a~~l~~~p~~i~~----~---d~~~~e~l~~~~~~n----~f~~~~~~~~~~a~~~G~~~~~~~~D~~--- 141 (496) T protein:vir:38 76 LPKVTAKYMSKLLFNEKVKINI----D---DKAAEEFVLNVLKTN----GFTKNMERYIEYGEAMGGFVIKVYHDGN--- 141 (496) T ss_pred hHHHHHHHHhhhhhCCcceEee----C---ChHHHHHHHHHHhcc----CHHHHHHHHHHHHhhhCcEEEEEEEcCC--- Confidence 111 222222334444444443 1 245667777777642 36666655 4468899987775555433 Q ss_pred cccccceeeccccccCchhccc----------c-cceeecCCCceeeecccccccccc-------cccc-cccccccccc Q lcl|NC_016071. 145 SKYAGYITIDKIAFRPQSSLSR----------S-KPWVFDEDGRTLKGIYQSKMAFAN-------FQNG-LTQISSAMSL 205 (516) Q Consensus 145 ~~~~g~~~~~~l~~r~q~ti~~----------~-~~f~~~~dg~~l~~~~q~~~~~~~-------~~~~-~~~~~~~~~~ 205 (516) |.+.+..+. +..+-. . .+..+..+|.....++-|...... +... ....+.++.+ T Consensus 142 ----~~~~i~~v~---~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~ 214 (496) T protein:vir:38 142 ----KNVKVSFAT---ADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSL 214 (496) T ss_pred ----CcEEEEEEc---ccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccc Confidence 333322211 111110 0 000111122111111111000000 0000 0000111111 Q ss_pred ccccccCCCcccc---ccccEEEEe----ecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccc Q lcl|NC_016071. 206 VTNLTSSADEVFI---PINKLMVMS----LGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQI 278 (516) Q Consensus 206 ~~~~~~~~~~~~i---P~~k~i~~~----~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~ 278 (516) .....+......+ +.--|++++ .....++|+|.|.+..|--..-.=+..+..|+.-++ .+-..+++|... T Consensus 215 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~---~~~~~i~v~~~~ 291 (496) T protein:vir:38 215 TLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFK---LGKKKVLVPSSF 291 (496) T ss_pred cccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHh---hcccceecchHH Confidence 0000000000001 111123332 234668899999999885422222222222332222 222233444333 Q ss_pred cccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHh Q lcl|NC_016071. 279 LNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFG 358 (516) Q Consensus 279 ~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iL 358 (516) +.......+..-.. .........++..+.. +-. ..++... ..-....|...++.+=++|+..+. T Consensus 292 l~~~~~~~g~~~~~-----------~~~~~~~~~~~~~~~~-~~~--~~i~~~~--~~i~~e~~~~~l~~~l~~i~~~~g 355 (496) T protein:vir:38 292 VKTAVNLDGSTTQY-----------FDSTDEAFFLYQGDQD-DNG--KAIKDIS--VEIRSTEFIESINAMLRIYAMQVG 355 (496) T ss_pred hhccCCCCCccccC-----------CCCccceEEEeecCCC-ccc--ccceeec--cccCHHHHHHHHHHHHHHHHHhhC Confidence 32211111100000 0000000111111000 000 0111111 111112244444444455554431 Q ss_pred cccccccCCccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHH-------hcCCcCCccccceEEecCcCch Q lcl|NC_016071. 359 AGFINLGNDGQGSYNLSESKQSIHGHF--VQRDIDIIVEAFNKNLIPQLLA-------LNDIRLSDEDMPKLKPGLIQEV 429 (516) Q Consensus 359 GqtLts~~~~~GS~Al~~vh~ev~~~~--~~aDa~~i~~~ln~~li~~lv~-------lN~~~~~~~~~P~~~~~~~~~~ 429 (516) -..-+.+.+++|...+.++........ +..-.+.+...|. ++++.++. +++...+ ..-+.+.|+..-+. T Consensus 356 ~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~g~~~~-~~~i~v~f~d~i~~ 433 (496) T protein:vir:38 356 LSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIK-EMIVSILEVGKFIEAYSGEVVE-LDTITVDFDDSIAQ 433 (496) T ss_pred CChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCCCC-ccceEEEeCCCCCC Confidence 111111112222111122222111111 2223344555553 45555543 2322222 23368889888889 Q ss_pred hHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccccCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 430 DMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 430 dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) |.++.++.+.+++.+|++.. +.++.+.+|+++..-+++ +++..+.... ..+ .+ ..+. -.++.+ T Consensus 434 d~~~~~~~~~~~~~~GiiS~----et~l~~~~~~~d~ea~~e-l~ri~~E~~~-~~~-~~--d~~~-~~~~~e 496 (496) T protein:vir:38 434 DEDTTINRYTNAKNQGMIPL----KIALQRAWNITEAEADEW-AEMLAKEKQA-EMP-NN--DMNG-IFGEEE 496 (496) T ss_pred CHHHHHHHHHHHHhcCCCCH----HHHHHhcCCCChHHHHHH-HHHHHHhhhc-cCc-cc--cccC-CCCCCC Confidence 98888999999999998653 567788888864321111 1111111110 001 00 0011 111211 No 216 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=70.90 E-value=0.2 Score=24.38 Aligned_cols=406 Identities=12% Similarity=0.023 Sum_probs=154.4 Q ss_pred CcccccchHHHHHHHHHHHhhcccc---c-CCcccHHHHHH---H--------------h------hChHH-----HHHH Q lcl|NC_016071. 24 LRTGELGSGALSQLRAESEVMKVEE---L-RWPCFLATVEA---M--------------K------QDHTV-----STAL 71 (516) Q Consensus 24 ~~~~e~g~~~~~~~~~~~~~~~~~~---l-r~~~~~~~y~~---m--------------~------~D~~v-----~s~l 71 (516) +-... +..+++...... + |.-+..+.|+- + . .+-.| .-.+ T Consensus 1 ~~~~~--------~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv 72 (470) T protein:vir:10 1 MELDA--------LKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLV 72 (470) T ss_pred CchHH--------HHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHH Confidence 11111 111111110000 0 00000111100 0 0 01122 2333 Q ss_pred HHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccccccccc Q lcl|NC_016071. 72 DTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAPSKYAGY 150 (516) Q Consensus 72 ~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~~~~~g~ 150 (516) .+...-+.+-+..+.+. + .+..+.+.+.+++ .|.+.+..+ .++.-+|.+.. .+|-... |. T Consensus 73 ~~~~~yl~G~p~~~~~~---d----~~~~~~l~~~~~~-----~~~~~~~~l~~~~~~~G~a~~-~~y~d~~------~~ 133 (470) T protein:vir:10 73 DQEAGYVASVFPDIDVG---K----DADNKKIIDVLGD-----DRALTLNGLLVDSSNAGRAWL-HYWIDED------GN 133 (470) T ss_pred HhhhhheeccceeeecC---c----hHHHHHHHHHHhh-----hHHHHHHHHHHHHhhcCeeEE-EEEecCC------Cc Confidence 33344444554444332 2 2334555555542 255555544 46778898875 5554332 33 Q ss_pred eeeccccccCchh----------cccccceee-cCCCceee-eccccccccc---cccccccccccccccccccc----- Q lcl|NC_016071. 151 ITIDKIAFRPQSS----------LSRSKPWVF-DEDGRTLK-GIYQSKMAFA---NFQNGLTQISSAMSLVTNLT----- 210 (516) Q Consensus 151 ~~~~~l~~r~q~t----------i~~~~~f~~-~~dg~~l~-~~~q~~~~~~---~~~~~~~~~~~~~~~~~~~~----- 210 (516) +.+.-+.|..-.. +.-.++|.. +.++.... ...-...... ....+......+........ T Consensus 134 ~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (470) T protein:vir:10 134 FRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGY 213 (470) T ss_pred eEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceecccccccccccccccc Confidence 3322222211000 000112211 11221100 0000000000 00000000000000000000 Q ss_pred cC--CCc--cccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCC Q lcl|NC_016071. 211 SS--ADE--VFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDP 286 (516) Q Consensus 211 ~~--~~~--~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~ 286 (516) .. .+. ..+..--++.|+ +|..|.|.+..+-...=-=...+..++..++.+..|+.++++..+ .+. T Consensus 214 ~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~------~~~ 282 (470) T protein:vir:10 214 ETGQSNTLKHNFGRVPFIEFS-----KNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG------ADL 282 (470) T ss_pred ccccccccccCCCeeeEEEee-----cCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCc------ccc Confidence 00 000 000111123333 366789999865432222245667788888999999988876321 111 Q ss_pred CHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccC Q lcl|NC_016071. 287 KSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGN 366 (516) Q Consensus 287 ~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~ 366 (516) .+....+. ....+.++..-+ .....++++...... ..+...++.+.+.|-+.--+..++.+. T Consensus 283 ----~~~~~~~~---------~~~~i~~~~~~~---~~~~~~~~lt~~~~~--~~~~~~~~~L~~~I~~~s~~p~~~~~~ 344 (470) T protein:vir:10 283 ----HQFMNDLR---------KYKSIKINNTGN---GDNSGVDKLQIDIPV--EARDDALKITRKNIFLFGQGIDPANFE 344 (470) T ss_pred ----chhhhhhh---------hcCeEeccCCCC---CcCceeEEEeecCCh--HHHHHHHHHHHHHHHHHhCCCCCCccc Confidence 11111111 111233332110 112346666654432 346778899999998776666555433 Q ss_pred CccchhhHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHH-hcCCcCCccccceEEecCcCchhHHHHHHHHHHH Q lcl|NC_016071. 367 DGQGSYNLSESKQSIHGHFVQRDI----DIIVEAFNKNLIPQLLA-LNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRI 441 (516) Q Consensus 367 ~~~GS~Al~~vh~ev~~~~~~aDa----~~i~~~ln~~li~~lv~-lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L 441 (516) . | .++.+.-+.+...+...| +.+...|. ++++.++. +|.. ..+..-..+.|...-+.|..+.++.++++ T Consensus 345 ~--g--n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~l~~~-~~d~~~i~i~f~~~~p~d~~e~~~~~~~~ 418 (470) T protein:vir:10 345 S--S--NASGVAIKMLYSHLELKAAKTQTYFEHAIN-ELVRAIMRYLNFS-DADKRHISQHWTRTKVEDSLTKAQIVSTV 418 (470) T ss_pred c--c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccc-CcccceeeEEeccCCCCCHHHHHHHHHHH Confidence 2 2 223333333322222233 33444553 45555554 3322 12233457889999999999999999998 Q ss_pred HhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccc-cCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 442 GAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLK-LLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 442 ~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) . |. ++ ++.+.+.++. ..+..+-+-...+.. ..+..+.. .+ ....+.++.+ T Consensus 419 ~--g~-iS----~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~~-~~---~~~~~~dde~ 470 (470) T protein:vir:10 419 A--NY-SS----KEAVAKANPIVDDWQQELKDLAKDKEENDPYSNQA-DE---LNGKGVNDEQ 470 (470) T ss_pred h--cc-Cc----HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhccc-cc---cCCCCCCCCC Confidence 4 54 32 3456666653 222111110111100 01111111 00 0111222222 No 217 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=70.69 E-value=0.21 Score=24.35 Aligned_cols=429 Identities=11% Similarity=-0.001 Sum_probs=156.1 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHH--------HHHHHH--Hhhccccc-CCcccHHHHH---HH-hhCh Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALS--------QLRAES--EVMKVEEL-RWPCFLATVE---AM-KQDH 65 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~--------~~~~~~--~~~~~~~l-r~~~~~~~y~---~m-~~D~ 65 (516) +..|+--..+-. . ..+...-.-..|.=.--++ .+.... +....+.+ .......-+. .. ...+ T Consensus 9 ~~~~~~~~~~~~--~-~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n 85 (481) T protein:vir:10 9 INTKFSPLANDD--F-VVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHN 85 (481) T ss_pred hchhcccccCce--e-eeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecc Confidence 111100000000 0 0000000000000000000 010000 00011111 0000000000 00 0234 Q ss_pred HHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccc Q lcl|NC_016071. 66 TVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAP 144 (516) Q Consensus 66 ~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~ 144 (516) +..-++.+....+.+-+..+.+. + .+..+.+.++|++. .|...+.. ..++.-+|.+. +++|... T Consensus 86 ~~~~ivd~~~~~l~g~~~~~~~~---d----~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~~~-~~~~~d~--- 150 (481) T protein:vir:10 86 YAKYVSRFIVGYLTGNPITITHQ---D----NQTNDKIIELNDLN----DADEVNSDLALNLSIYGRAY-EIVYRDF--- 150 (481) T ss_pred hHHHHHHHHHhhhccCCceEecC---C----hhHHHHHHHHHHhc----ChhHHHHHHHHHHHhcCeEE-EEEEeCC--- Confidence 44555555555566666555532 2 23445566666543 25555555 44688899665 4666533 Q ss_pred cccccceeeccccccCchhcccccceeecCC--Cceeeecccccc---------ccccccccccc----ccccccccccc Q lcl|NC_016071. 145 SKYAGYITIDKIAFRPQSSLSRSKPWVFDED--GRTLKGIYQSKM---------AFANFQNGLTQ----ISSAMSLVTNL 209 (516) Q Consensus 145 ~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~---------~~~~~~~~~~~----~~~~~~~~~~~ 209 (516) +|.+.+..+.|+ .+. -.|++. ++.+..++.... ....|...... -...+.+.... T Consensus 151 ---dg~~~i~~~~p~---~~~----~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~ 220 (481) T protein:vir:10 151 ---EDRDTFKVLDPK---STF----VVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEV 220 (481) T ss_pred ---CCeEEEEEEccc---ceE----EEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccc Confidence 344433332222 111 112221 122222211000 00001000000 00000000000 Q ss_pred ccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCH Q lcl|NC_016071. 210 TSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKS 288 (516) Q Consensus 210 ~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~ 288 (516) ....+ .|| ++.|. .++.|.|.+..+ .+.+-. ...+...+..++.+..|+.++++... . +.+ T Consensus 221 ~~~~g--~vP---vv~~~-----n~~~g~~~~~~v-~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~---~---~~~- 282 (481) T protein:vir:10 221 EHYYN--DVP---IIEYL-----NDQFKQGDFENV-IALIDLYDSAQSDTANYMTDLNDAMLAIIGNVD---L---DSE- 282 (481) T ss_pred cccCC--cee---EEEee-----cCCCCCCchhhH-HHHHHHHHHHHHHHHHHHHHhcCceeEeecCcC---C---Ccc- Confidence 11011 122 22222 356788888753 334322 22445566677778888877765321 1 111 Q ss_pred HHHHHHHHHHHHHHHhhcccceEEEeccCccccc-ccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCC Q lcl|NC_016071. 289 PESEMVQGLMADAANAHAGEQAYFILPSDMNAQG-GEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGND 367 (516) Q Consensus 289 ~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~-~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~ 367 (516) + ...... .....+|.+..... .+..+++++..+.. ...+...++.+.+.|...--...++.+.. T Consensus 283 -~---~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~ 347 (481) T protein:vir:10 283 -D---AKAFRD---------ANMIHLEPGTNANGSEGKAEVKYVYKQYD--VAGVEAYKKRLQNDIHKYTNTPDLNDEQF 347 (481) T ss_pred -c---hhhhhh---------ccceeccccccccCCCCCcceeEEeecCC--HHHHHHHHHHHHHHHHHHhCCcccccccc Confidence 1 001110 01122222221111 12234555554332 23466778888888876654444444322 Q ss_pred cc-chh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCcCCc--cccceEEecCcCchhHHHHHHHHHHHH Q lcl|NC_016071. 368 GQ-GSY-NLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLAL-NDIRLSD--EDMPKLKPGLIQEVDMEGFSKFVQRIG 442 (516) Q Consensus 368 ~~-GS~-Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~l-N~~~~~~--~~~P~~~~~~~~~~dl~~~a~~~~~L~ 442 (516) ++ .|- |+. ....-....+..-.+.+...+. ++++.++.+ |...... ..-..+.|...-+.|..+.++++.+|+ T Consensus 348 ~~n~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~ 425 (481) T protein:vir:10 348 SGVQSGESMK-YKLFGLEQVRAIKERLFKKGLM-KRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNALS 425 (481) T ss_pred ccccHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHHh Confidence 21 111 221 1111222223333345555664 456655554 2211111 122478888889999999999999985 Q ss_pred hCCcccccHHHHHHHHHHcCC-CCCCCcccccCcc---cccCCCC--CCcccccccccCCCCC Q lcl|NC_016071. 443 AVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDEL---LKLLGQD--TSRSGDGMTAGSNGNG 499 (516) Q Consensus 443 ~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~---~~~~~~~--~~~~~~~~~~~~~~~~ 499 (516) |++ + ++.+.+.++. +.+..+-+-...+ ..+..+. ...+.+.......++| T Consensus 426 --g~i-s----~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 426 --GGV-S----ESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred --ccC-C----hHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCCCCCCC Confidence 543 3 2445566654 2221110000000 0000000 0011111111122333 No 218 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=70.30 E-value=0.21 Score=24.29 Aligned_cols=448 Identities=13% Similarity=0.087 Sum_probs=164.4 Q ss_pred CCccc-----cCcccccchhhhcccCCCCcccccchHHH--HHH-HHHHHhhcccccC-CcccHHHHHHHhhChHHHHHH Q lcl|NC_016071. 1 MSTRF-----AQPSEVVKAGNENLAVSRLRTGELGSGAL--SQL-RAESEVMKVEELR-WPCFLATVEAMKQDHTVSTAL 71 (516) Q Consensus 1 ~~~r~-----~~~~~~~~~~~~~p~~~~~~~~e~g~~~~--~~~-~~~~~~~~~~~lr-~~~~~~~y~~m~~D~~v~s~l 71 (516) |.... .+.....+..+.. |+-. .=|++-+ .++ +.+.+.+ +..+ ..+.++.|++|..+|.|-+++ T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~---~~~~--~dg~~~~~~~~~~g~~~~~e--~~~~~~~eLI~~YR~ma~~pEvd~Av 73 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFV---QKDS--LDGSQPIVGGGYFGYSVDFD--GTIRNDHELITRYREMVLNPECDSAV 73 (537) T ss_pred CccccccceeecccccccCCccc---CCCc--ccccceeecccccccccccc--cccchHHHHHHHHHHHhhccchhhHH Confidence 22110 0001111111110 1100 0010000 001 1111111 2222 346899999999999999999 Q ss_pred HHHHHHHhcCCce---eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccccc Q lcl|NC_016071. 72 DTKYVFVTKAFND---FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYA 148 (516) Q Consensus 72 ~~Rk~~v~~~~w~---i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~ 148 (516) +-.-.-+.-.+-. |.++-. +-+.++.+-+.|.+. |+ .+..+|+.--+||..+= .|+-| T Consensus 74 ~eIVneaiv~d~~~~pV~i~Ld-~~~~s~~iK~kI~eE---------F~-~Il~ll~F~~~~~e~fR--------~WYVD 134 (537) T protein:vir:10 74 DDVVNETICGNFDDVPISIDLH-NLKQSEKIKKLIRSE---------FD-EILRLLDFDNRAYEIFR--------RWYVD 134 (537) T ss_pred HHhhcceeEecCCCceEEEEec-ccccchHHHHHHHHH---------HH-HHHHHhccchhhhHHHh--------hheee Confidence 9876643221100 000000 001223333333332 22 33356666666666542 23334 Q ss_pred cceeecccc-------------ccCchhcccccceeecCCCcee-eecccc----ccccccccccccccccccccccccc Q lcl|NC_016071. 149 GYITIDKIA-------------FRPQSSLSRSKPWVFDEDGRTL-KGIYQS----KMAFANFQNGLTQISSAMSLVTNLT 210 (516) Q Consensus 149 g~~~~~~l~-------------~r~q~ti~~~~~f~~~~dg~~l-~~~~q~----~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (516) |++.++++. ...|..|.+.|.+.-..+.... ...... ......+... +. .. T Consensus 135 gRi~fhKiid~k~pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~--------g~---~~ 203 (537) T protein:vir:10 135 GRLFFHKVIDPKKPRQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPK--------GL---KN 203 (537) T ss_pred eEEEEEEEEeCCCccccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccc--------cc---cc Confidence 555444433 2233344443333222221111 111100 0111112111 11 12 Q ss_pred cCCCccccccccEEEEeec--CcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC--CC Q lcl|NC_016071. 211 SSADEVFIPINKLMVMSLG--GTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI--DP 286 (516) Q Consensus 211 ~~~~~~~iP~~k~i~~~~~--~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~--~~ 286 (516) +...++.||. ..|+|+|. -..++++..|.|+++..++==-+.....-.++ .+-.+|.+|+=+-.- =| T Consensus 204 ~~~~~vkI~~-dAI~y~hSGl~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIY--------RitRAPeRRvFYIDVGnLP 274 (537) T protein:vir:10 204 STNQGMKIAP-DSIAYCHSGIQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIY--------RLSRAPERRIFYIDVGNLP 274 (537) T ss_pred cCCCceeccH-hheeeecccceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHH--------hhhccccceEEEEecCCCC Confidence 2356788988 67888884 34566888999999998875433332222221 122222222211111 12 Q ss_pred CHHHHHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHH Q lcl|NC_016071. 287 KSPESEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNS 348 (516) Q Consensus 287 ~~~~~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~ 348 (516) .....+. +..++..++. .+..|- -||.= +.....+|+-+ .|+... .-.+=|+| T Consensus 275 k~KAeqY---lr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRR---eGgrgTEItTL--pGgqnl-gem~DV~Y 345 (537) T protein:vir:10 275 KNKAEQY---LREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR---EGGRGTEISTL--PGGQNL-GELEDVKY 345 (537) T ss_pred chhHHHH---HHHHHHhccceEEEeccCceecccchhhhhhhhhccccc---CCCcccceeec--cccCCc-ChHHHHHH Confidence 2222222 3333332221 011111 12210 00111233333 222222 23345899 Q ss_pred HHHHHHHHHhcccccccCCccchhhH-HH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccceEE Q lcl|NC_016071. 349 RKKAILDRFGAGFINLGNDGQGSYNL-SE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPKLK 422 (516) Q Consensus 349 ~d~~Isk~iLGqtLts~~~~~GS~Al-~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~~~ 422 (516) ..+.+-+++--..--.+.+++-+... ++ +-.|+ |...+..-...+...|..-|-..|+.=+ .--+. .--..+. T Consensus 346 F~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKg-iit~eeW~~i~~~I~ 424 (537) T protein:vir:10 346 FQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKG-ICSIEEWEEMKEHIQ 424 (537) T ss_pred HHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCHHHHHHHhhcce Confidence 99999998887764444443322211 22 22333 2333444444444444433333332211 11110 1113344 Q ss_pred ecCcCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCC---------------------CCCcccc-- Q lcl|NC_016071. 423 PGLIQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDE---------------------EIPEDMS-- 472 (516) Q Consensus 423 ~~~~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~---------------------~~~~~~~-- 472 (516) |+...+ .+.+-+.+++..|..+-=.+-...+.+||++.+ .+.+ |...++. T Consensus 425 ~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~ 504 (537) T protein:vir:10 425 FDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEM 504 (537) T ss_pred EEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCccccccccc Confidence 433333 344444455555544321222233567776553 3321 1100000 Q ss_pred -cCcccccCCCCCCcccccccccCCCCCcccccccccc Q lcl|NC_016071. 473 -TDELLKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDN 509 (516) Q Consensus 473 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 509 (516) ....++.++....|..++..+. .+..++...- T Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 537 (537) T protein:vir:10 505 GIGDEEPVPEGGEEPQTDPNSAV-----SPADQKRGEL 537 (537) T ss_pred CCCCcccCCCCCCCcccCCccCC-----CCCCccCCCC Confidence 0000011111111111111111 1111111111 No 219 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=68.28 E-value=0.24 Score=23.99 Aligned_cols=401 Identities=12% Similarity=-0.027 Sum_probs=157.1 Q ss_pred CcccccchHHHHHHHHHHHh--hcccccCCcccHHHHHH---------H----------h---h--ChHHHHHHHHHHHH Q lcl|NC_016071. 24 LRTGELGSGALSQLRAESEV--MKVEELRWPCFLATVEA---------M----------K---Q--DHTVSTALDTKYVF 77 (516) Q Consensus 24 ~~~~e~g~~~~~~~~~~~~~--~~~~~lr~~~~~~~y~~---------m----------~---~--D~~v~s~l~~Rk~~ 77 (516) +. ...+..++.. .+.+.+ -+..+.|+- . . + .....-++.+.... T Consensus 1 l~--------~~~i~~~i~~~~~~~~r~--~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~y 70 (451) T protein:vir:10 1 ME--------LEKIRAIISADAARRQEI--LQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASY 70 (451) T ss_pred CC--------HHHHHHHHHHHHHHHHHH--HHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhh Confidence 11 1112112111 011111 001111110 0 0 0 23333334444444 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHH-HHHHhhcceeeeEEEeecccc--cccccceeec Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSA-ATFNEYGFSIFEKVYRTESAP--SKYAGYITID 154 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~-lda~~~G~S~~Eivw~~~~~~--~~~~g~~~~~ 154 (516) +.+-+..+.+ +.+++..++++.++++ .|.+....+ .++.-||.+. +++|...... ....|.+.+. T Consensus 71 l~G~p~~~~~------~~~~~~~~~~~~~~~n-----~~~~~~~~~~~~~~~~G~a~-~~~y~de~~~~~~~~~~~~~~~ 138 (451) T protein:vir:10 71 MFTYPVLFDI------DNNKELNEKVTDVLGN-----EFTRKAKNLAIEASNCGSAW-LHYWIDEEYSGEQVTNQTFKYG 138 (451) T ss_pred eecccceeec------CCcHHHHHHHHHHhcc-----CHHHHHHHHHHHHhhcCeEE-EEEeecCCcccccccccceeEE Confidence 4444433332 1233455667766642 366666664 4688899776 4666533211 0111333322 Q ss_pred cccccCchhcccccceeecC--CCceeeeccccccccc------cccccccccccccccccccc-----------cCCCc Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFA------NFQNGLTQISSAMSLVTNLT-----------SSADE 215 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~------~~~~~~~~~~~~~~~~~~~~-----------~~~~~ 215 (516) .+.|+. ++ -.|++ ++..+..+|-...... .....+..+-.+.....+.. ..... T Consensus 139 ~i~p~~--~~-----~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (451) T protein:vir:10 139 VVNTEE--II-----PIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQ 211 (451) T ss_pred EEcccc--eE-----EEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCcccccccccccc Confidence 222221 10 01121 2222222221100000 00000000000000000000 00000 Q ss_pred cccccccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHH Q lcl|NC_016071. 216 VFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQ 295 (516) Q Consensus 216 ~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~ 295 (516) ..++..-++.|. +|..|.|.+..+-..-=-=+..+...+..++.+..++.++++..+ .. ..+... T Consensus 212 ~~~g~vPvv~~~-----nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~--------~~--~~~~~~ 276 (451) T protein:vir:10 212 HRFNSVPFVEFS-----NNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGG--------ED--TSEFLK 276 (451) T ss_pred CCCCeeeEEEec-----cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc--------cc--chhhHH Confidence 111112233332 355678888765332222233556667777788888887765311 11 111111 Q ss_pred HHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh-hH Q lcl|NC_016071. 296 GLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY-NL 374 (516) Q Consensus 296 ~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~-Al 374 (516) .+. ....+.++...+ .+...++++..... ...+...++++.+.|.+.--+..++.++.|.-|. |+ T Consensus 277 ~~~---------~~~~i~~~~~~~---~~~~~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al 342 (451) T protein:vir:10 277 ELK---------RYKTIKTETDSE---GDSGGLKTMQIEIP--TEARKIILEILKKQIYESGQGLQQDTENFGNASGVAL 342 (451) T ss_pred HHh---------hCCeEEecCcCC---ccCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCcccccccccccccHHHH Confidence 111 111233332211 12234666655442 2346778999999998876555555443321121 21 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHHHHHHHHHhCCcccccHHHH Q lcl|NC_016071. 375 SESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVI 454 (516) Q Consensus 375 ~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~ 454 (516) .- ...-....+..-.+.+...| +++++.++.+... .+..-..+.|...-+.|..+.++++.+|+ |. ++ + T Consensus 343 k~-~~~~l~~k~~~k~~~f~~~l-~~~~~li~~~~~~--~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~-iS----~ 411 (451) T protein:vir:10 343 KF-FYRKLELKSGLLETEFRTSF-DKLIKAILYFLGV--TDYKKIQQTYTRNMMSNDLEDADIATKSV--GI-IP----T 411 (451) T ss_pred HH-HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCC--CCccceeEEecCCCCCCHHHHHHHHHHHh--cc-Cc----h Confidence 11 11111112223334455555 4566666665432 22233468899999999999999999985 54 33 3 Q ss_pred HHHHHHcCCCC-CCCcccccCcccccCCCCCCcccccccccCCCCCc Q lcl|NC_016071. 455 NKILEVGGFDE-EIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGT 500 (516) Q Consensus 455 ~~i~e~~Glp~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (516) +.+.+.++.-. +..+.+-....++...+. .....+..++ T Consensus 412 et~~~~~p~v~d~~~e~~~~~ee~~~~~~~-------~~~~~~~~~~ 451 (451) T protein:vir:10 412 KIILRHHPWVDDVEEAEKLYLEEKKIQASK-------VSDDYNNFTE 451 (451) T ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHH-------HHhhcCCCCC Confidence 45666665532 211111111111100000 0001111111 No 220 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=67.81 E-value=0.25 Score=23.92 Aligned_cols=434 Identities=8% Similarity=-0.028 Sum_probs=144.5 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhh--cccccCCcccHHHHHH------Hh--hChHH--- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVM--KVEELRWPCFLATVEA------MK--QDHTV--- 67 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~--~~~~lr~~~~~~~y~~------m~--~D~~v--- 67 (516) |+.....-..-.-..- .+|...+..- .....+..+.... +.+.|+ +..+.|+- +- .++.. T Consensus 1 ~~~~~~~~~~~~~~~~---~~p~~~~~~~--~~~~l~~~l~~~~~~~~~rl~--~l~~YY~G~~~~~~~~~~~~~~~~~~ 73 (501) T protein:vir:25 1 MTVPVDVIADAPAADV---EFPEDSMSRE--QLGALVADMWRLHISERQWLD--RIYEYTKGLRGRPEVPEGASDEVKEL 73 (501) T ss_pred CcccchhhhccCcccc---cCCcccCChH--HHHHHHHHHHHHHHHHHHHHH--HHHHHHhcCCCchhccccCChhhhhh Confidence 4332211111110111 1222111100 0011112221111 111110 11111110 00 00110 Q ss_pred --HHHHHHHHHHHhcC-----CceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEe Q lcl|NC_016071. 68 --STALDTKYVFVTKA-----FNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYR 139 (516) Q Consensus 68 --~s~l~~Rk~~v~~~-----~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~ 139 (516) .++..==+..|... .-.|.++ ++... +.+.+.|+. ..|.....+ ..++.-||.|. +.||. T Consensus 74 ~~~~v~n~~~~ivd~~a~~l~~~gf~~~---d~~~~----~~l~~i~~~----N~~d~~~~~~~~~a~i~G~ay-~~v~~ 141 (501) T protein:vir:25 74 AKLSVKNVLSLVRDSFAQNLSVVGYRNA---LAKEN----DPAWEMWQR----NRMDARQAEVHRPALTYGASY-VTVTP 141 (501) T ss_pred HhhhhcChHHHHHHHHHhhhcccceecC---Cccch----HHHHHHHHh----cChhHHHHHHHHHHhhcCceE-EEEec Confidence 01110001111100 0012222 11122 223444432 125666655 56788999975 78886 Q ss_pred ecccccccccceeeccccccCchhcccccce-eecC-C--Cceeeecccccc--------cccccccccccccccccccc Q lcl|NC_016071. 140 TESAPSKYAGYITIDKIAFRPQSSLSRSKPW-VFDE-D--GRTLKGIYQSKM--------AFANFQNGLTQISSAMSLVT 207 (516) Q Consensus 140 ~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f-~~~~-d--g~~l~~~~q~~~--------~~~~~~~~~~~~~~~~~~~~ 207 (516) -+.+ ..+. ..++.. .| .|++ . .+.+..++-... ....|............+.. T Consensus 142 de~~-----~~i~-----~~sp~~-----~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 206 (501) T protein:vir:25 142 TDEG-----PVFR-----TRSPRQ-----ILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVL 206 (501) T ss_pred CCCC-----CeEE-----Eecccc-----EEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceee Confidence 5432 1111 112211 11 1111 0 011111110000 00000000000000000000 Q ss_pred cc--------------------ccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhcc Q lcl|NC_016071. 208 NL--------------------TSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDL 266 (516) Q Consensus 208 ~~--------------------~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g 266 (516) .. ........++..-++-|.+... .++.|.|-+..+- +.+-. +..+...+...+-+. T Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~-~~~~g~sdie~v~-~l~Da~~~~~s~~~~~~e~~a 284 (501) T protein:vir:25 207 GDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRD-ADDMIVGEVAPLI-LLQQAINSVNFDRLIVSRFGA 284 (501) T ss_pred eeccccccccccccccccccccccccccCCccceeeEeccCccc-cCccccchhhhhH-HHHHHHHHHHHHHHHHHHhhc Confidence 00 0000111222333444555443 3677888776532 11111 123333455666666 Q ss_pred ccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHH Q lcl|NC_016071. 267 GGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELV 346 (516) Q Consensus 267 ~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li 346 (516) .|..++.|. +....+ .. .+ .... +....|-+.++ ...+.. ....|...+ T Consensus 285 ~p~~~i~G~---------~~~~~~--~~----~~------~~~~-i~~~~~~~~~~--------~q~~~~-~~~~~~~~l 333 (501) T protein:vir:25 285 NPQRVISGW---------TGSKAE--VL----KA------SALR-VWTFEDPEVKA--------QAFPPA-SVEPYNLIL 333 (501) T ss_pred cHHHHHhCC---------CCCccc--hh----hh------cccc-eeccCCCCceE--------EEeccc-ChHHHHHHH Confidence 665555432 111111 11 00 1111 22333323222 222221 122355556 Q ss_pred HHHHHHHHHHHhcccccccCCc-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-ccceEEec Q lcl|NC_016071. 347 NSRKKAILDRFGAGFINLGNDG-QGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDE-DMPKLKPG 424 (516) Q Consensus 347 ~~~d~~Isk~iLGqtLts~~~~-~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~-~~P~~~~~ 424 (516) +.+-.+||+.---...+.+... ..|-..-+....-....++.-.+.+...|. ++++.++.+.+...... .-..+.|. T Consensus 334 ~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~-~~~rl~~~~~~~~~~~~~~~i~v~w~ 412 (501) T protein:vir:25 334 EEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWE-QLLRLAAEMDDDPDTAADSGAEVLWR 412 (501) T ss_pred HHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCccccceeeeEEec Confidence 6666666664433222222111 112211122223333334445556666674 46676666664322111 12467788 Q ss_pred CcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccc--------cCcccccCCCCCCcccccccccCC Q lcl|NC_016071. 425 LIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMS--------TDELLKLLGQDTSRSGDGMTAGSN 496 (516) Q Consensus 425 ~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~ 496 (516) ...+.++.+.|+++.+|+.+|+ +. +.-+.+..|+.++.-+... ......+.+....+..+ T Consensus 413 ~~~~~s~~~~ada~~kl~~~gi--s~---et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~------- 480 (501) T protein:vir:25 413 DTEARSFGAVVDGITKLASAGI--PI---EHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPP------- 480 (501) T ss_pred CCCCCCHHHHHHHHHHHHhcCC--CH---HHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCC------- Confidence 8889999999999999999985 21 2334556788643211000 00000111110000000 Q ss_pred CCCcccccccccchhhhhcC Q lcl|NC_016071. 497 GNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 497 ~~~~~~~~~~~d~~~~~~~~ 516 (516) .+++..+..+++..+-.+ T Consensus 481 --~~~~~~~~~~~~~~~~~~ 498 (501) T protein:vir:25 481 --PPPQAAAQALNEGGVNGN 498 (501) T ss_pred --CCCCCCccccccccCCCC Confidence 000011111111111111 No 221 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=67.78 E-value=0.25 Score=23.91 Aligned_cols=445 Identities=10% Similarity=0.044 Sum_probs=160.2 Q ss_pred CCcc----------------------------------------ccCcccccchhhhccc--CCCCccc-----ccchHH Q lcl|NC_016071. 1 MSTR----------------------------------------FAQPSEVVKAGNENLA--VSRLRTG-----ELGSGA 33 (516) Q Consensus 1 ~~~r----------------------------------------~~~~~~~~~~~~~~p~--~~~~~~~-----e~g~~~ 33 (516) ||.| +++.+-.+.....-|. .|..|++ +.+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (694) T protein:vir:10 1 MSRRNAKKRTQLARTGRRPEVAKAAALAAAATIATAAAQPVPADFARRGALNALDAAPVAEPSPSLRLARQFEVDVSNYT 80 (694) T ss_pred CCccchhhHHHHhhcCCCcchhhhhhhhhhhhhhhcCCCcccCCccccccchhhcccccCCCCcchhhhhhccccccCCC Confidence 3322 2222222221111111 1111111 111000 Q ss_pred HHHHHHHHHhhc-----ccccCC-----cccHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCCh--------- Q lcl|NC_016071. 34 LSQLRAESEVMK-----VEELRW-----PCFLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSK--------- 94 (516) Q Consensus 34 ~~~~~~~~~~~~-----~~~lr~-----~~~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~--------- 94 (516) ...-+...+... -..|.| --+|-+...|..-+.+.++....-....+ +|. ++..+..+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R-~w~-~~~~~~~e~~~~~g~~~~ 158 (694) T protein:vir:10 81 PRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIR-TWG-EAIGGTKEKADTSGLAAG 158 (694) T ss_pred ccccchhhhhhccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhc-ccc-eeccccchhhhhhccccc Confidence 000000011110 000111 01344445556667777777777666554 483 332222221 Q ss_pred ------hhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeeccccccccccee----------eccccc Q lcl|NC_016071. 95 ------ASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSKYAGYIT----------IDKIAF 158 (516) Q Consensus 95 ------~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~----------~~~l~~ 158 (516) .+.+..+.++..++++.. |..+...+-.+.+||=++. +...++..-..+.-+. ++.|.+ T Consensus 159 ~~~~~~~d~dqi~~L~~e~erl~V---~~~l~eaik~aRlfGGa~~--~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~V 233 (694) T protein:vir:10 159 GNAASTSDGDQLKQINDEIERLRI---RDAVRTTVIHDQAFGRAHP--YFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRV 233 (694) T ss_pred ccccccccHHHHHHHHHHHHHHHH---HHHHHHHHHhhccccceEE--EEEeecCccccccccccccccccCcceeeeEe Confidence 122556778888887753 4455555557999999983 2222221110011110 011111 Q ss_pred cCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecC------cC Q lcl|NC_016071. 159 RPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGG------TE 232 (516) Q Consensus 159 r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~------~~ 232 (516) .. +||.....-+. .++....++.+.++.+. |..|=..+++.|.-.+ -. T Consensus 234 iD-------p~~vtP~~~n~-----~dP~spdfgkP~~y~V~--------------G~~IH~SRL~~f~g~plPd~LKp~ 287 (694) T protein:vir:10 234 VE-------PYWVTPNNYNS-----INPVADDFYKPSTWWMI--------------GTEVHATRLHTIVSRPVGDMLKPT 287 (694) T ss_pred ec-------ccccccchhhh-----ccchhhccCCCceEEEe--------------ceEEeeeeEEEecCCCchhhhhcc Confidence 11 23322211000 11222222222222211 1123333444333221 12 Q ss_pred CccccchhHHHHHHHHH---HHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_016071. 233 SNPAGVSPLVGCYRAFR---EKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQ 309 (516) Q Consensus 233 g~p~G~gLlr~~~~~~~---fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~ 309 (516) -|.+|.++...++..+. -.+.... ++.+ ++... +++. -+.+.-.+. .+.+...+ .++++.++. .. T Consensus 288 y~~~G~Sv~q~~~e~V~~~~rT~~~v~-~Li~-~~~v~---~lk~---dla~~L~~g--~~~~l~~R-~eli~~~Rs-n~ 355 (694) T protein:vir:10 288 YSFAGISMTQLAMPYIDNWLRTRQSVS-DIVK-QFSVS---GILM---DLAQALMPG--ANVDLSMR-AELINRYRD-NR 355 (694) T ss_pred cccCcccHHHHHHHHHHHHHHHHhHHH-HHHH-hhhhH---HHHH---HHHHhhcCh--hHHHHHHH-HHHHHHhcC-cc Confidence 35678898888775322 1111111 1111 00000 0000 000011111 11222222 244444442 23 Q ss_pred eEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc--ccCCccchhhHHHHHHHHHHHHHH Q lcl|NC_016071. 310 AYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN--LGNDGQGSYNLSESKQSIHGHFVQ 387 (516) Q Consensus 310 a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS~Al~~vh~ev~~~~~~ 387 (516) ...+|=++. .+.+.++.+=| ....+|.-.-.+||-+. +-.+| .+.+-.|=-|-|+--..+.-|.++ T Consensus 356 G~~llDk~~-------Eefeq~stslS----GLddVi~qf~q~VAgaa-~IPltkLfGqSPkGlNATGE~D~rnYYD~I~ 423 (694) T protein:vir:10 356 NILFLDKAT-------EEFFQFNTPLS----GLDALQAQAQEQMSAVS-HIPLIKLLGITPTGLNASSEGEIRVWYDYVR 423 (694) T ss_pred ceEEEecCC-------cceEEEecccC----CHHHHHHHHHHHHHhhh-cCchhhhhccCcccccccchhhHHHHHHHHH Confidence 333442222 23344433222 24556666666666442 22222 122223433556655666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcCCccccceE--EecCcCchh------H-HHHHHHHHHHHhCCcccccHHHHHHHH Q lcl|NC_016071. 388 RDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKL--KPGLIQEVD------M-EGFSKFVQRIGAVGYLPKTPTVINKIL 458 (516) Q Consensus 388 aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~--~~~~~~~~d------l-~~~a~~~~~L~~~G~~~~~~~~~~~i~ 458 (516) +........+-+.|+.-| .+.. ++. ..|.| +|....+-+ + +..|++++.+++.|++.++ .++ T Consensus 424 s~Qe~~L~p~L~rl~~ii-~rS~-~G~--idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~-----evr 494 (694) T protein:vir:10 424 AYQRNALQQLMNDVIVMI-QLSL-FGA--VDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPD-----QVA 494 (694) T ss_pred HHHHHHHHHHHHHHHHHH-HHHh-cCC--CCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHH-----HHH Confidence 655433333223343333 2221 222 23444 443222222 2 3346678899999998873 567 Q ss_pred HHcCCCCCC------C-ccccc-Ccc--cccCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 459 EVGGFDEEI------P-EDMST-DEL--LKLLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 459 e~~Glp~~~------~-~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) +++.-.+.- + +|++. +.. .........+.+++...+.+++ +.+.-.+.-+++| T Consensus 495 ~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~g~~~~~~v~~ 557 (694) T protein:vir:10 495 ARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG-----ARAGATAPPTVAN 557 (694) T ss_pred HHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCc-----ccccccCCCcccc Confidence 775443211 0 11100 000 0000011112222222222222 1111122222233 No 222 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=67.25 E-value=0.26 Score=23.84 Aligned_cols=444 Identities=12% Similarity=0.078 Sum_probs=157.9 Q ss_pred CCccccCcccccchhhhccc-CCCCccc---ccchH---HHHHHHHHHHhh---cccccC-CcccHHHHHHHhhChHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLA-VSRLRTG---ELGSG---ALSQLRAESEVM---KVEELR-WPCFLATVEAMKQDHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~-~~~~~~~---e~g~~---~~~~~~~~~~~~---~~~~lr-~~~~~~~y~~m~~D~~v~s 69 (516) +........+.- .+..+| +||...- |+-.. +...+.|..+.. ..+.++ ..+.|+.|++|..+|.|-+ T Consensus 13 ~~~de~~~~~~~--~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEvd~ 90 (524) T protein:vir:10 13 AKMDERNFKDQE--KEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLMNNYEVDN 90 (524) T ss_pred ccCcchhhhhhh--ccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHhhccchhh Confidence 111111111111 111111 1111110 11000 000122221111 112222 3458999999999999999 Q ss_pred HHHHHHHHHhcCCc-e--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFN-D--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSK 146 (516) Q Consensus 70 ~l~~Rk~~v~~~~w-~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~ 146 (516) +++-.-.-+.-.+- . |.+.- .+.+.++.+-+.+.+. |+ .+..+|+.--+||..+= .|+ T Consensus 91 Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~iK~kI~ee---------F~-~Il~ll~F~~~~~~~fR--------~WY 151 (524) T protein:vir:10 91 AVSEIVSDAIVYEDDTEVVALNL-DKSKFSPKIKNMMLDE---------FN-DVLNHLSFQRKGSDHFR--------RWY 151 (524) T ss_pred HHHHhhcceeEecCCCceEEEEe-cCcCcchHHHHHHHHH---------HH-HHHHHhccchhhhHHHh--------hhe Confidence 99987654321110 0 01110 0111223333333332 22 33356666666666542 233 Q ss_pred cccceeecccc-------------ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCC Q lcl|NC_016071. 147 YAGYITIDKIA-------------FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSA 213 (516) Q Consensus 147 ~~g~~~~~~l~-------------~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (516) -||++.++++. ...|..|+..|......++..- +-.....+.-|..+.... ...-...... T Consensus 152 VDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~--vi~~~~e~f~Y~~~~~~y----~~~g~~~~~~ 225 (524) T protein:vir:10 152 VDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTK--IVKGYKEYFIYDTAHESY----ACDGRMYEAG 225 (524) T ss_pred eeeEEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccch--hhcchhhheeeccCcccc----ccCccccCCC Confidence 45555544433 2222233332222222222110 011111112222221110 0111112334 Q ss_pred CccccccccEEEEeecCcCC-c-cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC--CCCHH Q lcl|NC_016071. 214 DEVFIPINKLMVMSLGGTES-N-PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI--DPKSP 289 (516) Q Consensus 214 ~~~~iP~~k~i~~~~~~~~g-~-p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~--~~~~~ 289 (516) .++.||.+- |+|+|..-.+ + -.=.|.|.++..|+==-+.....-.+ ..+-.+|.+|+=+-.- =|... T Consensus 226 ~~ikI~~dA-I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVI--------YRitRAPeRRvFYIDvGnlPk~K 296 (524) T protein:vir:10 226 TKIKIPKAA-IVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVI--------YRITRAPDRRVWYVDTGNMPARK 296 (524) T ss_pred cceecchhh-eeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHH--------HhhhccccceEEEEecCCCCchh Confidence 677787765 8888843211 0 11237888888776433322221111 1122223222211111 12222 Q ss_pred HHHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHHHHH Q lcl|NC_016071. 290 ESEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKK 351 (516) Q Consensus 290 ~~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~ 351 (516) .++. +..++..++. ....|- -||.= +.....+|+-+ .|+.... -.+=|+|..+ T Consensus 297 AeqY---l~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEItTL--pGgqnlg-em~DV~YF~k 367 (524) T protein:vir:10 297 AAEH---MQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRR---DGKAVTEVDTL--PGADNTG-NMEDVRWFRQ 367 (524) T ss_pred HHHH---HHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc---CCCcccceeec--cccCCcC-hHHHHHHHHH Confidence 2222 2333322211 001111 12210 00111233333 2222222 3345799999 Q ss_pred HHHHHHhcccccccCCccchh---hHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccceEEe Q lcl|NC_016071. 352 AILDRFGAGFINLGNDGQGSY---NLSE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPKLKP 423 (516) Q Consensus 352 ~Isk~iLGqtLts~~~~~GS~---Al~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~~~~ 423 (516) .+-+++--..--...++.|.. ..++ +-.|+ |...++.-...+...|..-|-..|+. .+.--+. .--+.+.| T Consensus 368 kLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLil-Kgiit~eew~~i~~~I~~ 446 (524) T protein:vir:10 368 ALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLL-KGIITEDEWNDEINNIKI 446 (524) T ss_pred HHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-ccCCCHHHHHHHhhcceE Confidence 998888776643332222112 2223 33343 33334444444555554333333332 2111110 11133444 Q ss_pred cCcCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCC--cccccCcccc--cCCCCCCccccc Q lcl|NC_016071. 424 GLIQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIP--EDMSTDELLK--LLGQDTSRSGDG 490 (516) Q Consensus 424 ~~~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~--~~~~~~~~~~--~~~~~~~~~~~~ 490 (516) +...+ .+.+-+.+++..|..+-=.+-...+.+|+++.+ .+...+- ++...+.+.+ .-+++.....+. T Consensus 447 ~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 447 EFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred EeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 43333 333444445555543322222234577887654 4432111 1111111111 011111111111 No 223 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=66.55 E-value=0.27 Score=23.74 Aligned_cols=441 Identities=11% Similarity=-0.014 Sum_probs=161.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHh---hcccccCCcccHHHHH--------------HH-- Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEV---MKVEELRWPCFLATVE--------------AM-- 61 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~---~~~~~lr~~~~~~~y~--------------~m-- 61 (516) ...|+..-++..-.....-....... ..+..++.. ...+.+ .+..+.|+ +- T Consensus 17 ~~~~~~~~~n~~~~~~~~~~~~~~~~--------~~i~~~i~~~~~~~~~r~--~~l~~Yy~g~~~i~~~~~~~~~~~~~ 86 (511) T protein:vir:10 17 INYLFNDEANVVYTYDGTESDLLQNV--------NEVSKCIEHHMDYQRPRL--KVLSDYYEGKTKNLVELTRRKEEYMA 86 (511) T ss_pred hhhhhhhhhcCCccCchhhhhcccCH--------HHHHHHHHHHHHhhHHHH--HHHHHHhcccCccccccCcccccccC Confidence 22233222222211110000000000 011111110 000100 00011110 00 Q ss_pred --h-hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH-HHHhhcceeeeEE Q lcl|NC_016071. 62 --K-QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA-TFNEYGFSIFEKV 137 (516) Q Consensus 62 --~-~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l-da~~~G~S~~Eiv 137 (516) + ......-++......+.+-+..+++ + +.++.+++..+++.- .|..+..++. ++.-||. +++++ T Consensus 87 ~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~----~---d~~~~~~l~~~~~~n----~~~~~~~~~~~~~~i~G~-ay~~v 154 (511) T protein:vir:10 87 DNRVAHDYASYISDFINGYFLGNPIQYQD----D---DKDVLEAIEAFNDLN----DVESHNRSLGLDLSIYGK-AYEIM 154 (511) T ss_pred cceeecchHHHHHHHHhhhhcccCceeec----C---chHHHHHHHHHHhhc----CHHHHHHHHHHHHHhcCe-eEEEE Confidence 0 1233344445555555555555542 1 223456677766542 3666665544 6888997 45788 Q ss_pred EeecccccccccceeeccccccCchhcccccceeecCC--Cceeeecccccccccccc----cccccccccccccccccc Q lcl|NC_016071. 138 YRTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDED--GRTLKGIYQSKMAFANFQ----NGLTQISSAMSLVTNLTS 211 (516) Q Consensus 138 w~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 211 (516) |... +|.+.+..+.|+. . ...|++. ++.+..++.......... ..+..+-.+..+..+... T Consensus 155 y~de------dg~~~i~~~~p~~---~----~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~ 221 (511) T protein:vir:10 155 IRNQ------DDETRLYKSDAMS---T----FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTS 221 (511) T ss_pred EeCC------CCceEEEEEccce---e----EEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEec Confidence 8643 3445444433321 1 1133332 233443332111000000 000000011111111111 Q ss_pred CCC----------ccccccc--cEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_016071. 212 SAD----------EVFIPIN--KLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQIL 279 (516) Q Consensus 212 ~~~----------~~~iP~~--k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~ 279 (516) ... ..+-|.. -++.|. .|..|.|.+..+-..-=--...+..++..++.+..+++++++.... T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vPvv~f~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~- 295 (511) T protein:vir:10 222 RTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL- 295 (511) T ss_pred CCCcccccccccccccccCcceeEEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccC- Confidence 111 1111111 123333 3567888888764432222345566777788888999888874321 Q ss_pred ccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_016071. 280 NKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA 359 (516) Q Consensus 280 ~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG 359 (516) +.. +........ +.... ....+-+.+ .......+++++..+. ....+...++.+.+.|.+.--. T Consensus 296 -----~~~--~~~~~~~~~--~~~~~---~~~~~~~~~--~~~~~~~d~~~l~~~~--~~~~~e~~~~~L~~~I~~~s~~ 359 (511) T protein:vir:10 296 -----DPV--EVRKQKEAN--VLFLE---PTVYADSEG--RETEGSVDGGYIYKQY--DVQGTEAYKDRLNSDIHMFTNT 359 (511) T ss_pred -----Cch--hhccchhcc--ceecc---ccccccccc--ccCCCCcceeEEeecC--CHHHHHHHHHHHHHHHHHHhCC Confidence 111 100000000 00000 000000000 0111223455554432 2234677888888888776555 Q ss_pred ccccccCCccchhhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHh---cCC-cCCc-cccceEEecCcCchh Q lcl|NC_016071. 360 GFINLGNDGQGSYNLSESKQSIHGHF----VQRDIDIIVEAFNKNLIPQLLAL---NDI-RLSD-EDMPKLKPGLIQEVD 430 (516) Q Consensus 360 qtLts~~~~~GS~Al~~vh~ev~~~~----~~aDa~~i~~~ln~~li~~lv~l---N~~-~~~~-~~~P~~~~~~~~~~d 430 (516) ..++.++-++.+ +.+........ +..-.+.+...|. ++++.++.+ ... ..+. ..-..+.|...-+.| T Consensus 360 P~~~~~~~~~n~---Sg~Al~~~~~~l~~k~~~k~~~f~~~l~-~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d 435 (511) T protein:vir:10 360 PNMKDDNFSGTQ---SGEAMKYKLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKS 435 (511) T ss_pred cccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcC Confidence 545543322111 11121111111 2222333444443 244444443 111 1111 112477888888999 Q ss_pred HHHHHHHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCccccc---CCCCCCcccccccccCCCCCccccccc Q lcl|NC_016071. 431 MEGFSKFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKL---LGQDTSRSGDGMTAGSNGNGTGKISST 506 (516) Q Consensus 431 l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (516) ..+.++++.+|+ |++. .+.+.+.++. +.+..+-+-...+.+. .........++. .+.+....+. T Consensus 436 ~~~~~~~~~kl~--G~iS-----~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 503 (511) T protein:vir:10 436 LIEELKAYIDSG--GKIS-----QTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD-----INDDEQDDDT 503 (511) T ss_pred HHHHHHHHHHHh--ccCc-----HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCC-----CCCCCCCCcc Confidence 999999999985 6533 3456666653 3221110001111000 000000000000 0111111111 Q ss_pred ccchhhhh Q lcl|NC_016071. 507 RDNSVSNM 514 (516) Q Consensus 507 ~d~~~~~~ 514 (516) ..++.+-- T Consensus 504 ~~~~~~~~ 511 (511) T protein:vir:10 504 KDTVDKKE 511 (511) T ss_pred cCcccccC Confidence 11111111 No 224 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=64.61 E-value=0.3 Score=23.47 Aligned_cols=445 Identities=14% Similarity=0.108 Sum_probs=158.7 Q ss_pred CCccccCc--------ccccchhhhcccCCCCccc--cc--chHHHHHHHHHHH--hhcccccC-CcccHHHHHHHhhCh Q lcl|NC_016071. 1 MSTRFAQP--------SEVVKAGNENLAVSRLRTG--EL--GSGALSQLRAESE--VMKVEELR-WPCFLATVEAMKQDH 65 (516) Q Consensus 1 ~~~r~~~~--------~~~~~~~~~~p~~~~~~~~--e~--g~~~~~~~~~~~~--~~~~~~lr-~~~~~~~y~~m~~D~ 65 (516) |++=++-- .+..+...+-|+.|..--+ ++ |.. .....|+.+ ....++.+ ..++++.|++|...+ T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~-~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~p 81 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREG-ESSYNALMQQFFGIDNNISGTKDLINTYRQLTNNP 81 (516) T ss_pred chHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcc-cccccceeeeeecccCccccHHHHHHHHHHhhhcc Confidence 22211110 0000000001111111100 11 100 001122211 01122333 245799999999999 Q ss_pred HHHHHHHHHHHHHhcCC-ce--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecc Q lcl|NC_016071. 66 TVSTALDTKYVFVTKAF-ND--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTES 142 (516) Q Consensus 66 ~v~s~l~~Rk~~v~~~~-w~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~ 142 (516) .|-++++-.-.-+.-.+ -. |.+. -.+.+.++.+-+.|.+.+ +.+..+|+.--+||..+= T Consensus 82 Evd~Av~eIvneaiv~d~~~~pV~l~-l~~~e~s~sik~kI~eeF----------~~Il~ll~F~~~~~~~fR------- 143 (516) T protein:vir:10 82 EVERAVANIVNEAVVYEKGHKVVSLD-LDDTEFSSSIKDKILEEF----------DEICRLLDASRKLDTLFR------- 143 (516) T ss_pred chhHHHHHhhcceeEecCCCceEEEE-ecccccchHHHHHHHHHH----------HHHHHHhccchhhhHHHH------- Confidence 99999998765432111 00 0110 011112333333343332 234555666666666542 Q ss_pred cccccccceeeccccccCchhcc-----------cccceeec-CCCceeeeccccccccccccccccccccccccccccc Q lcl|NC_016071. 143 APSKYAGYITIDKIAFRPQSSLS-----------RSKPWVFD-EDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLT 210 (516) Q Consensus 143 ~~~~~~g~~~~~~l~~r~q~ti~-----------~~~~f~~~-~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (516) .|+-||++.++++..+|..-|. ..|..... .+|...+. ....+..|..+- ..+...-..- T Consensus 144 -~WYVDgRi~fhKiid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~---~~~e~~~Y~~~~----~~~~~~g~~~ 215 (516) T protein:vir:10 144 -RWYIDSRIFFHKIMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVK---GYREFFVYTTGN----EGYAYNGRLF 215 (516) T ss_pred -hhhhcceEEEEEEecCcccceeeeeeeCCcceeeEEeeecccCcchhhhh---ceeeeeeeecCc----cceecccccc Confidence 2444566665555554443333 22211111 11111110 000111111110 0000000000 Q ss_pred cCCCccccccccEEEEeecCc--CCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC--CC Q lcl|NC_016071. 211 SSADEVFIPINKLMVMSLGGT--ESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI--DP 286 (516) Q Consensus 211 ~~~~~~~iP~~k~i~~~~~~~--~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~--~~ 286 (516) +....+.||.+ .|+|+|..- .+...=.|.|.++..|+==-+.....-.++ .+-.+|.+|+=+-.- =| T Consensus 216 ~~~~~ikI~~d-aI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIY--------RitRAPeRRvFYIDVGnLP 286 (516) T protein:vir:10 216 EPNTRIKIPRS-AIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIY--------RITRAPERRVFYIDVGNMP 286 (516) T ss_pred CCCCceecchh-heeeeecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHH--------hhhccccceEEEEecCCCC Confidence 11234566654 577777431 111122588888888764333322212211 122222222211111 12 Q ss_pred CHHHHHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHH Q lcl|NC_016071. 287 KSPESEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNS 348 (516) Q Consensus 287 ~~~~~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~ 348 (516) ....++. +..++..++. .+..|- -||.= +.....+|+-+ .|+... .-.+=|+| T Consensus 287 k~KAeqY---l~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEItTL--pGgqnl-gem~DV~Y 357 (516) T protein:vir:10 287 NRKATEY---VNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRR---DGKSVTEVTSL--PGAQTM-GEMDDVRW 357 (516) T ss_pred chhHHHH---HHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc---CCCcccceeec--cccCCc-ChHHHHHH Confidence 2222222 2333322211 001111 12210 00111233333 222222 23345899 Q ss_pred HHHHHHHHHhcccccccCCccchh---hHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccce Q lcl|NC_016071. 349 RKKAILDRFGAGFINLGNDGQGSY---NLSES-KQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPK 420 (516) Q Consensus 349 ~d~~Isk~iLGqtLts~~~~~GS~---Al~~v-h~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~ 420 (516) ..+.+-+++--..--.+.+++++. ..+++ ..|+ |...+..-...++..|..-|-..|+.=+ .--+. .--+. T Consensus 358 F~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLilKg-Iit~eeW~~i~~~ 436 (516) T protein:vir:10 358 FNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIYKK-IILESEWEEQINN 436 (516) T ss_pred HHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcC-CCCHHHHHHHhhc Confidence 999999998887644444443332 22333 3334 3333444444444444433333332211 11100 11133 Q ss_pred EEecCcCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCCc--ccccCcccccCCCCCCcccccc Q lcl|NC_016071. 421 LKPGLIQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIPE--DMSTDELLKLLGQDTSRSGDGM 491 (516) Q Consensus 421 ~~~~~~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 491 (516) +.|+...+ .+.+-+.+++..|..+-=.+-...+.+||++.+ .++..+-. +...+.+.+ .+-...|..+. T Consensus 437 I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~~-~~~~~~p~~e~- 514 (516) T protein:vir:10 437 IKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQIEKEAN-VKRFQNPENED- 514 (516) T ss_pred ceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhHHHHHHHHHHHhhh-CCCCCCCCccc- Confidence 44443333 333444445555443321122245578887654 55532111 111111111 11001111110 Q ss_pred cccCCCC Q lcl|NC_016071. 492 TAGSNGN 498 (516) Q Consensus 492 ~~~~~~~ 498 (516) .+ T Consensus 515 -----~f 516 (516) T protein:vir:10 515 -----DF 516 (516) T ss_pred -----cC Confidence 00 No 225 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=64.29 E-value=0.3 Score=23.43 Aligned_cols=444 Identities=12% Similarity=0.077 Sum_probs=158.3 Q ss_pred CCccccCcccccchhhhccc-CCCCccc---ccchH---HHHHHHHHHHhh---cccccC-CcccHHHHHHHhhChHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLA-VSRLRTG---ELGSG---ALSQLRAESEVM---KVEELR-WPCFLATVEAMKQDHTVST 69 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~-~~~~~~~---e~g~~---~~~~~~~~~~~~---~~~~lr-~~~~~~~y~~m~~D~~v~s 69 (516) +........+.- .+..+| +||...- |+-.. +...+.|..+.. ..+.++ ..+.|+.|++|..+|.|-+ T Consensus 13 ~~~de~~~~~~~--~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEvd~ 90 (524) T protein:vir:72 13 AKMDERNFKDQE--KEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLMNNYEVDN 90 (524) T ss_pred ccCcchhhhhhh--ccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHhhccchhh Confidence 111111111111 111111 1111110 11000 000122221111 112222 3458999999999999999 Q ss_pred HHHHHHHHHhcCCc-e--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEeecccccc Q lcl|NC_016071. 70 ALDTKYVFVTKAFN-D--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRTESAPSK 146 (516) Q Consensus 70 ~l~~Rk~~v~~~~w-~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~~~~~~~ 146 (516) +++-.-.-+.-.+- . |.+.- .+.+.++.+-+.+.+. |+ .+..+|+.--+||..+= .|+ T Consensus 91 Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~iK~kI~ee---------F~-~Il~ll~F~~~~~~~fR--------~WY 151 (524) T protein:vir:72 91 AVSEIVSDAIVYEDDTEVVALNL-DKSKFSPKIKNMMLDE---------FS-DVLNHLSFQRKGSDHFR--------RWY 151 (524) T ss_pred HHHHhhcceeEecCCCceEEEEe-cCcCcchHHHHHHHHH---------HH-HHHHHhccchhhhHHHh--------hhe Confidence 99987654321110 0 01110 0111223333333332 22 33356666666666552 233 Q ss_pred cccceeecccc-------------ccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCC Q lcl|NC_016071. 147 YAGYITIDKIA-------------FRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSA 213 (516) Q Consensus 147 ~~g~~~~~~l~-------------~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (516) -||++.++++. ...|..|+..|......++..- +-.....+.-|..+.... ...-...... T Consensus 152 VDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~--vi~~~~e~f~Y~~~~~~y----~~~g~~~~~~ 225 (524) T protein:vir:72 152 VDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTK--IVKGYKEYFIYDTAHESY----ACDGRMYEAG 225 (524) T ss_pred eeeEEEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCCccch--hhcchhhheeeccCcccc----ccCccccCCC Confidence 45555554433 1222233332222222222110 001111112222221110 0111112334 Q ss_pred CccccccccEEEEeecCcCC-c-cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC--CCCHH Q lcl|NC_016071. 214 DEVFIPINKLMVMSLGGTES-N-PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI--DPKSP 289 (516) Q Consensus 214 ~~~~iP~~k~i~~~~~~~~g-~-p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~--~~~~~ 289 (516) .++.||.+- |+|+|..-.+ + -.=.|.|.++..|+==-+.....-.+ ..+-.+|.+|+=+-.- =|... T Consensus 226 ~~ikI~~dA-I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVI--------YRitRAPeRRvFYIDvGnlPk~K 296 (524) T protein:vir:72 226 TKIKIPKAA-VVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVI--------YRITRAPDRRVWYVDTGNMPARK 296 (524) T ss_pred cceecchhh-eeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHH--------HhhhccccceEEEEecCCCCchh Confidence 677787765 8888843211 0 11237888888776433322221111 1122223222211111 12222 Q ss_pred HHHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHHHHH Q lcl|NC_016071. 290 ESEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKK 351 (516) Q Consensus 290 ~~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~ 351 (516) .++. +..++..++. ....|- -||.= +.....+|+-+ .|+.... -.+=|+|..+ T Consensus 297 AeqY---l~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEItTL--pGgqnlg-em~DV~YF~k 367 (524) T protein:vir:72 297 AAEH---MQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRR---DGKAVTEVDTL--PGADNTG-NMEDIRWFRQ 367 (524) T ss_pred HHHH---HHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc---CCCcccceeec--cccCCcC-hHHHHHHHHH Confidence 2222 2333322211 001111 12210 00111233333 2222222 3345799999 Q ss_pred HHHHHHhcccccccCCccchh---hHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccceEEe Q lcl|NC_016071. 352 AILDRFGAGFINLGNDGQGSY---NLSE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPKLKP 423 (516) Q Consensus 352 ~Isk~iLGqtLts~~~~~GS~---Al~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~~~~ 423 (516) .+-+++--..--...++.|.. ..++ +-.|+ |...++.-...+...|..-|-..|+. .+.--+. .--+.+.| T Consensus 368 kLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLil-Kgiit~eew~~i~~~I~~ 446 (524) T protein:vir:72 368 ALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLL-KGIITEDEWNDEINNIKI 446 (524) T ss_pred HHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-ccCCCHHHHHHHhhcceE Confidence 998888776643332222112 2223 33343 33334444444555554333333332 2111110 11133444 Q ss_pred cCcCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCC--cccccCcccc--cCCCCCCccccc Q lcl|NC_016071. 424 GLIQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIP--EDMSTDELLK--LLGQDTSRSGDG 490 (516) Q Consensus 424 ~~~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~--~~~~~~~~~~--~~~~~~~~~~~~ 490 (516) +...+ .+.+-+.+++..|..+-=.+-...+.+|+++.+ .+...+- ++...+.+.+ .-+++.....+. T Consensus 447 ~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 447 EFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred EeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 43333 333444445555543322222234578887654 4432111 1111111111 011111111111 No 226 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=63.65 E-value=0.31 Score=23.35 Aligned_cols=443 Identities=11% Similarity=0.036 Sum_probs=166.9 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHHHHhhC---------------h Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVEAMKQD---------------H 65 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~D---------------~ 65 (516) |||..+..-...++ -.+++.-+.-|..+-++=+.-+.+|+++++.|..--.+ + T Consensus 1 ~~~~~~~~~~~~~~------------~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~ 68 (584) T protein:vir:95 1 MSVKVAELNSLLVR------------DSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLP 68 (584) T ss_pred CCcchhhhhhhccc------------cchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchh Confidence 88887766555431 13444556677777777667778898888766432222 2 Q ss_pred HHHHHHHHHHHHHh-----cCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEe Q lcl|NC_016071. 66 TVSTALDTKYVFVT-----KAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYR 139 (516) Q Consensus 66 ~v~s~l~~Rk~~v~-----~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~ 139 (516) ++.+........+. +.+| +...+...+.+++..++.++..+.+=-....|...++. +.+++.||.+++=+-|. T Consensus 69 k~~~~~~~i~~~l~~~~Fp~~~w-~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~ 147 (584) T protein:vir:95 69 KLCQIRDNLHSNYFSSLFPNDDW-LRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFE 147 (584) T ss_pred HHHHHHHHHHHHHHHhhcCccce-eeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEe Confidence 33333333322222 2344 22222222223344466666666433223357666655 45799999999999998 Q ss_pred ecccccccccceeeccccccCchhcccccceeecCCC------ceee--------------------------------- Q lcl|NC_016071. 140 TESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFDEDG------RTLK--------------------------------- 180 (516) Q Consensus 140 ~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg------~~l~--------------------------------- 180 (516) ..-.... ++........+|-- .++ +--|.||... .-+. T Consensus 148 ~~~~e~~-e~~~v~~~~~prie-riS-P~d~~~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~ 224 (584) T protein:vir:95 148 AKYKEMT-DGTLVPDYIGPRLV-RIS-PLDIVFNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEIC 224 (584) T ss_pred ecceeee-ccccccccccceEE-eeC-hhheeecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhc Confidence 7631100 00000000000000 000 0001111111 0000 Q ss_pred -eccc------ccccccc----------ccccccccccccccccc-----------cccCCCccc-------cc--cccE Q lcl|NC_016071. 181 -GIYQ------SKMAFAN----------FQNGLTQISSAMSLVTN-----------LTSSADEVF-------IP--INKL 223 (516) Q Consensus 181 -~~~q------~~~~~~~----------~~~~~~~~~~~~~~~~~-----------~~~~~~~~~-------iP--~~k~ 223 (516) ..+. +.+...+ +..+++.+..-++.+.. +....++.. .| ..-| T Consensus 225 ~~~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF 304 (584) T protein:vir:95 225 RHLGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPI 304 (584) T ss_pred cCCCCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCE Confidence 0000 0000000 11111111111111100 000000111 22 2257 Q ss_pred EEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016071. 224 MVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAAN 303 (516) Q Consensus 224 i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~ 303 (516) ++..+.+...+.||.|.+..|--.-..++...+.-.--+..+..|++...+ ++ .+ T Consensus 305 ~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~----------~~---------------~~ 359 (584) T protein:vir:95 305 YHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIG----------EV---------------EE 359 (584) T ss_pred EEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeecc----------cc---------------ch Confidence 888888889999999999999887777776666655556666666432221 11 11 Q ss_pred hhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHH-- Q lcl|NC_016071. 304 AHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSI-- 381 (516) Q Consensus 304 ~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev-- 381 (516) .+.+ .|..+ .......+.++....+.-...| ..|.+.+..+.......--..|.++ +.+++...+ T Consensus 360 ~~~~--pg~~~------~~~~~~~~q~~~p~a~~~~s~~-~~lq~~e~~me~~sGvp~~~~G~~~----~~~~TAtg~s~ 426 (584) T protein:vir:95 360 FVWG--PGAEI------HLDQGGDVQEIAKNVNYIINAD-NQIQMLEDRMELYAGAPREAMGIRT----PGEKTAFEVQQ 426 (584) T ss_pred hccc--CCcee------ecCCCCCcceecCchhhhhHHH-HHHHHHHHHHHhhhCCChhhccccc----chhhhHHHHHH Confidence 1112 11111 1111111222222211111123 2356666666653333222222221 122222211 Q ss_pred HHHHHHHHHHHHHHHHH----HHHHHHHHHhcCCcCCccccceEE--------ecCcCchhHH----------------- Q lcl|NC_016071. 382 HGHFVQRDIDIIVEAFN----KNLIPQLLALNDIRLSDEDMPKLK--------PGLIQEVDME----------------- 432 (516) Q Consensus 382 ~~~~~~aDa~~i~~~ln----~~li~~lv~lN~~~~~~~~~P~~~--------~~~~~~~dl~----------------- 432 (516) .......-.+.+.+++- +.|+..|..++-.+......+++. |..+..+|++ T Consensus 427 l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~ke 506 (584) T protein:vir:95 427 LGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQA 506 (584) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHH Confidence 11122222233333333 334444444421111112222222 2222223321 Q ss_pred ----HHHHHHHHHHhCCc-ccc---cHHHHHHHHHHcCCCCCC---CcccccCc--ccccCCCCCCcccccccccCCCCC Q lcl|NC_016071. 433 ----GFSKFVQRIGAVGY-LPK---TPTVINKILEVGGFDEEI---PEDMSTDE--LLKLLGQDTSRSGDGMTAGSNGNG 499 (516) Q Consensus 433 ----~~a~~~~~L~~~G~-~~~---~~~~~~~i~e~~Glp~~~---~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 499 (516) .+...++. .+|. +-| ...+.+.+.+..++|.-. ++-...++ ....+.+++... -..+.-++++ T Consensus 507 q~~q~l~~ilq~--~~~~~i~p~~~~~~l~~~ladl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~--~~~~~~~~~~ 582 (584) T protein:vir:95 507 QDLQNLVGIFNS--QIGQMILPHTSGKALATFVDDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDL--QLQAQMPAEG 582 (584) T ss_pred HHHHHHHHHHHh--hhhhhccccchHHHHHHHHHHHhCCCcccccCCCcccchhHHHHhhhHHHHHHH--HHHHhhhhcc Confidence 11222221 2232 111 122334455677777311 11011000 000000100000 0000000111 Q ss_pred cc Q lcl|NC_016071. 500 TG 501 (516) Q Consensus 500 ~~ 501 (516) .- T Consensus 583 ~~ 584 (584) T protein:vir:95 583 AI 584 (584) T ss_pred CC Confidence 10 No 227 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=57.72 E-value=0.43 Score=22.60 Aligned_cols=464 Identities=12% Similarity=0.063 Sum_probs=182.1 Q ss_pred CCccccCcccccchhhhccc--CCCCcccccchHHHHHHHHHHH--hhcccc----cCCcccHHHHHHHhhChHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLA--VSRLRTGELGSGALSQLRAESE--VMKVEE----LRWPCFLATVEAMKQDHTVSTALD 72 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~--~~~~~~~e~g~~~~~~~~~~~~--~~~~~~----lr~~~~~~~y~~m~~D~~v~s~l~ 72 (516) |---.+|-.+..+.--+.-+ +|..-... ++++.-+.+ .-+..+ +|+.....+| |+-=+-.+. T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~R-----laaY~ly~d~y~n~~~el~~il~G~dr~~~~-----~ps~r~~V~ 70 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNR-----VRAYDLYENIYLNSAETLKLVLRGDDSVPIL-----MPSGRKIVE 70 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHH-----HHHHHHHHHhhcCchhhhhhhcCCCceeeec-----cchHHHHHH Confidence 44433333322221110000 22211110 122221111 111111 3333333322 222223334 Q ss_pred HHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHH-HHHHHHHhhcceeeeEEEeecccccccccce Q lcl|NC_016071. 73 TKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIA-RSAATFNEYGFSIFEKVYRTESAPSKYAGYI 151 (516) Q Consensus 73 ~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l-~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~ 151 (516) + ...+++.+.+|.|++...++...+ .|+..|+++..+..|.-.. ..-.+|..-|=.||=+.|...... .+++ T Consensus 71 ~-~~~~Lg~~~~~~Ve~~~~de~~~~---avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~---g~R~ 143 (563) T protein:vir:74 71 A-VHRFLGVGFDYLVEPDMGDEGIRQ---SLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKA---GERI 143 (563) T ss_pred H-HHHhcCCCcEEecCccccCcchHH---HHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecccccc---CCCc Confidence 4 334557778888876654443333 3666666655454565444 345578999999999999853210 1233 Q ss_pred eeccccccC------------------------c----hhcccccceeec--CCCceeeecccccccccccccccccccc Q lcl|NC_016071. 152 TIDKIAFRP------------------------Q----SSLSRSKPWVFD--EDGRTLKGIYQSKMAFANFQNGLTQISS 201 (516) Q Consensus 152 ~~~~l~~r~------------------------q----~ti~~~~~f~~~--~dg~~l~~~~q~~~~~~~~~~~~~~~~~ 201 (516) .++.+-|+. + ..|-+.+.|.|. ++|-....+ -. ...-|..|.+...+ T Consensus 144 rv~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~-~~--dae~w~lg~wd~r~ 220 (563) T protein:vir:74 144 SVDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRI-SS--ELTHWTLGNWDDRG 220 (563) T ss_pred eEeecCCceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCcccee-ee--ccchhccccccccC Confidence 333322220 0 001111222221 222100000 00 00001111111111 Q ss_pred ccccc--------cccccCCCcccc--ccc--cEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_016071. 202 AMSLV--------TNLTSSADEVFI--PIN--KLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGI 269 (516) Q Consensus 202 ~~~~~--------~~~~~~~~~~~i--P~~--k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~ 269 (516) ..+-. .......+...+ |.. -++++...+..+..+|.|.|..+--...--+..+.+....++-.|.|| T Consensus 221 ~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi 300 (563) T protein:vir:74 221 AISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGM 300 (563) T ss_pred ccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCe Confidence 00000 000111122222 222 133455668889999999998887777666667767777777778888 Q ss_pred eeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccc---cccceeeeeccccCcchhHHHHH Q lcl|NC_016071. 270 IELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGG---EQYKMSLKGIDGAGKQYSTQELV 346 (516) Q Consensus 270 ~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~---e~~~iel~~~~g~g~~~~~~~li 346 (516) .++-+-. +.+... |.-+-.-|+.|+-++.. +...++.+ +|+-....++.-+ T Consensus 301 ~vl~~~~------p~d~~~------------------g~~~~w~vgpG~i~El~~~~~~g~l~~v--~g~~~l~~~q~Hm 354 (563) T protein:vir:74 301 YVTNASA------PVDPNT------------------GELTDWNIGPMQIVEIAGNRNDNYFERV--SGVQDVSPFQDHM 354 (563) T ss_pred EEecccc------cccccc------------------ccccccccCCceeEeccCCccccceeee--cchhhhHHHHHHH Confidence 7665311 111100 11111123444444433 22334444 3332222344445 Q ss_pred HHHHH-HHHHHHhccc-----ccccCCc-cchhhHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhc--C Q lcl|NC_016071. 347 NSRKK-AILDRFGAGF-----INLGNDG-QGSYNLSESKQSIHGHF-------VQRDIDIIVEAFNKNLIPQLLALN--D 410 (516) Q Consensus 347 ~~~d~-~Isk~iLGqt-----Lts~~~~-~GS~Al~~vh~ev~~~~-------~~aDa~~i~~~ln~~li~~lv~lN--~ 410 (516) ++++. .|+.. .+| -|.+.+. -+++||- +...-.... +.+-.++...-+.+.|++.+-.+- + T Consensus 355 ~~l~eral~~~--s~tPavA~G~vD~~~~~SGiALe-L~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g 431 (563) T protein:vir:74 355 KWIDEKGIAEG--SGTPEVAIGRVDVTSAESGISLE-LQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQ 431 (563) T ss_pred HHHHHHHHHhh--ccCcceeecccccccccchhhhh-hhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhh Confidence 55443 33321 111 1112111 1224431 122222222 223333333333344443333320 0 Q ss_pred C---cCCccccc-----eEEecCcCchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCcccc---- Q lcl|NC_016071. 411 I---RLSDEDMP-----KLKPGLIQEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLK---- 478 (516) Q Consensus 411 ~---~~~~~~~P-----~~~~~~~~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~---- 478 (516) . ++..+..| .++|...-+.|.++..+-+..|+..|++..- -..+.+.+. |.|.|+-+++-.....+ T Consensus 432 ~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSre-tAv~~L~~~-g~~~pdae~e~~~ie~~~i~~ 509 (563) T protein:vir:74 432 DGSRPFASADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQAHLILRK-MAVAKLRSI-GWEYPEVDDQGNALTDDDIAD 509 (563) T ss_pred cccccccccccCCceEEEEEeCCCCCccHHHHHHHHHHHHHcCchhHH-HHHHHHHhC-CCCCCcHHHHHhhcCHHHHHH Confidence 0 01112222 4569999999998888888899999987642 112222222 66654322221111100 Q ss_pred -cCCCCCCcccccccccCC-CCCcccccccccchhhhhcC Q lcl|NC_016071. 479 -LLGQDTSRSGDGMTAGSN-GNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 479 -~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~d~~~~~~~~ 516 (516) ...++...+.-|.+++.. |.++.+. -..-|-.-+.-| T Consensus 510 ~~~a~a~ad~~~~~~a~~~~g~~~~~~-dd~g~p~~~~~~ 548 (563) T protein:vir:74 510 MLLAEAEADASLGLSAMDNGGAGEQQF-DDQGNPIDQFGN 548 (563) T ss_pred HHHHHhhccCcccceecccCCCCcccc-cccCCchhHcCC Confidence 011111112222222211 1222111 111222333344 No 228 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=56.28 E-value=0.46 Score=22.43 Aligned_cols=422 Identities=11% Similarity=0.016 Sum_probs=154.7 Q ss_pred CCcccc--CcccccchhhhcccCCCCcccccchHH-------HHHH---HHHHHhhcccccCCcccH----H--HHH-HH Q lcl|NC_016071. 1 MSTRFA--QPSEVVKAGNENLAVSRLRTGELGSGA-------LSQL---RAESEVMKVEELRWPCFL----A--TVE-AM 61 (516) Q Consensus 1 ~~~r~~--~~~~~~~~~~~~p~~~~~~~~e~g~~~-------~~~~---~~~~~~~~~~~lr~~~~~----~--~y~-~m 61 (516) |..-.= .+....+.. +-+.-...-..+.=.-. +..+ ..+-.. ..+-+-..... + .+. .- T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g-~~~i~~~~~~~~~~~~~~~~~~~~ 78 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVV-EQIKPQYETQEEMILRLITKHKENVEDITVGERYYNH-QPDVLFNAPKRNVKGEIDPFKPDW 78 (468) T ss_pred CccccCCcCceeehhee-ecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCcccccccccccccccccccccc Confidence 222100 000000000 00000000000000000 0000 000000 00000000000 0 000 00 Q ss_pred -hhChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEe Q lcl|NC_016071. 62 -KQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYR 139 (516) Q Consensus 62 -~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~ 139 (516) ...+...-++.+....+.+-+..+.+ + +.++.+.+.+++++ .|.+.+.. +.++.-||.++ +++|. T Consensus 79 ki~~n~~~~Iv~~~~~~l~g~p~~~~~----~---d~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~~-~~v~~ 145 (468) T protein:vir:96 79 RMYTNYHQNLVDQKVAYAVANPVTYGT----E---DEKSLKTIQEVLNH-----KWDDKLVDILTAASNKGVEW-IQPYV 145 (468) T ss_pred ccccchHHHHHHHHHhhhccCCceecc----C---ChHHHHHHHHHHhc-----CHHHHHHHHHHHHhhcCeEE-EEEEE Confidence 01344444455444555555544432 1 23445667777643 25555554 45788899975 56775 Q ss_pred ecccccccccceeeccccccCchhcccccceeec--CCCceeeeccccccccc----cccccc-cccccccc-ccc---- Q lcl|NC_016071. 140 TESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFD--EDGRTLKGIYQSKMAFA----NFQNGL-TQISSAMS-LVT---- 207 (516) Q Consensus 140 ~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~--~dg~~l~~~~q~~~~~~----~~~~~~-~~~~~~~~-~~~---- 207 (516) .. +|.+.+..+.|+ .+. -.|+ ..++.+..++....... .|.... .......+ ++. T Consensus 146 d~------~~~~~i~~~~p~---~~~----~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (468) T protein:vir:96 146 DE------QGEFKTFRVPAE---QAI----PIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQ 212 (468) T ss_pred cC------CCceEEEEEccc---ceE----EEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccc Confidence 33 334433333222 111 0122 12232222221100000 000000 00000000 000 Q ss_pred -------ccccCCCccccccccEEEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_016071. 208 -------NLTSSADEVFIPINKLMVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQIL 279 (516) Q Consensus 208 -------~~~~~~~~~~iP~~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~ 279 (516) ..........+....++.|. +|+.|.|.+..+- ..+-. ...+..++..++.+..|+.++++..+ T Consensus 213 ~~~~~~~~~~~~~~~~~~~~iPvv~~~-----n~~~g~sd~e~v~-~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~-- 284 (468) T protein:vir:96 213 GEEHVQAHYYVGNKSMSWNRVPFIPFK-----NNPQEVSDLFMYK-TIIDAMDKRLSDTQNTFDEATELIYVLKGYEG-- 284 (468) T ss_pred cccccccceeeccccccCCcccEEEec-----CCCCCCCchHHHH-HHHHHHHHHHHHHHHHHHHhcCceeeeecCCc-- Confidence 00000001111122233332 4678899887743 33322 33556677788888888887775321 Q ss_pred ccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhc Q lcl|NC_016071. 280 NKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGA 359 (516) Q Consensus 280 ~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLG 359 (516) .+. ..... .+. + ...+.++.. +...++++..... ...+...++.+.+.|...--+ T Consensus 285 ----~~~----~~~~~----~~~----~-~~~i~~~~d------~~~~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~ 339 (468) T protein:vir:96 285 ----EDL----EEFMY----NLK----Y-YKAINVDGD------GSGGVDTIQIDVP--VQSAKEYLDMLRDYVIEFGQG 339 (468) T ss_pred ----ccc----chhhh----hhh----c-CceEEecCC------CCCcceEEeecCC--hHHHHHHHHHHHHHHHHHhCc Confidence 111 11111 111 1 112333321 1123555554432 234677889998998877555 Q ss_pred ccccccCCccchhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCcCchhHHHHH Q lcl|NC_016071. 360 GFINLGNDGQGSYNLSESKQSIH----GHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLIQEVDMEGFS 435 (516) Q Consensus 360 qtLts~~~~~GS~Al~~vh~ev~----~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~~~~dl~~~a 435 (516) ..++.++.+ | - ++-+..+.. ...+..-.+.+.+.|. ++++.++.+.+... +..-..+.|....+.|..+.+ T Consensus 340 p~~~~~~~~-~-n-~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~g~~~-d~~~i~i~f~~~~p~d~~e~a 414 (468) T protein:vir:96 340 VDFQQDKFG-N-S-PSGIALKFMYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYKLSI-KVQDVEITFNFNVMVNELEQS 414 (468) T ss_pred ccccccccc-c-c-hHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCc-ccceeeEEecCCCCcCHHHHH Confidence 444443222 1 1 111121111 1122333445555563 56777777653222 223357788888888877666 Q ss_pred HHHHHHHhCCcccccHHHHHHHHHHcCC-CCCCCcccccCcccccCCCCCCcccccccccCCCCCccc Q lcl|NC_016071. 436 KFVQRIGAVGYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLKLLGQDTSRSGDGMTAGSNGNGTGK 502 (516) Q Consensus 436 ~~~~~L~~~G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (516) ++ ++..|++. ++.+.+.++. ..+..+=+-...+.... ....++...+ ++..++ T Consensus 415 ~~---~~~~g~iS-----~et~i~~l~~v~D~~~E~~ri~~E~~~~----~~~~~~~~~~--~~~~~~ 468 (468) T protein:vir:96 415 QI---GVNSQYLS-----KETVVTNHPWVDDPVAEMERIDQEELAL----PSIEEGLNGK--ENNEPT 468 (468) T ss_pred HH---HHhcCCCc-----hHHHHHhCCCCCCHHHHHHHHHHHHHHH----HHHhhccCCC--CCCCCC Confidence 65 44568643 3445566533 22211100010110000 0111111111 111111 No 229 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=55.88 E-value=0.47 Score=22.39 Aligned_cols=443 Identities=12% Similarity=0.086 Sum_probs=154.9 Q ss_pred CCccccCcccccchhhhcccCCCCc----ccccchHHHHHHHHHHHh---hcccccC-CcccHHHHHHHhhChHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLR----TGELGSGALSQLRAESEV---MKVEELR-WPCFLATVEAMKQDHTVSTALD 72 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~----~~e~g~~~~~~~~~~~~~---~~~~~lr-~~~~~~~y~~m~~D~~v~s~l~ 72 (516) ++.-...-..-.+...+-|+.|-.- .-|.|.. ..++.|+.+. ...+..+ ..+.|+.|++|..+|.|-++++ T Consensus 17 ~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~-~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~ 95 (524) T protein:vir:98 17 AREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLN-NQKYAGVFQQFYSGQDPAIQNKEQLINTYRGIMSYPEVENAVS 95 (524) T ss_pred hhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCC-cceecceeeeeccccccccchHHHHHHHHHHHhhccchhhHHH Confidence 1111111111111111111111100 0011100 0122332221 1112223 3468999999999999999999 Q ss_pred HHHHHHhcCC-ce--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcc-----------eeeeEEE Q lcl|NC_016071. 73 TKYVFVTKAF-ND--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGF-----------SIFEKVY 138 (516) Q Consensus 73 ~Rk~~v~~~~-w~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~-----------S~~Eivw 138 (516) -.-.-+.-.+ -. |.+.-. +.+.++.+-+.|.+.+ + .+..+|+.--+|| -.+.++. T Consensus 96 eIVneaIv~~~~~~pV~l~L~-~~~~s~~iK~kI~eeF---------~-~Il~ll~F~~~~~~~fR~WYVDgRi~fhkii 164 (524) T protein:vir:98 96 EIIDDAIVNEQGKDIITMDLA-KTNFSKAIQDKIVEEF---------D-NVLNIYDFDNMGARLFRDWYVDSRIYFHKIM 164 (524) T ss_pred hhhcceeEecCCCceEEEEec-ccccchHHHHHHHHHH---------H-HHHHHhccchhhhHHHhhhhhcceeEEEEEE Confidence 8765432111 00 011110 1112233333333322 1 2224444444444 4444444 Q ss_pred eecccccccccceeeccccccCchhcccccceeec-CCCceeeeccccccccccccccccccccccccccccccCCCccc Q lcl|NC_016071. 139 RTESAPSKYAGYITIDKIAFRPQSSLSRSKPWVFD-EDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVF 217 (516) Q Consensus 139 ~~~~~~~~~~g~~~~~~l~~r~q~ti~~~~~f~~~-~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (516) .... +.|-..++.|-|| .|++.|-.... .++..- ..+....+.-|..+.... ...-..-....++. T Consensus 165 d~~~----~kGI~ELr~lDPr---~i~~vr~~~~~~~~~~~~--v~~~~~e~f~Y~~~~~~~----~~~g~~~~~~~~ik 231 (524) T protein:vir:98 165 HKDE----SKGIRELRQLDPR---CMELIRESITETLDGGVK--VFRGYREFFVYSAPKAGY----TYNGQIYQANQKIK 231 (524) T ss_pred cCCC----CcceeeeeeeCCc---cceeeeeccccccccchh--hccceeeeeeeccCCCcc----ccccceecCCCcee Confidence 3211 1343333333333 22221111111 111110 011111111111111000 00000112234577 Q ss_pred cccccEEEEeecCcC---CccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC--CCCHHHHH Q lcl|NC_016071. 218 IPINKLMVMSLGGTE---SNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI--DPKSPESE 292 (516) Q Consensus 218 iP~~k~i~~~~~~~~---g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~--~~~~~~~~ 292 (516) ||.+- |+|+|..-. ++- .|.|.++..|+==-+.....-. -..+-.+|.+|+=+-.- =|....++ T Consensus 232 I~~dA-Ivy~hSGL~d~~~~i--isyLhkAiKp~NQLkm~EDAlV--------IYRitRAPeRRvFYIDvGnlPk~KAeq 300 (524) T protein:vir:98 232 IPRSA-IVYAHSGLEDCSNNI--IGYLHRAVKPANQLRLLEDAMV--------IYRITRAPERRVFYIDVGQMGGNKATQ 300 (524) T ss_pred echhh-eeeeccCcccCCCCe--eeehhHhhHhHHhhHHHHhhHH--------HHhhhccccceEEEEecCCCCchhHHH Confidence 88765 778876432 221 3788888877643333222111 11222233333221111 12222222 Q ss_pred HHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHH Q lcl|NC_016071. 293 MVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAIL 354 (516) Q Consensus 293 ~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Is 354 (516) -+..++..++. ....|- -||.= +.....+|+-+ .|+... .-.+=|+|..+.+- T Consensus 301 ---Yl~~im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRR---eGgrgTEItTL--pggqnl-gem~DV~YF~kkLy 371 (524) T protein:vir:98 301 ---YVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRR---DGKAITEVSTL--PGGQNF-SDMDDIKWFNRKLY 371 (524) T ss_pred ---HHHHHHHhcCceeEeeccCceeeccccccchhhhhccccc---CCCCccceeec--cccCCc-ChHHHHHHHHHHHH Confidence 23444433321 001111 12210 01111233333 222222 23345799999998 Q ss_pred HHHhcccccccCCccchhhH---HHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---cccceEEecCc Q lcl|NC_016071. 355 DRFGAGFINLGNDGQGSYNL---SES-KQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---EDMPKLKPGLI 426 (516) Q Consensus 355 k~iLGqtLts~~~~~GS~Al---~~v-h~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~~~P~~~~~~~ 426 (516) +++--..--.+.++ |+..+ +++ -.|+ |...+..-...+...|..-|-..|+.=+. --++ .--+.+.|+.. T Consensus 372 ~aLnVP~sRl~~~~-~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgi-it~eew~~i~~~I~~~f~ 449 (524) T protein:vir:98 372 EALRVPLSRMPRDD-GGMQIGGGGEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKI-ITEDEWEENVSKISFVFQ 449 (524) T ss_pred HHhCCCceeccCCC-CccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcC-CCHHHHHHHhhcceEEEe Confidence 88877664443222 22222 222 2233 23334444444444454333333332221 1111 11133444433 Q ss_pred Cc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCC--cccccCcccc-c-CCCCCCccccc Q lcl|NC_016071. 427 QE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIP--EDMSTDELLK-L-LGQDTSRSGDG 490 (516) Q Consensus 427 ~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~--~~~~~~~~~~-~-~~~~~~~~~~~ 490 (516) .+ .+.+-+.+++..|..+-=.+-...+.+||++.+ .+...+- .+...+.+.+ + -+++...-.+. T Consensus 450 ~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 450 QDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred ecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCccccccC Confidence 33 333444445555544322222345678887654 4442111 1111111111 0 11111111111 No 230 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=55.03 E-value=0.49 Score=22.29 Aligned_cols=192 Identities=11% Similarity=0.034 Sum_probs=71.4 Q ss_pred HHHHHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccce Q lcl|NC_016071. 249 REKILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKM 328 (516) Q Consensus 249 ~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~i 328 (516) .+|-.++. ....+. +.+ +.+-.+++.+.+.-.. +++| .+.+ .++ T Consensus 1 V~k~~~l~------------------------~~~~~~---~~~-~~~r~~~~~~~~~~~~-~~~l-d~~~------e~~ 44 (201) T protein:vir:10 1 MWKAKGLA------------------------DLCDDS---DGA-ARLRLAQVDNNSGVGQ-AIGI-DADS------EEY 44 (201) T ss_pred CccchHHH------------------------HHhcCC---hHH-HHHHHHHHHHhhhhhh-hhee-ecCC------cce Confidence 11211110 001111 112 2222233333322111 1222 2211 234 Q ss_pred eeeeccccCcchhHHHHHHHHHHHHHHHHhccccc--ccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016071. 329 SLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN--LGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLL 406 (516) Q Consensus 329 el~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv 406 (516) +.+..+-|| ...++...-.+||-+ .+-.+| .+.+-+|=-|.|+--....-+.+++.......-+.+.|++.++ T Consensus 45 e~~~~~lsG----l~d~l~~~~~~iaa~-s~iP~t~LfG~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~ 119 (201) T protein:vir:10 45 NVLNSDIGG----IDTFLSQKFDRIVAL-SGIHEIILKGKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIV 119 (201) T ss_pred eeeecCcCC----hHHHHHHHHHHHHhH-hcCchhhhcCCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 555444443 445666666666654 333333 2223334335566566677777777765443333333444221 Q ss_pred HhcCCcCCccccceEEecCc------Cchh-HHHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCCCcccccCccccc Q lcl|NC_016071. 407 ALNDIRLSDEDMPKLKPGLI------QEVD-MEGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEIPEDMSTDELLKL 479 (516) Q Consensus 407 ~lN~~~~~~~~~P~~~~~~~------~~~d-l~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~~~~~~~~~~~~~ 479 (516) .+.. -.|.|... +..+ .+..|++++++++.|++.++ ...+++++.-.-+.-.+ ...+.... T Consensus 120 ------~~~~--~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~g~i~~~-e~r~~L~~~~~~~~~~~--~~~~~~~~- 187 (201) T protein:vir:10 120 ------TEQE--WSVEFNPLSQVSDKDKSEILEKNVNSVAALIAAGIIDAD-EARDTLRAISTEVKIGE--GSIQTEVV- 187 (201) T ss_pred ------CCCC--ceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHH-HHHHHHHhcCCcCCCCC--CCCCcccc- Confidence 1111 12333222 2222 24557789999999998773 33444444211111000 00000000 Q ss_pred CCCCCCcccccccccCCCCC Q lcl|NC_016071. 480 LGQDTSRSGDGMTAGSNGNG 499 (516) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~ 499 (516) ..+...|... +-+. T Consensus 188 ~~e~~dp~~~------~~~~ 201 (201) T protein:vir:10 188 INESEDPLDV------SANN 201 (201) T ss_pred ccccCCCCCC------CCCC Confidence 0000000000 0000 No 231 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=46.56 E-value=0.73 Score=21.33 Aligned_cols=447 Identities=9% Similarity=-0.043 Sum_probs=157.4 Q ss_pred CCccccCcccccch-hhhcccCCCCcccccchHHHHHHHHHHHhhccccc-CC-cccHHHH-HHH-hhChHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKA-GNENLAVSRLRTGELGSGALSQLRAESEVMKVEEL-RW-PCFLATV-EAM-KQDHTVSTALDTKY 75 (516) Q Consensus 1 ~~~r~~~~~~~~~~-~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~l-r~-~~~~~~y-~~m-~~D~~v~s~l~~Rk 75 (516) |+...- .+-+.+- ....| + +.....+-... .+.+ +. .....-. ... ...+...-++.+.. T Consensus 22 l~~~~i-~~li~~~~~~~~~---r----------~~~l~~YY~g~-~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 86 (506) T protein:vir:94 22 LTPNKI-MKFITHHFNYQRP---R----------LEMLDDYYQGY-NLKILDKQSRRHEDGKADHRATHSFAKYIADFQT 86 (506) T ss_pred CCHHHH-HHHHHHHHHHHHH---H----------HHHHHHHhcCC-CccccccccccccccCCcceeecchHHHHHHHhh Confidence 221100 0000000 00000 0 00111111001 1110 00 0000000 000 12556666666666 Q ss_pred HHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHH-HHHHHhhcceeeeEEEeecccccccccceeec Q lcl|NC_016071. 76 VFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARS-AATFNEYGFSIFEKVYRTESAPSKYAGYITID 154 (516) Q Consensus 76 ~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~-~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~ 154 (516) ..+.+-+..+.+. ++ ...+.+..+++.- .|...+.. ..++..+|.+. +.+|... +|.+.+. T Consensus 87 ~~l~G~p~~~~~~---d~----~~~~~l~~~~~~N----~~~~~~~~~~~~~~~~G~a~-~~v~~de------d~~~~i~ 148 (506) T protein:vir:94 87 SYSVGNPINVKLP---DD----GSNSGFDTFNKAN----DVDAENYDLFLDMSRYGRAY-EYVYRGE------DNEEHLA 148 (506) T ss_pred hhhcccCceeecC---cc----hHHHHHHHHHhcc----CHhHHHHHHHHHHHhcCeEE-EEEEecC------CCeeEEE Confidence 6666666555432 12 2345566666542 25555544 45688899854 6777533 3444443 Q ss_pred cccccCchhcccccceeecC--CCceeeeccccccccccc-----cccccccccccccccccccC-----CCccccc--c Q lcl|NC_016071. 155 KIAFRPQSSLSRSKPWVFDE--DGRTLKGIYQSKMAFANF-----QNGLTQISSAMSLVTNLTSS-----ADEVFIP--I 220 (516) Q Consensus 155 ~l~~r~q~ti~~~~~f~~~~--dg~~l~~~~q~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~-----~~~~~iP--~ 220 (516) .+.|+.-. ..|++ +++.+..++-.......- ...+...-............ ..+..-| . T Consensus 149 ~~~p~~~~-------~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~ 221 (506) T protein:vir:94 149 KLDPLDTF-------VIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITT 221 (506) T ss_pred EEcccceE-------EEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCc Confidence 33332110 12222 222333222110000000 00000000000000000000 0011111 1 Q ss_pred ccEEEEeecCcCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccCC-----CCHH--HHHH Q lcl|NC_016071. 221 NKLMVMSLGGTESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAID-----PKSP--ESEM 293 (516) Q Consensus 221 ~k~i~~~~~~~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~-----~~~~--~~~~ 293 (516) --++.|+ +|+.|.|.+..+-...=-=+..+..++..++.+..++.++++.+....+.... +... .... T Consensus 222 vPvv~~~-----n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (506) T protein:vir:94 222 FPVVEFK-----NSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKL 296 (506) T ss_pred cceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhcccccccccccccccc Confidence 1223333 24456666665543221113344556666666677776766644322111000 0000 0000 Q ss_pred HHHHHHHHHHhhcccceEEEeccCcccccc-cccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchh Q lcl|NC_016071. 294 VQGLMADAANAHAGEQAYFILPSDMNAQGG-EQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSY 372 (516) Q Consensus 294 l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~-e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~ 372 (516) ......+++.... .-.+.++.+...... ....++++..+.. ...+...++.+.+.|.+.--...++.++.++.+ T Consensus 297 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~d~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~- 371 (506) T protein:vir:94 297 AKDKLELIKEMKD--ANMLLLKSGMTVNGTQTSVDAKYINKTYD--VVGSEAYKKRVAGDIHKFSHTPDLTDENFASNS- 371 (506) T ss_pred ccchhHHHhhhhh--cCeeeecccccccCccccccceeeeecCC--HHHHHHHHHHHHHHHHHHhCccccccccccccc- Confidence 0000111111111 112233333221111 1223444443322 234667888889999877655555544322221 Q ss_pred hHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHh----cCCcCCccccceEEecCcCchhHHHHHHHHHHHHhC Q lcl|NC_016071. 373 NLSESKQSIHGHF----VQRDIDIIVEAFNKNLIPQLLAL----NDIRLSDEDMPKLKPGLIQEVDMEGFSKFVQRIGAV 444 (516) Q Consensus 373 Al~~vh~ev~~~~----~~aDa~~i~~~ln~~li~~lv~l----N~~~~~~~~~P~~~~~~~~~~dl~~~a~~~~~L~~~ 444 (516) +-+.-...... +..-.+.+.+.|. ++++.++.+ |...-.+..-..+.|...-+.|..+.++++.+|. T Consensus 372 --Sg~Aik~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~-- 446 (506) T protein:vir:94 372 --SGVAMQYKVLGTVELASTKRRMFERGLY-ARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAG-- 446 (506) T ss_pred --hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHh-- Confidence 11111111111 1222233444443 355554443 2111112223678899999999999999999984 Q ss_pred CcccccHHHHHHHHHHcCC-CCCCCcccccCcccc-cCCCCCCcccccccccCCCCCcccccccccchhh Q lcl|NC_016071. 445 GYLPKTPTVINKILEVGGF-DEEIPEDMSTDELLK-LLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVS 512 (516) Q Consensus 445 G~~~~~~~~~~~i~e~~Gl-p~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 512 (516) |++. .+.+.+.++. +.+..+-+-...+.. ...........+.. +.+.+...-.++... T Consensus 447 g~iS-----~et~~~~lp~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 447 ATLP-----QKYLYQQLPGVTNPQDIVDMMKEQSANGDYSFDQNGVISND-----GQTNTTATQTDEEVR 506 (506) T ss_pred ccCC-----hHHHHHhCCCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcc-----cCccccccccccCCC Confidence 6533 3455666633 322211000101100 00000000000000 000001111111111 No 232 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=45.41 E-value=0.77 Score=21.20 Aligned_cols=417 Identities=9% Similarity=0.030 Sum_probs=154.2 Q ss_pred CCcc----ccCcccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccC-------CcccHH--------HHHH- Q lcl|NC_016071. 1 MSTR----FAQPSEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELR-------WPCFLA--------TVEA- 60 (516) Q Consensus 1 ~~~r----~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr-------~~~~~~--------~y~~- 60 (516) ++-| .. ..+-..-..-.|.-..++.+ |+.+++. ..+...|... .-..+. .|+. T Consensus 7 ~~~~~~~m~V-~~~hp~y~a~~~~W~~~~d~--g~~~~k~----~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~ 79 (488) T protein:vir:96 7 IKHRGFFMLT-PIYHPDYLVNAPQWLRNLDC--VMDNIKR----KKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDL 79 (488) T ss_pred Eeecceeecc-cccCHHHHHHhhhhhHhhhh--hhHHHHH----hhhhcCCCCCCccccccCcchhhhhhccchhhhHhh Confidence 2222 10 00111111111222222221 2222211 1111222110 000111 1221 Q ss_pred Hh----hChHHHHHHHHHHHHHhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccC-cCCHHHHHHHHH-HHHhhcceee Q lcl|NC_016071. 61 MK----QDHTVSTALDTKYVFVTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLAN-QQTLRDIARSAA-TFNEYGFSIF 134 (516) Q Consensus 61 m~----~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~-~~~~~~~l~~~l-da~~~G~S~~ 134 (516) .. -=+++.-.++.--..|.+.++.++.+ ++ +++.. ++++... ..+++.++++++ .++.||.+.+ T Consensus 80 ~~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~~---~~---~~l~~----l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~i 149 (488) T protein:vir:96 80 TWRLANYVNIVNPTMNAITGAVMRREPEFDTM---DN---PVLIG----LRDNIDGKGNGIDQECKQALNALQWGSRCGW 149 (488) T ss_pred hhhccccCchhHHHHHHhcchhhccCceeccC---Cc---HHHHH----HHhccCCCCCCHHHHHHHHHHHHHhcCeEEE Confidence 11 13566666666666677666655421 11 22333 3343322 246888888866 4888999887 Q ss_pred eEEEeecccc--------cccccceeeccccccCchhcccccceeecCCCc---e-eeecccccc---cccccccccc-- Q lcl|NC_016071. 135 EKVYRTESAP--------SKYAGYITIDKIAFRPQSSLSRSKPWVFDEDGR---T-LKGIYQSKM---AFANFQNGLT-- 197 (516) Q Consensus 135 Eivw~~~~~~--------~~~~g~~~~~~l~~r~q~ti~~~~~f~~~~dg~---~-l~~~~q~~~---~~~~~~~~~~-- 197 (516) =+-+-.++.+ .+| + +....+..|-. |.++..|+ + .+.++.... .+........ T Consensus 150 lVD~P~~~~T~ade~~~~~rP--y-----~~~~~a~~Iin---W~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~ 219 (488) T protein:vir:96 150 LVRSHPESATMADWNKGKKLP--T-----AAFYDALHIID---WEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLIN 219 (488) T ss_pred EEecCCCcCCHHHHHHhcCCc--E-----EEEechhhhcC---cceeccCCceeeEEEEEEEEEEeccCCCcccceEEEE Confidence 6655322110 011 1 11112222221 44443221 0 011111000 0000000000 Q ss_pred --ccccccccccccccCCCccccc---------cccEEEEeecCcCCccccchhHHHHHH----HHHHHHHHHHHHHHHH Q lcl|NC_016071. 198 --QISSAMSLVTNLTSSADEVFIP---------INKLMVMSLGGTESNPAGVSPLVGCYR----AFREKILIENLETIGA 262 (516) Q Consensus 198 --~~~~~~~~~~~~~~~~~~~~iP---------~~k~i~~~~~~~~g~p~G~gLlr~~~~----~~~fK~~~~~~w~~~~ 262 (516) .....+.......+...+...| .=-|+.+. ....+-..+...|..++. ||-..-. -.+..+. T Consensus 220 ~~l~~g~~~v~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~-~~~~~~~~~~pPLldLA~lnl~Hy~~ssd--~~~il~~ 296 (488) T protein:vir:96 220 HRLVDGLCEFQEVTDDEYSDEWTPVLINSKQSDTIPFFLAS-SQSNEWCIDSTPLTSLAEISLSIYVMNAY--SNKAMIL 296 (488) T ss_pred EEEECcEEEEEEEecCCcccceEeecCCCcccCeeEEEEEe-cCCCCCCCCCCchHHHHHHHHHHHhhhhH--HHHHHHh Confidence 0000111111111111111122 11244332 222333345544445544 3333221 1122222 Q ss_pred hhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhH Q lcl|NC_016071. 263 SKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYST 342 (516) Q Consensus 263 er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~ 342 (516) . +.|+++.++. ..++...+. .. ...+..|.+.....|.| .+++++++++ ... T Consensus 297 ~--~~p~lv~~~~-------~~~~~~~~~-----~~--~~g~~~~~~~~~~~~~g---------~~~~~e~~~~---~l~ 348 (488) T protein:vir:96 297 A--NEAKWMVDMG-------DMNKTMASE-----MN--PLGFTLAGRMPYYVKNG---------DVKVIQAQFS---PET 348 (488) T ss_pred c--CCceeeeccC-------CCCcccccc-----cc--cceeeecccccccccCC---------ceeecCCchh---HHH Confidence 2 2333333211 111111110 00 00111233333333444 3555655443 224 Q ss_pred HHHHHHHHHHHHHHHhcccccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC--ccccce Q lcl|NC_016071. 343 QELVNSRKKAILDRFGAGFINLGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLS--DEDMPK 420 (516) Q Consensus 343 ~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~--~~~~P~ 420 (516) ++.++...++| ..+.+.+++.+...+++.+ .........++.+-+..+++.++ ++++++..+-+.... ...-+. T Consensus 349 ~~~l~~l~~qm-~~~Ga~l~~~~~~~Ta~~~--~~~~~~~~S~L~~~a~~le~al~-~~l~~~A~w~g~~~~~~~~~~~~ 424 (488) T protein:vir:96 349 ENKVEKLFEQA-VKVGASLFTQQSNETATGA--AIRSGSSTASMATLGNNVEDTVR-NMLRFIMRYFEGTNLYVNPDELV 424 (488) T ss_pred HHHHHHHHHHH-HHHhHhhccCCCcchHHHH--HHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCCCcCccceE Confidence 55566666676 2333355543222223222 22334446678888899999996 588998887532211 111234 Q ss_pred EEecC--c-CchhHHHHHHHHHHHHhCCcccccHHHHHHHHHHcCC--CCCCCcccccCcccccCCCCCCcccccc Q lcl|NC_016071. 421 LKPGL--I-QEVDMEGFSKFVQRIGAVGYLPKTPTVINKILEVGGF--DEEIPEDMSTDELLKLLGQDTSRSGDGM 491 (516) Q Consensus 421 ~~~~~--~-~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Gl--p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (516) |.... . ...| ....+++-++...|.+.. ....+++++ -|+ |....+++ .+..+. .+-|+ T Consensus 425 ~~in~dF~~~~ld-~~~~~al~~~~~~G~Is~-~t~~~~L~~-~gvl~~d~~~e~~-~~~ie~--------~g~~~ 488 (488) T protein:vir:96 425 FKLNRDYFDVEVN-PQMLQVAYAAMMEGNLPQ-VSWFELLKR-ARVVRGDMSKEEF-DEHIAE--------LGFGM 488 (488) T ss_pred EEeccCCCCccCC-HHHHHHHHHHHhcCCCCH-HHHHHHHHh-CCcCCccCCHHHH-HHHHhh--------cCCCC Confidence 54331 1 1223 334567778888898664 334455544 455 33222222 111111 11112 No 233 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=42.66 E-value=0.88 Score=20.90 Aligned_cols=439 Identities=10% Similarity=0.030 Sum_probs=163.3 Q ss_pred CCccccCc----ccccchhhhcccCCCCcccc-----cchHHHHHHHHHHHhhc-----ccccCC-----cccHHHHHHH Q lcl|NC_016071. 1 MSTRFAQP----SEVVKAGNENLAVSRLRTGE-----LGSGALSQLRAESEVMK-----VEELRW-----PCFLATVEAM 61 (516) Q Consensus 1 ~~~r~~~~----~~~~~~~~~~p~~~~~~~~e-----~g~~~~~~~~~~~~~~~-----~~~lr~-----~~~~~~y~~m 61 (516) |-.|-+-- .+|.+ | .|.+|+.. .+..+...-+...+.+. -..|.| --+|-+...| T Consensus 46 ~~~~~~~~~~~~~~~~~-----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~l 119 (695) T protein:vir:78 46 MGRRGALNALDAAPVAE-----P-SPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLL 119 (695) T ss_pred hcccccccccccccccC-----C-CcccccceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHH Confidence 33332211 11111 1 22333331 11111111111111111 011211 1134555666 Q ss_pred hhChHHHHHHHHHHHHHhcCCceeeeCCCCCCh---------------hhHHHHHHHHHHHhhccCcCCHHHHHHHHHHH Q lcl|NC_016071. 62 KQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSK---------------ASKDAAEFVEYALKNLANQQTLRDIARSAATF 126 (516) Q Consensus 62 ~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~---------------~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda 126 (516) ..-+.+.++....-....+ +|. ++..+..+. .+.+..+.++..++++.. |..+...+-.+ T Consensus 120 aQ~~eyr~~~~~ia~e~~R-~w~-~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V---~~~l~eaik~a 194 (695) T protein:vir:78 120 AQLPEYRAMHEVLADECIR-TWG-EAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRI---RDAVRTTVIHD 194 (695) T ss_pred hhccchhhHHHHHHHHhhc-ccc-eeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHH---HHHHHHHHHhh Confidence 6778888888887776654 483 332222221 122556778888887753 44555555579 Q ss_pred HhhcceeeeEEEeecccccccccceee----------ccccccCchhcccccceeecCCCceeeeccccccccccccccc Q lcl|NC_016071. 127 NEYGFSIFEKVYRTESAPSKYAGYITI----------DKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGL 196 (516) Q Consensus 127 ~~~G~S~~Eivw~~~~~~~~~~g~~~~----------~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~ 196 (516) .+||=++. +...++..-..+.-+.+ +.|.+.. +||.....-+. .++....++.+.+ T Consensus 195 RlfGGa~~--~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViD-------p~~vtP~~~n~-----~dP~spdfgkP~~ 260 (695) T protein:vir:78 195 QAFGRAHP--YFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVE-------PYWVTPNNYNS-----INPVADDFYKPST 260 (695) T ss_pred ccccceEE--EEEeccCccccccccccccccccCcceeeeEeec-------ccccccchhhh-----ccchhhccCCCce Confidence 99999983 33333321100111100 1111111 23322211000 1122222222222 Q ss_pred cccccccccccccccCCCccccccccEEEEeecC------cCCccccchhHHHHHHHHH---HHHHHHHHHHHHHhhccc Q lcl|NC_016071. 197 TQISSAMSLVTNLTSSADEVFIPINKLMVMSLGG------TESNPAGVSPLVGCYRAFR---EKILIENLETIGASKDLG 267 (516) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~------~~g~p~G~gLlr~~~~~~~---fK~~~~~~w~~~~er~g~ 267 (516) +.+. |..|=..+++.|+-.+ -.-+.+|.++...++..+. -.+.... ++.+. +... T Consensus 261 y~V~--------------G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~-~Li~~-~~v~ 324 (695) T protein:vir:78 261 WWMI--------------GTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVS-DIVKQ-FSVS 324 (695) T ss_pred EEEe--------------ceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHH-HHHHh-hhhH Confidence 2211 1123333444333221 1235678898888875322 1111111 11110 0000 Q ss_pred cceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcchhHHHHHH Q lcl|NC_016071. 268 GIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVN 347 (516) Q Consensus 268 ~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~ 347 (516) +++.- +.+.-.+. .+.+...+ .++++.++. .....+|=++. .+.+.++.+=| ....+|. T Consensus 325 ---~lk~d---la~~L~~g--~~~~l~~R-~eli~~~Rs-n~G~~llDk~~-------Eefeq~stslS----GLddVi~ 383 (695) T protein:vir:78 325 ---GILMD---LAQALMPG--ANVDLSMR-AELINRYRD-NRNILFLDKAT-------EEFFQFNTPLS----GLDALQA 383 (695) T ss_pred ---HHHHH---HHHhhcCh--hHHHHHHH-HHHHHHhcC-ccceEEEecCC-------cceEEEecccC----CHHHHHH Confidence 00000 00011111 11222222 244444442 23333442222 23344433222 2455666 Q ss_pred HHHHHHHHHHhccccc--ccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceE--Ee Q lcl|NC_016071. 348 SRKKAILDRFGAGFIN--LGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKL--KP 423 (516) Q Consensus 348 ~~d~~Isk~iLGqtLt--s~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~--~~ 423 (516) -.-.+||-+. +-.+| .+.+-.|=-|-|+--..+.-|.+++........+-+.|+.-| .+.. ++. ..|.| +| T Consensus 384 qf~q~VAgaa-~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii-~rS~-~G~--idpdi~~~f 458 (695) T protein:vir:78 384 QAQEQMSAVS-HIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMI-QLSL-FGA--VDPSIKWQW 458 (695) T ss_pred HHHHHHHhhh-cCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHh-cCC--CCCcceEEe Confidence 6666666442 22222 122223433556655666666666655433333223343333 2221 222 23444 44 Q ss_pred cCcCchh------H-HHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC------C-ccccc-Ccc--cccCCCCCCc Q lcl|NC_016071. 424 GLIQEVD------M-EGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI------P-EDMST-DEL--LKLLGQDTSR 486 (516) Q Consensus 424 ~~~~~~d------l-~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~------~-~~~~~-~~~--~~~~~~~~~~ 486 (516) ....+-+ + +..|++++.+++.|++.++ .+++++.-.+.- + +|++. +.. .........+ T Consensus 459 nPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~-----evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~ 533 (695) T protein:vir:78 459 NALRELDDLEVAESRYKQAQSDVLYVQEQVIRPD-----QVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQR 533 (695) T ss_pred CCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHH-----HHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcC Confidence 3222222 2 3346678899999998873 567775543211 0 11100 000 0000011112 Q ss_pred ccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 487 SGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) .+++...+.+++ +.+.-+..-+.+| T Consensus 534 ~~~~~~~~~~~~-----~~~g~~~~~~~~~ 558 (695) T protein:vir:78 534 LAEGGDTGAPGG-----ARAGATAPPTVAN 558 (695) T ss_pred cccccccCCCCC-----CCCCCCCCCceee Confidence 222222222221 1111112222222 No 234 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=39.24 E-value=1 Score=20.52 Aligned_cols=444 Identities=9% Similarity=0.032 Sum_probs=161.9 Q ss_pred CC---------ccccCcccccchhhhcc---cCCCCcccc-----cchHHHHHHHHHHHhhc-----ccccCC-----cc Q lcl|NC_016071. 1 MS---------TRFAQPSEVVKAGNENL---AVSRLRTGE-----LGSGALSQLRAESEVMK-----VEELRW-----PC 53 (516) Q Consensus 1 ~~---------~r~~~~~~~~~~~~~~p---~~~~~~~~e-----~g~~~~~~~~~~~~~~~-----~~~lr~-----~~ 53 (516) +. ..+++.+-..-.. ..| ..|.+|+.. .+..+...-+...+.+. -..|.| -- T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~ 111 (695) T protein:vir:36 33 IATAAAAQPVPADFARRGALNALD-AAPVVEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFP 111 (695) T ss_pred hhhhccccccchhhhhcccccccc-cccccCCCcccccceeceecccccCccccchhhhhhcccccccccchhhhccCcc Confidence 11 1111111111000 011 122233321 11111111111111111 011211 11 Q ss_pred cHHHHHHHhhChHHHHHHHHHHHHHhcCCceeeeCCCCCCh---------------hhHHHHHHHHHHHhhccCcCCHHH Q lcl|NC_016071. 54 FLATVEAMKQDHTVSTALDTKYVFVTKAFNDFKVLYNRDSK---------------ASKDAAEFVEYALKNLANQQTLRD 118 (516) Q Consensus 54 ~~~~y~~m~~D~~v~s~l~~Rk~~v~~~~w~i~~~~~~d~~---------------~~~~~a~~v~~~l~~~~~~~~~~~ 118 (516) +|-+...|..-+.+.++....-....+ +|. ++..+..+. .+.+..+.++..++++.. |.. T Consensus 112 Gy~~la~laQ~~eyr~~~~~ia~e~~R-~w~-~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V---~~~ 186 (695) T protein:vir:36 112 GFPTLVLLAQLPEYRAMHEVLADECIR-TWG-EAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRI---RDA 186 (695) T ss_pred hHHHHHHHhhccchhhHHHHHHHHhhc-ccc-eecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHH---HHH Confidence 345556666778888888887766654 483 332222221 122556778888887753 445 Q ss_pred HHHHHHHHHhhcceeeeEEEeecccccccccceee----------ccccccCchhcccccceeecCCCceeeeccccccc Q lcl|NC_016071. 119 IARSAATFNEYGFSIFEKVYRTESAPSKYAGYITI----------DKIAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMA 188 (516) Q Consensus 119 ~l~~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~----------~~l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~ 188 (516) +...+-.+.+||=++. +...++..-..+.-+.+ +.|.+.. +||.....-+. .++.. T Consensus 187 l~eaik~aRlfGGa~~--~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViD-------p~~vtP~~~n~-----~dP~s 252 (695) T protein:vir:36 187 VRTTVIHDQAFGRAHP--YFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVE-------PYWVTPNNYNS-----INPVA 252 (695) T ss_pred HHHHHHhhccccceEE--EEEeccCccccccccccccccccCcceeeeEeec-------ccccccchhhh-----ccchh Confidence 5555557999999983 33333321100111100 1111111 23322211000 11222 Q ss_pred cccccccccccccccccccccccCCCccccccccEEEEeecC------cCCccccchhHHHHHHHHH---HHHHHHHHHH Q lcl|NC_016071. 189 FANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGG------TESNPAGVSPLVGCYRAFR---EKILIENLET 259 (516) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~------~~g~p~G~gLlr~~~~~~~---fK~~~~~~w~ 259 (516) ..++.+.++.+. |..|=..+++.|+-.+ -.-|.+|.++...++..+. -.+.... ++ T Consensus 253 pdfgkP~~y~V~--------------G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~-~L 317 (695) T protein:vir:36 253 DDFYKPSTWWMI--------------GTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVS-DI 317 (695) T ss_pred hccCCCceEEEe--------------ceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHH-HH Confidence 222222222211 1123333444333221 1235678998888775322 1111111 11 Q ss_pred HHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHHHhhcccceEEEeccCcccccccccceeeeeccccCcc Q lcl|NC_016071. 260 IGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAANAHAGEQAYFILPSDMNAQGGEQYKMSLKGIDGAGKQ 339 (516) Q Consensus 260 ~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~~~~~g~~a~~iiP~g~~i~~~e~~~iel~~~~g~g~~ 339 (516) .+. + .+.+++. - +.+.-.+. .+.+...+ .++++.++. .....+|=++. .+.+.++.+=| T Consensus 318 i~~-~---~v~~lk~-d--la~aL~~g--~~~~l~~R-~eli~~~Rs-n~G~~llDk~~-------Eefeq~stslS--- 376 (695) T protein:vir:36 318 VKQ-F---SVSGILM-D--LAQALMPG--ANVDLSMR-AELINRYRD-NRNILFLDKAT-------EEFFQFNTPLS--- 376 (695) T ss_pred HHh-h---hHHHHHH-H--HHHhhcCh--hHHHHHHH-HHHHHHhcC-ccceEEEecCC-------cceEEEecccC--- Confidence 110 0 0000000 0 00000111 11222222 244444442 23333442222 23344433222 Q ss_pred hhHHHHHHHHHHHHHHHHhccccc--ccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc Q lcl|NC_016071. 340 YSTQELVNSRKKAILDRFGAGFIN--LGNDGQGSYNLSESKQSIHGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDED 417 (516) Q Consensus 340 ~~~~~li~~~d~~Isk~iLGqtLt--s~~~~~GS~Al~~vh~ev~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~ 417 (516) ....+|.-.-.+||-+. +-.+| .+.+-.|=-|-|+--..+.-|.+++........+-+.|+.-| .+.. ++. . T Consensus 377 -GLddVi~qf~q~VAgaa-~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii-~rS~-~G~--i 450 (695) T protein:vir:36 377 -GLDALQAQAQEQMSAVS-HIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMI-QLSL-FGA--V 450 (695) T ss_pred -CHHHHHHHHHHHHHhhh-cCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHh-cCC--C Confidence 24556666666666442 22222 122223433556655666666666655433333223343333 2221 222 2 Q ss_pred cceE--EecCcCchh------H-HHHHHHHHHHHhCCcccccHHHHHHHHHHcCCCCCC------C-ccccc-Ccc--cc Q lcl|NC_016071. 418 MPKL--KPGLIQEVD------M-EGFSKFVQRIGAVGYLPKTPTVINKILEVGGFDEEI------P-EDMST-DEL--LK 478 (516) Q Consensus 418 ~P~~--~~~~~~~~d------l-~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~Glp~~~------~-~~~~~-~~~--~~ 478 (516) .|.| +|....+-+ + +..|++++.+++.|++.++ .+++++.-.+.- + +|++. +.. .. T Consensus 451 dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~-----evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~ 525 (695) T protein:vir:36 451 DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPD-----QVAARLNTEPDGPYAGKLDANDDPGVPADDDID 525 (695) T ss_pred CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHH-----HHHHHHhcCCCcccccccccccCCCcCccchhh Confidence 3444 443222222 2 3346678899999998873 567775543211 0 11100 000 00 Q ss_pred cCCCCCCcccccccccCCCCCcccccccccchhhhhcC Q lcl|NC_016071. 479 LLGQDTSRSGDGMTAGSNGNGTGKISSTRDNSVSNMDN 516 (516) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 516 (516) .......+.+++...+.+++ +.+.-.+.-+++| T Consensus 526 ~~~~~~~~~~~~~~~~~~~~-----~~~g~~~~~~v~~ 558 (695) T protein:vir:36 526 GVLTYVQRLAEGGDTGAPGG-----ARAGATAPPTVAN 558 (695) T ss_pred hhHhhhcCcccccccCCCCc-----ccccccCCCcccc Confidence 00011112222222222222 1111122222222 No 235 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=34.31 E-value=1.3 Score=19.96 Aligned_cols=438 Identities=13% Similarity=0.090 Sum_probs=164.0 Q ss_pred CCcccc--------CcccccchhhhcccCCCCcccccchH-----------HHHHHHHHHHhhcccccC-CcccHHHHHH Q lcl|NC_016071. 1 MSTRFA--------QPSEVVKAGNENLAVSRLRTGELGSG-----------ALSQLRAESEVMKVEELR-WPCFLATVEA 60 (516) Q Consensus 1 ~~~r~~--------~~~~~~~~~~~~p~~~~~~~~e~g~~-----------~~~~~~~~~~~~~~~~lr-~~~~~~~y~~ 60 (516) |.+=++ ......+...+-|+.|- .+=|++ +..-.+.+...+ +.++ ..+.|+.|++ T Consensus 5 ~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~---~~dGa~~I~~~~~~~~~~~~~~~~~~~~~--~~~~n~~eLI~~YR~ 79 (521) T protein:vir:10 5 FLKLLQPWMKDDEKRVQSDLSDRIDSFAVPD---TADGAIEVDKQIDTTAPKTAIVQSVLGYA--PKIQNTKDLINQYRS 79 (521) T ss_pred hhHHhhhhhhhhhhHHhhhhccCcccccccc---CCCCceeeccCCCccccccchhhhhhccc--cccchHHHHHHHHHH Confidence 111111 11111111111111111 111110 000111222221 1222 2457999999 Q ss_pred HhhChHHHHHHHHHHHHHhcCC-ce--eeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEE Q lcl|NC_016071. 61 MKQDHTVSTALDTKYVFVTKAF-ND--FKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKV 137 (516) Q Consensus 61 m~~D~~v~s~l~~Rk~~v~~~~-w~--i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eiv 137 (516) |..+|.|-++++-.-.-+.-.+ -. +.++-. +.+.++.+-+.|.+. |+ .+..+|+.--+||..+= T Consensus 80 ma~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld-~~~~s~~iK~kI~ee---------F~-~Il~ll~F~~~~~~~fR-- 146 (521) T protein:vir:10 80 LSKYHEVDNAIDEIINDAIVQEDNRDTVYLDLD-KTDWNESVKEMVREE---------FR-TILKLLKFEREGKRHFR-- 146 (521) T ss_pred HhhccchhhHHHhhhcceEEecCCCceEEEEec-CcccchHHHHHHHHH---------HH-HHHHHhccchhhhHHHh-- Confidence 9999999999998766442221 00 001000 011123333333332 22 33356666666666552 Q ss_pred Eeecccccccccceeecccc-------------ccCchhcccccceeecC-CCceeeecccccccccccccccccccccc Q lcl|NC_016071. 138 YRTESAPSKYAGYITIDKIA-------------FRPQSSLSRSKPWVFDE-DGRTLKGIYQSKMAFANFQNGLTQISSAM 203 (516) Q Consensus 138 w~~~~~~~~~~g~~~~~~l~-------------~r~q~ti~~~~~f~~~~-dg~~l~~~~q~~~~~~~~~~~~~~~~~~~ 203 (516) .|+-||++..+++. ...|..|+..|...... +|.....- ...+.-|.+. .... T Consensus 147 ------~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~---~~e~f~Y~~~---~~~~- 213 (521) T protein:vir:10 147 ------RWYVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKG---VKEFFTYGAT---EDNR- 213 (521) T ss_pred ------hheeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhcc---ceeeeeeccC---CCce- Confidence 23334444444433 22222333222222221 22111110 0011111100 0000 Q ss_pred ccccccccCCCccccccccEEEEeecC--cCCccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccc Q lcl|NC_016071. 204 SLVTNLTSSADEVFIPINKLMVMSLGG--TESNPAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNK 281 (516) Q Consensus 204 ~~~~~~~~~~~~~~iP~~k~i~~~~~~--~~g~p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k 281 (516) +-.......++.||. ..|+|+|.. ..+.++..|.|.++..|+==-+.....-.++ .+-.+|.+|+=+ T Consensus 214 --~~~~g~~~~~vkI~~-daI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIY--------RitRAPeRRvFY 282 (521) T protein:vir:10 214 --YNISGNSNNLVQIPI-DAIVYSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIY--------RITRAPERRVFY 282 (521) T ss_pred --ecCCCCCCcceeech-hheeeecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHH--------hhhccccceEEE Confidence 001112355677888 678899854 3356888999999998875433332222221 122222222211 Q ss_pred ccC--CCCHHHHHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchh Q lcl|NC_016071. 282 AAI--DPKSPESEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYS 341 (516) Q Consensus 282 ~~~--~~~~~~~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~ 341 (516) -.- =|....++. +..+++..+. .+..|- -||.= +.....+|+-+ .|+... . T Consensus 283 IDvGnlpk~KAeqY---l~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEI~TL--pggqnl-g 353 (521) T protein:vir:10 283 IDVGTMPNKKATQH---LNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRR---DGKATTEVSTL--PGAQSM-G 353 (521) T ss_pred EecCCCCchhHHHH---HHHHHHhcCceEEEeccCceeccchhhhhhHhhhccccc---CCCCccceeec--cccCCc-C Confidence 111 122222222 2333322211 111111 12210 00111233333 222222 2 Q ss_pred HHHHHHHHHHHHHHHHhcccccccCCccc-hh-hHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc-- Q lcl|NC_016071. 342 TQELVNSRKKAILDRFGAGFINLGNDGQG-SY-NLSESK-QSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD-- 415 (516) Q Consensus 342 ~~~li~~~d~~Isk~iLGqtLts~~~~~G-S~-Al~~vh-~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~-- 415 (516) -.+=|+|..+.+-+++--..--.+.+++| +. ..+++- .|+ |...++.-...+...|..-|-..|+.=+ .--+. T Consensus 354 em~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKg-iit~eew 432 (521) T protein:vir:10 354 EMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKG-KMSVSEW 432 (521) T ss_pred hHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCHHHH Confidence 33457999999998888776444434321 11 123322 233 2333444444444444433333332211 11110 Q ss_pred -cccceEEecCcCc------hhHHHHHHHHHHHHhCCc--ccccHHHHHHHHHHc-CCCCCC--CcccccCccccc--CC Q lcl|NC_016071. 416 -EDMPKLKPGLIQE------VDMEGFSKFVQRIGAVGY--LPKTPTVINKILEVG-GFDEEI--PEDMSTDELLKL--LG 481 (516) Q Consensus 416 -~~~P~~~~~~~~~------~dl~~~a~~~~~L~~~G~--~~~~~~~~~~i~e~~-Glp~~~--~~~~~~~~~~~~--~~ 481 (516) .--+.+.|+...+ .+.+-+.+++..|..+-- .+-...+.+||++.+ .++..+ ++++..+.+.+. -+ T Consensus 433 ~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~ 512 (521) T protein:vir:10 433 EEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYK 512 (521) T ss_pred HHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCC Confidence 1113344433333 333444445555543321 222245678887654 555211 111111111111 11 Q ss_pred CCCCccccc Q lcl|NC_016071. 482 QDTSRSGDG 490 (516) Q Consensus 482 ~~~~~~~~~ 490 (516) ++.....+. T Consensus 513 ~p~~e~~df 521 (521) T protein:vir:10 513 NPEDPMEEF 521 (521) T ss_pred CCcchhhcC Confidence 111121221 No 236 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=31.60 E-value=1.5 Score=19.64 Aligned_cols=443 Identities=8% Similarity=-0.030 Sum_probs=145.9 Q ss_pred CCccccCc-ccccchhhhcccCCCCcccccchHHHHHHHHHHHhhcccccCCcccHHHHH-HH-hhChHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQP-SEVVKAGNENLAVSRLRTGELGSGALSQLRAESEVMKVEELRWPCFLATVE-AM-KQDHTVSTALDTKYVF 77 (516) Q Consensus 1 ~~~r~~~~-~~~~~~~~~~p~~~~~~~~e~g~~~~~~~~~~~~~~~~~~lr~~~~~~~y~-~m-~~D~~v~s~l~~Rk~~ 77 (516) +.+...-. ..+.+.-+. .......+ +.....+-... .+-+..+....... .. ...+...-++.+.... T Consensus 9 ~~~~~~~~~~~~~~~i~~------~~~~~~~r--~~~~~~yy~g~-~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~ 79 (489) T protein:vir:99 9 IDYESKLWIDQLKNYISR------FKAEQLER--LKELKRYYLGD-NNIKYRPAKTDKYAADNRIASDFAKYITVFEQGY 79 (489) T ss_pred eCCCCCCCHHHHHHHHHH------HHHHHHHH--HHHHHHHhccc-CccccccccccccCCcceeecchHHHHHHHHhhh Confidence 11110000 000000000 00000000 00011110000 00011100000000 00 0123333344444444 Q ss_pred HhcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHH-HHHHHHhhcceeeeEEEeecccccccccceeeccc Q lcl|NC_016071. 78 VTKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIAR-SAATFNEYGFSIFEKVYRTESAPSKYAGYITIDKI 156 (516) Q Consensus 78 v~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~-~~lda~~~G~S~~Eivw~~~~~~~~~~g~~~~~~l 156 (516) +.+-+..+.+ .++.+-+++..++++- .|..... ...++.-||.++ +++|-.... ..+|.+.+..+ T Consensus 80 l~g~~~~~~~-------~d~~~~~~l~~~~~~n----~~~~~~~~~~~~~~~~G~~~-~~v~~~~~~--d~~~~~~i~~~ 145 (489) T protein:vir:99 80 MLGVPVEYKN-------ENKDLQAAIDLMSVRN----NEDYHNVKIKTDLSIYGRAY-ELLTVEKID--DKKTEVKLYQL 145 (489) T ss_pred hccCCceeec-------CChhHHHHHHHHHhhc----ChhHHHHHHHHHHhhCCeEE-EEEeeccCc--CCCcceEEEEE Confidence 4444444432 1234456677766542 2544443 345688899765 455532111 01344444433 Q ss_pred cccCchhcccccceeecCC--Cceeeecccccc---------cccccccccccccccc--ccccccccCCCccccccccE Q lcl|NC_016071. 157 AFRPQSSLSRSKPWVFDED--GRTLKGIYQSKM---------AFANFQNGLTQISSAM--SLVTNLTSSADEVFIPINKL 223 (516) Q Consensus 157 ~~r~q~ti~~~~~f~~~~d--g~~l~~~~q~~~---------~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~iP~~k~ 223 (516) .|+.-. ..|++. ++.+..++.... ....|..+....-... +.............+...-+ T Consensus 146 ~p~~~~-------~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPv 218 (489) T protein:vir:99 146 PAEQTF-------VIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPV 218 (489) T ss_pred cccceE-------EEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeE Confidence 332210 122211 122222221100 0001111100000000 00000000000011111123 Q ss_pred EEEeecCcCCccccchhHHHHHHHHHHH-HHHHHHHHHHHhhccccceeeeecccccccccCCCCHHHHHHHHHHHHHHH Q lcl|NC_016071. 224 MVMSLGGTESNPAGVSPLVGCYRAFREK-ILIENLETIGASKDLGGIIELKIPSQILNKAAIDPKSPESEMVQGLMADAA 302 (516) Q Consensus 224 i~~~~~~~~g~p~G~gLlr~~~~~~~fK-~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l~~~~~ 302 (516) +.|+ +|+.|.|.+..+. +++-- ...+..++..++-+..++.++++... +..+..+...... ... T Consensus 219 v~~~-----n~~~~~s~~~~v~-~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~--------~~~~~~~~~~~~~-~~~ 283 (489) T protein:vir:99 219 NEYA-----NNEERTGAYESVL-DNIDAYDLSQSELANFQQDSVNALLVIAGNAY--------TGADENDYLDDGR-LNP 283 (489) T ss_pred EEee-----cCCCCCCchhhhH-HHHHHHHHHHHHHHHHHHHhhhhhhhhccCCc--------ccccchhhhhhcc-ccc Confidence 3333 3567888887643 23222 23445566666666666766665321 1111110000000 000 Q ss_pred H----hhcccc--eEEEeccCcccccccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhcccccccCCccchhhHHH Q lcl|NC_016071. 303 N----AHAGEQ--AYFILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFINLGNDGQGSYNLSE 376 (516) Q Consensus 303 ~----~~~g~~--a~~iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLts~~~~~GS~Al~~ 376 (516) + +..+.. -...+..+...... ...++++..... ...+...++++.+.|.+.--+..++.++.++. + +. T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n--~-Sg 357 (489) T protein:vir:99 284 NGRLAISIGFKKAQVLILDDNPNPNGV-KPQAYFLKKEYD--TAGSEAYKNRLVADILRFTFTPDTQDMKFSGV--Q-SG 357 (489) T ss_pred ccccccccccccceeeeeccccCcccc-ccceeeeeecCC--hHHHHHHHHHHHHHHHHHhCCccccccccccc--c-hH Confidence 0 000000 01111222111111 123444443222 22356678888888886655444444332211 1 11 Q ss_pred HHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhcC-CcCCcc-----ccceEEecCcCchhHHHHHHHHHHHHhCCc Q lcl|NC_016071. 377 SKQSIH----GHFVQRDIDIIVEAFNKNLIPQLLALND-IRLSDE-----DMPKLKPGLIQEVDMEGFSKFVQRIGAVGY 446 (516) Q Consensus 377 vh~ev~----~~~~~aDa~~i~~~ln~~li~~lv~lN~-~~~~~~-----~~P~~~~~~~~~~dl~~~a~~~~~L~~~G~ 446 (516) +..+.. ...+..-.+.+...| +++++.++.+-. ..+... .-..+.|...-+.|..+.++++.+|+ |+ T Consensus 358 ~Al~~~~~~l~~k~~~k~~~~~~~l-~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--gi 434 (489) T protein:vir:99 358 ESMKYKLMASDNYREKQERLFKKGL-MRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GI 434 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc Confidence 111111 111222223444444 345555544321 111111 12467888888999999999999985 65 Q ss_pred ccccHHHHHHHHHHc-CCCCCCCccccc--Cccc-ccCCCCCCcccccccccCCCCCccccc Q lcl|NC_016071. 447 LPKTPTVINKILEVG-GFDEEIPEDMST--DELL-KLLGQDTSRSGDGMTAGSNGNGTGKIS 504 (516) Q Consensus 447 ~~~~~~~~~~i~e~~-Glp~~~~~~~~~--~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (516) +. .+.+.+.+ ++.....++|.. ..+. ......+..... ...+. .+.++..| T Consensus 435 is-----~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~-~~~~~~~p 489 (489) T protein:vir:99 435 VS-----DQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVG-DASGQ-EEPTAEKP 489 (489) T ss_pred CC-----HHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccC-CCCCC-cCCCCCCC Confidence 43 23344444 443222222110 0110 101111111000 00000 01111111 No 237 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=31.13 E-value=1.5 Score=19.58 Aligned_cols=444 Identities=13% Similarity=0.093 Sum_probs=158.1 Q ss_pred CCccccCcccccch------hhhccc-CCCCc-----ccccc-hHHHHHHHHHHHhh---cccccC-CcccHHHHHHHhh Q lcl|NC_016071. 1 MSTRFAQPSEVVKA------GNENLA-VSRLR-----TGELG-SGALSQLRAESEVM---KVEELR-WPCFLATVEAMKQ 63 (516) Q Consensus 1 ~~~r~~~~~~~~~~------~~~~p~-~~~~~-----~~e~g-~~~~~~~~~~~~~~---~~~~lr-~~~~~~~y~~m~~ 63 (516) |++=++--....+. .+..+| +||-. .-+++ ..+...+.+..+.. .-+.++ ..++|+.|++|.. T Consensus 5 ~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~ 84 (523) T protein:vir:68 5 ILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLMT 84 (523) T ss_pred hhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHhh Confidence 22211111111110 000111 11111 01111 11111122222211 122233 3468999999999 Q ss_pred ChHHHHHHHHHHHHHhcCC-c--eeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHHHHHhhcceeeeEEEee Q lcl|NC_016071. 64 DHTVSTALDTKYVFVTKAF-N--DFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAATFNEYGFSIFEKVYRT 140 (516) Q Consensus 64 D~~v~s~l~~Rk~~v~~~~-w--~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~lda~~~G~S~~Eivw~~ 140 (516) +|.|-++++-.-.-+.-.+ - -+.++-. +.+.++.+-+.+.+. |+ .+..+|+.--+||..+= T Consensus 85 ~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld-~~~~s~~iK~kI~ee---------F~-~Il~ll~F~~~~~~~fR----- 148 (523) T protein:vir:68 85 NYEVDNAVSEIVSDAIVYEDDTEVVSINLD-NTKFSPNIKSMMLDE---------FN-EVLNHLSFQRKGSDHFR----- 148 (523) T ss_pred ccchhhHHHHhhcceeeecCCCceEEEEec-ccccchHHHHHHHHH---------HH-HHHHHhccchhhhHHHH----- Confidence 9999999998766432211 0 0111111 111233333333333 22 33356666666666542 Q ss_pred cccccccccceeeccccc-------------cCchhcccccceeecCC-Cceeeeccccccccccccccccccccccccc Q lcl|NC_016071. 141 ESAPSKYAGYITIDKIAF-------------RPQSSLSRSKPWVFDED-GRTLKGIYQSKMAFANFQNGLTQISSAMSLV 206 (516) Q Consensus 141 ~~~~~~~~g~~~~~~l~~-------------r~q~ti~~~~~f~~~~d-g~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~ 206 (516) .|+-||++.++++.. ..|..|+..|......+ |.-+ +. ....+.-|..++.. +... T Consensus 149 ---~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~v--i~-~~~e~f~Y~~~~~~----~~~~ 218 (523) T protein:vir:68 149 ---RWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKI--VK-GYKEYFIYDTSHES----YACD 218 (523) T ss_pred ---hheeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhh--hh-hhhhheeecccccc----cccc Confidence 233455555444331 12222322222212211 1111 00 11111111111100 0000 Q ss_pred cccccCCCccccccccEEEEeecCcCC-c-cccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccccccccC Q lcl|NC_016071. 207 TNLTSSADEVFIPINKLMVMSLGGTES-N-PAGVSPLVGCYRAFREKILIENLETIGASKDLGGIIELKIPSQILNKAAI 284 (516) Q Consensus 207 ~~~~~~~~~~~iP~~k~i~~~~~~~~g-~-p~G~gLlr~~~~~~~fK~~~~~~w~~~~er~g~~~~v~~~pp~~~~k~~~ 284 (516) -.......++.||.+- |+|+|..-.+ + -.=.|.|.++..|+==-+.....-.+ ..+-.+|.+|+=+-.- T Consensus 219 g~~~~~~~~ikI~~dA-I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVI--------YRitRAPeRRvFYIDv 289 (523) T protein:vir:68 219 GRIYEAGTKIKIPKAA-IVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVI--------YRITRAPDRRVWYVDT 289 (523) T ss_pred ccccCCCcceecchhh-eeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHH--------HhhhccccceEEEEec Confidence 1112234577787764 8888843211 0 11237888888776433322221111 1122223322211111 Q ss_pred --CCCHHHHHHHHHHHHHHHHhhc----ccceEE--------------EeccCcccccccccceeeeeccccCcchhHHH Q lcl|NC_016071. 285 --DPKSPESEMVQGLMADAANAHA----GEQAYF--------------ILPSDMNAQGGEQYKMSLKGIDGAGKQYSTQE 344 (516) Q Consensus 285 --~~~~~~~~~l~~l~~~~~~~~~----g~~a~~--------------iiP~g~~i~~~e~~~iel~~~~g~g~~~~~~~ 344 (516) =|....++.+ ..++..++. ....|- -||.= +.....+|+-+ .|+.... -.+ T Consensus 290 GnlPk~KAeqYl---~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRR---eGgrgTEItTL--pGgqnlg-em~ 360 (523) T protein:vir:68 290 GNMPSRKAAEHM---QHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRR---DGKAVTEVDTL--PGADNTG-NME 360 (523) T ss_pred CCCCchhHHHHH---HHHHHhhcceeEEeccCCeeccchhhhhhHhhhccccc---CCCcccceeec--cccCCcC-hHH Confidence 1222222222 222222111 001111 12210 00111233333 2222222 334 Q ss_pred HHHHHHHHHHHHHhcccccccCCccchhhH---HH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc---c Q lcl|NC_016071. 345 LVNSRKKAILDRFGAGFINLGNDGQGSYNL---SE-SKQSI-HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSD---E 416 (516) Q Consensus 345 li~~~d~~Isk~iLGqtLts~~~~~GS~Al---~~-vh~ev-~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~---~ 416 (516) =|+|..+.+-+++--..--.+.++ |+..+ ++ +-.|+ |...++.-...+...|..-|-..|+. .+.--+. . T Consensus 361 DV~YF~kkLy~aLnVP~sRl~~~~-~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLil-Kgiit~eew~~ 438 (523) T protein:vir:68 361 DVRWFRNALYMALRIPITRIPSDQ-GGIQFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLIL-KGIITEDEWND 438 (523) T ss_pred HHHHHHHHHHHHhCCcceeecCCC-cceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-ccCCCHHHHHH Confidence 589999999998887764443332 22222 22 22333 23334444444444444333333322 1111110 1 Q ss_pred ccceEEecCcCc------hhHHHHHHHHHHHHhCCcccccHHHHHHHHHHc-CCCCCCC--cccccCcccc--cCCCCCC Q lcl|NC_016071. 417 DMPKLKPGLIQE------VDMEGFSKFVQRIGAVGYLPKTPTVINKILEVG-GFDEEIP--EDMSTDELLK--LLGQDTS 485 (516) Q Consensus 417 ~~P~~~~~~~~~------~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~e~~-Glp~~~~--~~~~~~~~~~--~~~~~~~ 485 (516) --+.+.|+...+ .+.+-+.+++..|..+-=.+-...+.+|+++.+ .+...+- ++...+.+.+ .-+++.. T Consensus 439 i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~ 518 (523) T protein:vir:68 439 EINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQDPDQ 518 (523) T ss_pred HhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCch Confidence 113344433333 333444445555544322222234577887654 4432111 1111111111 0111111 Q ss_pred ccccc Q lcl|NC_016071. 486 RSGDG 490 (516) Q Consensus 486 ~~~~~ 490 (516) ...+. T Consensus 519 e~~~f 523 (523) T protein:vir:68 519 EQEDF 523 (523) T ss_pred hhhcC Confidence 11111 No 238 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=29.12 E-value=1.7 Score=19.34 Aligned_cols=440 Identities=10% Similarity=0.075 Sum_probs=153.3 Q ss_pred CCccccCcccccchhhhcccCCCCcccccchHHHHHH--HHHHHhhcccccCCcccHHHHHHHhhChHHHHHHHHHHHHH Q lcl|NC_016071. 1 MSTRFAQPSEVVKAGNENLAVSRLRTGELGSGALSQL--RAESEVMKVEELRWPCFLATVEAMKQDHTVSTALDTKYVFV 78 (516) Q Consensus 1 ~~~r~~~~~~~~~~~~~~p~~~~~~~~e~g~~~~~~~--~~~~~~~~~~~lr~~~~~~~y~~m~~D~~v~s~l~~Rk~~v 78 (516) |+++.+.+-+-.-..+...-.++......+--+.+.| .+|..-+-.||||+- =..+++.+..-++-+ T Consensus 13 p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW~~~d~vpELry~-----------vgW~~~a~SR~rL~a 81 (646) T protein:vir:10 13 PFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGWRLYDIIPEHHFL-----------AGRIGDSVAQARLYV 81 (646) T ss_pred cccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHHHHHhhhhhHhhH-----------hhhhhhhhceeeeee Confidence 5555554333222222111122211111221223344 345555666776641 122333333222222 Q ss_pred hcCCceeeeCCCCCChhhHHHHHHHHHHHhhccCcCCHHHHHHHHH---HHHhhcceeeeEEEeecccccccccceeecc Q lcl|NC_016071. 79 TKAFNDFKVLYNRDSKASKDAAEFVEYALKNLANQQTLRDIARSAA---TFNEYGFSIFEKVYRTESAPSKYAGYITIDK 155 (516) Q Consensus 79 ~~~~w~i~~~~~~d~~~~~~~a~~v~~~l~~~~~~~~~~~~l~~~l---da~~~G~S~~Eivw~~~~~~~~~~g~~~~~~ 155 (516) ...+ . +..+.+ +..+.++...+...+..-..+ .++|+.+- .=.+.+|-|.+--.... .++ T Consensus 82 seid-d-tG~~tg-~v~~~~v~~iv~~~~Gg~~gQ---~qlLkr~~~~ltV~GE~wiv~~~~~~~~-----~~~------ 144 (646) T protein:vir:10 82 TEVD-D-TGEETG-EVQDERIKRLAAVPLGTGSQR---DDNLRLAGLDLAVGGECWIVGEGAATSP-----EAA------ 144 (646) T ss_pred eeec-C-CCCCcC-ccchHHHHHHhhhhccchhhH---HHHHHHHHhheecccceEEeeccccCCC-----CCC------ Confidence 2221 0 011111 122334444444333222111 23444332 23344444421111110 011 Q ss_pred ccccCchhcccccceeecCCCceeeeccccccccccccccccccccccccccccccCCCccccccccEEEEeecCcC-Cc Q lcl|NC_016071. 156 IAFRPQSSLSRSKPWVFDEDGRTLKGIYQSKMAFANFQNGLTQISSAMSLVTNLTSSADEVFIPINKLMVMSLGGTE-SN 234 (516) Q Consensus 156 l~~r~q~ti~~~~~f~~~~dg~~l~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP~~k~i~~~~~~~~-g~ 234 (516) +..|+.+..+- + ...+.. ..+.+| ...++...+.++....++-+.+++. .+ T Consensus 145 ----------~~~W~vvt~~E--v------~~tg~~-----~~i~~p-----~~~~g~~~v~~~~~d~lvRiW~P~Prr~ 196 (646) T protein:vir:10 145 ----------EGSWFVVTGSA--I------SRTGDE-----IAVRRP-----QQRGGSKLVLVDGQDILIRCWRPHPNDT 196 (646) T ss_pred ----------ccceeeecHHH--h------ccCCCe-----eeeecC-----ccCCCCCcceecCCceEEEEecCCcccc Confidence 12344333211 0 000000 011111 0111334555555555544343322 12 Q ss_pred cccchhHHHHHHHHHHHHH-HHHHHHHHHhh-ccccceeeeecccccccccCCCCHHHHHHHHHH-HHHHHHhhcccceE Q lcl|NC_016071. 235 PAGVSPLVGCYRAFREKIL-IENLETIGASK-DLGGIIELKIPSQILNKAAIDPKSPESEMVQGL-MADAANAHAGEQAY 311 (516) Q Consensus 235 p~G~gLlr~~~~~~~fK~~-~~~~w~~~~er-~g~~~~v~~~pp~~~~k~~~~~~~~~~~~l~~l-~~~~~~~~~g~~a~ 311 (516) .+-.|.-|.|.-..-.-.. +-+.-+..=.| -|.|+ +..|-...=-....+..........+ +.++..+.--.+++ T Consensus 197 ~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGv--LfvP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~a 274 (646) T protein:vir:10 197 DQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGI--MFLPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRAS 274 (646) T ss_pred cCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCce--eeeccccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCcc Confidence 2333444444433221111 11111111111 12232 23333221111111111111222222 23333444444556 Q ss_pred EEeccCcccc--c-ccccceeeeeccccCcchhHHHHHHHHHHHHHHHHhccccc------ccCCcc-chhhHHHHHHHH Q lcl|NC_016071. 312 FILPSDMNAQ--G-GEQYKMSLKGIDGAGKQYSTQELVNSRKKAILDRFGAGFIN------LGNDGQ-GSYNLSESKQSI 381 (516) Q Consensus 312 ~iiP~g~~i~--~-~e~~~iel~~~~g~g~~~~~~~li~~~d~~Isk~iLGqtLt------s~~~~~-GS~Al~~vh~ev 381 (516) +.||--..+. . +....|..+.- ++ ......|+--|..|.+.-.|=.+. ++..+. +...+++ ++| T Consensus 275 A~vPiia~~P~E~i~~~~~ik~l~f---~~-eite~aiktR~daI~RlA~glDIppE~LLGlgd~NHWtAWqI~d--e~v 348 (646) T protein:vir:10 275 AMVPIMATIPNEMMEHLDKIKPLTF---WS-ELSAEITPMKDKAIARLASSAEIPGEVLTGIGDANHWTAWLISD--EGI 348 (646) T ss_pred ceeeeEEeeChHHHhhhhcceeecc---Cc-hhhHHHhhhHHHHHHHHHhccCCchhheeeccccceeeeeeecc--ccc Confidence 6667433221 1 01112221111 11 113346777888888877665432 221221 2234443 667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccceEEecCc---CchhHHHHHHHHHHHHhCCcccccHHHHHHHH Q lcl|NC_016071. 382 HGHFVQRDIDIIVEAFNKNLIPQLLALNDIRLSDEDMPKLKPGLI---QEVDMEGFSKFVQRIGAVGYLPKTPTVINKIL 458 (516) Q Consensus 382 ~~~~~~aDa~~i~~~ln~~li~~lv~lN~~~~~~~~~P~~~~~~~---~~~dl~~~a~~~~~L~~~G~~~~~~~~~~~i~ 458 (516) + . ++--...||+.|++|++++.+.--+. .+.+.| .|+||.. ...|. ..+++ .+++.|.|.- +.++ T Consensus 349 r-H-I~P~l~~ic~AlT~~~Lrp~Le~eGi-~dp~ky-vvW~DaS~Lt~~pd~--~deA~-qa~drGAIt~-----eAlr 416 (646) T protein:vir:10 349 R-W-IRGYLGLIADALTRGFLRRALESMGV-TNPERY-AFAFDTSTLASKPNR--LDEAI-QLHERNLIKD-----EEVV 416 (646) T ss_pred h-h-hhhHHHHHHHHHHhhHHHHHHHHcCC-CChhHe-EEeecCcccccCCCC--cHHHH-HHHHcCCccH-----HHHH Confidence 7 3 77788999999999999999886531 122333 4667643 22332 23333 4567777652 4567 Q ss_pred HHcCCCCCCCccc----------------------------ccCc-ccc--cCCCCCCccc---ccccccCCCCCccc-- Q lcl|NC_016071. 459 EVGGFDEEIPEDM----------------------------STDE-LLK--LLGQDTSRSG---DGMTAGSNGNGTGK-- 502 (516) Q Consensus 459 e~~Glp~~~~~~~----------------------------~~~~-~~~--~~~~~~~~~~---~~~~~~~~~~~~~~-- 502 (516) +.+|+.+...... ..+. ..+ +++..+++.+ +..+.|.+....+. T Consensus 417 k~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~~lpp~~~~~~dg~~~~~e~~g~~~~~E~~~~ 496 (646) T protein:vir:10 417 KAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSVGLPPTAAQRTDGDLDDDESEGAPNGGEAPDQ 496 (646) T ss_pred HHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCccccCCcccccccCCCCChhhcCCCCCCccCCC Confidence 7777753211100 0000 000 0011111111 11111111111110 Q ss_pred ccccccchhhhhcC Q lcl|NC_016071. 503 ISSTRDNSVSNMDN 516 (516) Q Consensus 503 ~~~~~d~~~~~~~~ 516 (516) ..+..+.+.+--.+ T Consensus 497 pda~~~~a~~~~~~ 510 (646) T protein:vir:10 497 PDADEARAITAALD 510 (646) T ss_pred CCCCcccccccccc Confidence 00011111111111 Done!