Query lcl|NC_019524.1_cdsid_YP_007007012.1 [gene=F418_gp13] [protein=phage portal protein, lambda family] [protein_id=YP_007007012.1] [location=13097..14767] Match_columns 556 No_of_seqs 121 out of 438 Neff 7.6 Searched_HMMs 1612 Date Thu Nov 7 19:42:43 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:6382 Length: 553 # 100.0 9E-185 5E-188 1029.7 53.6 545 7-556 1-550 (553) 2 protein:vir:389 Length: 530 # 100.0 2E-175 1E-178 978.2 53.9 530 4-553 1-530 (530) 3 protein:vir:3420 Length: 533 # 100.0 5E-175 3E-178 976.3 53.3 533 1-553 1-533 (533) 4 protein:vir:96738 Length: 505 100.0 6E-167 4E-170 931.9 48.6 495 7-549 1-505 (505) 5 protein:vir:79538 Length: 502 100.0 4E-166 2E-169 927.4 52.4 490 1-556 11-502 (502) 6 protein:vir:95542 Length: 548 100.0 6E-166 4E-169 926.4 50.1 494 1-556 11-518 (548) 7 protein:vir:10321 Length: 495 100.0 2E-163 1E-166 912.8 50.7 490 4-552 1-495 (495) 8 protein:vir:3153 Length: 467 # 99.9 7.5E-27 4.7E-30 163.8 29.0 419 69-556 1-457 (467) 9 protein:vir:7853 Length: 518 # 99.8 1.1E-21 7.1E-25 135.4 24.3 427 1-556 3-431 (518) 10 protein:vir:3843 Length: 397 # 99.8 8.8E-21 5.4E-24 130.5 27.7 397 4-555 1-397 (397) 11 protein:vir:101648 Length: 518 99.8 6.1E-21 3.8E-24 131.4 24.8 433 1-556 1-457 (518) 12 protein:vir:93610 Length: 454 99.8 7.9E-20 4.9E-23 125.3 27.0 434 1-556 2-443 (454) 13 protein:vir:99452 Length: 651 99.8 1.2E-19 7.6E-23 124.3 24.8 474 1-556 1-542 (651) 14 protein:vir:93943 Length: 409 99.8 1.8E-18 1.1E-21 117.8 27.9 407 1-556 1-408 (409) 15 protein:vir:81072 Length: 432 99.8 1.2E-18 7.2E-22 118.9 26.7 429 1-556 1-429 (432) 16 protein:vir:97060 Length: 432 99.8 2.5E-18 1.5E-21 117.1 27.2 429 1-556 1-429 (432) 17 protein:vir:7987 Length: 456 # 99.8 1.3E-17 8.1E-21 113.2 31.0 436 1-548 1-456 (456) 18 protein:vir:10362 Length: 432 99.7 6E-18 3.7E-21 115.0 28.4 429 1-556 1-429 (432) 19 protein:vir:94426 Length: 409 99.7 8E-18 5E-21 114.3 28.5 407 1-556 1-408 (409) 20 protein:vir:79772 Length: 648 99.7 1.5E-17 9.3E-21 112.8 29.9 466 1-556 11-505 (648) 21 protein:vir:100691 Length: 535 99.7 2.2E-17 1.4E-20 111.9 29.7 471 1-556 1-526 (535) 22 protein:vir:102727 Length: 945 99.7 2.6E-18 1.6E-21 117.0 24.2 466 1-556 43-536 (945) 23 protein:vir:483 Length: 413 # 99.7 1.5E-17 9.1E-21 112.9 28.3 413 4-555 1-413 (413) 24 protein:vir:102080 Length: 429 99.7 5.7E-18 3.5E-21 115.1 26.0 427 4-556 1-427 (429) 25 protein:vir:1266 Length: 416 # 99.7 2.2E-18 1.4E-21 117.4 23.5 415 4-556 1-415 (416) 26 protein:vir:6240 Length: 457 # 99.7 1.1E-17 6.8E-21 113.6 26.4 438 4-556 1-452 (457) 27 protein:vir:107605 Length: 432 99.7 2.1E-17 1.3E-20 112.1 27.3 430 4-556 1-430 (432) 28 protein:vir:105002 Length: 432 99.7 2.1E-17 1.3E-20 112.1 27.3 430 4-556 1-430 (432) 29 protein:vir:102855 Length: 432 99.7 2.1E-17 1.3E-20 112.1 27.3 430 4-556 1-430 (432) 30 protein:vir:4854 Length: 386 # 99.7 2.5E-17 1.5E-20 111.6 27.6 386 4-553 1-386 (386) 31 protein:vir:4454 Length: 414 # 99.7 4.6E-17 2.8E-20 110.2 28.5 414 4-556 1-414 (414) 32 protein:vir:2683 Length: 412 # 99.7 1.6E-17 1E-20 112.6 25.7 408 1-556 1-411 (412) 33 protein:vir:104082 Length: 485 99.7 8.2E-16 5.1E-19 103.3 34.6 465 4-556 1-482 (485) 34 protein:vir:7407 Length: 392 # 99.7 9E-17 5.6E-20 108.6 29.1 386 1-550 2-392 (392) 35 protein:vir:7768 Length: 484 # 99.7 2E-15 1.2E-18 101.2 36.0 453 14-556 1-482 (484) 36 protein:vir:96579 Length: 576 99.7 1.6E-16 1E-19 107.2 29.2 458 1-556 27-531 (576) 37 protein:vir:1326 Length: 457 # 99.7 3.8E-17 2.4E-20 110.6 25.2 438 4-556 1-448 (457) 38 protein:vir:80040 Length: 461 99.7 9.9E-17 6.1E-20 108.3 27.2 450 1-556 1-460 (461) 39 protein:vir:4952 Length: 386 # 99.7 1.5E-16 9E-20 107.4 27.3 386 4-554 1-386 (386) 40 protein:vir:9359 Length: 348 # 99.7 3.8E-17 2.4E-20 110.6 24.2 347 91-554 1-348 (348) 41 protein:vir:1380 Length: 422 # 99.7 7.5E-17 4.6E-20 109.0 25.7 417 4-551 1-422 (422) 42 protein:vir:96980 Length: 409 99.7 1.7E-16 1.1E-19 107.0 27.7 408 1-556 1-408 (409) 43 protein:vir:2341 Length: 488 # 99.7 1.8E-15 1.1E-18 101.4 33.0 466 1-555 7-488 (488) 44 protein:vir:105064 Length: 421 99.7 3.7E-17 2.3E-20 110.7 23.4 414 1-554 3-421 (421) 45 protein:vir:81218 Length: 423 99.7 6.4E-17 3.9E-20 109.4 24.4 419 4-556 1-422 (423) 46 protein:vir:100150 Length: 437 99.7 7.8E-17 4.8E-20 108.9 24.7 428 4-556 1-437 (437) 47 protein:vir:1431 Length: 419 # 99.7 8.7E-17 5.4E-20 108.6 24.8 413 10-556 1-416 (419) 48 protein:vir:4509 Length: 424 # 99.7 6.1E-16 3.8E-19 104.0 29.1 413 1-553 10-424 (424) 49 protein:vir:80333 Length: 419 99.7 6.4E-17 4E-20 109.4 23.6 412 18-556 1-413 (419) 50 protein:vir:8418 Length: 409 # 99.7 4.5E-16 2.8E-19 104.7 27.5 407 4-556 1-409 (409) 51 protein:vir:98444 Length: 434 99.7 2.6E-15 1.6E-18 100.5 31.6 416 47-552 1-434 (434) 52 protein:vir:94666 Length: 723 99.7 1.3E-16 8.1E-20 107.7 24.4 424 29-556 1-446 (723) 53 protein:vir:4337 Length: 434 # 99.7 1E-16 6.4E-20 108.2 23.7 429 1-556 1-434 (434) 54 protein:vir:5737 Length: 419 # 99.7 4.1E-16 2.5E-19 105.0 26.8 411 4-556 1-413 (419) 55 protein:vir:100249 Length: 431 99.7 2.4E-16 1.5E-19 106.2 25.3 420 1-554 1-431 (431) 56 protein:vir:102118 Length: 409 99.6 1.2E-15 7.5E-19 102.4 28.4 404 18-552 1-409 (409) 57 protein:vir:105819 Length: 456 99.6 2.1E-15 1.3E-18 101.1 29.7 432 1-548 1-456 (456) 58 protein:vir:102602 Length: 456 99.6 2.1E-15 1.3E-18 101.1 29.7 432 1-548 1-456 (456) 59 protein:vir:4156 Length: 542 # 99.6 1.7E-15 1E-18 101.6 29.1 430 1-556 6-470 (542) 60 protein:vir:1023 Length: 392 # 99.6 2.6E-15 1.6E-18 100.6 29.6 387 1-550 2-392 (392) 61 protein:vir:3989 Length: 392 # 99.6 2.6E-15 1.6E-18 100.6 29.6 387 1-550 2-392 (392) 62 protein:vir:95599 Length: 563 99.6 4.5E-16 2.8E-19 104.7 25.4 459 1-556 33-529 (563) 63 protein:vir:99312 Length: 563 99.6 4.5E-16 2.8E-19 104.7 25.4 459 1-556 33-529 (563) 64 protein:vir:100187 Length: 385 99.6 5.4E-16 3.4E-19 104.3 25.7 380 1-549 1-385 (385) 65 protein:vir:81152 Length: 411 99.6 3.6E-16 2.2E-19 105.3 24.1 406 4-554 1-411 (411) 66 protein:vir:99072 Length: 479 99.6 1.1E-13 7.1E-17 91.5 37.3 452 1-556 9-473 (479) 67 protein:vir:5961 Length: 503 # 99.6 1.6E-13 1E-16 90.7 38.1 458 1-556 12-498 (503) 68 protein:vir:8100 Length: 466 # 99.6 9.2E-16 5.7E-19 103.0 25.5 448 4-555 1-466 (466) 69 protein:vir:1884 Length: 424 # 99.6 6.1E-16 3.8E-19 104.0 23.7 413 4-555 1-424 (424) 70 protein:vir:4194 Length: 540 # 99.6 2.6E-15 1.6E-18 100.5 27.1 442 1-556 2-462 (540) 71 protein:vir:100882 Length: 383 99.6 1.6E-15 1E-18 101.7 25.5 379 4-553 1-383 (383) 72 protein:vir:98396 Length: 441 99.6 2.5E-15 1.5E-18 100.7 26.4 431 1-556 4-441 (441) 73 protein:vir:79984 Length: 441 99.6 2.3E-15 1.5E-18 100.8 25.3 432 1-556 4-441 (441) 74 protein:vir:9408 Length: 441 # 99.6 2.3E-15 1.5E-18 100.8 25.3 432 1-556 4-441 (441) 75 protein:vir:189 Length: 424 # 99.6 3E-15 1.8E-18 100.2 25.2 422 4-555 1-424 (424) 76 protein:vir:63755 Length: 547 99.6 2.7E-14 1.7E-17 94.9 30.3 447 1-556 31-531 (547) 77 protein:vir:94049 Length: 532 99.6 5.1E-15 3.2E-18 98.9 25.8 464 1-556 1-524 (532) 78 protein:vir:4598 Length: 416 # 99.6 3.2E-15 2E-18 100.1 23.9 416 16-556 1-416 (416) 79 protein:vir:81095 Length: 416 99.6 3.2E-15 2E-18 100.1 23.9 416 16-556 1-416 (416) 80 protein:vir:80644 Length: 551 99.6 7E-14 4.3E-17 92.7 31.2 446 1-556 35-535 (551) 81 protein:vir:4828 Length: 382 # 99.6 3E-14 1.9E-17 94.7 28.3 382 4-554 1-382 (382) 82 protein:vir:78641 Length: 278 99.6 7.9E-16 4.9E-19 103.4 18.9 275 91-472 1-278 (278) 83 protein:vir:3868 Length: 417 # 99.5 6.5E-15 4E-18 98.4 23.4 399 4-555 1-417 (417) 84 protein:vir:4995 Length: 384 # 99.5 3.7E-15 2.3E-18 99.7 21.8 378 4-554 1-384 (384) 85 protein:vir:5249 Length: 437 # 99.5 1.6E-13 1E-16 90.7 30.5 420 28-556 1-437 (437) 86 protein:vir:101647 Length: 460 99.5 8.5E-14 5.3E-17 92.2 28.8 420 4-556 1-460 (460) 87 protein:vir:960 Length: 413 # 99.5 2.2E-14 1.3E-17 95.5 25.5 410 1-554 1-413 (413) 88 protein:vir:4223 Length: 486 # 99.5 1.4E-12 8.7E-16 85.6 34.5 451 10-556 1-481 (486) 89 protein:vir:104259 Length: 403 99.5 2E-14 1.2E-17 95.7 23.9 390 40-553 1-403 (403) 90 protein:vir:2732 Length: 501 # 99.5 3.8E-12 2.4E-15 83.2 36.4 450 1-555 38-501 (501) 91 protein:vir:80796 Length: 574 99.5 8.7E-14 5.4E-17 92.2 26.3 449 1-556 29-530 (574) 92 protein:vir:96494 Length: 501 99.5 4.4E-12 2.7E-15 82.8 35.9 451 1-555 38-501 (501) 93 protein:vir:2427 Length: 485 # 99.5 5.1E-12 3.2E-15 82.5 36.4 450 18-556 1-483 (485) 94 protein:vir:9702 Length: 406 # 99.5 5.5E-14 3.4E-17 93.3 24.1 402 32-555 1-406 (406) 95 protein:vir:8317 Length: 409 # 99.5 1.4E-14 8.4E-18 96.6 20.7 389 1-539 16-409 (409) 96 protein:vir:101494 Length: 527 99.5 3.9E-12 2.4E-15 83.1 33.6 452 27-556 1-520 (527) 97 protein:vir:102239 Length: 527 99.5 4.5E-12 2.8E-15 82.8 33.6 452 27-556 1-520 (527) 98 protein:vir:2500 Length: 501 # 99.4 6.1E-12 3.8E-15 82.1 32.7 446 1-556 23-497 (501) 99 protein:vir:78537 Length: 480 99.4 1.5E-11 9.4E-15 79.9 36.1 434 18-556 1-469 (480) 100 protein:vir:107742 Length: 537 99.4 8.9E-13 5.5E-16 86.7 27.3 461 1-556 25-534 (537) 101 protein:vir:1082 Length: 359 # 99.4 2.6E-13 1.6E-16 89.5 23.4 353 4-508 1-359 (359) 102 protein:vir:95378 Length: 406 99.4 8.7E-13 5.4E-16 86.7 25.5 403 4-556 1-406 (406) 103 protein:vir:95113 Length: 474 99.4 2.8E-11 1.7E-14 78.4 35.4 435 1-555 21-474 (474) 104 protein:vir:9568 Length: 410 # 99.4 2.3E-11 1.4E-14 78.9 32.9 392 18-522 1-410 (410) 105 protein:vir:96068 Length: 765 99.4 3.4E-13 2.1E-16 88.9 22.5 467 1-556 31-541 (765) 106 protein:vir:80134 Length: 403 99.4 6.8E-13 4.2E-16 87.3 23.9 394 4-555 1-403 (403) 107 protein:vir:4898 Length: 502 # 99.4 3.8E-11 2.3E-14 77.7 36.2 448 1-556 39-499 (502) 108 protein:vir:78227 Length: 480 99.4 4.4E-11 2.7E-14 77.4 36.3 434 18-556 1-469 (480) 109 protein:vir:97447 Length: 474 99.4 5.9E-11 3.7E-14 76.7 35.7 438 1-556 21-474 (474) 110 protein:vir:94498 Length: 474 99.4 5.9E-11 3.7E-14 76.7 35.7 438 1-556 21-474 (474) 111 protein:vir:9507 Length: 395 # 99.4 6.2E-13 3.9E-16 87.5 21.8 388 4-555 1-395 (395) 112 protein:vir:101289 Length: 395 99.4 6.2E-13 3.9E-16 87.5 21.8 388 4-555 1-395 (395) 113 protein:vir:100650 Length: 395 99.4 6.2E-13 3.9E-16 87.5 21.8 388 4-555 1-395 (395) 114 protein:vir:9871 Length: 429 # 99.4 6E-11 3.7E-14 76.6 36.1 420 14-548 1-429 (429) 115 protein:vir:95965 Length: 385 99.4 1.2E-12 7.6E-16 85.9 23.0 378 4-556 1-385 (385) 116 protein:vir:99563 Length: 862 99.3 1.5E-11 9.5E-15 79.9 28.8 457 1-556 48-562 (862) 117 protein:vir:99781 Length: 511 99.3 1E-10 6.4E-14 75.3 35.3 451 1-556 39-511 (511) 118 protein:vir:104338 Length: 422 99.3 1.8E-11 1.1E-14 79.5 27.2 405 26-556 1-421 (422) 119 protein:vir:99916 Length: 504 99.3 1.3E-10 8E-14 74.8 35.7 457 1-556 1-502 (504) 120 protein:vir:79647 Length: 435 99.3 1.7E-11 1.1E-14 79.6 26.2 411 1-556 5-435 (435) 121 protein:vir:99522 Length: 470 99.3 2.8E-10 1.7E-13 73.0 37.1 430 1-555 1-470 (470) 122 protein:vir:80680 Length: 441 99.2 3.5E-10 2.2E-13 72.4 34.4 409 19-556 1-440 (441) 123 protein:vir:96266 Length: 474 99.2 3.5E-10 2.2E-13 72.4 35.3 438 1-553 21-474 (474) 124 protein:vir:95899 Length: 474 99.2 3.5E-10 2.2E-13 72.4 35.3 438 1-553 21-474 (474) 125 protein:vir:96839 Length: 474 99.2 4E-10 2.5E-13 72.1 35.6 442 1-555 1-474 (474) 126 protein:vir:3964 Length: 453 # 99.2 4.5E-10 2.8E-13 71.8 36.6 436 1-556 11-453 (453) 127 protein:vir:107662 Length: 427 99.2 1E-10 6.4E-14 75.3 26.5 410 7-554 1-427 (427) 128 protein:vir:9306 Length: 511 # 99.2 5.2E-10 3.2E-13 71.5 35.1 450 1-555 37-511 (511) 129 protein:vir:93867 Length: 378 99.2 2.5E-12 1.5E-15 84.2 17.1 378 32-556 1-378 (378) 130 protein:vir:106639 Length: 481 99.2 6E-10 3.7E-13 71.1 35.7 439 1-552 30-481 (481) 131 protein:vir:94546 Length: 506 99.2 6.6E-10 4.1E-13 70.9 35.0 453 1-556 22-503 (506) 132 protein:vir:9751 Length: 422 # 99.2 4.9E-10 3E-13 71.6 28.8 403 23-524 1-422 (422) 133 protein:vir:6210 Length: 394 # 99.2 1.1E-10 7E-14 75.1 25.3 392 4-555 1-394 (394) 134 protein:vir:94002 Length: 378 99.2 2E-12 1.3E-15 84.7 15.6 376 16-556 1-378 (378) 135 protein:vir:93747 Length: 472 99.2 8.1E-10 5E-13 70.4 36.9 445 1-556 1-472 (472) 136 protein:vir:103951 Length: 511 99.2 1E-09 6.2E-13 69.9 36.4 446 1-555 39-511 (511) 137 protein:vir:79043 Length: 479 99.2 1.1E-09 6.6E-13 69.8 36.9 422 18-554 1-479 (479) 138 protein:vir:94742 Length: 409 99.2 1.1E-09 6.7E-13 69.8 30.0 397 2-508 1-409 (409) 139 protein:vir:96240 Length: 511 99.1 1.2E-09 7.3E-13 69.5 35.5 450 1-555 39-511 (511) 140 protein:vir:1236 Length: 483 # 99.1 1.7E-09 1E-12 68.7 36.2 433 1-555 28-483 (483) 141 protein:vir:97336 Length: 492 99.1 1.9E-09 1.2E-12 68.4 35.5 434 1-556 38-492 (492) 142 protein:vir:1661 Length: 378 # 99.1 1.5E-11 9.4E-15 79.9 16.8 378 32-556 1-378 (378) 143 protein:vir:96179 Length: 468 99.1 2.2E-09 1.4E-12 68.0 34.9 428 1-545 20-468 (468) 144 protein:vir:3609 Length: 452 # 99.1 2.4E-09 1.5E-12 67.8 37.3 431 1-553 1-452 (452) 145 protein:vir:98643 Length: 395 99.1 6.9E-10 4.3E-13 70.8 25.2 385 16-555 1-395 (395) 146 protein:vir:97171 Length: 512 99.1 2.8E-09 1.7E-12 67.5 37.8 459 1-555 31-512 (512) 147 protein:vir:106571 Length: 499 99.1 2.8E-09 1.8E-12 67.4 37.0 440 1-556 16-492 (499) 148 protein:vir:94805 Length: 492 99.1 3E-09 1.9E-12 67.3 35.0 432 1-555 37-492 (492) 149 protein:vir:94101 Length: 474 99.1 3.2E-09 2E-12 67.2 38.1 454 1-555 1-474 (474) 150 protein:vir:105889 Length: 474 99.1 3.2E-09 2E-12 67.2 38.1 454 1-555 1-474 (474) 151 protein:vir:105292 Length: 478 99.0 4.3E-09 2.7E-12 66.4 37.2 420 4-552 1-478 (478) 152 protein:vir:102950 Length: 471 99.0 4.5E-09 2.8E-12 66.3 34.3 393 44-553 1-471 (471) 153 protein:vir:103971 Length: 376 99.0 1.8E-10 1.1E-13 74.0 19.9 330 1-465 23-376 (376) 154 protein:vir:94869 Length: 378 99.0 1.1E-10 7E-14 75.1 18.6 378 4-556 1-378 (378) 155 protein:vir:105782 Length: 449 99.0 2.1E-09 1.3E-12 68.2 25.4 436 1-556 1-449 (449) 156 protein:vir:1634 Length: 409 # 99.0 5.6E-09 3.5E-12 65.8 29.6 394 2-508 1-409 (409) 157 protein:vir:733 Length: 453 # 99.0 5.7E-09 3.5E-12 65.8 34.9 435 1-552 3-453 (453) 158 protein:vir:858 Length: 378 # 99.0 4.7E-11 2.9E-14 77.2 15.3 378 4-556 1-378 (378) 159 protein:vir:4089 Length: 395 # 99.0 1.5E-09 9E-13 69.0 23.3 390 4-555 1-395 (395) 160 protein:vir:107112 Length: 478 99.0 1E-08 6.4E-12 64.4 37.2 423 4-553 1-478 (478) 161 protein:vir:9641 Length: 395 # 98.9 3.2E-09 2E-12 67.1 23.8 383 4-555 1-395 (395) 162 protein:vir:78310 Length: 376 98.9 4.2E-09 2.6E-12 66.5 23.5 371 4-550 1-376 (376) 163 protein:vir:267 Length: 348 # 98.9 8.7E-10 5.4E-13 70.3 19.6 333 1-488 1-348 (348) 164 protein:vir:96366 Length: 511 98.9 1.6E-08 1E-11 63.3 36.1 446 1-556 37-509 (511) 165 protein:vir:78805 Length: 511 98.9 1.6E-08 1E-11 63.3 36.1 446 1-556 37-509 (511) 166 protein:vir:1150 Length: 350 # 98.9 9E-10 5.6E-13 70.2 19.3 327 1-462 1-350 (350) 167 protein:vir:108215 Length: 469 98.9 1.7E-08 1.1E-11 63.2 32.5 439 18-556 1-465 (469) 168 protein:vir:9922 Length: 489 # 98.9 2.1E-08 1.3E-11 62.6 36.6 445 1-554 15-489 (489) 169 protein:vir:79150 Length: 368 98.9 1.1E-09 6.5E-13 69.8 18.5 347 1-461 1-368 (368) 170 protein:vir:79207 Length: 351 98.9 6.9E-10 4.3E-13 70.8 17.4 325 1-465 1-351 (351) 171 protein:vir:8184 Length: 474 # 98.9 2.5E-08 1.6E-11 62.2 31.6 443 1-541 1-474 (474) 172 protein:vir:95806 Length: 440 98.8 4E-08 2.5E-11 61.1 33.3 430 1-555 2-440 (440) 173 protein:vir:79703 Length: 505 98.8 4.8E-08 3E-11 60.7 30.3 446 1-539 1-505 (505) 174 protein:vir:100328 Length: 346 98.8 3.4E-09 2.1E-12 67.0 18.1 326 4-467 1-346 (346) 175 protein:vir:78907 Length: 518 98.8 6.4E-08 4E-11 60.0 36.2 455 1-539 4-518 (518) 176 protein:vir:78191 Length: 351 98.8 2.7E-09 1.7E-12 67.6 17.2 325 1-465 1-351 (351) 177 protein:vir:98567 Length: 340 98.8 4E-09 2.5E-12 66.6 18.2 322 1-466 1-340 (340) 178 protein:vir:105461 Length: 470 98.8 6.5E-08 4E-11 60.0 35.2 418 18-556 1-470 (470) 179 protein:vir:4782 Length: 522 # 98.7 6.8E-08 4.2E-11 59.9 34.8 449 1-554 1-522 (522) 180 protein:vir:7430 Length: 563 # 98.7 7.2E-08 4.4E-11 59.8 31.4 460 27-556 1-558 (563) 181 protein:vir:3743 Length: 345 # 98.7 1.7E-08 1.1E-11 63.1 20.9 331 4-464 1-345 (345) 182 protein:vir:3780 Length: 345 # 98.6 1.5E-08 9E-12 63.6 18.3 332 1-464 1-345 (345) 183 protein:vir:5691 Length: 344 # 98.6 1E-08 6.4E-12 64.4 17.5 317 1-461 1-344 (344) 184 protein:vir:99853 Length: 488 98.6 1.8E-07 1.1E-10 57.6 29.1 404 4-556 1-410 (488) 185 protein:vir:6058 Length: 344 # 98.6 5.9E-08 3.7E-11 60.2 19.5 317 1-481 1-344 (344) 186 protein:vir:3028 Length: 500 # 98.6 2.9E-07 1.8E-10 56.5 32.8 439 1-553 1-500 (500) 187 protein:vir:9815 Length: 500 # 98.6 2.9E-07 1.8E-10 56.5 32.8 439 1-553 1-500 (500) 188 protein:vir:80959 Length: 499 98.5 4.6E-07 2.9E-10 55.3 36.2 434 18-553 1-499 (499) 189 protein:vir:98853 Length: 219 98.5 1.4E-08 8.5E-12 63.7 14.1 209 207-482 1-219 (219) 190 protein:vir:2013 Length: 344 # 98.5 6.1E-08 3.8E-11 60.2 17.6 317 1-480 1-344 (344) 191 protein:vir:79063 Length: 491 98.4 6.2E-07 3.8E-10 54.6 31.8 406 1-556 7-422 (491) 192 protein:vir:38 Length: 496 # N 98.4 9.3E-07 5.8E-10 53.6 37.0 444 18-553 1-496 (496) 193 protein:vir:102330 Length: 451 98.4 1.1E-06 6.6E-10 53.3 35.3 425 10-534 1-451 (451) 194 protein:vir:98883 Length: 517 98.3 1.6E-06 9.8E-10 52.4 36.1 451 1-553 1-517 (517) 195 protein:vir:1587 Length: 508 # 98.3 1.7E-06 1.1E-09 52.2 35.7 453 1-551 1-508 (508) 196 protein:vir:107880 Length: 491 98.2 2.1E-06 1.3E-09 51.7 31.8 408 1-556 1-422 (491) 197 protein:vir:78749 Length: 337 98.2 1.5E-06 9.5E-10 52.5 19.9 321 4-463 1-337 (337) 198 protein:vir:99232 Length: 526 98.1 4.2E-06 2.6E-09 50.1 32.9 428 1-556 1-447 (526) 199 protein:vir:4698 Length: 251 # 98.1 1.6E-06 9.6E-10 52.4 16.5 248 16-356 1-251 (251) 200 protein:vir:79233 Length: 526 98.0 8.1E-06 5E-09 48.5 33.7 428 1-556 1-446 (526) 201 protein:vir:8654 Length: 629 # 98.0 8.4E-06 5.2E-09 48.4 21.9 458 1-556 1-520 (629) 202 protein:vir:99088 Length: 629 97.9 1E-05 6.3E-09 47.9 21.6 458 1-556 1-520 (629) 203 protein:vir:103860 Length: 528 97.7 2.4E-05 1.5E-08 45.9 33.5 429 1-556 1-449 (528) 204 protein:vir:1986 Length: 512 # 97.7 2.5E-05 1.5E-08 45.8 33.0 420 1-556 1-437 (512) 205 protein:vir:106027 Length: 629 97.6 3.7E-05 2.3E-08 44.9 21.5 449 1-556 1-512 (629) 206 protein:vir:106491 Length: 646 97.5 4.7E-05 2.9E-08 44.3 22.1 441 1-556 4-501 (646) 207 protein:vir:79511 Length: 448 97.5 6E-05 3.7E-08 43.7 29.8 433 4-556 1-439 (448) 208 protein:vir:97900 Length: 639 97.4 8.1E-05 5E-08 43.0 19.8 462 1-556 1-524 (639) 209 protein:vir:107517 Length: 639 97.4 8.1E-05 5E-08 43.0 19.8 462 1-556 1-524 (639) 210 protein:vir:77981 Length: 448 97.2 0.00013 7.9E-08 42.0 29.8 429 1-556 1-440 (448) 211 protein:vir:106716 Length: 698 97.2 0.00015 9.1E-08 41.6 20.0 457 1-556 63-574 (698) 212 protein:vir:78161 Length: 355 97.1 0.00015 9.5E-08 41.5 25.1 313 161-556 1-339 (355) 213 protein:vir:78589 Length: 695 97.0 0.00023 1.5E-07 40.5 18.7 457 1-556 63-571 (695) 214 protein:vir:3648 Length: 695 # 96.8 0.00035 2.1E-07 39.6 18.5 454 1-556 36-571 (695) 215 protein:vir:106999 Length: 564 96.8 0.00035 2.2E-07 39.5 28.0 490 1-556 1-564 (564) 216 protein:vir:98816 Length: 446 96.7 0.00037 2.3E-07 39.4 24.1 416 1-533 7-446 (446) 217 protein:vir:101541 Length: 694 96.6 0.00044 2.7E-07 39.0 19.7 454 1-556 38-570 (694) 218 protein:vir:102426 Length: 631 96.4 0.00062 3.9E-07 38.2 23.4 458 4-556 1-521 (631) 219 protein:vir:78083 Length: 537 96.0 0.0012 7.4E-07 36.6 36.4 434 14-556 1-525 (537) 220 protein:vir:5839 Length: 533 # 95.6 0.0018 1.1E-06 35.7 25.4 443 1-556 20-519 (533) 221 protein:vir:108049 Length: 524 95.6 0.0019 1.1E-06 35.6 27.4 466 1-549 1-524 (524) 222 protein:vir:103177 Length: 533 95.5 0.002 1.2E-06 35.4 28.7 480 1-556 1-532 (533) 223 protein:vir:104892 Length: 558 94.7 0.0038 2.3E-06 33.9 30.5 490 1-556 1-542 (558) 224 protein:vir:104500 Length: 537 94.2 0.0051 3.2E-06 33.2 28.7 472 1-556 10-534 (537) 225 protein:vir:103458 Length: 524 94.1 0.0055 3.4E-06 33.0 25.7 472 1-549 1-524 (524) 226 protein:vir:103219 Length: 201 93.9 0.006 3.7E-06 32.8 12.9 195 272-556 1-200 (201) 227 protein:vir:105154 Length: 525 93.7 0.0065 4.1E-06 32.6 19.4 444 37-556 1-518 (525) 228 protein:vir:98265 Length: 524 93.3 0.008 5E-06 32.1 25.7 463 1-549 13-524 (524) 229 protein:vir:102668 Length: 547 93.0 0.0093 5.7E-06 31.7 26.3 444 23-556 1-547 (547) 230 protein:vir:81017 Length: 521 92.9 0.0096 6E-06 31.6 28.5 468 1-549 8-521 (521) 231 protein:vir:94572 Length: 535 92.5 0.011 6.8E-06 31.3 25.9 466 1-551 1-535 (535) 232 protein:vir:80165 Length: 651 91.8 0.014 8.9E-06 30.7 31.1 480 1-556 5-602 (651) 233 protein:vir:106282 Length: 521 91.1 0.018 1.1E-05 30.2 28.4 466 1-549 1-521 (521) 234 protein:vir:80453 Length: 535 90.5 0.02 1.3E-05 29.9 37.0 450 5-556 1-532 (535) 235 protein:vir:95254 Length: 488 90.3 0.021 1.3E-05 29.7 32.2 443 4-556 1-481 (488) 236 protein:vir:6596 Length: 521 # 90.3 0.022 1.3E-05 29.7 28.6 468 1-549 8-521 (521) 237 protein:vir:101806 Length: 516 90.3 0.022 1.3E-05 29.7 28.8 458 1-549 1-516 (516) 238 protein:vir:101189 Length: 516 90.3 0.022 1.3E-05 29.7 28.8 458 1-549 1-516 (516) 239 protein:vir:95315 Length: 559 88.1 0.035 2.1E-05 28.6 22.2 480 1-556 1-559 (559) 240 protein:vir:7208 Length: 524 # 87.8 0.037 2.3E-05 28.5 27.1 472 1-549 1-524 (524) 241 protein:vir:7321 Length: 556 # 87.3 0.04 2.5E-05 28.3 26.7 465 1-543 1-556 (556) 242 protein:vir:5665 Length: 511 # 86.9 0.042 2.6E-05 28.1 30.2 466 1-549 1-511 (511) 243 protein:vir:3361 Length: 535 # 86.3 0.047 2.9E-05 27.9 31.7 450 4-555 1-535 (535) 244 protein:vir:94599 Length: 641 83.6 0.067 4.2E-05 27.0 24.7 470 1-556 20-582 (641) 245 protein:vir:94709 Length: 522 83.2 0.07 4.4E-05 26.9 27.1 458 4-555 1-522 (522) 246 protein:vir:103765 Length: 549 82.7 0.075 4.6E-05 26.8 26.2 459 1-549 1-549 (549) 247 protein:vir:6896 Length: 523 # 81.5 0.085 5.3E-05 26.5 25.9 469 1-549 1-523 (523) 248 protein:vir:78942 Length: 510 72.0 0.19 0.00012 24.6 26.9 450 10-539 1-510 (510) 249 protein:vir:8883 Length: 543 # 71.3 0.2 0.00012 24.4 29.2 464 11-556 1-542 (543) 250 protein:vir:97265 Length: 513 70.9 0.2 0.00013 24.4 34.7 446 11-556 1-501 (513) 251 protein:vir:100598 Length: 516 65.0 0.29 0.00018 23.5 30.2 460 1-549 1-516 (516) 252 protein:vir:1538 Length: 535 # 63.8 0.31 0.00019 23.4 32.1 458 4-555 1-535 (535) 253 protein:vir:2198 Length: 536 # 56.4 0.46 0.00028 22.4 33.1 446 4-552 1-536 (536) 254 protein:vir:10447 Length: 536 44.3 0.81 0.0005 21.1 33.4 442 4-556 1-533 (536) 255 protein:vir:107404 Length: 555 39.5 1 0.00063 20.6 28.7 456 18-556 1-544 (555) 256 protein:vir:107822 Length: 555 39.5 1 0.00063 20.6 28.7 456 18-556 1-544 (555) 257 protein:vir:98506 Length: 555 39.5 1 0.00063 20.6 28.7 456 18-556 1-544 (555) 258 protein:vir:6322 Length: 510 # 34.6 1.3 0.00079 20.0 30.1 444 10-539 1-510 (510) 259 protein:vir:95149 Length: 501 26.0 2 0.0012 18.9 34.2 443 10-553 1-501 (501) No 1 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=100.00 E-value=8.5e-185 Score=1029.72 Aligned_cols=545 Identities=48% Similarity=0.865 Sum_probs=487.7 Q ss_pred hhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Q lcl|NC_019524. 7 TTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVH 86 (556) Q Consensus 7 ~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~ 86 (556) |.+...+........++..+.+...++|+||++++|++++|+|...|+|.++..++.+||+|||||++|||+|+++|+++ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~ 80 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQ 80 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 22222222212222233344455567899999999999999999999999999999999999999999999999999999 Q ss_pred HhhhccCCceeeeecccccc-CCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEe Q lcl|NC_019524. 87 RDSIVGSQYKLNAKPNTIVL-GAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCE 165 (556) Q Consensus 87 ~~nvVG~Gi~~~~~~~~~~l-g~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~ 165 (556) ++||||+||+|+++|++..| |++++++++|++.|+++|++||++++.+||++|++||++||+|++|+++++||||++++ T Consensus 81 ~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~ 160 (553) T protein:vir:63 81 RDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAE 160 (553) T ss_pred HHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEee Confidence 99999999999999999976 99999999999999999999999998899999999999999999999999999999999 Q ss_pred eccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCcccc--CCccccceeeccc Q lcl|NC_019524. 166 WLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTD--MEQWKWGYEPARF 243 (556) Q Consensus 166 ~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~--~~~~~~~rv~~~~ 243 (556) +.+. .+.++||+||+||||+|++++|.+++++|++|||||++|+|+||||++.||||.+. ...++|.||++++ T Consensus 161 ~~~~-----~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~ 235 (553) T protein:vir:63 161 WDRA-----ANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSK 235 (553) T ss_pred eccC-----CCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeecccc Confidence 8753 35679999999999999999999999999999999999999999999999999654 4558899999999 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccc--ccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGG--FKE 321 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~--~~~ 321 (556) +|||++|||+|+++||||+||||||+|||.+|+||++|+||||++|+|+|||++|||++.+++...+..+.+.+. ... T Consensus 236 ~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~ 315 (553) T protein:vir:63 236 PWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVG 315 (553) T ss_pred ccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999888776665533332 233 Q ss_pred cccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchh Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSS 401 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs 401 (556) ...........+..+...+.|+||+|++|.||++|++++|++|+++|.+|++.+||.||+|+|||||+||+|||++|||| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS 395 (553) T protein:vir:63 316 IFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSS 395 (553) T ss_pred cccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHH Confidence 34445555666677788899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhh Q lcl|NC_019524. 402 ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKET 481 (556) Q Consensus 402 ~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~ 481 (556) +|++++++|+.++.+|++|+.+||+|||++||++|+++|+|++|++++.+++++|..+.+|++|+|++|+|+||||+||+ T Consensus 396 ~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~ 475 (553) T protein:vir:63 396 IQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKET 475 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 482 EAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 482 ~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +|++++|++|++|++++|+++|.||++|++|+++|+++++++||+++.++..........+++++++.+..+++| T Consensus 476 ~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (553) T protein:vir:63 476 QAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQ 550 (553) T ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCccc Confidence 999999999999999999999999999999999999999999999998877665554444444443333333333 No 2 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=100.00 E-value=2.1e-175 Score=978.22 Aligned_cols=530 Identities=35% Similarity=0.504 Sum_probs=464.7 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) || -.. ...+......+.....|++|..+++++++|.|...|+|+++..++.+||+|||||++|||+|+++| T Consensus 1 ~~------~~~---~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av 71 (530) T protein:vir:38 1 MK------IPS---LVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAV 71 (530) T ss_pred Cc------cce---eecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 11 011 111111111112223455666678899999999999999999999999999999999999999999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +++++||||+||+|+++|+...||++++++++|+++|+++|++||++++.+||++|++||++||+|++|++++|||||++ T Consensus 72 ~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 151 (530) T protein:vir:38 72 QLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQ 151 (530) T ss_pred HHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEE Confidence 99999999999999999999999999999999999999999999999988899999999999999999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +++.+ .+++++||+||+||||+|+++++.+++++|++|||||++|+|+||||++.||++.. ...|.++|+++ T Consensus 152 ~~~~~-----~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~---~~~~~~~~~~~ 223 (530) T protein:vir:38 152 ATWDS-----DSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWM---AQNWTYIPREL 223 (530) T ss_pred eeecc-----CCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCcc---ccccceeeeee Confidence 99875 35678999999999999999999999999999999999999999999998765433 35789999999 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) ++||++|||+|+++||||+||||||+|+|.+|+||++|.+|||++|+|+|||++|||++.++...+...+....+..... T Consensus 224 ~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~ 303 (530) T protein:vir:38 224 PGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSK 303 (530) T ss_pred ccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999888777665544443333 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ............+...+.|+||+|++|.||++|++++|++|+++|++|++.++|.||+|||||||+|++|||++||||+| T Consensus 304 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R 383 (530) T protein:vir:38 304 LTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTAR 383 (530) T ss_pred ccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHH Confidence 33333334455566678899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) ++++++|+.++.+|++|+.+||+|||++||++|+++|+|++|.+.+. .++..+.+|++|+|++|+|+||||+||++| T Consensus 384 ~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~---~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a 460 (530) T protein:vir:38 384 ASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARF---SFQEARTAWGNANWIGSGRMAIDGLKEVQE 460 (530) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCC---CchhhHHhhhceeeecCCccccChHHHHHH Confidence 99999999999999999999999999999999999999999998753 336789999999999999999999999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) ++++|++|++|++++|+++|.||++|++|+++|+++++++||+++.++...+.+...+.++++++....- T Consensus 461 ~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 461 AVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred HHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCCCCCCC Confidence 9999999999999999999999999999999999999999999987766554444333333332222222 No 3 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=100.00 E-value=4.7e-175 Score=976.30 Aligned_cols=533 Identities=35% Similarity=0.498 Sum_probs=462.5 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |. ++-..+...+++. .++. .....|.||++.++++++|+|...|+|.++..++.+|++|||||++|||+++ T Consensus 1 ~~----~p~~~~~~~~~~~----~~~~-~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~ 71 (533) T protein:vir:34 1 MK----TPTIPTLLGPDGM----TSLR-EYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAA 71 (533) T ss_pred CC----Cchhhhhhccccc----chHH-HHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 11 1111111111111 1111 1122245666778899999999999999999999999999999999999999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) ++|+++++||||+||+|+++|+...||++++++++|+++|+++|+.|+++++.+||++|++||++||+|++|++++|||| T Consensus 72 ~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~ 151 (533) T protein:vir:34 72 NAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGEL 151 (533) T ss_pred HHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCce Confidence 99999999999999999999999999999999999999999999999999988899999999999999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+++++.+. +++++||+||+||||+|++|+|.+++++|++|||||++|+|+||||++.||++.. ...|.+++ T Consensus 152 f~~~~~~~~-----~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~---~~~~~~~~ 223 (533) T protein:vir:34 152 FVQATWDTS-----SSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWM---PQKWTWIP 223 (533) T ss_pred EEEeeeccC-----CCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCcc---ccccceee Confidence 999998763 3567999999999999999999999999999999999999999999998766543 35788999 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) +++++||++|||+|+++||||+||||+|+|+|.+|+||++|.+|||++|+|+|||++|||++.++....+..+....+.. T Consensus 224 ~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~ 303 (533) T protein:vir:34 224 RELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQ 303 (533) T ss_pred eeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999888777665555444 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) ...............+...+.|+||+|++|+||++|++++|++|+++|++|++.++|.||+|+|||||+||+|||++||| T Consensus 304 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYS 383 (533) T protein:vir:34 304 RERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYS 383 (533) T ss_pred cccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHH Confidence 33333333334455566677899999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) |+|++++++|+.++.+|++|+.+||+|||++||++|+++|.|++|++.... ++..+.+|++|+|++|+|+||||+|| T Consensus 384 S~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~---~~~~~~~~~~~~w~~p~~~~iDP~Ke 460 (533) T protein:vir:34 384 TARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFS---FQEARSAWGNCDWIGSGRMAIDGLKE 460 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCC---chhhHHhhhceeeccCCccccChHHH Confidence 999999999999999999999999999999999999999999999987533 25778999999999999999999999 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) ++|++++|++|++|++++|+++|.||++|++|+++|+++++++||+++.++...+.+...+.++++.++...- T Consensus 461 ~~a~~~~i~~G~~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 461 VQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred HHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCCCC Confidence 9999999999999999999999999999999999999999999999987765443333333222222111111 No 4 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=100.00 E-value=5.8e-167 Score=931.94 Aligned_cols=495 Identities=18% Similarity=0.206 Sum_probs=422.0 Q ss_pred hhHHHHHhh-----HhhcccchhhhhhhhcchhccccCCCcccccc--cCCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 7 TTRTRAKKA-----VDVVAETATATPMAVGGGMEGAERTTREMFQW--NPSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 7 ~~r~~a~~a-----~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w--~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) +.|.+.+.. +.-..+...+..+...++|+||+ +++++++| +|...|+|+++..++.+||+|||||++|||+| T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~-~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a 79 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAAR-RDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYA 79 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhccccc-CCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHH Confidence 222222211 11111111222233446799886 47888999 67899999999999999999999999999999 Q ss_pred HHHHHHHHhhhcc-CCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 80 AGVVAVHRDSIVG-SQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 80 ~~~v~~~~~nvVG-~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +++|+++++|||| +||+|+++++...++. +++++++|+++|+.|++++ +||++|++||++||+|++|++++|| T Consensus 80 ~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~----~~~~~~~ie~~w~~Wa~~~--~~D~~g~~~f~~lq~l~~r~~~~dG 153 (505) T protein:vir:96 80 KRFYQLLKNNVIGPKGMTFQSRVKRRNGKP----DDRANTLIEGNWQQWIKKG--NCDVTGRYHFVTLLHLWMETLARDG 153 (505) T ss_pred HHHHHHHHHHhcCCCcceeeecCCcccccc----cHHHHHHHHHHHHHhcCCc--CcceeccCCHHHHHHHHHHHHhhCC Confidence 9999999999999 6999999998765554 5568999999999999876 4999999999999999999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCC--CCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccc Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNV--MDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKW 236 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~--~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~ 236 (556) |||+++++.+ +.++||+||+||||+|++|+|. .++++|++|||||++|+|++|||++.||||.+...... T Consensus 154 E~f~~~~~~~-------~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~- 225 (505) T protein:vir:96 154 EVLVREHRGY-------PNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYA- 225 (505) T ss_pred ceEEEEeecC-------CCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccc- Confidence 9999987643 4579999999999999999875 47889999999999999999999999999976543221 Q ss_pred ceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccc Q lcl|NC_019524. 237 GYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQ 316 (556) Q Consensus 237 ~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~ 316 (556) ...+++|||++|||+|+++||||+||||+|+|||.+|+||++|.+|||++|+|+|||++|||++.++.... T Consensus 226 --~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~------- 296 (505) T protein:vir:96 226 --GQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQP------- 296 (505) T ss_pred --cccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCc------- Confidence 12456788999999999999999999999999999999999999999999999999999999876543211 Q ss_pred ccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhc Q lcl|NC_019524. 317 GGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTK 396 (556) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~ 396 (556) ..+..+...+.|+||+|.+|+||++|++++|++|+++|++|++.++|.||+|+|||||+|++|||+ T Consensus 297 --------------~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~ 362 (505) T protein:vir:96 297 --------------PEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEG 362 (505) T ss_pred --------------cccccCccccccCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc Confidence 011233456679999999999999999999999999999999999999999999999999999999 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccc Q lcl|NC_019524. 397 TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQID 476 (556) Q Consensus 397 ~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iD 476 (556) +||||+|++++++|+.|+.+|++|+.+||+|||++||++++++|+|++|++. +..|++|.|++|+|+||| T Consensus 363 ~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~----------~~~~~~~~w~~p~~~~iD 432 (505) T protein:vir:96 363 VNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVD----------IDRLSQYAFQPRGWDWVD 432 (505) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCcc----------chhhceeeeccCCccccC Confidence 9999999999999999999999999999999999999999999999999863 356899999999999999 Q ss_pred hhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 477 EKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 477 P~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) |+||++|++++|++|++|++++|+++|.||++|++|+++|+++++++||.++.++....++....++++.+|+ T Consensus 433 P~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 433 PAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVNPTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 9999999999999999999999999999999999999999999999999876555433333222222222222 No 5 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=100.00 E-value=3.9e-166 Score=927.40 Aligned_cols=490 Identities=19% Similarity=0.236 Sum_probs=438.4 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +||.++++|.++|.. .++|+||+. +| ..+|.+..+|++.++..++.+||+|||||++|||+|+ T Consensus 11 ~sP~~~~~R~~ar~~---------------~~~y~aa~~-~r-~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~ 73 (502) T protein:vir:79 11 FSPGWKAARLRSRAV---------------IQAYEAVKT-TR-THKARRENRTADQLSQYGAVSLREQARYLDNNHDLVI 73 (502) T ss_pred cChHHHHHHHhhHHH---------------HhhccccCc-cc-ccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 888888877776543 246998864 55 4578899999999999999999999999999999999 Q ss_pred HHHHHHHhhhccC-CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 81 GVVAVHRDSIVGS-QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 81 ~~v~~~~~nvVG~-Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) ++|+++++||||+ ||+|+++++.. +.+++++++++|+++|++|++ .||++|++||++||+|++|+++++|| T Consensus 74 ~av~~~~~nvVG~ggi~~~~~~~~~----~~~~~~~~~~~ie~~w~~Wa~----~~D~~g~~~f~~~q~l~~r~~~~dGE 145 (502) T protein:vir:79 74 GVFDKLEERVVGKNGIIVEPHPVLR----NGAIARDLAAEIRTRWSEWSV----SPEVTGQFTRPMLERLMLRTWLRDGE 145 (502) T ss_pred HHHHHHHHhhccCCceeeeeccCCC----ChhHHHHHHHHHHHHHHHhhc----CcCccccCCHHHHHHHHHHHHHhCCc Confidence 9999999999998 89999998654 556788999999999999997 39999999999999999999999999 Q ss_pred eEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) ||+++++.+. +.+++++++||+||+||||+|++|.+ ++++|++|||||++|+|++|||++.||||... .+| T Consensus 146 ~f~~~~~~~~-~~~~~g~~~~l~lq~iepd~l~~~~~--~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~---~~~--- 216 (502) T protein:vir:79 146 VFAQMVSGRI-NSLTPSAGVHFWLEALEPDFIPMTSD--ESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQ---MET--- 216 (502) T ss_pred eEEEEeeccc-CccCCCcccceEEEEecchhcCCCCC--CCCeeEeeeEECCCCceEEEEEeecCCCCCcc---cce--- Confidence 9999998654 35678999999999999999999876 57899999999999999999999999998532 334 Q ss_pred eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) .+|||++|||+|+++||||+||||+|+|+|..|+||++|.+|||++|+|+|||++|||++.++.......+ T Consensus 217 ---~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~------ 287 (502) T protein:vir:79 217 ---KEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNG------ 287 (502) T ss_pred ---eEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCC------ Confidence 45677899999999999999999999999999999999999999999999999999999887654433221 Q ss_pred cccccccccccccccccccceecCCcee-eecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKI-PHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i-~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) +..+...+.|+||+| .+|.||++|++++|++|+++|++|++.++|.||+|+|||||+||+|||. | T Consensus 288 -------------~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~-n 353 (502) T protein:vir:79 288 -------------SKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNG-T 353 (502) T ss_pred -------------CCCccccccccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc-h Confidence 112344567999986 5799999999999999999999999999999999999999999999986 9 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) |||+|++++++|+.|+.+|++|+.+||+|||++||++++++|+|++|++.. +.+|++|+|++|+|+||||+ T Consensus 354 ySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~---------~~~~~~~~W~~p~~~~iDP~ 424 (502) T protein:vir:79 354 YSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLD---------RSSLYTAVYSGPVMPWIDPV 424 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCC---------chhhcceeeecCCccccChH Confidence 999999999999999999999999999999999999999999999998743 56899999999999999999 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ||++|++++|++|++|++++|+++|.||++|++|+++|+++++++||+++.++...+.+...+.+.+++.+++++++| T Consensus 425 Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 425 KEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 999999999999999999999999999999999999999999999999999887776666666666666666666666 No 6 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=100.00 E-value=6.1e-166 Score=926.36 Aligned_cols=494 Identities=19% Similarity=0.225 Sum_probs=431.0 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +||.++++|.++|.. .++|+||+. +|+...| +...|+|.++..++.+||+|||||++|||+|+ T Consensus 11 ~sP~~a~~R~~ar~~---------------~~~y~aa~~-~r~~~~~-~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 73 (548) T protein:vir:95 11 LAPELVARRLAAREA---------------IQAYEAARP-GRTHKAK-RQPLGADTSLQKSAVSMREQCRKLDEDHDLVT 73 (548) T ss_pred cchHHHHHHHHhHHH---------------hccccccCc-ccccccc-CCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 777777777666542 257999976 5666666 45789999999999999999999999999999 Q ss_pred HHHHHHHhhhcc-CCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 81 GVVAVHRDSIVG-SQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 81 ~~v~~~~~nvVG-~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) ++|+++++|||| .|+.+++++ |++|++++++|+++|+++|++||++ ||++|++||++||+|++|+++++|| T Consensus 74 ~av~~~~~nvVG~~G~~i~p~~----l~~d~~~a~~l~~~ie~~w~~Wa~~----~D~~g~~~f~~lq~l~~R~~~~dGE 145 (548) T protein:vir:95 74 GLLDRLEERVVGGSGIGVEPLP----LRLDGSVHAELAMEIRSAWAEWSLS----PETSGELTRPQVERLMCRTWLRDGE 145 (548) T ss_pred HHHHHHHHhccCccccceeeee----cCCCHHHHHHHHHHHHHHHHHhhcC----ccccccCCHHHHHHHHHHHHHhCCc Confidence 999999999999 588888776 8999999999999999999999963 9999999999999999999999999 Q ss_pred eEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) ||++++|.+.. +...+.++||+||+||||+|++|++.. +++|++|||||++|+|+||||++.||||.+.... . T Consensus 146 ~f~~~~~~~~~-~~~~g~~~~~~lqliepd~l~~~~~~~-~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~-----~ 218 (548) T protein:vir:95 146 GLAQKLMGRVP-NYTFATSVPFALELLEPDYLPFSYNNL-SKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGG-----S 218 (548) T ss_pred eEEEeeecccc-cccCCcccceEEEEechhhcCCCCCCC-CCceeeeeEECCCCceEEEEEeecCCCccccccc-----c Confidence 99999997654 345788999999999999999998865 5689999999999999999999999999654321 2 Q ss_pred eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) ..+++|||++|||+|+++|+||+||||||+|||.+|+||++|.+|||++|+|+|||++|||++.+++...+. T Consensus 219 ~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~-------- 290 (548) T protein:vir:95 219 LAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEP-------- 290 (548) T ss_pred cceeeechhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCC-------- Confidence 345567889999999999999999999999999999999999999999999999999999998876543321 Q ss_pred cccccccccccccccccccceecCCcee-eecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKI-PHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i-~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) ....+...+.|+||+| .+|.||++|++++|++|+++|++|++.++|.||+|+|||||+||+||| +| T Consensus 291 ------------~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s-~n 357 (548) T protein:vir:95 291 ------------GKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD-GT 357 (548) T ss_pred ------------CcccccccccccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-hh Confidence 1123455667999996 579999999999999999999999999999999999999999999998 59 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) |||+|++++++|+.|+.+|++||.+||+|||++||++|+++|+|++|++.+ +.+|++|+|++|+|+||||+ T Consensus 358 YSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~---------~~~~~~~~W~~P~~~~iDP~ 428 (548) T protein:vir:95 358 YSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVD---------HRTLYAAVYQGPVMPWINPM 428 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCC---------chhheeeeeecCCccccChH Confidence 999999999999999999999999999999999999999999999998743 46799999999999999999 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCC-CCCCCCCCC---------- Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQ-SSNSSESTS---------- 547 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~-~~~~~~~~~---------- 547 (556) ||++|++++|++|++|++++|+++|.||++|++|+++|+++++++||++++++....... ..++++.+. T Consensus 429 Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (548) T protein:vir:95 429 HEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLT 508 (548) T ss_pred HHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccc Confidence 999999999999999999999999999999999999999999999999988876544332 222222211 Q ss_pred -CCCCCcCCC Q lcl|NC_019524. 548 -DNPNEETTQ 556 (556) Q Consensus 548 -~~~~~e~~~ 556 (556) |+.+||-++ T Consensus 509 ~~~~~~~~~~ 518 (548) T protein:vir:95 509 ADEARELVNR 518 (548) T ss_pred cchhHHhhcc Confidence 222222222 No 7 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=100.00 E-value=1.8e-163 Score=912.80 Aligned_cols=490 Identities=17% Similarity=0.223 Sum_probs=421.0 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |+-.++- ..++.+.++.+...++|+||+. +++.+.|. ..|+|.++..++.+||+|||||++|||||+++| T Consensus 1 m~~~~~~-------~~a~~~~~~~~~~~~~y~aa~~-~~~~~~~~--~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av 70 (495) T protein:vir:10 1 MNMTPSG-------YQSLASGLLVPVGASAYEGASG-GHRWQDIG--DYGPDTAVASGIQTLRARSHHNVRNNPWATNAV 70 (495) T ss_pred CCccccc-------ccccchhhhhHHHhhhhhcccc-CcccCCCC--CCChhHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 2222221 1112222223334467999976 55566663 678999999999999999999999999999999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +++++||||+||+|+++++ +++++++|+++|++|++ .||++|++||++||+|++|+++++||||++ T Consensus 71 ~~~~~~vVG~Gi~p~~~~~----------~~~~~~~ie~~w~~wa~----~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~ 136 (495) T protein:vir:10 71 ATWVAAAVGNGLTPRWRMK----------EQELRQELQELWGDWVN----EADFDEVQSFYGLQALVVRTVINSGEAFVI 136 (495) T ss_pred HHHHHhhcCCCcccccCCc----------hHHHHHHHHHHHHHhhc----CcccccccCHHHHHHHHHHHHHhCCceEEE Confidence 9999999999999999875 56899999999999996 399999999999999999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCC---CCCceEEEEEEECCCCCeEEEEEeecCCCccccCC-cccccee Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNV---MDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDME-QWKWGYE 239 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~---~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~-~~~~~rv 239 (556) +++.+ ..++.++||+||+||||+|++|++. +++++|++|||||++|+|++|||++.||||.+... ..+| T Consensus 137 ~~~~~----~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~--- 209 (495) T protein:vir:10 137 KKPRP----LSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDT--- 209 (495) T ss_pred Eeecc----cCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccce--- Confidence 98865 3567899999999999999999875 46889999999999999999999999999965432 2334 Q ss_pred eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) ++|||++|||+|. +||||+||||+|+|++ +|+||++|++|||++|+|+|||++|||++.+++......+... T Consensus 210 ---~rvpA~~vlH~f~-~r~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~--- 281 (495) T protein:vir:10 210 ---VWIKAEHVLHVTV-LTVRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPK--- 281 (495) T ss_pred ---eeechhheEeccc-cCCCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccc--- Confidence 4577889999996 7999999999999855 7999999999999999999999999999988766554433221 Q ss_pred cccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY 399 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY 399 (556) .+..+...+.|+||+|++|.||++|++++|++|+++|++|++.+||.||+|+|||||+||+|||++|| T Consensus 282 ------------~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nY 349 (495) T protein:vir:10 282 ------------RSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNY 349 (495) T ss_pred ------------cccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH Confidence 12234556789999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 400 SSARASMAETQKYMDSRKK-LVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 400 Ss~R~~~~e~~r~~~~~q~-~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||+|++++++|+.++.+|+ ++|.+||+|||++||++++++|+|++|++. ..+.+|++|+|++|+|+||||+ T Consensus 350 SS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~--------~~~~~~~~~~w~~p~~~~vDP~ 421 (495) T protein:vir:10 350 SSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYL--------QRRRYYNRVSWRTPRWEEVDPL 421 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCch--------hhhHhhhccccccCCccccChH Confidence 9999999999999999886 689999999999999999999999999864 3578999999999999999999 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNE 552 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (556) ||++|++++|++|++|++++|+++|.||++|++|+++|+++++++||++++++...+.++..+.+.++..++++ T Consensus 422 Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 422 KKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 99999999999999999999999999999999999999999999999999887766554444333333333333 No 8 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.94 E-value=7.5e-27 Score=163.80 Aligned_cols=419 Identities=11% Similarity=0.025 Sum_probs=221.7 Q ss_pred HHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehh----cccCHH Q lcl|NC_019524. 69 AQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDAR----RMCTLT 144 (556) Q Consensus 69 aRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~----g~~~f~ 144 (556) -|+|++||+++.++|+.+.++|.|.+|.+..+.+... ...-...++..+..|-.... .+..+ ..+++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~-------~~~~~~~~~~~~~~l~~~~p-n~~~~~~~~~~~t~~ 72 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAED-------PDRDGEQYERVWDFWFGDDS-NWQVGPMESERATAT 72 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCccc-------ccchhhhhhhHHHHhhccCC-CccccchhhHhhHHH Confidence 8899999999999999999999999999877643211 11122344444444332210 12222 235778 Q ss_pred HHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCC------CCCceEEEEEEECCCCCeEEE Q lcl|NC_019524. 145 GLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNV------MDTPNLRSGVQLDNNGAALGY 218 (556) Q Consensus 145 ~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~------~~g~~i~~GIE~d~~Gr~vaY 218 (556) ++...++..++..|.+|+.+++... ..++.|..|+|+.|..-.+. ..+..+..++-.+.+...... T Consensus 73 ~~~~~~~~~l~l~Gn~~i~~~r~~~--------G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 144 (467) T protein:vir:31 73 NVLQTAWTDYEAIGWLTIEILTQTD--------GTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNG 144 (467) T ss_pred HHHHHHHHHHHhcCCeEEEEEECCC--------CcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeeccc Confidence 8889999999999999998875322 23678999999998642211 111111111111111111111 Q ss_pred EEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcce Q lcl|NC_019524. 219 WLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATY 295 (556) Q Consensus 219 ~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~ 295 (556) ++...+........+ ....+|+.+|||+..+...++.+|+|++..++..+. -...++..+++. ++.. T Consensus 145 ~~~~~~~~~~~~~~~-------~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~---~~~~~~~~~~~~f~ng~~p 214 (467) T protein:vir:31 145 DLDPVFVDADDGSTG-------TSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIR---GDSAAQDYNIDFFENDGVP 214 (467) T ss_pred ceeeeeeeecccccc-------ceeEeccccEEEecCCCCCCCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCC Confidence 111111111111011 123578899999999998999999999998877654 344444444443 4556 Q ss_pred eeeEeccCc--ccccccccccccccccccccccccccccccccc----cceecCCceeeecCCCce-------eeeecCC Q lcl|NC_019524. 296 AASVESELP--SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQT----KNIAIDGAKIPHLYPGTK-------LKMQPAG 362 (556) Q Consensus 296 ~~fi~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~l~pG~i~~L~pGe~-------i~~~~~~ 362 (556) .++|+.+.+ +..... ................ ....-..|....|..|.+ ++.+++. T Consensus 215 ~gil~~~~~~l~~e~~~-----------~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~ 283 (467) T protein:vir:31 215 RIAIIVKGAELTEKGRE-----------EMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVG 283 (467) T ss_pred ceEEEecCcCCCHHHHH-----------HHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEecccc Confidence 677764322 111100 0000000000000000 000113444555555554 3443332 Q ss_pred CC-CccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019524. 363 TP-GGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAG 440 (556) Q Consensus 363 ~p-~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G 440 (556) .+ ...|.++.+.+...||+.+|||... .|+.+++|| |++.+....|+ ...++|+.+.| +.++-.. T Consensus 284 ~~~d~qf~e~~~~~~~~Ia~~fgVpp~~-lG~~~~~~~~s~~e~~~~~f~-----------~~~l~P~~~~i-e~~ln~~ 350 (467) T protein:vir:31 284 IDEEASFLEFRGRNEHDILKVHDVPPVI-AGVVESGAFSTDAEEQRKEFA-----------EETIQPKQHDF-GELLYEL 350 (467) T ss_pred ChhhHHHHHHHHHHHHHHHHHhCCCHHH-cccCCCCCcccCHHHHHHHHH-----------HHHHHHHHHHH-HHHHHHh Confidence 22 4688999999999999999999875 488877787 45555544443 33456666554 3334333 Q ss_pred CccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHH Q lcl|NC_019524. 441 NVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLI 520 (556) Q Consensus 441 ~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~ 520 (556) .++ .... .....+++..-.....|+.+.+++...++++|+.|+.|+-+..|++|-. + + ... T Consensus 351 l~~--~~~~----------~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~--d----~-~~~ 411 (467) T protein:vir:31 351 VHK--QGLD----------APDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFP--E----E-HVY 411 (467) T ss_pred hcc--hhhc----------cCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--c----c-ccc Confidence 221 1000 0011235555566678999999999999999999999999999998731 1 0 000 Q ss_pred HHcCCCCC----ccccccCCCCCCCCCCCCCCC------CCCcCCC Q lcl|NC_019524. 521 KSLKLDFT----GKMVEGNSTQSSNSSESTSDN------PNEETTQ 556 (556) Q Consensus 521 ~~~Gl~~~----~~~~~~~~~~~~~~~~~~~~~------~~~e~~~ 556 (556) ...-+... ..+.........+..+++.++ .+-|++| T Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (467) T protein:vir:31 412 GGETLVAEVTGGSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQ 457 (467) T ss_pred CCcccccccccccCCCCcccCcCCCCCCCcccchHhhhhhccccch Confidence 00000000 000000000000000000000 0011111 No 9 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.84 E-value=1.1e-21 Score=135.39 Aligned_cols=427 Identities=10% Similarity=0.038 Sum_probs=237.9 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) ++-..- ..+++.+...-...-.|. |.+..+. ...+..+-....+.++|.+. T Consensus 3 ~~~~~~-----------~~~p~~~~~~~~~~~~~~-----------~~~~~g~-------~~~~~~~~~~~~~~~~~~V~ 53 (518) T protein:vir:78 3 LANGQT-----------LSAPAMAELSPQMQDSYY-----------YAPAVGM-------QLERQFSLYGGIYKNQPWVR 53 (518) T ss_pred ccCcee-----------eccchhhhhhhhhhhccc-----------ccceece-------ecccccchhhHHhhhhHHHH Confidence 000000 000000000000000111 1111110 01111122223567899999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.|.+.|-+.-|++.-+... .. .+.....+.....+|+ -.+|..++...++..++..|++ T Consensus 54 acV~~IA~~iA~lp~~l~~~~~~------~~-----~~~~~~~~~~Ll~~PN------~~~t~~~F~~~lv~~lll~Gna 116 (518) T protein:vir:78 54 TVIAKRAQALARLPVKCMFTSGD------TE-----TEEHDTGYAKLLADPC------EYLDPFAFWEWVASTLDIYGET 116 (518) T ss_pred HHHHHHHHhhccCceEEEEEcCC------cc-----ccccchHHHHHHhCCC------CCCCHHHHHHHHHHHHhhcCCe Confidence 99999999999877776543221 10 0111222333445665 3578899999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+.+... ..+..|..|.|++|. |..|..+-.+.|++....+... T Consensus 117 y~~i~r~~~--------G~~~~L~~l~p~~Vt--------------v~~~~~~~~~~y~~~~~~~~~~------------ 162 (518) T protein:vir:78 117 YLAIQKNKS--------GTPEKLMPMHPSRVA--------------IKRNSRTGRYEYYFQAGAGVGT------------ 162 (518) T ss_pred EEEEEEcCC--------CcEEEEEEECCCceE--------------EEEcCCCCEEEEEEEecCCccc------------ Confidence 999865321 135688899998874 4445556667788764432210 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....+++.+|||+...-.-|..+|+|++..+...+.......+....-.+-.+...++|+.+..- ..+.. T Consensus 163 ~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~l----------s~e~~ 232 (518) T protein:vir:78 163 QLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL----------SPEAQ 232 (518) T ss_pred eeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCC----------CHHHH Confidence 11246778999998877677889999999888888777777777666666678888898864220 01111 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) ........ ....+ .-..|.+..|..|.+++.++.+.-..+|.+..+.....||+.+|||-+.| |+.++.||| T Consensus 233 ~~~k~~~~---~~~~G----~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~l-g~~~~st~s 304 (518) T protein:vir:78 233 QRLREQFD---RAHAG----SSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIV-HILDRATFS 304 (518) T ss_pred HHHHHHHH---HHhcC----cccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCch Confidence 11000000 00000 01357788899999999999876778899999999999999999998866 899999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) ++.+..+.|+ ...++|+..+ ++.++....++.-.... .++|-.-.....|.... T Consensus 305 n~e~~~~~f~-----------~~tL~P~~~~-ie~eln~~L~~~~~~~~--------------~~~fd~~~Llr~D~~~r 358 (518) T protein:vir:78 305 NISAQMRAFY-----------RDTMAIPIAR-IQSAMDKYVGQYWVRKN--------------RMKFDIDDVIQPDWEAK 358 (518) T ss_pred hHHHHHHHHH-----------HHHHHHHHHH-HHHHHHHhhcccccCcc--------------eEEeechhhhccCHHHH Confidence 8876665554 3335554444 44444443322100000 12222223334599999 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHH--HHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFRE--VFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~--v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +++....+++|+.|+-|+-+..|..|-+ -.+++-.-.. +..++..... ....+.++++....+.+..+.+| T Consensus 359 ~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n-~~pl~~~~~~----~~~g~~~~~~~~~~~~~~~~~~~ 431 (518) T protein:vir:78 359 SESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA-LQPLGATPDG----AVEGEEAPAPKRPASTPVASLDQ 431 (518) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccc-ceeccccccc----ccCCCCCCCCCCCCccccccccc Confidence 9999999999999999998888988754 2221100000 0000000000 00000000000001101000000 No 10 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.83 E-value=8.8e-21 Score=130.54 Aligned_cols=397 Identities=11% Similarity=0.012 Sum_probs=229.5 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=.++...+. .+.+.++ ..|.....+... ...-+.+-+..++-+..+| T Consensus 1 M~~f~~~~~~~--------------------~~~~~~~---~~~~~~~~~~~~--------~~~v~~~~al~~~~V~~~v 49 (397) T protein:vir:38 1 MPLLKLNKSHS--------------------QGFSLND---PDWVNFLTGGEA--------QKYVSADTALKNSDIFSLI 49 (397) T ss_pred Ccchhhhhccc--------------------CcccCCc---hhhhhhhcCCcC--------CceechHHhhccHHHHHHH Confidence 11111110000 0000000 112111110000 0001112233578889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-+- .++ .+ . .....+-.+|+ -.+|.+++.+..+..++..|+||+. T Consensus 50 ~~ia~~ia~~--p~~--~~-------~-----------~~~~~l~~~PN------~~~s~~~f~~~~~~~lll~Gna~~~ 101 (397) T protein:vir:38 50 MQLSGDLAMV--RYT--SE-------S-----------DRSQSIISNPS------VTANGYSFWQGMFAQLLLDGNCYAY 101 (397) T ss_pred HHHHHHHhhC--ccc--cc-------c-----------cHHHHHHhcCC------CCCCHHHHHHHHHHHhhhcCCEEEE Confidence 9998887642 221 11 0 11234445554 2579999999999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +++... ..+..|..|+|+++. |..+.+|..+.|++.-.+++... .. T Consensus 102 i~r~~~--------g~~~~l~~l~~~~v~--------------i~~~~~~~~~~y~~~~~~~~~~~------------~~ 147 (397) T protein:vir:38 102 RHKNTN--------GVDLSWEYLRPSQVQ--------------PMLLQDGSGLIYNINFDEPAIGY------------ME 147 (397) T ss_pred EEECCC--------CcEEEEEEEcCceeE--------------EEEcCCCceEEEEEEeccccccc------------ee Confidence 875321 235788999998873 56778888899999876654321 13 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+......++.+|+|++.++...+.......++.....+-.+...++|+.+...... ..... T Consensus 148 ~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e----------~~~~~ 217 (397) T protein:vir:38 148 NVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLD----------AETRI 217 (397) T ss_pred EecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHH----------HHHHH Confidence 47888999999999999999999999999999988888888888888889999999976432110 00000 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ...... . ...-+.|.+..|..|.+++.++.+....+|.+..+.....||+.+|||-..|.++ +. +|+++- T Consensus 218 ~~~~~~----~----~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~-~~-~~~~~e 287 (397) T protein:vir:38 218 ARSKEI----S----KQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQ-GD-QQSSIT 287 (397) T ss_pred HHHHHH----H----hcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CC-cccHHH Confidence 000000 0 0011356677799999999999887788899999999999999999999977654 33 445432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +. ..+....++|+...| +.++....++ . .. + ...+. .-.|+...+++ T Consensus 288 ~~------------~~~~~~~l~P~~~~i-e~~ln~~l~~--~-~~---~----------~~~~~----~~~d~~~~~~~ 334 (397) T protein:vir:38 288 QI------------SGQYAKSLNRYVQAI-VGELNDKLHA--N-IS---A----------NIRFA----IDAMGDQYAST 334 (397) T ss_pred HH------------HHHHHHHHHHHHHHH-HHHHHHhccC--h-hc---c----------ccccc----ccCCHHHHHHH Confidence 11 112234556766554 3344333221 1 00 0 11111 12488888999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) ..+.+++|+.|+.|+-+..|..+-+--+...-+. . ................+.+++++.+|++ T Consensus 335 ~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~-----~----~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 335 ISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEK-----E----PQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCCccccccc-----c----ccccccccccccCCCCCCCCCCCCCCCC Confidence 9999999999999999888987732211000000 0 0000000011011111111111222222 No 11 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.82 E-value=6.1e-21 Score=131.40 Aligned_cols=433 Identities=9% Similarity=0.034 Sum_probs=238.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |==..+. ...+++.+...-.....| +|.+... . ...+..+-.-.++.+++.+. T Consensus 1 ~~~~~~~---------~~~~p~~~e~~~~~~~~~-----------~~~~~~~---~----~~~~~~~~~~~~a~~~~~V~ 53 (518) T protein:vir:10 1 MLLANGQ---------TLSAPAMAELSPQMQDSY-----------YYAPAVG---M----QLERQFSLYGGIYKNQPWVR 53 (518) T ss_pred CcccCce---------eecCchhhhhhhhhhccc-----------ccccccc---e----ecccccchhhHHHhhhHHHH Confidence 0000000 000000000000000001 1111110 0 00111111223577899999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.|.+.|-.--|++.-+... +. .+.....+.....+|+ -.+|.+++...++..++..|++ T Consensus 54 acV~~IA~~iA~lpl~l~~~~~~------~~-----~~~~~~~~~~Ll~~PN------~~~t~~~F~~~lv~~lll~Gna 116 (518) T protein:vir:10 54 TVIAKRAQALARLPVKCMFTSGD------TE-----TEESDTGYAKLLADPC------EYLDPFAFWEWVASTLDIYGET 116 (518) T ss_pred HHHHHHHHhhccCceEEEEEcCC------Cc-----eeccchHHHHHHcCCC------CCCCHHHHHHHHHHHHhhcCCe Confidence 99999999998776766433221 11 1112233344555665 3578999999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+.+... | .+..|..|.|++|. |+.|..+-.+.|++....+... T Consensus 117 y~~i~r~~~------G--~~~~L~~l~p~~v~--------------v~~~~~~~~~~y~~~~~~~~~~------------ 162 (518) T protein:vir:10 117 YLAIQKNKS------G--TPEKLMPMHPSRVA--------------IKRNSRTGRYEYYFQAGAGVGT------------ 162 (518) T ss_pred EEEEEECCC------C--cEEEEEEECCCceE--------------EEEcCCCCEEEEEEEecCCccc------------ Confidence 999865321 1 35678999998884 4445555567787764432110 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....+++.+|||+...-.-|..+|+|++..+...+......+++...-.+-.+...++|+.+..-. ++.. T Consensus 163 ~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls----------~e~~ 232 (518) T protein:vir:10 163 QLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS----------EAAQ 232 (518) T ss_pred eEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCC----------HHHH Confidence 112467889999988777788899999998888887777777777777777788889998653210 1111 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) ........ ....+ .-..|.+..|..|.+++.++.+.-...|.+..+.....||+.+|||-+.| |+.++.||| T Consensus 233 ~~~k~~~~---~~~~G----~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~l-g~~~~~t~s 304 (518) T protein:vir:10 233 QRLREQFD---RAHSG----SSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIV-HILDRATFS 304 (518) T ss_pred HHHHHHHH---HHhcC----ccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCch Confidence 11111000 00000 12356788899999999999877777899999999999999999999866 888899999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) ++.+..+.|. ...++|+... ++.++....++.-. .. ..++|-.-.....|.... T Consensus 305 n~eq~~~~f~-----------~~tL~P~l~~-ie~~ln~~L~~~~~-~~-------------~~~~fd~~~llr~D~~~r 358 (518) T protein:vir:10 305 NISAQMRAFY-----------RDTMAIPIAR-IQSAMDKYVGQYWV-RK-------------NRMKFDIDDVIQPDWEAK 358 (518) T ss_pred hHHHHHHHHH-----------HHHHHHHHHH-HHHHHHHhhccccc-CC-------------ceEEEechhhhccCHHHH Confidence 8866655553 3334554433 34444443332100 00 012333333445699999 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHH--HHHHHHHHHHHHHHcCCCCCccc--c-ccCCCCCCCCCCCCC-CCCCC-- Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFRE--VFKQRAREEGLIKSLKLDFTGKM--V-EGNSTQSSNSSESTS-DNPNE-- 552 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~--v~~q~a~E~~~~~~~Gl~~~~~~--~-~~~~~~~~~~~~~~~-~~~~~-- 552 (556) +++....+++|+.|+-|+-+..|..|-+ --+++-.-.. +..++....... . .+.+..+.+.+..+. +++.+ T Consensus 359 ~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n-~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (518) T protein:vir:10 359 SESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA-LQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSV 437 (518) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccc-ceecccccccccCCCCCCCCCCCCccccccccccccccC Confidence 9999999999999999998888988754 1211100000 000110000000 0 000000000000000 00000 Q ss_pred ----------------cCCC Q lcl|NC_019524. 553 ----------------ETTQ 556 (556) Q Consensus 553 ----------------e~~~ 556 (556) ...| T Consensus 438 ~~~~~~~~~~~~~~~~~~~~ 457 (518) T protein:vir:10 438 PGLSPTNSDRSTDSGKTEPR 457 (518) T ss_pred CCCCcccccccccccccchh Confidence 0000 No 12 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.80 E-value=7.9e-20 Score=125.32 Aligned_cols=434 Identities=9% Similarity=-0.007 Sum_probs=237.0 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) .++-+..++.. +.. + .-+..+|.+...............-..-..+-+.+++-+. T Consensus 2 ~~~~~~~~~~~-~~~-----~-------------------~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~ 56 (454) T protein:vir:93 2 WNLLRRTRKNQ-KSG-----R-------------------DVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVF 56 (454) T ss_pred CCccccCcccc-ccc-----c-------------------cccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHH Confidence 22222211110 000 0 0001112111000000000000000011123345667789 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.|.+.|-.--|++.-+.. ++... +.-...+.....+|+ -.+|.+++....+..++..|++ T Consensus 57 ~~v~~Ia~~iA~lp~~~~~~~~------~g~~~----~~~~~~~~~L~~~PN------~~~t~~~f~~~l~~~lll~Gna 120 (454) T protein:vir:93 57 ACISLISQDIAKMRLRLMQTDA------QGIRR----ETRRGDIARLCRRPN------AQQNRIQFFELWLNAKLRHGNT 120 (454) T ss_pred HHHHHHHHhhccCceEEEEecc------CCccc----hhhhHHHHHHHhcCC------CCCCHHHHHHHHHHHHhhcCce Confidence 9999999999887666643221 11111 111112233344565 2578999999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+.+... ..+..|..|+|++|.. ..+.+|. +.|.+........ . T Consensus 121 ~~~i~r~~~--------G~~~~L~~i~~~~v~v--------------~~~~~g~-~~y~~~~~~~~~~----~------- 166 (454) T protein:vir:93 121 VVLKIRNAR--------GQIKELRILDWNRVEP--------------LVADDGE-VFYRITPDRNCGI----T------- 166 (454) T ss_pred EEEEEECCC--------CcEEEEEEEcCcceEE--------------EEcCCCc-EEEEEEecccccc----c------- Confidence 999875321 2356889999998842 2334454 4466553322110 0 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....+++.+|||+......+...|+|++..+...+......+++...-.+=.+...++|+.+..-. .+.. T Consensus 167 ~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~----------~e~~ 236 (454) T protein:vir:93 167 EAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSIT----------EENA 236 (454) T ss_pred eeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCC----------HHHH Confidence 112477889999987777888999999999998888888888877777777788889998653210 1111 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) ........ ....+ -..|.+..|..|.+++.++.+.-..+|.+..+.....||+.+|||-+.| ++..++||| T Consensus 237 ~~~~~~~~---~~~~g-----~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~s 307 (454) T protein:vir:93 237 KKLKSNWD---SGYTG-----ENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKI-GVGQPPSSD 307 (454) T ss_pred HHHHHHHH---HHhcc-----cccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCcch Confidence 11111000 00000 1356677899999999999777777888888999999999999999855 788888999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) ++.+.... |+...+.|+...+ +.++-...+. +.+. .++|..-.....|.... T Consensus 308 n~e~~~~~-----------f~~~~l~P~~~~i-e~~ln~~L~~-~~~~---------------~~~f~~~~ll~~D~~~r 359 (454) T protein:vir:93 308 NVEALEQQ-----------YYSQCLQTLIESI-ELLLDEALET-GENE---------------STEFDVTTLLRMDSERR 359 (454) T ss_pred hHHHHHHH-----------HHHHHHHHHHHHH-HHHHHHhhcC-CCCc---------------EEEeechhhhccCHHHH Confidence 76554443 3555566665554 4444433222 2111 12232223334588888 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHH--HHHHHHHcCCCCCc-cccccCCCCCCCCC-----CCCCCCCCC Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAR--EEGLIKSLKLDFTG-KMVEGNSTQSSNSS-----ESTSDNPNE 552 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~--E~~~~~~~Gl~~~~-~~~~~~~~~~~~~~-----~~~~~~~~~ 552 (556) +++....+++|+.|+-|+-+..|++|-+--+++-- -.-.+..+|-..+. .+........+.++ +...+.++. T Consensus 360 ~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~ 439 (454) T protein:vir:93 360 MKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAITET 439 (454) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCC Confidence 99999999999999999999999988542221100 00011111111000 01111111111111 111111222 Q ss_pred cCCC Q lcl|NC_019524. 553 ETTQ 556 (556) Q Consensus 553 e~~~ 556 (556) +.+. T Consensus 440 ~~d~ 443 (454) T protein:vir:93 440 EHDA 443 (454) T ss_pred ccch Confidence 2222 No 13 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=99.78 E-value=1.2e-19 Score=124.25 Aligned_cols=474 Identities=10% Similarity=-0.007 Sum_probs=228.4 Q ss_pred CCcchhhhHHHHHhhHh--hcccch-hhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVD--VVAETA-TATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~--~~~~~~-~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) |++-++..+.+.=.+-. ..+... ....++. ..+.+.. + .++.++.-++.. -+.++.+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~---~~~~~p~~~~~~------------L~~~~e~~~ 62 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQI--PDHRIQS-H---NVGVNPPYNPDR------------LAAFLELNE 62 (651) T ss_pred CCCccceeeeeEEEeeccccccccccccccccc--chhhhcc-c---CCCCCCCCCHHH------------HHHHHhcCh Confidence 77777654333211100 000000 0000000 1122221 1 234444445543 578899999 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccc---cceeh-hcccCHHHHHHHHhhh Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPE---NWFDA-RRMCTLTGLTRLAVSG 153 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~---~~cD~-~g~~~f~~lq~l~~r~ 153 (556) ++..+|+.+++++.|.||.+.+..+...- +.-.++++.+|+.|-.... ..|+. .=..++..+..-++.. T Consensus 63 ~~~~~i~~~~~~iag~g~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~D 135 (651) T protein:vir:99 63 TLATGIRKKSRYEVGFGFDLVPAQGVDGD-------DASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQD 135 (651) T ss_pred HHHHHHHHHhhhhhccCceeeecccCCCC-------ccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHH Confidence 99999999999999999998887654321 1223455566666533211 11111 1114566665544444 Q ss_pred heecCceEEEEeeccCCCCcCCCcccceEEEEEchhh---------c---------CCCCCCC------------CCceE Q lcl|NC_019524. 154 FLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYR---------M---------SNPNNVM------------DTPNL 203 (556) Q Consensus 154 ~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~dr---------l---------~~~~~~~------------~g~~i 203 (556) +...|-.++.+.... . +. ++.|--++++. + ..|.... .+... T Consensus 136 le~tGna~ieiIrn~-~-----g~--pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~ 207 (651) T protein:vir:99 136 YHGVGWLALEMLTDI-E-----GR--PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRR 207 (651) T ss_pred HHHHhhHhhhhhhcC-c-----cc--hhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcc Confidence 444444333321100 0 00 01111111110 0 1111100 00111 Q ss_pred EEEEEECCCCCeEEEEEeec------CCCcccc------CCcc--cccee--eccccCChhHeEeeecccCCCcccCCch Q lcl|NC_019524. 204 RSGVQLDNNGAALGYWLRKA------FPGDPTD------MEQW--KWGYE--PARFDWGRRRVIHIIEALLAGQTRGISE 267 (556) Q Consensus 204 ~~GIE~d~~Gr~vaY~i~~~------hpgd~~~------~~~~--~~~rv--~~~~~v~a~~viH~f~~~r~gQ~RGvs~ 267 (556) ..-+--|.+++.+.++.... ++.+... .... .|... -....+++.+|||+..+...+...|+|+ T Consensus 208 ~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~sp 287 (651) T protein:vir:99 208 YFGEAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPD 287 (651) T ss_pred eEEEeeccccceeeeeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccH Confidence 11111233444433322110 0001000 0000 01000 0123478899999988877788999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecC-C-- Q lcl|NC_019524. 268 MVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAID-G-- 344 (556) Q Consensus 268 la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-p-- 344 (556) +..++..+......++....-.+=.+...++|+-+.+.. ..+.............. ......-|+ + T Consensus 288 l~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~l---------s~e~~~~lr~~~~~~~~--nagk~~vL~~~~~ 356 (651) T protein:vir:99 288 WVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTGGEL---------SEESKRDLRQMLNGLRE--ESHRAVVLEVEKF 356 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCC---------CHHHHHHHHHHHHHHhc--cCCceEEeecccc Confidence 999999887777777777766666677788887532210 00111111111000000 000111111 1 Q ss_pred ceeeecCCCceeeeecCCCC-CccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 345 AKIPHLYPGTKLKMQPAGTP-GGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADR 423 (556) Q Consensus 345 G~i~~L~pGe~i~~~~~~~p-~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~ 423 (556) +.+..+..|.+++.++.... ...|.+..+.....||+.+|||-. +.|+.+++|||++.+....|.+ . T Consensus 357 ~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~-~lG~~~~~~~sn~E~~~~~f~~-----------~ 424 (651) T protein:vir:99 357 QSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPV-KIGVTDSANRSNSDQQDKDFAL-----------E 424 (651) T ss_pred cccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHH-HhccCCCCCcccHHHHHHHHHH-----------H Confidence 12223455888888875433 578999999999999999999986 5588989999998777766533 3 Q ss_pred HHHHHHHHHHHHHHHcCCcc----CCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHH Q lcl|NC_019524. 424 FASAIYTLWLEEEVNAGNVP----LPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEI 499 (556) Q Consensus 424 ~~~pi~~~~l~~a~l~G~l~----~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ 499 (556) .++|+...| +.++-...++ ..++ .+.+++........|+...+++....+++|+.|+.|+- T Consensus 425 tL~P~~~~i-e~eln~kLl~~~e~~~~~--------------~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R 489 (651) T protein:vir:99 425 VIQPEQHTF-AEWLYQIIHQQALGVTDW--------------TIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAR 489 (651) T ss_pred HHHHHHHHH-HHHHHHhhcCccccccCc--------------eEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHH Confidence 345544432 3333332221 1111 01234445556667999999999999999999999998 Q ss_pred HHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCC-------CCCCCCCCCCcCCC Q lcl|NC_019524. 500 SRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNS-------SESTSDNPNEETTQ 556 (556) Q Consensus 500 ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~-------~~~~~~~~~~e~~~ 556 (556) +..|.+|-+- + ..++.+............... ++++.+.++.|.++ T Consensus 490 ~~lglppi~~------~-----~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~~~e~~~ 542 (651) T protein:vir:99 490 EELGLDPLGE------P-----YGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKIGEREWDT 542 (651) T ss_pred HHhCCCCCCC------c-----cccccccccccccccccccCCCCcccccCccccccccchhhh Confidence 8889877321 0 011111100000000000000 11111111111111 No 14 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.76 E-value=1.8e-18 Score=117.82 Aligned_cols=407 Identities=11% Similarity=0.040 Sum_probs=228.4 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |++.+-+.|..+-........ .....++ ...|...... .-+.+-+..++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~--~~~~~~~~~~-------------~v~~~~~~~~~~V~ 52 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQ-------------STSKLYD--FSPWKNRSFW-------------GVINNTLETNETIF 52 (409) T ss_pred CCccchhhhhhhhhhhhhhcc-------------ccccccc--cccccCcccc-------------ccchhhhhccHHHH Confidence 888888877654211111100 0000000 1122211110 01223355667889 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) ++|+.|.+.|-..-|++.-+.+ .. .. .++..+..+|+ -.+|.+++....+..++..|++ T Consensus 53 ~ci~~Ia~~ia~lp~~~~~~~~--------~~----~~---~~~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna 111 (409) T protein:vir:93 53 SAITKLSNSMASLPLKMYEDYK--------VV----NT---EVSDLLTVSPN------NSLSSFDFINQIETIRNEKGNA 111 (409) T ss_pred HHHHHHHHhhhhCceeEeeccc--------cc----cc---hHHHHHhhhcc------cCCCHHHHHHHHHHHHhhcCce Confidence 9999999988876666543211 01 11 22333444554 3568899999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+.+.. ...+..|..|.|+.+. |..+.++..+.|.+.... | T Consensus 112 y~~i~r~~--------~G~~~~L~~l~~~~v~--------------~~~~~~~~~~~y~~~~~~-g-------------- 154 (409) T protein:vir:93 112 YVLIERDI--------YHQPSKLFLLNPDVVE--------------MLIENQSRELYYSIHAAT-G-------------- 154 (409) T ss_pred EEEEEECC--------CCcEEEEEEEcCceeE--------------EEEeCCCcEEEEEEEcCC-c-------------- Confidence 99876532 1235688889988883 556677888889885332 1 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee-eeEeccCcccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA-ASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~-~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) ....+++.+|||+...-..+...|+|++..+...+.-....... ...-.+.-. ++++.+.. ..++. T Consensus 155 ~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~----------l~~e~ 221 (409) T protein:vir:93 155 NKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSN----------VGKEK 221 (409) T ss_pred eEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHH---HHHhcCCCCceEEecCCC----------CCHHH Confidence 11246788999998776778999999987654444332222211 122222222 23332211 01111 Q ss_pred cccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY 399 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY 399 (556) .+....... ... =..|.+..|+.|.+++.++.+.-..+|.+..+.....||+.+|||-+.| ++.+++|| T Consensus 222 ~~~~~~~~~---~~~-------~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~l-g~~~~~~~ 290 (409) T protein:vir:93 222 RQQVLEDFK---QYY-------EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFL-NARSNTNF 290 (409) T ss_pred HHHHHHHHH---HHh-------hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCc Confidence 111111000 000 1356788899999999998776667888888899999999999998855 77888999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhh Q lcl|NC_019524. 400 SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKK 479 (556) Q Consensus 400 Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~K 479 (556) |++.+....|+. .-+.|+.+.+ +.++....++--.... -..+++-.....-.|+.. T Consensus 291 sn~e~~~~~f~~-----------~~l~P~~~~i-e~~l~~~Ll~~~~~~~------------~~~~~fd~~~ll~~d~~~ 346 (409) T protein:vir:93 291 AKNEELNRFYLQ-----------HTLLPIVKQY-EEEFNRKLLTKTDREK------------NRYFKFNVKSYLRADSAT 346 (409) T ss_pred ccHHHHHHHHHH-----------HHHHHHHHHH-HHHHHhhcCCcccccC------------cceEEeechhhhccCHHH Confidence 999776665543 3456666653 3344443332111000 011233333334468999 Q ss_pred hhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 480 ETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 480 e~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .+++..+.+.+|+.|+-|+-+..|..|-+--+++- ++....+..... ........++++.++ T Consensus 347 ~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~----------~~~n~~~~~~~~-----~~~~~~~gG~~n~~e 408 (409) T protein:vir:93 347 QAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL----------ISGDLYPIDTPL-----ELRKSLKGGDKNVNE 408 (409) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee----------ecccccccccch-----hhcccccCCCCCcCC Confidence 99999999999999999999999988764211110 111111111111 001111222222222 No 15 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.76 E-value=1.2e-18 Score=118.92 Aligned_cols=429 Identities=14% Similarity=0.087 Sum_probs=231.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |+|.+.+--.-..++.-........ ..+.+........+... +.++.. . .. -+-+-+..++.+. T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~-~~~~~~--g-------~~---v~~~~al~~~~V~ 64 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDI---GGGQTFTPVNATARDLG-IIISDT--G-------AA---VNADAIMRLDAVA 64 (432) T ss_pred CCchhhcchhhhhhhhccccccccc---ccccccccCccchhhhc-cccccc--C-------cc---cchHhhhccHHHH Confidence 8888877644322221111000000 00000000000011111 111000 0 00 1223345668889 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.|.+.|-..-|++.-+.+. |. ... .-..++..+..+|+ ..+|.+++....+..++..|++ T Consensus 65 ~~i~~Ia~~ia~lp~~~y~~~~~---g~--~~~-----~~~~l~~lL~~~PN------~~~t~~~f~~~l~~~lll~Gna 128 (432) T protein:vir:81 65 ACVKLVSQAIAAMPLTMYMRTPD---GR--KEA-----VNHPLYTLLLDGPN------STQTAFDFWQVVVTRLLLDGTA 128 (432) T ss_pred HHHHHHHHhhhhCceeeEEecCC---cc--eec-----ccchHHHHHHhccc------ccCCHHHHHHHHHHHHhhcCCe Confidence 99999999888766655332211 10 000 00223444545565 2468899999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+.+.. | .+..|..|.|+.+. |+.|.+|+.. |.+...+ | T Consensus 129 yv~i~~~~-------g--~~~~L~~l~~~~v~--------------v~~~~~g~~~-y~~~~~~-g-------------- 169 (432) T protein:vir:81 129 YVRKVVTD-------G--RIESLQYLANDRLT--------------ITTDPKGNTA-YRYRRTD-G-------------- 169 (432) T ss_pred EEEEEecC-------C--cEEEEEEEcCCceE--------------EEECCCCcEE-EEEEecC-c-------------- Confidence 99876421 1 24678888888773 3445566543 5443211 1 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....+++++|||+... ......|+|++..+...+.......+....-.+=.+...++|+.+..-. .+.. T Consensus 170 ~~~~~~~~~iih~r~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~----------~e~~ 238 (432) T protein:vir:81 170 QMIDIPKQQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLT----------DDQY 238 (432) T ss_pred eEEEEccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCC----------HHHH Confidence 1124678899999755 4455899999988777766655555555555555667778888643211 1111 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) +.+... . ......|.+..|+.|.+++.++.+.-..+|.+..+....+||..+|||.+.| |+..+.+|| T Consensus 239 ~~~~~~-------~----~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~ 306 (432) T protein:vir:81 239 DSFAKK-------V----SGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTS 306 (432) T ss_pred HHHHHH-------H----hhhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCcCCcccc Confidence 111111 1 1113467788999999999999877788899999999999999999999855 887776663 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) ..- .++..+..|+..-+.|+... ++.++....++-..... ..+++-.-.-.-.|+..- T Consensus 307 ~~s--------n~eq~~~~f~~~tl~P~~~~-ie~~l~~kLl~~~~~~~-------------~~~~fd~~~llr~d~~~r 364 (432) T protein:vir:81 307 WGS--------GIESQQLGFLTMTLSPWLRR-IEQSIALNLLSPAERRR-------------YFADFDTSALLRADSAAR 364 (432) T ss_pred ccc--------hHHHHHHHHHHHHHHHHHHH-HHHHHHhhccCccccCc-------------eEEEeechhhhccCHHHH Confidence 210 12223334455566776665 44455554443211100 112333333334589999 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +++....+++|+.|+-|+-+..|+.|-+--+ .. +-+.....+..... ...+.++.+..++++.++ T Consensus 365 ~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~------~~---~~~~~~~~pl~~~~--~~~~~~~~~~~~n~~~~~ 429 (432) T protein:vir:81 365 SSYYSQLVNNGLMTRDEAREIEGLPKLGGNA------AV---LTVQSAMVPLDSIG--LQASPEPASGLGNQQQDK 429 (432) T ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCCCCc------ce---EeecCcccchhhhc--cCCCCCCCCCCCCccccc Confidence 9999999999999999999999988743100 00 00111101110000 000111111111111111 No 16 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.75 E-value=2.5e-18 Score=117.12 Aligned_cols=429 Identities=13% Similarity=0.089 Sum_probs=232.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |-|++++-..-.-++.-....... .+. + ..|.|........-...-..-..-+.+-+..++.+. T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~----------~~~-~-----~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~ 64 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVD----------IGG-G-----QTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVA 64 (432) T ss_pred CCCcccCchhhhhHhhcCCccccc----------ccc-c-----cccccCchhhhhhcccccccCcccchHhhhcchHHH Confidence 777776654432222110000000 000 0 001110000000000000000111334455668889 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.|.+.|-.-=|.+.-+.. ++.. ...=..++..+..+|+ .-+|.+++.+..+..++..|++ T Consensus 65 ~~v~~Ia~~ia~lp~~~y~~~~------~g~~----~~~~~pl~~lL~~~PN------~~~t~~~f~~~l~~~lll~Gna 128 (432) T protein:vir:97 65 ACVKLVSQAVAAMPLMMYMRTP------DGRK----EAVNHPLYTLLLDGPN------STQTAFDFWQVVVTRLLLDGTA 128 (432) T ss_pred HHHHHHHHhhccCceEEEEecC------CCcc----cccccHHHHHHHhccc------ccCCHHHHHHHHHHHHhhcCCe Confidence 9999998888766555432211 1100 0001234666666665 2478999999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+++.. | .+..|..|.|+.+. |..|.+|+ +.|.+...+ | T Consensus 129 y~~~~~~~-------g--~~~~L~~l~p~~v~--------------v~~~~~g~-~~y~~~~~~-g-------------- 169 (432) T protein:vir:97 129 YVRKVVTD-------G--RIESLQYLANDRLT--------------ITTDTKGN-TAYRYRRTD-G-------------- 169 (432) T ss_pred EEEEEecC-------C--cEEEEEEEcCcceE--------------EEEcCCCc-EEEEEEecC-c-------------- Confidence 99987531 1 24678888888773 34455665 456654321 1 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....+++++|||+...- .+...|+|++..+...+......+++...-.+=.+...++++.+..-. .+.. T Consensus 170 ~~~~~~~~~iih~r~~~-~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~----------~e~~ 238 (432) T protein:vir:97 170 QMIDIPRQQIWKIMGYS-LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLT----------DDQY 238 (432) T ss_pred eEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCC----------HHHH Confidence 11357788999997653 456899999988776665555444444444444566678887643211 1111 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) +.+... . ......|.+..|..|.+++.++.+.-..+|.+-.+.....||+.+|||-+.| |+.++.+|+ T Consensus 239 ~~~~~~-------~----~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~ 306 (432) T protein:vir:97 239 DSFSKK-------V----SGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTS 306 (432) T ss_pred HHHHHH-------H----hhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCcCCcccc Confidence 111111 0 1112467788899999999998877777888888999999999999998865 777666653 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) .. ..++..+..|+..-+.|+.+. ++.++....+.-..... ..+++-.-.....|+... T Consensus 307 ~~--------s~~e~~~~~f~~~tl~P~~~~-ie~~ln~kLl~~~e~~~-------------~~~~fd~~~llr~d~~~r 364 (432) T protein:vir:97 307 WG--------SGIESQQLGFLTMTLSPWLRR-IEQSIALNLLTPAERRR-------------YFADFDTSALLRADSAAR 364 (432) T ss_pred cc--------hhHHHHHHHHHHHHHHHHHHH-HHHHHhhhccCccccCc-------------eEEEeechhhhccCHHHH Confidence 21 113333444566667776665 55566655553221111 012333333334599999 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +++..+.+++|+.|+-|+-+..|..|.+--+.. +-++....+...... ...+++.+..+++++++ T Consensus 365 ~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~~~---------~~~~~~~~pl~~~~~--~~~~~~~~~~~~~~~~~ 429 (432) T protein:vir:97 365 SSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAV---------LTVQSAMVPLDSIGL--QASPEPASGLGNQQQDK 429 (432) T ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcce---------Eeecccccchhhhcc--cCCCCCCCCCCCccccc Confidence 999999999999999999888898774311000 001100001100000 01111111111122222 No 17 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.75 E-value=1.3e-17 Score=113.15 Aligned_cols=436 Identities=11% Similarity=0.062 Sum_probs=227.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |.|..+..-.. ........ ...+. .....|.-+...-+.+ +... . ..++...+.++. .|++ T Consensus 1 ~~~~t~~~~~~--~l~~~~~~-~~~r~-~~l~~Yy~g~~~i~~~----~~~~--~-------~~~~~~~~~~~~--n~~~ 61 (456) T protein:vir:79 1 MTASTPAEWLP--VLTKRIDD-GMSRV-RLLARYSNGDAPLPEL----TRNT--S-------AAWRSFQREART--NWGL 61 (456) T ss_pred CCCCCHHHHHH--HHHHHHHH-HHHHH-HHHHHHHhccCChhhc----Cccc--C-------hhhchhhhhhhc--chHH Confidence 77666654321 11111100 01111 1223444333211111 1111 1 112223333443 3899 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+++++|.||+.....+ .+..+.+.+.|+ + + +|...+..+++..+.-|.+ T Consensus 62 ~ivd~~~~~l~~~g~~~~~~~d-----------~~~~~~~~~~~~---~-----n------~~d~~~~~~~~~a~~~G~a 116 (456) T protein:vir:79 62 MVRDSVADRIIPNGITVGGSAD-----------SDLALRARRIWR---D-----N------RMDSVCKQWVKYGLDFGES 116 (456) T ss_pred HHHHHHHhhhccCCeecCCCCC-----------ccHHHHHHHHHH---h-----c------ChhHHHHHHHHHHhhcCee Confidence 9999999999999998653321 112233333332 2 2 4677888999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE--CCCCCeEEEEEeec-----------CCCc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL--DNNGAALGYWLRKA-----------FPGD 227 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~--d~~Gr~vaY~i~~~-----------hpgd 227 (556) |+.. |.... +. .++..++|..+-.-++......+..+|.+ +.++.+.-+-++.. -..+ T Consensus 117 ~~~~-~~~ed-----g~---~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (456) T protein:vir:79 117 YLTC-WRRDD-----GT---ATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSS 187 (456) T ss_pred EEEE-eeCCC-----Cc---eEEEEeccceeEEEEcCCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeecc Confidence 9764 43322 21 37899999988544443344445555543 12222211111000 0000 Q ss_pred c----ccCCccccceeeccccCChhHeEeeecc---cCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEe Q lcl|NC_019524. 228 P----TDMEQWKWGYEPARFDWGRRRVIHIIEA---LLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVE 300 (556) Q Consensus 228 ~----~~~~~~~~~rv~~~~~v~a~~viH~f~~---~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~ 300 (556) . .......| ++...+-|.+.. .+..-..|+|.+.+++..+ +.|..+....+..+-.++.-+. T Consensus 188 ~~~~~~~~~~~~~--------~~~~~~~~~~~~~pvv~~~N~~~~gd~e~v~~li---D~~~~~~s~~~~~~~~~a~~~~ 256 (456) T protein:vir:79 188 SRRRLVTRISDSW--------VPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDII---NRINRAELQLLSTMAIQAFRQR 256 (456) T ss_pred ccceeeeccCCce--------eecccccCCCCceeEEEecCCCCCchhhhhHHHH---HHHHHHHHHHHHHHHHHhhHHH Confidence 0 00000111 111122222221 1112357899999987644 3444444443333222221111 Q ss_pred ccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHH Q lcl|NC_019524. 301 SELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIA 380 (556) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~ia 380 (556) .-.+....... . + ..+. ...........+|.+..+++|.++..++.. +-.+|.+.++..++.|+ T Consensus 257 ~~~G~~~~~~~-~-------d----~~g~---~i~~~~~~~~~~~~~~~~~~~~~~~q~~~~-~~~~~~~~l~~~i~~i~ 320 (456) T protein:vir:79 257 ALKSSEHRLPK-V-------D----ENGN---AIDYASIFEAAPGALWELPPGVDIWESQTN-DFTPMLSAIKEHIRQLS 320 (456) T ss_pred HHhcCCccccc-c-------c----cccc---ccchhhhhhhhccccccCCCCcceeeeccc-ChHHHHHHHHHHHHHHH Confidence 11110000000 0 0 0000 000111222467888899999999877755 44678999999999999 Q ss_pred HhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHH Q lcl|NC_019524. 381 ASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRD 460 (556) Q Consensus 381 aglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~ 460 (556) +..++|.+.+.++.++.|..++|..+.......+..|..|-..+.+ +++..+. +.|.. . . T Consensus 321 ~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~~l~~~---~~g~~---~-------------~ 380 (456) T protein:vir:79 321 SATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA-ILVKALQ---IEGES---V-------------E 380 (456) T ss_pred hhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH---hcCCC---c-------------c Confidence 9999999999999888999999999999999888888888776654 4443322 23321 1 0 Q ss_pred HhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCC Q lcl|NC_019524. 461 ALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSS 540 (556) Q Consensus 461 a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~ 540 (556) .-+.+.|..|.. .+....+.+..+.+.+|+.|.+......|.+++++- +...|+...+..++. ..+... T Consensus 381 ~~i~v~w~~~~~--~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~-~~e~~r~~~e~~~~~--~~~~~~------ 449 (456) T protein:vir:79 381 DTVDVSFESPDR--VTLGEKYSAASLAKAAGESWASIRRNILNYNADQIK-QDDLDRAREQITLFA--GNPVQR------ 449 (456) T ss_pred ccceEEeCCCCC--cCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHH-HHHHHHHHHHHHHHh--hhHhhc------ Confidence 124688987754 477889999999999999999887777899998763 222222222222221 011110 Q ss_pred CCCCCCCC Q lcl|NC_019524. 541 NSSESTSD 548 (556) Q Consensus 541 ~~~~~~~~ 548 (556) ++++.+. T Consensus 450 -~~~~~~~ 456 (456) T protein:vir:79 450 -PQEDGSR 456 (456) T ss_pred -CCCCCCC Confidence 0000000 No 18 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.75 E-value=6e-18 Score=115.00 Aligned_cols=429 Identities=12% Similarity=0.075 Sum_probs=232.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |-|.+++-..-.-++.-.. +.+ .. .++. ..|.|........-...-..-..-+.+-+..++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~--~~--------~~~~-----~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~ 64 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVP-PDP--VD--------IGGG-----QTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVA 64 (432) T ss_pred CCCCcccchhhhhHhhcCC-ccc--cc--------cccc-----cccccCcchhhhhcccccccCcccchhhhhcchHHH Confidence 7777776554322221110 000 00 0000 001110000000000000000111334455678899 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.|.+.|-.--|++.-+.+. |. +. .. =..+++.+..+|+. -+|.+++-...+..++..|++ T Consensus 65 ~~i~~Ia~~ia~lp~~~y~~~~~---g~--~~--~~---~~~l~~lL~~~PN~------~~t~~~f~~~l~~~lll~Gna 128 (432) T protein:vir:10 65 ACVKLVSQAIAAMPLTMYMRTPD---GR--KE--AV---NHPLYTLLLDGPNS------TQTAFDFWQVVVTRLLLDGTA 128 (432) T ss_pred HHHHHHHHhhhhCceeEEEecCC---Cc--cc--cc---ccHHHHHHHhcccc------cCCHHHHHHHHHHHHhhcCCe Confidence 99999999988766665433211 10 00 00 02345556666652 478999999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+++.. | .+..|..|.|+.+. |..|.+|+ +.|.+...+ | T Consensus 129 y~~~~~~~-------g--~~~~L~~l~~~~v~--------------v~~~~~g~-~~y~~~~~~-g-------------- 169 (432) T protein:vir:10 129 YVRKVVTD-------G--RIESLQYLANDRLT--------------ITTDTKGN-TAYRYRRTD-G-------------- 169 (432) T ss_pred EEEEEecC-------C--cEEEEEEEcCCceE--------------EEEcCCCc-EEEEEEecC-c-------------- Confidence 99876531 1 24678888888873 23345555 345543221 1 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....+++++|||+... ......|+|++..+...+......++....-.+=.+...++++.+..-. .+.. T Consensus 170 ~~~~~~~~~iih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~----------~e~~ 238 (432) T protein:vir:10 170 QMIDIPKQQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLT----------DDQY 238 (432) T ss_pred eEEEEcCccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCC----------HHHH Confidence 1124678899999765 4556899999998887776665555555555555677788888653211 1111 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) +.+.... . .....|.+..|+.|.+++.++.+.-...|.+-.+....+||+.+|||-+.| |+.++.+|+ T Consensus 239 ~~~~~~~-------~----~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~l-g~~~~~t~~ 306 (432) T protein:vir:10 239 DSFAKKV-------S----GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTS 306 (432) T ss_pred HHHHHHH-------h----hhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCccCCccc Confidence 1111111 1 113457788899999999998877777888888999999999999999865 777665553 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) ..- .++..+..|+..-+.|+.+. ++.++....++-..... ..++|-.-.-.-.|+... T Consensus 307 ~~s--------n~e~~~~~f~~~tl~P~~~~-ie~~ln~kL~~~~~~~~-------------~~~~fd~~~ll~~d~~~r 364 (432) T protein:vir:10 307 WGS--------GIESQQLGFLSMTLSPWLRR-IEQSIALNLLSPAERRR-------------YFADFDTSALLRADSAAR 364 (432) T ss_pred ccc--------hHHHHHHHHHHHHHHHHHHH-HHHHHHhhhcCccccCc-------------eEEEeechhhhccCHHHH Confidence 210 12333334455566776665 44445444443211110 112333333334599999 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +++..+.+.+|+.|+-|+-+..|..|-+--+.. ..--.++. +..... ....+++ ....++++.++ T Consensus 365 ~~~~~~~~~~G~~T~NE~R~~~glppi~g~~~~-----~~~~~~~~----pl~~~~-~~~~~~~-~~~~~~~~~~~ 429 (432) T protein:vir:10 365 SSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAV-----LTVQSAMV----PLDSIG-LQASPEP-ASGLGNQQQDK 429 (432) T ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcce-----EeecCccc----chhhhc-ccCCCCC-CCCCCCccccc Confidence 999999999999999999999998774311000 00001110 100000 0001111 11111111111 No 19 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.74 E-value=8e-18 Score=114.31 Aligned_cols=407 Identities=10% Similarity=0.028 Sum_probs=228.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |++.+-..|..-.-.-. +.+.+. . ..-+|.+..... ...-+++.+..++.+. T Consensus 1 ~~~~~~~~~~k~~~~~~----------------~~~~~~-~-~~~~~~~~~~~~----------~~~v~~~~a~~~~~v~ 52 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDN----------------WIDQSA-S-KLYDFSPWKNKS----------FWGVINNTLETNETIF 52 (409) T ss_pred CcccccchhhhhHHhhh----------------hhcCCc-c-cccccccccCcc----------ccccchhhhhccHHHH Confidence 77776666554321101 111111 1 111222211100 0112344455678889 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.|.+.|-.--|++.-+.+ ... ..++..+..+|+ -.+|.+++....+..++..|++ T Consensus 53 ~~i~~Ia~~ia~lp~~~~~~~~--------~~~-------~~~~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna 111 (409) T protein:vir:94 53 SAITKLSNSMASLPLKMYEDYK--------VVN-------TEVSDLLTVSPN------NSLSSFDFINQIETIRNEKGNA 111 (409) T ss_pred HHHHHHHHhhhhCceeEeeccc--------ccc-------hhHHHHHhhhcc------cCCCHHHHHHHHHHHHhhcCCe Confidence 9999999888876666542211 001 112333444554 2468889999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+.+.. ...+..|..|.|+.+ -|..|.++.++.|.+...+. T Consensus 112 y~~i~r~~--------~G~~~~L~~l~~~~v--------------~v~~~~~~~~~~y~~~~~~g--------------- 154 (409) T protein:vir:94 112 YVLIERDI--------YHQPSKLFLLNPDVV--------------EMLIENQSRELYYSIHAATG--------------- 154 (409) T ss_pred EEEEEECC--------CCcEEEEEEEcCcee--------------EEEEeCCCcEEEEEEEcCCc--------------- Confidence 99876432 123568888998888 35666778888888863321 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcce-eeeEeccCcccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATY-AASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~-~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) ....+++.+|||+......+...|+|++..+...+.-.....+. .....+.- .++++.+..- ..+. T Consensus 155 ~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~l----------~~e~ 221 (409) T protein:vir:94 155 NKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSNV----------GKEK 221 (409) T ss_pred eEEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHH---HHHhcCCCCeeEEecCCCC----------CHHH Confidence 01246788999998776778899999887654444332222221 12222222 2333322110 1111 Q ss_pred cccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY 399 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY 399 (556) .+....... ... =..|.+..|..|.+++.++.+.-..+|.+..+....+||+.+|||-+.| ++.+++|| T Consensus 222 ~~~~~~~~~---~~~-------~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~ 290 (409) T protein:vir:94 222 RQQVLEDFK---QYY-------EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFL-NARSNTNF 290 (409) T ss_pred HHHHHHHHH---HHh-------hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCc Confidence 111111100 000 1356788899999999998777777888989999999999999999866 77788899 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhh Q lcl|NC_019524. 400 SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKK 479 (556) Q Consensus 400 Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~K 479 (556) |++.+....|. ..-+.|+.+.+- .++....++-..... -..+++-.....-.|+.. T Consensus 291 sn~e~~~~~f~-----------~~~l~P~~~~ie-~~ln~~Ll~~~~~~~------------~~~i~fd~~~ll~~d~~~ 346 (409) T protein:vir:94 291 AKNEELNRFYL-----------QHTLLPIVKQYE-EEFNRKLLTKTDREK------------NRYFKFNVKSYLRADSAT 346 (409) T ss_pred ccHHHHHHHHH-----------HHHHHHHHHHHH-HHHHHhhCCcccccC------------cceEEeechhhhccCHHH Confidence 99876666554 344566665543 334433332211100 011233333344469999 Q ss_pred hhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 480 ETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 480 e~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .+++....+.+|+.|+-|+-+..|..|-+-.+++- ++....+...... .......+++++++ T Consensus 347 ~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~----------~~~n~~~~~~~~~-----~~~~~kGG~~n~~e 408 (409) T protein:vir:94 347 QAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL----------ISGDLYPIDTPLE-----LRKSLKGGDKNVNE 408 (409) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEe----------ecccccccccchh-----hcccccCCCCCcCC Confidence 99999999999999999998888988764221110 1111111111000 00111222222222 No 20 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.74 E-value=1.5e-17 Score=112.82 Aligned_cols=466 Identities=12% Similarity=0.092 Sum_probs=209.7 Q ss_pred CCcchhhhHHHHH-h---hHhhccc-chhhhhhhhcchhccccCCCccc-----------------ccccCCCCCHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAK-K---AVDVVAE-TATATPMAVGGGMEGAERTTREM-----------------FQWNPSIISPDQQI 58 (556) Q Consensus 1 ~sp~~~~~r~~a~-~---a~~~~~~-~~~~~~~~~~~~y~aa~~~~r~~-----------------~~w~~~~~s~~~~i 58 (556) .|-+..+-|..-- + ++....+ ..++.++..+.+..+....+..+ .+|..+..+.+ T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~d~~--- 87 (648) T protein:vir:79 11 WSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEFDFN--- 87 (648) T ss_pred hhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccccCCcCHH--- Confidence 2222111110000 0 0000000 00000000000000000111111 11211222211 Q ss_pred HHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehh Q lcl|NC_019524. 59 AQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDAR 138 (556) Q Consensus 59 ~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~ 138 (556) .| ..++.+||++..||+.+++.|.+.++.+..+-+. ..+.... .....+++ T Consensus 88 -----~l----~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~--------~~~~~~~------~~ll~rPn------ 138 (648) T protein:vir:79 88 -----EI----TSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPN--------AVEYIRM------RFTLMAEA------ 138 (648) T ss_pred -----HH----HHHHhcChHHHHHHHHHHHHHhhCcceEEecCCc--------cchhhHH------HHHhhccC------ Confidence 11 2267889999999999999999988877654321 1111111 11122332 Q ss_pred cccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEE Q lcl|NC_019524. 139 RMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGY 218 (556) Q Consensus 139 g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY 218 (556) -.+|.+++....+..++..|.+|+.+.+.... .+ ++.+.++...++..-.....-+..+.-|..|.+|.+..| T Consensus 139 ~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G------~~-~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y 211 (648) T protein:vir:79 139 TQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDA------LP-FQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGW 211 (648) T ss_pred CCCCHHHHHHHHHHHHHhcCCeEEEEEecCCC------cc-chhhhhhhhccccceeeeEeecCceeEEEEcCCCceeee Confidence 23578888888888999999999988754321 11 222333333332211000000011223677899999888 Q ss_pred EEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeee Q lcl|NC_019524. 219 WLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAAS 298 (556) Q Consensus 219 ~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~f 298 (556) ..... |.. ....+++.+|||+......+...|+|++.++...+.......+....--+=.+.-.++ T Consensus 212 ~y~~~--g~~------------~~~~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gi 277 (648) T protein:vir:79 212 QQEQE--GQD------------KPQKFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWH 277 (648) T ss_pred EEEec--CCc------------eeEEecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE Confidence 75321 110 1123567899999877778889999999998888876665555554433334555566 Q ss_pred EeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHH Q lcl|NC_019524. 299 VESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRN 378 (556) Q Consensus 299 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~ 378 (556) |+.+.+.... +. ......... .. .....+.+|.+.. ..-.+.+.. +....+|.+..+..... T Consensus 278 l~~~~~~~~~-e~-------~k~~~e~~~----~~---~~~~~i~gg~v~~--~~~~i~~~~-s~~dlqfle~rk~~~~e 339 (648) T protein:vir:79 278 VKVGLEQEGF-GA-------EEGEVDLVR----GE---VENMDVEGGMVTT--ERVNISSIA-SNQIIDAKEYLKHFEQR 339 (648) T ss_pred EEeCCCccch-HH-------HHHHHHHHH----Hh---ccccccccccccc--ceeeccccC-CHHHHHHHHHHHHHHHH Confidence 7654322110 00 000000000 00 0111122332211 011111111 11123567777889999 Q ss_pred HHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhh Q lcl|NC_019524. 379 IAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMM 458 (556) Q Consensus 379 iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~ 458 (556) ||+.+|||...| |+..++|||.+-+....+...+...+..+...+-.-+....+.+..+...+. |. T Consensus 340 Ia~aFgVPP~lL-G~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~-~d------------ 405 (648) T protein:vir:79 340 AFTVLGVSELMM-GRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLN-PD------------ 405 (648) T ss_pred HHHHhCCCHhHc-ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc-cc------------ Confidence 999999999855 7777888888755544444444444443322221111111111111111110 11 Q ss_pred HHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHH-------HHHHHHHHHHHHcCCCCCccc Q lcl|NC_019524. 459 RDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVF-------KQRAREEGLIKSLKLDFTGKM 531 (556) Q Consensus 459 ~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~-------~q~a~E~~~~~~~Gl~~~~~~ 531 (556) . .+++..-...-.|+.+.++.....+++|+.|+.|+-+..|++|-+-- .+.-.-.......+.. +.++ T Consensus 406 --~--~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~-~~~~ 480 (648) T protein:vir:79 406 --D--KVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALA-PTPA 480 (648) T ss_pred --c--eEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccccccCC-CCCC Confidence 1 12222233444588889998999999999999999888899884311 1100000000000000 0011 Q ss_pred cccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 532 VEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .....+........+.++.+..++| T Consensus 481 ~~~~~~a~~eg~~~e~~~~~~~~~~ 505 (648) T protein:vir:79 481 GGSSASASGDKKKKATDNKTKPTNQ 505 (648) T ss_pred CCCCCCccccccccccCCCCCCCCC Confidence 0111111111111111111111222 No 21 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.74 E-value=2.2e-17 Score=111.93 Aligned_cols=471 Identities=9% Similarity=0.032 Sum_probs=229.0 Q ss_pred CCcchhhhHHHH-------------HhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRA-------------KKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASA 67 (556) Q Consensus 1 ~sp~~~~~r~~a-------------~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~ 67 (556) |.=.+-++-... ...-+..+++..+........+.|..-..-..++|.....-.+.. ....| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~---~~~~l-- 75 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVL---STKKL-- 75 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCcccccc---CHHHH-- Confidence 222222211110 011112222211111111122222111111112343222222211 22222 Q ss_pred HHHHHHhcChHHHHHHHHHHhhhc-----------cCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhccccccee Q lcl|NC_019524. 68 RAQDMVQNDGYAAGVVAVHRDSIV-----------GSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFD 136 (556) Q Consensus 68 RaRdl~rNn~~a~~~v~~~~~nvV-----------G~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD 136 (556) +.++.++|+++.+|+++.++|- +-|+.+..+ ...-..+.+. ......+...+..+++.+ T Consensus 76 --~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~--~~~~~~~~~~----~~~~~~l~~lL~~~PN~~-- 145 (535) T protein:vir:10 76 --LKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELK--DATKVMSKAQ----IKRAHEIEDFIYNTGSEY-- 145 (535) T ss_pred --HHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEE--eccCCCcchh----hhhhhHHHHHHHhCCCCC-- Confidence 2355677888888777776654 222222211 0000111111 112222233344455422 Q ss_pred hhcccCHHHHHHHHhhh-heecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCC-- Q lcl|NC_019524. 137 ARRMCTLTGLTRLAVSG-FLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNG-- 213 (556) Q Consensus 137 ~~g~~~f~~lq~l~~r~-~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~G-- 213 (556) .+++.+|..+...++.. .+.+|.+|+.+..... ..+..|..|+|+.|... .|.+| T Consensus 146 ~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~--------G~~~~L~~l~p~~V~v~--------------~d~~~~~ 203 (535) T protein:vir:10 146 YEWRDTFPRLLTKIINDMYVQDQINIERIFKNDS--------NELDHFNAVDASKVVIS--------------YSPRSKD 203 (535) T ss_pred CChhHHHHHHHHHHHHHHHhhCCceEEEEEECCC--------CcEEEEEEeCCceeEEE--------------EcCcccc Confidence 23344555566555554 5567888877654321 13567888999888421 12222 Q ss_pred -CeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCC---cccCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 214 -AALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAG---QTRGISEMVSALKQMKMTRNFQEITLQNA 289 (556) Q Consensus 214 -r~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~g---Q~RGvs~la~~l~~l~~l~~~~dael~~a 289 (556) -++.|++.... ....+++++|||+...-+.+ ...|+|++..+...+......++....-. T Consensus 204 ~~~~~~~~~~~~----------------~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f 267 (535) T protein:vir:10 204 QPRKFEQFVSET----------------KSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFF 267 (535) T ss_pred CceEEEEEecCc----------------eeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 23334332211 01246678999997644333 44599999888877777766666666666 Q ss_pred HHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecC-CCceeeeecCCCCCccH Q lcl|NC_019524. 290 VVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY-PGTKLKMQPAGTPGGVG 368 (556) Q Consensus 290 ~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~-pGe~i~~~~~~~p~~~f 368 (556) +=.+.-.++|+.+....... ..+..+.+.... .....+. =..|.+..|. .|.+++.++.+.-...| T Consensus 268 ~ng~~p~giL~~~~~~~~~l------s~e~~e~lk~~~---~~~~~G~----~nag~~~vl~~~g~~~~~l~~~~~D~qf 334 (535) T protein:vir:10 268 SQGGTTRGILVIDQDGDAQA------NQMMLAGIRRQW---TSQGSGL----GGAWKIPILAAKDAKFVNMTQNSRDMEF 334 (535) T ss_pred hccCCccEEEEecCCCCccc------CHHHHHHHHHHH---HHHhcCc----ccccccccccCCCceEEecCCChhHHHH Confidence 66677778888753211100 000000000000 0000000 1245554443 68888888887778889 Q ss_pred HHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCC Q lcl|NC_019524. 369 TDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYM-DSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPG 447 (556) Q Consensus 369 ~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~-~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~ 447 (556) .+..+...+.||..+|||.+.| |+..++|||++..+...++... +.....++...+.|+..+ ++.++....++--+ T Consensus 335 le~~~~~~~eIa~afgVPp~~l-G~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~-ie~~ln~~Ll~~~~- 411 (535) T protein:vir:10 335 DKFLNFMIYDTAAIFQMQPEEI-NFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSF-IEQVINDKIMRYVD- 411 (535) T ss_pred HHHHHHHHHHHHHHhCCCHHHh-ccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHH-HHHHHhhhcccccC- Confidence 9999999999999999999866 9999999999999988888765 566677889999996665 56566555443111 Q ss_pred cccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHH---HHHHHHHcC Q lcl|NC_019524. 448 KNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAR---EEGLIKSLK 524 (556) Q Consensus 448 ~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~---E~~~~~~~G 524 (556) . .+.+++. .-...|+....++..+.+. |..|+-|+-+..|+.|-+--++..- .......-+ T Consensus 412 ~-------------~~~f~f~--~l~~~d~~~r~~~~~~~~~-g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~ 475 (535) T protein:vir:10 412 T-------------DYRFSFT--LGDAQDKLQEEQVWKLKLA-NGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATG 475 (535) T ss_pred C-------------eEEEEec--cccccCHHHHHHHHHHHHc-CCCCHHHHHHHhCCCCCCCccccccccchhhcccccc Confidence 0 0122333 3445688888888877775 5579999888889887542221100 000100000 Q ss_pred CCCCccccccCCCCCCCCC----CC---------CCCCC------CCcCCC Q lcl|NC_019524. 525 LDFTGKMVEGNSTQSSNSS----ES---------TSDNP------NEETTQ 556 (556) Q Consensus 525 l~~~~~~~~~~~~~~~~~~----~~---------~~~~~------~~e~~~ 556 (556) ...+..+...........+ +. ..++| +.+++| T Consensus 476 ~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 526 (535) T protein:vir:10 476 FGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDD 526 (535) T ss_pred cccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCCCc Confidence 0000000000000000000 00 00000 000111 No 22 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.73 E-value=2.6e-18 Score=117.03 Aligned_cols=466 Identities=9% Similarity=-0.053 Sum_probs=224.9 Q ss_pred CCcchhhhHHHHHhhH----------------hhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAV----------------DVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDM 64 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~----------------~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~ 64 (556) ..|.---+..+....+ ....+-..-.+......++... .... ..+....++.- . T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~-es~s---~vtsls~pdaf------~ 112 (945) T protein:vir:10 43 FKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSP-ESLM---YLPSISDPDAF------F 112 (945) T ss_pred cchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccC-ccce---ecccccCccce------e Confidence 2222111111000000 0000000000000011111100 0000 01111111111 1 Q ss_pred HHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHH Q lcl|NC_019524. 65 ASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLT 144 (556) Q Consensus 65 lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~ 144 (556) +.+-.++.+.+++.+..+|+.+.+.+-+--|++.-+.+. |......+.. .....+++ +..+|+ .+.++...|. T Consensus 113 ~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~ed---G~~~~~~kk~-~~~hpL~~-LL~rPN--p~mT~~eFwq 185 (945) T protein:vir:10 113 LINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIED---KHVNYYLKRI-RDARNILE-FLERPD--PYFSEVNSWE 185 (945) T ss_pred eehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEeccc---Cccccccccc-ccchHHHH-HHhCCC--cccChhHHHH Confidence 222345667789999999999999998876665433221 1111000000 01111222 334665 4566666677 Q ss_pred HHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecC Q lcl|NC_019524. 145 GLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAF 224 (556) Q Consensus 145 ~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~h 224 (556) .+.+..++.++..|.+++.+.+... | .+..|..|+|+++. |..|.+|...-|++.... T Consensus 186 sFl~~Lv~dLLL~GNAYieIiRd~~------G--~ii~L~pLdPs~Vt--------------i~~ddDG~~~y~Yv~~id 243 (945) T protein:vir:10 186 YLLGMVLDDILTIDRGAIVKIRDEQ------G--NLVAITPVDGTTIK--------------PILSEDTGIVVGYVQEVD 243 (945) T ss_pred HHHHHHHHHHhhcCCeEEEEEECCC------C--cEEEEEEECCcceE--------------EEEcCCCcEEEEEEEecC Confidence 7888999999999999998764321 2 35688899998874 233455554444433221 Q ss_pred CCccccCCccccceeeccccCChhHeEeeec-ccCCCccc--CCchhhHHHHHHHHHHHHHHHHHHHHH-HhcceeeeEe Q lcl|NC_019524. 225 PGDPTDMEQWKWGYEPARFDWGRRRVIHIIE-ALLAGQTR--GISEMVSALKQMKMTRNFQEITLQNAV-VNATYAASVE 300 (556) Q Consensus 225 pgd~~~~~~~~~~rv~~~~~v~a~~viH~f~-~~r~gQ~R--Gvs~la~~l~~l~~l~~~~dael~~a~-i~A~~~~fi~ 300 (556) .+ ....+++.+|||++. +..-|..+ |+|++..+...+......++....-.. -.+.-.++|+ T Consensus 244 G~--------------~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILs 309 (945) T protein:vir:10 244 GA--------------IVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILA 309 (945) T ss_pred Cc--------------eEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEE Confidence 10 011355667776554 44445554 788887766655444433333333321 2244557776 Q ss_pred ccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHH Q lcl|NC_019524. 301 SELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIA 380 (556) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~ia 380 (556) .+.+........+....+..+........ ...+ -..|....|..|.+++.++.+.....|.+..+.....|| T Consensus 310 vkg~~~~d~k~~~~LseEq~erlKe~wee---~~sG-----~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIA 381 (945) T protein:vir:10 310 IEPPSYKEGDIYPQLSREQLESIQRQLQA---IMMG-----DYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARKIC 381 (945) T ss_pred ecCccccccccccccCHHHHHHHHHHHHH---HhCC-----cccccceecCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 54332221111111111111111111000 0000 123444468999999999988788899999999999999 Q ss_pred HhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHH Q lcl|NC_019524. 381 ASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRD 460 (556) Q Consensus 381 aglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~ 460 (556) +.+|||.+.| |+.++.|||++.+....+ +..-++|+..++ +.++.+..++...+ T Consensus 382 rAFGVPP~lL-G~~e~st~SNiEqq~~~F-----------v~~tL~Pil~~I-EqeLNrkLl~~~eg------------- 435 (945) T protein:vir:10 382 AVYQVSPQDV-GILEGSNKATAEVMASLT-----------KAKGLEPLMATI-SKGFDEVVSEFRNE------------- 435 (945) T ss_pred HHhCCCHHHc-ccCCCCCcchHHHHHHHH-----------HHHHHHHHHHHH-HHHHHHhccccccC------------- Confidence 9999999866 899999999986655544 344445544433 44444332211111 Q ss_pred HhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHH--------HHHHHHHcCCCCCcccc Q lcl|NC_019524. 461 ALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAR--------EEGLIKSLKLDFTGKMV 532 (556) Q Consensus 461 a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~--------E~~~~~~~Gl~~~~~~~ 532 (556) ..+.++|.... ..|+...+++....+++|+.|+-|+-+..|..|-+--++... +.....+.|-..+ ... T Consensus 436 ~~i~fdFd~ld--l~D~ksraEal~kli~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~-q~a 512 (945) T protein:vir:10 436 KDIKLWFKEDD--LEKERDWWNIIQGQLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPP-QLA 512 (945) T ss_pred ceeEEEecchh--ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCCc-ccc Confidence 11234554443 458888999999999999999999988889888542221110 0000001110000 000 Q ss_pred ccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 533 EGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 533 ~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ......+.+..++.+++.+..++. T Consensus 513 q~~~dqp~~kGGe~dEns~~psE~ 536 (945) T protein:vir:10 513 QAMADQPSQQGGGVDENSSVPSEQ 536 (945) T ss_pred cCCCCCCCCCCCCCCCCCCCCCcc Confidence 010111111111111111111111 No 23 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.73 E-value=1.5e-17 Score=112.87 Aligned_cols=413 Identities=12% Similarity=0.023 Sum_probs=229.4 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |-=. ..+-|+...... ..... ....+.+ ...|... .-+.+.+..++.+.++| T Consensus 1 ~~f~-~~f~r~~~~~~~---~~~~~---~~~~~~~-----~~~~~g~----------------~v~~~~~l~~~~v~~~i 52 (413) T protein:vir:48 1 MFFS-GLFQRKSDAPVT---TPAEL---AEAIGLS-----YDTYTGK----------------RISSQRAMRLTAVYSCV 52 (413) T ss_pred Cccc-hhhccCccCCcc---chHHH---HHhhhcC-----cccccCc----------------eechhhhhccHHHHHHH Confidence 1100 000000000000 00000 0000000 0001100 01234456688999999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-+--|++.-+.+.. . + +..... +...+..+|+ ..++..++....+..++..|++|+. T Consensus 53 ~~Ia~~iA~~p~~~~~~~~~~----~-~--~~~~~~---~~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gn~~~~ 116 (413) T protein:vir:48 53 RVLAESVGMLPCSLYKISGTL----K-T--RVVDER---LHKLVSAKPN------GYMTPQEFWELVIVCLCLRGNFYAY 116 (413) T ss_pred HHHHHhhhhCceEEEEecCCc----c-e--eecccH---HHHHHHhhcc------CCCCHHHHHHHHHHHHhhcCceEEE Confidence 999999998777765432110 0 0 000111 2222333443 3578999999999999999999998 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) ++... | .+..|..|+|++|. |..|..|.++ |.+.... |. .. T Consensus 117 i~~~~-------g--~~~~L~~l~~~~v~--------------~~~~~~~~~~-y~~~~~~-g~--------------~~ 157 (413) T protein:vir:48 117 KVKAL-------G--EVVELLPIDPGCVE--------------PKLNSQWQPV-YQVTFPD-GS--------------VD 157 (413) T ss_pred EEeCC-------C--cEEEEEEEcCceEE--------------EEEcCCceEE-EEEEecC-ce--------------EE Confidence 76421 2 35678888888874 2344455443 4443211 10 12 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+...- .....|+|++..+...+.......++.....+=.+...++|+.+..-. .+..+.. T Consensus 158 ~~~~~evih~~~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~----------~e~~~~~ 226 (413) T protein:vir:48 158 VLTQDEIWHVRTLT-LDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLT----------PDAYERL 226 (413) T ss_pred EEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCC----------HHHHHHH Confidence 36778999997664 566899999999998888777777777777777788888988754211 0100000 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) .... .....+ .-..|.+..|..|.+++.++.+.-...|.+..+.....||..+|||.+.| ++..++|||++. T Consensus 227 ~~~~---~~~~~g----~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~n~e 298 (413) T protein:vir:48 227 KKDF---EERHTG----LGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMV-QNTDRATFNNIE 298 (413) T ss_pred HHHH---HHHhcC----ccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCcCCCcccHH Confidence 0000 000000 11356778899999999999877778888999999999999999999866 677788999987 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +..+.|. ...+.|+.+.+ +.++....++ +.... ..+ +++-.-.....|+...+++ T Consensus 299 ~~~~~f~-----------~~~i~P~~~~i-e~~l~~~L~~-~~~~~----------~~~--~~fd~~~l~~~d~~~~~~~ 353 (413) T protein:vir:48 299 ELGLGFI-----------NYSLVPYLTRI-EQRINTGLVR-ESKQG----------KFY--AKFNAGALLRGDMKSRFEA 353 (413) T ss_pred HHHHHHH-----------HHHHHHHHHHH-HHHHHhhccC-ccccC----------CeE--EEEechhhhccCHHHHHHH Confidence 6665553 33455655543 3444444332 11100 001 1221222233599999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) ....+++|+.|+-|+-+..|..|-+-- +++=++... ...........++.++.+.++.++ T Consensus 354 ~~~~~~~g~~T~NE~R~~~g~~p~~gg----------D~~~~~~n~--~~~~~~~~~~~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 354 YATGINWGIYSPNDCRDLEDMNPRPGG----------DVYLTPMNM--TTSPSAGDDNGKKKESGDADKTAS 413 (413) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCc----------ceeeccccc--cccccccccCCCCCCCCCccccCC Confidence 999999999999999988898875411 111111111 111111111111222222222222 No 24 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.73 E-value=5.7e-18 Score=115.13 Aligned_cols=427 Identities=11% Similarity=0.049 Sum_probs=233.4 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=+.+..-.. .+... .. ..-+..+..+..|.....+. ..-+..-+..++.+.++| T Consensus 1 M~~~~~~f~~~--~r~~~----------~~-~~~~~~~~~~~~~~g~~~~~-----------~~v~~~~al~~~~v~~~i 56 (429) T protein:vir:10 1 MDSVKKFFNFE--KRQTS----------QV-IELNKDDEKLLEWLGISPST-----------ISVKGKNALKVATVFACI 56 (429) T ss_pred Cchhhhhhccc--ccCcc----------cc-cccCCChHHHHHHhcCCCCc-----------ceechhhhhccHHHHHHH Confidence 22222221000 00000 00 00000011111121110000 000112234578899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|-..-|++.-+.+. |. +.+ .. ..+...+..+|+ ..+|.+++-+..+..++..|++|+. T Consensus 57 ~~ia~~ia~l~~~~~~~~~~---~~--~~~--~~---~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnay~~ 120 (429) T protein:vir:10 57 KILSESVSKLPLKIYQEDEY---GI--QRG--TK---HYLNNLLRLRPN------PYMSSMNFFGSLEAQKNLYGNSYAN 120 (429) T ss_pred HHHHHhhccCceEEEEecCC---ce--eec--cc---cHHHHHHHhhcc------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 99999998766665332211 00 000 00 112333334554 3568888999999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.+... ..+..|..|+|+.+..-.+.. |+. ..+..+.|.+ .. .| ... T Consensus 121 i~r~~~--------G~~~~L~~i~~~~v~v~~~~~-------~~~--~~~~~~~~~~-~~-~g--------------~~~ 167 (429) T protein:vir:10 121 IEFDRK--------GKVQALWPIDASKVTVYIDDV-------GLL--NSKTKMWYVV-NT-GG--------------QQR 167 (429) T ss_pred EEECCC--------CcEEEEEEEcCceeEEEEcCc-------ccc--cccceEEEEE-cc-CC--------------eEE Confidence 865321 135788999998885211100 000 0111222322 11 11 112 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++++|||+......++..|+|++..+...+.......++.....+-.+...++++.+..-. .+..... T Consensus 168 ~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~----------~e~~~~~ 237 (429) T protein:vir:10 168 VLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLN----------EDAKKVF 237 (429) T ss_pred EEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCC----------HHHHHHH Confidence 467789999988888899999999999999888888888888888888888889988653210 0111111 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ...... ...+ .-..|.+..|+.|.+++.++.+.....|.+..+.....||..+|||...| ++.++.|||++. T Consensus 238 ~~~~~~---~~~g----~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~sn~e 309 (429) T protein:vir:10 238 RENFES---MSSG----LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQL-NDLSKATLNNIE 309 (429) T ss_pred HHHHHH---Hhcc----ccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHH Confidence 110000 0000 01356777899999999998777778888888999999999999999866 788889999876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +....| +...+.|+... ++.++....++-...... ..+++..-.....|+...+++ T Consensus 310 ~~~~~f-----------~~~~l~P~~~~-ie~~ln~kl~~~~~~~~g------------~~~~fd~~~ll~~d~~~~~~~ 365 (429) T protein:vir:10 310 QQQQQF-----------YTDTLQATLTM-YEQEMTYKLFLDSELDKG------------FYSKFNVDAILRADIKTRYEA 365 (429) T ss_pred HHHHHH-----------HHHHHHHHHHH-HHHHHHHhhcChhhcCCC------------cEEEeechhhhcCCHHHHHHH Confidence 655444 45555664443 444444443321111000 012222333344699999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ..+++.+|+.|+-|+-+..|+.|.+-. +++=++....+....... ..+..+++++...+.+| T Consensus 366 ~~~~~~~G~~T~NE~R~~~gl~p~~gg----------D~~~~~~n~~~~d~~~~~-~~k~g~~~~~~~~~~~e 427 (429) T protein:vir:10 366 YRTGIQGGFLKPNEARSKEDLPPEAGG----------DRLLVNGNMLPIDMAGQA-YLKGGDTNGEVSKEGNE 427 (429) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCc----------Ceeeecccccchhhcccc-ccCCCCCCCCCCCCCCC Confidence 999999999999999888898874311 111111111111111111 11111122222222222 No 25 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.73 E-value=2.2e-18 Score=117.41 Aligned_cols=415 Identities=10% Similarity=-0.005 Sum_probs=233.5 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |- ..|..-++... ..+.. .......+|.....+. .. +.-+.+-+..++.+..+| T Consensus 1 m~-~~~~f~~~~~~--------------~~~~~--~~~~~~~~~~~~~~~~------~~---~~v~~~~al~~~~v~~~i 54 (416) T protein:vir:12 1 ML-LERMFEKRSGS--------------SDHED--GFNNILLNMFGGRKTA------SG---ERVSESNSLVQPDIFACV 54 (416) T ss_pred Cc-cchhcccccCc--------------cccCc--cchhHHHHhhcCcccc------cC---ceechhhhhccHHHHHHH Confidence 00 00000000000 00000 0000000110000000 00 001112233468889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|-.-.|++.-+.+. |. +. ..-...+..+..+|+ ..+|..++....+..++..|++|+. T Consensus 55 ~~Ia~~ia~l~~~~~~~~~~---~~--~~-----~~~~~l~~~l~~~PN------~~~t~~~f~~~~v~~lll~Gna~~~ 118 (416) T protein:vir:12 55 NVLSDDIAKLPIHTYKRTDG---GI--ER-----KPEHKSAHAVYARPN------PYMTAFTWKKLMMTHVLTWGNAYSY 118 (416) T ss_pred HHHHHhhhhCceEEEEecCC---cc--cc-----ccccHHHHHHHhhcc------cCCCHHHHHHHHHHHHhhcCCeEEE Confidence 99999999877776433221 00 00 000123344444554 3578999999999999999999998 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.+... ..+..|..|+|+.+. |..+.++..+.|.+... | . .. T Consensus 119 i~r~~~--------G~~~~L~~l~~~~v~--------------v~~~~~~~~~~~~~~~~--g-------~-------~~ 160 (416) T protein:vir:12 119 IQFGSH--------GYPEALFPLRPDYTN--------------AYVHPTTGMLWYQTVLN--G-------K-------AI 160 (416) T ss_pred EEECCC--------CcEEEEEEECCcceE--------------EEEeCCCcEEEEEEecC--C-------e-------EE Confidence 765321 245678889988873 33456666777766421 1 0 12 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+... .++...|+|++..+...+.-.....+....-.+-.+...++|+.+..- .++..+.. T Consensus 161 ~~~~~eiih~~~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~----------~~e~~~~~ 229 (416) T protein:vir:12 161 ELYDYEVLHFKGL-STDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFL----------DEKPKENV 229 (416) T ss_pred EecCccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCC----------CHHHHHHH Confidence 4677899999765 567799999999888888777777777666667778888888864321 01111111 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ... ..+ ....|.+..|+.|.+++.++.+.-...|.+..+.....||..+|||.+.| ++..+.|||++. T Consensus 230 ~~~-------~~~----~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn~e 297 (416) T protein:vir:12 230 RKE-------WKR----VNKVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKL-NELDKATFSNIE 297 (416) T ss_pred HHH-------HHH----HhcCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCccCCCcccHH Confidence 111 100 12356788899999999998877777888999999999999999999855 788889999987 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +....|. ...+.|+...+ +.++-...++-..... -..++|..-.....|+...+++ T Consensus 298 ~~~~~f~-----------~~~l~P~~~~i-e~~l~~~l~~~~~~~~------------g~~i~fd~~~l~~~d~~~~~~~ 353 (416) T protein:vir:12 298 HQSIEYV-----------RNTLQPWIVNF-EQELNVKLFLDHDQKS------------GHYVKFNIDSELRGDSKTQAEY 353 (416) T ss_pred HHHHHHH-----------HHHHHHHHHHH-HHHHHHhhcCchhhcC------------CceEEeechhhhccCHHHHHHH Confidence 7666553 44556655543 3334443332221100 0112333333444699999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ..+.+++|+.|+-|+-+..|..|-+--++ +-++....+................+.++ +.++ T Consensus 354 ~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~----------~~~~~n~~~~~~~~~~~~~~~~~~~~gge-~~~~ 415 (416) T protein:vir:12 354 LKTLHETGVLNKDEIRELLERNPIENGDK----------YISSLNYVFLDFLEEYQRLKAGGAMKGGD-NKNE 415 (416) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCcce----------eeeccccccccccchhhccccccccCCCC-CcCC Confidence 99999999999999999999988642221 11111111111111111111111111111 1111 No 26 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.72 E-value=1.1e-17 Score=113.57 Aligned_cols=438 Identities=12% Similarity=0.045 Sum_probs=228.5 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..+.+-+.. ......+ ....|.+.....-....... .-..-+-+-+..++.+..+| T Consensus 1 Mg~~~~l~~~~~---------------~~~~~~~-----~~~~~~~~~~~~~~~~~~~~-~g~~v~~~~al~~~~v~~~i 59 (457) T protein:vir:62 1 MGFWSALFGRGH---------------SPALDAA-----EGRAWEPYDPSIYNLGATAS-SGERVTPHDALQVSAVFASV 59 (457) T ss_pred Cchhhhhhcccc---------------ccccccc-----cccccccchhhhhhcccccc-CCceechHHhhccHHHHHHH Confidence 222221111000 0000000 01112111100000000000 00011123345578889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-+--|++.-+-+. ..+ ..-...+......++ ..+|.+++.+..+..++..|++|+. T Consensus 60 ~~ia~~iA~lp~~~~~~~~~-----~~~------~~~~~~~~~ll~~pn------~~~t~~~f~~~~~~~l~l~Gna~~~ 122 (457) T protein:vir:62 60 RLLSETIATLPLSTYSKRGG-----TRK------EIDTPEWLDFPNAEP------GGMGRIDILSQTVLSLLLQGNAFLA 122 (457) T ss_pred HHHHHhHhhCceEEEEecCC-----ccc------cccchHHHHhccccC------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 99999998877776533211 000 011122333333432 2478999999999999999999988 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCC--CceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeec Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMD--TPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPA 241 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~--g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~ 241 (556) +++.. + -+..|..|.|+++.......+ ...+.....++..|... . T Consensus 123 i~~~~-------g--~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~g~~~------------------------~ 169 (457) T protein:vir:62 123 VRWAG-------P--NIAGLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEV------------------------L 169 (457) T ss_pred EEeCC-------C--cEEEEEEEcCcceEEEEeccCCccceeEEEEEEccCCcee------------------------E Confidence 75432 1 245788899888853221111 11111112222222111 0 Q ss_pred cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccc Q lcl|NC_019524. 242 RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKE 321 (556) Q Consensus 242 ~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~ 321 (556) ...+++++|||+..+...|...|+|++..+...+.-...-.+....-.+=.+...++|+.+..- ..+... T Consensus 170 ~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l----------s~e~~~ 239 (457) T protein:vir:62 170 LGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTM----------SEEGLA 239 (457) T ss_pred EEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCC----------CHHHHH Confidence 1235677999999888888899999998888777766666666666666677788888865321 011111 Q ss_pred cccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-- Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-- 399 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-- 399 (556) ........ ...+ .=..|.+..|..|.+++.++.+.....|.+..+.....||+.+|||-..| |+.++++| T Consensus 240 ~~~~~~~~---~~~G----~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~ 311 (457) T protein:vir:62 240 RAREAWRA---ANSG----VDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLI-SDATNSTSWG 311 (457) T ss_pred HHHHHHHH---HhcC----ccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCccccc Confidence 11111000 0000 01357788899999999998776677899999999999999999998755 89988887 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhh Q lcl|NC_019524. 400 SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKK 479 (556) Q Consensus 400 Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~K 479 (556) |++.+..+. |+..-+.|+.+. ++.++....++-... . ...++|-.-.....|+.. T Consensus 312 sn~eq~~~~-----------f~~~~l~P~~~~-ie~~ln~~L~~~~~~-~------------~~~i~fd~~~l~~~d~~~ 366 (457) T protein:vir:62 312 SGLAEQNIA-----------FTMFSLRPWLER-IEAGFNRLLFAETAD-R------------FRFVKFNLDEIKRGAPKE 366 (457) T ss_pred chHHHHHHH-----------HHHHHHHHHHHH-HHHHHHhhhcCcccc-C------------ceEEEeechhhhccCHHH Confidence 434333333 344445776654 444555544432110 0 011233223333459999 Q ss_pred hhHHHHHHHHcCCCCHHHHHHHhCCCHHHHH--HHHHHHHHHHHHcCCCCCcc----ccccCC----CCCCCCCCCCCCC Q lcl|NC_019524. 480 ETEAAILRIKNGLSTYEAEISRLGGDFREVF--KQRAREEGLIKSLKLDFTGK----MVEGNS----TQSSNSSESTSDN 549 (556) Q Consensus 480 e~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~--~q~a~E~~~~~~~Gl~~~~~----~~~~~~----~~~~~~~~~~~~~ 549 (556) .+++..+.+++|+.|+-|+-+..|..|-+-- +++-.-.. +-..|-..... +....+ ....+.+++.+.+ T Consensus 367 r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (457) T protein:vir:62 367 RMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLN-LGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGD 445 (457) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc-cccccccccccccCCCccCCCCccCCCCCCCCCCCCCC Confidence 9999999999999999999999998774311 11100000 00011000000 000000 0111111222333 Q ss_pred CCCcCCC Q lcl|NC_019524. 550 PNEETTQ 556 (556) Q Consensus 550 ~~~e~~~ 556 (556) |+++.+| T Consensus 446 ~d~~~~~ 452 (457) T protein:vir:62 446 PDEGETE 452 (457) T ss_pred Ccccccc Confidence 4443333 No 27 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.72 E-value=2.1e-17 Score=112.07 Aligned_cols=430 Identities=11% Similarity=0.058 Sum_probs=235.9 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|.+..-.. ..+... .... -+.....+..|.....+ ...-+..-+..++.+.++| T Consensus 1 M~~~~r~~~~~~~---~~r~~~------~~~~-~~~~~~~~~~~~g~~~~-----------~~~v~~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIVDSVKKFFNF---EKRQTS------QVIE-LNKDDEKLLEWLGISPS-----------TISVKGKNALKVATVFACI 59 (432) T ss_pred CChHHHHHHhcCc---cccCcc------cccc-cCCchHHHHHHhCCCcC-----------ccccchhhhhccHHHHHHH Confidence 5544444221100 000000 0000 00111122222211110 0001222344578899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|-..-|++.-+.+... +.+ ... .+...+..+|+ ..+|..++....+..++..|++|+. T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~-----~~~--~~~---~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnay~~ 123 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGI-----QRG--TKH---YLNNLLRLRPN------PYMSSMNFFGSLEAQKNLYGNSYAN 123 (432) T ss_pred HHHHHhhccCceEEEEecCCce-----eec--ccc---HHHHHHHhhcc------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 9999999887776643321100 000 001 11222223454 3568889999999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.+... ..+..|..|.|+.|..-.+.. |+. ..+..+.| +... .| ... T Consensus 124 i~r~~~--------G~~~~L~~i~~~~v~v~~d~~-------~~~--~~~~~~~y-~~~~-~g--------------~~~ 170 (432) T protein:vir:10 124 IEFDRK--------GKVQALWPIDASKVTVYIDDV-------GLL--NSKTKMWY-VVNT-GG--------------QQR 170 (432) T ss_pred EEECCC--------CcEEEEEEEcCceeEEEEcCc-------ccc--cccceEEE-EEec-CC--------------eEE Confidence 875321 135688889988875211100 111 01112223 3221 11 112 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++++|||+......+..+|+|++..+...+.......++.....+-.+...++|+.+..-. .+..... T Consensus 171 ~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~----------~e~~~~~ 240 (432) T protein:vir:10 171 VLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLN----------EDAKKVF 240 (432) T ss_pred EEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCC----------HHHHHHH Confidence 467889999987777889999999999998888888888888888888888889988653211 0000000 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ..... ....+ .-..|.+..|+.|.+++.++.+.....|.+..+.....||..+|||.+.| ++..+.|||++. T Consensus 241 ~~~~~---~~~~g----~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~s~~e 312 (432) T protein:vir:10 241 RENFE---SMSSG----LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQL-NDLSKATLNNIE 312 (432) T ss_pred HHHHH---HHhcc----cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHH Confidence 00000 00000 01246677899999999999887788889999999999999999999866 788889999976 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +....+ +...++|+... ++.++....++-..... -..+++..-.....|+...+++ T Consensus 313 ~~~~~~-----------~~~~l~P~~~~-ie~~ln~kLl~~~~~~~------------g~~~~fd~~~l~~~d~~~~~~~ 368 (432) T protein:vir:10 313 QQQQQF-----------YTDTLQATLTM-YEQEMTYKLFLDSELDK------------GFYSKFNVDAILRADIKTRYEA 368 (432) T ss_pred HHHHHH-----------HHHHHHHHHHH-HHHHHHHhhcChhhcCC------------CcEEEeechhhhcCCHHHHHHH Confidence 655444 34455564443 34444443332111100 0012333333445699999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ..+.+.+|+.|+-|+-+..|..|-+- .+++=++....+......... +..+++++...+.+| T Consensus 369 ~~~~~~~G~~t~NE~R~~~g~~pi~g----------gD~~~~~~n~~~~~~~~~~~~-k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 369 YRTGIQGGFLKPNEARSKEDLPPEAG----------GDRLLVNGNMLPIDMAGQAYL-KGGDTNGEVSKEGNE 430 (432) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCC----------CCeEeecccccchhhcccccc-CCCCCCCCCCCCCCC Confidence 99999999999999998889887431 111111111111111111111 111111112222222 No 28 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.72 E-value=2.1e-17 Score=112.07 Aligned_cols=430 Identities=11% Similarity=0.058 Sum_probs=235.9 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|.+..-.. ..+... .... -+.....+..|.....+ ...-+..-+..++.+.++| T Consensus 1 M~~~~r~~~~~~~---~~r~~~------~~~~-~~~~~~~~~~~~g~~~~-----------~~~v~~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIVDSVKKFFNF---EKRQTS------QVIE-LNKDDEKLLEWLGISPS-----------TISVKGKNALKVATVFACI 59 (432) T ss_pred CChHHHHHHhcCc---cccCcc------cccc-cCCchHHHHHHhCCCcC-----------ccccchhhhhccHHHHHHH Confidence 5544444221100 000000 0000 00111122222211110 0001222344578899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|-..-|++.-+.+... +.+ ... .+...+..+|+ ..+|..++....+..++..|++|+. T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~-----~~~--~~~---~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnay~~ 123 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGI-----QRG--TKH---YLNNLLRLRPN------PYMSSMNFFGSLEAQKNLYGNSYAN 123 (432) T ss_pred HHHHHhhccCceEEEEecCCce-----eec--ccc---HHHHHHHhhcc------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 9999999887776643321100 000 001 11222223454 3568889999999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.+... ..+..|..|.|+.|..-.+.. |+. ..+..+.| +... .| ... T Consensus 124 i~r~~~--------G~~~~L~~i~~~~v~v~~d~~-------~~~--~~~~~~~y-~~~~-~g--------------~~~ 170 (432) T protein:vir:10 124 IEFDRK--------GKVQALWPIDASKVTVYIDDV-------GLL--NSKTKMWY-VVNT-GG--------------QQR 170 (432) T ss_pred EEECCC--------CcEEEEEEEcCceeEEEEcCc-------ccc--cccceEEE-EEec-CC--------------eEE Confidence 875321 135688889988875211100 111 01112223 3221 11 112 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++++|||+......+..+|+|++..+...+.......++.....+-.+...++|+.+..-. .+..... T Consensus 171 ~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~----------~e~~~~~ 240 (432) T protein:vir:10 171 VLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLN----------EDAKKVF 240 (432) T ss_pred EEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCC----------HHHHHHH Confidence 467889999987777889999999999998888888888888888888888889988653211 0000000 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ..... ....+ .-..|.+..|+.|.+++.++.+.....|.+..+.....||..+|||.+.| ++..+.|||++. T Consensus 241 ~~~~~---~~~~g----~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~s~~e 312 (432) T protein:vir:10 241 RENFE---SMSSG----LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQL-NDLSKATLNNIE 312 (432) T ss_pred HHHHH---HHhcc----cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHH Confidence 00000 00000 01246677899999999999887788889999999999999999999866 788889999976 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +....+ +...++|+... ++.++....++-..... -..+++..-.....|+...+++ T Consensus 313 ~~~~~~-----------~~~~l~P~~~~-ie~~ln~kLl~~~~~~~------------g~~~~fd~~~l~~~d~~~~~~~ 368 (432) T protein:vir:10 313 QQQQQF-----------YTDTLQATLTM-YEQEMTYKLFLDSELDK------------GFYSKFNVDAILRADIKTRYEA 368 (432) T ss_pred HHHHHH-----------HHHHHHHHHHH-HHHHHHHhhcChhhcCC------------CcEEEeechhhhcCCHHHHHHH Confidence 655444 34455564443 34444443332111100 0012333333445699999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ..+.+.+|+.|+-|+-+..|..|-+- .+++=++....+......... +..+++++...+.+| T Consensus 369 ~~~~~~~G~~t~NE~R~~~g~~pi~g----------gD~~~~~~n~~~~~~~~~~~~-k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 369 YRTGIQGGFLKPNEARSKEDLPPEAG----------GDRLLVNGNMLPIDMAGQAYL-KGGDTNGEVSKEGNE 430 (432) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCC----------CCeEeecccccchhhcccccc-CCCCCCCCCCCCCCC Confidence 99999999999999998889887431 111111111111111111111 111111112222222 No 29 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.72 E-value=2.1e-17 Score=112.07 Aligned_cols=430 Identities=11% Similarity=0.058 Sum_probs=235.9 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|.+..-.. ..+... .... -+.....+..|.....+ ...-+..-+..++.+.++| T Consensus 1 M~~~~r~~~~~~~---~~r~~~------~~~~-~~~~~~~~~~~~g~~~~-----------~~~v~~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIVDSVKKFFNF---EKRQTS------QVIE-LNKDDEKLLEWLGISPS-----------TISVKGKNALKVATVFACI 59 (432) T ss_pred CChHHHHHHhcCc---cccCcc------cccc-cCCchHHHHHHhCCCcC-----------ccccchhhhhccHHHHHHH Confidence 5544444221100 000000 0000 00111122222211110 0001222344578899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|-..-|++.-+.+... +.+ ... .+...+..+|+ ..+|..++....+..++..|++|+. T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~-----~~~--~~~---~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnay~~ 123 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGI-----QRG--TKH---YLNNLLRLRPN------PYMSSMNFFGSLEAQKNLYGNSYAN 123 (432) T ss_pred HHHHHhhccCceEEEEecCCce-----eec--ccc---HHHHHHHhhcc------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 9999999887776643321100 000 001 11222223454 3568889999999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.+... ..+..|..|.|+.|..-.+.. |+. ..+..+.| +... .| ... T Consensus 124 i~r~~~--------G~~~~L~~i~~~~v~v~~d~~-------~~~--~~~~~~~y-~~~~-~g--------------~~~ 170 (432) T protein:vir:10 124 IEFDRK--------GKVQALWPIDASKVTVYIDDV-------GLL--NSKTKMWY-VVNT-GG--------------QQR 170 (432) T ss_pred EEECCC--------CcEEEEEEEcCceeEEEEcCc-------ccc--cccceEEE-EEec-CC--------------eEE Confidence 875321 135688889988875211100 111 01112223 3221 11 112 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++++|||+......+..+|+|++..+...+.......++.....+-.+...++|+.+..-. .+..... T Consensus 171 ~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~----------~e~~~~~ 240 (432) T protein:vir:10 171 VLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLN----------EDAKKVF 240 (432) T ss_pred EEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCC----------HHHHHHH Confidence 467889999987777889999999999998888888888888888888888889988653211 0000000 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ..... ....+ .-..|.+..|+.|.+++.++.+.....|.+..+.....||..+|||.+.| ++..+.|||++. T Consensus 241 ~~~~~---~~~~g----~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~s~~e 312 (432) T protein:vir:10 241 RENFE---SMSSG----LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQL-NDLSKATLNNIE 312 (432) T ss_pred HHHHH---HHhcc----cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHH Confidence 00000 00000 01246677899999999999887788889999999999999999999866 788889999976 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +....+ +...++|+... ++.++....++-..... -..+++..-.....|+...+++ T Consensus 313 ~~~~~~-----------~~~~l~P~~~~-ie~~ln~kLl~~~~~~~------------g~~~~fd~~~l~~~d~~~~~~~ 368 (432) T protein:vir:10 313 QQQQQF-----------YTDTLQATLTM-YEQEMTYKLFLDSELDK------------GFYSKFNVDAILRADIKTRYEA 368 (432) T ss_pred HHHHHH-----------HHHHHHHHHHH-HHHHHHHhhcChhhcCC------------CcEEEeechhhhcCCHHHHHHH Confidence 655444 34455564443 34444443332111100 0012333333445699999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ..+.+.+|+.|+-|+-+..|..|-+- .+++=++....+......... +..+++++...+.+| T Consensus 369 ~~~~~~~G~~t~NE~R~~~g~~pi~g----------gD~~~~~~n~~~~~~~~~~~~-k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 369 YRTGIQGGFLKPNEARSKEDLPPEAG----------GDRLLVNGNMLPIDMAGQAYL-KGGDTNGEVSKEGNE 430 (432) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCC----------CCeEeecccccchhhcccccc-CCCCCCCCCCCCCCC Confidence 99999999999999998889887431 111111111111111111111 111111112222222 No 30 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.72 E-value=2.5e-17 Score=111.64 Aligned_cols=386 Identities=9% Similarity=-0.004 Sum_probs=220.0 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |+=..+....++ ++. .....+ ..-....|.....+ .. .-+-..+..+|.+.++| T Consensus 1 M~~f~~~~~~~~----~~~------~~~~~~-----~~~~~~~~~~~~~~--------~~---~v~~~~~~~~~~v~~~i 54 (386) T protein:vir:48 1 MPIFNITNLATE----SPP------ISQGGF-----FDITDPDFLSTLNG--------SE---WVSAESALRNSDLFSII 54 (386) T ss_pred Cccccccccccc----ccc------cccccc-----cccccchhcccccC--------Cc---eechhhhhcchHHHHHH Confidence 222221111000 000 000000 00000001000000 00 01112234678899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|-+.-|++.-. .. ..+-.+++ -.+|..++.+..+..++..|++|+. T Consensus 55 ~~ia~~ia~~p~~~~~~-----------~~-----------~~l~~~pN------~~~t~~~f~~~~~~~lll~Gna~~~ 106 (386) T protein:vir:48 55 NQLSNDLATVKLTASRK-----------QL-----------QGIIDNPS------NNANRFNFYQSIFAQMLLGGEAFAY 106 (386) T ss_pred HHHHHhhccCceeeccc-----------hh-----------HHHhhcCC------CCCCHHHHHHHHHHHhhhcCcEEEE Confidence 99999998865544311 11 11222443 2478999999999999999999998 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.+.. ...+..|..|.|+.+. |+.+.+|.++.|++....+... ... T Consensus 107 i~r~~--------~g~~~~L~~l~~~~v~--------------v~~~~~~~~~~y~~~~~~~~~~------------~~~ 152 (386) T protein:vir:48 107 RWRNE--------NGRDMKWEYLRPSQVS--------------FNRLDNKDGIYYNITFDDPRIP------------PKQ 152 (386) T ss_pred EEECC--------CCcEEEEEEecCceeE--------------EEEcCCCceEEEEEEecCcccc------------cee Confidence 76532 1246788999988883 4566788889999865543211 123 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+......++..|+|++..+...+.......++...-.+=.+...++|+.+...... ..... T Consensus 153 ~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e----------~~~~~ 222 (386) T protein:vir:48 153 HVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLD----------FKTKL 222 (386) T ss_pred EecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHH----------HHHHH Confidence 46788999999888888899999999999888888878877777777788888999875432211 00000 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ... .... .=..|.+..|..|.+++.++.+....+|.+..+.....||+.+|||-..| ++ ..+||++. T Consensus 223 ~~~------~~~~----~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~--~~~~~~~e 289 (386) T protein:vir:48 223 SRS------RQAM----KQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVV-GG--QGDQQSSL 289 (386) T ss_pred HHH------HHHh----hcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CC--CCCcccHH Confidence 000 0000 11356778899999999999777777899999999999999999998866 54 34777765 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +..+.+ +...+.|+.+. ++.++....++- .. + ... ...--|+..-... T Consensus 290 ~~~~~~-----------~~~~l~P~~~~-ie~~l~~~l~~~---~~---~------------~~~--~~~~~d~~~~~~~ 337 (386) T protein:vir:48 290 EMSLDL-----------YNKAVSRYLRP-FLSELSQKLSCD---VD---A------------DIL--PAVDPTGSNSVSR 337 (386) T ss_pred HHHHHH-----------HHHHHHHHHHH-HHHHHHHhhcch---hh---c------------chh--hhhccChHHHHHH Confidence 554443 34445565444 333443332210 00 0 000 0001255554555 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) ....+++|++|+-|+....|..+-+-- |......++.+ +.+ .+|+.++| T Consensus 338 ~~~l~~~g~~t~nE~r~~lg~~~~~~~-----~~~~~~~~~~~--------------~~~--gGd~~~~~ 386 (386) T protein:vir:48 338 INSMVKSGTLAQNQGLYILQQAEILPK-----ELPEGENPNKT--------------TLK--GGEINGED 386 (386) T ss_pred HHHHHhCCCcCHHHHHHHhhcCCCCCc-----cchhhcCCCCC--------------ccC--CCCCCCCC Confidence 567899999999998887776652110 00011111100 000 11111111 No 31 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.71 E-value=4.6e-17 Score=110.16 Aligned_cols=414 Identities=12% Similarity=0.040 Sum_probs=229.2 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|..-++...... ..... ....+++. ..|....-+ .+.+..++.+..+| T Consensus 1 Mg~f~~lf~r~~~~~~~---~~~~~---~~~~~~~~-----~~~~g~~v~----------------~~~al~~~~v~~~i 53 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVT---TPAEL---ADAIGLSY-----DTYTGKQIS----------------SQRAMRLTAVFSCV 53 (414) T ss_pred CchhhhhhccCccCccc---chhhH---hHhhccCc-----cccCCceec----------------hhhhhccHHHHHHH Confidence 44443333222111000 00000 00001100 000000001 12234688899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-+.-|++.-..+.. .+. ..-..++..+..+|+ ..++..++....+..++..|++|+. T Consensus 54 ~~Ia~~ia~~p~~~~~~~~~~-----~~~-----~~~~~~~~lL~~~PN------~~~t~~~f~~~~~~~~ll~Gna~~~ 117 (414) T protein:vir:44 54 RVLAESVGMLPCNLYHLNGSL-----KQR-----ATGERLHKLISTHPN------GYMTPQEFWELVVTCLCLRGNFYAY 117 (414) T ss_pred HHHHHHhccCceEEEEecCCc-----eee-----cccchHHHHHHhhcc------cCCCHHHHHHHHHHHHhhcCCeEEE Confidence 999999988777764332110 000 001112333334454 3579999999999999999999988 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) ++.. .+ .+..|..|.|+.+. |+.+..|+++ |.+.... | ... T Consensus 118 i~~~--~g-------~~~~L~~l~~~~v~--------------~~~~~~~~~~-y~~~~~~-g--------------~~~ 158 (414) T protein:vir:44 118 KVKA--FG-------EVAELLPVDPGCVV--------------PKLNSSWEPV-YQVTFPD-G--------------STD 158 (414) T ss_pred EEeC--CC-------cEEEEEEEcCceEE--------------EEECCCCcEE-EEEEecC-c--------------eEE Confidence 7532 11 24677888888773 3445556543 5443221 1 012 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .++..+|||+... ..+...|+|++..+...+.......++...-.+=.+...++|+.+..-. .+..+.. T Consensus 159 ~~~~~evih~~~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~----------~e~~~~~ 227 (414) T protein:vir:44 159 VLSQEDIWHVRTL-TLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLS----------DQAYERL 227 (414) T ss_pred EEccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC----------HHHHHHH Confidence 4678899999765 4677999999998888777777766666666666777788888653211 1111111 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) .... .....+. =..|.+..|+.|.+++.++.+.-...|.+..+.....||+.+|||-+.| ++.+++|||++. T Consensus 228 ~~~~---~~~~~g~----~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-~~~~~~t~~n~e 299 (414) T protein:vir:44 228 KKDF---EERHTGL----GNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMV-QNTDRATFNNIE 299 (414) T ss_pred HHHH---HHHhcCc----cccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHH Confidence 0100 0000000 1246788899999999999876777888899999999999999999865 677788999975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +.... ++...+.|+.+. ++.++....++-..... +.+++-.-.-...|+...+++ T Consensus 300 ~~~~~-----------~~~~~l~P~~~~-ie~~ln~~L~~~~~~~~-------------~~i~fd~~~ll~~d~~~~~~~ 354 (414) T protein:vir:44 300 ELGLG-----------FINYSLVPYLTR-IEQRINTGLVRKSKQGV-------------FYAKFNAGALLRGDMKSRFEA 354 (414) T ss_pred HHHHH-----------HHHHHHHHHHHH-HHHHHHhhcCCccccCc-------------eEEEEechhhhccCHHHHHHH Confidence 54443 345556675554 45555554443211100 012222223333589999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ..+.+++|+.|+-|+-+..|.+|-+-- +++=++... ...+.......++.++..+++++- T Consensus 355 ~~~~~~~G~~t~NE~R~~~gl~p~~gg----------D~~~~~~n~---~~~~~~~~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 355 YATGINWGIYSPNDCRDLEDMNPRPGG----------DVYLTPMNM---TTKPSDGSKAGKQKDNANADETTS 414 (414) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCc----------ceecccccc---cccCCccccCCCCCCCCCCCCCCC Confidence 999999999999999988898874311 111000000 000011111111111111222222 No 32 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.71 E-value=1.6e-17 Score=112.65 Aligned_cols=408 Identities=10% Similarity=0.043 Sum_probs=221.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCc--ccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTR--EMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r--~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) ||=.+. .....+.......+ +.+. .+.. ....|...... .-+.+.+..+|. T Consensus 1 m~~~~~-~~~~~~~~~~~~~~------------~~~~-~~~~~~~~~~~~~~~~~-------------~v~~~~a~~~~~ 53 (412) T protein:vir:26 1 MNVIAK-ENIVTRIKKKLIDN------------WIDQ-STSKLYDFSPWKNRSFW-------------GVINNTLETNET 53 (412) T ss_pred Cccchh-hhhhhhhhhhHhhh------------hhcc-cccccccccccCCcccc-------------ccchhhhhccHH Confidence 433322 11111111111100 1110 0111 01223221111 013344567788 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +..+|+.|.+.|-..-|++.-+.+ . ... ..+..+-.+|+ -.+|..++-...+..++..| T Consensus 54 v~~~i~~ia~~iA~lp~~~~~~~~--------~----~~~---~~~~lL~~~PN------~~~t~~~f~~~~~~~lll~G 112 (412) T protein:vir:26 54 IFSAITKLSNSMASLPLKMYEDYK--------V----VNT---EVSDLLTVSPN------NSLSSFDFINQIETIRNEKG 112 (412) T ss_pred HHHHHHHHHHhHhhCceeEeeccc--------c----ccc---hHHHHHHhhcc------cCCCHHHHHHHHHHHHhhcC Confidence 899999999998876665532211 0 111 12333434554 24688899889999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) ++|+.+.+... ..+..|.+|.|+.+. |+.|.++..+.|++...+ |. T Consensus 113 nay~~i~r~~~--------G~~~~L~~l~~~~v~--------------v~~~~~~~~~~y~~~~~~-g~----------- 158 (412) T protein:vir:26 113 NAYVLIERDIY--------HQPSKLFLLNPDVVE--------------MLIENQSRELYYSIHAAT-GN----------- 158 (412) T ss_pred ceEEEEEECCC--------CcEEEEEEEcCceeE--------------EEEeCCCcEEEEEEEcCC-ce----------- Confidence 99998764321 235678888888873 566777888889885332 11 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeE-eccCcccccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASV-ESELPSDVVFGQLGMGQG 317 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi-~~~~~~~~~~~~~~~~~~ 317 (556) ...+++++|||+..+...+...|+|++..+...+.-.....+. .....+.-..+| +.+.. ..+ T Consensus 159 ---~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~---~~~~~~~~~~~i~~~~~~----------l~~ 222 (412) T protein:vir:26 159 ---KLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSN----------VGK 222 (412) T ss_pred ---EEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHH---HHHhcCCCCceEEecCCC----------CCH Confidence 1246788999998877789999999987654433322222211 222222222233 32211 011 Q ss_pred cccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcc Q lcl|NC_019524. 318 GFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKT 397 (556) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~ 397 (556) +..+....... ... =..|.+..|..|.+++.++.+.-..+|.+..+....+||+.+|||-..| ++.+++ T Consensus 223 e~~~~~~~~~~---~~~-------~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~l-g~~~~~ 291 (412) T protein:vir:26 223 EKRQQVLEDFK---QYY-------EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFL-NARSNT 291 (412) T ss_pred HHHHHHHHHHH---HHh-------hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCC Confidence 11111111100 000 1356788999999999998776667888888899999999999998866 667788 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch Q lcl|NC_019524. 398 NYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE 477 (556) Q Consensus 398 nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP 477 (556) |||++.+....|.. .-+.|+... |+.++-...++-..... -..+++-...-.-.|+ T Consensus 292 ~~sn~e~~~~~f~~-----------~~l~P~~~~-ie~~ln~kLl~~~~~~~------------~~~~~fd~~~l~~~d~ 347 (412) T protein:vir:26 292 NFAKNEELNRFYLQ-----------HTLLPIVKQ-YEEEFNRKLLTKTDREK------------NRYFKFNVKSYLRADS 347 (412) T ss_pred CcccHHHHHHHHHH-----------HHHHHHHHH-HHHHHHhhcCCcccccC------------cceEEeechhhhccCH Confidence 99999766655543 345665555 34444444433211100 0012222333334599 Q ss_pred hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 478 KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 478 ~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ...+++....+++|+.|+-|+-+..|.+|-+-.+ ++=++....+........ .....++++.++ T Consensus 348 ~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD----------~~~~~~n~~~~~~~~~~~-----~~~~gG~~n~~e 411 (412) T protein:vir:26 348 ATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD----------KPLISGDLYPIDTPLELR-----KSLKGGDKNVNE 411 (412) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----------eeeecccccccccchhhc-----ccccCCCCCcCC Confidence 9999999999999999999999998988754211 111111111111111101 111111122222 No 33 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.71 E-value=8.2e-16 Score=103.28 Aligned_cols=465 Identities=14% Similarity=0.089 Sum_probs=218.5 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccC-CCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAER-TTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGV 82 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~-~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~ 82 (556) |.+ +.....-....... ......+..... .-+....|.-...... .+.. ..-...|++.....|++-+ T Consensus 1 ~~~------~i~~~~~~~~~~~~-~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~-~~~~---~~~~~~~~~~~~~n~~~~i 69 (485) T protein:vir:10 1 MTA------PLPGQEEIEDPAIA-RDEMVSAFEDSTQNLKTNTSYYEAERRPE-AIGV---TVPIQMQSLLAHVGYPRLY 69 (485) T ss_pred CCC------CCCCCCCCCCHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCcch-hcCC---CCChhhhhhhhhcCcHHHH Confidence 111 00000000000000 000000000000 0000011110000000 0000 0001112333445699999 Q ss_pred HHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEE Q lcl|NC_019524. 83 VAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLA 162 (556) Q Consensus 83 v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~ 162 (556) |+.+++++++.||+.. + +++ ..+.+... |.. .+|...+..+.+..++-|.||+ T Consensus 70 vd~~~~~l~~~g~~~~---~------~~~----~~~~~~~i---~~~-----------N~~d~~~~~~~~~a~i~G~ay~ 122 (485) T protein:vir:10 70 VDSIAERQAVEGFRFG---D------ADE----ADEELWQW---WQA-----------NNLDIEAPLGYTDAYVHGRSYI 122 (485) T ss_pred HHHHHhhhcccceecC---C------Cch----hHHHHHHH---HHh-----------cCHhHHHHHHHHHHhhcCceEE Confidence 9999999999998742 1 111 22333333 322 2688889999999999999998 Q ss_pred EEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 163 TCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 163 ~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) ..............-+. ..+..++|..+-.-++.. .+.+..++.+ +..+....++++..+---.+......|.-+ T Consensus 123 ~v~~~e~~~~~~~~~~~-~~i~~~~p~~~~~~~D~~-~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ 200 (485) T protein:vir:10 123 TISRPDPQIDLGWDPNT-PIIRVEPPTRMYAEIDPR-IGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEW 200 (485) T ss_pred EEeeCCcccccccCCCe-eEEEEEccceeEEEEcCC-CCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEe Confidence 85433221111111112 368889998875333322 2223334332 234455555555432111111111223211 Q ss_pred ec-cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceee---eEeccCcccccccccccc Q lcl|NC_019524. 240 PA-RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAA---SVESELPSDVVFGQLGMG 315 (556) Q Consensus 240 ~~-~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~---fi~~~~~~~~~~~~~~~~ 315 (556) .. ...++.--|+|+....+.+..-|+|.+.+.+..| ++.|..+....+.++-.++. +|+-...+.. . T Consensus 201 ~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~l--iDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~-----~-- 271 (485) T protein:vir:10 201 FNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSM--TDAAARILMLMQATAELMGVPQRLIFGIKPEEI-----G-- 271 (485) T ss_pred ccccCCCCcccEEEeccccccCCCCCccchhHHHHHH--HHHHHHHHHHHHHHHHhhcchHHHHhcCCcccc-----c-- Confidence 11 1234555578888888888889999999855444 45555555555554433332 2221111000 0 Q ss_pred cccccccccccccccccccccccceecCCceeeecCCCceeeeecCCC-CCccHHHHHHHHHHHHHHhcCCCHHHhhchh Q lcl|NC_019524. 316 QGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGT-PGGVGTDYEQSLLRNIAASLGMSYEQFSRDY 394 (556) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~-p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~ 394 (556) .....+.....+.+|.|..+ +|+++++.+.+. +-.+|...++..++.+++..++|.+.|.++. T Consensus 272 ---------------~~~~~~~~~~~~~~~~i~~~-~~~d~k~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~ 335 (485) T protein:vir:10 272 ---------------VDPETGQTLFDAYLARILAF-EDAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA 335 (485) T ss_pred ---------------ccccccchhhhhcccceecc-CCCCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhcccc Confidence 00111223334556777666 455666654332 2345777777889999999999999886543 Q ss_pred hcccchh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCc Q lcl|NC_019524. 395 TKTNYSS---ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGAS 471 (556) Q Consensus 395 s~~nYSs---~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~ 471 (556) .|.+| +|..+.......+..|..|-..+ +.+++..+ ++..+ ...+.. ..-+.+.|..+. T Consensus 336 --~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l-~~~~~l~~--~~~~~-~~~~~~------------~~~i~v~w~~~~ 397 (485) T protein:vir:10 336 --DNPASAEAIRAAESRLIKKVERKNSIFGGAW-EEAMRLAY--RMMKG-GDVPPD------------MLRMETVWRDPS 397 (485) T ss_pred --CchhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH--HHhCC-CCCccc------------ceeeeEEecCCC Confidence 46655 56666666666666666665554 34444332 23333 211110 012467887777 Q ss_pred ccccchhhhhHHHHHHHHcC--CCCHHHHHHHhCCCHHHHHHHHHHHH--HHHHHcCCCCCccccccCCCCCCCCCC-CC Q lcl|NC_019524. 472 RGQIDEKKETEAAILRIKNG--LSTYEAEISRLGGDFREVFKQRAREE--GLIKSLKLDFTGKMVEGNSTQSSNSSE-ST 546 (556) Q Consensus 472 ~~~iDP~Ke~~A~~~~i~~G--~~s~~~~~ae~G~D~e~v~~q~a~E~--~~~~~~Gl~~~~~~~~~~~~~~~~~~~-~~ 546 (556) .. +....+.+..+.+.+| +.|.+..+...|.+++++ +++.++. +.....++. +. ............++ ++ T Consensus 398 ~~--~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~-~~~~~~~ee~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~ 472 (485) T protein:vir:10 398 TP--TYAAKADAASKLYNGGTGVIPRERARKDMGYSIAER-EEMRRWDEEEAAMGLGLI-GT-MVDPNPTVPGSPSPAPA 472 (485) T ss_pred CC--CHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHH-HHHHHHHHHHHHHHHHHH-HH-hhccCCCCCCCCCcccc Confidence 55 6677788888888876 888888888889988765 3333222 211111110 00 00000000000001 11 Q ss_pred CCCCCCcCCC Q lcl|NC_019524. 547 SDNPNEETTQ 556 (556) Q Consensus 547 ~~~~~~e~~~ 556 (556) ++.+.+++.- T Consensus 473 ~~~~~~~~~~ 482 (485) T protein:vir:10 473 PKPAALESGG 482 (485) T ss_pred ccCcCCCCCC Confidence 1111111111 No 34 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.71 E-value=9e-17 Score=108.56 Aligned_cols=386 Identities=9% Similarity=-0.009 Sum_probs=216.7 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHH--HHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQ--DMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~--~~lr~RaRdl~rNn~~ 78 (556) |-+.... .+..... .. ......|......++ +.... ..-..-+-.-+.+++. T Consensus 2 ~m~~~~~--~~~~~~~---------------------~~-~~~~~~~~~~~~~~~--~~~~~~~~~g~~v~~~~al~~~~ 55 (392) T protein:vir:74 2 ILPILNF--INQTNDP---------------------PE-AGSVQSYFPDGNDAQ--IMESLLGDNNEWVSARAALRNSD 55 (392) T ss_pred cchhhhh--hhcccCc---------------------cc-ccccccccccCchhh--hhhhccCCCCcccchhhhhcchH Confidence 2222111 0000000 00 000111111111111 00000 0000001122346788 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +..+|+.+.+.|-+.-|++.-+.. .. .-++++. .+|.+++.+..+..++..| T Consensus 56 v~~~v~~ia~~ia~lp~~~~~~~~-----------~~-----------l~~~PN~------~~t~~~f~~~~~~~lll~G 107 (392) T protein:vir:74 56 LFSIILQLSSDLAIVKINAEKKKN-----------QG-----------IIDNPST------NANKHGFWQSMFAQLLLGG 107 (392) T ss_pred HHHHHHHHHHhhccCceeeccchh-----------hh-----------hhhhcCC------CCCHHHHHHHHHHHhhhcC Confidence 999999999999876666542211 11 1224542 4688999999999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) ++|+.+++... ..+..|..|+|+.+. |+.|.+|..+.|.+....+... T Consensus 108 na~~~i~r~~~--------G~~~~L~~i~~~~v~--------------v~~~~~~~~~~y~~~~~~~~~~---------- 155 (392) T protein:vir:74 108 EAFAYRWRNAN--------GADMKWEYLRPSQVN--------------TYYFEYENGMYYNITFDDPKIE---------- 155 (392) T ss_pred CEEEEEEECCC--------CcEEEEEEEcCceeE--------------EEEcCCCceEEEEEEecCCccc---------- Confidence 99999875321 235788999998883 5566778888888865443210 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGG 318 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~ 318 (556) ....+++++|||+......|...|+|++..+...+.-....+++.....+=.+...++|+.+...... +. T Consensus 156 --~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~--------~~ 225 (392) T protein:vir:74 156 --PILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS--------DK 225 (392) T ss_pred --eeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch--------HH Confidence 11246778999999888888899999999888888777777777777777778888888865421110 00 Q ss_pred ccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 319 FKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) ....... ... ..-..|.+..|+.|.+++.++.+....+|.+..+.....||+.+|||-+.| |+.+. + T Consensus 226 ~~~~~~~-------~~~----~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~-~ 292 (392) T protein:vir:74 226 DKASRSR-------SFM----KRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQGD-Q 292 (392) T ss_pred HHHHHHH-------HHh----ccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCC-c Confidence 0000000 000 112457788899999999998777778899999999999999999998866 66543 4 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) +|+. +.... ++...+.|+.+.+. .++....++ +. .+ -.-...-.|+. T Consensus 293 ~~~~-e~~~~-----------~~~~~l~p~~~~ie-~~l~~~l~~---~~---~~--------------~~~~~~~~d~~ 339 (392) T protein:vir:74 293 QSSI-QQISG-----------MYASALNRYLRPAI-SELEYKLSD---HI---SV--------------NMRPAIDPLGD 339 (392) T ss_pred ccHH-HHHHH-----------HHHHHHHHHHHHHH-HHHHHhccc---hh---cc--------------cchhhhcCCHH Confidence 4432 11111 33444556555433 333332211 10 11 01111224788 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHH---hCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISR---LGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNP 550 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae---~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (556) .-+......+++|+.|+.|+.+. .|..+.++-+ ..+++.- +.+ ++.+..| T Consensus 340 ~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~----------~enl~~~-------~~G-----d~~~p~p 392 (392) T protein:vir:74 340 NYLSTISTATRWGALAENQATFVLQEAGYIPKDLPA----------PENTNKK-------TTG-----QSNEPVP 392 (392) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccch----------hcCCCCC-------CCC-----CCCCCCC Confidence 78888889999999999987542 2433332210 1122110 000 0011111 No 35 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.70 E-value=2e-15 Score=101.20 Aligned_cols=453 Identities=15% Similarity=0.091 Sum_probs=217.1 Q ss_pred hhHhhcccchh----hhh-hh-----hc------chhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 14 KAVDVVAETAT----ATP-MA-----VG------GGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 14 ~a~~~~~~~~~----~~~-~~-----~~------~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) .+....+-..- .+. .. .. ..|.-+.. ....+ ...+-...+.+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~---~i~~~--------------~~~~~~~~~~~~~~~n 63 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESER---RPDAV--------------GVTVPQQMQKLLAHVG 63 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---cchhc--------------ccccchhHHhhhhhcC Confidence 11111111000 000 00 00 01211111 01100 0001112233334567 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) |++-+|+.++..+++.||+.. + ++...+ .+|+-|.+ .+|...+..+++..+.- T Consensus 64 ~~~~ivd~~~~~l~~~g~~~~---~----------~~~~~~---~l~~i~~~-----------N~~d~~~~~~~~~a~~~ 116 (484) T protein:vir:77 64 YPRLYIDAIAARQELEGFRLG---G----------ADKADE---QLWDWWQA-----------NDLDIESTLGHTDSLVH 116 (484) T ss_pred cHHHHHHHHHhhhccCceecC---C----------cchhHH---HHHHHHHh-----------cCHhHHHHHHHHHHhhc Confidence 999999999999999998742 1 111122 23443322 26788899999999999 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE--C-CCCCeEEEEEeecCCCccccCCcc Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL--D-NNGAALGYWLRKAFPGDPTDMEQW 234 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~--d-~~Gr~vaY~i~~~hpgd~~~~~~~ 234 (556) |.+|................+.| .|..++|..+-.-++.. .+.+..+|.+ + ..|....+-++..+---.+..... T Consensus 117 G~a~~~v~~~~~~~~~~~~~~~~-~i~~~~p~~~~~~~D~~-~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~ 194 (484) T protein:vir:77 117 GRSYITISKPDPNIDPGVDPEVP-IIRVEPPTNLYAQIDPR-TRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDG 194 (484) T ss_pred CceEEEEecCCCCcccccccccc-eEEEeccceeEEEecCC-CCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCC Confidence 99998864433221111111222 57788888875433322 2345555533 2 233333222222110001111112 Q ss_pred ccceeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee---eeEeccCccccccc Q lcl|NC_019524. 235 KWGYEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA---ASVESELPSDVVFG 310 (556) Q Consensus 235 ~~~rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~---~fi~~~~~~~~~~~ 310 (556) .|.-+... ..++.-=|+|+....+.+..-|+|.|.+.+..| ++.|..+....+..+-.++ .+|+-..++.... T Consensus 195 ~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L--~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~- 271 (484) T protein:vir:77 195 QWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSV--TDAAARTLMLMQATAELMGVPQRLLFGVKGEELGV- 271 (484) T ss_pred ceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHH--HHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcc- Confidence 23322211 122322378888888999999999999755444 3455555544444443332 2232111100000 Q ss_pred ccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCC-ccHHHHHHHHHHHHHHhcCCCHHH Q lcl|NC_019524. 311 QLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPG-GVGTDYEQSLLRNIAASLGMSYEQ 389 (556) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~-~~f~~F~~~~lr~iaaglGi~ye~ 389 (556) ....+.......+|.|..+ +|+++++.+.+..+ .+|...++..++.+++..++|-+. T Consensus 272 ---------------------~~~~~~~~~~~~~~~~~~~-~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~ 329 (484) T protein:vir:77 272 ---------------------DPETGQTLFDAYLARILAF-EDHESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYY 329 (484) T ss_pred ---------------------cccccchhhhhhhhhhccc-CCCCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHH Confidence 0011122233445666555 45566665544222 356667788899999999999999 Q ss_pred hhchhhc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeee Q lcl|NC_019524. 390 FSRDYTK-TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWI 468 (556) Q Consensus 390 l~~D~s~-~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~ 468 (556) |.++.++ +|-.+.|..+.......+..|..|-..+.+ +++.. .++..+. ..+.. ..-+.+.|. T Consensus 330 fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~-~~~l~--~~~~~~~-~~~~~------------~~~i~v~w~ 393 (484) T protein:vir:77 330 LSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQ-AMRVA--YKVMNGG-DIPPE------------YYRMESIWR 393 (484) T ss_pred hccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHhCCC-Ccccc------------cccceEEec Confidence 9766443 345567777777777777777777666544 34433 2343332 11110 012467887 Q ss_pred cCcccccchhhhhHHHHHHHHcC--CCCHHHHHHHhCCCHHHHH--HHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC Q lcl|NC_019524. 469 GASRGQIDEKKETEAAILRIKNG--LSTYEAEISRLGGDFREVF--KQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE 544 (556) Q Consensus 469 ~p~~~~iDP~Ke~~A~~~~i~~G--~~s~~~~~ae~G~D~e~v~--~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~ 544 (556) .+... +-...+.+..+.+.+| +.|.+..+...|...+++- +++.+|... ...++............+....++ T Consensus 394 ~~~~~--s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~-~~~~~~~~~~~~~~~~~~~~~~~~ 470 (484) T protein:vir:77 394 DPSTP--TYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQA-QGLGLMGTMFGTDPSGGGNPDNPE 470 (484) T ss_pred CCCCC--CHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHH-HHHHHHhhhccccccCCCCCCCCC Confidence 77755 4445566666777765 8899888888898655432 233222211 111111101011111111111122 Q ss_pred CCCCCCCCcCCC Q lcl|NC_019524. 545 STSDNPNEETTQ 556 (556) Q Consensus 545 ~~~~~~~~e~~~ 556 (556) .+...|+.++++ T Consensus 471 ~~~~~~~~~~~~ 482 (484) T protein:vir:77 471 TPEPQPNPAEEA 482 (484) T ss_pred cccccCCCcccc Confidence 222223333333 No 36 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.70 E-value=1.6e-16 Score=107.16 Aligned_cols=458 Identities=12% Similarity=0.054 Sum_probs=208.5 Q ss_pred CCcchhhhHHHHH--hhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAK--KAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~--~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |...-. -+.+-. ++.. .........++..... .+.-.+.+.....|.... ....+....+.+. +||+ T Consensus 27 ~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~a~~~p~-~~~~~~~~~~~~~p~~~~-------~~~~~~~~l~~~~-~npi 95 (576) T protein:vir:96 27 IDDGLQ-ANIRNIEEKSKE-LNKSLYGKQQAYAEPF-LEVMDTNPEFRTKRSYMK-------NSDNLHDVLKQFG-NNPI 95 (576) T ss_pred cccChh-HHHHHhhhhhhh-hccccCCccchhhcce-eeeeecCCCccccCcchh-------hhhhhHHHHHHhh-cCHH Confidence 111000 000000 0000 0000000000000000 000111000001121111 1112222233333 5799 Q ss_pred HHHHHHHHHhhhcc-----------CCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHH Q lcl|NC_019524. 79 AAGVVAVHRDSIVG-----------SQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLT 147 (556) Q Consensus 79 a~~~v~~~~~nvVG-----------~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq 147 (556) +..+|+.+.+.|-. -|+.+..+ ......+.++..++ ..++. +-.++.. +..++ ++||.++. T Consensus 96 v~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk--~~~~~~~~~~~~~~-~~l~~-~l~~~~~-~~~p~---~~t~~~f~ 167 (576) T protein:vir:96 96 LNAIILTRSNQVAMYCQPSRYNERGLGFEVRMR--DLDAEPGKKEKEEI-KRIEN-FILNTGR-DKDID---RDSFQSFC 167 (576) T ss_pred HHHHHHHHHHHHHhhhhhhhhccccccceeEEe--cCcCccchhhhHhh-hhHHh-hHhhccC-CCCCc---cccHHHHH Confidence 99999999887753 23332222 11111122221111 11111 1111110 01121 46899999 Q ss_pred HHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCc Q lcl|NC_019524. 148 RLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGD 227 (556) Q Consensus 148 ~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd 227 (556) ..++..++..|.+|+.+++.+. +...++.|..|+|++|.. ..|.+|....|-..-....+ T Consensus 168 ~~lv~dlll~Gna~~~i~~~rd------~~g~~~~L~pl~p~~V~v--------------~~~~dg~~~~~~~~~~~~~~ 227 (576) T protein:vir:96 168 RKIVRDTYTYDQVNFEKVFNKK------NATTMDKFIAVDPSTIFY--------------ATDKNGKIIKGGKRFVQVIN 227 (576) T ss_pred HHHHHHHHhcCCeEEEEEEecC------CCCceEEEEEeCCceeEE--------------EECCCCceeeeeeEEEEecC Confidence 9999999999999998775432 123467889999998853 22333333222111000000 Q ss_pred cccCCccccceeeccccCChhHeEeeecccCCC---cccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALLAG---QTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~g---Q~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) .. ....+++.+|||+...-.++ ...|+|++..+...+......++....-.+=.+...++|+.+.+ T Consensus 228 -----~~------~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~ 296 (576) T protein:vir:96 228 -----KK------VVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSE 296 (576) T ss_pred -----Cc------eEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC Confidence 00 11245677888776544443 56799999988877766555555555555556777788875422 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCcee-eecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhc Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI-PHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASL 383 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i-~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaagl 383 (556) ... ..+..+.+.... .......-..|.+ ..|..|.+++.++.+.....|.+..+...+.||+.+ T Consensus 297 ~~l--------s~e~~~~lr~~~-------~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~af 361 (576) T protein:vir:96 297 QQQ--------SQRALENFKREW-------KSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALY 361 (576) T ss_pred CCC--------CHHHHHHHHHHH-------HHHhccccccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHh Confidence 110 000000000000 0000001134554 678999999999988888999999999999999999 Q ss_pred CCCHHHhhchhhcc-----------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCccccc Q lcl|NC_019524. 384 GMSYEQFSRDYTKT-----------NYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRM 452 (556) Q Consensus 384 Gi~ye~l~~D~s~~-----------nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~ 452 (556) |||.+.| |+.+++ |||++.+....| +...++|+..+ ++.++....++ .+... T Consensus 362 gVPp~~l-G~~~~~~~~g~~~~~s~t~sn~e~~~~~f-----------~~~tL~P~~~~-ie~~ln~~Ll~--~~~~~-- 424 (576) T protein:vir:96 362 GIDPAEI-GFPNRGGATGGKGGNTLNEADPGKKQQQS-----------QNKGLQPLLRF-IEDLINTHIIS--EYSDK-- 424 (576) T ss_pred CCCHHHc-cccccccccccccccccccccHHHHHHHH-----------HHHHHHHHHHH-HHHHHHhhhch--hccCc-- Confidence 9999866 766544 666665444444 45556665554 44445443332 11110 Q ss_pred ccchhhHHHhhCeeeecCcccccchhhhhHHH--HHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCcc Q lcl|NC_019524. 453 FYDPMMRDALCNAEWIGASRGQIDEKKETEAA--ILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGK 530 (556) Q Consensus 453 ~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~--~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~ 530 (556) +.+++. -.|....++.. ...+.+|+.|+-|+-+..|+.|-+--+++-.-. .+...|-..... T Consensus 425 ----------~~~~f~-----r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~-~~~~~~~~~~~~ 488 (576) T protein:vir:96 425 ----------YVFQFV-----GGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGS-FIQSMSLNTQKE 488 (576) T ss_pred ----------eEEEec-----cCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceecccc-ccccccccccCC Confidence 112221 23555444433 344668999999998888998754222111000 000111000000 Q ss_pred cc-----------------ccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 531 MV-----------------EGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 531 ~~-----------------~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .. ...+..+..+..++..+.++.++. T Consensus 489 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~ 531 (576) T protein:vir:96 489 QYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDP 531 (576) T ss_pred CCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccC Confidence 00 000000000000011111111100 No 37 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.69 E-value=3.8e-17 Score=110.60 Aligned_cols=438 Identities=12% Similarity=0.037 Sum_probs=223.4 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|..-| ....+..+.. + ..|.+...+.-........ -..-+.+-+..++-+..+| T Consensus 1 Mg~~~~l~~r---------------~~~~~~~~~~--~---~~~~~~~~~~~~~~~~~~~-g~~V~~~~al~~~~V~~~v 59 (457) T protein:vir:13 1 MGFWSALFGR---------------GHSPALDGIE--A---RAWEPYDPSIYNLGAVAAS-GETVTPHDALQVSAVFASV 59 (457) T ss_pred Cchhhhhhcc---------------cccccccccc--c---ccccccchHHHhhcccccC-CceechHHhhccHHHHHHH Confidence 2222111110 0000001000 0 1121111110000000000 0001122344567789999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-+--|++.-+... ..+ +.....|...... .+ . -++.+++-+..+..++..|++|+. T Consensus 60 ~~Ia~~iA~lp~~~~~~~~~-----~~~------~~~~~~l~~~ln~---~~--n-~~t~~~f~~~~~~~lll~Gna~~~ 122 (457) T protein:vir:13 60 RLLSETIATLPLSTYSKRGG-----SRK------EIVTPEWLDYPNA---EP--G-GMGRIDILSQTVLSLLLQGNAFLA 122 (457) T ss_pred HHHHHhhccCceEEEEecCC-----ccc------ccccchHHHhccc---cC--C-CCCHHHHHHHHHHHHhhcCCeEEE Confidence 99999998876665432110 001 1111122222211 11 1 368889999999999999999988 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCC--ceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeec Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDT--PNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPA 241 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g--~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~ 241 (556) +++.. + -+..|..|.|+++....+..++ ..+..-..++..|..+ . T Consensus 123 i~~~~-------g--~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~~~~~------------------------~ 169 (457) T protein:vir:13 123 VRWQG-------P--NIVGLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEV------------------------L 169 (457) T ss_pred EEecC-------C--cEEEEEEEccCceEEEEecCCCccceeEEEEEEecCCcee------------------------e Confidence 76431 1 3467889999888532111111 0111111111111110 0 Q ss_pred cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccc Q lcl|NC_019524. 242 RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKE 321 (556) Q Consensus 242 ~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~ 321 (556) ...++..+|||+..+...+...|+|++..+...+.-.....+....-.+=.+...++|+.+..-. .+..+ T Consensus 170 ~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls----------~e~~~ 239 (457) T protein:vir:13 170 LGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMS----------EEGLA 239 (457) T ss_pred EEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCC----------HHHHH Confidence 12356779999999988899999999998888877777777666666666778888888653210 11111 Q ss_pred cccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-- Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-- 399 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-- 399 (556) ........ ...+ .=..|.+..|..|.+++.++.+.....|.+..+.....||..+|||-+.| |+.++++| T Consensus 240 ~~~~~~~~---~~~g----~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~ 311 (457) T protein:vir:13 240 RAREAWRA---ANSG----VDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLI-SDATNSTSWG 311 (457) T ss_pred HHHHHHHH---HhcC----ccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCccccc Confidence 11111000 0000 01246788899999999998877777888999999999999999998865 88888877 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhh Q lcl|NC_019524. 400 SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKK 479 (556) Q Consensus 400 Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~K 479 (556) |++.+..+. |+..-+.|+.++ ++.++....++-.. .. ...++|..-.....|+.. T Consensus 312 sn~eq~~~~-----------f~~~tl~P~~~~-ie~~ln~~L~~~~~-~~------------~~~i~fd~~~l~~~D~~~ 366 (457) T protein:vir:13 312 SGLAEQNIA-----------FTMFSLRPWLER-IEAGFNRLLFAETA-DR------------FRFVKFNLDEIKRGAPKE 366 (457) T ss_pred chHHHHHHH-----------HHHHHHHHHHHH-HHHHHHHhhcCccc-cC------------ceeEEeechhhhccCHHH Confidence 444433333 445556776654 44455544443111 10 001222222333459999 Q ss_pred hhHHHHHHHHcCCCCHHHHHHHhCCCHHHH--HHHHHHHHHHHHHcCCCCCcc----ccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 480 ETEAAILRIKNGLSTYEAEISRLGGDFREV--FKQRAREEGLIKSLKLDFTGK----MVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 480 e~~A~~~~i~~G~~s~~~~~ae~G~D~e~v--~~q~a~E~~~~~~~Gl~~~~~----~~~~~~~~~~~~~~~~~~~~~~e 553 (556) .+++....+++|+.|+-|+-+..|.+|-+- .+++-.-.. +...|-..... +....+....+.++.+.++.++| T Consensus 367 r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~ 445 (457) T protein:vir:13 367 RMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLN-LGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDD 445 (457) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeeccc-cccccccccccccCCCCCCCCCccccCCCCCCCCCCcc Confidence 999999999999999999988889977321 111100000 00011000000 00000001111111111111111 Q ss_pred CCC Q lcl|NC_019524. 554 TTQ 556 (556) Q Consensus 554 ~~~ 556 (556) +++ T Consensus 446 ~~~ 448 (457) T protein:vir:13 446 EGA 448 (457) T ss_pred ccC Confidence 111 No 38 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.69 E-value=9.9e-17 Score=108.33 Aligned_cols=450 Identities=11% Similarity=0.103 Sum_probs=227.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccC--CCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNP--SIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~--~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |+=.+..+..+..+. +..+. ....++ | ..+.+....|.. .....+ ...++.|++.|++ T Consensus 1 ~~~~~~a~~~~~~~~-------a~~~~-~~~~~~-g-~~~~~d~~~~~~~~~~~~~~----------~~~l~~lY~~~~l 60 (461) T protein:vir:80 1 MYSIDKAKQAKIDSK-------IVNRN-DFMVGH-G-KANSRDKLTRQTPGNGQKLD----------LKACENLYASNSI 60 (461) T ss_pred Cccchhhhhhhhhhh-------hhhhh-HHHhhc-C-CcchhhhhhccccCcccccC----------HHHHHHHHHhCCc Confidence 555554432221110 00011 000011 1 111222222321 111111 2467789999999 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ++.+|+.++...+-.|+.+... +++. .+.+++.|++. ....-...+++....-| T Consensus 61 ~r~iVd~~a~d~~r~g~~i~~~--------~~~~----~~~~~~~~~~l--------------~~~~~l~~~~~~~rl~G 114 (461) T protein:vir:80 61 AMNIVDIISEDMVRAGWSLKTD--------NKEM----KKNIESKWRKL--------------KTKDRFQKLYADKRLYG 114 (461) T ss_pred cchhhccchHHhhcCCeeeecC--------CHHH----HHHHHHHHHHh--------------hHHHHHHHHHHhhcccc Confidence 9999999999999999987642 2222 33444444421 23344455555666778 Q ss_pred ceEEEEeeccCCCCcCCCcccce------EEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCC Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGT------AIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDME 232 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l------~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~ 232 (556) .+.+.+....... .....+-|+ .|..|++-... ... ...+.+..-=..+|+|..|+|....++...... T Consensus 115 ~a~i~i~v~d~~~-~~~~~~~pl~~~~~~~~~~l~~~~~~---~i~-~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~ 189 (461) T protein:vir:80 115 DGFLSIGVVSSNR-EQADLSTAIDPKTIKSIPYINTFNTQ---KVT-QLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILS 189 (461) T ss_pred cEEEEEEeecCCc-cccCccCCcccccccceeEEEecccc---ccc-hhhhcccCcCcccccceEEEEeccccccccccc Confidence 7766665432211 011111121 22223321110 000 001111111225699999999766544321111 Q ss_pred ccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccc Q lcl|NC_019524. 233 QWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQL 312 (556) Q Consensus 233 ~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~ 312 (556) .. ... ....|+.+.|||+....-+++..|+|.|-+++..++++++-......-..- +.+. +++++..... . T Consensus 190 ~~--~~~-~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~-~~~~-v~k~~~l~~~----~ 260 (461) T protein:vir:80 190 GT--TAS-TSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYD-FAFK-VYKTDDIDAL----N 260 (461) T ss_pred cc--cCc-cceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHH-hCCC-ceecchHHhh----h Confidence 00 000 123688899999988899999999999999999888877666554332211 3333 3444322111 0 Q ss_pred ccccccccccccccccccccccccccceec-CCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 313 GMGQGGFKEIFNEYMTGLANYVAQTKNIAI-DGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~ 391 (556) +...... .... .... .-| +..+..+++++.++.+ -++..++.......||++.+||...|. T Consensus 261 ~~~~~~~----~~~~-----------~~~~~~~g-~~~~d~~e~~e~~~~~--lsgl~~~l~~~~~~iaa~s~iP~t~L~ 322 (461) T protein:vir:80 261 KDDKANL----TAML-----------DFMFRTEA-LAIIKGDEQLTKESTN--VSGMKDLLDYGWDYLAGAVRMPKTVLK 322 (461) T ss_pred chHHHHH----HHHH-----------HHhcCCce-EEEEcCCcceEEEecC--cCCHHHHHHHHHHHHhhhhcCCeeeee Confidence 0000000 0000 0001 112 3446778999988854 457899999999999999999999888 Q ss_pred chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCc Q lcl|NC_019524. 392 RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGAS 471 (556) Q Consensus 392 ~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~ 471 (556) |.-.+ ..||.-.-+.-+...++.+|+..+..+++-+++..+..+...+...-|.+.++... ++..|.+.. T Consensus 323 G~s~g-~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~---------f~~L~~~s~ 392 (461) T protein:vir:80 323 GQEAG-TLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIE---------FNPLWNLDS 392 (461) T ss_pred cccCC-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEE---------eCCCCCCCH Confidence 87654 44777877888889999999876666666666655543333222222322121111 112344555 Q ss_pred ccccc-hhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCC Q lcl|NC_019524. 472 RGQID-EKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNP 550 (556) Q Consensus 472 ~~~iD-P~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (556) .+..| ..|-+++..+.+++|+-|+.++- +.+ ..+.|+..++....... ..........+.+ T Consensus 393 kekAe~~~~~a~a~~~~~~~g~is~~e~r-----------~~l------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 454 (461) T protein:vir:80 393 KTDAEVRKLTAEADQIYIVNGVLDPDEVK-----------ETR------FGRFGLENSSKFSGDSA-EIDKLAKLVYDAY 454 (461) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCHHHHH-----------HHH------HHhcCCCCCccCCCCCc-hhhhhhhhccccc Confidence 55555 34556778888888887776642 211 12345543322111100 0000000111122 Q ss_pred CCcCCC Q lcl|NC_019524. 551 NEETTQ 556 (556) Q Consensus 551 ~~e~~~ 556 (556) .+|..+ T Consensus 455 ~~e~~~ 460 (461) T protein:vir:80 455 AKKNAD 460 (461) T ss_pred cccCCC Confidence 222222 No 39 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.68 E-value=1.5e-16 Score=107.42 Aligned_cols=386 Identities=9% Similarity=0.008 Sum_probs=217.9 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |+=..+....++.... ....+..... ......|.... .+ +..-+..++.+.++| T Consensus 1 M~~f~~~~~~~~~~~~----------~~~~~~~~~~-~~~~~~~~~~~-----~v----------~~~~al~~~~v~~~i 54 (386) T protein:vir:49 1 MPIFNITNLATESPPI----------NQESFFDIAD-SDFLASLNSSE-----WV----------SAENALKNSDLFSII 54 (386) T ss_pred CchhhhhccCCCCccc----------chhhhhhhhh-ccccccccCCc-----ee----------chhhhhccHHHHHHH Confidence 3333332221110000 0000110000 00111121110 01 011123468888999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|.+.-|++.-+. .. ..-.+|+ ..+|..++....+..++..|++|+. T Consensus 55 ~~ia~~ia~~p~~~~~~~-----------~~-----------~l~~~PN------~~~t~~~f~~~~~~~lll~Gna~~~ 106 (386) T protein:vir:49 55 SQLSNDLATAKITTSRKQ-----------LQ-----------GIVDNPS------NNANRFNFYQSIFAQMLLGGEAFAY 106 (386) T ss_pred HHHHHHhhhCceeeccch-----------hh-----------hhhhccC------CCCCHHHHHHHHHHHhhhcCCEEEE Confidence 999999888766654211 11 1222343 3568999999999999999999998 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +++... ..+..|..|.|+.+. |+.+..|..+.|.+....++.. ... T Consensus 107 i~r~~~--------g~~~~l~~i~~~~v~--------------v~~~~~~~~~~y~~~~~~~~~~------------~~~ 152 (386) T protein:vir:49 107 RWRNDN--------GRDMKWEYLRPSQVS--------------FNRLDNQNGLYYNITFDDPHIA------------PKQ 152 (386) T ss_pred EEECCC--------CcEEEEEEecCceeE--------------EEEcCCCceEEEEEEEcCcccc------------cee Confidence 875332 235788999998874 4455667778887765443321 113 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++++|||+..+...+..+|+|++..+...+.......++.....+-.+...++|+.+....... .... T Consensus 153 ~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~----------~~~~ 222 (386) T protein:vir:49 153 HVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDF----------KTKV 222 (386) T ss_pred EEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHH----------HHHH Confidence 467789999998887888999999999999998888888888888888888899998754322110 0000 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) .... . .. .=..|.+..|+.|.+++.++.+....+|.+..+.....||+.+|||-+.|.++ ..+|++.- T Consensus 223 ~~~~---~---~~----~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~--~~~~~~~~ 290 (386) T protein:vir:49 223 SRSR---Q---AM----KQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGD--GDQQSSLE 290 (386) T ss_pred HHHH---H---Hh----ccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC--CCccchHH Confidence 0000 0 00 11356788899999999998777777888899999999999999999988543 44666542 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +. .+ .-..++..++.|+..+ ++..+.. ++. + ---...-.|+..-... T Consensus 291 ~~-~~-------~~~~~i~~~l~~i~~~-~~~~l~~-~~~---------~--------------~~~~~~~~d~~~~~~~ 337 (386) T protein:vir:49 291 MI-YN-------IYFKSVSRYLRPFVSE-MSKKLSC-EVD---------V--------------DISPAVDPTGSNYISL 337 (386) T ss_pred HH-HH-------HHHHHHHHHHHHHHHH-HHHHhcc-hhc---------c--------------cchhhhccCHHHHHHH Confidence 11 11 1112334444444332 2222211 111 0 0011112367666777 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) ....+++|+.|+-|+....| +.|+..+..+.. ...+..+.++.+.++++ T Consensus 338 ~~~l~~~g~~t~nE~r~~l~------------------~~~~~~~~~~~~----~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 338 INSMVKSGTLAQNQGLYILQ------------------QAEILPKELPDG----KNPNRTSLKGGEINEQD 386 (386) T ss_pred HHHHHhCCCcCHHHHHHHHh------------------hCCCCCCcCcch----hccCCCCCCCCCCCCCC Confidence 77889999999988654322 333322111110 01011111111112222 No 40 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.68 E-value=3.8e-17 Score=110.59 Aligned_cols=347 Identities=12% Similarity=0.012 Sum_probs=197.1 Q ss_pred ccC-CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccC Q lcl|NC_019524. 91 VGS-QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNP 169 (556) Q Consensus 91 VG~-Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~ 169 (556) |.. =|++. + + ++.. . ...++.+..+|+ ..+|..++.+..+..++..|++|+.+.+... T Consensus 1 ia~lp~~~~-~-~------~~~~----~---~~l~~lL~~~PN------~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~ 59 (348) T protein:vir:93 1 MASLPLKMY-E-D------YKVV----N---TEVSDLLTVSPN------NSLSSFDFINQIETIRNEKGNAYVLIERDIY 59 (348) T ss_pred CcccceEeE-e-c------CcCc----c---cHHHHHHHhCCC------CCCCHHHHHHHHHHHHhhcCCeEEEEEECCC Confidence 332 12211 0 0 0000 1 123333444554 2568889989999999999999998765321 Q ss_pred CCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhH Q lcl|NC_019524. 170 TGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRR 249 (556) Q Consensus 170 ~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~ 249 (556) ..+..|..|.|+.+. |..+..|.++.|.+...+. ....+++.+ T Consensus 60 --------G~~~~L~~l~~~~v~--------------~~~~~~~~~~~y~~~~~~g---------------~~~~~~~~e 102 (348) T protein:vir:93 60 --------HQPSKLFLLNPDVVE--------------MLIENQSRELYYSIHAATG---------------NKLIVHNMD 102 (348) T ss_pred --------CcEEEEEEEcCCceE--------------EEEeCCCcEEEEEEEcCCC---------------eEEEEcccc Confidence 235788899988873 5566778889888753321 112467889 Q ss_pred eEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccc Q lcl|NC_019524. 250 VIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTG 329 (556) Q Consensus 250 viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (556) |||+......++..|+|++..+...+......... ...-.+.-.++|-+.... ...+.......... T Consensus 103 iih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~---------l~~e~~~~~~~~~~- 169 (348) T protein:vir:93 103 MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSN---------VSTEKRQQVLEDFK- 169 (348) T ss_pred EEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHH---HHHhcCCCceeEEecCCC---------CCHHHHHHHHHHHH- Confidence 99998877789999999887765544332222222 222222222222221111 01111111111000 Q ss_pred cccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHH Q lcl|NC_019524. 330 LANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAET 409 (556) Q Consensus 330 ~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~ 409 (556) ... =..|.+..|..|.+++.++.+.-..+|.+..+...+.||+.+|||-..| ++..++||+++.+....+ T Consensus 170 --~~~-------~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~~~~e~~~~~~ 239 (348) T protein:vir:93 170 --QYY-------EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFL-NARSNTNFAKNEELNRFY 239 (348) T ss_pred --HHh-------hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHHHHHHHH Confidence 000 1456788899999999998776667888999999999999999998866 667778999987655554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHH Q lcl|NC_019524. 410 QKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIK 489 (556) Q Consensus 410 ~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~ 489 (556) +...+.|+.+. ++.++.+..++-.... .-..++|........|+...+++..+.++ T Consensus 240 -----------~~~~l~P~~~~-ie~~l~~~l~~~~~~~------------~g~~i~fd~~~l~~~d~~~~a~~~~~~~~ 295 (348) T protein:vir:93 240 -----------LQHTLLPIVKQ-YEEEFNRKLLTKTDRE------------KNRYFKFNVKSYLRADSATQAEVYFKAVR 295 (348) T ss_pred -----------HHHHHHHHHHH-HHHHHHHhhCCccccc------------CcceEEeechhhhccCHHHHHHHHHHHHh Confidence 34445665555 4444555444322110 00113444445555799999999999999 Q ss_pred cCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 490 NGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 490 ~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) +|+.|+-|+-+..|.+|-+--+ +.=++....+...... ......-+++..+|+ T Consensus 296 ~G~~T~NE~R~~~g~~p~~ggD----------~~~~~~n~~~~~~~~~--~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 296 SGYYTINDIREWEDLPPVEGGD----------KPLISGDLYPIDTPLE--LRKSLKGGDKNVNES 348 (348) T ss_pred CCCCCHHHHHHHhCCCCCCCcC----------eEeecccccccccchh--hcccccCCCCCcCCC Confidence 9999999999888988754111 1001111111111000 000011111111122 No 41 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.68 E-value=7.5e-17 Score=108.99 Aligned_cols=417 Identities=12% Similarity=0.047 Sum_probs=231.7 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|...++......+.... ...++. .++. .-|......+...+. -+-+-.++.+..+| T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~----~~~~~~---~~~~--~~~~~~g~~~~~~v~----------~~~al~~~~v~~ci 61 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYD----EDIGID---ISDS--NFWEKFGIKLNFSVR----------GKRALKENTVYVCT 61 (422) T ss_pred CchhhhhhhccCCccchhhhhh----hccccc---cCcc--hhhhhccccCCcccc----------hhhhhccHHHHHHH Confidence 5445544443332111000000 000000 0000 001100101110010 01112356778999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|-...|++.-..+ ... .. .+.+.+..+|+ ..+|..++.+..+..++..|++|+. T Consensus 62 ~~ia~~iA~lp~~~~~~~~---------~~~--~~---~~~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna~~~ 121 (422) T protein:vir:13 62 KIRAESIGKLSLKIYKDKE---------EYK--EH---ELYYLLRYKPN------PLMSSINFWKCLETQRTLKGNAYAY 121 (422) T ss_pred HHHHHhhhhCceEEEecCc---------ccc--cc---hHHHHHhhhcc------cCCCHHHHHHHHHHHHhhcCCeEEE Confidence 9999999887776642211 000 01 12233333443 3578999999999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCC-----eEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGA-----ALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr-----~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) +.+... ..+..|..|.|+.|.. ..|.+|. .+.|.+...+ | T Consensus 122 i~r~~~--------G~~~~L~~i~~~~v~~--------------~~~~~~~~~~~~~~~y~~~~~~-g------------ 166 (422) T protein:vir:13 122 IERDRK--------GKIIGLYPINSDNVTK--------------IIDDDNFLSSLSKVWYVVTDKN-G------------ 166 (422) T ss_pred EEECCC--------CcEEEEEEECCcceEE--------------EEcCCcceeccceEEEEEEeCC-C------------ Confidence 875432 1367889999998852 2233332 2344433211 1 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGG 318 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~ 318 (556) ....++..+|||+......+...|+|++..+...+.......++...-.+=.+...++|+.+..-. ++ T Consensus 167 --~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~----------~e 234 (422) T protein:vir:13 167 --KEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLD----------EK 234 (422) T ss_pred --eEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCC----------HH Confidence 112467789999998878889999999999998888877777777777777788889988653210 01 Q ss_pred ccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 319 FKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) .......... ....+. =..|.+..|+.|.+++.++.+....+|.+..+.....||..+|||...| ++..+.| T Consensus 235 ~~~~~~~~~~---~~~~g~----~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~l-g~~~~~~ 306 (422) T protein:vir:13 235 AKKIFKKEFE---SMSNGL----ENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHL-NDLERAT 306 (422) T ss_pred HHHHHHHHHH---HHhcCc----cccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCC Confidence 1111111100 000000 1246778899999999999887788888999999999999999999755 6777889 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||++.+....+. ..-+.|+.+. ++.++....++-...... ..+++..-.....|+. T Consensus 307 ~sn~e~~~~~f~-----------~~~l~P~~~~-ie~~l~~~Ll~~~~~~~g------------~~i~fd~~~l~r~d~~ 362 (422) T protein:vir:13 307 FNNLTEQQKDFY-----------VTTLQSSLTV-YEQEIQDKLFSQYETLQD------------VKAEFNVDTILRSDIK 362 (422) T ss_pred cccHHHHHHHHH-----------HHHHHHHHHH-HHHHHHHhhCChhhhcCC------------ceEEeechhhhcCCHH Confidence 999766555543 3334554443 233333333321110000 0112222223334888 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPN 551 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (556) ..+++..+++++|+.|+-|+-+..|..|-+-.+++ +.-.++ .+..... .+...+.+++++ T Consensus 363 ~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~------~~~~n~----~~l~~~~---~~~~~~g~~~g~ 422 (422) T protein:vir:13 363 TRYEAYRIGIQGGFIEANEARRRENLPPVEGGDRL------LVNGNM----IPIEMAG---EQYKKGGEKGGK 422 (422) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee------eeccCc----cchhhcc---cccccCCCcCCC Confidence 89999999999999999999988898885421111 111111 1111111 111111111111 No 42 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.68 E-value=1.7e-16 Score=106.99 Aligned_cols=408 Identities=10% Similarity=0.025 Sum_probs=223.5 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |+=++=..|....-- ..+.+-+- ++ .-+|.+.....- ..-+++.+..++.+. T Consensus 1 ~~~~~~~~~~k~~~~----------------~~~~~~~~-~~-~~~~~~~~~~~~----------~~v~~~~a~~~~~V~ 52 (409) T protein:vir:96 1 MAKENIVTRIKKKLI----------------DNWIDQSA-SK-LYDFSPWKNKSF----------WGVINNTLETNETIF 52 (409) T ss_pred CccccchhhhhhHHh----------------hhhhcccc-cc-ccccccccCccc----------cccchhhHhhhHHHH Confidence 555544444332110 01111111 11 112222111000 011233455677789 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) ++|+.|++.|-.--|++.-+.+ .. . ..++..+..+|+ -.+|..++....+..++..|++ T Consensus 53 ~ci~~ia~~ia~lp~~~~~~~~--------~~----~---~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna 111 (409) T protein:vir:96 53 SAITKLSNSMASLPLKMYEDYK--------VV----N---TEVSDLLTVSPN------NSLSSFDFINQIETIRNEKGNA 111 (409) T ss_pred HHHHHHHHhhhhCceEEeeccc--------cc----c---hhHHHHHhhhcc------cCCCHHHHHHHHHHHHhhcCce Confidence 9999988888775555532211 00 1 123334444554 3468888888999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+.+.. ...+..|..|.|+.+ -|..|.++..+.|.+...+. T Consensus 112 y~~i~r~~--------~G~~~~L~~l~~~~v--------------~v~~~~~~~~~~y~~~~~~g--------------- 154 (409) T protein:vir:96 112 YVLIERDI--------YHQPSKLFLLNPDVV--------------EMLIENQSRELYYSIHAATG--------------- 154 (409) T ss_pred EEEEEECC--------CCcEEEEEEEcCcee--------------EEEEeCCCcEEEEEEEcCCc--------------- Confidence 99876432 123567888998888 35667788888888753321 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....++.++|||+...-..+...|+|++..+...+.-...... ....-.+.-.++|.+.... ..++.. T Consensus 155 ~~~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~---~~~~~~~~~~~~i~~~~~~---------l~~e~~ 222 (409) T protein:vir:96 155 NKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRT---FNLTEMQKPDSFMLKYGSN---------VSTEKR 222 (409) T ss_pred eEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHH---HHHHhcCCCceeEEecCCC---------CCHHHH Confidence 1124677899999876677889999988765444332222211 1222222222222221111 011111 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) +....... ... =..|.+..|..|.+++.++.+.-..+|.+..+....+||+.+|||-+.| ++..++||| T Consensus 223 ~~~~~~~~---~~~-------~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~s 291 (409) T protein:vir:96 223 QQVLEDFK---QYY-------EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFL-NARSNTNFA 291 (409) T ss_pred HHHHHHHH---HHh-------hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcc Confidence 11111000 000 1356788999999999998877777888888999999999999999866 777788999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) ++.+....| +..-+.|+.+.+ +.++....++-..... -..+++-.....-.|+..- T Consensus 292 ~~e~~~~~f-----------~~~~l~P~~~~i-e~~l~~~Ll~~~~~~~------------g~~i~fd~~~ll~~d~~~~ 347 (409) T protein:vir:96 292 KNEELNRFY-----------LQHTLLPIVKQY-EEEFNRKLLTKTDREK------------NRYFKFNVKSYLRADSATQ 347 (409) T ss_pred cHHHHHHHH-----------HHHHHHHHHHHH-HHHHHhhcCCcccccC------------cceEEeechhhhccCHHHH Confidence 986555444 444567766654 3445554443211100 0112333333444588989 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +++....+++|+.|+-|+-+..|..|-+--++ +=++....+....... +. ..+.+++.+++ T Consensus 348 ~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~----------~~~~~n~~~~~~~~~~--~~---~~~gG~~n~~e 408 (409) T protein:vir:96 348 AEVYFKAVRSGYYTINDIREWEDLPPVEGGDK----------PLISGDLYPIDTPLEL--RK---SLKGGDKNVNE 408 (409) T ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCCCcce----------eeecccccccccchhh--cc---cccCCCCCcCC Confidence 99999999999999999988889887542111 1011111111111000 01 11111111112 No 43 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.68 E-value=1.8e-15 Score=101.39 Aligned_cols=466 Identities=13% Similarity=0.076 Sum_probs=230.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.|+.-+.+....-... ..+. .....-|+|- . ....+. .....+ . +++-.-..|++ T Consensus 7 ~d~~~~i~~L~~~~~~~----~~r~--~~~~~Yy~g~-~---~i~~~~---~~~~~~----~-------~~~~~~~n~~~ 62 (488) T protein:vir:23 7 IDPEKLRDQLLDAFENK----QNEL--KSSKAYYDAE-R---RPDAIG---LAVPLD----M-------RKYLAHVGYPR 62 (488) T ss_pred CCHHHHHHHHHHHHHHH----HHHH--HHHHHHHhcc-c---chhhcC---cccchh----h-------hhhhhhcchHH Confidence 55554434333221111 1111 1111224432 2 122111 111111 1 11112357889 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+.+++..+-.||+............+ +.+ ..+.+|+-|.+ .+|...+..+.+..++-|.+ T Consensus 63 ~ivd~~a~~l~~~Gf~~~~~~~~~~~~~~---d~~---~~~~l~~i~~~-----------N~~~~~~~~~~~~a~i~G~a 125 (488) T protein:vir:23 63 TYVDAIAERQELEGFRIPSANGEEPESGG---END---PASELWDWWQA-----------NNLDIEATLGHTDALIYGTA 125 (488) T ss_pred HHHHHHHHhhhccceeccCCccccccccc---chh---HHHHHHHHHHh-----------cChhHHHHHHHHHHhhcCce Confidence 99999999888888775432222111111 122 22334444432 26888999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCCeEEEEEeecCCCccccCCccccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGAALGYWLRKAFPGDPTDMEQWKWG 237 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr~vaY~i~~~hpgd~~~~~~~~~~ 237 (556) |+.+..........+....+ .|..++|..+-.-++.. .+.+..||.+ +..|....++++..+---.+......|. T Consensus 126 ~~~v~~~~~~~~~~~~~~~~-~i~~~~p~~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~ 203 (488) T protein:vir:23 126 YITISMPDPEVDFDVDPEVP-LIRVEPPTALYAEVDPR-TRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWE 203 (488) T ss_pred EEEEecCCcccccCCCCCcc-eEEEeccceeEEEEecC-CCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceE Confidence 98865443222222222333 58888998875434322 2346666653 3445555555544321111111222333 Q ss_pred eeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee---eeEeccCcccccccccc Q lcl|NC_019524. 238 YEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA---ASVESELPSDVVFGQLG 313 (556) Q Consensus 238 rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~---~fi~~~~~~~~~~~~~~ 313 (556) -+... ..++.-=|+|+....+.+-.-|+|.|.+.+..| ++.|..+....+..+-.++ .+|+--..+... T Consensus 204 ~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l--~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~----- 276 (488) T protein:vir:23 204 APTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSV--TDAAAQILMNMQGTANLMAIPQRLIFGAKPEELG----- 276 (488) T ss_pred eccccccCCCCcceEEeccccccCCcCCccchhhhHHHH--HHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccc----- Confidence 22111 234444578888777888889999999755544 3455555555554443333 222211000000 Q ss_pred cccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCC-CCccHHHHHHHHHHHHHHhcCCCHHHhhc Q lcl|NC_019524. 314 MGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGT-PGGVGTDYEQSLLRNIAASLGMSYEQFSR 392 (556) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~-p~~~f~~F~~~~lr~iaaglGi~ye~l~~ 392 (556) .....+.......+|.+..+++|+++++.+.+. +-.+|...++..++.|++..++|-+.|.+ T Consensus 277 -----------------~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~ 339 (488) T protein:vir:23 277 -----------------INAETGQRMFDAYMARILAFEGGEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSS 339 (488) T ss_pred -----------------ccccccchhhhhhhhhhccCCCCCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhcc Confidence 001112233345678888999999988876433 33567777788899999999999998866 Q ss_pred hhhcccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeec Q lcl|NC_019524. 393 DYTKTNYSSA---RASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIG 469 (556) Q Consensus 393 D~s~~nYSs~---R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~ 469 (556) +. .|-+|+ |..+.......+..+..|-..+-+ +++..+ +++.| ...+.- ..-+.+.|.. T Consensus 340 ~~--~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~~l~~--~~~~~-~~~~~~------------~~~i~v~f~~ 401 (488) T protein:vir:23 340 SS--DNPASAEAIKAAESRLVKKVERKNKIFGGAWEQ-AMRLAY--KMVKG-GDIPTE------------YYRMETVWRD 401 (488) T ss_pred cc--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH--HHhcC-CCcchh------------hccceEEecC Confidence 54 354544 555555555555555555554433 333222 33333 222210 1125678987 Q ss_pred CcccccchhhhhHHHHHHHHcC--CCCHHHHHHHhCCCHH--HHHHHHHHHHHHHHHcCCCCCcc-ccccCCCCCCCCCC Q lcl|NC_019524. 470 ASRGQIDEKKETEAAILRIKNG--LSTYEAEISRLGGDFR--EVFKQRAREEGLIKSLKLDFTGK-MVEGNSTQSSNSSE 544 (556) Q Consensus 470 p~~~~iDP~Ke~~A~~~~i~~G--~~s~~~~~ae~G~D~e--~v~~q~a~E~~~~~~~Gl~~~~~-~~~~~~~~~~~~~~ 544 (556) |... +-...+.+..+.+.+| +.|.+..+...|...+ +.++++.++.+ ....+.. +.- ............+. T Consensus 402 ~~~~--s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~ 477 (488) T protein:vir:23 402 PSTP--TYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQ-KQGLGLI-GSLYGASTPEGKPGEAPV 477 (488) T ss_pred CCCC--CHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHH-HHHHHHH-HHHhccCCCcccCCCCCC Confidence 7766 4444555666666655 7888888888896443 33333322221 1111110 000 00001111111111 Q ss_pred CCCCCCCCcCC Q lcl|NC_019524. 545 STSDNPNEETT 555 (556) Q Consensus 545 ~~~~~~~~e~~ 555 (556) ++.++++.... T Consensus 478 ~~~~~~e~~~a 488 (488) T protein:vir:23 478 GEPPAPEPDAA 488 (488) T ss_pred CCCCCCCCCCC Confidence 12222222222 No 44 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.68 E-value=3.7e-17 Score=110.66 Aligned_cols=414 Identities=10% Similarity=0.011 Sum_probs=229.0 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchh---ccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGM---EGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y---~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) +++.-..+. +++ . ....+ .++.... .. ...... +.+.+..++ T Consensus 3 ~~~~~~~~~----~~~--s----------~~~~w~~~~~~~~~~-~~----~~g~~v--------------t~~~al~~~ 47 (421) T protein:vir:10 3 IPQMFEGKK----RSV--S----------GGGFWEAMLGGVRSS-HS----KAGVMI--------------TPETALALS 47 (421) T ss_pred Ccchhcccc----ccc--C----------cchhhHHHhhhhccC-cc----cCCcee--------------chHHhhccH Confidence 222111110 000 0 00000 0000000 00 000000 112345778 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHH--HHHHHHhcccccceehhcccCHHHHHHHHhhhhe Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVE--ARFNMAAESPENWFDARRMCTLTGLTRLAVSGFL 155 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie--~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~ 155 (556) -+..+|+.|.+.|-.--|++.-+.. ++.. ..+. .++..+..+|+ .-+|.+++....+..++ T Consensus 48 ~v~~~i~~Ia~~iA~lp~~~~~~~~------~g~~-----~~~~~~~l~~lL~~~PN------~~~t~~~f~~~~~~~ll 110 (421) T protein:vir:10 48 AVRACVTLLAESVAQLPVELYRRDK------NGGR-----QRATDHPIYDLIHSQPN------KKDTSFEYFEQQQGLLG 110 (421) T ss_pred HHHHHHHHHHHhhccCceEEEEEcC------CCce-----eecccchHHHHHhhccc------CCCCHHHHHHHHHHHHh Confidence 8999999999998876666532211 1110 0111 23444445555 34688999999999999 Q ss_pred ecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccc Q lcl|NC_019524. 156 MTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWK 235 (556) Q Consensus 156 ~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~ 235 (556) ..|++|+.+++.. ...+..|..|.|+.|.. ..|.+|.+. |.+. +.| T Consensus 111 l~Gna~~~i~r~~--------~G~~~~L~~l~~~~v~v--------------~~~~~g~~~-y~~~--~~g--------- 156 (421) T protein:vir:10 111 LEGNCYSIIDRDG--------KGYPKELIPINPKKVIV--------------LKGPDGMPY-YEIP--EIG--------- 156 (421) T ss_pred hcCCeEEEEEEcC--------CCcEEEEEEecCceEEE--------------EECCCceEE-EEEc--CCC--------- Confidence 9999999986532 12467888888888732 234556543 3331 111 Q ss_pred cceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccc Q lcl|NC_019524. 236 WGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMG 315 (556) Q Consensus 236 ~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~ 315 (556) ..+|+++|||+... ..+...|+|++..+...+.......++.....+=.+...++|+.+..-... . T Consensus 157 -------~~~~~~eiih~~~~-~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~------~ 222 (421) T protein:vir:10 157 -------ETLPMRMMHHVKVF-SLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAI------K 222 (421) T ss_pred -------cEEchhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCcc------C Confidence 13577899998765 456789999999888877776666666666666677788888865321100 0 Q ss_pred cccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh Q lcl|NC_019524. 316 QGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT 395 (556) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s 395 (556) ..+..+...... .....+ .-..|.+..|..|.+++.++.+....+|.+..+.....||..+|||-+.| ++.+ T Consensus 223 ~~e~~~~~~~~~---~~~~~g----~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~ 294 (421) T protein:vir:10 223 SQEKIDQLLAKW---TDRYSG----INNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMV-QMLA 294 (421) T ss_pred CHHHHHHHHHHH---HHHhcC----ccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCc Confidence 000000000000 000000 01246688899999999999887888888888999999999999998754 8888 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) ++|||++.+....| +..-+.|+... ++.++....++-..... . .+++-......- T Consensus 295 ~~t~sn~e~~~~~f-----------~~~tl~P~~~~-ie~~ln~kL~~~~~~~~-----------~--~v~fd~~~l~~~ 349 (421) T protein:vir:10 295 KATNNNIEHQGLQF-----------VMYTLLAWLKR-HEGALQRDLLLPSERRD-----------L--YIEFNVSGLLRG 349 (421) T ss_pred CCccccHHHHHHHH-----------HHHHHHHHHHH-HHHHHhhhccCccccCC-----------e--EEEEechhhhcc Confidence 99999865444433 45556675554 45555555443211111 0 122222232334 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) |+...+++..+.+++|+.|+-|+-+..|++|-+--+++- .-++......+ ..........++.+.|+.-.+| T Consensus 350 d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~------~~~n~~~~~~~-~~~~~~~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 350 DQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYL------TPLNMVDSAQI-IPGDKKPTAQQMAEIDTILSRT 421 (421) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceee------ecccccccccc-ccCCCCcccccCcccccccccC Confidence 899999999999999999999999999988764332211 11111110000 0011111111222222222222 No 45 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.67 E-value=6.4e-17 Score=109.38 Aligned_cols=419 Identities=12% Similarity=0.012 Sum_probs=219.6 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=+.+..-+... ...+....-.+.. ..+...+.. ..-..+.+.++|.+..+| T Consensus 1 Mg~~~~~~~~~~~--------------~~~~~~~~~~~~~---~~~~~~~~~----------~~~~~~~~~~~~~v~~~i 53 (423) T protein:vir:81 1 MGFLQKLGLAPSV--------------VATPEPIELVGPI---FESLKLSTK----------NMTVEQIWEDQPHLRTVT 53 (423) T ss_pred CchhHhhcccccc--------------ccCcccccccccc---ccccccccc----------hhhHHHHHHhhhHHHHHH Confidence 4444333211100 0000000000000 001111111 012345567999999999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHH--HHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVE--ARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVL 161 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie--~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f 161 (556) +.+.+.|-.--|++.-+.. ++.. +++. ..++ ...+|+ --+|..++.+..+..++..|++| T Consensus 54 ~~ia~~ia~lp~~~~~~~~------dg~~-----~~~~~~~~~~-ll~~PN------~~~t~~~f~~~~~~~l~l~Gna~ 115 (423) T protein:vir:81 54 TFIARNVASLQLQAFERVE------DGGR-----ERVREGHLAR-VCKLAN------SDMTMYDLLERTMFDLCLYDEFF 115 (423) T ss_pred HHHHHhHhhCceEEEEEec------CCce-----eeeccchHHH-HhhcCC------CCCCHHHHHHHHHHHHhhcCCeE Confidence 9999988876666532211 1110 1111 1222 223454 34699999999999999999999 Q ss_pred EEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeec Q lcl|NC_019524. 162 ATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPA 241 (556) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~ 241 (556) +.+.... .+ +..+..|.|..+.. +.--+..|.. ..+-|.+....+.++ . T Consensus 116 ~~i~rd~-~~--------~~~~~~l~p~~~~~---------v~~~~~~~~~-~~~~Y~~~~~~~~~g------------~ 164 (423) T protein:vir:81 116 WLLPGDL-GV--------DTPTLDIRPIPVSW---------VQRRAYKDGW-GSLDYIIIESGDNDG------------R 164 (423) T ss_pred EEEEecC-Cc--------CcceEEEeecccce---------eeeeeccCCC-cceEEEEEEecCCCc------------e Confidence 8865321 11 11222333332221 1000111222 335677655444321 1 Q ss_pred cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccc Q lcl|NC_019524. 242 RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKE 321 (556) Q Consensus 242 ~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~ 321 (556) ...+++++|||+..+...+..+|+|++..+...+.....-.++...-.+=.+...++|+.+..... +....+..+ T Consensus 165 ~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~-----~~l~~e~~~ 239 (423) T protein:vir:81 165 SVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKA-----GKWDAESRT 239 (423) T ss_pred EEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccC-----ccCCHHHHH Confidence 234778899999988888889999999888877766666666655555556777788876533211 000010000 Q ss_pred cccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchh Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSS 401 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs 401 (556) ...+.. ..... ...=..|.+..|..|.+++.++.+.-..+|.+..+....+||..+|||-+.| |+.++++||+ T Consensus 240 ~~~~~~---~~~~~---~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~l-g~~~~~t~sn 312 (423) T protein:vir:81 240 RFMANL---RASFS---PKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMV-GQLDNANYSN 312 (423) T ss_pred HHHHHH---HHHhc---cccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHh-cCCCCCCccc Confidence 000000 00000 0011357788899999999998776677888888999999999999998755 8999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhh Q lcl|NC_019524. 402 ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKET 481 (556) Q Consensus 402 ~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~ 481 (556) +.+....|.. .-+.|+... ++.++....++--. .... . +.+++-.....--|+...+ T Consensus 313 ~e~~~~~f~~-----------~~L~P~~~~-ie~~l~~~L~~~~~-~~~~--------~--~~~~fd~~~llr~d~~~r~ 369 (423) T protein:vir:81 313 VREFRKALYG-----------DNLGSWIRI-IQDVMNLFLLPRVG-IDNE--------K--FYFEFNLEEKLRASFEEAA 369 (423) T ss_pred HHHHHHHHHH-----------HHHHHHHHH-HHHHHhhhhcCccc-cccC--------c--cEEEecchhhhccCHHHHH Confidence 7665555543 345565554 34444443332110 0000 0 0012222222334888888 Q ss_pred HHHHHHHH-cCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 482 EAAILRIK-NGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 482 ~A~~~~i~-~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ++..++|. .|+.|+-|+-+..|..|-+--+ ++=.+. +....+.+++..|+++ T Consensus 370 ~~~~~~l~~~G~~T~NE~R~~~gl~p~~gGD----------~~~~p~-------------n~~~~~~~~~~~~~~~ 422 (423) T protein:vir:81 370 EIKRAAVGNVAWMTINEVRAMDNLPSIDGGD----------DLARPL-------------NTEFGDSEDAPGEEVE 422 (423) T ss_pred HHHHHHHhCCCCcCHHHHHHHhCCCCCCCcc----------eeeccc-------------ccccCccCCCCCCCCC Confidence 88888885 5999999988877777654111 110110 0111111111111111 No 46 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.67 E-value=7.8e-17 Score=108.89 Aligned_cols=428 Identities=11% Similarity=0.018 Sum_probs=227.8 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhcccc--CCCccc-ccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE--RTTREM-FQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~--~~~r~~-~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) ||-.+ .+.+...... ...|.+.. .++... ..|.....+... .-+.+-+..++.+. T Consensus 1 ~~~~~----~~~~~~~~~~--------~~~~~g~~~s~~~~~~~~~~~~~~~~~g~----------~v~~~~al~~~~v~ 58 (437) T protein:vir:10 1 MKQGK----QRALGRIKSS--------FLKWLGVPISLTDGSFWSAWGGMGSSSGE----------TVTADSALQLSAVW 58 (437) T ss_pred CCcch----hhhhhhhHHh--------hhhhcCCcccCCchhHHHhhcccccCCCc----------eechHhhhccHHHH Confidence 43211 1111111000 01111110 001100 011100000000 01122234668889 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.|.+.|-.--|++.-+.. ++..... .-..++..+..+|+ ..+|.+++....+..++..|++ T Consensus 59 ~ci~~Ia~~ia~lp~~~~~~~~------~g~~~~~---~~~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna 123 (437) T protein:vir:10 59 SCVRLIAETIATLPLNLYQTKP------DGTRVLA---KQHRLYTVIHSQPN------AENTAAEFWEVIVASMLLWGNG 123 (437) T ss_pred HHHHHHHHHHhhCceeEEEEcC------CCceeec---cccHHHHHhhccCC------cCCCHHHHHHHHHHHHhhcCCe Confidence 9999999998876666533211 1100000 01123344444554 3568999999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+++.. | .+..|..|.|+.+.. +.+..|.. .|++.... | T Consensus 124 y~~i~r~~-------g--~~~~L~~l~p~~v~i--------------~~~~~g~~-~y~~~~~~-g-------------- 164 (437) T protein:vir:10 124 YARKLRSA-------G--VLIGLELMLPQRTTV--------------KRLTSGAL-QYTYRNVD-G-------------- 164 (437) T ss_pred EEEEEecC-------C--cEEEEEEEcCcceEE--------------EECCCCeE-EEEEEecC-c-------------- Confidence 99876431 1 356788899988742 22344443 35543221 1 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....++.++|||+...- .+...|+|++..+...+.......+....-.+=.+...++|+.+..-. .+.. T Consensus 165 ~~~~~~~~dIih~r~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~----------~e~~ 233 (437) T protein:vir:10 165 TVSTLAEDDVFHVRGFS-LDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQ----------KEKR 233 (437) T ss_pred eEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCC----------HHHH Confidence 01246788999997654 566899999988877776666666666665566677888888653211 0111 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc- Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY- 399 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY- 399 (556) ....... .....+ .-..|.+..|..|.+++.++.+.-..+|.+-.+...+.||..+|||.+.| |+.+++|| T Consensus 234 ~~~~~~~---~~~~~g----~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~ 305 (437) T protein:vir:10 234 AEIRTDL---AEQFGG----AMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMV-GHSEKSTSW 305 (437) T ss_pred HHHHHHH---HHHhcC----ccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccc Confidence 1100000 000000 12357788899999999999887778888888999999999999999866 88877665 Q ss_pred -hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 400 -SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 400 -Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) |.+.+.... |+..-++|+... ++.++....++ +..... .+ +++-.-..--.|+. T Consensus 306 ~sn~e~~~~~-----------f~~~tl~P~~~~-ie~~l~~kll~-~~e~~~----------~~--~~fd~~~ll~~d~~ 360 (437) T protein:vir:10 306 GTGIEQQTLG-----------FLTFTLRPWLTR-IEQAARRSLLR-PGERDQ----------FY--AEFSVEGLLRADSA 360 (437) T ss_pred cchHHHHHHH-----------HHHHHHHHHHHH-HHHHHHhhccC-ccccCc----------eE--EEEechhhhccCHH Confidence 554444333 455556776653 55566555543 211110 01 22222233335888 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHH---HHHHHHHcCCCCCccccccCCCCCCC-CCCCCCCCCCCcC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAR---EEGLIKSLKLDFTGKMVEGNSTQSSN-SSESTSDNPNEET 554 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~---E~~~~~~~Gl~~~~~~~~~~~~~~~~-~~~~~~~~~~~e~ 554 (556) ..+++....+++|+.|+-|+-+..|+.|-+--++... -..-+++.|-..+ ........ ..+...+++..+. T Consensus 361 ~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~ 435 (437) T protein:vir:10 361 GRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTT-----ATAAQDALKAWLYQEEKTRATQ 435 (437) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEeecCcccchhhccCcCC-----CcchhccccccCCCCCCCCccc Confidence 9999999999999999999999999887542211100 0001111111100 00011111 1111122222222 Q ss_pred CC Q lcl|NC_019524. 555 TQ 556 (556) Q Consensus 555 ~~ 556 (556) |+ T Consensus 436 e~ 437 (437) T protein:vir:10 436 ER 437 (437) T ss_pred cC Confidence 23 No 47 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.67 E-value=8.7e-17 Score=108.63 Aligned_cols=413 Identities=8% Similarity=-0.018 Sum_probs=225.9 Q ss_pred HHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhh Q lcl|NC_019524. 10 TRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDS 89 (556) Q Consensus 10 ~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~n 89 (556) +.-.+... .....+.-...+|.....+.... .. .+.-+.+-+..++.+.++|+.|.+. T Consensus 1 ~~~~r~~~-----------------~~~~~~~~~~~~~~~~~~g~~~s--~~---~~~vt~~~al~~~~v~~~v~~ia~~ 58 (419) T protein:vir:14 1 MFFSRQLL-----------------SNLGQTQMSAGGWVSALLGSSRS--DS---GQVVTPASALALTVLQNCVTLLAES 58 (419) T ss_pred Cccccccc-----------------ccccccccCcchhhHHhhcCCCc--cC---CcccchHHhhccHHHHHHHHHHHHh Confidence 11000000 00000000111221100000000 00 0001112234577789999999998 Q ss_pred hccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccC Q lcl|NC_019524. 90 IVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNP 169 (556) Q Consensus 90 vVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~ 169 (556) |-..-|++.-+.+. ..++. .=..+++.+..+|+ .-+|..++.+..+..++..|++|+.+++... T Consensus 59 iA~lp~~~~~~~~~-----~~~~~-----~~~~l~~lL~~~PN------~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~ 122 (419) T protein:vir:14 59 IAQLPIELYERSGE-----DRKPA-----TDHPLYSILKYEPN------SWQTPFEYQEQSQVAVGLRGNSYSFIDRDSD 122 (419) T ss_pred hccCceEEEEecCC-----ccccc-----cccHHHHHHHhhcc------cCCCHHHHHHHHHHHHhhcCCeEEEEEECCC Confidence 88765655432211 00000 00123344445564 3578899999999999999999999865321 Q ss_pred CCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhH Q lcl|NC_019524. 170 TGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRR 249 (556) Q Consensus 170 ~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~ 249 (556) | .+..|..|.|++|.. ..|.+|+++ |.+... ..++.++ T Consensus 123 ------G--~~~~l~pl~~~~v~v--------------~~~~~~~~~-y~~~~~-------------------~~~~~~~ 160 (419) T protein:vir:14 123 ------G--VIQGLYPLDNEAVTV--------------MRGSDLKPV-YRVRGS-------------------DPMPQRL 160 (419) T ss_pred ------C--cEEEEEEecCceEEE--------------EECCCceEE-EEEccC-------------------cccchhh Confidence 1 356788999988842 334455443 443211 1356778 Q ss_pred eEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccc Q lcl|NC_019524. 250 VIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTG 329 (556) Q Consensus 250 viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (556) |+|+...- .+...|+|++..+...+.......+......+=.+...++|+.+...... ..++..+.+... T Consensus 161 i~h~~~~~-~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~------~~~~~~~~~~~~--- 230 (419) T protein:vir:14 161 VHHVRWMS-INGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPAL------KDQASVDRITDG--- 230 (419) T ss_pred eeEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcc------cCHHHHHHHHHH--- Confidence 99987654 45689999999888877776666666666666677888898865321100 000001100000 Q ss_pred cccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHH Q lcl|NC_019524. 330 LANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAET 409 (556) Q Consensus 330 ~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~ 409 (556) ........-..|.+..|..|.+++.++.+.-..+|.+..+.....||+.+|||.+.| ++.++.|||++.+....| T Consensus 231 ----~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~l-g~~~~~t~s~~E~~~~~f 305 (419) T protein:vir:14 231 ----WNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMV-NELERATFSNIEHQSLQF 305 (419) T ss_pred ----HHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-cCCCCCCcccHHHHHHHH Confidence 000000011247788899999999998776677788888999999999999999865 778888999876555443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHH Q lcl|NC_019524. 410 QKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIK 489 (556) Q Consensus 410 ~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~ 489 (556) +...+.|+..+ ++.++....++-..... ..++|-.......|+...+++..+.++ T Consensus 306 -----------~~~~L~P~~~~-ie~~l~~kll~~~~~~~-------------~~i~fd~~~l~r~d~~~~~~~~~~~~~ 360 (419) T protein:vir:14 306 -----------VIYTLLPWVKR-HEQAKTRDLLLPSERKQ-------------YFIEYNLAGLLRGDQSSRYAAYAVGRQ 360 (419) T ss_pred -----------HHHHHHHHHHH-HHHHHhhhccCccccCC-------------eEEEEechhhhccCHHHHHHHHHHHHh Confidence 44456665553 45556555442211100 112333333344599999999999999 Q ss_pred cCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCC---CCCCCCcCCC Q lcl|NC_019524. 490 NGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSEST---SDNPNEETTQ 556 (556) Q Consensus 490 ~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~---~~~~~~e~~~ 556 (556) +|+.|+-|+-+..|..|-+--++ +=++....+ ...+.+.+... .....+|..+ T Consensus 361 ~G~~T~NE~R~~~gl~p~~gGD~----------~~~~~n~~~----~~~~~~~~~~~~~~~~~~~~e~~~ 416 (419) T protein:vir:14 361 WGWLSINDIRRLENMPPVKGGDI----------YLSPMNMVD----ASKPQQLPVGKSEPTKAAIDEIGR 416 (419) T ss_pred CCCcCHHHHHHHhCCCCCCCcCe----------eeecccccc----ccccccccCCCCCCccccccchhc Confidence 99999999998888887542111 101111000 01111111111 2222223333 No 48 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.67 E-value=6.1e-16 Score=103.99 Aligned_cols=413 Identities=12% Similarity=0.043 Sum_probs=226.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHH--HHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQD--MASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~--~lr~RaRdl~rNn~~ 78 (556) +-|...+- .. ... +..-+...+. ...+.+.. ..... .-+.-+.+-+..++- T Consensus 10 ~~~~~~~~-~~-----~~l--------------f~~~~~~~~~------~~~~~~~~-~~~~~~~~~~~vs~~~al~~~~ 62 (424) T protein:vir:45 10 LWPEGGRV-LL-----DAL--------------FRSKSLENPS------TPITGDAV-DTDGLFRADVYVSPETAMKLAA 62 (424) T ss_pred ecCcchhH-HH-----Hhh--------------ccccCCCCCc------cccchhhh-hhhccccCCceechHHhhccHH Confidence 11111100 00 000 0000000000 00000000 00000 000123445666788 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +..+|+.|.+.|-+--|++.-+.+... +.. .. ..++..+..+|+ .-+|.+++.+..+..++..| T Consensus 63 v~~cv~~Ia~~iA~lp~~v~~~~~~~~-----~~~--~~---~~l~~lL~~~PN------~~~t~~~f~~~~v~~lll~G 126 (424) T protein:vir:45 63 VYSCIYVLSSSLAQMPLHVMRRHKGKV-----EPA--RD---HPAFYLVHDEPN------TWQTSYKWRELKQRHILGWG 126 (424) T ss_pred HHHHHHHHHHHHhhCceEEEEecCCce-----eec--cc---chHHHHHHhhcc------cCCCHHHHHHHHHHHHhhcC Confidence 999999999999987666543222110 000 00 123444445554 35789999999999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) ++|+.+.+... ..+..|..|.|+.+. +...+..+-|.+...+. T Consensus 127 na~~~i~r~~~--------G~~~~L~~l~~~~v~----------------i~~~~~~~~y~~~~~~~------------- 169 (424) T protein:vir:45 127 NGYTWVKRNRR--------GEVISLDCCMPWETT----------------LMNTGGRYTYGLYNEYG------------- 169 (424) T ss_pred CeEEEEEEcCC--------CcEEEEEEecCceEE----------------EEEcCCeEEEEEEecCc------------- Confidence 99998764321 235678888887763 11223345566653210 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGG 318 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~ 318 (556) ...+++++|||+... .+++..|+|++..+...+.....-+++...-.+=.+.-.++|+.+..-. .+ T Consensus 170 ---~~~~~~~eVih~r~~-~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~----------~e 235 (424) T protein:vir:45 170 ---AFAISPDDMIHIRAL-GNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLN----------KE 235 (424) T ss_pred ---eEEECcccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCC----------HH Confidence 113677899999865 4688999999998877776666666655555566677788888653210 11 Q ss_pred ccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 319 FKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) ..+....... ....+ ..=..|.+..|+.|.+++.++.+.-...|.+-.+....+||..+|||.+.| ++.++++ T Consensus 236 ~~~~~~~~~~---~~~~g---~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t 308 (424) T protein:vir:45 236 SWGWLKDQWQ---KASQA---LRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMI-NDLEKAT 308 (424) T ss_pred HHHHHHHHHH---HHhcc---ccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCC Confidence 1111101000 00000 001357788899999999998776677888999999999999999999855 7788889 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||++.+.... |+..-+.|+.+.+ +.++....++-+..... ..+++..-.....|+. T Consensus 309 ~sn~eq~~~~-----------f~~~tL~P~~~~i-e~~ln~kLl~~~e~~~g------------~~i~fd~~~llr~d~~ 364 (424) T protein:vir:45 309 FSNISAQAIQ-----------FVRYTMMPWVTNW-EQELNRRLFTRAELAAG------------YYVRFNLTGLLRGTPQ 364 (424) T ss_pred cccHHHHHHH-----------HHHHHHHHHHHHH-HHHHHHhcCChhhhcCC------------cEEEeechhhhccCHH Confidence 9987554443 4555566766654 33454444432211000 0122222233335899 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) ..+++..+.+++|+.|+.|+-+..|.+|-+--+ ++=++. ...+......++..++.+++| T Consensus 365 ~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD----------~~~~~~-----n~~~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 365 ERAQFYHFAITDGWMSRNEARAFEDMNPVEGLD----------EMLVSV-----NAANPAGDFKPPKNDEGKTNE 424 (424) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----------eeeecc-----cccccccccCCCCCCCCCCCC Confidence 999999999999999999988888888743111 110111 011111111112222222222 No 49 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.66 E-value=6.4e-17 Score=109.37 Aligned_cols=412 Identities=8% Similarity=-0.017 Sum_probs=226.4 Q ss_pred hcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCcee Q lcl|NC_019524. 18 VVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKL 97 (556) Q Consensus 18 ~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~ 97 (556) +.-.....+. -+.+.....+|.....+.. .. ...+.-+.+-+.+++.+.++|+.|.+.|-+--|++ T Consensus 1 m~~~~~~~~~---------~~~~~~~~~~~~~~~~g~~----~s-~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~ 66 (419) T protein:vir:80 1 MFFSRQLLSN---------LGQTQPGSGGWVSALLGSA----RS-EAGQVVTPASALSLTVLQNCVTLLAESIAQLPVEL 66 (419) T ss_pred CCcccccccc---------cCcCCCCcchhhHHhhccc----cc-ccCcccChHHhhccHHHHHHHHHHHHhhccCceEE Confidence 1000000000 0000011112211000000 00 00000122234467889999999999998876665 Q ss_pred eeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCc Q lcl|NC_019524. 98 NAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRR 177 (556) Q Consensus 98 ~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~ 177 (556) .-+... | .++. .=..++..+..+|+. -+|..++.+..+..++..|++|+.+.+... T Consensus 67 ~~~~~~---~--~~~~-----~~~~l~~lL~~~PN~------~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~-------- 122 (419) T protein:vir:80 67 YERSGD---D--RKPA-----TDHPLYSILKYEPNP------WQTPFEYQEQSQVAVGLRGNSYSFIDRDQD-------- 122 (419) T ss_pred EEecCC---C--cccc-----cccHHHHHHHhhccc------CCCHHHHHHHHHHHHhhcCCeEEEEEECCC-------- Confidence 432211 0 0100 001233444455652 468889999999999999999998865321 Q ss_pred ccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeeccc Q lcl|NC_019524. 178 PFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEAL 257 (556) Q Consensus 178 ~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~ 257 (556) ..+..|..|.|++|. |..|..|+++ |.+.. ...++.++|+|+...- T Consensus 123 G~~~~L~~i~~~~v~--------------i~~~~~~~~~-y~~~~-------------------~~~~~~~~i~h~~~~~ 168 (419) T protein:vir:80 123 GVIQGLYPLDNEAVT--------------VMKGPDLKPM-YRVAG-------------------ADPLPQRLVHHVRWMS 168 (419) T ss_pred CcEEEEEEecCceEE--------------EEECCCceEE-EEEcC-------------------ccccchhheEEecCCC Confidence 235789999999985 2334445443 33310 1136778999988654 Q ss_pred CCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccc Q lcl|NC_019524. 258 LAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQT 337 (556) Q Consensus 258 r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (556) . +-..|+|++..+...+.......++...-.+=.+...++|+.+....... .++..+...+. .... T Consensus 169 ~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~------~~~~~~~~~~~-------~~~~ 234 (419) T protein:vir:80 169 I-NGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALK------DQASVDRITDG-------WNAK 234 (419) T ss_pred C-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCccc------CHHHHHHHHHH-------HHHH Confidence 3 44899999999888887777777777766666788888888653221100 00000000000 0000 Q ss_pred cceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHH Q lcl|NC_019524. 338 KNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRK 417 (556) Q Consensus 338 ~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q 417 (556) ....-..|.+..|+.|.+++.++.+.-..+|.+..+....+||+.+|||.+.| ++.++.|||++.+....| T Consensus 235 ~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~ll-g~~~~~t~~n~e~~~~~f-------- 305 (419) T protein:vir:80 235 FGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMV-NELERATFSNIEHQSLQF-------- 305 (419) T ss_pred hcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-cCCCCCCcccHHHHHHHH-------- Confidence 00011347788899999999999777777889999999999999999999855 788889999986665554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHH Q lcl|NC_019524. 418 KLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEA 497 (556) Q Consensus 418 ~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~ 497 (556) +..-+.|+... ++.++.+..+.-..... ..++|-.......|+...+++..+.+++|+.|+-| T Consensus 306 ---~~~~l~P~~~~-ie~~l~~kll~~~~~~~-------------~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE 368 (419) T protein:vir:80 306 ---VIYTLLPWVKR-HEQAKTRDLLLPSERKQ-------------YFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSIND 368 (419) T ss_pred ---HHHHHHHHHHH-HHHHHhhhccCccccCC-------------eEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 33445665554 34455544442111100 01233223333459999999999999999999999 Q ss_pred HHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCC-CCCCCCCCCCCCCcCCC Q lcl|NC_019524. 498 EISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQS-SNSSESTSDNPNEETTQ 556 (556) Q Consensus 498 ~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~-~~~~~~~~~~~~~e~~~ 556 (556) +-+..|.+|-+--+++ =++.. ...... .+.+..+.++++.-.++ T Consensus 369 ~R~~~g~~p~~gGD~~----------~~~~n-----~~~~~~~~~~~~~~~~~~~~~~~~ 413 (419) T protein:vir:80 369 IRRLENMPPVKGGDIY----------LSPMN-----MVDASKPQPIPMGKTEPTKAALDE 413 (419) T ss_pred HHHHhCCCCCCCccee----------eeccc-----cccccccccccCCCCCchhhhHHH Confidence 9988888775422211 01100 000000 11111111111111111 No 50 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.66 E-value=4.5e-16 Score=104.71 Aligned_cols=407 Identities=12% Similarity=0.024 Sum_probs=222.2 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|...+....+... .+.+.. . ....|...... -+.+-+..++.+..+| T Consensus 1 Mgl~~~~f~~~~~~~~~~-----------~~~~~~--~-~~~~~~~~g~~--------------v~~~~al~~~~v~~~v 52 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLT-----------KISGIP--S-PAEDWAMHGDR--------------PGANSAMTLGAFYACV 52 (409) T ss_pred CchhhhhhcCCCcccccc-----------cccccc--c-ccchhhccCcc--------------cchhhhhccHHHHHHH Confidence 433433332211110000 000000 0 00111111111 0122234578889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.+-..-|++.-+.+.. ... ...+.+.+..+++ -.+|.+++.+..+..++..|++|+. T Consensus 53 ~~ia~~iA~lp~~~~~~~~~~-----~~~-------~~~l~~lL~~~PN------~~~t~~~f~~~l~~~l~l~Gn~~~~ 114 (409) T protein:vir:84 53 TLLADTVASLSIDAYRKKDNV-----RIP-------VSPAPKLLESTPY------PGLTWFDWLWMLMESLAVTGNAFGY 114 (409) T ss_pred HHHHHhhhhCceEEEEecCCc-----ccc-------cchHHHHhhccCC------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 999999998777654322210 000 1122233444554 3578999999999999999999988 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.+.... ..+..|..|+|+.+..-.. -|..++.+ |.++. .+ + . T Consensus 115 i~~~~~~-------g~~~~L~~l~p~~v~v~~~------------~~~~~~~~-~~~~~---~~-----g---------~ 157 (409) T protein:vir:84 115 ISARDEA-------NRPTAIMPIHPDCIHVTDA------------KDEDGDWI-EPVYR---ID-----G---------K 157 (409) T ss_pred EEEECCC-------CceEEEEEEcCceeEEEEc------------CCCcceEE-EEEec---CC-----c---------e Confidence 7554322 2356788899888742110 01112111 11111 11 0 1 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++++|||+......|...|+|++..+...+.......+....-.+=.+...++|+.+..-. .+..+.. T Consensus 158 ~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~----------~e~~~~~ 227 (409) T protein:vir:84 158 VVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLT----------PDQVKQT 227 (409) T ss_pred EEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCC----------HHHHHHH Confidence 356789999988888888999999988887777666666666655555677788888653210 0111110 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc--hh Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY--SS 401 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY--Ss 401 (556) .... .. ..-..|.+..|+.|.+++.++.+.-..+|.+..+.....||..+|||-+.| ++..+.|+ |+ T Consensus 228 ~~~~------~~----~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~sn 296 (409) T protein:vir:84 228 QKQW------IQ----SHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMI-GDVEKSTSWGTG 296 (409) T ss_pred HHHH------HH----HhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccch Confidence 0000 00 011356677899999999999876677888889999999999999998755 77777666 44 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhh Q lcl|NC_019524. 402 ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKET 481 (556) Q Consensus 402 ~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~ 481 (556) +.+..+ .|+..-+.|+.+. ++.++.+ .| +.+.. +++-.-.....|+...+ T Consensus 297 ~e~~~~-----------~f~~~~l~P~~~~-ie~~l~~-~L--~~g~~---------------i~fd~~~l~~~d~~~~~ 346 (409) T protein:vir:84 297 IEEQGI-----------NFVRHTLLPWLRC-IEQALDT-FL--PRGQF---------------VKFNVDGLMRGDVTARF 346 (409) T ss_pred HHHHHH-----------HHHHHHHHHHHHH-HHHHHHH-hc--cCCCe---------------EEEechhhhccCHHHHH Confidence 333333 3345555666664 3434432 22 22211 12212223345999999 Q ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 482 EAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 482 ~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ++....+++|+.|+-|+-+..|+.|-+-- +++=++....+.... .+.++.++...+.+.+.++ T Consensus 347 ~~~~~~~~~G~~t~NE~R~~~g~~p~~gg----------D~~~~~~n~~~~~~~--~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 347 TAYQMGLQNGIWSVNEVRAWEDAPPIPEG----------DIHLQPMNFVPLGYV--PPEEPAQEPQPNSATEGNK 409 (409) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCc----------ceeeecccccccccC--CccccCcCCCCCCccCCCC Confidence 99999999999999999988898885321 111111111111111 1111111111112222333 No 51 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.66 E-value=2.6e-15 Score=100.50 Aligned_cols=416 Identities=11% Similarity=0.031 Sum_probs=215.3 Q ss_pred ccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHH Q lcl|NC_019524. 47 WNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNM 126 (556) Q Consensus 47 w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~ 126 (556) +-|.... +.++.+.|-++.| |++-+|+.+++.+++.||+.. + .+. ++.+ |+- T Consensus 1 ~l~~~~~---------~~~~~~~~~~v~n--~~~~ivd~~~~~l~~~gf~~~---d-------~~~----~~~~---~~i 52 (434) T protein:vir:98 1 MLPKNAE---------QAFLDFQRKARTN--FCGLIANASVHRLLALGVTGP---D-------GEP----DTRA---SRW 52 (434) T ss_pred CCCCCcc---------HHHHHhhhhhhcc--chHHHHHHHHhhhccCceecC---C-------Cch----HHHH---HHH Confidence 2222221 1222222333333 999999999999999998632 1 111 2223 333 Q ss_pred HhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEE Q lcl|NC_019524. 127 AAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSG 206 (556) Q Consensus 127 w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~G 206 (556) |.. .+|...+..+++..++-|.+|+.. +..+.+...++.+.+ .|++++|..+-.-++... +++..+ T Consensus 53 ~~~-----------N~~d~~~~~~~~~a~i~G~ay~~v-~~~~~~~~~~~~~~~-~I~~~~p~~~~~i~D~~~-~~~~~a 118 (434) T protein:vir:98 53 WQA-----------NRLDSRQKLVWRMAMAQSAGYMLV-GAHPTRTEDNGRPSP-LITMEHPSECIVEYDPET-GEPLVG 118 (434) T ss_pred HHh-----------cChhHHHHHHHHHHhhcCceEEEE-ecCCCcccccCCcee-EEEEeccceeEEEEeCCC-CceEEE Confidence 322 257778889999999999999875 444444444555665 499999998864444332 346777 Q ss_pred EEE---CCCCCeEE--EE------Eee-cCCCccccCCccccce---eecc--ccCChhHeEeeecccCCCcccCCchhh Q lcl|NC_019524. 207 VQL---DNNGAALG--YW------LRK-AFPGDPTDMEQWKWGY---EPAR--FDWGRRRVIHIIEALLAGQTRGISEMV 269 (556) Q Consensus 207 IE~---d~~Gr~va--Y~------i~~-~hpgd~~~~~~~~~~r---v~~~--~~v~a~~viH~f~~~r~gQ~RGvs~la 269 (556) |.+ +..|...+ |+ +.. ...+.........|.. ++.. -..+.-=|+|+....+.+- .|+|.|. T Consensus 119 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e 197 (434) T protein:vir:98 119 LKVWHNDIDGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFA 197 (434) T ss_pred EEEEEeccCCceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCc-CCcchhh Confidence 764 33343332 21 111 1111000001011100 0000 1233334778777766544 6999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeee Q lcl|NC_019524. 270 SALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPH 349 (556) Q Consensus 270 ~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~ 349 (556) ++|..+..++.-.--.+..+...|.=-.+|+-...+..... .. . ...........+|.|.. T Consensus 198 ~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~-----~~-----------~---~~~~~~~~~~~~~~i~~ 258 (434) T protein:vir:98 198 GVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDP-----AT-----------G---MTVVDQPFVPSPSAVWA 258 (434) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc-----cc-----------c---cchhhhhhhcccccccc Confidence 87666655554433333333333221223331111110000 00 0 00000111234555544 Q ss_pred cCCCceeeeecCC-CCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 350 LYPGTKLKMQPAG-TPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAI 428 (556) Q Consensus 350 L~pGe~i~~~~~~-~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi 428 (556) + +|+++++.+.+ ....+|...++..++.+++..++|-+.+.++.++.|-.++|..+.......+..|..|-..+-+ + T Consensus 259 ~-~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~-~ 336 (434) T protein:vir:98 259 S-EGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLES-V 336 (434) T ss_pred C-CCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Confidence 4 46667765533 2345677778888999999999999999988777788888888888877777777776655433 3 Q ss_pred HHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHH Q lcl|NC_019524. 429 YTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFRE 508 (556) Q Consensus 429 ~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~ 508 (556) ++.-+ .+ .|. + . . ..-+.+.|..+... +....+++..+.+..|+ +.+.+....|.++++ T Consensus 337 ~rl~~--~~-~g~-~--~----------~--~~~~~v~w~~~~~~--s~~~~ada~~kl~~~g~-~~e~~~~~lg~~~~e 395 (434) T protein:vir:98 337 LALAA--AQ-AGV-P--E----------D--YTEAEVRWANPAHV--TMAVKADAATKLKSIGY-PLDVIAEELDESPAR 395 (434) T ss_pred HHHHH--Hh-cCC-C--h----------h--heeeeEEecCCCCC--CHHHHHHHHHHHHhcCC-cHHHHHHhCCCCHHH Confidence 33222 12 231 1 0 0 01246888777654 77888888888888886 554444445988876 Q ss_pred HHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 509 VFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNE 552 (556) Q Consensus 509 v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (556) +-+ +..|++.-..+.-.... ......+...+++++.+++ T Consensus 396 ~~r-~~~e~~~~~~~~~~~~~----~~~~~~~g~~~~~~~~~dg 434 (434) T protein:vir:98 396 VRR-IVAGAASQALLAASLLP----APGAPSAGNVPDSGGAVDG 434 (434) T ss_pred HHH-HHHHHHHHHHHHHhhhc----cCCCCCCCCCCcccCCCCC Confidence 543 33332211111100000 0011111112223333333 No 52 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.66 E-value=1.3e-16 Score=107.65 Aligned_cols=424 Identities=10% Similarity=0.019 Sum_probs=222.2 Q ss_pred hhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCC Q lcl|NC_019524. 29 AVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGA 108 (556) Q Consensus 29 ~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~ 108 (556) .++....++.+ ..|.+..... .+...+.+++.+..+|+.|.+.|-..-|++.- . T Consensus 1 ~~~~~~~~g~~-----~~~~~~~~~~-------------~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~-~------- 54 (723) T protein:vir:94 1 MTTFPSGAGGW-----NAWSADSVFG-------------NGAKGWSNSAVAYRCISMLANNAASVDLVVRG-P------- 54 (723) T ss_pred CcccccCCCcc-----cccccccccc-------------ccHHHHhhhHHHHHHHHHHHHhhccceeEEEc-C------- Confidence 11122121111 1343222111 11223578899999999999888876666531 1 Q ss_pred ChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEch Q lcl|NC_019524. 109 PDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISP 188 (556) Q Consensus 109 ~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~ 188 (556) +++..+ -..++..+-.+|+ -.+|..++-.+.+..++..|++|+.+....+. . ...|..|..|.+ T Consensus 55 ~~~~~~-----~~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~---~--~g~p~~l~~l~~ 118 (723) T protein:vir:94 55 DGELDE-----LHPLSQLWNVMPN------RAMPAQVLKALSMTRLQLDGQCHLWLNYNGRT---P--AGVPDEIWYVYD 118 (723) T ss_pred CCccch-----hhHHHHHHhhCCC------CCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc---c--ccceeEEEEecC Confidence 111111 1233444444555 25688999999999999999999998654221 1 234667788876 Q ss_pred hhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchh Q lcl|NC_019524. 189 YRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEM 268 (556) Q Consensus 189 drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~l 268 (556) +.+... ... +.+.-..+..+.|.+...+ | ....+++++|||+......+...|+|++ T Consensus 119 ~~~~v~-~~~-------~~~~~~~~~~~~y~~~~~~-G--------------~~~~~~~~dIiHir~~~~~dg~~G~Spi 175 (723) T protein:vir:94 119 RVTTIV-ATR-------AADAVPQAQIIGYVIERTD-G--------------VRVPVLADEMLWLRFSDPYDPLAVMAPW 175 (723) T ss_pred cceEEe-ecC-------CCccceeeeeeEEEEEecC-c--------------eeEEecccceEEecCCCCCCCcccccHH Confidence 544211 111 1111112234456554321 1 1124678899999877667888999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceec--CCce Q lcl|NC_019524. 269 VSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAI--DGAK 346 (556) Q Consensus 269 a~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~pG~ 346 (556) ..+...+......++....-.+=.+.-.++|+.+.-+...... ....+.....+ ........-| +.+. T Consensus 176 ~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~~l~~e~~~~-------~~~~~~~~~~G---~~Nagk~~vL~g~~~~ 245 (723) T protein:vir:94 176 KAARAAVDADFYAATWQRQSFKNGARPGGVVNLGDMDEQTFTK-------TVAAFRSQVEG---VQNAGRHLLIAGQGSD 245 (723) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCHHHHHH-------HHHHHHHHhhc---hhhcCcceeecccccc Confidence 9888777766666555555555556667888754211111100 00111110000 0011111112 2334 Q ss_pred eeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 347 IPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFAS 426 (556) Q Consensus 347 i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~ 426 (556) +..|..|.+++.++.+.-..+|.+..+....+||+.+|||.+.|.+ .++||++.+....| +...+. T Consensus 246 ~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~---~st~sN~e~~~~~f-----------~~~tL~ 311 (723) T protein:vir:94 246 GGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLG---GSTYENQAEAKAAV-----------WTETLI 311 (723) T ss_pred cccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCC---CCCcccHHHHHHHH-----------HHHHHH Confidence 5567789999998877667789999999999999999999887754 35788765555544 334456 Q ss_pred HHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCH Q lcl|NC_019524. 427 AIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDF 506 (556) Q Consensus 427 pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~ 506 (556) |+.+. ++.++....++--++ .. .+++-.-.-.--|+...+++....+++|+.|+-|+-+..|+.| T Consensus 312 P~~~~-ie~~ln~~Ll~~~g~-~~-------------~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpP 376 (723) T protein:vir:94 312 PQMEV-MASITDLQLLPDIGW-TV-------------EWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDP 376 (723) T ss_pred HHHHH-HHHHHhHhhcccccC-ce-------------EEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 65554 444555544432111 10 1111111111238888999999999999999999888889877 Q ss_pred HHHHHH----------H--------HHHHHHHHHcCCCCC--ccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 507 REVFKQ----------R--------AREEGLIKSLKLDFT--GKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 507 e~v~~q----------~--------a~E~~~~~~~Gl~~~--~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) -+--+. . +.+....+-+.+... .+...+-.+..+.+.-..++.++.+++. T Consensus 377 i~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 377 LPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRATTVLHHDPGPDPQQTL 446 (723) T ss_pred CCCCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCCCCCCCCcccCCchhH Confidence 532210 0 000000000000000 0000000001111111111222222222 No 53 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.66 E-value=1e-16 Score=108.24 Aligned_cols=429 Identities=14% Similarity=0.057 Sum_probs=229.0 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhcccc--CCCc-ccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE--RTTR-EMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~--~~~r-~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) |+-+-.. +.++ +....+...+++.+-+ .++. ....|.....+.. ..-+.+-+..++ T Consensus 1 ~~~~l~~--~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g----------~~v~~~~al~~~ 59 (434) T protein:vir:43 1 MSKSLGK--VLSS---------ATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESSSG----------KKVTVDKAMKLS 59 (434) T ss_pred Cccchhh--hhhh---------cccccchhhhcccccccccCchHHHHHHhcCCccCC----------ceechhhhhccH Confidence 3322211 1110 0000001111111000 0000 0001100000000 001122334467 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) -+-.+|+.|.+.|-.--|++.-+.. ++...+ ..-..++..+..+++ ..+|..++-+..+..++.. T Consensus 60 ~V~~~i~~ia~~ia~lp~~~~~~~~------~g~~~~---~~~~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~ 124 (434) T protein:vir:43 60 AVWACVRLISTSVAGLPLGVYERKA------DGSRVD---ARSFPLYDVVHNSPN------DDMTAFQFWQAMVASMLLW 124 (434) T ss_pred HHHHHHHHHHHhhhhCceEEEEEcC------CCcccc---ccccHHHHHHhccCC------CCCCHHHHHHHHHHHHhhc Confidence 7889999999888876666532211 111000 001123444445554 3568889999999999999 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccc Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWG 237 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~ 237 (556) |++|+.+++. . | .+..|..|.|+.|. |..|.+|+.. |.+.... | T Consensus 125 Gnay~~i~~~--~-----G--~~~~L~~l~p~~v~--------------~~~~~~g~~~-y~~~~~~-g----------- 168 (434) T protein:vir:43 125 GNAYAEIRRA--A-----G--RPAALDFLLPSRVD--------------LECDENGRLK-YFYTTKK-G----------- 168 (434) T ss_pred CCeEEEEEeC--C-----C--cEEEEEEEcCcceE--------------EEEcCCCeEE-EEEEecC-c----------- Confidence 9999887532 1 2 35678888888873 3445566554 4333211 1 Q ss_pred eeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccc Q lcl|NC_019524. 238 YEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQG 317 (556) Q Consensus 238 rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 317 (556) ....+++.+|||+... ..+...|+|++..+...+......++....-.+=.+...++|+.+..-. . T Consensus 169 ---~~~~~~~~eVih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~----------~ 234 (434) T protein:vir:43 169 ---ARREIERTNMLHIPAF-TLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQ----------P 234 (434) T ss_pred ---eEEEEccccEEEecCc-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCC----------H Confidence 1235678899999765 4566899999999888887777777776666666777888888753211 1 Q ss_pred cccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcc Q lcl|NC_019524. 318 GFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKT 397 (556) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~ 397 (556) +..+....... .....-..|.+..|+.|.+++.++.+....+|.+..+....+||..+|||-+.| |+..+. T Consensus 235 e~~~~~r~~~~--------~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~ 305 (434) T protein:vir:43 235 AQREEFREYVK--------SVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMI-GQTDKG 305 (434) T ss_pred HHHHHHHHHHH--------HhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCcCC Confidence 11111111110 001112356777899999999998877788999999999999999999998755 776554 Q ss_pred c--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 398 N--YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 398 n--YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) + ||++.+..+.| +..-+.|+...+ +.++....++-..... ..+++-.....-. T Consensus 306 ~~~~s~~e~~~~~f-----------~~~~L~P~~~~i-e~~ln~kL~~~~~~~~-------------~~~~fd~~~llr~ 360 (434) T protein:vir:43 306 SNWGTGLEQQMLAF-----------LTFSISSITNQI-QQCVNKRLLTAPERIR-------------YYAEFSLEGFLKA 360 (434) T ss_pred ccccchHHHHHHHH-----------HHHHHHHHHHHH-HHHHHhhcCChhhhcC-------------ceEEEechhhhcc Confidence 4 67665544444 444566666654 4444444333211100 1133333333345 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) |+...+++....+++|+.|+-|+-+..|..|-+-.+++-- -+++.. .+.........+.........++.+.+ T Consensus 361 d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~------~~n~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (434) T protein:vir:43 361 DSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGDILTV------QSNLVP-IDQLGQSNKSQAVRAALMNWFSQPEPQ 433 (434) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEee------ccCccc-hhhhhccCCCcchhhhhhccCCCCCCC Confidence 9999999999999999999999999999988432221110 011110 000000011111111111111222222 Q ss_pred C Q lcl|NC_019524. 556 Q 556 (556) Q Consensus 556 ~ 556 (556) | T Consensus 434 ~ 434 (434) T protein:vir:43 434 E 434 (434) T ss_pred C Confidence 2 No 54 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.65 E-value=4.1e-16 Score=104.96 Aligned_cols=411 Identities=10% Similarity=-0.024 Sum_probs=222.8 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=.+..+-+... ....+.. ...+|......+-..+ -.+. +..++-+..+| T Consensus 1 m~~~~~~~~~~~~-------------~~~~~~~------~~~~~~~~~~~~g~~v---------~~~~-al~~~~v~~~i 51 (419) T protein:vir:57 1 MFIPQFWKGRPSE-------------NRVNWQV------VPGGMRSSSSQAGVII---------TPET-ALALSAVRACV 51 (419) T ss_pred CcchhhhccCCcc-------------ccccccc------cccccccccccCCcee---------chHH-hhccHHHHHHH Confidence 1111111000000 0000000 0011111111110001 0112 23456789999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHH--HHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVE--ARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVL 161 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie--~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f 161 (556) +.+.+.|-+--|++.-+... +.. +.+. .++..+..+|+ --+|..++.+..+..++..|++| T Consensus 52 ~~ia~~ia~lp~~~~~~~~~------g~~-----~~~~~~~l~~lL~~~PN------~~~t~~~f~~~~~~~l~l~Gna~ 114 (419) T protein:vir:57 52 TLLAESVAQLPCVLYRRTEN------GGR-----EIAFDHPLHDLIRYQPN------RKDTAFEYHEQTQGVLGLEGNSY 114 (419) T ss_pred HHHHHhhccCceEEEEEcCC------Cce-----eccccchHHHHHhhccc------cCCCHHHHHHHHHHHHhhcCCeE Confidence 99999988766665322211 000 0111 13344444554 35788899999999999999999 Q ss_pred EEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeec Q lcl|NC_019524. 162 ATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPA 241 (556) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~ 241 (556) +.+++... ..+..|..|+|+.+.. ..|.+|.+ -|.+. +.. T Consensus 115 ~~i~r~~~--------G~~~~L~pl~~~~v~v--------------~~~~~g~~-~y~~~----~~~------------- 154 (419) T protein:vir:57 115 SLIDRNGR--------GDITELIPINPHKVIV--------------LKGPDGMP-YYDIP----SIG------------- 154 (419) T ss_pred EEEEECCC--------CcEEEEEEEcCcceEE--------------EECCCceE-EEEEc----CCc------------- Confidence 98865321 2367889999988842 22334443 24331 110 Q ss_pred cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccc Q lcl|NC_019524. 242 RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKE 321 (556) Q Consensus 242 ~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~ 321 (556) ..++.++|||+... ..+...|+|++..+...+.......++...-.+=.+...++|+.+....... ..+..+ T Consensus 155 -~~~~~~~vih~r~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~------~~e~~~ 226 (419) T protein:vir:57 155 -EILPMRMVHHIKSF-SLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIA------SQAAVD 226 (419) T ss_pred -eEEchhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCccc------CHHHHH Confidence 13567899999765 4566899999998888777766666666555555677778888653321100 000000 Q ss_pred cccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchh Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSS 401 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs 401 (556) ...... .....+ .-..|.+..|..|.+++.++.+.....|.+..+.....||+.+|||-+.| ++..+.|||+ T Consensus 227 ~~~~~~---~~~~~g----~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn 298 (419) T protein:vir:57 227 AILAKW---TERYGG----VRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMI-QDLQKSTNNN 298 (419) T ss_pred HHHHHH---HHHhcc----ccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCcccc Confidence 000000 000000 01346778899999999999887788899999999999999999999866 6777789998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhh Q lcl|NC_019524. 402 ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKET 481 (556) Q Consensus 402 ~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~ 481 (556) +.+....| +...+.|+...+ +.++....+.- .... . ..+++-......-|+...+ T Consensus 299 ~e~~~~~f-----------~~~~l~P~~~~i-e~~l~~~ll~~-~~~~----------~--~~i~fd~~~ll~~d~~~~~ 353 (419) T protein:vir:57 299 IEHQGLQY-----------VIYTMLAILKRH-ESAMMRDLLLP-SERR----------D--FYIEFNVSSLLRGDQKSRY 353 (419) T ss_pred HHHHHHHH-----------HHHHHHHHHHHH-HHHHHhhccCc-cccC----------C--eEEEEechhhhccCHHHHH Confidence 75554443 455567755553 44555554431 1111 0 0122222222335899999 Q ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 482 EAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 482 ~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ++....+++|+.|+-|+-+..|.+|-+--+++ =++.... +.........+.+++..+.++ T Consensus 354 ~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~----------~~~~n~~-----~~~~~~~~~~~~~~~~~~~~~ 413 (419) T protein:vir:57 354 ESYALGRQWGWLSVNDIRRMENLTPIPGGDKY----------LTPLNMV-----DSKALTGIGKATPQQLKDIEA 413 (419) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee----------eeccccc-----cccccccccCCCcccCcchhh Confidence 99999999999999999988898875421111 0111100 000101100111111111111 No 55 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.65 E-value=2.4e-16 Score=106.23 Aligned_cols=420 Identities=11% Similarity=0.025 Sum_probs=224.1 Q ss_pred CCcchhhhHHHHHhh-Hhhc-ccchhhhhhh---hcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019524. 1 MKDVKKTTRTRAKKA-VDVV-AETATATPMA---VGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQN 75 (556) Q Consensus 1 ~sp~~~~~r~~a~~a-~~~~-~~~~~~~~~~---~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rN 75 (556) |.=+.-.+|..+..+ .... .+...+.... .+..+.+. .+.....|.......-. . -+-+-+.. T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~~~~~~~~~~~~~~g~-------~---v~~~~al~ 68 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGL--DDPRLKEYIRRGELNGG-------T---GRETRALR 68 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccc--cchHHHHhhccCccCcc-------e---echhhhhc Confidence 332222222111110 0000 0000000000 00111111 01111111110000000 0 01122335 Q ss_pred ChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhe Q lcl|NC_019524. 76 DGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFL 155 (556) Q Consensus 76 n~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~ 155 (556) ++.+..+|+.|.+.|-.--|++.-+.+. .+. ..-..++..+..+|+ -.+|..++-...+..++ T Consensus 69 ~~~V~~ci~~Ia~~iA~lp~~v~~~~~~------~~~-----~~~~~~~~lL~~~PN------~~~t~~~f~~~l~~~ll 131 (431) T protein:vir:10 69 NMAVLRCVTLISGTIGMLPMNLISSDDS------KQV-----LTDDPAHRLLKYKPN------DWQTPMEFKSLMQLRAL 131 (431) T ss_pred cHHHHHHHHHHHHhhccCceEEEEecCc------eee-----eccchHHHHHhhccC------CCCCHHHHHHHHHHHHh Confidence 7889999999888888766665432111 000 000223444444554 24688889899999999 Q ss_pred ecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccc Q lcl|NC_019524. 156 MTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWK 235 (556) Q Consensus 156 ~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~ 235 (556) ..|++|+.+.+.. ..+..|..++|+.+. |+.+..|.+ -|.+...+ |. T Consensus 132 l~Gna~~~i~r~~---------g~~~~L~pl~~~~v~--------------~~~~~~~~~-~y~~~~~~-g~-------- 178 (431) T protein:vir:10 132 LDGESMARIVWSG---------NRPIRLIPMDRGSAK--------------GRLTSTWQI-VYDYTTPT-GD-------- 178 (431) T ss_pred hcCCeEEEEEEcC---------CceEEEEEEcCceeE--------------EEEcCCCeE-EEEEEeCC-ce-------- Confidence 9999999986531 135678888877764 344555554 35444221 11 Q ss_pred cceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccc Q lcl|NC_019524. 236 WGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMG 315 (556) Q Consensus 236 ~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~ 315 (556) ...++.++|||+... ..+...|+|++..+...+.-....++....-.+=.+...++|+.+..-. T Consensus 179 ------~~~~~~~dViHir~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls--------- 242 (431) T protein:vir:10 179 ------KIELPAREVFHLRDL-SIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPKELS--------- 242 (431) T ss_pred ------EEEEchhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCC--------- Confidence 124778899999765 4667999999988877776666666665555566778888888653211 Q ss_pred cccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh Q lcl|NC_019524. 316 QGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT 395 (556) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s 395 (556) ++..+...... .....+ .=..|.+..|+.|.+++.++.+.-..+|.+-.+....+||..+|||.+.| ++.+ T Consensus 243 -~e~~~~~~~~~---~~~~~g----~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~l-g~~~ 313 (431) T protein:vir:10 243 -DNAYGRMKASV---QENHTG----SENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLL-MMDD 313 (431) T ss_pred -HHHHHHHHHHH---HHHhcC----ccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHh-CCCC Confidence 11111100000 000000 01356778899999999998877777888888889999999999999866 6677 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) +++||++.+..+.|.+ .-+.|+... ++.++....++ +.... -..+++-....--. T Consensus 314 ~~t~sn~eq~~~~f~~-----------~tL~P~~~~-ie~~ln~~Ll~-~~~~~------------~~~~~fd~~~llr~ 368 (431) T protein:vir:10 314 TSWGSGIEQLAIFFIQ-----------YGLSHWFVS-WEQAAARAFLP-EKMLG------------QRQFKFNEGALLRG 368 (431) T ss_pred CCccccHHHHHHHHHH-----------HHHHHHHHH-HHHHHHhhccC-hhhcC------------CceEEEechhhhcc Confidence 7899988766666643 345565553 34444444432 11000 01123333333345 Q ss_pred chhhhhHHHHHHHHcCC----CCHHHHHHHhCCCHHHH--HHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGL----STYEAEISRLGGDFREV--FKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~----~s~~~~~ae~G~D~e~v--~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) |+...+++..+++.+|+ .|+-|+-+..|.+|-+- .+++ -.+.... . .+++++ T Consensus 369 d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD~~----------~~p~n~~-----~-------~~~~~~ 426 (431) T protein:vir:10 369 TLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVADQL----------RNPMTQK-----Q-------KGSGDE 426 (431) T ss_pred CHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccccce----------ecccccc-----c-------CCCCCC Confidence 99999999999887766 79999888888877532 3221 1111100 0 011111 Q ss_pred CCCcC Q lcl|NC_019524. 550 PNEET 554 (556) Q Consensus 550 ~~~e~ 554 (556) ++.-| T Consensus 427 ~p~~~ 431 (431) T protein:vir:10 427 PPATT 431 (431) T ss_pred CCCCC Confidence 11111 No 56 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.65 E-value=1.2e-15 Score=102.38 Aligned_cols=404 Identities=13% Similarity=0.055 Sum_probs=225.1 Q ss_pred hcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCcee Q lcl|NC_019524. 18 VVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKL 97 (556) Q Consensus 18 ~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~ 97 (556) |.=.... ..+.... +........|.....+. ..-+.+-+..++.+.++|+.+.+.|-+--|++ T Consensus 1 m~f~~~~-~~~~~~~-----~~~~~~~~~~~g~~~~~-----------~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~ 63 (409) T protein:vir:10 1 MLFRKGF-KNQSQEI-----SIDDKKILEWLGINPSE-----------TYVNGKSCLKQATVFGCIRILSDNISKLPIKI 63 (409) T ss_pred Ccccccc-cCcCCCC-----CCChHHHHHHhcCCcCc-----------ceechhhhhccHHHHHHHHHHHHhhhhCceEE Confidence 1100000 0000000 00111112222111100 00112234468889999999999999877766 Q ss_pred eeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCc Q lcl|NC_019524. 98 NAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRR 177 (556) Q Consensus 98 ~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~ 177 (556) .-+.+. .+.. .. ..+...+-.+|+ --+|..++....+..++..|++|+.+++... T Consensus 64 ~~~~~~------~~~~--~~---~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-------- 118 (409) T protein:vir:10 64 YQKKDG------IKRV--PD---HYLEYLLKLRPN------PYMSSSDFWKCIEVQRNIYGNAYVALDFKKN-------- 118 (409) T ss_pred EEecCC------eeec--cC---chHHHHHhhccC------CCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-------- Confidence 422110 0000 00 112223334554 3578888888889899999999999875432 Q ss_pred ccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCC-----CeEEEEEeecCCCccccCCccccceeeccccCChhHeEe Q lcl|NC_019524. 178 PFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNG-----AALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIH 252 (556) Q Consensus 178 ~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~G-----r~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH 252 (556) ..+..|..|.|+.+. |..|+.| ..+.|.+... -| ....+++.+||| T Consensus 119 G~~~~L~~i~~~~V~--------------v~~~~~~~~~~~~~~~y~~~~~-~g--------------~~~~~~~~evih 169 (409) T protein:vir:10 119 GEIKGLYPLKSDGMK--------------IFVDDTGLLNSENNVWYLYTDD-LG--------------QRHKFMSDEILH 169 (409) T ss_pred CcEEEEEEEcCCceE--------------EEEcCCccccccceEEEEEEeC-Cc--------------eeEEeccccEEE Confidence 235688889988874 2233322 2344544321 11 112467889999 Q ss_pred eecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccccccccccccccc Q lcl|NC_019524. 253 IIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLAN 332 (556) Q Consensus 253 ~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (556) +... .++...|+|++..+...+......++......+-.+...++|+.+..-. .+..+........ T Consensus 170 ~r~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~----------~e~~~~~~~~~~~--- 235 (409) T protein:vir:10 170 FKGL-TADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLN----------PEAEEVFKENFER--- 235 (409) T ss_pred ecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCC----------HHHHHHHHHHHHH--- Confidence 8765 4677999999988888777766666666666666778888888653210 0011111010000 Q ss_pred ccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHH Q lcl|NC_019524. 333 YVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKY 412 (556) Q Consensus 333 ~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~ 412 (556) ...+ .-..|.+..|..|.+++.++.+.-...|.+..+.....||+.+|||-..| ++..+.|||++.+....+ T Consensus 236 ~~~g----~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~e~~~~~f--- 307 (409) T protein:vir:10 236 MSSG----LKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQL-NDLDRATHSNITEQNREF--- 307 (409) T ss_pred Hhcc----ccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCccccHHHHHHHH--- Confidence 0000 01357788899999999998877778888999999999999999999866 677788999987665554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCC Q lcl|NC_019524. 413 MDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGL 492 (556) Q Consensus 413 ~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~ 492 (556) +..-+.|+.+. ++.++....+..+.... -..++|........|+...+++....+.+|+ T Consensus 308 --------~~~~l~P~~~~-ie~~ln~kL~~~~~~~~------------~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~ 366 (409) T protein:vir:10 308 --------YIDTLQSILNM-YELEINYKLFLISEIKN------------GFYSKFNVDTILRADIKTRYESYKEAIQNGF 366 (409) T ss_pred --------HHHHHHHHHHH-HHHHHHHhhcCchhccC------------CcEEEEechhhhccCHHHHHHHHHHHHhCCC Confidence 34445565554 34444443332211100 0112333333344699999999999999999 Q ss_pred CCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 493 STYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNE 552 (556) Q Consensus 493 ~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (556) .|+-|+-+..|..|-+--+ ++=++....+.... . +.....+++ T Consensus 367 ~T~NE~R~~lgl~p~~ggD----------~~~~~~n~~~~~~~-----~--~~~~kgGe~ 409 (409) T protein:vir:10 367 KTPNEIRELEEDEPLEGGD----------VLLINGNMIPVKMA-----G--EQYSKGGEK 409 (409) T ss_pred cCHHHHHHHhCCCCCCCcC----------eeeeccCccchhhc-----c--ccccccCCC Confidence 9999998888887743111 11011110000000 0 000111111 No 57 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.64 E-value=2.1e-15 Score=101.10 Aligned_cols=432 Identities=12% Similarity=0.052 Sum_probs=224.1 Q ss_pred CCcchhhh---HHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTT---RTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~---r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) |+|..+.. +....-. .+. .+- .....|.-+... ...+.. . ... .++..-+.+ .+. T Consensus 1 ~~~~t~~~~~~~l~~~~~----~~~--~r~-~~l~~Yy~g~~~---i~~~~~-~--~~~-------~~~~~~~k~--~~n 58 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRID----DGM--SRV-RLLARYSNGDAP---LPELTR-N--TSA-------AWRSFQREA--RTN 58 (456) T ss_pred CCCCCHHHHHHHHHHHHH----HHH--HHH-HHHHHHHhcCCC---chhcCc-c--cCh-------hhhhhhhhh--hcc Confidence 77666643 2221111 111 111 112334433221 111111 1 111 111111222 246 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) |++-+|+..+..++|.||+.....+ .+. . +.+|+-|.. .+|...+..+++..++- T Consensus 59 ~~~~ivd~~~~~l~~~~~~~~~~~d-------~~~----~---~~~~~i~~~-----------N~~d~~~~~~~~~a~i~ 113 (456) T protein:vir:10 59 WGLMVRDSVADRIIPNGITVGGSAD-------SDL----A---LRARRIWRD-----------NRMDSVCKQWVKYGLDF 113 (456) T ss_pred hHHHHHHHHHhhhccCCeecCCCCC-------cch----H---HHHHHHHHh-----------cChhhHHHHHHHHHhhc Confidence 9999999999999999998643321 111 1 233443432 15667788889999999 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE--CCCCCeEEEEEeec------------ Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL--DNNGAALGYWLRKA------------ 223 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~--d~~Gr~vaY~i~~~------------ 223 (556) |.+|.. ++....+ . .++.+++|..+-.-++...+..+..+|.+ +.++.+..+-++.. T Consensus 114 G~ay~~-v~~d~~g-----~---~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 184 (456) T protein:vir:10 114 GESYLT-CWRRDDG-----T---ATITADSPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFV 184 (456) T ss_pred CeeEEE-EeeCCCC-----c---eEEEEEccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEE Confidence 999975 4543222 1 36888999988544554555566666654 22333321111000 Q ss_pred ---CCCccccCCccccceeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee--- Q lcl|NC_019524. 224 ---FPGDPTDMEQWKWGYEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA--- 296 (556) Q Consensus 224 ---hpgd~~~~~~~~~~rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~--- 296 (556) -...........|..+... ...+..-| +.|. -..|+|.|.++|.. ++.|..+....+..+..++ T Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pv-v~~~-----N~~g~gd~e~vi~l---iDa~~~~~s~~~~~~~~~a~~~ 255 (456) T protein:vir:10 185 QSSSRRRLVTRISDSWVPVGDAVVTGSPPPV-VVYQ-----NPDGMGEVEPHIDI---INRINRAELQLLSTMAIQAFRQ 255 (456) T ss_pred eecccceeeeecCCceeeccccCCCCCceeE-EEec-----CCCCCchhhhhHHH---HHHHHHHHHHHHHHHHHhhhHh Confidence 0000000011111111000 00111112 2222 23689999998754 4555555554443332222 Q ss_pred eeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 297 ASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 297 ~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~l 376 (556) .+|+-...+.......| . ...........+|.+..+++|.++..++... -.+|....+..+ T Consensus 256 ~~i~G~~~~~~~~d~~g---------------~---~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~-~~~~~~~l~~~i 316 (456) T protein:vir:10 256 RALKSTEHGLPNVDENG---------------N---AIDYASIFEAAPGALWELPPGVDIWESQAND-FTPMLSAIKEHI 316 (456) T ss_pred HhhhccCcccccccccc---------------c---ccchhhhhhhhccccccCCCCcceEEecccC-hhHHHHHHHHHH Confidence 22321111110000000 0 0000111234678888899999988877553 457999999999 Q ss_pred HHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccch Q lcl|NC_019524. 377 RNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDP 456 (556) Q Consensus 377 r~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~ 456 (556) +.|++..++|-+.+.++.++.|-.++|..+.......+..|..|-..+.+ +++..+ + ..|... T Consensus 317 ~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~-~~rl~~--~-~~g~~~------------- 379 (456) T protein:vir:10 317 RQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA-ILVKAL--Q-IEGESV------------- 379 (456) T ss_pred HHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH--H-hcCCCc------------- Confidence 99999999999999888877777888888888888777777777666544 344322 1 234211 Q ss_pred hhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCC Q lcl|NC_019524. 457 MMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNS 536 (556) Q Consensus 457 ~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~ 536 (556) ..-+.+.|..+.. .+....+.+..+.+.+|+.|.+......|.+++++ ++...|+.+-+..++. ..+... T Consensus 380 ---~~~~~v~w~~~~~--~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i-~~~e~er~~~e~~~~~--~~~~~~-- 449 (456) T protein:vir:10 380 ---EDTVDVSFESPDR--VTLGEKYSAASLAKAAGESWASIRRNILNYNADQI-KQDDLDRAREQITLFA--GNPVQR-- 449 (456) T ss_pred ---ccceeEEecCCCC--cCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHH-HHHHHHHHHHHHHHHh--hhhhhc-- Confidence 0125688988754 46788899989999999999887777779998876 3222222222222211 000000 Q ss_pred CCCCCCCCCCCC Q lcl|NC_019524. 537 TQSSNSSESTSD 548 (556) Q Consensus 537 ~~~~~~~~~~~~ 548 (556) ++++.+. T Consensus 450 -----~~~~~~~ 456 (456) T protein:vir:10 450 -----PQEDGSR 456 (456) T ss_pred -----CCCCCCC Confidence 0000000 No 58 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.64 E-value=2.1e-15 Score=101.10 Aligned_cols=432 Identities=12% Similarity=0.052 Sum_probs=224.1 Q ss_pred CCcchhhh---HHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTT---RTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~---r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) |+|..+.. +....-. .+. .+- .....|.-+... ...+.. . ... .++..-+.+ .+. T Consensus 1 ~~~~t~~~~~~~l~~~~~----~~~--~r~-~~l~~Yy~g~~~---i~~~~~-~--~~~-------~~~~~~~k~--~~n 58 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRID----DGM--SRV-RLLARYSNGDAP---LPELTR-N--TSA-------AWRSFQREA--RTN 58 (456) T ss_pred CCCCCHHHHHHHHHHHHH----HHH--HHH-HHHHHHHhcCCC---chhcCc-c--cCh-------hhhhhhhhh--hcc Confidence 77666643 2221111 111 111 112334433221 111111 1 111 111111222 246 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) |++-+|+..+..++|.||+.....+ .+. . +.+|+-|.. .+|...+..+++..++- T Consensus 59 ~~~~ivd~~~~~l~~~~~~~~~~~d-------~~~----~---~~~~~i~~~-----------N~~d~~~~~~~~~a~i~ 113 (456) T protein:vir:10 59 WGLMVRDSVADRIIPNGITVGGSAD-------SDL----A---LRARRIWRD-----------NRMDSVCKQWVKYGLDF 113 (456) T ss_pred hHHHHHHHHHhhhccCCeecCCCCC-------cch----H---HHHHHHHHh-----------cChhhHHHHHHHHHhhc Confidence 9999999999999999998643321 111 1 233443432 15667788889999999 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE--CCCCCeEEEEEeec------------ Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL--DNNGAALGYWLRKA------------ 223 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~--d~~Gr~vaY~i~~~------------ 223 (556) |.+|.. ++....+ . .++.+++|..+-.-++...+..+..+|.+ +.++.+..+-++.. T Consensus 114 G~ay~~-v~~d~~g-----~---~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 184 (456) T protein:vir:10 114 GESYLT-CWRRDDG-----T---ATITADSPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFV 184 (456) T ss_pred CeeEEE-EeeCCCC-----c---eEEEEEccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEE Confidence 999975 4543222 1 36888999988544554555566666654 22333321111000 Q ss_pred ---CCCccccCCccccceeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee--- Q lcl|NC_019524. 224 ---FPGDPTDMEQWKWGYEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA--- 296 (556) Q Consensus 224 ---hpgd~~~~~~~~~~rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~--- 296 (556) -...........|..+... ...+..-| +.|. -..|+|.|.++|.. ++.|..+....+..+..++ T Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pv-v~~~-----N~~g~gd~e~vi~l---iDa~~~~~s~~~~~~~~~a~~~ 255 (456) T protein:vir:10 185 QSSSRRRLVTRISDSWVPVGDAVVTGSPPPV-VVYQ-----NPDGMGEVEPHIDI---INRINRAELQLLSTMAIQAFRQ 255 (456) T ss_pred eecccceeeeecCCceeeccccCCCCCceeE-EEec-----CCCCCchhhhhHHH---HHHHHHHHHHHHHHHHHhhhHh Confidence 0000000011111111000 00111112 2222 23689999998754 4555555554443332222 Q ss_pred eeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 297 ASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 297 ~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~l 376 (556) .+|+-...+.......| . ...........+|.+..+++|.++..++... -.+|....+..+ T Consensus 256 ~~i~G~~~~~~~~d~~g---------------~---~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~-~~~~~~~l~~~i 316 (456) T protein:vir:10 256 RALKSTEHGLPNVDENG---------------N---AIDYASIFEAAPGALWELPPGVDIWESQAND-FTPMLSAIKEHI 316 (456) T ss_pred HhhhccCcccccccccc---------------c---ccchhhhhhhhccccccCCCCcceEEecccC-hhHHHHHHHHHH Confidence 22321111110000000 0 0000111234678888899999988877553 457999999999 Q ss_pred HHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccch Q lcl|NC_019524. 377 RNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDP 456 (556) Q Consensus 377 r~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~ 456 (556) +.|++..++|-+.+.++.++.|-.++|..+.......+..|..|-..+.+ +++..+ + ..|... T Consensus 317 ~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~-~~rl~~--~-~~g~~~------------- 379 (456) T protein:vir:10 317 RQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA-ILVKAL--Q-IEGESV------------- 379 (456) T ss_pred HHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH--H-hcCCCc------------- Confidence 99999999999999888877777888888888888777777777666544 344322 1 234211 Q ss_pred hhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCC Q lcl|NC_019524. 457 MMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNS 536 (556) Q Consensus 457 ~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~ 536 (556) ..-+.+.|..+.. .+....+.+..+.+.+|+.|.+......|.+++++ ++...|+.+-+..++. ..+... T Consensus 380 ---~~~~~v~w~~~~~--~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i-~~~e~er~~~e~~~~~--~~~~~~-- 449 (456) T protein:vir:10 380 ---EDTVDVSFESPDR--VTLGEKYSAASLAKAAGESWASIRRNILNYNADQI-KQDDLDRAREQITLFA--GNPVQR-- 449 (456) T ss_pred ---ccceeEEecCCCC--cCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHH-HHHHHHHHHHHHHHHh--hhhhhc-- Confidence 0125688988754 46788899989999999999887777779998876 3222222222222211 000000 Q ss_pred CCCCCCCCCCCC Q lcl|NC_019524. 537 TQSSNSSESTSD 548 (556) Q Consensus 537 ~~~~~~~~~~~~ 548 (556) ++++.+. T Consensus 450 -----~~~~~~~ 456 (456) T protein:vir:10 450 -----PQEDGSR 456 (456) T ss_pred -----CCCCCCC Confidence 0000000 No 59 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.64 E-value=1.7e-15 Score=101.62 Aligned_cols=430 Identities=9% Similarity=0.017 Sum_probs=205.7 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) ||=..-........ ....+.... ..+ ...|..++.+.. ....++.+|+++. T Consensus 6 ~~i~s~~~~~~i~~---------------~~~~s~~~~-~~~-~~~~~~pp~~~~------------~la~l~~~n~~v~ 56 (542) T protein:vir:41 6 LSIRSLEKYKAIKR---------------EEVESQALG-ETR-FEEYVEPKVNPL------------VLLSLLQVNPYHA 56 (542) T ss_pred ccccccccchhhhh---------------ccccccccc-ccc-CCccccCCCCHH------------HHHHHHhhcHHHH Confidence 11111100000000 000111110 111 123322222222 1235889999999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) ++|+.+.++|.+.+|++..... ...++.+ ++ + .++++++....+..++..|.+ T Consensus 57 scI~~ia~~IA~l~~~~~~~~~------------------~~l~~~l---pN--~----~~s~~~f~~~~v~~lll~Gna 109 (542) T protein:vir:41 57 SACSIKANDIIRTGYILEGDDE------------------GVVDEFI---RA--C----KPSFEYVLLRALEDLQVFNYC 109 (542) T ss_pred HHHHHHHHHHhhCceeeecccc------------------hhhhhhc---CC--C----CCCHHHHHHHHHHHHhhcCCe Confidence 9999999999999988753211 1111111 22 2 368999999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+.+... ..+..|..|+|+.+.... ++..++... .+...-|...-..+.......+ . T Consensus 110 yi~i~rd~~--------G~~~~L~~l~~~~v~v~~---d~~~~~~~~----~~~~~~~~~~y~~~~~~~~~~g------~ 168 (542) T protein:vir:41 110 TLEVVRDDR--------GDPIRFEYIPSHTIRVHK---DGSRYRQTW----DGVNITHFKDYRYEGEINPETG------E 168 (542) T ss_pred EEEEEEcCC--------CcEEEEEEEcCcceEEEE---cCCeeEeee----cCCcceeEEeeccccccccccc------c Confidence 998764321 236788999998885321 122222211 1111122211111111000000 0 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ....+++.+|||+......++..|+|++..++..+.-.....+....-.+=.+...++|+.+..-............+.. T Consensus 169 ~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~ 248 (542) T protein:vir:41 169 DQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGR 248 (542) T ss_pred cccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHH Confidence 11357889999998888889999999999988877655544444444444456666777754322111000000000000 Q ss_pred ccccccccccccccccccceecCCceeeecC------CCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchh Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLY------PGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDY 394 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~------pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~ 394 (556) ........ ....+ ..=.+|.+..|. .|-+++.++.+.-...|.+..+.....||+.+|||-..| |+. T Consensus 249 ~~lk~~~~---~~~~g---~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~ 321 (542) T protein:vir:41 249 TVIQALIE---DNFKH---LKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRL-GIA 321 (542) T ss_pred HHHHHHHH---HHHhh---hhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CcC Confidence 00000000 00000 001245555553 566788877766677888888999999999999998855 776 Q ss_pred hcccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcc Q lcl|NC_019524. 395 TKTNY--SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASR 472 (556) Q Consensus 395 s~~nY--Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~ 472 (556) ++.+| |.+.+.... |....++|+.+. ++.++-...++ ..+.. ..+++..... T Consensus 322 ~~~t~n~sn~Eq~~~~-----------f~~~tL~P~~~~-ie~~ln~~L~~-~~~~~-------------~~~~f~~~~l 375 (542) T protein:vir:41 322 DTGPLGGNFAEVTRRT-----------YYESVVRPQQNI-ISSILTDFFQV-KFNPK-------------TRFKFNDETL 375 (542) T ss_pred CCcccccccHHHHHHH-----------HHHHHHHHHHHH-HHHHHHhhccc-ccCCc-------------eEEEecchhh Confidence 55544 655544433 455555665554 34444443221 11110 1122322222 Q ss_pred cccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCC------ Q lcl|NC_019524. 473 GQIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSES------ 545 (556) Q Consensus 473 ~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~------ 545 (556) ...| ...+....+++|+.|+.|+-++. |.++-+.. .+...... .........+.+++ T Consensus 376 l~~d---~~~~~~~~v~~GilT~NE~Re~L~g~~pgdd~--------~l~p~~~~-----~~~~~~~~~n~~~~~~~~~~ 439 (542) T protein:vir:41 376 LESD---SVRNCALLVQSGVLTPAEARERLFGLDGGPDI--------FMVPSKGA-----AKSVKRQERNYEKNQIREIR 439 (542) T ss_pred cchH---HHHHHHHHHhCCCCCHHHHHHhhCCCCCCCcc--------cccccccc-----ccccccCCcCCCCCchhhhh Confidence 2223 33445668999999998874332 55542210 00000000 00000000000000 Q ss_pred --------------------CCCCCCCcCCC Q lcl|NC_019524. 546 --------------------TSDNPNEETTQ 556 (556) Q Consensus 546 --------------------~~~~~~~e~~~ 556 (556) ++.+.+-|..| T Consensus 440 k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (542) T protein:vir:41 440 KIYAKYRPRFNEIISSKLSAEEKKKKIDESL 470 (542) T ss_pred hcccccCccccccccccccchhhcccccchh Confidence 00000000000 No 60 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.64 E-value=2.6e-15 Score=100.55 Aligned_cols=387 Identities=9% Similarity=-0.029 Sum_probs=214.7 Q ss_pred CCcchh-hhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 1 MKDVKK-TTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 1 ~sp~~~-~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) |.+.-. ..|. +.++... ....+..... ......|.... ....+ +-+-+..++.+ T Consensus 2 ~m~~f~~~~~~--~~~~~~~----------~~~~~~~~~~-~~~~~~~~~~~--~~~~v----------~~~~al~~~~v 56 (392) T protein:vir:10 2 ILPILNFINQT--NDPPEVG----------SVQSYFPDGN-DAQIMESLLGD--NNEWV----------SARAALRNSDL 56 (392) T ss_pred cchhhhhhhcc--ccccccc----------ccccccccCc-hhhhhhhhcCC--CCcee----------chHHhhccHHH Confidence 222211 1110 0000000 0000000000 00000000000 00000 11222357889 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) ..+|+.+.+.|...-|++.-+. ... .-.+++ -.+|.+++....+..++..|+ T Consensus 57 ~~~i~~ia~~ia~lp~~~~~~~-----------~~~-----------l~~~PN------~~~t~~~f~~~~~~~lll~Gn 108 (392) T protein:vir:10 57 FSIILQLSSDLAIVKINAEKKK-----------NQG-----------IIDNPS------TNANKHGFWQSMFAQLLLGGE 108 (392) T ss_pred HHHHHHHHHhhccCceeeccch-----------hhh-----------HhhcCC------CCCCHHHHHHHHHHHhhhcCc Confidence 9999999999988666543211 111 112454 257899999999999999999 Q ss_pred eEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) +|+.+++... ..+..|..|+|+.+. |+.|.+|..+.|.+....++.. T Consensus 109 a~~~i~r~~~--------g~~~~L~~l~~~~v~--------------~~~~~~~~~~~y~~~~~~~~~~----------- 155 (392) T protein:vir:10 109 AFAYRWRNAN--------GADMKWEYLRPSQVN--------------TYYFEYENGMYYNITFDDPKIE----------- 155 (392) T ss_pred EEEEEEECCC--------CcEEEEEEEcCceeE--------------EEEcCCCceEEEEEEecCcccc----------- Confidence 9999875321 235788999988873 5566778888898876543321 Q ss_pred eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) ....+++++|||+......|...|+|++..+...+.-....+++.....+=.+...++|+.+...... +.. T Consensus 156 -~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~--------~~~ 226 (392) T protein:vir:10 156 -PILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS--------DKD 226 (392) T ss_pred -eeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch--------HHH Confidence 11246788999999888778899999999988888777777766666666677788888865321110 000 Q ss_pred cccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY 399 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY 399 (556) ...... .....-..|.+..|+.|.+++.++.+....+|.+..+...+.||+.+|||-..| |+.+ .++ T Consensus 227 ~~~~~~-----------~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-g~~~-~~~ 293 (392) T protein:vir:10 227 KASRSR-----------SFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQG-DQQ 293 (392) T ss_pred HHHHHH-----------HHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCC-Ccc Confidence 000000 001112457788899999999999877778899999999999999999998866 5643 234 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhh Q lcl|NC_019524. 400 SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKK 479 (556) Q Consensus 400 Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~K 479 (556) |+. ... ..++...+.|+.+.+ +.++....++ .. .+ -.......|+.. T Consensus 294 ~~~-~~~-----------~~f~~~~l~P~~~~i-e~~l~~~L~~---~~---~~--------------d~~~~~~~d~~~ 340 (392) T protein:vir:10 294 SSI-QQI-----------SGMYASALNRYLRPA-ISELEYKLSD---HI---SV--------------NMRPAIDPLGDN 340 (392) T ss_pred cHH-HHH-----------HHHHHHHHHHHHHHH-HHHHHHhccc---cc---cc--------------cchhhhccCHHH Confidence 332 111 123444456655543 3333332211 10 00 001112247777 Q ss_pred hhHHHHHHHHcCCCCHHHHHHH---hCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCC Q lcl|NC_019524. 480 ETEAAILRIKNGLSTYEAEISR---LGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNP 550 (556) Q Consensus 480 e~~A~~~~i~~G~~s~~~~~ae---~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (556) -+......+++|+.|+.|+.+- .|.-+.|+-+ ..|++.. +.+.. .+..| T Consensus 341 ~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~----------~e~l~~~-------~~Gd~-----~~p~p 392 (392) T protein:vir:10 341 YLSTISTATRWGALAENQATFVLQEAGYIPKDLPA----------PENTNKK-------TTGQS-----NEPVP 392 (392) T ss_pred HHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccch----------hcCCCCC-------CCCCC-----CCCCC Confidence 7788889999999999987542 2444433210 1122210 00000 00111 No 61 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.64 E-value=2.6e-15 Score=100.55 Aligned_cols=387 Identities=9% Similarity=-0.029 Sum_probs=214.7 Q ss_pred CCcchh-hhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 1 MKDVKK-TTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 1 ~sp~~~-~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) |.+.-. ..|. +.++... ....+..... ......|.... ....+ +-+-+..++.+ T Consensus 2 ~m~~f~~~~~~--~~~~~~~----------~~~~~~~~~~-~~~~~~~~~~~--~~~~v----------~~~~al~~~~v 56 (392) T protein:vir:39 2 ILPILNFINQT--NDPPEVG----------SVQSYFPDGN-DAQIMESLLGD--NNEWV----------SARAALRNSDL 56 (392) T ss_pred cchhhhhhhcc--ccccccc----------ccccccccCc-hhhhhhhhcCC--CCcee----------chHHhhccHHH Confidence 222211 1110 0000000 0000000000 00000000000 00000 11222357889 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) ..+|+.+.+.|...-|++.-+. ... .-.+++ -.+|.+++....+..++..|+ T Consensus 57 ~~~i~~ia~~ia~lp~~~~~~~-----------~~~-----------l~~~PN------~~~t~~~f~~~~~~~lll~Gn 108 (392) T protein:vir:39 57 FSIILQLSSDLAIVKINAEKKK-----------NQG-----------IIDNPS------TNANKHGFWQSMFAQLLLGGE 108 (392) T ss_pred HHHHHHHHHhhccCceeeccch-----------hhh-----------HhhcCC------CCCCHHHHHHHHHHHhhhcCc Confidence 9999999999988666543211 111 112454 257899999999999999999 Q ss_pred eEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) +|+.+++... ..+..|..|+|+.+. |+.|.+|..+.|.+....++.. T Consensus 109 a~~~i~r~~~--------g~~~~L~~l~~~~v~--------------~~~~~~~~~~~y~~~~~~~~~~----------- 155 (392) T protein:vir:39 109 AFAYRWRNAN--------GADMKWEYLRPSQVN--------------TYYFEYENGMYYNITFDDPKIE----------- 155 (392) T ss_pred EEEEEEECCC--------CcEEEEEEEcCceeE--------------EEEcCCCceEEEEEEecCcccc----------- Confidence 9999875321 235788999988873 5566778888898876543321 Q ss_pred eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) ....+++++|||+......|...|+|++..+...+.-....+++.....+=.+...++|+.+...... +.. T Consensus 156 -~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~--------~~~ 226 (392) T protein:vir:39 156 -PILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS--------DKD 226 (392) T ss_pred -eeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch--------HHH Confidence 11246788999999888778899999999988888777777766666666677788888865321110 000 Q ss_pred cccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY 399 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY 399 (556) ...... .....-..|.+..|+.|.+++.++.+....+|.+..+...+.||+.+|||-..| |+.+ .++ T Consensus 227 ~~~~~~-----------~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-g~~~-~~~ 293 (392) T protein:vir:39 227 KASRSR-----------SFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQG-DQQ 293 (392) T ss_pred HHHHHH-----------HHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCC-Ccc Confidence 000000 001112457788899999999999877778899999999999999999998866 5643 234 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhh Q lcl|NC_019524. 400 SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKK 479 (556) Q Consensus 400 Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~K 479 (556) |+. ... ..++...+.|+.+.+ +.++....++ .. .+ -.......|+.. T Consensus 294 ~~~-~~~-----------~~f~~~~l~P~~~~i-e~~l~~~L~~---~~---~~--------------d~~~~~~~d~~~ 340 (392) T protein:vir:39 294 SSI-QQI-----------SGMYASALNRYLRPA-ISELEYKLSD---HI---SV--------------NMRPAIDPLGDN 340 (392) T ss_pred cHH-HHH-----------HHHHHHHHHHHHHHH-HHHHHHhccc---cc---cc--------------cchhhhccCHHH Confidence 332 111 123444456655543 3333332211 10 00 001112247777 Q ss_pred hhHHHHHHHHcCCCCHHHHHHH---hCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCC Q lcl|NC_019524. 480 ETEAAILRIKNGLSTYEAEISR---LGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNP 550 (556) Q Consensus 480 e~~A~~~~i~~G~~s~~~~~ae---~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (556) -+......+++|+.|+.|+.+- .|.-+.|+-+ ..|++.. +.+.. .+..| T Consensus 341 ~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~----------~e~l~~~-------~~Gd~-----~~p~p 392 (392) T protein:vir:39 341 YLSTISTATRWGALAENQATFVLQEAGYIPKDLPA----------PENTNKK-------TTGQS-----NEPVP 392 (392) T ss_pred HHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccch----------hcCCCCC-------CCCCC-----CCCCC Confidence 7788889999999999987542 2444433210 1122210 00000 00111 No 62 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.64 E-value=4.5e-16 Score=104.71 Aligned_cols=459 Identities=10% Similarity=0.033 Sum_probs=218.8 Q ss_pred CCcchhhhHH--HHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRT--RAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~--~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) ++-..--++. ...++.....++... ......+... ....+ +....+...+ ..-.+ ...||++ T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~a~~~---~~~~~~~~~~----~~~~~-~~~~~~~~~l-------~~~l~-~~~~n~i 96 (563) T protein:vir:95 33 IKKIEQDNKEYQDLTKSLYGQQQAYAE---PFIEMMDTNP----EFRDK-RSYMKNEHNL-------HDVLK-KFGNNPI 96 (563) T ss_pred HhhhhccchhHHHHHhhhccCCCcchh---hhHhhhcccc----ccccc-ccCCCCcccH-------HHHHH-HhhcchH Confidence 1100000000 000111000000000 0001111110 01111 1111111111 11122 2336899 Q ss_pred HHHHHHHHHhhhccC---------CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhccc--ccceehhcccCHHHHH Q lcl|NC_019524. 79 AAGVVAVHRDSIVGS---------QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESP--ENWFDARRMCTLTGLT 147 (556) Q Consensus 79 a~~~v~~~~~nvVG~---------Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~--~~~cD~~g~~~f~~lq 147 (556) ++.+|+++.++|-.. |+.+..+........+.++ ...+. .+..+.... +..++ +.+|.++. T Consensus 97 ~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~----~~~~~-~l~~~l~~~~~~~~p~---~~t~~~f~ 168 (563) T protein:vir:95 97 LNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKE----KEEMK-RIEDFIVNTGKDKDVD---RDSFQTFC 168 (563) T ss_pred HHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhh----hhhhH-HHHHHhhhcCCCCCCC---cchHHHHH Confidence 999999998887631 2222222211111111111 11111 112222111 11121 35899999 Q ss_pred HHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCc Q lcl|NC_019524. 148 RLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGD 227 (556) Q Consensus 148 ~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd 227 (556) +.++..++..|.+++.+++.+. +...+..|..|+|.+|..-.+ .++.... ....|+... + T Consensus 169 ~~lv~~lll~Gn~~~~~~~~rd------~~G~~~~L~pl~p~~V~v~~~-~~g~~~~---------~~~~y~~~~-~--- 228 (563) T protein:vir:95 169 KKIVRDTYIYDQVNFEKVFNKN------NKTKLEKFIAVDPSTIFYATD-KKGKIIK---------GGKRFVQVV-D--- 228 (563) T ss_pred HHHHHHHHhcCCeEEEEEEEec------CCCceEEEEEeCCceeEEEEC-CCCceec---------cceeEEEEe-C--- Confidence 9999999999999987654332 122467889999998853211 1111111 111222211 0 Q ss_pred cccCCccccceeeccccCChhHeEeeecccCCC---cccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALLAG---QTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~g---Q~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) +. ....+++.+|||+...-.++ ...|+|++..+...+......++....-.+=.+...++|+-+.+ T Consensus 229 -----g~------~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~ 297 (563) T protein:vir:95 229 -----KR------VVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSD 297 (563) T ss_pred -----Cc------eeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCC Confidence 00 01235567887665543333 66799999998888877776666666666667778888875432 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCcee-eecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhc Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI-PHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASL 383 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i-~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaagl 383 (556) .... .+.......... ....+ .-..|.+ ..|..|.+++.++.+.-...|.+..+.....||+.+ T Consensus 298 ~~ls--------~e~~~~~~~~~~---~~~~G----~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~af 362 (563) T protein:vir:95 298 QQQS--------QHALENFKREWK---SSLSG----INGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALY 362 (563) T ss_pred CCCC--------HHHHHHHHHHHH---HHhcc----ccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHh Confidence 1100 000000000000 00000 0124554 678999999999988788889999999999999999 Q ss_pred CCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhh Q lcl|NC_019524. 384 GMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALC 463 (556) Q Consensus 384 Gi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~ 463 (556) |||.+.| |+.++++|++.-.+-.-.+..++..+..++...++|+..+ ++.++....++ .+... . T Consensus 363 gVPp~~l-G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~-ie~~ln~~L~~--~~~~~------------~ 426 (563) T protein:vir:95 363 GIDPAEI-GFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRF-IEDLVNRHIIS--EYGDK------------Y 426 (563) T ss_pred CCCHHHc-cccccccccccccccchhhccHHHHHHHHHHHHHHHHHHH-HHHHHHhhhch--hcccc------------c Confidence 9999865 8888777755544333333455555666778888887775 55556554432 11110 1 Q ss_pred CeeeecCcccccchhhhhHHH--HHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCc------------ Q lcl|NC_019524. 464 NAEWIGASRGQIDEKKETEAA--ILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTG------------ 529 (556) Q Consensus 464 ~~~w~~p~~~~iDP~Ke~~A~--~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~------------ 529 (556) .++|. -.|.....++. ...+.+|+.|+-|+-+..|+.|-+--+.+- ..+.+.... T Consensus 427 ~~~f~-----r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~------~~~~~~~~~~~~~~~~~~~~~ 495 (563) T protein:vir:95 427 TFQFV-----GGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIIL------DASFLQGTAQLQQDKQYNDGK 495 (563) T ss_pred EEEec-----cCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceee------cccccccccccccccCCCccc Confidence 22332 23555544433 345889999999999999998765222110 000000000 Q ss_pred -----cccc-cCCCCCCCCCCCCCCCC-CCcCCC Q lcl|NC_019524. 530 -----KMVE-GNSTQSSNSSESTSDNP-NEETTQ 556 (556) Q Consensus 530 -----~~~~-~~~~~~~~~~~~~~~~~-~~e~~~ 556 (556) .+.. ........++.+.+.++ ++++++ T Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (563) T protein:vir:95 496 QKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEI 529 (563) T ss_pred cchhhhhcccccCCCCCCCCCCCCCCCCCCcccc Confidence 0000 00000001111111111 111111 No 63 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.64 E-value=4.5e-16 Score=104.71 Aligned_cols=459 Identities=10% Similarity=0.033 Sum_probs=218.8 Q ss_pred CCcchhhhHH--HHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRT--RAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~--~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) ++-..--++. ...++.....++... ......+... ....+ +....+...+ ..-.+ ...||++ T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~a~~~---~~~~~~~~~~----~~~~~-~~~~~~~~~l-------~~~l~-~~~~n~i 96 (563) T protein:vir:99 33 IKKIEQDNKEYQDLTKSLYGQQQAYAE---PFIEMMDTNP----EFRDK-RSYMKNEHNL-------HDVLK-KFGNNPI 96 (563) T ss_pred HhhhhccchhHHHHHhhhccCCCcchh---hhHhhhcccc----ccccc-ccCCCCcccH-------HHHHH-HhhcchH Confidence 1100000000 000111000000000 0001111110 01111 1111111111 11122 2336899 Q ss_pred HHHHHHHHHhhhccC---------CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhccc--ccceehhcccCHHHHH Q lcl|NC_019524. 79 AAGVVAVHRDSIVGS---------QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESP--ENWFDARRMCTLTGLT 147 (556) Q Consensus 79 a~~~v~~~~~nvVG~---------Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~--~~~cD~~g~~~f~~lq 147 (556) ++.+|+++.++|-.. |+.+..+........+.++ ...+. .+..+.... +..++ +.+|.++. T Consensus 97 ~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~----~~~~~-~l~~~l~~~~~~~~p~---~~t~~~f~ 168 (563) T protein:vir:99 97 LNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKE----KEEMK-RIEDFIVNTGKDKDVD---RDSFQTFC 168 (563) T ss_pred HHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhh----hhhhH-HHHHHhhhcCCCCCCC---cchHHHHH Confidence 999999998887631 2222222211111111111 11111 112222111 11121 35899999 Q ss_pred HHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCc Q lcl|NC_019524. 148 RLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGD 227 (556) Q Consensus 148 ~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd 227 (556) +.++..++..|.+++.+++.+. +...+..|..|+|.+|..-.+ .++.... ....|+... + T Consensus 169 ~~lv~~lll~Gn~~~~~~~~rd------~~G~~~~L~pl~p~~V~v~~~-~~g~~~~---------~~~~y~~~~-~--- 228 (563) T protein:vir:99 169 KKIVRDTYIYDQVNFEKVFNKN------NKTKLEKFIAVDPSTIFYATD-KKGKIIK---------GGKRFVQVV-D--- 228 (563) T ss_pred HHHHHHHHhcCCeEEEEEEEec------CCCceEEEEEeCCceeEEEEC-CCCceec---------cceeEEEEe-C--- Confidence 9999999999999987654332 122467889999998853211 1111111 111222211 0 Q ss_pred cccCCccccceeeccccCChhHeEeeecccCCC---cccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALLAG---QTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~g---Q~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) +. ....+++.+|||+...-.++ ...|+|++..+...+......++....-.+=.+...++|+-+.+ T Consensus 229 -----g~------~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~ 297 (563) T protein:vir:99 229 -----KR------VVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSD 297 (563) T ss_pred -----Cc------eeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCC Confidence 00 01235567887665543333 66799999998888877776666666666667778888875432 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCcee-eecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhc Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI-PHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASL 383 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i-~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaagl 383 (556) .... .+.......... ....+ .-..|.+ ..|..|.+++.++.+.-...|.+..+.....||+.+ T Consensus 298 ~~ls--------~e~~~~~~~~~~---~~~~G----~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~af 362 (563) T protein:vir:99 298 QQQS--------QHALENFKREWK---SSLSG----INGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALY 362 (563) T ss_pred CCCC--------HHHHHHHHHHHH---HHhcc----ccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHh Confidence 1100 000000000000 00000 0124554 678999999999988788889999999999999999 Q ss_pred CCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhh Q lcl|NC_019524. 384 GMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALC 463 (556) Q Consensus 384 Gi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~ 463 (556) |||.+.| |+.++++|++.-.+-.-.+..++..+..++...++|+..+ ++.++....++ .+... . T Consensus 363 gVPp~~l-G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~-ie~~ln~~L~~--~~~~~------------~ 426 (563) T protein:vir:99 363 GIDPAEI-GFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRF-IEDLVNRHIIS--EYGDK------------Y 426 (563) T ss_pred CCCHHHc-cccccccccccccccchhhccHHHHHHHHHHHHHHHHHHH-HHHHHHhhhch--hcccc------------c Confidence 9999865 8888777755544333333455555666778888887775 55556554432 11110 1 Q ss_pred CeeeecCcccccchhhhhHHH--HHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCc------------ Q lcl|NC_019524. 464 NAEWIGASRGQIDEKKETEAA--ILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTG------------ 529 (556) Q Consensus 464 ~~~w~~p~~~~iDP~Ke~~A~--~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~------------ 529 (556) .++|. -.|.....++. ...+.+|+.|+-|+-+..|+.|-+--+.+- ..+.+.... T Consensus 427 ~~~f~-----r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~------~~~~~~~~~~~~~~~~~~~~~ 495 (563) T protein:vir:99 427 TFQFV-----GGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIIL------DASFLQGTAQLQQDKQYNDGK 495 (563) T ss_pred EEEec-----cCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceee------cccccccccccccccCCCccc Confidence 22332 23555544433 345889999999999999998765222110 000000000 Q ss_pred -----cccc-cCCCCCCCCCCCCCCCC-CCcCCC Q lcl|NC_019524. 530 -----KMVE-GNSTQSSNSSESTSDNP-NEETTQ 556 (556) Q Consensus 530 -----~~~~-~~~~~~~~~~~~~~~~~-~~e~~~ 556 (556) .+.. ........++.+.+.++ ++++++ T Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (563) T protein:vir:99 496 QKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEI 529 (563) T ss_pred cchhhhhcccccCCCCCCCCCCCCCCCCCCcccc Confidence 0000 00000001111111111 111111 No 64 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.64 E-value=5.4e-16 Score=104.28 Aligned_cols=380 Identities=11% Similarity=0.007 Sum_probs=204.0 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhcccc-CCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE-RTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~-~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) |+=++.. ...++. .....+.... .....+.++.....+ .+-+-.++.+ T Consensus 1 Mg~~~~~--~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~v~----------------~~~al~~~~v 49 (385) T protein:vir:10 1 MGLLTPR--NFNKRK-------------AKNMVYPSNPAFFTTTVGGMQLSYVS----------------ALSALQNTNV 49 (385) T ss_pred Cccccch--hccccc-------------ccccccccchhhhhhhccccCccccC----------------HHHhhccHHH Confidence 2211110 000000 0000000000 000001111111111 1123346778 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) ..+|+.+.+.+-..-|++.-.. +.. +-.+|+. -+|..++....+..++..|+ T Consensus 50 ~~~i~~ia~~ia~~p~~v~~~~------------------~~~----ll~~PN~------~~t~~~f~~~~~~~l~l~Gn 101 (385) T protein:vir:10 50 YSVINRIASDVASAHFKTENTA------------------TLN----RLESPSS------LIGRFSFWQGALMQLCLSGN 101 (385) T ss_pred HHHHHHHHHHHhhCceeeeccc------------------hhh----hhhcCCC------CCCHHHHHHHHHHHhhhcCC Confidence 8999999999988766653211 111 1224542 36899999999999999999 Q ss_pred eEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) +|+.+... + ++++.++... |+...++..+-|++.....+. T Consensus 102 ~~~~i~r~------------~--~~~~p~~~~~--------------v~~~~~~~~~~~~~~~~~~~~------------ 141 (385) T protein:vir:10 102 DYIPLVGQ------------N--LEHIPNSDVQ--------------INYLPGNMGIVYTVLESNDRP------------ 141 (385) T ss_pred eEEEEEcC------------c--eeEeecCCce--------------EEEEEcCCceEEEEEEcCCce------------ Confidence 99987521 1 2233322221 122222223344443221111 Q ss_pred eccccCChhHeEeeeccc--CCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEAL--LAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQG 317 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~~--r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 317 (556) ...+++++|||+.... .-+..+|+|++..+...+......++......+=.+...++|+.+..-.. . T Consensus 142 --~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~---------~ 210 (385) T protein:vir:10 142 --QMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSD---------G 210 (385) T ss_pred --EEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCC---------H Confidence 1246778999998643 34578999999998888887777777777777777888889887532110 0 Q ss_pred cccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccH-HHHHHHHHHHHHHhcCCCHHHhhc-hhh Q lcl|NC_019524. 318 GFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVG-TDYEQSLLRNIAASLGMSYEQFSR-DYT 395 (556) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f-~~F~~~~lr~iaaglGi~ye~l~~-D~s 395 (556) +..+........ .. ..-..|.+..|+.|.+++.++.+....+| .+..+....+||..+|||-+.|.+ |.+ T Consensus 211 e~~~~~~~~~~~---~~-----~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~ 282 (385) T protein:vir:10 211 KDLESAREEFEK---AN-----TGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTST 282 (385) T ss_pred HHHHHHHHHHHH---Hh-----CccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCC Confidence 111111111100 00 01135678889999999999877666665 477888899999999999988865 677 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) +.+||++-+....+.. .+.|+... ++.++....+. + .+++..-..... T Consensus 283 ~~~~sn~eq~~~~~~~------------~l~P~~~~-ie~~l~~~l~~-~------------------~~~f~~~~ll~~ 330 (385) T protein:vir:10 283 ESQHSNIDQIKATYLA------------NLNSYVNP-IVDELRLKMNA-P------------------DLELDIKDMLDV 330 (385) T ss_pred CcccccHHHHHHHHHH------------HHHHHHHH-HHHHHHHhhCC-c------------------eEEeechhhhcc Confidence 7788887543332211 23444433 23333332221 1 022333334445 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) |+...+++..+.+.+|+.|+-|+-...|..+ ++. .+........+..+..+++|+ T Consensus 331 d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p------------------~p~-~~~~~~~~~~~~~~~g~~~dn 385 (385) T protein:vir:10 331 DDSALINQVSNLAKSGVLGAEQAQFILTRSG------------------FLP-DNLPEFKPLTTQVKGGDEGDN 385 (385) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCc------------------cCC-CCCccccCcccccCCCCCCCC Confidence 9999999999999999999999887777655 221 111111111111222222222 No 65 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.63 E-value=3.6e-16 Score=105.27 Aligned_cols=406 Identities=13% Similarity=0.063 Sum_probs=224.5 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|.+.... + .-+....++.....|.... .-+.+.+.-++-+..+| T Consensus 1 MG~~~~~~~~~~-----~-----------~~~~~~~~~~~~~~~~g~~---------------~~~~~~al~~~~V~~~v 49 (411) T protein:vir:81 1 MGWWSRLTRFFR-----P-----------RNETVDMTNPLLLQWLGVD---------------PDTPRNQLSEATYFACL 49 (411) T ss_pred CchHHHHHhhcc-----C-----------cccccccchHHHHHHhcCc---------------ccChhhhhccHHHHHHH Confidence 333333211000 0 0000011111122221100 01112223356788999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-..-|++.-+.+... .. ..-..+++.+..+|+ --+|..++....+..++..|++|+. T Consensus 50 ~~Ia~~iA~lp~~~~~~~~~~~-----~~-----~~~~~l~~lL~~~PN------~~~t~~~f~~~l~~~lll~Gna~~~ 113 (411) T protein:vir:81 50 KILSESLGKLPLKMYQKTERGI-----VK-----SDREELYNLLKLRPN------PYMTSSVFWSTVEMNRNHYGNAYVW 113 (411) T ss_pred HHHHHhHhhCceeEEEecCCce-----ee-----ecccHHHHHHhhccC------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 9999988877777654322110 00 000123333434554 3568999999999999999999998 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCC-----CeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNG-----AALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~G-----r~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) +++.. + -+..|..|.|+.+.. ..|+.| ..+-|.+.....| T Consensus 114 i~r~~-------g--~~~~l~~l~~~~v~~--------------~~~~~~~~~~~~~~~~~~~~~~~g------------ 158 (411) T protein:vir:81 114 CQYSG-------P--QLQALWILPSQYVTI--------------VVDDRGLLGEKNAIWYRYNDPYDG------------ 158 (411) T ss_pred EEecC-------C--ceEEEEEECCceEEE--------------EEcCcccccccceEEEEEEecCCc------------ Confidence 76431 1 245678888888742 222222 1222333222111 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGG 318 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~ 318 (556) ....+++++|||+......+...|+|++..+...+.......++.....+=.+...++|+.+..-. .+ T Consensus 159 --~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~----------~e 226 (411) T protein:vir:81 159 --KMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLN----------QE 226 (411) T ss_pred --eEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC----------HH Confidence 112467889999987777889999999999988888888777777777777788889988653211 01 Q ss_pred ccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 319 FKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) ......... .....+. =..|.+..|..|.+++.++.+.-..+|.+..+.....||+.+|||-+.| |+..+.| T Consensus 227 ~~~~~~~~~---~~~~~g~----~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t 298 (411) T protein:vir:81 227 ARDRLVKGF---EQFANGS----KNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQI-NDYEKSS 298 (411) T ss_pred HHHHHHHHH---HHHhcCc----cccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCC Confidence 100000000 0000000 0246678899999999998776677888888999999999999998855 8888899 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||++.+....|+ ..-+.|+...+ +.++....++-..... .. .+++-.-.....|+. T Consensus 299 ~~n~e~~~~~f~-----------~~~l~P~~~~i-e~~l~~~ll~~~~~~~----------~~--~~~fd~~~ll~~d~~ 354 (411) T protein:vir:81 299 YASAEAQNLAFY-----------VDTLLYVLKQY-EEEITYKILSNDLISQ----------GH--YFKFNVNVILRADIK 354 (411) T ss_pred chhHHHHHHHHH-----------HHHHHHHHHHH-HHHHHhhcCChhhcCC----------Cc--EEEeechhhhccCHH Confidence 999877655543 33455655543 3333333332110000 00 011212222335888 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) ..+++....+.+|+.|+-|+-+..|..|.+--+++ =++....+..... .+ ...-+|+ T Consensus 355 ~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~----------~~~~n~~pl~~~~---~~------~~kgGd~ 411 (411) T protein:vir:81 355 TQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNL----------MANGNYIPLSMLG---AN------YGKGGDS 411 (411) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee----------eeccCccchhhhh---hh------hccCCCC Confidence 89999999999999999998777788774311110 0010111110000 00 0001111 No 66 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.63 E-value=1.1e-13 Score=91.54 Aligned_cols=452 Identities=8% Similarity=-0.042 Sum_probs=219.7 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |.++.-....-.+-......+. .+- .....|+-+... ...... ...+ ....+.+.+.+ +.|++ T Consensus 9 l~~~~~~~~~~~~l~~~~~~~~--~r~-~~~~~YY~g~~~---i~~~~~--~~~~--------~~~~~~~~~~~-~n~~~ 71 (479) T protein:vir:99 9 LSSEGLAKYLETKVFPKMNTEC--ERL-DDFEAWTKNGQE---VPDLAT--RHKN--------KEREVLQQLSR-KPWMG 71 (479) T ss_pred CChhHHHHHHHHHHHHHHHHHh--HHH-HHHHHHHhcCCc---cccccc--ccCC--------hhHHHHHHHhh-cCcHH Confidence 5555433222111000111111 111 112234433221 111111 1111 11112222222 36899 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+.++++++..||+.. + .+ ..+.+ |+-|.. -+|...+..+++..+.-|.+ T Consensus 72 ~iVd~~~~~l~~~gf~~~---d-------~~----~~~~~---~~i~~~-----------N~~d~~~~~~~~~a~~~G~a 123 (479) T protein:vir:99 72 LMVNSFAQQLIVDGYRKT---G-------TN----ENAKG---WDTWRL-----------NQMDKQQFWLNRAVLTFGYA 123 (479) T ss_pred HHHHHHHhhcccccccCC---C-------ch----hhHHH---HHHHHh-----------cChhHHHHHHHHHHhhcCce Confidence 999999999988888642 1 11 12333 333322 14667788888899999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCC-CCCceEEEEEEECCCCCeEEE-----EEeecCCCccccCCcc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNV-MDTPNLRSGVQLDNNGAALGY-----WLRKAFPGDPTDMEQW 234 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~-~~g~~i~~GIE~d~~Gr~vaY-----~i~~~hpgd~~~~~~~ 234 (556) |+.. +.... ..++.+. .++.+++|..+-.-++. .........|+++..+...-| +++...-+.+... T Consensus 124 f~~v-~~~~~--~~d~~g~-~~i~~~~p~~~~~iydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 196 (479) T protein:vir:99 124 FIKV-TSGIS--PLDGTTV-ARIKCIDPRDAFAIWEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYR--- 196 (479) T ss_pred EEEE-ecCCC--CcCCCCc-eEEEEechhheEEEecCCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeec--- Confidence 9765 32111 1222333 36889999987533322 223345667888877755433 1222221111100 Q ss_pred ccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccc Q lcl|NC_019524. 235 KWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGM 314 (556) Q Consensus 235 ~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~ 314 (556) ..+|- ..+.-=|+|+....+.+ .+|.|.|.+++..+..+++...-....+...|.--.+|+-...... T Consensus 197 --~~~~h--~~g~vPvv~f~n~~~~~-~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~------- 264 (479) T protein:vir:99 197 --ETVSH--DYGHIPFVRYVNVMDLR-GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEG------- 264 (479) T ss_pred --ccccc--CCCCcceEEeecCCCcC-cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccc------- Confidence 11111 12333377777765553 3799999998776555544433322223222222223321100000 Q ss_pred ccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCC-CccHHHHHHHHHHHHHHhcCCCHHHhhch Q lcl|NC_019524. 315 GQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTP-GGVGTDYEQSLLRNIAASLGMSYEQFSRD 393 (556) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p-~~~f~~F~~~~lr~iaaglGi~ye~l~~D 393 (556) .. .......+..+.|.. .+|+++++.+-+.. -.+|...++..++.|++..++|.+.+ |. T Consensus 265 ---~~---------------~~~~~~~~~~~~i~~-~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~-g~ 324 (479) T protein:vir:99 265 ---AN---------------ADQEKMRFAQESMLI-SQNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIA-GQ 324 (479) T ss_pred ---cc---------------cchhcccccccccee-ecCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHc-cc Confidence 00 000001112222322 23455555543322 25566667888999999999998865 55 Q ss_pred hhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccc Q lcl|NC_019524. 394 YTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRG 473 (556) Q Consensus 394 ~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~ 473 (556) .+++|-.++|..+.......+..|..|-..+.+ +++.. .++..+..+. . ..-+.+.|..|..+ T Consensus 325 ~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~-~~~l~--~~~~~~~~~~----~----------~~~i~~~w~~~~~~ 387 (479) T protein:vir:99 325 IVNVAADALAAGTRQTMQKLFEKQATWKASHNQ-TMRLV--NKIEGRTEEA----T----------DLDFTITWQDVTIQ 387 (479) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHcCCCccc----c----------ceeeeEEecCCCCC Confidence 555555677777777777777777776665544 33332 2333322210 0 11246788766543 Q ss_pred ccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcC-----CCCCccccccCCCCCCCCCCCCC Q lcl|NC_019524. 474 QIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFREVFKQRAREEGLIKSLK-----LDFTGKMVEGNSTQSSNSSESTS 547 (556) Q Consensus 474 ~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~G-----l~~~~~~~~~~~~~~~~~~~~~~ 547 (556) +....+.+..+.+.+|+.|.+.+++.. |.|..++ +++.+|++...+.+ +....++.....+.....+.++. T Consensus 388 --s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~-e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (479) T protein:vir:99 388 --SLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTV-NGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQA 464 (479) T ss_pred --CHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHH-HHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCC Confidence 667788888888999999999999888 9887654 33333322222211 11111222222222222222233 Q ss_pred CCCCCcCCC Q lcl|NC_019524. 548 DNPNEETTQ 556 (556) Q Consensus 548 ~~~~~e~~~ 556 (556) .+.++++.+ T Consensus 465 ~~~~~~~~~ 473 (479) T protein:vir:99 465 NNKTGEPAS 473 (479) T ss_pred CCCCcchhc Confidence 333344555 No 67 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.62 E-value=1.6e-13 Score=90.68 Aligned_cols=458 Identities=9% Similarity=0.028 Sum_probs=227.7 Q ss_pred CCcchhhhHHHHHh----hHhhccc---chhhhhhhhcchhccccCCC--cccccccCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKK----AVDVVAE---TATATPMAVGGGMEGAERTT--REMFQWNPSIISPDQQIAQNQDMASARAQD 71 (556) Q Consensus 1 ~sp~~~~~r~~a~~----a~~~~~~---~~~~~~~~~~~~y~aa~~~~--r~~~~w~~~~~s~~~~i~~~~~~lr~RaRd 71 (556) |....+..+..+.. ....... ............|.-+...- +....+.-... ......++.. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~---------~~~~~~~~~~ 82 (503) T protein:vir:59 12 TEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQ---------QLVDDTKTNN 82 (503) T ss_pred HHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccc---------cccccccccc Confidence 22222222222111 0111100 00001111222333222100 00000000000 0000001110 Q ss_pred HHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHh Q lcl|NC_019524. 72 MVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAV 151 (556) Q Consensus 72 l~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~ 151 (556) -..+++++-+|+..+++++|.+++..+. + +.....|+.|..+ +|...+..+. T Consensus 83 -ri~~n~~~~ivd~~~~yl~g~~~~~~~~---------d-------~~~~~~l~~~~~n-----------~~~~~~~~~~ 134 (503) T protein:vir:59 83 -RTSHAWHKLFVDQKTQYLVGEPVTFTSD---------N-------KTLLEYVNELADD-----------DFDDILNETV 134 (503) T ss_pred -eeecchHHHHHHHHHhhhhcCCeeeccC---------c-------HHHHHHHHHHHhc-----------CHHHHHHHHH Confidence 0136799999999999999999887531 1 2333455666431 5667777788 Q ss_pred hhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeEEE-EEeecCCC Q lcl|NC_019524. 152 SGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAALGY-WLRKAFPG 226 (556) Q Consensus 152 r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~vaY-~i~~~hpg 226 (556) +..+.-|.+|+.+.+ ...+ .+++..++|..+---+.....+.+..+|.+ +..+..+-| .++..+-- T Consensus 135 ~~~~~~G~~~~~v~~-d~dg--------~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i 205 (503) T protein:vir:59 135 KNMSNKGIEYWHPFV-DEEG--------EFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHV 205 (503) T ss_pred HHHhhCCeEEEEEee-cCCC--------ceEEEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcE Confidence 888999999977643 2211 257899999887543433334556677753 233443322 23322210 Q ss_pred ccccCCccccc------eeeccccCChhHeEeeecccC----CCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee Q lcl|NC_019524. 227 DPTDMEQWKWG------YEPARFDWGRRRVIHIIEALL----AGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA 296 (556) Q Consensus 227 d~~~~~~~~~~------rv~~~~~v~a~~viH~f~~~r----~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~ 296 (556) -.+......|. .......+-.....|.|.... ..-..|+|.|.+++..+..++....-......-.+.-. T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~ 285 (503) T protein:vir:59 206 YYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIV 285 (503) T ss_pred EEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCe Confidence 00000000000 000000000011223222110 12346999999988888777766555444444444333 Q ss_pred eeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 297 ASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 297 ~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~l 376 (556) .+++...++. .+.....+..+.+..++.+.++++++.+.+...+..+++.+. T Consensus 286 ~v~~g~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 337 (503) T protein:vir:59 286 YVLKNYDGEN----------------------------PKEFTANLRYHSVIKVSGDGGVDTLRAEIPVDSAAKELERIQ 337 (503) T ss_pred eEeecCCccc----------------------------cchhhhhhhcccceeccCCCcceeEeccCCHHHHHHHHHHHH Confidence 4444211110 000111244455666777888999999989999999988777 Q ss_pred HHH---HHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccc Q lcl|NC_019524. 377 RNI---AASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMF 453 (556) Q Consensus 377 r~i---aaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~ 453 (556) ..| +...+++.+.++++.|+ .+++..+..........+..+ ...++.+++..+...-..+...... T Consensus 338 ~~i~~~s~~p~~~~~~~~~~~Sg---~Ai~~~~~~l~~k~~~~~~~~-~~~l~~~~~~i~~~~~~~~~~~~~~------- 406 (503) T protein:vir:59 338 DELYKSAQAVDNSPETIGGGATG---PALENLYALLDLKANMAERKI-RAGLRLFFWFFAEYLRNTGKGDFNP------- 406 (503) T ss_pred HHHHHHhcccCCCcccccccccH---HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCccccc------- Confidence 666 55555556666665544 466666555555555555554 4455666666555443333322111 Q ss_pred cchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccc Q lcl|NC_019524. 454 YDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKM 531 (556) Q Consensus 454 ~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~ 531 (556) ...+.+.|..+- ..|-...+++..+++.+|+.|.+..+...+. |+++.++++++|++...+..-... +. T Consensus 407 ------~~~i~i~f~~~~--p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~-~~ 477 (503) T protein:vir:59 407 ------DKELTMTFTRTR--IQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLL-DD 477 (503) T ss_pred ------ccceeEEeCCCC--CCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhcccc-Cc Confidence 112456674333 3688888999999999999999999999875 899999999998876655432211 11 Q ss_pred cccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 532 VEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ..+....+.+.+..++.++.. T Consensus 478 ----~~~~~~~~~~~~~~~~~~~~~ 498 (503) T protein:vir:59 478 ----EGGDDDLEEDDPNAGAAESGG 498 (503) T ss_pred ----cCCCCCCCcCCCCCCcccCCC Confidence 111111111111111112222 No 68 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.62 E-value=9.2e-16 Score=103.03 Aligned_cols=448 Identities=13% Similarity=0.061 Sum_probs=220.8 Q ss_pred chhhhHHHHHhhH--hhcccchhhhhhhh---cchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 4 VKKTTRTRAKKAV--DVVAETATATPMAV---GGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 4 ~~~~~r~~a~~a~--~~~~~~~~~~~~~~---~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |.=+.|.+.+... .............. +.+|. ...+.++...|.....+ . ..-......+...+..++. T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~~~---~--~~~~~g~~v~~~~a~~~~~ 74 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYG-FGGGVPRIQQTLAGPST---E--LAPDTFVGLATQAYQANGP 74 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccc-cccccHHHHHhhccccc---c--ccCccccccchhhhhccHH Confidence 5555555543221 11111000000000 00000 00011111111000000 0 0000111234556777899 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +..+|+.|.+.|-.-.|++.-+.+...- +..+. .++. .-.+|+ -.+|.+++.+..+..++..| T Consensus 75 v~~~i~~Ia~~ia~lp~~~~~~~~~~~~--------~~~~~--~~~~-L~~~PN------~~~t~~~f~~~l~~~lll~G 137 (466) T protein:vir:81 75 VFACMLVRQLVFSSVRFRWQRLRDGKPS--------DTFGS--RDLQ-ILETPW------KGGTTQDMLSRMIQDADLAG 137 (466) T ss_pred HHHHHHHHHHhhccCceEEEEecCCcee--------ecccc--HHHH-HhhCCC------CCCCHHHHHHHHHHHHHhcC Confidence 9999999999998888887644321100 00000 1122 223444 34789999999999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCC-CeEEEEEeecCCCccccCCccccc Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNG-AALGYWLRKAFPGDPTDMEQWKWG 237 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~G-r~vaY~i~~~hpgd~~~~~~~~~~ 237 (556) ++|+.+.+.............+..|..|.|+++. |+.+.+| ..+.|++...-.. .. . T Consensus 138 nay~~i~r~~~g~l~~~~~g~~~~l~~l~~~~v~--------------~~~~~~~~~~~~y~~~~~~~~--~~---~--- 195 (466) T protein:vir:81 138 NSYWTIVDGEFVRMRPDWVDVVVEERMVRGGRGE--------------LGGGQLGWRKVGYLYTEGGRQ--SG---N--- 195 (466) T ss_pred CeEEEEEecCccccccccCcceeEEEEecCcceE--------------EEEcCCCceEEEEEEEecCcc--cc---c--- Confidence 9999987643222111222345678888877763 3444454 3455655422100 00 0 Q ss_pred eeeccccCChhHeEeeecc-cCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccc Q lcl|NC_019524. 238 YEPARFDWGRRRVIHIIEA-LLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQ 316 (556) Q Consensus 238 rv~~~~~v~a~~viH~f~~-~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~ 316 (556) ....++.++|||+... ...+...|+|++..+...+.-....++....-.+=.+...++++.+..-. T Consensus 196 ---~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~---------- 262 (466) T protein:vir:81 196 ---ESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMAD---------- 262 (466) T ss_pred ---ceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCC---------- Confidence 1124678899999754 34577899999998888777666666666666666677788888653210 Q ss_pred ccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhh--chh Q lcl|NC_019524. 317 GGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFS--RDY 394 (556) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~--~D~ 394 (556) .+..+....... ....+ .=+.|.+..|..|.+++.++.+.-..+|.+..+.+...||..+|||.+.|- .+. T Consensus 263 ~e~~~~~~~~~~---~~~~g----~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~ 335 (466) T protein:vir:81 263 PAAVKKWADEVN---SKHAG----VDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGL 335 (466) T ss_pred HHHHHHHHHHHH---HHhcC----ccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCC Confidence 111111111100 00000 012467788999999999998877888999999999999999999988663 245 Q ss_pred hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccc Q lcl|NC_019524. 395 TKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQ 474 (556) Q Consensus 395 s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~ 474 (556) +.++||++.+....|. ..-+.|+..+| +.++.+..+........ .+++..-.... T Consensus 336 ~~st~sn~eq~~~~f~-----------~~tl~P~~~~i-e~~l~~~L~~~~~~~~~-------------~~~f~~~~llr 390 (466) T protein:vir:81 336 AAATYSNYGQARRRLA-----------DGTAHPLWQNL-SGCIGHVMPDMGPDVRL-------------WYDADDVPFLR 390 (466) T ss_pred CccccccHHHHHHHHH-----------HHHHHHHHHHH-HHHHHhhcCCcccCcce-------------EEEecchhhhc Confidence 6789999876666553 44456655554 44444433321111100 01111111222 Q ss_pred cchhhhhHH-------HHHHHHcCCCCHHHHHHH-hCCCHHHHHHHHHHHHHHHHHcCCCCCcc-ccccCCCCCCCCCCC Q lcl|NC_019524. 475 IDEKKETEA-------AILRIKNGLSTYEAEISR-LGGDFREVFKQRAREEGLIKSLKLDFTGK-MVEGNSTQSSNSSES 545 (556) Q Consensus 475 iDP~Ke~~A-------~~~~i~~G~~s~~~~~ae-~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~-~~~~~~~~~~~~~~~ 545 (556) .|....+++ ....+++|+ |+.|+.+. .|.|.. .+.-.|+..... +........ .... T Consensus 391 ~d~~~r~~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~gd~~-----------~~~~~~~~~~~~~~~~~~~~~~--~~~~ 456 (466) T protein:vir:81 391 EDEKDAADIQKVRAETINTLITAGY-EPESVVAAVNSGDLR-----------LLKHTGLTSVQLLPPGVSASAS--SDTP 456 (466) T ss_pred cCHHHHHHHHHHHHHHHHHHHHcCC-ChhhccccccCCccc-----------cccCCCcchhhhcccccccccC--CCCc Confidence 244333333 234566775 55544321 122211 011112111000 000000011 1111 Q ss_pred CCCCCCCcCC Q lcl|NC_019524. 546 TSDNPNEETT 555 (556) Q Consensus 546 ~~~~~~~e~~ 555 (556) .....+++.+ T Consensus 457 ~~~Gg~~ngn 466 (466) T protein:vir:81 457 TSGGADDNGN 466 (466) T ss_pred ccCCCCcCCC Confidence 1111112222 No 69 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.61 E-value=6.1e-16 Score=104.00 Aligned_cols=413 Identities=10% Similarity=-0.013 Sum_probs=223.5 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcch-hcc--------ccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGG-MEG--------AERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQ 74 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~-y~a--------a~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~r 74 (556) |..- .-.+....++-- .....+ +.+ ++.+++....+..... .-+.+-+. T Consensus 1 ~~~~-----~~~~~~~~~~g~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~v~~~~al 58 (424) T protein:vir:18 1 MEEP-----KYTIDLRTNNGW---WARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDS--------------SINDERIL 58 (424) T ss_pred CCCC-----cceEeecCCCch---HHHHHhhhcccccccccccccccccccccccccc--------------cccHHHhh Confidence 0000 000000000000 000000 000 0111111110000000 11334455 Q ss_pred cChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 75 NDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 75 Nn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) .++.+-.+|+.|.+.|-+.-|++.-+.+. ++....... ..++..+-.+|+ --+|.+++-...+..+ T Consensus 59 ~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~-----~~~~~~~~~---~~l~~lL~~~PN------~~~t~~~f~~~~~~~l 124 (424) T protein:vir:18 59 QISTVWRCVSLISTLTACLPLDVFETDQN-----DNRKKVDLS---NPLARLLRYSPN------QYMTAQEFREAMTMQL 124 (424) T ss_pred ccHHHHHHHHHHHHhhccCceEEEEeecC-----Cceeeeccc---cHHHHHHhhccC------CCCCHHHHHHHHHHHH Confidence 67788999999999998777766322110 000000000 123444444554 2478888999999999 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcc Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQW 234 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~ 234 (556) +..|++|+.+.+... | .+..|..|+|+.+.. + .++..+.|.+... | T Consensus 125 ll~Gnay~~i~r~~~------G--~~~~L~pl~~~~V~v--------------~--~~~~~~~y~~~~~--g-------- 170 (424) T protein:vir:18 125 CFYGNAYALVDRNSA------G--DVISLLPLQSANMDV--------------K--LVGKKVVYRYQRD--S-------- 170 (424) T ss_pred hhcCCeEEEEEECCC------C--cEEEEEEecCcceEE--------------E--EcCCeEEEEEEeC--C-------- Confidence 999999999865321 2 356788899888732 1 2234556665421 1 Q ss_pred ccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccc Q lcl|NC_019524. 235 KWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGM 314 (556) Q Consensus 235 ~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~ 314 (556) ....++.++|||+...- .+...|+|++..+...+......++....-.+=.+.-.+||+.+.+... T Consensus 171 ------~~~~~~~~eIih~r~~~-~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~------- 236 (424) T protein:vir:18 171 ------EYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT------- 236 (424) T ss_pred ------eEEEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCC------- Confidence 11246778999997654 5678999999888777776666666666666667778889987543110 Q ss_pred ccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchh Q lcl|NC_019524. 315 GQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDY 394 (556) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~ 394 (556) ++..+........ ..+ .-..|.+..|+.|.+++.++.+.-...|.+-.+.....||..+|||.+.| ++. T Consensus 237 --~e~~~~~~~~~~~----~~~----g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~ 305 (424) T protein:vir:18 237 --EQQRSQVEENFKE----IAG----GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDV 305 (424) T ss_pred --HHHHHHHHHHHHH----HhC----CcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCC Confidence 0111110000000 000 11346678899999999999887778889999999999999999998855 888 Q ss_pred hcccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcc Q lcl|NC_019524. 395 TKTNY--SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASR 472 (556) Q Consensus 395 s~~nY--Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~ 472 (556) ++.|| |++.+..+.| +..-+.|+... ++.++....++-. ... . +.+++-.-.. T Consensus 306 ~~~t~~~sn~eq~~~~f-----------~~~tl~P~~~~-ie~~l~~~L~~~~-~~~----------~--~~~~fd~~~l 360 (424) T protein:vir:18 306 EKSTSWGSGIEQQNLGF-----------LQYTLQPYISR-WENSIQRWLIPAK-DVG----------R--IHAEHNLDGL 360 (424) T ss_pred CCcccccccHHHHHHHH-----------HHHHHHHHHHH-HHHHHHhhcCCcc-ccC----------C--eEEEEechhh Confidence 88776 6555444444 44455665554 3445555444321 110 0 0122222333 Q ss_pred cccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 473 GQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNE 552 (556) Q Consensus 473 ~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (556) ...|+...+++..+.+++|+.|+-|+-+..|+.|-+--+ ++=++....+.... . +...|++ T Consensus 361 lr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD----------~~~~~~n~~~l~~~---~------~~~~p~~ 421 (424) T protein:vir:18 361 LRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD----------VAMRQSQYVPITDL---G------TNKEPRN 421 (424) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----------eeeeccCccchHhh---h------ccCCCcc Confidence 446999999999999999999999988888877743111 11011111110000 0 0111111 Q ss_pred cCC Q lcl|NC_019524. 553 ETT 555 (556) Q Consensus 553 e~~ 555 (556) +.. T Consensus 422 ~ga 424 (424) T protein:vir:18 422 NGA 424 (424) T ss_pred CCC Confidence 111 No 70 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.61 E-value=2.6e-15 Score=100.51 Aligned_cols=442 Identities=9% Similarity=-0.009 Sum_probs=214.7 Q ss_pred CCcchhhh---HHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTT---RTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~---r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) .++....+ |..+.+. . .-..+.. .+ .+..|..++-+.. .| ..++++|+ T Consensus 2 ~~~~~~~~~~~~~~~~~~---~------------~~~~~~~-~~-~~~~~~~pp~~~~--------~L----a~~~~~n~ 52 (540) T protein:vir:41 2 FNYHLSIKSLEKYRAIKG---D------------TDSQALK-ED-RFEEYVEPKVHPL--------VL----LSLLQVNP 52 (540) T ss_pred CCcccChhhccchhhhhc---c------------ccccccc-cC-CCCccccCCCCHH--------HH----HHHHHhcH Confidence 22222211 1100000 0 0011111 12 2244544443332 12 26788999 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) ++.++|+.+++.|.+.++++...-. .+ ..+. ++ | .+|+.++...++..++.. T Consensus 53 ~v~scI~~ia~~ia~~~~~i~~~~~------------~~--------~~~l--pN--~----~~t~~~f~~~~v~dlll~ 104 (540) T protein:vir:41 53 YHASACSIKANDILRTGYLIDGDDG------------GV--------EELL--RA--C----RPSFEFILLQALEDLQVF 104 (540) T ss_pred HHHHHHHHHHHHHhcCCceEecCcc------------ch--------hhhc--cC--C----CCCHHHHHHHHHHHHHhc Confidence 9999999999999999988754321 01 1111 22 2 358889999999999999 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccc Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWG 237 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~ 237 (556) |.+|+.+.+... ..+..|..|+|+.|.... ++..++ .-.+|..+.|+..-..+.......+. T Consensus 105 Gnayv~i~r~~~--------G~~~~L~~i~~~~V~v~~---~~~~~~----~~~d~~~~~~~~~~~~~~~~~~~~g~--- 166 (540) T protein:vir:41 105 NYCTLEVVRDDQ--------GEPVRLDYIPAHTVRVHR---DGSRYM----QTWDGIHVTYFKDYRYEGEVNPDNGE--- 166 (540) T ss_pred CCeEEEEEECCC--------CcEEEEEEeCCcceEEeE---cCceeE----eeecCceeeeeecccccceeeccccc--- Confidence 999998765321 235789999999985321 122111 12334444444322222211111110 Q ss_pred eeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccc Q lcl|NC_019524. 238 YEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQG 317 (556) Q Consensus 238 rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 317 (556) ....+++.+|||+......++..|+|++..++..+.....-.+....--+=.+...++|+.+..-............ T Consensus 167 ---~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~ 243 (540) T protein:vir:41 167 ---DQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEP 243 (540) T ss_pred ---cceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHH Confidence 11357889999998887788999999999887777666555555444444566777888765321110000000000 Q ss_pred cccccccccccccccccccccceecCCceeeec------CCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 318 GFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL------YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L------~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~ 391 (556) .....+...... . .....-.+|.+..| ..|-+++.++.+.-...|.+..+.....||+.+|||-..| T Consensus 244 ~~~~~~~~~~~~---~---~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~l- 316 (540) T protein:vir:41 244 TGRTVLQGLIED---N---FKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRL- 316 (540) T ss_pred HHHHHHHHHHHH---H---hccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHc- Confidence 000000000000 0 00001135555555 3577888888777778899999999999999999998755 Q ss_pred chhh--cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeec Q lcl|NC_019524. 392 RDYT--KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIG 469 (556) Q Consensus 392 ~D~s--~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~ 469 (556) |+.. ..|||++.+....|.. ..+.|+.+.+ +.++-+..++ ..... +.+++.. T Consensus 317 G~~~~~~~n~sn~eq~~~~f~~-----------~tL~P~~~~i-e~~ln~~L~~-~~~~~-------------~~i~f~~ 370 (540) T protein:vir:41 317 GITDVGPLGGNFAEVARRTYYE-----------SVVRPQQEIV-SSVLTDFIQL-KLDPG-------------ARFVFNE 370 (540) T ss_pred CcccCCCCCcccHHHHHHHHHH-----------HHHHHHHHHH-HHHHHHhhhh-ccCCc-------------eEEEecc Confidence 7653 4578888776666543 3334444432 3333332111 10100 1122323 Q ss_pred CcccccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHH-HHHH----HHHHHHHHHHcCCCCCccccccC--CCCCCC Q lcl|NC_019524. 470 ASRGQIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFRE-VFKQ----RAREEGLIKSLKLDFTGKMVEGN--STQSSN 541 (556) Q Consensus 470 p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~-v~~q----~a~E~~~~~~~Gl~~~~~~~~~~--~~~~~~ 541 (556) .....-|..+ .....+++|+.|+-|+-... |.++-. ..-+ ...+...-++.+-.......... ...+.. T Consensus 371 ~~ll~~D~~~---~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~ 447 (540) T protein:vir:41 371 EILMESEFVH---NYALLVQCGVLTPSEVREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRI 447 (540) T ss_pred hhhcchHHHH---HHHHHHhCCCCCHHHHHHHhCcCcCCCcccccccccccccccccccccCCCCccccccccchhcccc Confidence 2222224322 23457899999999985433 555432 1100 00000000000000000000000 000000 Q ss_pred CCCCCCCCCCCcCCC Q lcl|NC_019524. 542 SSESTSDNPNEETTQ 556 (556) Q Consensus 542 ~~~~~~~~~~~e~~~ 556 (556) .+..+++.+.++++. T Consensus 448 ~~~~~~~~~~~~~~~ 462 (540) T protein:vir:41 448 QEIISSESPLEDKKK 462 (540) T ss_pred cCccccccccccccc Confidence 000001111111111 No 71 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.61 E-value=1.6e-15 Score=101.66 Aligned_cols=379 Identities=10% Similarity=-0.008 Sum_probs=201.1 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..+....+. +..+...+....|.....+.. ....-+.+-+..++.+..+| T Consensus 1 Mg~~~~~~~~k~-------------------~~~~~~~~~~~~~~~~~~~~~--------~~~~v~~~~~l~~~~v~~~i 53 (383) T protein:vir:10 1 MGLLTPKNFSKR-------------------NAKNMVYPSNPAFFTTTVGGM--------QLSYVSALSALQNTNVYSVI 53 (383) T ss_pred CCcccccccccc-------------------cccccccccchhhhhhhccCc--------cccccchhHhhcchHHHHHH Confidence 111110000000 000000000011111000000 00000011123367788999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.+-..-|+..-.. ... .-.+++ --+|..++....+..++..|++|+. T Consensus 54 ~~ia~~ia~~~~~~~~~~------------------~~~----ll~~PN------~~~t~~~f~~~~~~~l~l~Gn~~~~ 105 (383) T protein:vir:10 54 NRIASDVSSAHFKTENTA------------------TLN----RLESPS------SLIGRFSFWQGALMQLCLSGNDYIP 105 (383) T ss_pred HHHHHhhccCceeecccc------------------hhh----hhhCCC------CCCCHHHHHHHHHHHhhhcCCeEEE Confidence 999998887666543111 011 112454 2478999999999999999999988 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +... ++.+-.+++.++ ++..++..+-|.+.....+. .. T Consensus 106 i~~~------------~~~~~p~~~~~v----------------~~~~~~~~~~~~~~~~~~~~--------------~~ 143 (383) T protein:vir:10 106 LVGQ------------NLEHIPNSDVQI----------------NYLPGNMGIVYTVLESNDRP--------------KM 143 (383) T ss_pred EEcC------------ceeEeecCcceE----------------EEEEcCCceEEEEEEcCCce--------------EE Confidence 6421 122222222222 22222223335444332221 12 Q ss_pred cCChhHeEeeeccc--CCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEAL--LAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKE 321 (556) Q Consensus 244 ~v~a~~viH~f~~~--r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~ 321 (556) .+++.+|||+.... .-+...|+|++..+...+.......+....-.+=++...++++.+..-. .++... T Consensus 144 ~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~---------~~e~~~ 214 (383) T protein:vir:10 144 VLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLS---------DGKDLE 214 (383) T ss_pred EEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCC---------CHHHHH Confidence 46678999986543 3346789999998888877777766666665565677788888653211 011111 Q ss_pred cccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccH-HHHHHHHHHHHHHhcCCCHHHhhc-hhhcccc Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVG-TDYEQSLLRNIAASLGMSYEQFSR-DYTKTNY 399 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f-~~F~~~~lr~iaaglGi~ye~l~~-D~s~~nY 399 (556) ........ .. ..-..|.+..|..|.+++.++.+....+| .+..+...+.||..+|||.+.|.+ |.+..+| T Consensus 215 ~~~~~~~~-------~~-~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~ 286 (383) T protein:vir:10 215 SAREEFEK-------AN-TGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQH 286 (383) T ss_pred HHHHHHHH-------Hh-CccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCcc Confidence 11111100 00 01235678889999999999876555665 577888899999999999998865 5677889 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhh Q lcl|NC_019524. 400 SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKK 479 (556) Q Consensus 400 Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~K 479 (556) |++.+....+ . ..++|+.+.+ +.++....+ .+ .+++..-.....|+.. T Consensus 287 sn~eq~~~~~-----------~-~~l~P~~~~i-e~~l~~~l~-~~------------------~~~f~~~~l~~~d~~~ 334 (383) T protein:vir:10 287 SNIDQIKATY-----------L-ANLNSYVNPI-VDELRLKMN-AP------------------DLELDIKDMLDVDDSI 334 (383) T ss_pred ccHHHHHHHH-----------H-HHHHHHHHHH-HHHHHHhhC-Cc------------------eEEeechhhhccCHHH Confidence 9866433222 2 2356666553 333433221 11 1222233334469999 Q ss_pred hhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 480 ETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 480 e~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) .+++....+++|+.|+.|+-+..|..|-+- .+....... .+ ..+.+++| T Consensus 335 ~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~-------------------~d~~~~~~~--~~----~~~gGd~e 383 (383) T protein:vir:10 335 LINQVSNLAKSGVLGAEQAQFILTRSGFLP-------------------DNLPEFKPL--TN----ETKGGDDK 383 (383) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCcccC-------------------CcccccCCC--cc----cCCCCCCC Confidence 999999999999999999888888777310 111000000 00 11112222 No 72 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.61 E-value=2.5e-15 Score=100.68 Aligned_cols=431 Identities=11% Similarity=0.027 Sum_probs=221.2 Q ss_pred CCc----chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019524. 1 MKD----VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQND 76 (556) Q Consensus 1 ~sp----~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn 76 (556) -+| +.-.-|..+++.+.....-...-.+.........+.-...+.+|..... ..-+-.-+..+ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~al~~ 70 (441) T protein:vir:98 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKL-------------RQYKDIEAIRH 70 (441) T ss_pred ecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCc-------------cccchhhhhcc Confidence 333 3333333333322221111000000000000000000000001111100 01111122345 Q ss_pred hHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhee Q lcl|NC_019524. 77 GYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLM 156 (556) Q Consensus 77 ~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~ 156 (556) +-+-.+|+.|.+.|-.--|++.- + ++. ..-..++..+-.+|+ ..+|.+++-...+..++. T Consensus 71 ~~V~acv~~Ia~~iA~lpl~~~~--~-------~~~-----~~~~~~~~lL~~~PN------~~~t~~~f~~~l~~~lll 130 (441) T protein:vir:98 71 SDIFTAVMMIASDLARMPIRVTV--N-------GQI-----NYSDRIVNLLNTRPN------PMYNGYIFKLVVFVSALL 130 (441) T ss_pred HHHHHHHHHHHHhhccCceEEec--C-------Ccc-----cccchHHHHHhcccc------cCCCHHHHHHHHHHHHhh Confidence 56777899988888875444421 1 111 011223444555665 357888988889999999 Q ss_pred cCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccc Q lcl|NC_019524. 157 TGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKW 236 (556) Q Consensus 157 dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~ 236 (556) .|++|+.+.+... ..+..|..|.|+.+. |..|.+|++..|......++.. T Consensus 131 ~Gnay~~i~r~~~--------G~~~~L~~i~~~~v~--------------v~~~~~g~~~~~~~~~~~~~~~-------- 180 (441) T protein:vir:98 131 TSHGYIEITRDKT--------GEPMNLTFRKTSEIE--------------LKLDARGRLYYFHQRIDSNGNN-------- 180 (441) T ss_pred cCCeEEEEEEcCC--------CcEEEEEEEcCceeE--------------EEECCCCcEEEEEEEeccCcce-------- Confidence 9999999865321 246788889988873 4555667766544332222210 Q ss_pred ceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccc Q lcl|NC_019524. 237 GYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQ 316 (556) Q Consensus 237 ~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~ 316 (556) ....+++++|||+...- .+...|+|++..+...+.-....++....-.+=.+...++|+.+..-.. T Consensus 181 ----~~~~~~~~dviHir~~~-~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~--------- 246 (441) T protein:vir:98 181 ----IERNVKFEDMLDIKFYS-LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN--------- 246 (441) T ss_pred ----eeEEEccccEEEeccCC-CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCC--------- Confidence 11246788999997653 4558999999888777766555555555555556777888886532110 Q ss_pred ccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhc Q lcl|NC_019524. 317 GGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTK 396 (556) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~ 396 (556) ++..+...... .....+ .=..|.+..|+.|.+++.++.+.-..+|.+..+.....||..+|||-+.| ++ +. T Consensus 247 ~e~~~~~~~~~---~~~~~G----~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~l-g~-~~ 317 (441) T protein:vir:98 247 KKARDRAREEF---HKSFSG----TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GI-ET 317 (441) T ss_pred HHHHHHHHHHH---HHHhcC----ccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CC-CC Confidence 00000000000 000000 01246788899999999998776777888889999999999999999977 33 44 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccc Q lcl|NC_019524. 397 TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQID 476 (556) Q Consensus 397 ~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iD 476 (556) .|+|. .+..+ .++ ..++|+...+ +.++....++-..+. .+++-.-...-.| T Consensus 318 ~~~s~-~q~~~-----------~y~-~tl~P~~~~i-e~~ln~~L~~~~~~~---------------~~~fd~~~llr~d 368 (441) T protein:vir:98 318 ANMSI-TDANL-----------DYL-STLKPYITCV-CAELNFKFNDEYVNR---------------EFKFDTTEIRVVD 368 (441) T ss_pred CCccH-HHHHH-----------HHH-HHHHHHHHHH-HHHHHhhccccccCc---------------eEEEechhhhccC Confidence 45442 11111 122 2466766654 445544433211111 1223333334469 Q ss_pred hhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCC---CCCCCCCCCCCCCCCc Q lcl|NC_019524. 477 EKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNST---QSSNSSESTSDNPNEE 553 (556) Q Consensus 477 P~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~---~~~~~~~~~~~~~~~e 553 (556) +...+++....+++|+.|+-|+-+..|..|-+--++.. .-++....+...... ......+.....+ | T Consensus 369 ~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~--------~~~~~n~~~~~~~~~~q~~~~~~~~~~~kgG--e 438 (441) T protein:vir:98 369 EKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSI--------HRVDLNHVNIELVDEYQMNKSRATDKKLKGG--E 438 (441) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce--------EeecccccccccccccccccccccccccCCC--C Confidence 99999999999999999999888877877643111100 000000001111111 1111111111222 2 Q ss_pred CCC Q lcl|NC_019524. 554 TTQ 556 (556) Q Consensus 554 ~~~ 556 (556) .+| T Consensus 439 ~ne 441 (441) T protein:vir:98 439 ENE 441 (441) T ss_pred CCC Confidence 222 No 73 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.60 E-value=2.3e-15 Score=100.80 Aligned_cols=432 Identities=12% Similarity=0.014 Sum_probs=222.7 Q ss_pred CCc----chhhhHHHHHhhHhhcccchhhhhhhhcch-hccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019524. 1 MKD----VKKTTRTRAKKAVDVVAETATATPMAVGGG-MEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQN 75 (556) Q Consensus 1 ~sp----~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~-y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rN 75 (556) -+| |.-.-|...++.+.....-...-.+..... ....... ..+.+|.... +..-+..-+.. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~-~~~~~~~~~~-------------~~~~~~~~al~ 69 (441) T protein:vir:79 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMV-QTLPGFQGTK-------------LRQYKDIEAIR 69 (441) T ss_pred ccCccccccccccccchhhhhccccccccccccccCCCcchHHHH-HHhcccCccc-------------ccccchhhhhc Confidence 344 444444444443222111100000000000 0000000 0000111000 01111112234 Q ss_pred ChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhe Q lcl|NC_019524. 76 DGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFL 155 (556) Q Consensus 76 n~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~ 155 (556) ++-+-++|+.|.+.|..-=|++.- + ++.. .-...+..+-.+|+ ..+|..++-...+..++ T Consensus 70 ~~~V~~cv~~Ia~~iA~lp~~~~~--~-------~~~~-----~~~~~~~lL~~~PN------~~~t~~~f~~~~~~~ll 129 (441) T protein:vir:79 70 HSDIFTAVMMIASDLARMPIRVTV--N-------GQIN-----YSDRIVNLLNTRPN------PMYNGYIFKLVVFVSAL 129 (441) T ss_pred cHHHHHHHHHHHHhhccCceeeec--C-------cccc-----ccchHHHHHhcccC------cCCCHHHHHHHHHHHHh Confidence 556777899988888875444321 1 1110 11223444444554 34788888888999999 Q ss_pred ecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccc Q lcl|NC_019524. 156 MTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWK 235 (556) Q Consensus 156 ~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~ 235 (556) ..|++|+.+++... ..+..|..|.|+.|. |+.|..|++..|.......+.. T Consensus 130 l~Gnay~~i~r~~~--------G~~~~L~~i~~~~v~--------------v~~d~~g~~~~~~~~~~~~~~~------- 180 (441) T protein:vir:79 130 LTSHGYIEITRDKT--------GEPMNLTFRKTSEIE--------------LKSDARGRLYYFHQRIDSNGNN------- 180 (441) T ss_pred hcCCeEEEEEECCC--------CcEEEEEEEcCceeE--------------EEECCCccEEEEEEEeccCCce------- Confidence 99999999865321 235678889888773 4566677765544332222110 Q ss_pred cceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccc Q lcl|NC_019524. 236 WGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMG 315 (556) Q Consensus 236 ~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~ 315 (556) ....+++++|||+... ..+...|+|++..+...+.-....++....-.+=.+...++|+.+..-.. T Consensus 181 -----~~~~~~~~dvih~k~~-~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~-------- 246 (441) T protein:vir:79 181 -----IERNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN-------- 246 (441) T ss_pred -----eEEEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCC-------- Confidence 1124678899999764 45568999999988887776666666666666667778888886532110 Q ss_pred cccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh Q lcl|NC_019524. 316 QGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT 395 (556) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s 395 (556) ++..+...... .....+ .-..|.+..|+.|.+++.++.+....+|.+..+...++||..+|||-..| +. + T Consensus 247 -~e~~e~~r~~~---~~~~~G----~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~-~ 316 (441) T protein:vir:79 247 -KKARDRAREEF---HKSFSG----TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GI-E 316 (441) T ss_pred -HHHHHHHHHHH---HHHhcC----ccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CC-C Confidence 00000000000 000000 11346778899999999998776677888889999999999999999877 43 3 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) ..|+|.. +..+ .++ .-+.|+... ++.++....++ .... ..+++-.-...-. T Consensus 317 ~~~~s~~-q~~~-----------~~~-~tl~P~~~~-ie~eln~kl~~--~~~~-------------~~~~fd~~~llr~ 367 (441) T protein:vir:79 317 TANMSIT-DANL-----------DYL-STLKPYITC-VCAELNFKFND--EYVN-------------REFKFDTTEIRVV 367 (441) T ss_pred CCCccHH-HHHH-----------HHH-HHHHHHHHH-HHHHHhhhccc--cccC-------------ceEEeechhhhcc Confidence 4455422 1111 122 235665553 44445443321 1100 0122323333446 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC-CCCCCCCCcC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE-STSDNPNEET 554 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~ 554 (556) |+...+++....+++|+.|+.|+-+..|..|-+--++.. .-++....+.......+.+... .+....-.|. T Consensus 368 D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~--------~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:79 368 DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSI--------HRVDLNHVNIELVDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce--------EeecccccccccccccccccccccccccCCCCC Confidence 999999999999999999999988888887754111100 0000001111111111111111 1111111122 Q ss_pred CC Q lcl|NC_019524. 555 TQ 556 (556) Q Consensus 555 ~~ 556 (556) +| T Consensus 440 ~e 441 (441) T protein:vir:79 440 NE 441 (441) T ss_pred CC Confidence 22 No 74 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.60 E-value=2.3e-15 Score=100.80 Aligned_cols=432 Identities=12% Similarity=0.014 Sum_probs=222.7 Q ss_pred CCc----chhhhHHHHHhhHhhcccchhhhhhhhcch-hccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019524. 1 MKD----VKKTTRTRAKKAVDVVAETATATPMAVGGG-MEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQN 75 (556) Q Consensus 1 ~sp----~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~-y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rN 75 (556) -+| |.-.-|...++.+.....-...-.+..... ....... ..+.+|.... +..-+..-+.. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~-~~~~~~~~~~-------------~~~~~~~~al~ 69 (441) T protein:vir:94 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMV-QTLPGFQGTK-------------LRQYKDIEAIR 69 (441) T ss_pred ccCccccccccccccchhhhhccccccccccccccCCCcchHHHH-HHhcccCccc-------------ccccchhhhhc Confidence 344 444444444443222111100000000000 0000000 0000111000 01111112234 Q ss_pred ChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhe Q lcl|NC_019524. 76 DGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFL 155 (556) Q Consensus 76 n~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~ 155 (556) ++-+-++|+.|.+.|..-=|++.- + ++.. .-...+..+-.+|+ ..+|..++-...+..++ T Consensus 70 ~~~V~~cv~~Ia~~iA~lp~~~~~--~-------~~~~-----~~~~~~~lL~~~PN------~~~t~~~f~~~~~~~ll 129 (441) T protein:vir:94 70 HSDIFTAVMMIASDLARMPIRVTV--N-------GQIN-----YSDRIVNLLNTRPN------PMYNGYIFKLVVFVSAL 129 (441) T ss_pred cHHHHHHHHHHHHhhccCceeeec--C-------cccc-----ccchHHHHHhcccC------cCCCHHHHHHHHHHHHh Confidence 556777899988888875444321 1 1110 11223444444554 34788888888999999 Q ss_pred ecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccc Q lcl|NC_019524. 156 MTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWK 235 (556) Q Consensus 156 ~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~ 235 (556) ..|++|+.+++... ..+..|..|.|+.|. |+.|..|++..|.......+.. T Consensus 130 l~Gnay~~i~r~~~--------G~~~~L~~i~~~~v~--------------v~~d~~g~~~~~~~~~~~~~~~------- 180 (441) T protein:vir:94 130 LTSHGYIEITRDKT--------GEPMNLTFRKTSEIE--------------LKSDARGRLYYFHQRIDSNGNN------- 180 (441) T ss_pred hcCCeEEEEEECCC--------CcEEEEEEEcCceeE--------------EEECCCccEEEEEEEeccCCce------- Confidence 99999999865321 235678889888773 4566677765544332222110 Q ss_pred cceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccc Q lcl|NC_019524. 236 WGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMG 315 (556) Q Consensus 236 ~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~ 315 (556) ....+++++|||+... ..+...|+|++..+...+.-....++....-.+=.+...++|+.+..-.. T Consensus 181 -----~~~~~~~~dvih~k~~-~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~-------- 246 (441) T protein:vir:94 181 -----IERNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN-------- 246 (441) T ss_pred -----eEEEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCC-------- Confidence 1124678899999764 45568999999988887776666666666666667778888886532110 Q ss_pred cccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh Q lcl|NC_019524. 316 QGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT 395 (556) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s 395 (556) ++..+...... .....+ .-..|.+..|+.|.+++.++.+....+|.+..+...++||..+|||-..| +. + T Consensus 247 -~e~~e~~r~~~---~~~~~G----~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~-~ 316 (441) T protein:vir:94 247 -KKARDRAREEF---HKSFSG----TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GI-E 316 (441) T ss_pred -HHHHHHHHHHH---HHHhcC----ccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CC-C Confidence 00000000000 000000 11346778899999999998776677888889999999999999999877 43 3 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) ..|+|.. +..+ .++ .-+.|+... ++.++....++ .... ..+++-.-...-. T Consensus 317 ~~~~s~~-q~~~-----------~~~-~tl~P~~~~-ie~eln~kl~~--~~~~-------------~~~~fd~~~llr~ 367 (441) T protein:vir:94 317 TANMSIT-DANL-----------DYL-STLKPYITC-VCAELNFKFND--EYVN-------------REFKFDTTEIRVV 367 (441) T ss_pred CCCccHH-HHHH-----------HHH-HHHHHHHHH-HHHHHhhhccc--cccC-------------ceEEeechhhhcc Confidence 4455422 1111 122 235665553 44445443321 1100 0122323333446 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC-CCCCCCCCcC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE-STSDNPNEET 554 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~ 554 (556) |+...+++....+++|+.|+.|+-+..|..|-+--++.. .-++....+.......+.+... .+....-.|. T Consensus 368 D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~--------~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:94 368 DEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSI--------HRVDLNHVNIELVDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce--------EeecccccccccccccccccccccccccCCCCC Confidence 999999999999999999999988888887754111100 0000001111111111111111 1111111122 Q ss_pred CC Q lcl|NC_019524. 555 TQ 556 (556) Q Consensus 555 ~~ 556 (556) +| T Consensus 440 ~e 441 (441) T protein:vir:94 440 NE 441 (441) T ss_pred CC Confidence 22 No 75 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.59 E-value=3e-15 Score=100.22 Aligned_cols=422 Identities=9% Similarity=-0.007 Sum_probs=223.3 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |..- .....+..++-- -.+.. ..+.+..+.......+.. ..++..... .. .-+.+-+..++.+.++| T Consensus 1 ~~~~-----~~~~~~~~~~g~-~~~~~-~~f~~~~~~~~~~~~~~~-~~~~~~~~~--~~---~v~~~~al~~~~v~~cv 67 (424) T protein:vir:18 1 MEEP-----KYTIDLRTNNGW-WARLK-SWFVGGRLVTPNQGSQTG-PVSAHGYLG--DS---SINDERILQISTVWRCV 67 (424) T ss_pred CCCC-----ccccccCCCCch-HHHHH-hhccccccccccchhhcc-ccccccccc--cc---cccHHHhhccHHHHHHH Confidence 0000 000000000000 00000 000111100100000000 000000000 00 01223345567789999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.+-+.-|++.-.... +........ ..++..+-.+|+ --+|.+++-...+..++..|++|+. T Consensus 68 ~~Ia~~iA~lp~~vy~~~~~-----~~~~~~~~~---~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnay~~ 133 (424) T protein:vir:18 68 SLISTLTACLPLDVFETDQN-----DNRKKVDLS---NPLARLLRYSPN------QYMTAQEFREAMTMQLCFYGNAYAL 133 (424) T ss_pred HHHHHhhccCceEEEEeccC-----Cceeeeccc---cHHHHHHhhccC------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 99999998877766322110 000000001 123444444554 2468888888999999999999999 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.+.. .| .+..|..|+|+.+.. + .++..+.|.+... | ... T Consensus 134 i~r~~------~G--~~~~L~~l~~~~v~v--------------~--~~~~~~~y~~~~~--g--------------~~~ 173 (424) T protein:vir:18 134 VDRNS------AG--DVISLLPLQSANMDV--------------K--LVGKKVVYRYQRD--S--------------EYA 173 (424) T ss_pred EEECC------CC--cEEEEEEecCcceEE--------------E--EcCCeEEEEEEeC--C--------------eEE Confidence 86532 12 356788898888742 1 1234556765421 1 112 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .++.++|||+.... .+...|+|++..+...+......++....-.+=.+...++|+.+..... .+..+.. T Consensus 174 ~~~~~eVihir~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~---------~e~~~~~ 243 (424) T protein:vir:18 174 DFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT---------EQQRSQV 243 (424) T ss_pred EeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCC---------HHHHHHH Confidence 46778999997654 5668999999888777776666666666666666778889987543110 0111111 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc--hh Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY--SS 401 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY--Ss 401 (556) ...... ..+ .-..|.+..|..|.+++.++.+.-..+|.+..+.....||..+|||.+.| |+.++.|| |+ T Consensus 244 ~~~~~~----~~~----~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn 314 (424) T protein:vir:18 244 EENFKE----IAG----GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSG 314 (424) T ss_pred HHHHHH----HhC----CcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHh-CCCCCccccccc Confidence 010000 000 11346688899999999998877778899999999999999999998755 88888877 65 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhh Q lcl|NC_019524. 402 ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKET 481 (556) Q Consensus 402 ~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~ 481 (556) +.+..+.| +..-+.|+... ++.++....++ +..... +.+++..-...-.|+...+ T Consensus 315 ~eq~~~~f-----------~~~tl~P~~~~-ie~~ln~~L~~-~~~~~~------------~~~~fd~~~llr~d~~~r~ 369 (424) T protein:vir:18 315 IEQQNLGF-----------LQYTLQPYISR-WENSIQRWLIP-SKDVGR------------LHAEHNLDGLLRGDSASRA 369 (424) T ss_pred HHHHHHHH-----------HHHHHHHHHHH-HHHHHHhhcCC-ccccCC------------eEEEEechhhhccCHHHHH Confidence 55544443 45556776665 34455554433 111110 0122222333345999999 Q ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 482 EAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 482 ~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) ++....+++|+.|+-|+-+..|++|-+--+ ++=++....+.... ..+. .++++.. T Consensus 370 ~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD----------~~~~~~n~~~l~~~---~~~~------~~~~n~a 424 (424) T protein:vir:18 370 AFMKAMGESGLRTINEMRRTDNMPPLPGGD----------VAMRQAQYVPITDL---GTNK------EPRNNGA 424 (424) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCcC----------eeeeccCccchhhh---hccC------CccccCC Confidence 999999999999999998888877742111 11000000010000 0000 0111111 No 76 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.59 E-value=2.7e-14 Score=94.93 Aligned_cols=447 Identities=11% Similarity=0.026 Sum_probs=207.8 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |.+... ....++......+.+.... +.+.++. +. ..+|.... ...|++.. +.+.+||++. T Consensus 31 ~~~~~~---~~~~k~~~~~~~~~~~~~~----~~~~~~~-g~---~~~~~~~~--------~~~l~~l~-~~~~~npiv~ 90 (547) T protein:vir:63 31 IQQREQ---EQISKAMNNKEVAYSQPVI----GSMSANP-GF---KTKPSIRN--------NQDLHGVL-KKFGGNIILN 90 (547) T ss_pred hhhhhH---HHHHHhhcccchhhhchhh----heeeccc-cc---ccCCccCC--------hhHHHHHH-HHhhcCHHHH Confidence 222211 1222222222222111111 1111110 00 01121221 12223333 3556789999 Q ss_pred HHHHHHHhhhccC---------CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHh Q lcl|NC_019524. 81 GVVAVHRDSIVGS---------QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAV 151 (556) Q Consensus 81 ~~v~~~~~nvVG~---------Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~ 151 (556) .+|+.++++|.+. |+....++....-..+.+. ..-...++ .+..+++-.-+. .+.+|.++...++ T Consensus 91 ~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~-~~~~~~l~----~~l~~pn~~~~p-~~~s~~~f~~~lv 164 (547) T protein:vir:63 91 AIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHD-EATIKRIE----SFIEKTGVDNDI-NRDSFSSFVKKIV 164 (547) T ss_pred HHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhh-HHHHHHHH----HHHHhhCCCCCC-ccchHHHHHHHHH Confidence 9999999998853 2212222211111111111 11112222 223333211111 2358999999999 Q ss_pred hhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCC----eEEEEEeecCCCc Q lcl|NC_019524. 152 SGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGA----ALGYWLRKAFPGD 227 (556) Q Consensus 152 r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr----~vaY~i~~~hpgd 227 (556) ..++..|.+++.+++... | .+..|..|+|++|..- .+.+|+ .+.|... . .+. T Consensus 165 ~d~ll~Gn~~~~i~rd~~------G--~~~~L~~l~p~~V~~~--------------~~~~g~~~~~~~~y~~~-~-~~~ 220 (547) T protein:vir:63 165 RDTYMYDQVNFEKVFNRN------Q--SMVRFVAKDPTTIFFA--------------TTADGKIPDNGNRFVQV-I-DQK 220 (547) T ss_pred HHHHhhCCEEEEEEECCC------C--cEEEEEEecCceeEEE--------------ECCccccccCceEEEEE-c-CCc Confidence 999999999988765321 2 3567888999888421 112221 1222111 1 110 Q ss_pred cccCCccccceeeccccCChhHeEeeecccC---CCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALL---AGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r---~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) ....+++++|||+...-. .....|+|++..+...+......++....-.+=.+...++|+-+.+ T Consensus 221 -------------~~~~~~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~ 287 (547) T protein:vir:63 221 -------------IVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAA 287 (547) T ss_pred -------------EEEEeccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCC Confidence 012467889999965332 3356799999988888877766666666666656677777764432 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCceeeec-CCCceeeeecCCCCCccHHHHHHHHHHHHHHhc Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASL 383 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaagl 383 (556) .... .+....+..... ....+. =..|.+..| ..|-+++.++.+.-...|.+..+...+.||..+ T Consensus 288 ~~ls--------~e~~~~lk~~~~---~~~~G~----~nagk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~af 352 (547) T protein:vir:63 288 QQQS--------QHALEIFKREWK---NSLSGI----NGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALY 352 (547) T ss_pred CCCC--------HHHHHHHHHHHH---HHhcCc----ccccccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHh Confidence 1100 000000000000 000000 134555444 567788888877777889999999999999999 Q ss_pred CCCHHHhhchhhc----------ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccc Q lcl|NC_019524. 384 GMSYEQFSRDYTK----------TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMF 453 (556) Q Consensus 384 Gi~ye~l~~D~s~----------~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~ 453 (556) |||.+.| |+.+. +|||++.+... .+....++|+..++ +.++-...++ .... T Consensus 353 gVPP~~l-G~~~~~~~~~~~~~s~t~sn~e~~~~-----------~~~~~tL~P~~~~i-e~~ln~~L~~--~~~~---- 413 (547) T protein:vir:63 353 GIDPAEI-NIPNNGGATGSKGGSLNEGNSAEKNQ-----------ASKNKGLQPLLGFI-EDFINKHIVA--EFGD---- 413 (547) T ss_pred CCCHHHc-CcccccccccccccccchhhHHHHHH-----------HHHHHHHHHHHHHH-HHHHHhhccc--ccCC---- Confidence 9999866 44433 24555433333 34556667765554 4455444332 1100 Q ss_pred cchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHH-HHHHHHHH----------------H Q lcl|NC_019524. 454 YDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFR-EVFKQRAR----------------E 516 (556) Q Consensus 454 ~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e-~v~~q~a~----------------E 516 (556) .+.++|...-. -|.. +.......+.+|+.|+-|+-+..|+.|. +--+..-. + T Consensus 414 --------~~~~~f~~~~~--~~~~-~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~ 482 (547) T protein:vir:63 414 --------KYTFQFVGGDI--KSEL-ESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFE 482 (547) T ss_pred --------ceEEEeecccc--ccHH-HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCc Confidence 01234443222 2332 3223345778899999999988999873 22111100 0 Q ss_pred HHHHH-HcCCCCCccccccCCCCCCCCC---CCCCCCCCCcC-----C-C Q lcl|NC_019524. 517 EGLIK-SLKLDFTGKMVEGNSTQSSNSS---ESTSDNPNEET-----T-Q 556 (556) Q Consensus 517 ~~~~~-~~Gl~~~~~~~~~~~~~~~~~~---~~~~~~~~~e~-----~-~ 556 (556) .+..+ ...-.. ........+.....+ +..++.+++.+ + . T Consensus 483 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 531 (547) T protein:vir:63 483 HEKQQSNLQMLQ-EQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNAN 531 (547) T ss_pred cccchhhccccc-cccCCCCCCCCCCCCCCcccCCCcCccccccCccccc Confidence 00000 000000 000000000000000 00000000000 0 0 No 77 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.58 E-value=5.1e-15 Score=98.95 Aligned_cols=464 Identities=11% Similarity=0.066 Sum_probs=219.3 Q ss_pred CC-------------cchhhhHHHHHhhHhhcccchhhhhh-------hhcchhcc---------ccCCCcccccccCCC Q lcl|NC_019524. 1 MK-------------DVKKTTRTRAKKAVDVVAETATATPM-------AVGGGMEG---------AERTTREMFQWNPSI 51 (556) Q Consensus 1 ~s-------------p~~~~~r~~a~~a~~~~~~~~~~~~~-------~~~~~y~a---------a~~~~r~~~~w~~~~ 51 (556) |. -.....|++++++.+-....+++-.. ....++.. ..+.++..-.|.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~ 80 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEAT 80 (532) T ss_pred CCCCCCCCCcceehhhhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCccccccccccccccc Confidence 22 23344556655554211110000000 00011110 111121111222111 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhccc Q lcl|NC_019524. 52 ISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESP 131 (556) Q Consensus 52 ~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~ 131 (556) .-... ..-.+++-|++++.+|+.+++..+=.|+.+...-+ ++...+..++++..|++. T Consensus 81 ~~~~~-----------~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~-------~~~~~~~~~~i~~~~~~l---- 138 (532) T protein:vir:94 81 SWPGF-----------PTLALLAQLPEYRTMHETPADECVRAWGKITCSSK-------DELAADKATRITQKLEQY---- 138 (532) T ss_pred ccchH-----------HHHHHHHcCchhhhhhccchHHHhhCCceEeeCCc-------cccchHHHHHHHHHHHhh---- Confidence 11111 11237888999999999999999999998875321 122234445556555542 Q ss_pred ccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccc-----------eEEEEEchhhcCCCCCCCCC Q lcl|NC_019524. 132 ENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFG-----------TAIQMISPYRMSNPNNVMDT 200 (556) Q Consensus 132 ~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~-----------l~lq~ie~drl~~~~~~~~g 200 (556) .+.+....+++....-|.+++.+...........-.|+. ..|.+|++..|. |...... T Consensus 139 ----------~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~-p~~~~~~ 207 (532) T protein:vir:94 139 ----------NVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLS-PNAYNAT 207 (532) T ss_pred ----------hHHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheec-ccccccc Confidence 222333344444445565555544321100000000000 135556666553 1100000 Q ss_pred ceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcc------cCCchhhHHHHH Q lcl|NC_019524. 201 PNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQT------RGISEMVSALKQ 274 (556) Q Consensus 201 ~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~------RGvs~la~~l~~ 274 (556) -+--..+|+|..|++.. ...|+.+.|||+-...-|... -|+|.|-+++.. T Consensus 208 -----dp~sp~fg~P~~y~v~~-------------------g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~ 263 (532) T protein:vir:94 208 -----DPTLPSFYKPDSWIATS-------------------GKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPY 263 (532) T ss_pred -----cccccccCCceeEEEcc-------------------CeeeccceEEEecCCCchhhhccccccccccHHHHHHHH Confidence 00011468888887631 125778889997554444444 599999999998 Q ss_pred HHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCc Q lcl|NC_019524. 275 MKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGT 354 (556) Q Consensus 275 l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe 354 (556) |++++........-..- +.+ .++|+..... . . .+............ ..... .-|.+..-..++ T Consensus 264 l~~~~~t~~~~~~l~~~-~~~-~v~k~~~a~~-l-s------~~~~~~~~~r~~~~------~~~~~-n~g~~~id~~~e 326 (532) T protein:vir:94 264 VDNWLRTRQSVSDTVKQ-FSM-TNLATDMAQL-L-A------PGGAQSLDARLQLF------NLYRD-NRNIGALDKGTE 326 (532) T ss_pred HHHHHHHHHHHHHHHHh-cCC-ceeeechHHh-h-c------chhHHHHHHHHHHH------HhhcC-CccceEEcCCCc Confidence 88887766655442222 222 3344432111 0 0 00000000000000 00000 113332223468 Q ss_pred eeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 355 KLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFASAIYTLWL 433 (556) Q Consensus 355 ~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l 433 (556) +++.++.+ -++..+........||+++|||.-.|.|- |-..+ |+.-.-+..+...++.+|+..+..++.-+++.++ T Consensus 327 ~~e~~~~~--lsgl~~~l~~~~~~iAaa~~IP~t~LfG~-sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~ 403 (532) T protein:vir:94 327 EIQQTNTP--LSGLDSLQAQSQEQMAAVSHIPLVKLLGI-TPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQ 403 (532) T ss_pred eeEEEecc--cCCHHHHHHHHHHHHHhHhCCCeeeeecC-CcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88888733 44789999999999999999999888774 22344 4456677888889999998776665555555544 Q ss_pred HHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHH Q lcl|NC_019524. 434 EEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQ 512 (556) Q Consensus 434 ~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q 512 (556) ..+ -|. .+..+.+. ++-.|.....+-+|- .|-+++....+++|+.|..++....|.+++.-+.. T Consensus 404 ~s~--~g~--~~~d~~~~-----------f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~ 468 (532) T protein:vir:94 404 LSE--YGQ--IDPGLAWE-----------WSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAG 468 (532) T ss_pred HHh--cCC--CCCCceEE-----------eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCcccccc Confidence 322 132 22222111 112344444444443 46778889999999999999998888888654443 Q ss_pred HHHHHHHHH------HcCCCCCccccccCCCCCCCCCCCCCCCCCC------cCCC Q lcl|NC_019524. 513 RAREEGLIK------SLKLDFTGKMVEGNSTQSSNSSESTSDNPNE------ETTQ 556 (556) Q Consensus 513 ~a~E~~~~~------~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~------e~~~ 556 (556) ...+..... +.+-.....+....+....+..+.++|.++. +.-| T Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 524 (532) T protein:vir:94 469 ALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQ 524 (532) T ss_pred ccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCccccc Confidence 322211110 0000000011111111111111111111111 0111 No 78 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.57 E-value=3.2e-15 Score=100.06 Aligned_cols=416 Identities=10% Similarity=0.029 Sum_probs=214.2 Q ss_pred HhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCc Q lcl|NC_019524. 16 VDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQY 95 (556) Q Consensus 16 ~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi 95 (556) +....+.. .+.....+.....-.....+|...... .-+-.-+..++-+-++|+.|.+.+.+.-| T Consensus 1 Mg~f~~~~---~r~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~al~~~~v~~cv~~Ia~~iA~~p~ 64 (416) T protein:vir:45 1 MGIFYKNE---KRDLQYNEDDLQMMVQTLPGFQGTKLR-------------QYKDIEAIRHSDIFTAVMMIASDLARMPI 64 (416) T ss_pred CCcccccc---cccccCCCcchhHHHHHhccccccCcc-------------ccchhhhhcchHHHHHHHHHHHhhccCce Confidence 11110000 000000000000000000111110000 01111112234456789999888887555 Q ss_pred eeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCC Q lcl|NC_019524. 96 KLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQ 175 (556) Q Consensus 96 ~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~ 175 (556) ++.- + ++.. .. ...+..+-.+|+ -.+|..++-...+..++..|++|+.+.+.. T Consensus 65 ~~~~--~-------~~~~--~~---~~~~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna~~~i~r~~------- 117 (416) T protein:vir:45 65 RVTV--N-------GQIN--YS---DRIVNLLNTRPN------PMYNGYIFKLVVFVSALLTSHGYIEITRDK------- 117 (416) T ss_pred EEec--C-------cccc--cc---chHHHHHhcccc------cCCCHHHHHHHHHHHHhhcCCeEEEEEECC------- Confidence 5431 1 1110 01 122333444554 356888888899999999999999976532 Q ss_pred CcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeec Q lcl|NC_019524. 176 RRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIE 255 (556) Q Consensus 176 ~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~ 255 (556) ...+..|..|.|+.+. |..|..|++..|+......+.. ....++.++|||+.. T Consensus 118 -~G~~~~L~~i~~~~v~--------------v~~~~~g~~~~~~~~~~~~~~~------------~~~~~~~~evihir~ 170 (416) T protein:vir:45 118 -TGEPMNLTFRKTSEIE--------------LKSDARGRLYYFHQRIDSNGNN------------IERNVKFEDMLDIKF 170 (416) T ss_pred -CCcEEEEEEEcCceeE--------------EEECCCccEEEEEEEecCCCce------------eEEEEccccEEEecc Confidence 1235678889888884 4567778877665543332211 112477889999975 Q ss_pred ccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccc Q lcl|NC_019524. 256 ALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVA 335 (556) Q Consensus 256 ~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (556) . ..++..|+|++..+...+.-.....++.....+=.+...++|+.+..-.. ++..+...+ ... T Consensus 171 ~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~---------~~~~~~~~~-------~~~ 233 (416) T protein:vir:45 171 Y-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN---------KKARDRARE-------EFH 233 (416) T ss_pred C-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCC---------HHHHHHHHH-------HHH Confidence 4 56789999999998887777666666666666667778888886532110 000000000 000 Q ss_pred cccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHH Q lcl|NC_019524. 336 QTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDS 415 (556) Q Consensus 336 ~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~ 415 (556) ....-.-..|.+..|..|.+++.++.+.-..+|.+..+.....||+.+|||-..|.. +..++| ..+..+. T Consensus 234 ~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~--~~~~~~-~~~~~~~------- 303 (416) T protein:vir:45 234 KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGI--ETANMS-ITDANLD------- 303 (416) T ss_pred HHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCC--CCCCcc-HHHHHHH------- Confidence 000001134667789999999999877666678888899999999999999987633 344443 2222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCH Q lcl|NC_019524. 416 RKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTY 495 (556) Q Consensus 416 ~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~ 495 (556) ++ +-+.|+... ++.++....++--.+. .+++..-.....|+...+++..+.+.+|+.|+ T Consensus 304 ----~~-~~l~P~~~~-ie~~ln~~l~~~~~~~---------------~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~ 362 (416) T protein:vir:45 304 ----YL-STLKPYITC-VCAELNFKFNDEYVNR---------------EFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI 362 (416) T ss_pred ----HH-HHHHHHHHH-HHHHHhhhccccccCc---------------eEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 11 235665553 3444444322110010 12222223344599999999999999999999 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 496 EAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 496 ~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) -|+-+..|..|-+--+.... .+ -+... +.+.....+.......+.+.+.+ |.+| T Consensus 363 NE~R~~~gl~p~~~gd~~~~---~~-~~n~~-~~~~~~~~~~~~~~~~~~~~kgG--e~n~ 416 (416) T protein:vir:45 363 DEIRQRDGLAPIPGGNGSIH---RV-DLNHV-NIELVDEYQMNKSRATDKKLKGG--EENE 416 (416) T ss_pred HHHHHHhCCCCCCCCCcceE---ee-ccccc-ccccccccCcccccccccccCCC--CCCC Confidence 99888878777441111000 00 00000 00000000111111111111222 2222 No 79 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.57 E-value=3.2e-15 Score=100.06 Aligned_cols=416 Identities=10% Similarity=0.029 Sum_probs=214.2 Q ss_pred HhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCc Q lcl|NC_019524. 16 VDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQY 95 (556) Q Consensus 16 ~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi 95 (556) +....+.. .+.....+.....-.....+|...... .-+-.-+..++-+-++|+.|.+.+.+.-| T Consensus 1 Mg~f~~~~---~r~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~al~~~~v~~cv~~Ia~~iA~~p~ 64 (416) T protein:vir:81 1 MGIFYKNE---KRDLQYNEDDLQMMVQTLPGFQGTKLR-------------QYKDIEAIRHSDIFTAVMMIASDLARMPI 64 (416) T ss_pred CCcccccc---cccccCCCcchhHHHHHhccccccCcc-------------ccchhhhhcchHHHHHHHHHHHhhccCce Confidence 11110000 000000000000000000111110000 01111112234456789999888887555 Q ss_pred eeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCC Q lcl|NC_019524. 96 KLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQ 175 (556) Q Consensus 96 ~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~ 175 (556) ++.- + ++.. .. ...+..+-.+|+ -.+|..++-...+..++..|++|+.+.+.. T Consensus 65 ~~~~--~-------~~~~--~~---~~~~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna~~~i~r~~------- 117 (416) T protein:vir:81 65 RVTV--N-------GQIN--YS---DRIVNLLNTRPN------PMYNGYIFKLVVFVSALLTSHGYIEITRDK------- 117 (416) T ss_pred EEec--C-------cccc--cc---chHHHHHhcccc------cCCCHHHHHHHHHHHHhhcCCeEEEEEECC------- Confidence 5431 1 1110 01 122333444554 356888888899999999999999976532 Q ss_pred CcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeec Q lcl|NC_019524. 176 RRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIE 255 (556) Q Consensus 176 ~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~ 255 (556) ...+..|..|.|+.+. |..|..|++..|+......+.. ....++.++|||+.. T Consensus 118 -~G~~~~L~~i~~~~v~--------------v~~~~~g~~~~~~~~~~~~~~~------------~~~~~~~~evihir~ 170 (416) T protein:vir:81 118 -TGEPMNLTFRKTSEIE--------------LKSDARGRLYYFHQRIDSNGNN------------IERNVKFEDMLDIKF 170 (416) T ss_pred -CCcEEEEEEEcCceeE--------------EEECCCccEEEEEEEecCCCce------------eEEEEccccEEEecc Confidence 1235678889888884 4567778877665543332211 112477889999975 Q ss_pred ccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccc Q lcl|NC_019524. 256 ALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVA 335 (556) Q Consensus 256 ~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (556) . ..++..|+|++..+...+.-.....++.....+=.+...++|+.+..-.. ++..+...+ ... T Consensus 171 ~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~---------~~~~~~~~~-------~~~ 233 (416) T protein:vir:81 171 Y-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN---------KKARDRARE-------EFH 233 (416) T ss_pred C-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCC---------HHHHHHHHH-------HHH Confidence 4 56789999999998887777666666666666667778888886532110 000000000 000 Q ss_pred cccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHH Q lcl|NC_019524. 336 QTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDS 415 (556) Q Consensus 336 ~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~ 415 (556) ....-.-..|.+..|..|.+++.++.+.-..+|.+..+.....||+.+|||-..|.. +..++| ..+..+. T Consensus 234 ~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~--~~~~~~-~~~~~~~------- 303 (416) T protein:vir:81 234 KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGI--ETANMS-ITDANLD------- 303 (416) T ss_pred HHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCC--CCCCcc-HHHHHHH------- Confidence 000001134667789999999999877666678888899999999999999987633 344443 2222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCH Q lcl|NC_019524. 416 RKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTY 495 (556) Q Consensus 416 ~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~ 495 (556) ++ +-+.|+... ++.++....++--.+. .+++..-.....|+...+++..+.+.+|+.|+ T Consensus 304 ----~~-~~l~P~~~~-ie~~ln~~l~~~~~~~---------------~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~ 362 (416) T protein:vir:81 304 ----YL-STLKPYITC-VCAELNFKFNDEYVNR---------------EFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI 362 (416) T ss_pred ----HH-HHHHHHHHH-HHHHHhhhccccccCc---------------eEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 11 235665553 3444444322110010 12222223344599999999999999999999 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 496 EAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 496 ~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) -|+-+..|..|-+--+.... .+ -+... +.+.....+.......+.+.+.+ |.+| T Consensus 363 NE~R~~~gl~p~~~gd~~~~---~~-~~n~~-~~~~~~~~~~~~~~~~~~~~kgG--e~n~ 416 (416) T protein:vir:81 363 DEIRQRDGLAPIPGGNGSIH---RV-DLNHV-NIELVDEYQMNKSRATDKKLKGG--EENE 416 (416) T ss_pred HHHHHHhCCCCCCCCCcceE---ee-ccccc-ccccccccCcccccccccccCCC--CCCC Confidence 99888878777441111000 00 00000 00000000111111111111222 2222 No 80 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.57 E-value=7e-14 Score=92.71 Aligned_cols=446 Identities=11% Similarity=0.038 Sum_probs=204.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.+...-.- .++........ .+....+-.+....++ ++....+ ..|++-. +.+.+||+++ T Consensus 35 ~~~~~~~~~---~k~~~~~~~a~-~~~~~~~~~~~~~~~~-------r~~~~~~--------~~l~~~~-~~~~~npiv~ 94 (551) T protein:vir:80 35 IQQREQEQI---SKAMNNKEVAY-SQPVIGSMSANPGFKT-------KPSIRNN--------QDLHGVL-KKFGGNIILN 94 (551) T ss_pred cccccHHHH---HHhhccCccee-ecccccceecCccccc-------CccccCh--------hHHHHHH-HHhhcCHHHH Confidence 112211110 01110000000 0000000111111110 1111111 1122222 2456689999 Q ss_pred HHHHHHHhhhccC-----------CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHH Q lcl|NC_019524. 81 GVVAVHRDSIVGS-----------QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRL 149 (556) Q Consensus 81 ~~v~~~~~nvVG~-----------Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l 149 (556) .||+.++++|.+. |+....+.. +.+....-..+++.. ..+..+++-.-+. .+.||.++... T Consensus 95 ~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~------~~~~~~~~~~~~~~i-~~~l~~pn~~~~p-~~~s~~~f~~~ 166 (551) T protein:vir:80 95 AIINTRSNQVSMYCKPARHSEKGVGFEVRLKDL------DKKPTSHDEATIKRI-ESFIEKTGVDNDI-NRDSFSSFVKK 166 (551) T ss_pred HHHHHHHHHHhhhhhhhhhhcCCCCceEEeccc------CcccChhHHHHHHHH-HHHHHhcCCCCCC-ccchHHHHHHH Confidence 9999999998742 333322210 001111111111111 2233333311111 13589999999 Q ss_pred HhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCC----eEEEEEeecCC Q lcl|NC_019524. 150 AVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGA----ALGYWLRKAFP 225 (556) Q Consensus 150 ~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr----~vaY~i~~~hp 225 (556) ++..++..|.+++.+++... | .+..|..|+|++|.. ..+.+|. .+.|... .. T Consensus 167 lv~dlll~Gnay~~i~rd~~------G--~~~~L~~l~p~~V~v--------------~~~~~g~~~~~~~~y~~~--~~ 222 (551) T protein:vir:80 167 IVRDTYMYDQVNFEKVFNRN------Q--SMVRFVAKDPTTIFF--------------ATTADGKIPDNGNRFVQV--ID 222 (551) T ss_pred HHHHHHhcCCEEEEEEECCC------C--cEEEEEEeCCceeEE--------------EECCccccccCceEEEEE--eC Confidence 99999999999988765321 2 367889999988842 1222222 1222221 11 Q ss_pred CccccCCccccceeeccccCChhHeEeeeccc---CCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEecc Q lcl|NC_019524. 226 GDPTDMEQWKWGYEPARFDWGRRRVIHIIEAL---LAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESE 302 (556) Q Consensus 226 gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~---r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~ 302 (556) +. ....+++++|||+...- ......|+|++..+...+.......+....-.+=.+...++|+-+ T Consensus 223 g~-------------~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~ 289 (551) T protein:vir:80 223 QK-------------IVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIK 289 (551) T ss_pred Cc-------------EEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEc Confidence 10 01246778999997543 334567999998888777666655555555555556677777643 Q ss_pred CcccccccccccccccccccccccccccccccccccceecCCceeeec-CCCceeeeecCCCCCccHHHHHHHHHHHHHH Q lcl|NC_019524. 303 LPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAA 381 (556) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaa 381 (556) .+.... .+....+..... ....+. =..|.+..| ..|-+++.++.+.....|.+..+...+.||+ T Consensus 290 ~~~~lt--------~e~~~~lk~~~~---~~~~G~----~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~ 354 (551) T protein:vir:80 290 AAQQQS--------QHALEIFKREWK---NSLSGI----NGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISA 354 (551) T ss_pred CCCCCC--------HHHHHHHHHHHH---HHhcCc----cccCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 221100 000000000000 000000 134555444 5678999888777788899999999999999 Q ss_pred hcCCCHHHhhchhhc----------ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccc Q lcl|NC_019524. 382 SLGMSYEQFSRDYTK----------TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWR 451 (556) Q Consensus 382 glGi~ye~l~~D~s~----------~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~ 451 (556) .+|||.+.| |+.+. +|||.+.+.... +....++|+..+ ++.++....++ .... T Consensus 355 aFgVPp~~l-G~~~~~~~~~~~~~s~t~sn~e~~~~~-----------f~~~tL~P~~~~-ie~~ln~~L~~--~~~~-- 417 (551) T protein:vir:80 355 LYGIDPAEI-NIPNNGGATGSKGGSLNEGNSAEKNQA-----------SKNKGLQPLLGF-IEDFINKHIVA--EFGD-- 417 (551) T ss_pred HhcCCHHHc-CcccccccccccccccchhhHHHHHHH-----------HHHHHHHHHHHH-HHHHHHhhhcc--ccCC-- Confidence 999998866 44332 356665444333 445556665554 35555554332 1100 Q ss_pred cccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHH-HHHHHHH---------------- Q lcl|NC_019524. 452 MFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFR-EVFKQRA---------------- 514 (556) Q Consensus 452 ~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e-~v~~q~a---------------- 514 (556) .+.+++...- .-|.. +.......+.+|+.|+-|+-+..|++|. +--+..- T Consensus 418 ----------~~~f~f~~~~--~~~~~-~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~ 484 (551) T protein:vir:80 418 ----------KYTFQFVGGD--IKSEL-ESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQ 484 (551) T ss_pred ----------ceEEEeeccC--hhhHH-HHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccC Confidence 0123333221 12322 2222335677899999999999999872 3111110 Q ss_pred HHHHHHHHc--CCC-CCccccccCC---CCCCCCCCCCCCCCCC--cCC-C Q lcl|NC_019524. 515 REEGLIKSL--KLD-FTGKMVEGNS---TQSSNSSESTSDNPNE--ETT-Q 556 (556) Q Consensus 515 ~E~~~~~~~--Gl~-~~~~~~~~~~---~~~~~~~~~~~~~~~~--e~~-~ 556 (556) .+.+..++. .+. .......+.+ +..++...+.+++... |++ + T Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 535 (551) T protein:vir:80 485 FEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNAN 535 (551) T ss_pred cchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccccCccccc Confidence 011111100 011 0000000000 0000000000000000 111 0 No 81 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.56 E-value=3e-14 Score=94.69 Aligned_cols=382 Identities=9% Similarity=0.033 Sum_probs=205.7 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|...++. ... ..+..... ......|.. +..+. ..-+..++.+.++| T Consensus 1 Mg~f~~~~~~~~------------~~~-~~~~~~~~-~~~~~~~~~-----~~~v~----------~~~~l~~~~v~~~i 51 (382) T protein:vir:48 1 MPIFNLATESPP------------DNQ-GGFFDVVD-SDFLASLKG-----NEWVS----------AETALRNSDLFSII 51 (382) T ss_pred CccccccccCCc------------ccc-cccccchh-hhccccccC-----Ccccc----------hHhhhccHHHHHHH Confidence 222222111100 000 00000000 000111111 00000 01113467889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|-..-|++.-. .+..+ -.+|+ -.+|.+++....+..++..|++|+. T Consensus 52 ~~ia~~ia~~~~~~~~~-----------~~~~L-----------~~~PN------~~~t~~~f~~~l~~~l~l~Gna~~~ 103 (382) T protein:vir:48 52 NQLSNDLATVKLITSRK-----------KLQGI-----------VDNPS------NNANRFNFYQSIFAQMLLGGEAFAY 103 (382) T ss_pred HHHHHhhccCceeeecc-----------hhhhh-----------hhhcC------CCCCHHHHHHHHHHHhhhcCCEEEE Confidence 99999998765554321 11111 12343 2468999999999999999999998 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) ++.... ..+..|..|.|+.+. |+.+.+|..+.|.+....+... ... T Consensus 104 i~rd~~--------G~~~~l~~i~~~~v~--------------v~~~~~~~~~~y~~~~~~~~~~------------~~~ 149 (382) T protein:vir:48 104 RWRNEN--------GRDMKWEYLRPSQVS--------------FNRLDNKDGIYYNITFDDPRIP------------PKQ 149 (382) T ss_pred EEECCC--------CcEEEEEEEcCceeE--------------EEEcCCCCeEEEEEEecCcccc------------cee Confidence 864321 235688999999884 4556777788888765543211 123 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++++|||+..+...|...|+|++.++...+.......++.....+=.+...++|+.+..... +..... T Consensus 150 ~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~----------e~~~~~ 219 (382) T protein:vir:48 150 HVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLL----------DFKTKL 219 (382) T ss_pred EEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCh----------HHHHHH Confidence 4677899999988888999999999999999988888888888877778888888887533111 000000 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) .. . .... .=..|.+..|..|.+++.++.+.-..+|.+..+...+.||..+|||-..| |+.. ++++.. T Consensus 220 ~~---~---~~~~----~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~l-g~~~--~~~~~~ 286 (382) T protein:vir:48 220 SR---S---RQAM----KQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVV-GGQG--DQQSSL 286 (382) T ss_pred HH---H---HHhh----ccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCC--CcccHH Confidence 00 0 0000 11357788899999999999877778898999999999999999998766 5432 233332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +.... ++...+.|+...+.+ ++....++ .. . .......+ .|...-... T Consensus 287 ~~~~~-----------~~~~~l~p~~~~i~~-~l~~~l~~--~~-~---------~~~~~~~~--------~~~~~~~~~ 334 (382) T protein:vir:48 287 EMSSD-----------LYSKAVSRYLRPFLS-ELSQKLSC--DV-D---------ADIFPAVD--------PTGSNYISR 334 (382) T ss_pred HHHHH-----------HHHHHHHHHHHHHHH-HHHHHhcC--hh-h---------hhhhhhhc--------cchhHHHHH Confidence 22222 334445554444322 22221111 00 0 00000001 122222222 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) ....+++|+.|+.|+....+ ..|+..+..+.... ..++.+-+|+ ++++ T Consensus 335 ~~~l~~~g~~t~~e~r~~l~------------------~~g~~~~~~~~~~~----~~~~~~GGd~-~~~~ 382 (382) T protein:vir:48 335 INSLVKTGTLAQNQGLYILQ------------------QAEILPKELPNGEN----PNSTLKGGEE-DGQD 382 (382) T ss_pred HHHHhhcCccCHHHHHHHHh------------------hCCCCCcchhhhhc----CCCCCCCCCC-CCCC Confidence 23567889999888754321 22322111111000 0000011111 1111 No 82 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=99.55 E-value=7.9e-16 Score=103.37 Aligned_cols=275 Identities=12% Similarity=0.053 Sum_probs=166.9 Q ss_pred ccC-CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccC Q lcl|NC_019524. 91 VGS-QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNP 169 (556) Q Consensus 91 VG~-Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~ 169 (556) |+. =|++.-+ +++.. ...+..+..+|+ -.+|+.++-...+..++..|++|+.++... T Consensus 1 ia~l~~~~~~~--------~~~~~-------~~l~~lL~~~PN------~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~- 58 (278) T protein:vir:78 1 MASLPLKMYED--------YKVVN-------TEVSDLLTVSPN------NSLSSFDFINQIETIRNEKGNAYVLIERDI- 58 (278) T ss_pred CccceeEEEec--------Ccccc-------cHHHHHHHhcCC------CCCCHHHHHHHHHHHHhhcCCEEEEEEECC- Confidence 332 2222111 11111 123344445665 246899999999999999999999876532 Q ss_pred CCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhH Q lcl|NC_019524. 170 TGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRR 249 (556) Q Consensus 170 ~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~ 249 (556) . ..+..|..|.|++|. |+.+..|.++.|.+...+ | ....+++.+ T Consensus 59 -----~--G~~~~l~~l~~~~v~--------------v~~~~~~~~~~y~~~~~~-g--------------~~~~~~~~e 102 (278) T protein:vir:78 59 -----Y--HQPSKLFLLNPDVVE--------------MLIENQSRELYYSIHAAT-G--------------NKLIVHNMD 102 (278) T ss_pred -----C--CcEEEEEEECCceeE--------------EEEcCCCceEEEEEEcCC-c--------------eEEEEcccc Confidence 1 235788999999883 566788889999885322 1 112467889 Q ss_pred eEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccc Q lcl|NC_019524. 250 VIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTG 329 (556) Q Consensus 250 viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (556) |||+..+...++..|+|++..+...+..........+. .....-.++++.+..- .++......+.. T Consensus 103 vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~i~~~~~~l----------~~e~~~~~~~~~-- 168 (278) T protein:vir:78 103 MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLT--EMQKPDSFMLKYGSNV----------GKEKRQQVLEDF-- 168 (278) T ss_pred EEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHH--HhcCCCcEEEEeCCCC----------CHHHHHHHHHHH-- Confidence 99999888889999999999988777665554443322 2222233444433211 011111100100 Q ss_pred cccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHH Q lcl|NC_019524. 330 LANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAET 409 (556) Q Consensus 330 ~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~ 409 (556) .... -..|.+..|+.|.+++.++.+....+|.+..+...+.||+++|||-+ +.|+..+.|||++++....+ T Consensus 169 -~~~~-------~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~-~lg~~~~~~~sn~~~~~~~~ 239 (278) T protein:vir:78 169 -KQYY-------EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSV-FLNARSNTNFAKNEELNRFY 239 (278) T ss_pred -HHHh-------ccCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHH-HhCCCCCCCcccHHHHHHHH Confidence 0000 13577889999999999998878888999999999999999999966 55888889999997766555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCc--ccccccchhhHHHhhCeeeecCcc Q lcl|NC_019524. 410 QKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGK--NWRMFYDPMMRDALCNAEWIGASR 472 (556) Q Consensus 410 ~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~--~~~~~~~~~~~~a~~~~~w~~p~~ 472 (556) . ...++|+.+.+- .++-...++-.... ..+.|+ .-+. T Consensus 240 ~-----------~~~l~P~~~~i~-~~ln~~L~~~~e~~~g~~~~f~--------------~~~l 278 (278) T protein:vir:78 240 L-----------QHTLLPIVKQYE-EEFNRKLLTKTDREKIGILNLT--------------LNLI 278 (278) T ss_pred H-----------HHHHHHHHHHHH-HHHHhhcCChhHhcCCceEEEe--------------cccC Confidence 3 444666555433 33443333211100 011111 1111 No 83 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.55 E-value=6.5e-15 Score=98.36 Aligned_cols=399 Identities=9% Similarity=-0.016 Sum_probs=202.9 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccc---------cCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQW---------NPSIISPDQQIAQNQDMASARAQDMVQ 74 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w---------~~~~~s~~~~i~~~~~~lr~RaRdl~r 74 (556) |+=. +..... . ...| .+.....-. + . -+. T Consensus 1 m~~~-------------------------~~~~~~--~--~~~~~~~~~~~~~~~~~~g~~~----------~--~-~Al 38 (417) T protein:vir:38 1 MKLF-------------------------RGLATE--V--DPHWADHLLDSGVIPSFRGGYL----------G--I-SAL 38 (417) T ss_pred Cccc-------------------------cccccC--C--CccchhhhcccccccccCCcee----------c--h-hhc Confidence 1111 000000 0 0111 011110000 0 0 123 Q ss_pred cChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 75 NDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 75 Nn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) .++-+-++|+.|.+.|-.--|++.-+.. ++... . ..+...+..+|+ -.+|..++....+..+ T Consensus 39 ~~~~V~~cv~~ia~~iA~lp~~~~~~~~------~~~~~---~---~~~~~lL~~~PN------~~~t~~~f~~~~~~~l 100 (417) T protein:vir:38 39 RNSDVLTAVSIVSGDVSRFPLVITDSST------DEVID---L---ANIEYLMNTKVN------KRLSAYQWKFPMMVNA 100 (417) T ss_pred ccHHHHHHHHHHHHhhccCeeEEEEcCC------cceec---c---chHHHHHhcccC------cCCCHHHHHHHHHHHH Confidence 5666788999999988875454432211 11110 0 122223334554 3578999999999999 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcc Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQW 234 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~ 234 (556) +..|++|+.++..... ..|..|..|.|+.+. |+.+..|+. -|.+.....+. T Consensus 101 ll~Gn~y~~i~r~~~g-------~~~~~l~~l~p~~v~--------------v~~~~~~~~-~y~~~~~~~~~------- 151 (417) T protein:vir:38 101 ILTGNAYSRIVRDPIT-------NEPAMFEFYAPSQTQ--------------VDTSDPDNI-IYRFTPYNSSM------- 151 (417) T ss_pred hhcCCeEEEEEEcCCC-------CEEEEEEEeCCceEE--------------EEEcCCCeE-EEEEEEcCCcE------- Confidence 9999999998753221 245678888888874 233344443 35554321110 Q ss_pred ccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccc Q lcl|NC_019524. 235 KWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGM 314 (556) Q Consensus 235 ~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~ 314 (556) ...+++.+|||+...- .+...|+|++..+...+.......++...-.+=.+...++++.+..- T Consensus 152 -------~~~~~~~dviH~r~~~-~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l--------- 214 (417) T protein:vir:38 152 -------QKVCGFEDVIHWKFFS-YDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRL--------- 214 (417) T ss_pred -------EEEecCcceEEecCCC-CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC--------- Confidence 1235678999998763 45689999998887777665555555555445555666666643211 Q ss_pred ccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchh Q lcl|NC_019524. 315 GQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDY 394 (556) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~ 394 (556) ..+..+.... ....... .-..|.+..|..|.+++.++.+.-..+|.+..+...+.||..+|||-+.| |+ T Consensus 215 -~~e~~~~~~~-------~~~~~~~-g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~- 283 (417) T protein:vir:38 215 -SAEARQKIRE-------DFERAQA-GADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRL-AQ- 283 (417) T ss_pred -CHHHHHHHHH-------HHHHHhc-ccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHh-CC- Confidence 1111111111 0000010 01356777899999999988665566788888888999999999999877 43 Q ss_pred hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccc Q lcl|NC_019524. 395 TKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQ 474 (556) Q Consensus 395 s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~ 474 (556) +.+||++.+....| +..-+.|+.+.+ +.++-...++ +.... . +.++|- ... T Consensus 284 -~~~~s~~e~~~~~~-----------~~~tl~P~~~~i-e~~l~~~Ll~-~~~~~-----------~-~~~~fd---~~~ 334 (417) T protein:vir:38 284 -NSPNQSVKQLADDY-----------IRNDLPFYFEPI-TSEFELKLLD-DAQRH-----------Q-YCIGFD---TKS 334 (417) T ss_pred -CCcchhHHHHHHHH-----------HHHHHHHHHHHH-HHHHHhhhcC-hhhcc-----------c-ceEEec---hhh Confidence 46888875555444 344455655553 3344443332 11100 0 011221 112 Q ss_pred cchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHH--HHHHHHHHHHHHcC-CCCCcccc-----ccCCCCCCC-CCCC Q lcl|NC_019524. 475 IDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVF--KQRAREEGLIKSLK-LDFTGKMV-----EGNSTQSSN-SSES 545 (556) Q Consensus 475 iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~--~q~a~E~~~~~~~G-l~~~~~~~-----~~~~~~~~~-~~~~ 545 (556) ++.+ +..+..+++++|+.|+-|+-+..|..|-+-- +++.. -+. ++.+.... .....+..+ .+.. T Consensus 335 l~~~-~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~------~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~ 407 (417) T protein:vir:38 335 VNGL-PIADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQS------TLNTVFLDQKEAYQAEHAAELKGGDTNAKGN 407 (417) T ss_pred hhHH-HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeee------cccccccccccccccccccccCCCCCCCCCC Confidence 3332 3445677899999999999888888774311 11100 000 00000000 000011111 1111 Q ss_pred CCCCCCCcCC Q lcl|NC_019524. 546 TSDNPNEETT 555 (556) Q Consensus 546 ~~~~~~~e~~ 555 (556) .+.++++++. T Consensus 408 ~~~~~~~~~~ 417 (417) T protein:vir:38 408 QNGSGTNANS 417 (417) T ss_pred CcCCCCcCCC Confidence 0111111111 No 84 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.54 E-value=3.7e-15 Score=99.68 Aligned_cols=378 Identities=10% Similarity=0.018 Sum_probs=209.3 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcc-cccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTRE-MFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGV 82 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~-~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~ 82 (556) |.=..|....+. .+... ......+ .+.. +..|... ..+. .+. +..++.+..+ T Consensus 1 Mglf~~~~~~~~----~~~~~---~~~~~~~-----~~~~~~~~~~~~-----~~v~---------~~~-al~~~~V~~~ 53 (384) T protein:vir:49 1 MPIFNITNLATE----SPPSN---QDSFFDI-----TDPEFLDALNGS-----EWVS---------AET-ALKNSDLFSI 53 (384) T ss_pred CccccccccCcc----ccccc---chhhccc-----cchhhcccccCC-----ceec---------hhh-hhccHHHHHH Confidence 222221111000 00000 0000000 0000 1111110 0000 011 2357788999 Q ss_pred HHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEE Q lcl|NC_019524. 83 VAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLA 162 (556) Q Consensus 83 v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~ 162 (556) |+.+.+.|-+.-|++.-+. ... --.+|+ ..+|.+++....+..++..|++|+ T Consensus 54 i~~Ia~~ia~l~~~~~~~~-----------~~~-----------l~~~PN------~~~t~~~f~~~l~~~lll~Gna~~ 105 (384) T protein:vir:49 54 ISQLSNDLATAKITTSRKQ-----------LQG-----------IVDNPS------NNANRFNFYQSIFAQMLLGGEAFA 105 (384) T ss_pred HHHHHHHHhhCceeeecch-----------hhh-----------hhhccC------CCCCHHHHHHHHHHHhhhcCCeEE Confidence 9999999998777664221 011 112344 347999999999999999999999 Q ss_pred EEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeecc Q lcl|NC_019524. 163 TCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPAR 242 (556) Q Consensus 163 ~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~ 242 (556) .+++... ..+..|..|.|+++. |..+.++..+.|.+....+... .. T Consensus 106 ~i~r~~~--------g~~~~L~~l~~~~v~--------------v~~~~~~~~~~y~~~~~~~~~~------------~~ 151 (384) T protein:vir:49 106 YRWRNEN--------GRDMKWEYLRPSQVS--------------FNRLDNQNGLYYNITFDDPRIP------------PK 151 (384) T ss_pred EEEECCC--------CcEEEEEEEcCceeE--------------EEEcCCCceEEEEEEecCcccc------------ce Confidence 9876331 235788999998884 3334556677888875543211 12 Q ss_pred ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccccc Q lcl|NC_019524. 243 FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEI 322 (556) Q Consensus 243 ~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~ 322 (556) ..++.++|||+......+...|+|++.++...+.....-.++.....+-.+...++++.+....... .... T Consensus 152 ~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~---------~~~~ 222 (384) T protein:vir:49 152 QHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDF---------KTKQ 222 (384) T ss_pred eEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHH---------HHHH Confidence 3467789999998887888999999999888888777777777777777788888888754322110 0000 Q ss_pred ccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchh-hcccchh Q lcl|NC_019524. 323 FNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDY-TKTNYSS 401 (556) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~-s~~nYSs 401 (556) .. .... ..-..|.+..|..|.+++.++.+.-..+|.+..+.....||..+|||-+.|..+- ...||++ T Consensus 223 ~~-------~~~~----~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~ 291 (384) T protein:vir:49 223 SR-------SRQA----MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEM 291 (384) T ss_pred HH-------HHHh----cccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHH Confidence 00 0000 1124677889999999999987777788888889999999999999998774421 1123333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh-hh Q lcl|NC_019524. 402 ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK-KE 480 (556) Q Consensus 402 ~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~-Ke 480 (556) ++ .....++...++|+.+++ +.++... +.. . ..+... .|+. .+ T Consensus 292 ~~-----------~~~~~~i~~~l~pi~~~i-~~~l~~~-l~~---------------------~-~~~~~~-~~~~~~~ 335 (384) T protein:vir:49 292 IY-----------NIYFKAVSRFLRPFVSEL-SKKLSCE-VDA---------------------D-ILPAVD-PTGSNYI 335 (384) T ss_pred HH-----------HHHHHHHHHHHHHHHHHH-HHHhchh-hhh---------------------h-hhhhhh-ccchHHH Confidence 22 223345666677766654 3333221 100 0 000000 0111 11 Q ss_pred hHHHHHHHHcCCCCHHHHHHHh---CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRL---GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~---G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) ...+ ..+++|++|+.++.... |..+.|+. +..|++. -++.+.+++= T Consensus 336 ~~~~-~l~~~~~~t~~e~~~~l~~~g~~~ne~r----------~~~~~~p-----------------~~gGd~~~~~ 384 (384) T protein:vir:49 336 GLIN-SMVKTGTLAQNQGLYVLQQAEILPKDLP----------EGETDST-----------------LKGGETNEQY 384 (384) T ss_pred HHHH-HHhhcCcccHHHHHHHHhhCCCCChhHH----------HHcCCCC-----------------CCCCCCCCCC Confidence 1222 35788899988886654 33333211 1223321 0111111111 No 85 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.54 E-value=1.6e-13 Score=90.67 Aligned_cols=420 Identities=11% Similarity=0.035 Sum_probs=210.0 Q ss_pred hhhcchhcc--ccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccc Q lcl|NC_019524. 28 MAVGGGMEG--AERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIV 105 (556) Q Consensus 28 ~~~~~~y~a--a~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~ 105 (556) +...-+|.. ++-.+++...|...... .. ... .....+++-|++++.+|+.+....+-.|+.+... T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~~-~~---~~~----~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~----- 67 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSLS-LT---DDL----VQLEALWRDNWIANKVCIKRPEDMVRNWREIYSN----- 67 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCcc-cc---ccH----HHHHHHHHhCchhhHHhhcchHHhhcCCceEecC----- Confidence 111111111 11111122223211111 00 111 1344678999999999999999999999998752 Q ss_pred cCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCC--cC-CCcccceE Q lcl|NC_019524. 106 LGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGT--TM-QRRPFGTA 182 (556) Q Consensus 106 lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~--~~-~~~~~~l~ 182 (556) +.+.+ ..+++++.|++. .+.+...-+++..-.-|.+++.+........ +. .+. .-. T Consensus 68 -d~~~~----~~~~~~~~~~~l--------------~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~--~~~ 126 (437) T protein:vir:52 68 -DLNSK----QLDLFTKFERSL--------------KLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTER--LKR 126 (437) T ss_pred -CCCHH----HHHHHHHHHHhh--------------cHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCc--eeE Confidence 11111 123344444432 2333444444433333445555433211000 00 011 123 Q ss_pred EEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeec---ccCC Q lcl|NC_019524. 183 IQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIE---ALLA 259 (556) Q Consensus 183 lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~---~~r~ 259 (556) |.+|++..+. |....+ ..+--..+|+|..|+|.. +. ....|+.++|||+-. +... T Consensus 127 ~~v~~~~~v~-~~~~~~-----~dp~s~~fg~p~~y~v~~---~~-------------~~~~iH~SRii~~~~~~~~~~~ 184 (437) T protein:vir:52 127 LIILPKWKIS-PTGTKD-----DDVLSPNFGRYSEYSILG---GS-------------QSITVHHSRLIILNANDAPLSD 184 (437) T ss_pred EEEechhhcc-cccccc-----ccccccccCcceEEEEec---CC-------------cceeEccceeEEecCccCCCcc Confidence 7788887764 111111 112223579999999852 11 112578889999842 4566 Q ss_pred CcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccc Q lcl|NC_019524. 260 GQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKN 339 (556) Q Consensus 260 gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 339 (556) .+.-|+|.|-+++..+++++.-......-.. .+.+. +++.+.... ..+.+..+......+.. . T Consensus 185 ~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~-~~~~~-v~k~~~l~~----~l~~~~~~~~~~~~~~~-----------~ 247 (437) T protein:vir:52 185 NDIWGVSDLEKIIDVLKRFDSASVNVGDLIF-ESKID-IFKIAGLSD----KIAAGMENEVASVISAV-----------Q 247 (437) T ss_pred ccccCCchHHHHHHHHHHHHHHHHHHHHHHH-HcCCC-ceecchHHH----HhcCCcHHHHHHHHHHH-----------H Confidence 8889999999999888877665554432211 22222 334321111 11111110000000000 0 Q ss_pred eecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 340 IAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL 419 (556) Q Consensus 340 ~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~ 419 (556) ..-..+.+..+..+++++.++.+ -++..++.......||++.+||...|.|-- -...+|.-.-+..+...++.+|+. T Consensus 248 ~~~~~~~~~~~d~~~~~e~~~~~--~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s-~~Glasge~D~~~yyd~i~~~Qe~ 324 (437) T protein:vir:52 248 EIKSATNSLLLDAENEYDRKELT--FTGLKDLLTEFRNAVAGAADMPVTILFGQS-VSGLASGDEDIQNYHEAIRRLQET 324 (437) T ss_pred HhcCCCceEEEcCCcceEEEecC--cCCHHHHHHHHHHHHHHHhcCchhhhcCcC-cccccccHHHHHHHHHHHHHHHHH Confidence 00123445566778999888754 347889999999999999999999887753 456777777788888888998876 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHH Q lcl|NC_019524. 420 VADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAE 498 (556) Q Consensus 420 lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~ 498 (556) .+ +|+.++.++.-+++.-.++|..+.+. ++..|.+.....+|- .|-+++....+++|+.|+.++ T Consensus 325 ~l----~p~le~l~~~i~~~~~g~~~~~~~~~-----------f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~ 389 (437) T protein:vir:52 325 RL----RPIFEIIDPLICNELFGGLPADWWFE-----------FVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQI 389 (437) T ss_pred HH----HHHHHHHHHHHHHHhcCCCCCcceEE-----------eCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHH Confidence 44 45555555544444323344432211 122455555555554 577778888888888887665 Q ss_pred HHHhCCCHHHHHHHHHHHHHHHHHcCCC---CCccccccCCCCC-----CCCCCCCCCCCCCcCCC Q lcl|NC_019524. 499 ISRLGGDFREVFKQRAREEGLIKSLKLD---FTGKMVEGNSTQS-----SNSSESTSDNPNEETTQ 556 (556) Q Consensus 499 ~ae~G~D~e~v~~q~a~E~~~~~~~Gl~---~~~~~~~~~~~~~-----~~~~~~~~~~~~~e~~~ 556 (556) ..+ + ++.|+- .+.++........ .+++...+..+.+-..| T Consensus 390 r~~-----------L-------~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 390 ANE-----------L-------RESGLFANISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred HHH-----------H-------HhcCCCCCCCccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 333 1 122221 0111111111000 01111111111111111 No 86 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.54 E-value=8.5e-14 Score=92.24 Aligned_cols=420 Identities=9% Similarity=0.014 Sum_probs=212.6 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccc---cCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQW---NPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w---~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |... ..+++...+ .........| .....++- -.....-+.+.+.++|.+- T Consensus 1 ~~~~----~~~~~~~~~-----------------~~~~~~~~~~~~~~g~~~~~~------~~~~~~~~~~~a~~~~~v~ 53 (460) T protein:vir:10 1 MANR----IIRALRELT-----------------GLDNKFNDAFIKYIGQTFTKY------DNNGKTYLEQGYNINPDVY 53 (460) T ss_pred Cchh----HHHHHhhhh-----------------ccCCCchHHHHHhhccccCCC------ccchhhhhHHHHhcchHHH Confidence 1111 011111100 0000011112 11111100 0011224556688999999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHH------HHHH--------------HHHHHHHHHHhcccccceehhcc Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWG------EEFQ--------------EVVEARFNMAAESPENWFDARRM 140 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~------~~~~--------------~~ie~~~~~w~~~~~~~cD~~g~ 140 (556) ++|+.+.+.|-+-=|.+.-+... |...+.. ..+. ...+..+.....+|+ .. T Consensus 54 ~~v~~ia~~iA~lp~~v~~~~~~---g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN------~~ 124 (460) T protein:vir:10 54 SCISQMAAKTVAVPYTIKVVKDT---KAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPN------PT 124 (460) T ss_pred HHHHHHHHhhhhCceEEEeccCC---ccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCC------CC Confidence 99999999998765555432211 1000000 0000 011122333333454 45 Q ss_pred cCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEE--- Q lcl|NC_019524. 141 CTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALG--- 217 (556) Q Consensus 141 ~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~va--- 217 (556) +|.+++-...+..++..|++|+.+++.... .....+..|..|.|+.+..- .|.+|.++. T Consensus 125 ~t~~~f~~~~~~~lll~Gnay~~i~r~~~~----~~~G~~~~L~~l~~~~v~v~--------------~~~~~~~~~~~~ 186 (460) T protein:vir:10 125 QTWADIYSLYKTYMRLNGNCYFYLMSPDDG----INAGVPSQMYVLPAHLIKIV--------------LKDDINLLSTDS 186 (460) T ss_pred CCHHHHHHHHHHHHhhcCCeEEEEEecCCC----ccCceeEEEEEEcCceEEEE--------------EcCCCceeeeee Confidence 799999999999999999999998764321 12235678899999888531 222232222 Q ss_pred ----EEEeecCCCccccCCccccceeeccccCChhHeEeeecccCC-----CcccCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 218 ----YWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLA-----GQTRGISEMVSALKQMKMTRNFQEITLQN 288 (556) Q Consensus 218 ----Y~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~-----gQ~RGvs~la~~l~~l~~l~~~~dael~~ 288 (556) |.+.. .| ....+++++|||+...... +..+|+|++..+...+.......+..... T Consensus 187 ~~~~~~~~~--~g--------------~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~ 250 (460) T protein:vir:10 187 PIKSYMLIQ--GD--------------QFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKT 250 (460) T ss_pred eeeEEEEec--Cc--------------eeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 22211 00 1124677899999765433 57899999988887777766666666655 Q ss_pred HHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccH Q lcl|NC_019524. 289 AVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVG 368 (556) Q Consensus 289 a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f 368 (556) .+=.+...++++.+..-. ++.......... ....+. =..|.+..|..|.+++.++.+....+| T Consensus 251 f~ng~~~~~i~~~~~~l~----------~e~~~~~~~~~~---~~~~g~----~n~g~~~vl~~g~~~~~l~~~~~d~q~ 313 (460) T protein:vir:10 251 MQNGGVFGFIHGGSTGLT----------QPQADSLKQRLT---EMDKSP----DRLSQIAGASGEIAFTKISLNTDELKP 313 (460) T ss_pred HhcCCCcceeeecCCCCC----------HHHHHHHHHHHH---HHhcCc----cccCCceecCCCceEEEccCChhHHHH Confidence 555566656655432110 011111111000 000000 124667789999999999988778888 Q ss_pred HHHHHHHHHHHHHhcCCCHHHhhchhhc--ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCC Q lcl|NC_019524. 369 TDYEQSLLRNIAASLGMSYEQFSRDYTK--TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPP 446 (556) Q Consensus 369 ~~F~~~~lr~iaaglGi~ye~l~~D~s~--~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~ 446 (556) .+..+.....||..+|||-+.| |+..+ .|||++.+....|. ..-+.|+...+ +.++....++- . T Consensus 314 ~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~e~~~~~f~-----------~~~l~P~~~~i-e~~ln~kl~~~-~ 379 (460) T protein:vir:10 314 FDYLKYDQKAICNALGWSDKLL-NNNEGGGLNTGNLEEERKRVV-----------TDNIQPDLVIL-KQAFDKKFIKR-F 379 (460) T ss_pred HHHHHHHHHHHHHHhCCCHHHh-CCCCCCCCccccHHHHHHHHH-----------HHHHHHHHHHH-HHHHHHhhcCc-c Confidence 9999999999999999999855 55433 47888766655553 34445544443 33333332211 0 Q ss_pred CcccccccchhhHHHhhCeeeecCcccccchh-hhhHHHHHHHHcCCCCHHHHHHHhCCCHH--HHHHHHHHHHHHHHHc Q lcl|NC_019524. 447 GKNWRMFYDPMMRDALCNAEWIGASRGQIDEK-KETEAAILRIKNGLSTYEAEISRLGGDFR--EVFKQRAREEGLIKSL 523 (556) Q Consensus 447 ~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~-Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e--~v~~q~a~E~~~~~~~ 523 (556) .... -..+++-. .-++.+ .+.++....+++|+.|+-|+-+..|.+|- +-.+ ++ T Consensus 380 ~~~~-----------~~~i~~d~---~~l~~l~~d~~~~~~~~~~g~~T~NE~R~~~g~~pi~~~~gD----------~~ 435 (460) T protein:vir:10 380 KGYE-----------NAVIEWDI---SELPEMQTDMVAMASWLNTIPVTPNEIRIAMKYETLNQDGMD----------IV 435 (460) T ss_pred cccC-----------CceEEeec---chhhhHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCC----------ee Confidence 0000 00111111 112222 35566667899999999887777776652 1100 00 Q ss_pred CCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 524 KLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 524 Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) =++. +-.+....+++..+..+..+| T Consensus 436 ~~~~--------n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 436 FMPS--------NKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred eecc--------cccchhhcccccCCCcccCCC Confidence 0000 000111111111122222222 No 87 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.54 E-value=2.2e-14 Score=95.50 Aligned_cols=410 Identities=10% Similarity=-0.023 Sum_probs=212.7 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |+=|..++ ....+................... ...+.+....++ ....+..-...-..+++.+. T Consensus 1 ~~~~~~~~---~~~~m~~F~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~v~ 64 (413) T protein:vir:96 1 MPGVSEIR---KDKNLKFFNNKRSPTEESKAKDEI--------PKAPQVVMTLPN-----FFKELISDGYTKLSDSPEVR 64 (413) T ss_pred CCccchhh---hhhcCCccccCCCcchhhhhhccc--------cccccccccchh-----hHhhhccchhHHHhhchHHH Confidence 33333332 111111111110000000000000 000001111111 00111111112246689999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) .+|+.+++.|....|.+..+.. +++. .... .+...+..+++ ..+|.+++....+..++..|++ T Consensus 65 ~cI~~ia~~ia~~~~~~~~~~~------~~~~--~~~~---~~~~ll~~~PN------~~~t~~~f~~~~~~~lll~Gn~ 127 (413) T protein:vir:96 65 MAVDCIADLVSNMTIQLMQNGE------TGDK--RIKN---DLSRVVDIEPN------KYLSRKTFIQWLVRSMLLEGNG 127 (413) T ss_pred HHHHHHHHhhccCceEEEEecC------CCcc--cccc---HHHHHHHhccc------cCCCHHHHHHHHHHHHhhcCCe Confidence 9999999999988887754322 1111 1111 12223333443 3578999999999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) |+.+...... ..+..|..|+|+.+..-. +. .-+.|.+.... T Consensus 128 ~~~i~r~~~g-------~~~~~L~~l~~~~v~~~~--------------~~--~~~~y~~~~~~---------------- 168 (413) T protein:vir:96 128 NAVVKPQVSG-------DKIIGLTPISPYKVTFNV--------------SD--DDLDYSITFDN---------------- 168 (413) T ss_pred EEEEEEcCCC-------CceEEEEEecCceeEEEE--------------cC--CeEEEEEeecC---------------- Confidence 9998753221 124578889888874211 11 12334442110 Q ss_pred ccccCChhHeEeeec-ccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIE-ALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 241 ~~~~v~a~~viH~f~-~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) ...+..+|||+.. +..-++..|+|++..+...+.......+....-.+=.+...++|+.+..-. .+. T Consensus 169 --~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~----------~e~ 236 (413) T protein:vir:96 169 --KEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSD----------ELS 236 (413) T ss_pred --cEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCC----------HHH Confidence 1245679999964 455578899999999988888888888888887888888889988653211 111 Q ss_pred cccccccccccccccccccceecCCceeeec-CCCceeeeecC-CCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-YPGTKLKMQPA-GTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKT 397 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-~pGe~i~~~~~-~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~ 397 (556) .+...... .....+. =..|.+..| ..|.+++.+.+ +.-...|.+..+...+.||..+|||-..|. .- T Consensus 237 ~~~~~~~~---~~~~~g~----~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg-~~--- 305 (413) T protein:vir:96 237 DEEGRENF---EEMYLKR----KEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLG-VG--- 305 (413) T ss_pred HHHHHHHH---HHHhcCc----cccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcC-CC--- Confidence 11100000 0000000 013444444 34444444332 223567778888999999999999998773 21 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch Q lcl|NC_019524. 398 NYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE 477 (556) Q Consensus 398 nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP 477 (556) ++ .++. +..++..-+.|+.+. ++.++-...++ .+.. +++..-.....|+ T Consensus 306 ~~--~~~~-----------~~~~~~~~l~P~~~~-ie~~ln~~ll~--~~~~---------------~~fd~~~ll~~d~ 354 (413) T protein:vir:96 306 TY--NKDE-----------FNNFINTKIMSIAQV-IQQTYNKLIVE--EDMY---------------FSLNPRSLYNYSL 354 (413) T ss_pred cc--hHHH-----------HHHHHHHHHHHHHHH-HHHHHHHhhCC--CCcE---------------EEEechhhhccCH Confidence 11 1111 122455556776666 34445444332 2211 1222222334688 Q ss_pred hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 478 KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 478 ~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) ...+++....+++|+.|+-|+-+..|.+|.+- .+++=++....+ .....+....+.+|| T Consensus 355 ~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~----------gd~~~~~~n~~~--------~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 355 TEMVSAGAQMTQLNALRRNEFRNWVGMPPDAE----------MDDLLVLENYLQ--------QKDLVNQKKLIQDET 413 (413) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC----------cceeeecccccc--------hhhcccccCCCCCCC Confidence 88999999999999999999988889888531 111111111111 111111122223333 No 88 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.52 E-value=1.4e-12 Score=85.58 Aligned_cols=451 Identities=15% Similarity=0.143 Sum_probs=211.1 Q ss_pred HHHHhhHhhcccchhh---h-h---hhhc--------chhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 10 TRAKKAVDVVAETATA---T-P---MAVG--------GGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQ 74 (556) Q Consensus 10 ~~a~~a~~~~~~~~~~---~-~---~~~~--------~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~r 74 (556) +..+. ......-.. . . .... ..|+-+.. ..... +. ... .. .+++-. T Consensus 1 ~~~~~--~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~---~i~~~-~~--~~~----~~-------~~~~~~ 61 (486) T protein:vir:42 1 MTAPL--PGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAER---RPEAI-GV--TVP----RE-------MQQLLA 61 (486) T ss_pred CCCCC--CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---cchhc-cc--ccc----hh-------Hhhhhh Confidence 11110 111000000 0 0 0000 01111111 00000 00 000 00 122222 Q ss_pred cChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 75 NDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 75 Nn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) ...|++-+|+.++...+..||+..- +++ ..+.+ |+-|.. + +|...+..+++.. T Consensus 62 v~n~~~~iVd~~~~~l~~~g~~~~~---------~~~----~~~~~---~~i~~~-----N------~~d~~~~~~~~~a 114 (486) T protein:vir:42 62 HVGYPRLYVDSVAERQAVEGFRLGD---------ADE----ADEEL---WQWWQA-----N------NLDIEAPLGYTDA 114 (486) T ss_pred ccchHHHHHHHHHhhhcccceecCC---------Cch----hHHHH---HHHHHh-----c------ChhHHHHHHHHHH Confidence 3468899999999988888886421 111 12223 333322 1 5777888899999 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE--C-CCCCeEEEEEeecCCCccccC Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL--D-NNGAALGYWLRKAFPGDPTDM 231 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~--d-~~Gr~vaY~i~~~hpgd~~~~ 231 (556) ++-|.+|+.+-.........+..+. ..+++++|..+-.-++.. .+.+..+|.+ + +.+....+-++...---.+.. T Consensus 115 ~~~G~ay~~v~~~e~~~~~~~~~~~-~~i~~~~p~~~~~i~d~~-~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~ 192 (486) T protein:vir:42 115 YVHGRSFITISKPDPQLDLGWDQNV-PIIRVEPPTRMHAEIDPR-INRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFR 192 (486) T ss_pred hhcCceEEEEecCCcccccccCCCe-eEEEEecccceEEEEeCC-CCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEe Confidence 9999999876433222111222333 378899999876444422 2346666643 3 334443333332210000011 Q ss_pred Cccccc---eeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee---eeEeccCcc Q lcl|NC_019524. 232 EQWKWG---YEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA---ASVESELPS 305 (556) Q Consensus 232 ~~~~~~---rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~---~fi~~~~~~ 305 (556) ....|. ++| ..++.-=|+++....+.+-.-|.|.+.+-+..|- +.|..+....+..+-.++ .+|+-...+ T Consensus 193 ~~~~~~~~~~~~--h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~li--Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~ 268 (486) T protein:vir:42 193 ADGEWAEWFNVP--HGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMT--DAAARILMLMQATAELMGVPQRLIFGIKPE 268 (486) T ss_pred cCCcEEeeccee--cCCCCceEEEeccccccCCCCCcccchhhHHHHH--HHHHHHHHHHHHHHHhhcchHHHhhcCCcc Confidence 111221 222 1234344677766667777779999997444332 344444333333332222 122210000 Q ss_pred cccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCC-CCccHHHHHHHHHHHHHHhcC Q lcl|NC_019524. 306 DVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGT-PGGVGTDYEQSLLRNIAASLG 384 (556) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~-p~~~f~~F~~~~lr~iaaglG 384 (556) .. .. ....+.......+|.+..++ +.++++.+.+. .-.+|...++..++.+++..+ T Consensus 269 ~~-----~~-----------------~~~~~~~~~~~~~~~~~~~~-~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~ 325 (486) T protein:vir:42 269 EI-----GV-----------------DSETGQTLFDAYLARILAFE-DAEGKIQQFSAAELANFTNALDQIAKQVAAYTG 325 (486) T ss_pred cc-----cc-----------------ccccccchhhhhhchhcccC-CCCceEEeecccCHHHHHHHHHHHHHHHhcccC Confidence 00 00 00111222234566666554 44566654332 234566667777999999999 Q ss_pred CCHHHhhchhhc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhh Q lcl|NC_019524. 385 MSYEQFSRDYTK-TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALC 463 (556) Q Consensus 385 i~ye~l~~D~s~-~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~ 463 (556) +|-+.|.++.++ +|-.++|..+.......+..|..|-..+-+ +++.. .+++.+ ...+.. + .=+ T Consensus 326 ~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~-~~~l~--~~~~~~-~~~~~d-----~-------~~i 389 (486) T protein:vir:42 326 LPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEE-AMRIA--YRIMKG-GDVPPD-----M-------LRM 389 (486) T ss_pred CCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHhcC-CCcccc-----c-------eee Confidence 999988766432 345566666666666666666666555433 33322 234333 222211 0 014 Q ss_pred CeeeecCcccccchhhhhHHHHHHHHc--CCCCHHHHHHHhCCCHHHH--HHHHHHHHHHHHHcCCCCCccccccCCCCC Q lcl|NC_019524. 464 NAEWIGASRGQIDEKKETEAAILRIKN--GLSTYEAEISRLGGDFREV--FKQRAREEGLIKSLKLDFTGKMVEGNSTQS 539 (556) Q Consensus 464 ~~~w~~p~~~~iDP~Ke~~A~~~~i~~--G~~s~~~~~ae~G~D~e~v--~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~ 539 (556) .+.|..+... +-...+.+..+.+.+ |+.|.+-.....|...+++ ++++++|++..-. .+. .......+... T Consensus 390 ~v~w~~~~~~--s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~-~~~--~~~~~~~~~~~ 464 (486) T protein:vir:42 390 ETVWRDPSTP--TYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGL-GLL--GTMVDADPTVP 464 (486) T ss_pred eEEecCCCCC--CHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHH-HHH--HHhhcCCCCCC Confidence 6788777644 666677888777776 7888888888889865543 3444444332211 110 00001111111 Q ss_pred CCCCCCCCCCCCCcCCC Q lcl|NC_019524. 540 SNSSESTSDNPNEETTQ 556 (556) Q Consensus 540 ~~~~~~~~~~~~~e~~~ 556 (556) ......+.+.+++.+++ T Consensus 465 ~~~~~~~~~~~~~~~~~ 481 (486) T protein:vir:42 465 GSPSPTAPPKPQPAIES 481 (486) T ss_pred CCCCCCCCCCCCcccCC Confidence 12222222333333333 No 89 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.52 E-value=2e-14 Score=95.72 Aligned_cols=390 Identities=12% Similarity=0.098 Sum_probs=200.8 Q ss_pred CCcccccccCCCCCHHHHHHHHHHHHHH-------HHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhH Q lcl|NC_019524. 40 TTREMFQWNPSIISPDQQIAQNQDMASA-------RAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGW 112 (556) Q Consensus 40 ~~r~~~~w~~~~~s~~~~i~~~~~~lr~-------RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~ 112 (556) .+ +.+|-....+|...+..+...+.. -+...+..++.+..+|+.|.+.|..--+++..+..... +... T Consensus 1 mg--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~---~~~~ 75 (403) T protein:vir:10 1 MG--FKSWITEKLNPGQRIIRDMEPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVT---YANG 75 (403) T ss_pred Cc--chhhhhhccchhhhhhhcccccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeecccccc---cccc Confidence 00 111211111111111111111110 01123345677888999999888776676654322110 0000 Q ss_pred HHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcC Q lcl|NC_019524. 113 GEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMS 192 (556) Q Consensus 113 ~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~ 192 (556) ..-...+..+..+|+ --+|.+++....+..++..|++|+.+. +..|.+|.++.+. T Consensus 76 -----~~~~~l~~lL~~~PN------~~~t~~~f~~~~~~~~ll~Gnayi~~~--------------~~~l~~l~~~~~~ 130 (403) T protein:vir:10 76 -----VKTKTLDTLLNVRPN------PFMDISTFRRLVVTDLLFEGCAYIYWD--------------GTSLYHVPAALMQ 130 (403) T ss_pred -----cccchHHHHHhhCCC------CCCCHHHHHHHHHHHHhhcCCeEEEEe--------------CceeEeecCcceE Confidence 001122333344554 357899999999999999999998752 1235667666553 Q ss_pred CCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeeccc----CCCcccCCchh Q lcl|NC_019524. 193 NPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEAL----LAGQTRGISEM 268 (556) Q Consensus 193 ~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~----r~gQ~RGvs~l 268 (556) |+.|..+.. .|++.... ...+..+|||+.... ..+...|+|++ T Consensus 131 --------------v~~~~~~~~-~~~~~~~~------------------~~~~~~eiih~~~~~~~~~~~~~~~G~s~i 177 (403) T protein:vir:10 131 --------------VEADANKFI-KKFIFNNQ------------------INYRVDEIIFIKDNSYVCGTNSQISGQSRV 177 (403) T ss_pred --------------EEEcCCceE-EEEEecCc------------------eeecccceEEecccccccCCCCCcccccHH Confidence 223333322 22222110 123456899987543 34778899998 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceee Q lcl|NC_019524. 269 VSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIP 348 (556) Q Consensus 269 a~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~ 348 (556) ..+...+.-.....++...-.+-++...++|+.+..- .++..+....... ....+ .=..|.+. T Consensus 178 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l----------~~e~~~~~~~~~~---~~~~g----~~n~g~~~ 240 (403) T protein:vir:10 178 ATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEIL----------NKKLRERKQEELQ---LDYNP----STGQSSVL 240 (403) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC----------CHHHHHHHHHHHH---HHhCC----cccCccee Confidence 8888777766666666655555566677888854321 1111111111000 00000 11246778 Q ss_pred ecCCCceeeeecCC--CCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 349 HLYPGTKLKMQPAG--TPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFAS 426 (556) Q Consensus 349 ~L~pGe~i~~~~~~--~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~ 426 (556) .|..|.+++.++-. .....|.+..+.....||..+|||.+.| ++ .+||++.+... .|+...+. T Consensus 241 vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~---~~~sn~e~~~~-----------~f~~~tl~ 305 (403) T protein:vir:10 241 ILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLL-DG---GNNANIRPNIE-----------LFYYMTII 305 (403) T ss_pred ecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CC---CCCcCHHHHHH-----------HHHHHHHH Confidence 89999999988632 2355788888899999999999999876 43 35665544333 33445566 Q ss_pred HHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCH Q lcl|NC_019524. 427 AIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDF 506 (556) Q Consensus 427 pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~ 506 (556) |+...+ +.++... | +.. +.| +...-.....|....+++....+++|+.|+-|+-...|..| T Consensus 306 P~~~~i-e~~l~~~-L----~~~-~~~------------d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~p 366 (403) T protein:vir:10 306 PMLNKL-TSSLTFF-F----GYK-ITP------------NTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEP 366 (403) T ss_pred HHHHHH-HHHHHHh-c----Cce-eee------------ccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 665553 3334332 2 011 111 10000111237788888888999999999999988889998 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 507 REVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 507 e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) -+ .....++=++.+.........+ ....++...++.| T Consensus 367 i~--------~~~~d~~~~p~n~~~~~~~~~~--~e~~~~~~~~~g~ 403 (403) T protein:vir:10 367 LD--------DEQMNKIRIPANVAGSATGVSG--QEGGRPKGSTEGD 403 (403) T ss_pred CC--------cccccccccccccccccccCCC--CcCCCCCCCcCCC Confidence 42 0112222233222211111111 1111111122222 No 90 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.51 E-value=3.8e-12 Score=83.16 Aligned_cols=450 Identities=10% Similarity=-0.008 Sum_probs=220.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.+..-+... ..........+. .....|+-+.. +.-.. ........ ..+. | .-+++++ T Consensus 38 ~~~~~~l~~~-----i~~~~~~~~~r~-~~l~~yY~g~~-~~i~~--~~~~~~~~---~~~~-----k-----i~~n~~k 95 (501) T protein:vir:27 38 VNNWELLKNF-----INHHKLRQAPRI-QELLDYARGEN-HDVLQ--FGRRKDRE---MADK-----R-----AVHNYGR 95 (501) T ss_pred cccHHHHHHH-----HHHHHHHHHHHH-HHHHHHhcCCC-ccccc--cCccCccc---cccc-----e-----eccchHH Confidence 1111111111 000000000011 11123332221 11110 00000000 0000 1 1378999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.++++....+ + -++.+...|++|+.. .+|...+..+.+..+.-|.+ T Consensus 96 ~Ivd~~~~yl~g~p~~~~~~d~--------~----~~~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~~~G~a 153 (501) T protein:vir:27 96 MISKFKTGYLAGNPIRVEYDDN--------D----NNSQNDDTIKRIGRI----------NDIDSHNRTLIRDLSQTGRA 153 (501) T ss_pred HHHHHHhhhhcccCeeEecCCc--------c----chHHHHHHHHHHHHh----------cChhHHHHHHHHHHhhCCeE Confidence 9999999999999987765321 1 133444555555543 25788888999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeEEEEEeecCCCccccCCcccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAALGYWLRKAFPGDPTDMEQWKW 236 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~vaY~i~~~hpgd~~~~~~~~~ 236 (556) |+.+.... .+ .+++..++|..+---++......+..+|.+ +..+...-+.|+..+---. ......| T Consensus 154 ~~~vy~de-d~--------~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~-~~~~~~~ 223 (501) T protein:vir:27 154 YEVIYRNE-YD--------ETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYT-LDASDDF 223 (501) T ss_pred EEEEEeCC-CC--------ceEEEEEccceeEEEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEE-EEeCCce Confidence 98754322 11 257899999988543433334456777754 2223333333443321000 0000011 Q ss_pred ce---ee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccc Q lcl|NC_019524. 237 GY---EP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQL 312 (556) Q Consensus 237 ~r---v~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~ 312 (556) .- .| .+..|| |+++... ..|+|.|.+++..+..++....-......-.+.-..+++....... T Consensus 224 ~~~~~~~~~~g~vP---vv~~~nn-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~----- 290 (501) T protein:vir:27 224 NEISVTTHAFGTVP---ITEFLNN-----VDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPK----- 290 (501) T ss_pred eeccccccCCCccc---EEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCc----- Confidence 11 11 012233 4554432 3689999998888877776555444444433333333432211100 Q ss_pred ccccccccccccccccccccccccccceec-CCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCH---H Q lcl|NC_019524. 313 GMGQGGFKEIFNEYMTGLANYVAQTKNIAI-DGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSY---E 388 (556) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~y---e 388 (556) ++... .......+.+ .+|....+..+-++++++.+.+...+..+.+.+.+.|..-.++|- + T Consensus 291 ----~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 355 (501) T protein:vir:27 291 ----GMQAS-----------DMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDT 355 (501) T ss_pred ----ccchh-----------hhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCcc Confidence 00000 0001111122 233344456677899999888888999999999999888888773 2 Q ss_pred HhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeee Q lcl|NC_019524. 389 QFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWI 468 (556) Q Consensus 389 ~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~ 468 (556) .++++ +|-.+++..+..........+..|-. .++.+++..+...-..+...-+. ..-+.+.|. T Consensus 356 ~~~~n---~Sg~Al~~~~~~l~~ka~~~~~~~~~-~l~~~~~li~~~~~~~~~~~~~d-------------~~~i~v~f~ 418 (501) T protein:vir:27 356 NFSGN---TSGEALKYKLFGLDQDRVDTQSQFTQ-GLKRRYRLAARIGSLVNEFKDFD-------------ESLLKITFT 418 (501) T ss_pred ccccC---chHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcccccccc-------------cccceEEeC Confidence 33333 33345555554444444444444433 34445555444322222211100 012456774 Q ss_pred cCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCC Q lcl|NC_019524. 469 GASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSEST 546 (556) Q Consensus 469 ~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~ 546 (556) ++-. .|-...+++..++ .|+.|.+..+...+. |+++-++++.+|++.....+...+.. ..........+... T Consensus 419 ~~~p--~n~~e~ad~~~kl--~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~--~~~~~~~d~~~~~~ 492 (501) T protein:vir:27 419 PNLP--KSLNEQVSILTGL--GGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFN--EHVGKYTDEVKETH 492 (501) T ss_pred CCCC--cCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccc--cccccccCCCCCCc Confidence 3322 4555555554443 699999999998864 79999999999987655444332111 11111122222222 Q ss_pred CCCCCCcCC Q lcl|NC_019524. 547 SDNPNEETT 555 (556) Q Consensus 547 ~~~~~~e~~ 555 (556) +++.++.+| T Consensus 493 ~d~~e~~~~ 501 (501) T protein:vir:27 493 TDDFERAYE 501 (501) T ss_pred cccccccCC Confidence 222222222 No 91 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.50 E-value=8.7e-14 Score=92.19 Aligned_cols=449 Identities=11% Similarity=0.038 Sum_probs=206.0 Q ss_pred CCcchhh-------hHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKT-------TRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMV 73 (556) Q Consensus 1 ~sp~~~~-------~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~ 73 (556) .++.+.. .....++++.......+. .......+..+.++. |....+. ++ .+.. ... T Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-------~~~~~~~-----~~---~~~l-~~~ 91 (574) T protein:vir:80 29 LREIDTNVVNNEPYSMESIEKGMNGKTTAYMQ-PIIGEMSVNPGYKTK-------PSIRNSQ-----DL---HKTL-KKF 91 (574) T ss_pred cchhhhhhhhccCCCHHHHHHhHhhhcccccc-hhhhhccccccccCc-------CccCCcc-----cH---HHHH-Hhh Confidence 1111111 101112222111111000 000011111221111 1111111 11 1111 222 Q ss_pred hcChHHHHHHHHHHhhhc-----------cCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceeh-hccc Q lcl|NC_019524. 74 QNDGYAAGVVAVHRDSIV-----------GSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDA-RRMC 141 (556) Q Consensus 74 rNn~~a~~~v~~~~~nvV-----------G~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~-~g~~ 141 (556) .+|+++..+|++..+.|- |-++.+..+-. +.+....-......+++...... ++. -.+. T Consensus 92 ~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~------~~~~~~~~~~~~~~l~~ll~~~~---~~~nP~~~ 162 (574) T protein:vir:80 92 GNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDI------EAEPTSHDIANIKRIESFLENTA---QFRDPNRD 162 (574) T ss_pred ccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEecc------CCCccchhhhhhhHHHHHHhccC---CCCCCccc Confidence 356888777777766553 34555543311 11111111122222222221111 121 1346 Q ss_pred CHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEe Q lcl|NC_019524. 142 TLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLR 221 (556) Q Consensus 142 ~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~ 221 (556) +|.++.+.++..++..|.+|+.+++... | .+..|..|+|..|..-.+. ++.. ...+ .+-|.+. T Consensus 163 s~~ef~~~lv~~lll~Gnayi~i~r~~~------G--~~~~L~pl~p~~V~v~~d~-~~~~-------~~~~-~~y~~~~ 225 (574) T protein:vir:80 163 NFTTFCKKLVRATYMYDQVNFEKVFDKD------G--NFIKFDTVDPTTIFLATNG-EGKL-------IKNG-ERFVQVI 225 (574) T ss_pred cHHHHHHHHHHHHHhcCCeEEEEEECCC------C--cEEEEEEEcCceeEEEEcC-cccc-------ccCc-eEEEEEe Confidence 8999999999999999999998775321 2 3578899999998532111 1100 0111 2222221 Q ss_pred ecCCCccccCCccccceeeccccCChhHeEeeecccCCC---cccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeee Q lcl|NC_019524. 222 KAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAG---QTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAAS 298 (556) Q Consensus 222 ~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~g---Q~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~f 298 (556) . +. ....+++.+|||+...-.++ ...|+|++..+...+......++....-.+=.+...++ T Consensus 226 ~---g~-------------~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gi 289 (574) T protein:vir:80 226 D---NR-------------IVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGI 289 (574) T ss_pred C---Cc-------------eEEEEccccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 1 00 01235778999997654443 44699999888877776666666666666666777888 Q ss_pred EeccCcccccccccccccccccccccccccccccccccccceecCCcee-eecCCCceeeeecCCCCCccHHHHHHHHHH Q lcl|NC_019524. 299 VESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI-PHLYPGTKLKMQPAGTPGGVGTDYEQSLLR 377 (556) Q Consensus 299 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i-~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr 377 (556) |+.+.+.... .+....+.+... ....+. -..|.+ +.+..|.+++.++.+.-...|.+..+.... T Consensus 290 l~~~~~~~ls--------~e~~~~lk~~~~---~~~~G~----~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~ 354 (574) T protein:vir:80 290 LHVKTGQQQS--------QQALDIFRREWR---SSLAGI----NGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLIN 354 (574) T ss_pred EEeCCCCCCC--------HHHHHHHHHHHH---HHhccc----cccccceeecCCCceEEEccCChhHHHHHHHHHHHHH Confidence 8754321110 000000000000 000000 124444 445678999999888788889999999999 Q ss_pred HHHHhcCCCHHHhhchhhc----------ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCC Q lcl|NC_019524. 378 NIAASLGMSYEQFSRDYTK----------TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPG 447 (556) Q Consensus 378 ~iaaglGi~ye~l~~D~s~----------~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~ 447 (556) .||+.+|||.+.| |+.+. +|||++.+... .++...++|+..+ ++.++.+..++ .. T Consensus 355 ~Ia~afgVPp~~l-G~~~~~t~~gs~~~~~n~sn~E~~~~-----------~f~~~tL~P~~~~-ie~~ln~~Ll~--~~ 419 (574) T protein:vir:80 355 VISALYGIDPAEI-NFPNNGGATGSKGGSLNEGNSKEKMQ-----------ASQNKGLQPLLRF-IEDTVNTYIVA--EF 419 (574) T ss_pred HHHHHhCCCHHHh-cccccccccccccccccchhHHHHHH-----------HHHHHHHHHHHHH-HHHHHHhhhhh--hc Confidence 9999999999855 55543 35565544443 3455666775554 45555554432 11 Q ss_pred cccccccchhhHHHhhCeeeecCcccccchhhhhHH--HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019524. 448 KNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA--AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKL 525 (556) Q Consensus 448 ~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A--~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl 525 (556) ... ...+|... |.....+. ....+.+|+.|+-|+-+..|+.|-+--+.+..-.. +..++. T Consensus 420 ~~~------------~~~~f~~~-----d~~~~~~~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n-~~~~~~ 481 (574) T protein:vir:80 420 GEK------------YQFQFRGG-----DLSAQLDKLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVH-IQAIGQ 481 (574) T ss_pred CCc------------eEEEeccc-----chhhHHHHHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccc-eeeccc Confidence 100 11233322 33333332 23467899999999999999987542221110000 000000 Q ss_pred CCC------------------ccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 526 DFT------------------GKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 526 ~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ... .............+.+.+.|+.+.+.++ T Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~ 530 (574) T protein:vir:80 482 ALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDE 530 (574) T ss_pred ccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhh Confidence 000 0000000000000000111111111111 No 92 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.50 E-value=4.4e-12 Score=82.83 Aligned_cols=451 Identities=10% Similarity=0.009 Sum_probs=221.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.+..-+... ..........+..-...-|.|-. +.-. .+...... .+ + +.-.-+++++ T Consensus 38 ~~~~~~i~~~-----i~~~~~~~~~r~~~~~~yY~g~~--~~i~---~~~~~~~~-----~~------~-~~ri~~n~~k 95 (501) T protein:vir:96 38 VNNWELLKNF-----INHHKLRQAPRIQELLDYARGEN--HDVL---KSGRRKDN-----EM------A-DKRAVHNYGR 95 (501) T ss_pred CChHHHHHHH-----HHHHHHHHHHHHHHHHHHhcCCC--Cccc---CccccCcc-----cc------c-cceeecchHH Confidence 2221111111 11000000011111112244321 1111 11110000 00 0 0012478999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++....- ++. ++.+.+.++.|+.. .+|...+..+.+..+.-|.+ T Consensus 96 ~Ivd~~~~yl~g~p~~~~~~~--------~~~----~~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~~~G~a 153 (501) T protein:vir:96 96 MISKFKTGYLAGNPIRVEYDD--------NDD----NSQNDDAIKRIGRI----------NDLDSLNRTLIRDLSQTGRA 153 (501) T ss_pred HHHHHHhhhhcccCeeEeeCC--------ccc----hhHHHHHHHHHHHh----------cCHHHHHHHHHHHHhhcCeE Confidence 999999999999998876531 122 23334444444432 25778888889999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeEEEEEeecCCCccccC-Cccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAALGYWLRKAFPGDPTDM-EQWK 235 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~vaY~i~~~hpgd~~~~-~~~~ 235 (556) |+.+.. ...+ .+++..++|..+-.-++....+.+..+|.+ +..+....|.++..+ ..+.. .... T Consensus 154 ~~~v~~-dedg--------~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~--~i~~~~~~~~ 222 (501) T protein:vir:96 154 YEVIYR-SEYD--------ETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDE--HIYTLDASDD 222 (501) T ss_pred EEEEEE-cCCC--------ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCC--cEEEEeeCCC Confidence 977543 2211 257899999988543443334567788765 233443334444332 11110 0111 Q ss_pred cc---eee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccc Q lcl|NC_019524. 236 WG---YEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQ 311 (556) Q Consensus 236 ~~---rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~ 311 (556) +. ..| .+..+| |+++... ..|+|.|.+++..+..++....-......-.+.-..+++-...... T Consensus 223 ~~~~~~~~~~~g~vP---vv~~~nn-----~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~---- 290 (501) T protein:vir:96 223 FNEISVTTHAFGTVP---ITEYLNN-----IDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPK---- 290 (501) T ss_pred ceeccccccCCCccc---eEEecCC-----ccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCc---- Confidence 11 111 112233 5555432 3699999997777765555443333333333333344432211110 Q ss_pred cccccccccccccccccccccccccccceec-CCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHh Q lcl|NC_019524. 312 LGMGQGGFKEIFNEYMTGLANYVAQTKNIAI-DGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQF 390 (556) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l 390 (556) ++.. ........+.+ .++......++-++++++.+.+...+..+.+.+.+.|....++|-..+ T Consensus 291 -----~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~ 354 (501) T protein:vir:96 291 -----GMQA-----------SDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSD 354 (501) T ss_pred -----ccch-----------hhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCc Confidence 0000 00001111112 233333456677899999898999999999999999888888773222 Q ss_pred hchh-hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeec Q lcl|NC_019524. 391 SRDY-TKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIG 469 (556) Q Consensus 391 ~~D~-s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~ 469 (556) +.+ ++.|-.+++..+..........+..|-..+ +.+++..+...-..+...-+. ..-+.+.|.. T Consensus 355 -~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l-~~~~~li~~~~~~~~~~~~~d-------------~~~i~i~f~~ 419 (501) T protein:vir:96 355 -TNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGL-KRRYRLAARIGSLVNEFKDFD-------------ESLLKITFTP 419 (501) T ss_pred -ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcccccccc-------------cccceEEeCC Confidence 122 223334555555555555555555554443 444554444333333211111 0124677744 Q ss_pred CcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCC Q lcl|NC_019524. 470 ASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTS 547 (556) Q Consensus 470 p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~ 547 (556) +-. .|-...+++..+. .|+.|.+..++..+. |+++.++++.+|++.+...+......+. .........+.+. T Consensus 420 ~~p--~n~~e~ad~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~--~~~~~~~~~e~~~ 493 (501) T protein:vir:96 420 NLP--KSLNEQVSILTGL--GGQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEH--VGKYTDEVKETHT 493 (501) T ss_pred CCC--cCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhc--ccccCCcCCCCCC Confidence 333 4666666665555 589999999999865 8999999999998876544433222111 1111111111222 Q ss_pred CCCCCcCC Q lcl|NC_019524. 548 DNPNEETT 555 (556) Q Consensus 548 ~~~~~e~~ 555 (556) ++++.+-+ T Consensus 494 d~~e~~~~ 501 (501) T protein:vir:96 494 DDFEREYE 501 (501) T ss_pred CccccccC Confidence 22222222 No 93 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.50 E-value=5.1e-12 Score=82.49 Aligned_cols=450 Identities=15% Similarity=0.106 Sum_probs=212.0 Q ss_pred hcccchh-------hh------hhhhc--------chhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019524. 18 VVAETAT-------AT------PMAVG--------GGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQND 76 (556) Q Consensus 18 ~~~~~~~-------~~------~~~~~--------~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn 76 (556) +.++... .. ..... ..|.-+... ... ....... .+ +++-.-. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~---i~~---~~~~~~~----~~-------~~~~~~~ 63 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERR---PEA---IGVTVPV----QM-------QSLLAHV 63 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc---hhh---cCcccch----hh-------hhhhhcc Confidence 1111100 00 11000 001111110 000 0000011 11 1122224 Q ss_pred hHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhee Q lcl|NC_019524. 77 GYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLM 156 (556) Q Consensus 77 ~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~ 156 (556) .|++-+|+..++.+++.||+..- +....+.+...|+ . + +|...+..+.+..++ T Consensus 64 n~~~~ivd~~~~~l~~~g~~~~~-------------~~~~~~~l~~i~~---~-----N------~~d~~~~~~~~~a~i 116 (485) T protein:vir:24 64 GYPRLYVDSIAERQAVEGFRLGD-------------ADEADEELWQWWQ---A-----N------NLDIEAPLGYTDAYV 116 (485) T ss_pred chHHHHHHHHhhhhccCceecCC-------------CchhHHHHHHHHH---h-----c------ChhHHHHHHHHHHhh Confidence 68999999999999999987431 1122233333332 2 1 577888899999999 Q ss_pred cCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCCeEEEEEeecCCCccccCCc Q lcl|NC_019524. 157 TGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGAALGYWLRKAFPGDPTDMEQ 233 (556) Q Consensus 157 dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr~vaY~i~~~hpgd~~~~~~ 233 (556) -|.+|+.+..........++.+.+ .+..++|+.+-.-++... +.+..+|.+ +..+....+.++...---.+.... T Consensus 117 ~G~ay~~v~~~~~~~~~~~~~~~~-~i~~~~p~~~~~i~D~~~-~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~ 194 (485) T protein:vir:24 117 HGRSYITISRPDPQIDLGWDPNVP-LIRVEPPTRMYAEIDPRI-GRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAE 194 (485) T ss_pred cCceEEEEecCCcccccccCCCcc-eEEEeccceeEEEeeCCc-CceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecC Confidence 999998864432211111122222 688899987743333222 234444432 233444444444332110111112 Q ss_pred cccceeec-cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccc Q lcl|NC_019524. 234 WKWGYEPA-RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQL 312 (556) Q Consensus 234 ~~~~rv~~-~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~ 312 (556) ..|.-+.. ....+.-=|+|+....+.+-.-|.|.|++.+..| ++.|..+....+.++-.++.-+.--.+... ... T Consensus 195 ~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l--iDa~~~~~s~~~~~~~~~a~p~~~i~G~~~--~~~ 270 (485) T protein:vir:24 195 GEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSM--TDAAARILMLMQATAELMGVPQRLIFGIKP--EEI 270 (485) T ss_pred CceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHH--HHHHHHHHHHHHHHHHhhcchhhhhccCCc--ccc Confidence 22321111 1234444578887777777778999999755444 345555555555444443322111101000 000 Q ss_pred ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCC-ccHHHHHHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 313 GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPG-GVGTDYEQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~-~~f~~F~~~~lr~iaaglGi~ye~l~ 391 (556) +. ....+.....+.+|.|..++ |+++++.+.+..+ .+|...++..++.++...++|-+.|. T Consensus 271 ~~-----------------~~~~~~~~~~~~~~~i~~~~-~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg 332 (485) T protein:vir:24 271 GV-----------------DPETGQTLFDAYLARILAFE-DAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLS 332 (485) T ss_pred cc-----------------ccccccchhhhcccceeccC-CCCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhc Confidence 00 00112233345677776654 4566665543222 35666667779999999999999887 Q ss_pred chhhc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecC Q lcl|NC_019524. 392 RDYTK-TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGA 470 (556) Q Consensus 392 ~D~s~-~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p 470 (556) ++..+ +|-.+.|..+.......+..|..|-..+- .+++..+ ++..+ ...+.. ..-+.+.|..| T Consensus 333 ~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~-~~~~l~~--~~~~~-~~~~~d------------~~~i~v~f~~~ 396 (485) T protein:vir:24 333 TAADNPASAEAIRAAESRLIKKVERKNAIFGGAWE-EAMRLAY--RLMKG-GDVPPD------------MLRMETVWRDP 396 (485) T ss_pred cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH--HHhcC-CCCccc------------cceeeEEecCC Confidence 66432 34556677777776667777766655543 3444322 33333 222211 11246788766 Q ss_pred cccccchhhhhHHHHHHHHcC--CCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCC Q lcl|NC_019524. 471 SRGQIDEKKETEAAILRIKNG--LSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSD 548 (556) Q Consensus 471 ~~~~iDP~Ke~~A~~~~i~~G--~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~ 548 (556) ... +-...+.+..+.+.+| +.|.+..+...|.+++++ +++.+..+...+.+............... ..+...+ T Consensus 397 ~~~--s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~-~e~~~~~ee~~~~~~~~~~~~~~~~~~~~--~~~~~~e 471 (485) T protein:vir:24 397 STP--TYAAKADAATKLYGNGQGVIPRERARKDMGYSIAER-EEMRRWDEEEAAMGLGLLGTMVDADPTVP--GSPNPTP 471 (485) T ss_pred CCC--CHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHH-HHHHHHHHHHhhhhhhHHHhhcccCCCCC--CCCCCCC Confidence 654 4555666677777765 788888888889987664 34443332222222211000000000000 0011111 Q ss_pred CCCCc-C---CC Q lcl|NC_019524. 549 NPNEE-T---TQ 556 (556) Q Consensus 549 ~~~~e-~---~~ 556 (556) +++++ + .+ T Consensus 472 ~~~~~~~~~~~~ 483 (485) T protein:vir:24 472 APKPQPAIEGGD 483 (485) T ss_pred CCCCccCCCCCC Confidence 11111 1 11 No 94 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.49 E-value=5.5e-14 Score=93.28 Aligned_cols=402 Identities=10% Similarity=0.007 Sum_probs=201.6 Q ss_pred chhccccCCCcccccccCCCCCHHHHHHHHH--HHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCC Q lcl|NC_019524. 32 GGMEGAERTTREMFQWNPSIISPDQQIAQNQ--DMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAP 109 (556) Q Consensus 32 ~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~--~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~ 109 (556) .++..- ++... .+.+..+..-. .....-+..-+..++-+-.+|+.|.+.|-.-=|+..- .+ T Consensus 1 m~~f~~-~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~--------~~ 63 (406) T protein:vir:97 1 MSFFQP-LGTSK--------VSYDDYISSVLAGDVSQKYLGVSALKNSDILTATSIIAGDIARFPLVKKD--------VN 63 (406) T ss_pred Cccccc-cCCCC--------CCcchHHHHHhcCCCCcccccchhhccHHHHHHHHHHHHhhhhCeeEEEe--------cC Confidence 011100 11000 00000000000 0000001111234566777888888877754333221 11 Q ss_pred hhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchh Q lcl|NC_019524. 110 DGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPY 189 (556) Q Consensus 110 ~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~d 189 (556) ++... + ..++..+..+|+ ..+|..++-...+..++..|++|+.+...... ..+..|..|.|+ T Consensus 64 g~~~~---~--~~~~~lL~~~PN------~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~-------g~~~~L~~i~p~ 125 (406) T protein:vir:97 64 GDIIH---D--EDINYLLNVKST------SNASARTWKFAMAVNAILTGNSFSRILRDPKT-------NQALQFQFYRPS 125 (406) T ss_pred ccccc---c--chHHHHhhccCC------CCCCHHHHHHHHHHHHhhcCCeEEEEEecCCC-------CeEEEEEEECCC Confidence 11110 0 123333333444 46799999999999999999999987643211 235688888888 Q ss_pred hcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhh Q lcl|NC_019524. 190 RMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMV 269 (556) Q Consensus 190 rl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la 269 (556) .|.. +.+..|+ +-|.+.....+ ....+++.+|||+...- .+...|+|++. T Consensus 126 ~v~v--------------~~~~~~~-~~y~~~~~~~~--------------~~~~~~~~evih~r~~~-~dg~~G~spi~ 175 (406) T protein:vir:97 126 ETTV--------------EETDNHE-IVYTFTDMLTA--------------KQVKCFAHDVIHWKFFS-HDTILGRSPLL 175 (406) T ss_pred eeEE--------------EEcCCce-EEEEEEecCCc--------------eEEEEccccEEEecCCC-CCCcccccHHH Confidence 7742 2333333 44665432211 11246788999997543 45577999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeee Q lcl|NC_019524. 270 SALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPH 349 (556) Q Consensus 270 ~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~ 349 (556) .+...+.-...-.++...-.+=.+.-.++++.+..- .++..+..... ...... .-..|.+.. T Consensus 176 ~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l----------~~e~~~~~~~~-------~~~~~~-g~n~g~~~v 237 (406) T protein:vir:97 176 SLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQL----------SGDARQRARQE-------FEKMRE-GSVGGSPLV 237 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCC----------CHHHHHHHHHH-------HHHHhc-ccccCceee Confidence 877777655555555544444444444444432210 01111111000 000000 113467778 Q ss_pred cCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 350 LYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIY 429 (556) Q Consensus 350 L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~ 429 (556) |..|.+++.++.+.-..+|-+..+....+||+.+|||-+.| |. +++||++.+....| +...+.|+. T Consensus 238 l~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~l-g~--~~~~~~~e~~~~~f-----------~~~~l~P~~ 303 (406) T protein:vir:97 238 FDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKL-GV--NSPNQSVAQLMEDY-----------VTNDLPFYF 303 (406) T ss_pred cCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHc-CC--CCCcchHHHHHHHH-----------HHHHHHHHH Confidence 99999999998765556677778888999999999999977 42 45777654444333 344456666 Q ss_pred HHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHH Q lcl|NC_019524. 430 TLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREV 509 (556) Q Consensus 430 ~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v 509 (556) ..+ +.++....++ +..... +.+++--. -|...++++..+.+.+|+.|.-|+-...|..|-+- T Consensus 304 ~~i-e~~l~~kll~-~~~~~~------------~~i~fd~~----~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~ 365 (406) T protein:vir:97 304 DAI-TSELGLKTLN-DKDRRL------------YHIEFDTR----SVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTD 365 (406) T ss_pred HHH-HHHHhhhhcC-hhhccc------------eeEEEecC----ccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 653 4444333332 211110 11222111 25666788888999999999999988888876422 Q ss_pred --HHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 510 --FKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 510 --~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) .+++-- -++... .+...........+..+.+.+.+++++ T Consensus 366 ~~gD~~~~------~~n~~~-~~~~~~~~~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 366 PNMDRYQS------SLNYVF-LDKKEEYQDKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred CCCCeEee------ccCccc-hhcccccccccccccCCCCCCCCCCCC Confidence 111100 011100 000000011111111111222222222 No 95 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.49 E-value=1.4e-14 Score=96.60 Aligned_cols=389 Identities=10% Similarity=-0.006 Sum_probs=200.3 Q ss_pred CCcchhhhHHHHHhhHhhcc--cchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVA--ETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~--~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) ++|-.+..-.. .-.++++. +...+..... ..+.+...-+-....|.+.. ...-+...+..++. T Consensus 16 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~~~~~~~~-------------~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 16 LPNDNGPVDYN-PGDPDMVEFRGPEEEPEARA-LPWIRPTAWSGYPESWATPS-------------WGSAQDKLRTLIDV 80 (409) T ss_pred ccccccccccc-CCCCceeeccCCCcchhhhh-cccccccccccccccccccC-------------ccccchhhHhhhHH Confidence 11111110000 00000000 0000000000 00000000000001222211 11233445667888 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +-.+|+.|.+.|-.-=|.+.- + ++.. .........+++ -.+|..++-...+..++. | T Consensus 81 v~acV~~Ia~~iA~lpl~~~~--~-------~~~~-------~~~~~ll~~~PN------~~~t~~~f~~~l~~~lll-G 137 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMR--N-------GRII-------DSVAWMSNPDPE------VYTSWQEFAKQLFWDFQL-G 137 (409) T ss_pred HHHHHHHHHHhhccCceEEee--C-------Cccc-------cchhhhcccCCC------CCCCHHHHHHHHHHHHhh-C Confidence 999999999987764343321 1 1111 111111223444 335777777777777777 8 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) .+|+........ ..+..|..|+|+++ -|+.+.+|+ +-|.+... T Consensus 138 nay~~~i~r~~~-------G~~~~L~pl~p~~v--------------~v~~~~~g~-~~y~~~~~--------------- 180 (409) T protein:vir:83 138 EAFVLPMAHGSD-------GYPIRFRVVPPWLV--------------NVELKKGAR-REYRIGGL--------------- 180 (409) T ss_pred CcEEEEEEECCC-------CcEEEEEEECCcce--------------EEEEcCCce-EEEEEccc--------------- Confidence 898765432221 24678888988766 245565554 33655211 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGG 318 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~ 318 (556) ...++|||+......+...|+|++..+...+.-....++....-.+=++.-.++++.+..- ..+ T Consensus 181 ------~~~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~~l----------s~e 244 (409) T protein:vir:83 181 ------NVTDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVERRL----------SET 244 (409) T ss_pred ------cCccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCCCC----------CHH Confidence 1235899998777778889999998877776655555554444444456677777754321 111 Q ss_pred ccccccccccccccccccccceecCCceeeecCCCcee-eeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhh--chhh Q lcl|NC_019524. 319 FKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKL-KMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFS--RDYT 395 (556) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i-~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~--~D~s 395 (556) ..+........ ... -..|....|.+|.++ +..+-+.-..+|.+-.+...++||..+|||.+.|- +|.+ T Consensus 245 ~~~~~~~~~~~---~~~------~nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~ 315 (409) T protein:vir:83 245 EAVDLMDRWIE---SRS------KYAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATG 315 (409) T ss_pred HHHHHHHHHHH---hhC------CccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCcc Confidence 11111111100 000 034555557777665 34443433456777788899999999999988662 3677 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) ..+||++.+....|++ .-+.|+..+ ++.++.+..++ .+. .+.| ..-.-.-- T Consensus 316 ~~tysn~eq~~~~f~~-----------~tL~P~~~~-ie~~l~~~Ll~--~~~-~~~f--------------~~~~llr~ 366 (409) T protein:vir:83 316 SLTYSNIEQLFSFHDR-----------SSLRPKATA-VMAALDRWALP--SPQ-HLEL--------------NRDDYTRP 366 (409) T ss_pred ccccccHHHHHHHHHH-----------HHHHHHHHH-HHHHHHHhhCC--CCc-EEEe--------------ehhhhhcc Confidence 7889998877777643 344554444 34444444332 221 1111 11122234 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQS 539 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~ 539 (556) |+...+++....+++|+.|+-|+-+..|.+|.+--+.+ ..++. T Consensus 367 d~~~r~~~~~~~~~~G~lT~NE~R~~~glpp~~ggd~l---------------------~~~gv 409 (409) T protein:vir:83 367 SLVERATAYKIMIEAGVMEPNEARAMERLHSEAAAVRL---------------------SGGGV 409 (409) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccc---------------------CCCCC Confidence 88888999999999999999988766666554422221 11111 No 96 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.48 E-value=3.9e-12 Score=83.13 Aligned_cols=452 Identities=10% Similarity=0.012 Sum_probs=236.4 Q ss_pred hhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHH--HHHHHHhcC--h-----------HHHHHHHHHHhhhc Q lcl|NC_019524. 27 PMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASA--RAQDMVQND--G-----------YAAGVVAVHRDSIV 91 (556) Q Consensus 27 ~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~--RaRdl~rNn--~-----------~a~~~v~~~~~nvV 91 (556) +-...+.|+.++.- +....-.|..-+... ..+|++ -=-+++.|| . ..+-+......++| T Consensus 1 ~~~~~~~~~~~~~~-~~g~~~~p~~v~~~d-----~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~ 74 (527) T protein:vir:10 1 MGQDKRQYGSTQQL-RAGEANFPNAVTDFD-----KARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLI 74 (527) T ss_pred CCccccccCCCcCc-CCccccCcccCCHHH-----HHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhh Confidence 22223456654432 111111233333332 222222 233556665 1 12334444557888 Q ss_pred cCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCC Q lcl|NC_019524. 92 GSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTG 171 (556) Q Consensus 92 G~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~ 171 (556) |..-++....-.. .++.. .++++..-+.|++.. +++++-....|-.++-||...+++|-+..+ T Consensus 75 ~~~~~~~~~g~~~---~~~~~----~e~v~~~lr~~~~~e----------~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~ 137 (527) T protein:vir:10 75 EAKMRFLGQGLKW---EFSKK----DAKVDDAIKVLFDRE----------NWEQKFESLKRWTEIRGDYVLLLIGDDEKD 137 (527) T ss_pred CCcceeeccCccc---cccch----hHHHHHHHHHHHHHh----------hhHHHHHHHHHhhhhhcceeEEEeeccCCC Confidence 8765554332110 11222 334566667787752 366666667777788899888999875332 Q ss_pred CcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEE--------------------------ECCCCCeEE-EEEeecC Q lcl|NC_019524. 172 TTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQ--------------------------LDNNGAALG-YWLRKAF 224 (556) Q Consensus 172 ~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE--------------------------~d~~Gr~va-Y~i~~~h 224 (556) . +. ...|..+||-.+-- ...+.+..-+.||. .|+.|.|+. +.+|... T Consensus 138 ~---~~--R~~v~~~DP~~~f~-~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~ 211 (527) T protein:vir:10 138 E---GS--RLSLHEVDPSTYFP-YEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTE 211 (527) T ss_pred c---CC--CceEeecCcceeee-eecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeee Confidence 1 11 13566677666531 12222222333443 233333331 1111100 Q ss_pred ----CCccccC-----Cccccce-----eecc--ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 225 ----PGDPTDM-----EQWKWGY-----EPAR--FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQN 288 (556) Q Consensus 225 ----pgd~~~~-----~~~~~~r-----v~~~--~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~ 288 (556) +|++... ....+.. +... ..++--=|+|+-.--+++-+.|.|.|+=+|..+.-|+.-..-+... T Consensus 212 ~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~i 291 (527) T protein:vir:10 212 ELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLI 291 (527) T ss_pred ceeeccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHH Confidence 1111000 0000000 0000 0122223678766678999999999998888888876655555555 Q ss_pred HHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccH Q lcl|NC_019524. 289 AVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVG 368 (556) Q Consensus 289 a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f 368 (556) .+..+.-...++ ..+..+ . ........+.||+|.+|..|-++..++.......| T Consensus 292 s~~sG~Pi~~~t-g~~~vd---~----------------------~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~ 345 (527) T protein:vir:10 292 MVFGGLGFYATD-SAPPRD---S----------------------RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPS 345 (527) T ss_pred HHHhCCceeeec-cccccc---c----------------------cCCcCccccCCceeEecCCCcceeeccchhhhHHH Confidence 444443333333 221100 0 00112346889999999999999999987778889 Q ss_pred HHHHHHHHHHHHHhcCCCHHHhhc--hhhcccchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH-HHHHHHHc-CCcc Q lcl|NC_019524. 369 TDYEQSLLRNIAASLGMSYEQFSR--DYTKTNYSSARASMAETQKYMDSRK-KLVADRFASAIYTL-WLEEEVNA-GNVP 443 (556) Q Consensus 369 ~~F~~~~lr~iaaglGi~ye~l~~--D~s~~nYSs~R~~~~e~~r~~~~~q-~~lv~~~~~pi~~~-~l~~a~l~-G~l~ 443 (556) ..++..+.+.|+...++|-..+ | |-++ =.|.=+-.++........| ++++..+.+.-|.. |+...+.. ..+. T Consensus 346 ~~h~~~L~~~l~~vA~~PavA~-G~vD~s~--~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~ 422 (527) T protein:vir:10 346 QTHMTKAEEAMQQTKGIPDIAV-GVVDAAV--AESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVG 422 (527) T ss_pred HHHHHHHHHHHHHhhcCCeeee-ccccCCc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcc Confidence 9999999999999999997655 5 5443 3333333444444454433 34544444433322 43332222 1111 Q ss_pred CCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh----C-CCHHHHHHHHHHHHH Q lcl|NC_019524. 444 LPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL----G-GDFREVFKQRAREEG 518 (556) Q Consensus 444 ~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~----G-~D~e~v~~q~a~E~~ 518 (556) +-+ .. ...-++|.|-+|- -.|-.+-++...+++++|+-|++..+++. | .|+++-++++..|++ T Consensus 423 ~~d-~~---------~~~~v~ivf~p~l--P~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era 490 (527) T protein:vir:10 423 IDD-AD---------KKLTVTITFRDPK--PVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKK 490 (527) T ss_pred cCC-Cc---------cccceEEEecccC--CCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHH Confidence 111 00 0112478887773 36999999999999999999999998776 5 599999999999887 Q ss_pred HHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 519 LIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 519 ~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .+...-.... ++. .+.. .++..-+++++.| T Consensus 491 ~~a~a~a~A~-~~~------~a~~-~~~~g~~~~~~d~ 520 (527) T protein:vir:10 491 TQGIAQAEAA-DPF------GAQM-AAEQGIPDEEDDQ 520 (527) T ss_pred HHhHHhhhhc-Cch------hhhh-ccccCCCCCCccc Confidence 7755433211 111 1111 1111122222222 No 97 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.47 E-value=4.5e-12 Score=82.80 Aligned_cols=452 Identities=10% Similarity=0.011 Sum_probs=236.4 Q ss_pred hhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHH--HHHHHHhcC--h-----------HHHHHHHHHHhhhc Q lcl|NC_019524. 27 PMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASA--RAQDMVQND--G-----------YAAGVVAVHRDSIV 91 (556) Q Consensus 27 ~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~--RaRdl~rNn--~-----------~a~~~v~~~~~nvV 91 (556) +-...+.|+.++.- +....-.|..-+... ..+|++ -=-+++.|| . ..+-+......++| T Consensus 1 ~~~~~~~~~~~~~~-~~g~~~~p~~v~~~d-----~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~ 74 (527) T protein:vir:10 1 MGQDKRQYGSTQQL-RAGEANFPNAVTDFD-----KARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLI 74 (527) T ss_pred CCccccccCCCcCc-CCccccCcccCCHHH-----HHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhh Confidence 22223456654432 111111233333332 222222 233556665 1 12334444557888 Q ss_pred cCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCC Q lcl|NC_019524. 92 GSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTG 171 (556) Q Consensus 92 G~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~ 171 (556) |..-++....-.. .++.. .++++..-+.|++.. +++++-....+-.++-||...+++|-+..+ T Consensus 75 ~~~~~~~~~g~~~---~~~~~----~e~v~~~lr~~~~~e----------~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~ 137 (527) T protein:vir:10 75 EAKMRFLGQGLKW---EFSKK----DAKVDDAIRVLFDRE----------NWEQKFESLKRWTEIRGDYVLLLIGDDEKD 137 (527) T ss_pred CCcceeeccCccc---cccch----hHHHHHHHHHHHHHh----------hhHHHHHHHHHhhhhhcceeEEEeeccCCC Confidence 8765554332110 11222 334566667787752 366666667777788899888999875332 Q ss_pred CcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEE--------------------------ECCCCCeEE-EEEeecC Q lcl|NC_019524. 172 TTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQ--------------------------LDNNGAALG-YWLRKAF 224 (556) Q Consensus 172 ~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE--------------------------~d~~Gr~va-Y~i~~~h 224 (556) . +. ...|..+||-.+-- ...+.+..-+.||. .|+.|.|+. +.+|... T Consensus 138 ~---~~--R~~v~~~DP~~~f~-~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~ 211 (527) T protein:vir:10 138 E---GS--RLSLHEVDPSTYFP-YEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTE 211 (527) T ss_pred c---CC--CceEeecCcceeee-eecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeee Confidence 1 11 13566677666531 12222222333443 233333331 1111100 Q ss_pred ----CCccccC-----Cccccce-----eecc--ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 225 ----PGDPTDM-----EQWKWGY-----EPAR--FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQN 288 (556) Q Consensus 225 ----pgd~~~~-----~~~~~~r-----v~~~--~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~ 288 (556) +|++... ....+.. +... ..++--=|+|+-.--+++-+.|.|.|+=+|..+.-|+.-..-+... T Consensus 212 ~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~i 291 (527) T protein:vir:10 212 ELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLI 291 (527) T ss_pred ceeeccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHH Confidence 1111000 0000000 0000 0122223678766678999999999998888888776655555555 Q ss_pred HHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccH Q lcl|NC_019524. 289 AVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVG 368 (556) Q Consensus 289 a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f 368 (556) .+..+.-...++ ..+..+ . ........+.||+|.+|..|-++..++.......| T Consensus 292 s~~sG~Pi~~~t-g~~~vd---~----------------------~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~ 345 (527) T protein:vir:10 292 MVFGGLGFYATD-SAPPRD---S----------------------RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPS 345 (527) T ss_pred HHHhCCceeeec-cccccc---c----------------------cCCcCccccCCceeEecCCCcceeeccchhhhHHH Confidence 444443333333 221100 0 00112346889999999999999999987778889 Q ss_pred HHHHHHHHHHHHHhcCCCHHHhhc--hhhcccchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH-HHHHHHHc-CCcc Q lcl|NC_019524. 369 TDYEQSLLRNIAASLGMSYEQFSR--DYTKTNYSSARASMAETQKYMDSRK-KLVADRFASAIYTL-WLEEEVNA-GNVP 443 (556) Q Consensus 369 ~~F~~~~lr~iaaglGi~ye~l~~--D~s~~nYSs~R~~~~e~~r~~~~~q-~~lv~~~~~pi~~~-~l~~a~l~-G~l~ 443 (556) ..++..+.+.|+...++|-..+ | |-++ =.|.=+-.++........| ++++..+.+.-|.. |+...+.. ..+. T Consensus 346 ~~h~~~L~~~l~~vA~~PavA~-G~vD~s~--~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~ 422 (527) T protein:vir:10 346 QTHMNKAEEAMQQTKGIPDIAV-GVVDAAV--AESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVG 422 (527) T ss_pred HHHHHHHHHHHHHhhcCCeeee-ccccCCc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcc Confidence 9999999999999999997655 5 5443 3333333444444454433 34544444433322 43332222 1111 Q ss_pred CCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh----C-CCHHHHHHHHHHHHH Q lcl|NC_019524. 444 LPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL----G-GDFREVFKQRAREEG 518 (556) Q Consensus 444 ~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~----G-~D~e~v~~q~a~E~~ 518 (556) +-+ .. ...-++|.|-+|- -.|-.+-++...+++++|+-|++..+++. | .|+++-++++..|++ T Consensus 423 ~~d-~~---------~~~~v~ivf~p~l--P~D~~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era 490 (527) T protein:vir:10 423 IDD-AD---------KKLTVTITFRDPK--PVNNEKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKK 490 (527) T ss_pred cCC-Cc---------cccceEEEecccC--CCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHH Confidence 111 00 0112478887773 36999999999999999999999998776 5 599999999999987 Q ss_pred HHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 519 LIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 519 ~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .+...-.... ++. .+.. .++..-+++++.| T Consensus 491 ~~a~a~a~a~-~~~------~a~~-~~~~g~~~~~~d~ 520 (527) T protein:vir:10 491 TQGIAQAEAA-DPF------GAQM-AAEQGIPDEEDDQ 520 (527) T ss_pred HHhHHhhhhc-Cch------hhhh-ccccCCCCCCccc Confidence 7755433211 111 1111 1111122222222 No 98 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.45 E-value=6.1e-12 Score=82.06 Aligned_cols=446 Identities=10% Similarity=-0.006 Sum_probs=207.5 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |++..-..-. +.-...... ...+-.. ...|.-+....+.+ +.... +.++...+-++. .|++ T Consensus 23 ~~~~~~~~l~--~~l~~~~~~-~~~rl~~-l~~YY~G~~~~~~~----~~~~~---------~~~~~~~~~~v~--n~~~ 83 (501) T protein:vir:25 23 MSREQLGALV--ADMWRLHIS-ERQWLDR-IYEYTKGLRGRPEV----PEGAS---------DEVKELAKLSVK--NVLS 83 (501) T ss_pred CChHHHHHHH--HHHHHHHHH-HHHHHHH-HHHHHhcCCCchhc----cccCC---------hhhhhhHhhhhc--ChHH Confidence 3332221111 111100000 0011111 12233222211111 11111 112222222222 4999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+.+++.++..||+.. + ++. ++ .+|+-|.. + +|...+..+.+..++-|.+ T Consensus 84 ~ivd~~a~~l~~~gf~~~---d-------~~~----~~---~l~~i~~~-----N------~~d~~~~~~~~~a~i~G~a 135 (501) T protein:vir:25 84 LVRDSFAQNLSVVGYRNA---L-------AKE----ND---PAWEMWQR-----N------RMDARQAEVHRPALTYGAS 135 (501) T ss_pred HHHHHHHhhhcccceecC---C-------ccc----hH---HHHHHHHh-----c------ChhHHHHHHHHHHhhcCce Confidence 999999998888888643 1 111 22 23443432 1 4777788889999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCC-CCCCCceEEEEEEE----CCCCCeEEE------EEeecCCCccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPN-NVMDTPNLRSGVQL----DNNGAALGY------WLRKAFPGDPT 229 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~-~~~~g~~i~~GIE~----d~~Gr~vaY------~i~~~hpgd~~ 229 (556) |... +... ++ + .+..++|..+-.=+ +.....++..+|.+ +..+....+ +++..+..+.. T Consensus 136 y~~v-~~de-----~~---~-~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 205 (501) T protein:vir:25 136 YVTV-TPTD-----EG---P-VFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVV 205 (501) T ss_pred EEEE-ecCC-----CC---C-eEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCcee Confidence 9765 3321 11 2 58889998884222 22233345666653 222221111 12222221110 Q ss_pred cC--Cccccceeecc--------------ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019524. 230 DM--EQWKWGYEPAR--------------FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNA 293 (556) Q Consensus 230 ~~--~~~~~~rv~~~--------------~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A 293 (556) .. ....|...+.. ..++.==|+|+....+. ..+|.|.|.+++..+..++.-.--.+..+...| T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~-~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a 284 (501) T protein:vir:25 206 LGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDA-DDMIVGEVAPLILLQQAINSVNFDRLIVSRFGA 284 (501) T ss_pred eeeccccccccccccccccccccccccccCCccceeeEeccCcccc-CccccchhhhhHHHHHHHHHHHHHHHHHHHhhc Confidence 00 00000000000 00111126665554443 457999999877555544443322222222211 Q ss_pred ceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCC-CccHHHHH Q lcl|NC_019524. 294 TYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTP-GGVGTDYE 372 (556) Q Consensus 294 ~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p-~~~f~~F~ 372 (556) .=-.+|+ |...+ ......+..|.|..++ |+++++.+-+.. -.+|...+ T Consensus 285 ~p~~~i~------------G~~~~------------------~~~~~~~~~~~i~~~~-~~~~~~~q~~~~~~~~~~~~l 333 (501) T protein:vir:25 285 NPQRVIS------------GWTGS------------------KAEVLKASALRVWTFE-DPEVKAQAFPPASVEPYNLIL 333 (501) T ss_pred cHHHHHh------------CCCCC------------------ccchhhhcccceeccC-CCCceEEEecccChHHHHHHH Confidence 1101111 10000 0011234566666664 555665543322 24577889 Q ss_pred HHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCccccc Q lcl|NC_019524. 373 QSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRM 452 (556) Q Consensus 373 ~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~ 452 (556) +..++.|++-.++|-+.+.+..++.|--+++..+.......+..|..|-..+.+ +++.. .++..+... .. T Consensus 334 ~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~-~~rl~--~~~~~~~~~----~~--- 403 (501) T protein:vir:25 334 EEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQ-LLRLA--AEMDDDPDT----AA--- 403 (501) T ss_pred HHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHhCCCcc----cc--- Confidence 999999999999999988776555555677777777777777777777776654 44432 233332211 00 Q ss_pred ccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCCCCccc Q lcl|NC_019524. 453 FYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFREVFKQRAREEGLIKSLKLDFTGKM 531 (556) Q Consensus 453 ~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~ 531 (556) ..-+.+.|..+... +....+.+..+.+.+|+ |.+..+.+. |.++.++ +++.++++.-+..++...... T Consensus 404 -------~~~i~v~w~~~~~~--s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~i-e~~~~~~~e~~~~~~~~~~~~ 472 (501) T protein:vir:25 404 -------DSGAEVLWRDTEAR--SFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQTI-QAIKDSLRGGEVKSLVDKLLS 472 (501) T ss_pred -------ceeeeEEecCCCCC--CHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHHH-HHHHHHHHHHhHHHHHHHhhc Confidence 01246788776654 55777888888888887 667777775 9998774 223333222222222110001 Q ss_pred cccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 532 VEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ..+.+....+.++....+++.+++- T Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (501) T protein:vir:25 473 NEPAPVPPPPPQAAAQALNEGGVNG 497 (501) T ss_pred cCcCCCCCCCCCCCccccccccCCC Confidence 1111111222222222222222222 No 99 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.44 E-value=1.5e-11 Score=79.91 Aligned_cols=434 Identities=13% Similarity=0.095 Sum_probs=207.9 Q ss_pred hcccchhhhhhhh-----------cchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Q lcl|NC_019524. 18 VVAETATATPMAV-----------GGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVH 86 (556) Q Consensus 18 ~~~~~~~~~~~~~-----------~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~ 86 (556) +....-....... ...|.-+.. ....+ ...... .. +++-.-..|++-+|+.. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~---~i~~~---~~~~~~----~~-------~~~~~~~n~~~~ivd~~ 63 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTI---GIGAPP----EL-------AYLDVQPGWVATYLRTL 63 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---cchhc---ccccch----hh-------hhhhhhcchHHHHHHHH Confidence 2222211111100 011221111 11111 001111 11 11112356899999999 Q ss_pred HhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEee Q lcl|NC_019524. 87 RDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEW 166 (556) Q Consensus 87 ~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~ 166 (556) ++.+++.||+.. + +++..+.+...|+ . .+|...+..+++..+.-|.+|..+-. T Consensus 64 ~~~l~~~g~~~~---~----------d~~~~~~l~~i~~---~-----------N~~~~~~~~~~~~a~~~G~ay~~v~~ 116 (480) T protein:vir:78 64 SDRLDIEGFRIS---E----------DSEGLEELWNWWQ---A-----------NDLDEESVLGHDDSLTFGRAYITVSH 116 (480) T ss_pred HhhhccCceecC---C----------CchhHHHHHHHHH---h-----------cCHHHHHHHHHHHHhhcCceEEEeec Confidence 999999998642 1 1122344444333 2 15777888999999999999977532 Q ss_pred ccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEE----ECCCCCeEEEEEeecCCCcccc-----CCccccc Q lcl|NC_019524. 167 LNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQ----LDNNGAALGYWLRKAFPGDPTD-----MEQWKWG 237 (556) Q Consensus 167 ~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE----~d~~Gr~vaY~i~~~hpgd~~~-----~~~~~~~ 237 (556) ..... .+.... .++..++|..+-.-++......+..+|. .|..|...-+.++.. +..+. .....|. T Consensus 117 ~~~~~--~d~~~~-~~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~--~~~~~~~~~~~~~~~~~ 191 (480) T protein:vir:78 117 PDVES--GDPAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLP--DETVPLRRNGGLNDQWV 191 (480) T ss_pred Ccccc--CCCCCe-eEEEEEcccceEEEEcCCCccceEEEEEEEEeecCCcceEEEEEEeC--CeEEEEEecCCCccccc Confidence 11110 111122 4688999988753333333334455554 345554433223222 11110 0011111 Q ss_pred --eeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee---eeEeccCccccccccc Q lcl|NC_019524. 238 --YEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA---ASVESELPSDVVFGQL 312 (556) Q Consensus 238 --rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~---~fi~~~~~~~~~~~~~ 312 (556) .-+....+|.-=|+|+....+.+..-|+|.|.+-|..| ++.|.......+...-.++ .+|+-...+.. T Consensus 192 ~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l--~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~----- 264 (480) T protein:vir:78 192 VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV--TDAASRTLMNLQSASQILGTPLRVISGVTTDEL----- 264 (480) T ss_pred ccccccccCCCCcceEEeecccccCCccCccchhHHHHHH--HHHHHHHHHHHHHHHHhhcchhhhhhCCCcccc----- Confidence 11111224444578888888888888999998754444 3344444444444433222 22221100000 Q ss_pred ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCC-CccHHHHHHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 313 GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTP-GGVGTDYEQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p-~~~f~~F~~~~lr~iaaglGi~ye~l~ 391 (556) . ...+.......+|.+..+. |+++++.+.+.. -.+|...++..+..++...++|.+.|. T Consensus 265 -------~------------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg 324 (480) T protein:vir:78 265 -------T------------NDGENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLS 324 (480) T ss_pred -------c------------cccccchhhhhhhhhccCC-CCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhc Confidence 0 0001112234566666665 556776664432 245777778899999999999999887 Q ss_pred chhhcccch---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeee Q lcl|NC_019524. 392 RDYTKTNYS---SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWI 468 (556) Q Consensus 392 ~D~s~~nYS---s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~ 468 (556) ++. .|.| +++..+.......+..|..|-..+-+ +++. -.+++.+..+. . ..-+.+.|. T Consensus 325 ~~~--~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~rl--~~~~~~~~~~~-~-------------~~~i~v~w~ 385 (480) T protein:vir:78 325 SSS--ENPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRI--AMQIMGREVTE-E-------------YTRLETVWR 385 (480) T ss_pred ccc--CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH--HHHHcCCCccc-c-------------ceeeeEEec Confidence 653 3444 44555555555556666666555433 3332 22343332211 0 011467887 Q ss_pred cCcccccchhhhhHHHHHHHHcC--CCCHHHHHHHhCCCHHHHHH--HHHHHHHH--HHHcCCCCCccccccCCCCCCCC Q lcl|NC_019524. 469 GASRGQIDEKKETEAAILRIKNG--LSTYEAEISRLGGDFREVFK--QRAREEGL--IKSLKLDFTGKMVEGNSTQSSNS 542 (556) Q Consensus 469 ~p~~~~iDP~Ke~~A~~~~i~~G--~~s~~~~~ae~G~D~e~v~~--q~a~E~~~--~~~~Gl~~~~~~~~~~~~~~~~~ 542 (556) .|... +-...+.+..+.+.+| +.|.+.+....|.+.+++-+ ++.++... ...+.-+. .+..... T Consensus 386 ~~~~~--s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~--------~~~~~~~ 455 (480) T protein:vir:78 386 DPSTP--TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT--------KAQADAT 455 (480) T ss_pred CCCCC--CHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccc--------cCCCccc Confidence 77655 4556677777777766 66776666777988665533 22222211 11111110 1111111 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_019524. 543 SESTSDNPNEETTQ 556 (556) Q Consensus 543 ~~~~~~~~~~e~~~ 556 (556) +++...++++++.+ T Consensus 456 ~~~~~~~~~~~~~~ 469 (480) T protein:vir:78 456 PKPTVTETKTETQT 469 (480) T ss_pred cCCCCCCCCCccCC Confidence 11111122223332 No 100 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.44 E-value=8.9e-13 Score=86.66 Aligned_cols=461 Identities=11% Similarity=0.013 Sum_probs=205.3 Q ss_pred CCcch----hhhHHHHHhhHh-hcc-cchhh--hhh----hhcchhccc------------cCCCcccccccCCCCCHHH Q lcl|NC_019524. 1 MKDVK----KTTRTRAKKAVD-VVA-ETATA--TPM----AVGGGMEGA------------ERTTREMFQWNPSIISPDQ 56 (556) Q Consensus 1 ~sp~~----~~~r~~a~~a~~-~~~-~~~~~--~~~----~~~~~y~aa------------~~~~r~~~~w~~~~~s~~~ 56 (556) |+... +++...+..... .++ +.... ... ....+-++. .........|.....=.. T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 103 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIG- 103 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCcc- Confidence 22221 111111111100 000 00000 000 000011110 000001112222221111 Q ss_pred HHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhccccccee Q lcl|NC_019524. 57 QIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFD 136 (556) Q Consensus 57 ~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD 136 (556) . .--.+++-|++++.+|+.+++..+-.|+.+...-+ .+...+..+.+++.|++.. T Consensus 104 -----~-----~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~-------~~~~~~~~~~l~~~~~~l~-------- 158 (537) T protein:vir:10 104 -----H-----QMCALIATHWLVNKACSQMPRDAMRKGYKIISDDG-------NELDPKDAKFIDRYDRAFN-------- 158 (537) T ss_pred -----H-----HHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCc-------ccccHHHHHHHHHHHHHhh-------- Confidence 1 12246889999999999999999999998865311 1112233455555555421 Q ss_pred hhcccCHHHHHHHHhhhheecCceEEEEeec-cCCCCcCCC------cccce-EEEEEchhhcCCCCCCCCCceEEEEEE Q lcl|NC_019524. 137 ARRMCTLTGLTRLAVSGFLMTGEVLATCEWL-NPTGTTMQR------RPFGT-AIQMISPYRMSNPNNVMDTPNLRSGVQ 208 (556) Q Consensus 137 ~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~-~~~~~~~~~------~~~~l-~lq~ie~drl~~~~~~~~g~~i~~GIE 208 (556) -..-+...+..+++- |...+.+... ......+.. .+-.+ .|.+|+|..+...... .+..-.- T Consensus 159 -----~~~~l~~a~~~~rly-G~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~----~~~~dp~ 228 (537) T protein:vir:10 159 -----IKKHAIQFVRKGRIF-GIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDA----QASSNPV 228 (537) T ss_pred -----HHHHHHHHHHhcccc-cceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccch----hhhccCC Confidence 112233344444554 4444333221 111000000 00001 2445666555311000 0000011 Q ss_pred ECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCccc------CCchhhHHHHHHHHHHHHH Q lcl|NC_019524. 209 LDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTR------GISEMVSALKQMKMTRNFQ 282 (556) Q Consensus 209 ~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~R------Gvs~la~~l~~l~~l~~~~ 282 (556) =..+|+|..|+|. ...|+.+.|||+-...-|...+ |+|.|-+++..|++++.-. T Consensus 229 sp~fg~P~~y~v~--------------------g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~ 288 (537) T protein:vir:10 229 SMHFYEPTYWLIN--------------------GKKYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTA 288 (537) T ss_pred ccccCCceeeeec--------------------CeEecceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHH Confidence 1246888888762 1246778898875444455544 9999999998888765544 Q ss_pred HHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCC Q lcl|NC_019524. 283 EITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAG 362 (556) Q Consensus 283 dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~ 362 (556) .....-..- +.+. +++.+.... ... .+ ......... . . ..=.-|++..-..+++++.++. T Consensus 289 ~~~~~l~~~-~~~~-v~k~~~~~~-----l~~--~~---~~~~r~~~~----~--~-~r~n~g~~~id~e~e~~e~~~~- 348 (537) T protein:vir:10 289 NEGPMLAMT-KRQT-VLKVDAAQV-----LAN--KQ---QFDETMSWW----T--A-TRDNYQVRVVDKDNEDVVQIDT- 348 (537) T ss_pred HHHHHHHHh-cCCc-eeeechHHh-----hcC--HH---HHHHHHHHH----H--h-hcCCcceeEecCCCceeEEEec- Confidence 433222221 2222 223221110 000 00 000000000 0 0 0002345555556799998884 Q ss_pred CCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019524. 363 TPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGN 441 (556) Q Consensus 363 ~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~ 441 (556) +-++..+........||+.+|||...|.|--. ..+ |+.-.-+..+...++.+|+.| +|+.+..++..+.+.- T Consensus 349 -~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp-~GlnatGe~D~~~yyd~I~~~Qe~l-----~p~l~~l~~ll~~~~~ 421 (537) T protein:vir:10 349 -TLNDLDKVIMNQYQLVCAIARTPAPKMLGTVP-TGFNSTGDYEEASYHEECESTQDDM-----RPLIDRHHQLVCRSHL 421 (537) T ss_pred -cCCCHHHHHHHHHHHHHhhhCCCceeeccCCc-cccccchhHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHhcC Confidence 45578999999999999999999998877321 233 556777888888999998753 3444444433333322 Q ss_pred ccCCCCcccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHH--HH Q lcl|NC_019524. 442 VPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRARE--EG 518 (556) Q Consensus 442 l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E--~~ 518 (556) .+ +.++.+. ++..|.+-..+.+|- .|-+++....+++|+.|+.++....+.+++.....+-.. .. T Consensus 422 ~~-~~~~~i~-----------f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~e 489 (537) T protein:vir:10 422 RK-RIRVKVE-----------FPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPT 489 (537) T ss_pred CC-CcceEEE-----------eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChh Confidence 22 2221110 112333444444442 556688999999999999999888887766544433211 11 Q ss_pred HHHHcCCCCCccccccCCCCC-------CCCCCCCCCCCCCcCCC Q lcl|NC_019524. 519 LIKSLKLDFTGKMVEGNSTQS-------SNSSESTSDNPNEETTQ 556 (556) Q Consensus 519 ~~~~~Gl~~~~~~~~~~~~~~-------~~~~~~~~~~~~~e~~~ 556 (556) ..+++.+.....+......+. .....++.+++.+-.+. T Consensus 490 d~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 534 (537) T protein:vir:10 490 DAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSGAA 534 (537) T ss_pred hhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCccCccc Confidence 112222221111111111111 11111111111110000 No 101 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=99.42 E-value=2.6e-13 Score=89.54 Aligned_cols=353 Identities=10% Similarity=0.110 Sum_probs=191.6 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..+ +.. +......+|.+...+.... .....+. .+. +..++-+-.+| T Consensus 1 M~~~~~------------------------f~~--r~~~~~~~~~~~~~~~~~~---~~~~~v~-~~~-al~~~av~~cv 49 (359) T protein:vir:10 1 MSILNP------------------------FER--RSSITPNNYYPFMVQNGSI---VPNSLVD-ATE-ALKNSDLYAVT 49 (359) T ss_pred Ccccch------------------------hhc--cccCCCCcchhhhhccccc---cCCcccC-HHH-hhcchHHHHHH Confidence 111111 110 0000011121111000000 0000011 011 22345567899 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-+.=|+ + . ........+|+ -.+|..++-...+..++..|++|+. T Consensus 50 ~~ia~~ia~~p~~-----~-------~-----------~~~~~L~~~PN------~~~t~~~f~~~~~~~lll~Gnay~~ 100 (359) T protein:vir:10 50 SLISSDIAGTRFI-----G-------N-----------QVFTSVLNNPS------HLTNAFSFWQTAILNLLLNGNVFLA 100 (359) T ss_pred HHHHHhhhcCccc-----c-------c-----------hHHHHHhhccc------ccCCHHHHHHHHHHhccccCceEEE Confidence 9998887754221 0 0 11122233454 2478889999999999999999988 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +++.. ...+..|..|.|+.+. |+.+.. .+.|++.....+. .. T Consensus 101 i~r~~--------~g~~~~l~~l~~~~v~--------------i~~~~~--~~~y~~~~~~~~~--------------~~ 142 (359) T protein:vir:10 101 ILKGD--------NSLMKELRLIPSNAIT--------------IDLTDD--TLTYEVNQFDDYP--------------SA 142 (359) T ss_pred EEECC--------CCeEEEEEEeCCceEE--------------EEEcCC--eEEEEEEecCCce--------------EE Confidence 76432 1245678888887763 222222 2557664322111 12 Q ss_pred cCChhHeEeeeccc----CCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEAL----LAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 244 ~v~a~~viH~f~~~----r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) .+++.+|||+.... -.+...|+|++..+...+.......+....-.+=.+.-.++++.+.+.. .++. T Consensus 143 ~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l---------~~e~ 213 (359) T protein:vir:10 143 KYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTL---------SSEA 213 (359) T ss_pred EEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCC---------CHHH Confidence 46788999997653 2366789999888777776666655555555555666778888653211 0011 Q ss_pred cccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchh--hcc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDY--TKT 397 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~--s~~ 397 (556) .+........ . .+ .-..|.+..|+.|.+++.++.+.-..+|.+-.+.....||..+|||-+.| ++. ++. T Consensus 214 ~~~~~~~~~~---~-~~----~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~ 284 (359) T protein:vir:10 214 KDSIRKEFEK---A-NG----GNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYL-NGTGDQQS 284 (359) T ss_pred HHHHHHHHHH---H-hC----ccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCcccc Confidence 1101110000 0 00 01346788899999999998766666788888999999999999999977 332 345 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch Q lcl|NC_019524. 398 NYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE 477 (556) Q Consensus 398 nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP 477 (556) +|+++++-... ++...++|+-++ |+.++.+. +.+..+ .....|. T Consensus 285 ~~~~~e~~~~~-----------~l~~~l~p~~~~-l~~~l~~~-~~~~~~-----------------------~~~~~d~ 328 (359) T protein:vir:10 285 SLDQIKDLYVN-----------ALNRFIEPLISE-LRIKCDSS-IGVDMS-----------------------PITDYSN 328 (359) T ss_pred cHHHHHHHHHH-----------HHHHHHHHHHHH-HHHHhhhh-hcccch-----------------------hhhhcCH Confidence 66665443322 234444554433 33333322 211111 0011244 Q ss_pred hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHH Q lcl|NC_019524. 478 KKETEAAILRIKNGLSTYEAEISRLGGDFRE 508 (556) Q Consensus 478 ~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~ 508 (556) ..........+++|+.|+-|+-+..|..|-= T Consensus 329 ~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 329 SVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 4444555678999999999988877877755 No 102 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.41 E-value=8.7e-13 Score=86.71 Aligned_cols=403 Identities=11% Similarity=-0.051 Sum_probs=208.6 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..+.+- . ......+. .......|....... ...-+...+..++.+..+| T Consensus 1 Mg~f~~~~~--~----~~~~~~~~------------~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~v~~~i 52 (406) T protein:vir:95 1 MGLFDRWRR--T----KRKSKIRA------------DTGYVGLFMSGEDVS----------FLVPGYVRLSDNPEVRMAV 52 (406) T ss_pred Ccchhhhcc--c----cccccccc------------cchhhhhhccCcccC----------ccccCHHHHhhcHHHHHHH Confidence 222222110 0 00000000 000001111100000 0000111233579999999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.+...-|.+.-+.+. + .+... ......+..+|+ -.+|.+++....+..++..|++++. T Consensus 53 ~~ia~~ia~~~~~~~~~~~~------~--~~~~~---~~~~~~l~~~PN------~~~t~~~f~~~~~~~~ll~g~g~a~ 115 (406) T protein:vir:95 53 HKIADLISSMTIYLMQNTED------G--DIRIR---NELSRKIDITPY------SLMTRKSWMYNIVYTMLLDGEGNSV 115 (406) T ss_pred HHHHHhhccCceEEEEecCC------c--ceeec---chHHHHHhhccC------CCCCHHHHHHHHHHHHHhcCCceEE Confidence 99999999877766433221 0 00111 123344556665 2478999999999999999887554 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +...+. +...+..|..|+|+.+. |+.+..| |.+... .. T Consensus 116 ~~~~~~------~~g~~~~l~~i~~~~v~--------------~~~~~~~----~~~~~~------------------~~ 153 (406) T protein:vir:95 116 VFPKYT------ADGLIDELVPLTPSKVN--------------FLDTPDG----YQVLYG------------------GQ 153 (406) T ss_pred EEEEEC------CCCcEEEEEEEcCceeE--------------EEEcCCe----EEEEec------------------cE Confidence 332221 11235678888888774 2233333 333210 01 Q ss_pred cCChhHeEeeecc-cCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEA-LLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEI 322 (556) Q Consensus 244 ~v~a~~viH~f~~-~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~ 322 (556) .+++.+|||+... .......|+|++..+...+.-.....+......+-.+...++|+.+..-.. +.... T Consensus 154 ~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~----------e~~~~ 223 (406) T protein:vir:95 154 TFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAE----------LSSEE 223 (406) T ss_pred EEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCH----------HHHHH Confidence 3567899999864 444567899999998888887777777777777777888888887542110 00000 Q ss_pred ccccccccccccccccceecCCc-eeeecCCCceeeeecC-CCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch Q lcl|NC_019524. 323 FNEYMTGLANYVAQTKNIAIDGA-KIPHLYPGTKLKMQPA-GTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS 400 (556) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~l~pG-~i~~L~pGe~i~~~~~-~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS 400 (556) ... .........-..| .++....|++++.+.+ +.-..+|.+..+.....||..+|||-+.| |.-+ +. T Consensus 224 ~~~-------~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~l-g~~~---~~ 292 (406) T protein:vir:95 224 GRN-------AVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLL-GIGE---FN 292 (406) T ss_pred HHH-------HHHHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCC---ch Confidence 000 0000000011223 3334456677665443 33456788888999999999999998866 3221 11 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 401 SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 401 s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) ++ .+..++..-+.|+.+.+ +.++-...+.-.++ .+++-.-.....|.... T Consensus 293 --~~-----------~~~~~~~~~l~P~~~~i-e~~l~~~l~~~~~~----------------~~~fd~~~l~~~d~~~~ 342 (406) T protein:vir:95 293 --RD-----------EYNNFINSTILPIAKGI-EQELTRKLLISPDL----------------YFKFNPRSLYAYDLKEL 342 (406) T ss_pred --HH-----------HHHHHHHHHHHHHHHHH-HHHHHHhcCCCCCc----------------EEEeechhhhcCCHHHH Confidence 11 11224556677766554 44454444432221 12222233344588889 Q ss_pred hHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 481 TEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 481 ~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +++....+++|+.|+.|+-+..|.+|.+-. +++=++....+....+. ....++.+.+++++++| T Consensus 343 ~~~~~~l~~~G~~t~NE~R~~~gl~p~~~g----------d~~~~~~n~~~~~~~~~--~~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 343 AEVGSNMYVRGIMEGNEVRDWLGLSPKEGL----------SELVILENYIPLDKIGD--QSKLKGGDNSGADGQTD 406 (406) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCc----------ceeeeccCccchhhccc--ccccCCCCCCCCCCCCC Confidence 999999999999999999888898875321 11111111111111111 11111212222233333 No 103 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.41 E-value=2.8e-11 Score=78.45 Aligned_cols=435 Identities=7% Similarity=-0.030 Sum_probs=208.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |-|.....+..-.......... ..+. .....|.-+... -..+. .......+.+ ..++ ..--.+++++ T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~-~~~~-~~~~~Yy~g~~~-i~~r~-~~~~~~~~~~--------~~~~-~~ki~~n~~~ 87 (474) T protein:vir:95 21 LKPQFETQEEMIIRLIDDHRKQ-LDKI-TVGQRYYDKDND-IVKQM-KKVDVYGNID--------YDKP-DWRITTNFHQ 87 (474) T ss_pred hhhccCChHHHHHHHHHHHHHH-HHHH-HHHHHHhcccCc-hhccc-cccccccccc--------cccc-cceeccchHH Confidence 5555554444333332221111 1111 112234432210 00000 0000000000 0011 1111357999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -++++.+.++.|.+++.... + +.....|+.|.++ +|......+.+....-|.+ T Consensus 88 ~Ivd~~~~~l~g~p~~~~~~---------d-------~~~~~~l~~~~~n-----------~~~~~~~e~~~~~~~~G~~ 140 (474) T protein:vir:95 88 NLVDQKVSYVASKPVTYSCE---------D-------ESVLKIIHDVLDT-----------RWDNKLIDILTATSNKGID 140 (474) T ss_pred HHHHHHHhhhccCCceeccC---------c-------hHHHHHHHHHHhc-----------cHHHHHHHHHHHHhhcCcE Confidence 99999999999998876531 1 2233455555431 3555566667888899999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEEC---CCC-------CeEEEEEeecCCCcccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLD---NNG-------AALGYWLRKAFPGDPTD 230 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d---~~G-------r~vaY~i~~~hpgd~~~ 230 (556) |+++.+ ...+ -+++..++|+.+---+.....+.+..+|.+- ... ..+-||.+..-.-.... T Consensus 141 ~~~v~~-d~~~--------~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~ 211 (474) T protein:vir:95 141 WLQVYI-NENG--------EMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDY 211 (474) T ss_pred EEEEEe-cCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCcccccc Confidence 977543 2211 2578899998774323333333455566541 111 11222222211000000 Q ss_pred CCccccce---e-eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccc Q lcl|NC_019524. 231 MEQWKWGY---E-PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSD 306 (556) Q Consensus 231 ~~~~~~~r---v-~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~ 306 (556) .....+.. . ..+..+| |+|+... .-|+|.|.+++..+..++....-......-.+.--.+++--.++ T Consensus 212 ~~~~~~~~~~~~~~~~g~iP---vv~~~nn-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~- 282 (474) T protein:vir:95 212 YYGANHIQSHFSNGNWGRVP---FIAFKNN-----PEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQ- 282 (474) T ss_pred ccCcccccccccccCCCccc---eEeecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcc- Confidence 00000000 0 0111222 4554432 35899999988888777654444433333222222222211000 Q ss_pred ccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCC Q lcl|NC_019524. 307 VVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMS 386 (556) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ 386 (556) .. +.....+..+.+..+..|.++++++.+.+...+..+++.+.+.|....++| T Consensus 283 --------~~-------------------~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 335 (474) T protein:vir:95 283 --------DL-------------------EEFMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGV 335 (474) T ss_pred --------cc-------------------hhhhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 00 001112345566778889999999988899999999999999998888876 Q ss_pred ---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhh Q lcl|NC_019524. 387 ---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALC 463 (556) Q Consensus 387 ---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~ 463 (556) ++.++++. |-.+.|..+..........+..|-.. ++.+++..++ ++..... ..-+ T Consensus 336 ~~~~~~~~~n~---Sg~Alk~~~~~l~~k~~~k~~~~~~~-l~~~~~li~~--~~g~~~d----------------~~~i 393 (474) T protein:vir:95 336 DFQTDKFGSAP---SGIALKFLYGNLDLKANKLKNKATVA-IQELIGFIID--FNNLKMD----------------VKDI 393 (474) T ss_pred ccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH--HhCCCcc----------------ccee Confidence 23344443 33345555554444444444444332 3333333222 2222111 0113 Q ss_pred CeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCC Q lcl|NC_019524. 464 NAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSN 541 (556) Q Consensus 464 ~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~ 541 (556) .+.|... .|.-+.+.....+++|+.|.+..+...+. |+++.++++.+|++...+.=-.. .....+ T Consensus 394 ~v~f~~~-----~p~d~~e~a~~~~~~g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~--------~~~~~d 460 (474) T protein:vir:95 394 EISFNFN-----RMMNDAEQSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNL--------DDGGAD 460 (474) T ss_pred eEEeccC-----CCcCHHHHHHHHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhccccc--------ccccCC Confidence 4566332 34445555556778999999999999875 89999999998876553321111 000111 Q ss_pred CCCCCCCCCCCcCC Q lcl|NC_019524. 542 SSESTSDNPNEETT 555 (556) Q Consensus 542 ~~~~~~~~~~~e~~ 555 (556) ..++.++..+++++ T Consensus 461 ~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 461 GAQQQERSNDKESE 474 (474) T ss_pred CCcCCCCCccCCCC Confidence 11111111111111 No 104 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.40 E-value=2.3e-11 Score=78.93 Aligned_cols=392 Identities=11% Similarity=-0.021 Sum_probs=219.7 Q ss_pred hcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCcee Q lcl|NC_019524. 18 VVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKL 97 (556) Q Consensus 18 ~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~ 97 (556) +..+..+... ...|+-+.+.-+.+ +.+. -+.++.+.| ++ .+|++-+|+.+++..+=.||+. T Consensus 1 l~~~~~r~~~---~~~yY~g~~~~~~~------~~~~-------p~~~~~~~~-~v--~nw~~~~Vds~a~rl~~~Gf~~ 61 (410) T protein:vir:95 1 MNLYQSRVNL---RYKHYAMQHYEAPT------GITI-------PAHIRAKYQ-AV--LGWAAKGVDSLADRLIFRAFAN 61 (410) T ss_pred CCcchhhHHH---HHHHhcCCCCcccc------chhc-------cHHHHhHHH-hh--cchhHHHHHHhHhhhccccccC Confidence 1112111111 12344332211111 1111 122333333 33 4788999999999888788852 Q ss_pred eeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCc Q lcl|NC_019524. 98 NAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRR 177 (556) Q Consensus 98 ~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~ 177 (556) ++ + .+|+-|.. .+|...+..+.+..++-|.+|+.. +... ++. T Consensus 62 ---~d-------~-----------~l~~i~~~-----------N~ld~~~~~~~~~al~~G~sf~~v-~~~~-----d~~ 103 (410) T protein:vir:95 62 ---DD-------F-----------NVTEIFDR-----------NNPDIFFDSAILSALIGSCSFVYI-SKGE-----DDE 103 (410) T ss_pred ---CC-------c-----------hHHHHHhh-----------cChHHHHHHHHHHHHHhCceeEEE-ecCC-----CCc Confidence 11 1 14554432 268888999999999999999885 3221 121 Q ss_pred ccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCCeEEEEEeecCCCccccC-CccccceeeccccCChhHeEee Q lcl|NC_019524. 178 PFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGAALGYWLRKAFPGDPTDM-EQWKWGYEPARFDWGRRRVIHI 253 (556) Q Consensus 178 ~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr~vaY~i~~~hpgd~~~~-~~~~~~rv~~~~~v~a~~viH~ 253 (556) ..|+.++|..+---++. ..+.+..++.+ |+.|.++...++.. +..+.. ....+..++ ...|.-=|+|+ T Consensus 104 ---~~i~~~sP~~~~~i~Dp-~~~~~~~al~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~--~~~g~vPvV~f 175 (410) T protein:vir:95 104 ---VRLQVIESSNATGVIDP-ITGLLVEGYAVLARDDYNRPTLEAYFEP--NATHFIPKDGEPYSVT--NETGIPLLVPV 175 (410) T ss_pred ---eEEEEEcccceEEEEeC-CCCceEEEEEEEEecCCCeEEEEEEEeC--CcEEEEeeCCcccccc--CCCCCcceEEe Confidence 36899999998754443 33567777653 56677776666543 221111 111222232 23455557888 Q ss_pred ecccCCCcccCCchhh-HHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccccccccccccccc Q lcl|NC_019524. 254 IEALLAGQTRGISEMV-SALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLAN 332 (556) Q Consensus 254 f~~~r~gQ~RGvs~la-~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (556) +...+.+-.=|.|.++ |++..+..+++-.--.+..+...|.=-.+|+--++ . T Consensus 176 ~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~------------d--------------- 228 (410) T protein:vir:95 176 IHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDP------------D--------------- 228 (410) T ss_pred cccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCC------------C--------------- Confidence 8777777677888876 55555554444333333333332222122221000 0 Q ss_pred ccccccceecCCceeeecCCCc---eeee--ecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchh---HHH Q lcl|NC_019524. 333 YVAQTKNIAIDGAKIPHLYPGT---KLKM--QPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSS---ARA 404 (556) Q Consensus 333 ~~~~~~~~~l~pG~i~~L~pGe---~i~~--~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs---~R~ 404 (556) ...........|.|..++..+ .+++ ++.... .+|.+.++.+.+.||+-.++|-+.|.... .|-|| +++ T Consensus 229 -~~~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l-~~~~~~l~~l~~~~a~~s~lP~~~lg~~~--~NpsSa~Al~a 304 (410) T protein:vir:95 229 -AEPMEKWKATVSSLLTISSSDKGVKPSVGQFTTASM-SPFTEQLRTAAAGFAGEMGLTLDDLGFVS--DNPSSVEAIKA 304 (410) T ss_pred -CCcCchhhhhhhhheeccCCCCCCcceEEecCCCCh-HHHHHHHHHHHHHHhhhcCCCHHHhcccc--CchhHHHHHHH Confidence 001111224456777776543 3555 544433 46888899999999999999999887654 35554 677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeec---Ccccccchhhhh Q lcl|NC_019524. 405 SMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIG---ASRGQIDEKKET 481 (556) Q Consensus 405 ~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~---p~~~~iDP~Ke~ 481 (556) +.....+..+..|..|-..+.+ +++. -.++..+.-..|... .-+.+.|.+ |.-.++ ...+ T Consensus 305 ~~~~L~~ka~~k~~~fg~~l~~-~~rl--a~~i~~~~~~~~~~~------------~~~~v~W~p~~d~~~~s~--a~~a 367 (410) T protein:vir:95 305 SHENLRLAGRKAQRSLGAGLLN-VAYV--AACLRDEFRYTRSQF------------VRTAVKWEPLFEADANTM--TMIG 367 (410) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHH--HHHHhcCCCCccccc------------ceeeEEeeecCCcchhhH--HHHH Confidence 7777777777777777776644 4442 223433332222110 013456862 332222 5566 Q ss_pred HHHHHHHHc--CCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 482 EAAILRIKN--GLSTYEAEISRLGGDFREVFKQRAREEGLIKS 522 (556) Q Consensus 482 ~A~~~~i~~--G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~ 522 (556) .|..+.+.+ |+.+.+-.....|.+++++.++..+|.+..-+ T Consensus 368 Da~~Kl~~a~~g~~~~~~~~~~lg~~~~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 368 DGVVKLNQALPGYINAETIRDLTGIAGDMSAKPVVSEGGSNGE 410 (410) T ss_pred HHHHHHHHhccCCccHHHHHHhcCCChHHHHHHHHHHHHhCCC Confidence 666777787 78777777777899999988887777654433 No 105 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.40 E-value=3.4e-13 Score=88.95 Aligned_cols=467 Identities=8% Similarity=0.002 Sum_probs=210.6 Q ss_pred CCcchhhhHHH---HHhhHhhcccchh---hhhh-hhcchhccccC--CCccccccc-CCCCCHHHHHHHHHH----HHH Q lcl|NC_019524. 1 MKDVKKTTRTR---AKKAVDVVAETAT---ATPM-AVGGGMEGAER--TTREMFQWN-PSIISPDQQIAQNQD----MAS 66 (556) Q Consensus 1 ~sp~~~~~r~~---a~~a~~~~~~~~~---~~~~-~~~~~y~aa~~--~~r~~~~w~-~~~~s~~~~i~~~~~----~lr 66 (556) --|.+++.|.. +...-...++... .... ....+-+++.. ....+.... ....+.+.....+.- ..- T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~g 110 (765) T protein:vir:96 31 HDPLDPMIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIG 110 (765) T ss_pred CCCcccchhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCcc Confidence 34444444443 2111001011000 0000 00011122210 000011000 111111111111110 000 Q ss_pred HHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHH Q lcl|NC_019524. 67 ARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGL 146 (556) Q Consensus 67 ~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~l 146 (556) -..-.|++-|++++.+|+.+....+-.|+.+.... ++...+..+.+++.|++. .+.+. T Consensus 111 yql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~--------~e~~~~~~~~l~~~~~rl--------------~v~~~ 168 (765) T protein:vir:96 111 YQACAIISQHWLVDKACSMSGEDAARNGWELKSDG--------RKLSDEQSALIARRDMEF--------------RVKDN 168 (765) T ss_pred HHHHHHHHhCchhhhhhhcchHHhhcCCceeecCc--------cccCHHHHHHHHHHHHHh--------------hHHHH Confidence 11235799999999999999999999999886421 122233445566665543 22333 Q ss_pred HHHHhh-hheecCceEEEEeeccCCCCcCCCccc--------ce-EEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeE Q lcl|NC_019524. 147 TRLAVS-GFLMTGEVLATCEWLNPTGTTMQRRPF--------GT-AIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAAL 216 (556) Q Consensus 147 q~l~~r-~~~~dGE~f~~~~~~~~~~~~~~~~~~--------~l-~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~v 216 (556) ...+++ +.+-.|-+.+++.-......++ .|+ .+ .|.+|++-.+..-.. + ....-.--..+|+|. T Consensus 169 l~ea~~~~RlyGga~i~i~i~~~D~~~l~--~PL~~~~I~kg~~kgl~vldp~~~~~~~v---~-e~~~Dp~sp~fg~P~ 242 (765) T protein:vir:96 169 LVELNRFKNVFGVRIALFVVESDDPDYYE--KPFNPDGIAPGSYKGISQIDPYWAMPQLT---A-ESTADPSAEHFYEPD 242 (765) T ss_pred HHHHHHHhhhceeeEEEEEecccCcchhh--ccccccccccceeeEEEEechhhcccccc---h-hccccccccccCcce Confidence 334444 3333333333332211111010 011 11 133444433321000 0 000011112457776 Q ss_pred EEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCccc------CCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 217 GYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTR------GISEMVSALKQMKMTRNFQEITLQNAV 290 (556) Q Consensus 217 aY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~R------Gvs~la~~l~~l~~l~~~~dael~~a~ 290 (556) .|+|. ...|+.+.|||+-...-+...+ |+|.|-+++..|++++.-......-.. T Consensus 243 ~y~i~--------------------g~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~ 302 (765) T protein:vir:96 243 FWIIS--------------------GKKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAM 302 (765) T ss_pred eeeec--------------------CceeccceEEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66653 1247778888874444445554 999999999888877665543333222 Q ss_pred HhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHH Q lcl|NC_019524. 291 VNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTD 370 (556) Q Consensus 291 i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~ 370 (556) -++ +. +++.+.... .+.. ........... ...... | +..+..+++++.++.+ -++..+ T Consensus 303 k~~-~~-v~k~~~~~~-----l~~~-~~l~~r~~~~~----------~~r~n~-g-~~~id~ee~~e~~s~~--lsgl~d 360 (765) T protein:vir:96 303 SKR-TS-TIHVDVEKA-----IANE-DAFNARLAFWI----------ANRDNH-G-VKVIGIDETMEQFDTN--LSDFDS 360 (765) T ss_pred Hhc-cc-eeeechHhh-----hccH-HHHHHHHHHHH----------HhcCCc-e-eEEecCCcceeEEecc--cCCHHH Confidence 222 22 233322111 0000 00000000000 000011 2 3447778999988854 447899 Q ss_pred HHHHHHHHHHHhcCCCHHHhhch-hhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcc Q lcl|NC_019524. 371 YEQSLLRNIAASLGMSYEQFSRD-YTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKN 449 (556) Q Consensus 371 F~~~~lr~iaaglGi~ye~l~~D-~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~ 449 (556) ++...+..||++.+||...|.|- .++.| ||.-.-+..|...++.+|+..+ +|+.++.++.-+++|.++ ..+. T Consensus 361 ~l~~~~~~iAaas~IP~t~LfGqsp~Gln-ATGe~D~~nYyD~I~s~Qe~~l----~p~le~L~~li~~s~~i~--~d~~ 433 (765) T protein:vir:96 361 VIMNQYQLVAAIAKTPATKLLGTSPKGFN-ATGEHETISYHEELESIQEHIF----DPLLERHYLLLAKSESID--VQLE 433 (765) T ss_pred HHHHHHHHHHhhhCCCeeeeccCCccccc-CcchHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhcCCC--Ccce Confidence 99999999999999999888874 34444 6667778888899999997754 445555555555667653 2222 Q ss_pred cccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCC Q lcl|NC_019524. 450 WRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFT 528 (556) Q Consensus 450 ~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~ 528 (556) +. ++-.|.+...+-+|- .|-+++....+++|+.|..++..+.+.|++--+..+..+.. ..+-++... T Consensus 434 i~-----------FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~-e~~~~~~pe 501 (765) T protein:vir:96 434 IV-----------WNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQA-ETEPGMSPE 501 (765) T ss_pred EE-----------eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCcccc-ccccCCCcc Confidence 11 123455555555554 56778999999999999988888766554311111111000 000111100 Q ss_pred c------------cccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 529 G------------KMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 529 ~------------~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) . ..........+++...++..++.++.. T Consensus 502 ~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p 541 (765) T protein:vir:96 502 NLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAP 541 (765) T ss_pred ccccccCCCcccccccCccccccCCCCccCCCCcccccCC Confidence 0 000000000000000000000000000 No 106 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.39 E-value=6.8e-13 Score=87.29 Aligned_cols=394 Identities=12% Similarity=0.018 Sum_probs=197.8 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=....+ ++ ..+... ....|.....+.+ .+-.....-..++|.+..+| T Consensus 1 Mg~~~~f~-~k--------------------~~~~~~--~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~V~~~I 49 (403) T protein:vir:80 1 MGLFNFFR-RK--------------------TRSEPT--NAISWFLTQEAYD--------TLAIPGYTRLSDNPEVRMAV 49 (403) T ss_pred Cccccccc-cc--------------------cccccc--chhhhhccccccc--------ccccchhhhhhhhHHHHHHH Confidence 11110000 00 000000 0011111110000 00000111134478889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec--CceE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT--GEVL 161 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d--GE~f 161 (556) +.+.+.|-.-.|++.-+.+. +. ..... ..+..+-.+|+ ..+|.+++-...+..++.. |.++ T Consensus 50 ~~ia~~iA~~p~~~~~~~~~------g~--~~~~~---~~~~lL~~~PN------~~~t~~~f~~~~v~~~ll~~~Gna~ 112 (403) T protein:vir:80 50 HKIAELISSMTIHLMQNTDN------GD--IRIKN---ELSRKIDINPY------SLMTRKAWMYNIVYTMLLDGEGNSV 112 (403) T ss_pred HHHHHhhhhCceEEEEecCC------ce--eecCC---hHHHHHhccCC------cCCCHHHHHHHHHHHHhhcCCccEE Confidence 99999888777776432211 11 01111 12233333554 3467888888888887775 6688 Q ss_pred EEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeec Q lcl|NC_019524. 162 ATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPA 241 (556) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~ 241 (556) +.+.+.. ...+..|..|+|+.+. |..|..|..+.|-. T Consensus 113 i~~~~~~--------~g~~~~L~~l~p~~v~--------------~~~~~~g~~~~y~~--------------------- 149 (403) T protein:vir:80 113 VFPKYTT--------SGLIDELIPLAPSKVS--------------FVDTDTGYQIWYQG--------------------- 149 (403) T ss_pred EEEEEcC--------CCcEEEEEEEcCCeeE--------------EEEcCCceEEEEee--------------------- Confidence 7765422 1235678888887763 34455554444421 Q ss_pred cccCChhHeEeeec-ccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 242 RFDWGRRRVIHIIE-ALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 242 ~~~v~a~~viH~f~-~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) +..+.++|||+.. +..-+...|+|++..+...+.......+......+=.+...++|+.+....... ..... T Consensus 150 -~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~------~~~~~ 222 (403) T protein:vir:80 150 -KAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELS------SEEGR 222 (403) T ss_pred -cccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHH------HHHHH Confidence 1245679999885 444566789998877666666666666666666666677778887654311100 00000 Q ss_pred ccccccccccccccccccceecCCceeeec-CCCc---eeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhh-chhh Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHL-YPGT---KLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFS-RDYT 395 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-~pGe---~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~-~D~s 395 (556) ...... . ......|.+..+ .++. +++.+++ -...|.+..+.....||..+|||-+.|. ++++ T Consensus 223 ~~~~~~-------~----~~~~~~g~~~~~~~~~~~~~~~~~l~~--~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 289 (403) T protein:vir:80 223 NAVFKK-------Y----LEASEAGQPWIIPAELLDVEQVKPLSL--KDLAIHETVELDKRTVAGIFGVPAFLLGVGKYD 289 (403) T ss_pred HHHHHH-------H----hhhhhcCCeeeecccccccceeccCCH--HHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCcc Confidence 000000 0 000112333223 2333 4443332 3467888889999999999999988773 3433 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) +.+|++ |+..-+.|+.+.+ +.++.+..+.-+.+ .+++-.-..... T Consensus 290 ~~~~~~------------------f~~~~l~P~~~~i-e~~l~~kll~~~~~----------------~~~f~~~~ll~~ 334 (403) T protein:vir:80 290 KDEYNN------------------FINSTILPIAKGI-EQELTRKLLISPDL----------------YFKFNPRSLYAY 334 (403) T ss_pred HHHHHH------------------HHHHHHHHHHHHH-HHHHHHhccCCCCc----------------EEEeechhhhcc Confidence 322221 4455567766654 44555555543221 112222223335 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC-CCCCCCCCcC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE-STSDNPNEET 554 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~ 554 (556) |+...+++....+++|+.|+-|+-+..|..|.+--+ ++=++....|..... ..+... ++..+.+.++ T Consensus 335 d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd----------~~~~~~n~~pl~~~~--~~~~~k~ge~~~~~~~~ 402 (403) T protein:vir:80 335 DLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLS----------ELVILENYIPLDKIG--DQNKLKGGEKGGADGQT 402 (403) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC----------eEeecccccchhhcc--chhhccCCCCCCCCCCC Confidence 999999999999999999999998888988754211 111111111111111 111111 1222222222 Q ss_pred C Q lcl|NC_019524. 555 T 555 (556) Q Consensus 555 ~ 555 (556) + T Consensus 403 ~ 403 (403) T protein:vir:80 403 D 403 (403) T ss_pred C Confidence 2 No 107 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.39 E-value=3.8e-11 Score=77.72 Aligned_cols=448 Identities=8% Similarity=-0.025 Sum_probs=214.4 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) ......++.. ..........+-.....-|.|-. +.-... +...... ++ +.-.-++|++ T Consensus 39 ~~~~~~i~~~-----i~~h~~~~~~rl~~l~~yY~g~~--~~i~~~--~~~~~~~------------~~-~~ki~~n~~k 96 (502) T protein:vir:48 39 VNNWELLKNF-----INHHKLRQAPRIQELLDYARGEN--HDVLKS--GRRKDNE------------MA-DKRAVHNYGR 96 (502) T ss_pred cccHHHHHHH-----HHHHHHHHHHHHHHHHHHhcCCC--cccccc--ccccccc------------cc-cceeecchHH Confidence 0110111110 00000000001111112244321 111110 0000000 00 0012368999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++....- ++. ++.+...|+.|+.. .+|...+..+.+..+.-|.+ T Consensus 97 ~Ivd~~~~yl~g~p~~~~~~d--------~~~----~~~~~~~l~~~~~~----------N~~~~~~~~~~~~~~~~G~a 154 (502) T protein:vir:48 97 MISKFKTGYLAGNPIRVEYDD--------NED----NSQNDDAIKRIGRI----------NDIDTHNRNLIRDLSQTGRA 154 (502) T ss_pred HHHHHHhhhhcccCeeEecCC--------ccc----hhHHHHHHHHHHhh----------cCHhHHHHHHHHHHhhcCeE Confidence 999999999999998876431 111 34445555555443 25888889999999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeEEEEEeecCCCccccC-Cccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAALGYWLRKAFPGDPTDM-EQWK 235 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~vaY~i~~~hpgd~~~~-~~~~ 235 (556) |+.+. ....+ .+++..++|..+-.-++......+..+|.+ +..+...-+.++..+ ..+.. .... T Consensus 155 ~~~v~-~dedg--------~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~--~i~~~~~~~~ 223 (502) T protein:vir:48 155 YEVIY-RSEYD--------ETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQ--HIYTLDASDS 223 (502) T ss_pred EEEEE-eCCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCC--eEEEEEeCCc Confidence 97754 32211 268899999988533333333457777764 222333333344332 11110 1111 Q ss_pred cceeec----cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccc Q lcl|NC_019524. 236 WGYEPA----RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQ 311 (556) Q Consensus 236 ~~rv~~----~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~ 311 (556) |..+.. +..|| |+++.. -..|+|.|.+++..+..++....-......--+.-..+++....... T Consensus 224 ~~~~~~~~~~~g~vP---vv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~---- 291 (502) T protein:vir:48 224 FNEISVTPHAFGTVP---ITEFLN-----NADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQ---- 291 (502) T ss_pred eeeccceecCCCccc---eEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccc---- Confidence 111111 11233 444433 34799999998887766655443333333322322233332111100 Q ss_pred cccccccccccccccccccccccccccceec-CCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHh Q lcl|NC_019524. 312 LGMGQGGFKEIFNEYMTGLANYVAQTKNIAI-DGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQF 390 (556) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l 390 (556) +.... .......+.+ .++......++-++++++.+.+...+..+++.+.+.|....++|-... T Consensus 292 -----~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~ 355 (502) T protein:vir:48 292 -----GMQAS-----------DMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSD 355 (502) T ss_pred -----ccchh-----------hhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCc Confidence 00000 0000111111 222233345677899999888889999999999999988888773211 Q ss_pred hchh-hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeec Q lcl|NC_019524. 391 SRDY-TKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIG 469 (556) Q Consensus 391 ~~D~-s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~ 469 (556) ..+ ++.|-.+++..+..........+..|-.. ++.+++..+...-..+...-.. ..-+.+.|.+ T Consensus 356 -~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~-l~~~~~li~~~~~~~~~~~~~d-------------~~~i~i~f~~ 420 (502) T protein:vir:48 356 -NHFSGNASGEALKYKLFGLDQDRVDTQSQFTQG-LKRRYRLAARIGSLVNEFKDFD-------------ESRLKITFTP 420 (502) T ss_pred -cccccCchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhcccccccc-------------cccceEEeCC Confidence 222 22333355555444444444444444333 3334443333222222211000 0114567743 Q ss_pred CcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCC Q lcl|NC_019524. 470 ASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTS 547 (556) Q Consensus 470 p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~ 547 (556) +-. .|....+++..++ .|+.|.+..+...|. |+++-+++++.|++.......+..... ....+..+ .. T Consensus 421 ~~p--~d~~e~a~~~~kl--~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~--~~~~~~d~----~~ 490 (502) T protein:vir:48 421 NLP--KSLYEQVSILNDL--GGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYD--NVGKYTDE----VK 490 (502) T ss_pred CCC--cCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccc--cccccCCC----cc Confidence 322 4766666666554 699999999999874 899999999998876554443321111 11111111 11 Q ss_pred CCCCCcCCC Q lcl|NC_019524. 548 DNPNEETTQ 556 (556) Q Consensus 548 ~~~~~e~~~ 556 (556) +++++|.+. T Consensus 491 e~~~~~~~~ 499 (502) T protein:vir:48 491 ETHTDDFER 499 (502) T ss_pred CCCCcCcCC Confidence 122222222 No 108 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.38 E-value=4.4e-11 Score=77.39 Aligned_cols=434 Identities=13% Similarity=0.112 Sum_probs=206.4 Q ss_pred hcccchhhhhhhh-----------cchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Q lcl|NC_019524. 18 VVAETATATPMAV-----------GGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVH 86 (556) Q Consensus 18 ~~~~~~~~~~~~~-----------~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~ 86 (556) +....-....... ...|.-+.. ..... ...... .+ +++-.-..|++-+|+.. T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~---~i~~~---~~~~~~-------~~----~~~~~~~n~~~~ivd~~ 63 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTI---GIGAPP-------EL----AYLDVQPGWVATYLRTL 63 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---ccccc---ccccch-------hH----hhhhhhcchHHHHHHHH Confidence 3332221111100 011221111 11110 001110 11 11112235889999999 Q ss_pred HhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEee Q lcl|NC_019524. 87 RDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEW 166 (556) Q Consensus 87 ~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~ 166 (556) +++.++.||+..- +.+..+.+...|+ . -+|...+..+++..+.-|.+|..+-. T Consensus 64 ~~~l~~~g~~~~~-------------d~~~~~~l~~i~~---~-----------N~~d~~~~~~~~~a~~~G~ay~~v~~ 116 (480) T protein:vir:78 64 SDRLDIEGFRISE-------------DSEGLEELWNWWQ---A-----------NDLDEESVLGHDDSLTFGRSYITVSH 116 (480) T ss_pred HhhhccCceecCC-------------CchhHHHHHHHHH---h-----------cCHHHHHHHHHHHHhhcCceEEEEec Confidence 9999988886431 1122333433332 1 15677888899999999999977532 Q ss_pred ccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeEEEEEeecCCCccccC-----Cccccc Q lcl|NC_019524. 167 LNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAALGYWLRKAFPGDPTDM-----EQWKWG 237 (556) Q Consensus 167 ~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~vaY~i~~~hpgd~~~~-----~~~~~~ 237 (556) ..... .+.... .++..++|..+-.-++......+..+|.+ |..|.+.-+-++.. +..+.. ....|. T Consensus 117 ~~~~~--~d~~g~-~~i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~--~~~~~~~~~~~~~~~~~ 191 (480) T protein:vir:78 117 PDVES--GDPAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLP--DETVPLRRNGGLNDQWV 191 (480) T ss_pred Ccccc--CCCCCe-eEEEEEcccceEEEEcCCCccceEEEEEEEEeecCCCceEEEEEEeC--CeEEEEEecCCCccccc Confidence 11110 011112 46889999888543333334455666654 34454432222221 111100 001111 Q ss_pred e--eeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee---eeEeccCccccccccc Q lcl|NC_019524. 238 Y--EPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA---ASVESELPSDVVFGQL 312 (556) Q Consensus 238 r--v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~---~fi~~~~~~~~~~~~~ 312 (556) - -+....++.-=|+|+....+.+-.-|+|.|.+-+..| ++.|.......+...-.++ .+|+-...+.. T Consensus 192 ~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l--~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~----- 264 (480) T protein:vir:78 192 VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV--TDAASRTLMNLQSASQILGTPLRVISGVTTDEL----- 264 (480) T ss_pred cccccccCCCCCcceEEeecccccCCccCcccchhhHHHH--HHHHHHHHHHHHHHHHhhcchhhhhhcCCcccc----- Confidence 1 0111124444578888878888889999999744443 3333333333333332222 23321100000 Q ss_pred ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCC-CccHHHHHHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 313 GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTP-GGVGTDYEQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p-~~~f~~F~~~~lr~iaaglGi~ye~l~ 391 (556) . ...+.......+|.|..+. |+++++.+.+.. -.+|...++..++.|++..++|.+.|. T Consensus 265 -----------~--------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g 324 (480) T protein:vir:78 265 -----------T--------NDGENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLS 324 (480) T ss_pred -----------c--------cccccchhhhhhhhhccCC-CCCceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhc Confidence 0 0011112234566666664 566777665543 345777778889999999999999886 Q ss_pred chhhcccchh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeee Q lcl|NC_019524. 392 RDYTKTNYSS---ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWI 468 (556) Q Consensus 392 ~D~s~~nYSs---~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~ 468 (556) ++. .|-+| +|..+.......+..+..|-..+.+- ++ |-.+++.+.. +.. ..-+.+.|. T Consensus 325 ~~~--~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~-~~--l~~~~~g~~~--~~~------------~~~i~v~f~ 385 (480) T protein:vir:78 325 SSS--ENPASAEAIIATDSRIVKMAERKGRIFGGAWERA-MR--IAMQIMGREV--TEE------------YTRLETVWR 385 (480) T ss_pred ccc--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH--HHHHHcCCCc--ccc------------ceeeeEEec Confidence 653 34444 45555555555555566665555443 33 2223333221 110 011457887 Q ss_pred cCcccccchhhhhHHHHHHHHcC--CCCHHHHHHHhCCCHHHHHH--HHHHHHH--HHHHcCCCCCccccccCCCCCCCC Q lcl|NC_019524. 469 GASRGQIDEKKETEAAILRIKNG--LSTYEAEISRLGGDFREVFK--QRAREEG--LIKSLKLDFTGKMVEGNSTQSSNS 542 (556) Q Consensus 469 ~p~~~~iDP~Ke~~A~~~~i~~G--~~s~~~~~ae~G~D~e~v~~--q~a~E~~--~~~~~Gl~~~~~~~~~~~~~~~~~ 542 (556) .|..+ +-.+.+.+..+.+.+| +.|.+.++...|.+.+++-+ +..+|.. .+..+ .... . ..+...+ T Consensus 386 ~~~~~--s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~--~~~~-~----~~~~~~~ 456 (480) T protein:vir:78 386 DPSTP--TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTL--YSTT-K----AQADATP 456 (480) T ss_pred CCCCC--CHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHh--hccc-c----ccCCCCC Confidence 76655 5667777777777776 67777777788987765432 2222211 11111 1100 0 0111111 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_019524. 543 SESTSDNPNEETTQ 556 (556) Q Consensus 543 ~~~~~~~~~~e~~~ 556 (556) .++.++++.+ +++ T Consensus 457 ~~~~~~~~~~-~~~ 469 (480) T protein:vir:78 457 KPTVTETKTE-TQT 469 (480) T ss_pred CCCCCCCCCc-ccc Confidence 1111222111 111 No 109 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.36 E-value=5.9e-11 Score=76.67 Aligned_cols=438 Identities=8% Similarity=-0.006 Sum_probs=208.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |-|........-.+........ ..+. .....|.-+...-..-..+.-.....+. .++ ..---+++++ T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~-~~~~-~~~~~YY~g~~~i~~~~~~~~~~~~~~~----------~~~-~~ki~~n~~k 87 (474) T protein:vir:97 21 LKPQFETQEEMIVRLIDDHRKQ-LDKI-TVGQRYYDKDNDIVKQMKKVDVHGNIDY----------DKP-DWRITTNFHQ 87 (474) T ss_pred hhhcccCHHHHHHHHHHHHHHH-HHHH-HHHHHHhccccchhcccchhcccccccc----------ccC-cceeecchHH Confidence 3344333222222221111100 0111 1122333222100000000000000000 000 0011258999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++.... + +.....|+.|..+ +|...+..+.+..+.-|.+ T Consensus 88 ~Ivd~~~~~l~g~p~~~~~~---------d-------~~~~~~l~~~~~n-----------~~~~~~~e~~~~~~~~G~~ 140 (474) T protein:vir:97 88 NLVDQKVSYVASKPVTYSCE---------D-------ENVLKVIHDVLDT-----------RWDNKLIDILTATSNKGID 140 (474) T ss_pred HHHHHHHhhhhcCCceeccC---------c-------HHHHHHHHHHHhc-----------cHHHHHHHHHHHHhhcCce Confidence 99999999999999877532 1 2233445555431 3555666667888899999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCC--------Cc--ccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFP--------GD--PTD 230 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hp--------gd--~~~ 230 (556) ++++. ....+ -+++..++|+.+-.-++......+..+|++-.......+.++...- +. ... T Consensus 141 ~~~~~-~d~~~--------~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~ 211 (474) T protein:vir:97 141 WLQVY-INENG--------EMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDY 211 (474) T ss_pred EEEEE-ecCCC--------eeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCcccccc Confidence 97754 32211 2578899999886434434345566777653322112222221110 00 000 Q ss_pred CCcccccee-eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccc Q lcl|NC_019524. 231 MEQWKWGYE-PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVF 309 (556) Q Consensus 231 ~~~~~~~rv-~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~ 309 (556) .....+... ......+.-=|+++... .-|+|.|.+++..+..++....-......-.+.-..+++-..++. T Consensus 212 ~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~--- 283 (474) T protein:vir:97 212 YYGANHVQSHFSNGNWGRVPFIAFKNN-----PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGED--- 283 (474) T ss_pred ccCcCcccccccccCCCccceEEecCC-----cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccc--- Confidence 000000000 00000111114444332 358999999888887766544444333332222222333111000 Q ss_pred cccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCC--- Q lcl|NC_019524. 310 GQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMS--- 386 (556) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~--- 386 (556) . +.....+..+.+..+..|.++++++.+.+...+..+.+.+.+.|....++| T Consensus 284 ------~-------------------~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 338 (474) T protein:vir:97 284 ------L-------------------EEFMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQ 338 (474) T ss_pred ------c-------------------hhhhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccC Confidence 0 000112445666778889999999999999999999999999988888776 Q ss_pred HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCee Q lcl|NC_019524. 387 YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAE 466 (556) Q Consensus 387 ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~ 466 (556) .+.++++.| --+.+..+..........+..|-. .++.+++..++ ++..... . .-+.+. T Consensus 339 ~~~~~~n~S---g~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~--~~~~~~d----~------------~~i~v~ 396 (474) T protein:vir:97 339 TDKFGSAPS---GIALKFLYGNLDLKANKLKNKATV-AIQELISFIID--FNNLKTD----V------------KDIEIS 396 (474) T ss_pred ccccccccH---HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH--HhCCCcc----c------------ceeeEE Confidence 333433332 224554444444444444444333 33444433222 2222111 0 012456 Q ss_pred eecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC Q lcl|NC_019524. 467 WIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE 544 (556) Q Consensus 467 w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~ 544 (556) |.. -.|.-+++.....+++|+.|.+..+...+. |+++.++++.+|++...+.--.. .....+ .+ T Consensus 397 f~~-----~~p~~~~e~a~~~~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~--------~~~~~~-~~ 462 (474) T protein:vir:97 397 FNF-----NRMMNDAEQSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNL--------DDGGAD-GA 462 (474) T ss_pred ecc-----CcccCHHHHHHHHHHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc--------CCCCCC-Cc Confidence 632 234445556666788999999999999975 99999999999987654321111 011111 11 Q ss_pred CCCCCCCCcCCC Q lcl|NC_019524. 545 STSDNPNEETTQ 556 (556) Q Consensus 545 ~~~~~~~~e~~~ 556 (556) .+.+.++++.+| T Consensus 463 ~~~~~~~~~~~e 474 (474) T protein:vir:97 463 QQQEGSNNKESE 474 (474) T ss_pred ccCCCCcccccC Confidence 112222222222 No 110 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.36 E-value=5.9e-11 Score=76.67 Aligned_cols=438 Identities=8% Similarity=-0.006 Sum_probs=208.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |-|........-.+........ ..+. .....|.-+...-..-..+.-.....+. .++ ..---+++++ T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~-~~~~-~~~~~YY~g~~~i~~~~~~~~~~~~~~~----------~~~-~~ki~~n~~k 87 (474) T protein:vir:94 21 LKPQFETQEEMIVRLIDDHRKQ-LDKI-TVGQRYYDKDNDIVKQMKKVDVHGNIDY----------DKP-DWRITTNFHQ 87 (474) T ss_pred hhhcccCHHHHHHHHHHHHHHH-HHHH-HHHHHHhccccchhcccchhcccccccc----------ccC-cceeecchHH Confidence 3344333222222221111100 0111 1122333222100000000000000000 000 0011258999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++.... + +.....|+.|..+ +|...+..+.+..+.-|.+ T Consensus 88 ~Ivd~~~~~l~g~p~~~~~~---------d-------~~~~~~l~~~~~n-----------~~~~~~~e~~~~~~~~G~~ 140 (474) T protein:vir:94 88 NLVDQKVSYVASKPVTYSCE---------D-------ENVLKVIHDVLDT-----------RWDNKLIDILTATSNKGID 140 (474) T ss_pred HHHHHHHhhhhcCCceeccC---------c-------HHHHHHHHHHHhc-----------cHHHHHHHHHHHHhhcCce Confidence 99999999999999877532 1 2233445555431 3555666667888899999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCC--------Cc--ccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFP--------GD--PTD 230 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hp--------gd--~~~ 230 (556) ++++. ....+ -+++..++|+.+-.-++......+..+|++-.......+.++...- +. ... T Consensus 141 ~~~~~-~d~~~--------~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~ 211 (474) T protein:vir:94 141 WLQVY-INENG--------EMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDY 211 (474) T ss_pred EEEEE-ecCCC--------eeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCcccccc Confidence 97754 32211 2578899999886434434345566777653322112222221110 00 000 Q ss_pred CCcccccee-eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccc Q lcl|NC_019524. 231 MEQWKWGYE-PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVF 309 (556) Q Consensus 231 ~~~~~~~rv-~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~ 309 (556) .....+... ......+.-=|+++... .-|+|.|.+++..+..++....-......-.+.-..+++-..++. T Consensus 212 ~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~--- 283 (474) T protein:vir:94 212 YYGANHVQSHFSNGNWGRVPFIAFKNN-----PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGED--- 283 (474) T ss_pred ccCcCcccccccccCCCccceEEecCC-----cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccc--- Confidence 000000000 00000111114444332 358999999888887766544444333332222222333111000 Q ss_pred cccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCC--- Q lcl|NC_019524. 310 GQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMS--- 386 (556) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~--- 386 (556) . +.....+..+.+..+..|.++++++.+.+...+..+.+.+.+.|....++| T Consensus 284 ------~-------------------~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 338 (474) T protein:vir:94 284 ------L-------------------EEFMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQ 338 (474) T ss_pred ------c-------------------hhhhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccC Confidence 0 000112445666778889999999999999999999999999988888776 Q ss_pred HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCee Q lcl|NC_019524. 387 YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAE 466 (556) Q Consensus 387 ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~ 466 (556) .+.++++.| --+.+..+..........+..|-. .++.+++..++ ++..... . .-+.+. T Consensus 339 ~~~~~~n~S---g~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~--~~~~~~d----~------------~~i~v~ 396 (474) T protein:vir:94 339 TDKFGSAPS---GIALKFLYGNLDLKANKLKNKATV-AIQELISFIID--FNNLKTD----V------------KDIEIS 396 (474) T ss_pred ccccccccH---HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH--HhCCCcc----c------------ceeeEE Confidence 333433332 224554444444444444444333 33444433222 2222111 0 012456 Q ss_pred eecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC Q lcl|NC_019524. 467 WIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE 544 (556) Q Consensus 467 w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~ 544 (556) |.. -.|.-+++.....+++|+.|.+..+...+. |+++.++++.+|++...+.--.. .....+ .+ T Consensus 397 f~~-----~~p~~~~e~a~~~~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~--------~~~~~~-~~ 462 (474) T protein:vir:94 397 FNF-----NRMMNDAEQSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNL--------DDGGAD-GA 462 (474) T ss_pred ecc-----CcccCHHHHHHHHHHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc--------CCCCCC-Cc Confidence 632 234445556666788999999999999975 99999999999987654321111 011111 11 Q ss_pred CCCCCCCCcCCC Q lcl|NC_019524. 545 STSDNPNEETTQ 556 (556) Q Consensus 545 ~~~~~~~~e~~~ 556 (556) .+.+.++++.+| T Consensus 463 ~~~~~~~~~~~e 474 (474) T protein:vir:94 463 QQQEGSNNKESE 474 (474) T ss_pred ccCCCCcccccC Confidence 112222222222 No 111 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.36 E-value=6.2e-13 Score=87.50 Aligned_cols=388 Identities=10% Similarity=0.033 Sum_probs=189.4 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |+=..+.+-++ .......+| .....-+-..+..++.+..+| T Consensus 1 Mg~f~~lf~~~------------------------~~~~~~~~~---------------~~~~~v~~~~~~~~~~v~~~i 41 (395) T protein:vir:95 1 MSILEKIFKTR------------------------KDITYMLDL---------------DMIEDLSQQAYVKRLAIDSCI 41 (395) T ss_pred CchhhhhhccC------------------------ccccccccc---------------hhccccchhhhhhhHHHHHHH Confidence 22222111100 000000111 111111223345678899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|++.|-..-|++.-.- ... -...+..+..+|+ ..+|..++....+..++..|++|+. T Consensus 42 ~~Ia~~iA~~p~~~~~~~--------~~~-------~~~~~~ll~~~PN------~~~t~~~f~~~~~~~lll~g~~~~~ 100 (395) T protein:vir:95 42 EFVARAVAQSHFKVLEGN--------RIQ-------KNDVYYKLNIKPN------TDLSSDSFWQQVIYKLIYDNEVLIV 100 (395) T ss_pred HHHHHhhccceeEeccCC--------ccc-------cchHHHHHHhccC------cCCCHHHHHHHHHHHHhhCCceEEE Confidence 999999998766654210 000 1123333444554 3567888878888899999999987 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) .++.. ..+ .+++..+. +... ...+..+|.++.++- .. T Consensus 101 ~~~~~--------~~~-----~~~~~~~~-~~~~--~~~~~~~~~~~~~~~---------------------------~~ 137 (395) T protein:vir:95 101 VSDSK--------ELL-----IADSFYRE-EYAL--YDDIFKDVTVKDYTY---------------------------QR 137 (395) T ss_pred EecCC--------CeE-----ecCCccce-eEee--cCcceeEEEEcCcee---------------------------ee Confidence 64321 011 12211111 0000 011222332222110 11 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+...-..+...|.|++..+...+. ..+.+.+-++...++|+.+.... .++..+.. T Consensus 138 ~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~-------~~~~~~~~~~~~~gii~~~~~~~---------~~e~~~~~ 201 (395) T protein:vir:95 138 TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFG-------RMIGAQLKNYQIRGILKSASSAY---------DEKNIEKL 201 (395) T ss_pred eeccccEEEEccCCCCcccccchHHHHHHHHHH-------HHHHHHHhcCCCceEEEeCCCCC---------CHHHHHHH Confidence 356789999988778888899988766544432 33334455677778887653211 01111111 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCc-----cHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG-----VGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~-----~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) ....... . ....-....|..|..|.+++.++-+.-.. +|.+..+...++||..+|||-+.|. .+ T Consensus 202 ~~~~~~~---~---~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~-----~~ 270 (395) T protein:vir:95 202 QAFTNKL---F---NTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY-----GE 270 (395) T ss_pred HHHHHHH---h---ccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc-----Cc Confidence 1100000 0 00011344577799999999876443222 4666677889999999999998774 36 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||++.+.... ++..-+.|+...+- .++-...++ +.. +....+..+ -...-.|+. T Consensus 271 ~sn~e~~~~~-----------~~~~~l~P~~~~ie-~~l~~kL~~-~~~-----------~~~~~~f~~--~~l~~~D~~ 324 (395) T protein:vir:95 271 TADLEKNTLV-----------FEKFCLTPLLKKIQ-NELNAKLIT-QSM-----------YLKDTRIEI--VGVNKKDPL 324 (395) T ss_pred ccCHHHHHHH-----------HHHHHHHHHHHHHH-HHHHHhhcC-hhh-----------hcccceecc--hhhhccCHH Confidence 6665443333 34455666666543 334333332 111 000111222 222335888 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC--CCCCCCCCcCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE--STSDNPNEETT 555 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~--~~~~~~~~e~~ 555 (556) ..+++....+++|+.|+-|+-+..|..|-+-- ..++.=++....+............. ..+.+.+++.+ T Consensus 325 ~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g--------~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 325 QYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNP--------ELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--------CCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 88999999999999999999888898875310 00011111111111110000000000 00111111111 No 112 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.36 E-value=6.2e-13 Score=87.50 Aligned_cols=388 Identities=10% Similarity=0.033 Sum_probs=189.4 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |+=..+.+-++ .......+| .....-+-..+..++.+..+| T Consensus 1 Mg~f~~lf~~~------------------------~~~~~~~~~---------------~~~~~v~~~~~~~~~~v~~~i 41 (395) T protein:vir:10 1 MSILEKIFKTR------------------------KDITYMLDL---------------DMIEDLSQQAYVKRLAIDSCI 41 (395) T ss_pred CchhhhhhccC------------------------ccccccccc---------------hhccccchhhhhhhHHHHHHH Confidence 22222111100 000000111 111111223345678899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|++.|-..-|++.-.- ... -...+..+..+|+ ..+|..++....+..++..|++|+. T Consensus 42 ~~Ia~~iA~~p~~~~~~~--------~~~-------~~~~~~ll~~~PN------~~~t~~~f~~~~~~~lll~g~~~~~ 100 (395) T protein:vir:10 42 EFVARAVAQSHFKVLEGN--------RIQ-------KNDVYYKLNIKPN------TDLSSDSFWQQVIYKLIYDNEVLIV 100 (395) T ss_pred HHHHHhhccceeEeccCC--------ccc-------cchHHHHHHhccC------cCCCHHHHHHHHHHHHhhCCceEEE Confidence 999999998766654210 000 1123333444554 3567888878888899999999987 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) .++.. ..+ .+++..+. +... ...+..+|.++.++- .. T Consensus 101 ~~~~~--------~~~-----~~~~~~~~-~~~~--~~~~~~~~~~~~~~~---------------------------~~ 137 (395) T protein:vir:10 101 VSDSK--------ELL-----IADSFYRE-EYAL--YDDIFKDVTVKDYTY---------------------------QR 137 (395) T ss_pred EecCC--------CeE-----ecCCccce-eEee--cCcceeEEEEcCcee---------------------------ee Confidence 64321 011 12211111 0000 011222332222110 11 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+...-..+...|.|++..+...+. ..+.+.+-++...++|+.+.... .++..+.. T Consensus 138 ~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~-------~~~~~~~~~~~~~gii~~~~~~~---------~~e~~~~~ 201 (395) T protein:vir:10 138 TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFG-------RMIGAQLKNYQIRGILKSASSAY---------DEKNIEKL 201 (395) T ss_pred eeccccEEEEccCCCCcccccchHHHHHHHHHH-------HHHHHHHhcCCCceEEEeCCCCC---------CHHHHHHH Confidence 356789999988778888899988766544432 33334455677778887653211 01111111 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCc-----cHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG-----VGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~-----~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) ....... . ....-....|..|..|.+++.++-+.-.. +|.+..+...++||..+|||-+.|. .+ T Consensus 202 ~~~~~~~---~---~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~-----~~ 270 (395) T protein:vir:10 202 QAFTNKL---F---NTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY-----GE 270 (395) T ss_pred HHHHHHH---h---ccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc-----Cc Confidence 1100000 0 00011344577799999999876443222 4666677889999999999998774 36 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||++.+.... ++..-+.|+...+- .++-...++ +.. +....+..+ -...-.|+. T Consensus 271 ~sn~e~~~~~-----------~~~~~l~P~~~~ie-~~l~~kL~~-~~~-----------~~~~~~f~~--~~l~~~D~~ 324 (395) T protein:vir:10 271 TADLEKNTLV-----------FEKFCLTPLLKKIQ-NELNAKLIT-QSM-----------YLKDTRIEI--VGVNKKDPL 324 (395) T ss_pred ccCHHHHHHH-----------HHHHHHHHHHHHHH-HHHHHhhcC-hhh-----------hcccceecc--hhhhccCHH Confidence 6665443333 34455666666543 334333332 111 000111222 222335888 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC--CCCCCCCCcCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE--STSDNPNEETT 555 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~--~~~~~~~~e~~ 555 (556) ..+++....+++|+.|+-|+-+..|..|-+-- ..++.=++....+............. ..+.+.+++.+ T Consensus 325 ~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g--------~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 325 QYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNP--------ELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--------CCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 88999999999999999999888898875310 00011111111111110000000000 00111111111 No 113 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.36 E-value=6.2e-13 Score=87.50 Aligned_cols=388 Identities=10% Similarity=0.033 Sum_probs=189.4 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |+=..+.+-++ .......+| .....-+-..+..++.+..+| T Consensus 1 Mg~f~~lf~~~------------------------~~~~~~~~~---------------~~~~~v~~~~~~~~~~v~~~i 41 (395) T protein:vir:10 1 MSILEKIFKTR------------------------KDITYMLDL---------------DMIEDLSQQAYVKRLAIDSCI 41 (395) T ss_pred CchhhhhhccC------------------------ccccccccc---------------hhccccchhhhhhhHHHHHHH Confidence 22222111100 000000111 111111223345678899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|++.|-..-|++.-.- ... -...+..+..+|+ ..+|..++....+..++..|++|+. T Consensus 42 ~~Ia~~iA~~p~~~~~~~--------~~~-------~~~~~~ll~~~PN------~~~t~~~f~~~~~~~lll~g~~~~~ 100 (395) T protein:vir:10 42 EFVARAVAQSHFKVLEGN--------RIQ-------KNDVYYKLNIKPN------TDLSSDSFWQQVIYKLIYDNEVLIV 100 (395) T ss_pred HHHHHhhccceeEeccCC--------ccc-------cchHHHHHHhccC------cCCCHHHHHHHHHHHHhhCCceEEE Confidence 999999998766654210 000 1123333444554 3567888878888899999999987 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) .++.. ..+ .+++..+. +... ...+..+|.++.++- .. T Consensus 101 ~~~~~--------~~~-----~~~~~~~~-~~~~--~~~~~~~~~~~~~~~---------------------------~~ 137 (395) T protein:vir:10 101 VSDSK--------ELL-----IADSFYRE-EYAL--YDDIFKDVTVKDYTY---------------------------QR 137 (395) T ss_pred EecCC--------CeE-----ecCCccce-eEee--cCcceeEEEEcCcee---------------------------ee Confidence 64321 011 12211111 0000 011222332222110 11 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+...-..+...|.|++..+...+. ..+.+.+-++...++|+.+.... .++..+.. T Consensus 138 ~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~-------~~~~~~~~~~~~~gii~~~~~~~---------~~e~~~~~ 201 (395) T protein:vir:10 138 TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFG-------RMIGAQLKNYQIRGILKSASSAY---------DEKNIEKL 201 (395) T ss_pred eeccccEEEEccCCCCcccccchHHHHHHHHHH-------HHHHHHHhcCCCceEEEeCCCCC---------CHHHHHHH Confidence 356789999988778888899988766544432 33334455677778887653211 01111111 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCc-----cHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG-----VGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~-----~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) ....... . ....-....|..|..|.+++.++-+.-.. +|.+..+...++||..+|||-+.|. .+ T Consensus 202 ~~~~~~~---~---~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~-----~~ 270 (395) T protein:vir:10 202 QAFTNKL---F---NTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY-----GE 270 (395) T ss_pred HHHHHHH---h---ccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc-----Cc Confidence 1100000 0 00011344577799999999876443222 4666677889999999999998774 36 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||++.+.... ++..-+.|+...+- .++-...++ +.. +....+..+ -...-.|+. T Consensus 271 ~sn~e~~~~~-----------~~~~~l~P~~~~ie-~~l~~kL~~-~~~-----------~~~~~~f~~--~~l~~~D~~ 324 (395) T protein:vir:10 271 TADLEKNTLV-----------FEKFCLTPLLKKIQ-NELNAKLIT-QSM-----------YLKDTRIEI--VGVNKKDPL 324 (395) T ss_pred ccCHHHHHHH-----------HHHHHHHHHHHHHH-HHHHHhhcC-hhh-----------hcccceecc--hhhhccCHH Confidence 6665443333 34455666666543 334333332 111 000111222 222335888 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC--CCCCCCCCcCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE--STSDNPNEETT 555 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~--~~~~~~~~e~~ 555 (556) ..+++....+++|+.|+-|+-+..|..|-+-- ..++.=++....+............. ..+.+.+++.+ T Consensus 325 ~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g--------~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 325 QYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNP--------ELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--------CCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 88999999999999999999888898875310 00011111111111110000000000 00111111111 No 114 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.36 E-value=6e-11 Score=76.61 Aligned_cols=420 Identities=9% Similarity=0.002 Sum_probs=211.5 Q ss_pred hhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccC Q lcl|NC_019524. 14 KAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGS 93 (556) Q Consensus 14 ~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~ 93 (556) -.......--. .-......|... ..|..... .|.........++. .-..+++++-+|+..+.+++|. T Consensus 1 l~~~~l~~~i~-~~~~~~~r~~~l-------~~yy~g~~----~il~~~~~~~~~~~-~ki~~n~~~~ivd~~~~~l~g~ 67 (429) T protein:vir:98 1 MTKDLLSELIQ-KHRSFNLSYSAY-------KQLYEGDH----AILQQKQKEQYKPD-NRLVVNFAKYIVDTFNGYFIGV 67 (429) T ss_pred CCHHHHHHHHH-HHHHHHHHHHHH-------HHHhcccc----ccccccccccCCCc-ceeecchHHHHHHHHhhhhccc Confidence 11111100000 000011122211 01110000 00000000000010 0113568999999999999998 Q ss_pred CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCc Q lcl|NC_019524. 94 QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTT 173 (556) Q Consensus 94 Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~ 173 (556) +++..+. . +.....|+.|++. .+|...+..+.+..+.-|.+|+.+.. ...+ T Consensus 68 ~~~~~~~---------~-------~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~~~G~~~~~v~~-d~~g-- 118 (429) T protein:vir:98 68 PVQTSHE---------N-------KQVSNYLELLDGY----------NDQDDNNAELSKICSIYGHGYELVFN-DENA-- 118 (429) T ss_pred CceeecC---------C-------hHHHHHHHHHHhh----------cCHhHHHHHHHHHHhhcCeEEEEEEe-cCCC-- Confidence 8776542 1 2233345555443 25777888899999999999987643 2221 Q ss_pred CCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEEC-CCCCeEEEEEeecCCCccccCCccccceeecc-ccCChhHeE Q lcl|NC_019524. 174 MQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLD-NNGAALGYWLRKAFPGDPTDMEQWKWGYEPAR-FDWGRRRVI 251 (556) Q Consensus 174 ~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d-~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~-~~v~a~~vi 251 (556) -+++.+++|..+-.-+.......+..+|.+- ..+....+.++...---.+......|..+... ..++.==|+ T Consensus 119 ------~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 192 (429) T protein:vir:98 119 ------EAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMI 192 (429) T ss_pred ------cEEEEEEcccceEEEEeCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceE Confidence 2578999999986444444445577777663 33444444444332211111111222221111 011111255 Q ss_pred eeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccc Q lcl|NC_019524. 252 HIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLA 331 (556) Q Consensus 252 H~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (556) |+... ..|.|.|.+++..+..++....-......-.+.-..+++-...++ + T Consensus 193 ~~~n~-----~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~~-----------~------------- 243 (429) T protein:vir:98 193 EYVEN-----EERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAELDD-----------E------------- 243 (429) T ss_pred EecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCCc-----------c------------- Confidence 54432 369999999888777666644444433333332223333110000 0 Q ss_pred cccccccceecCCceeeecCC----CceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHH Q lcl|NC_019524. 332 NYVAQTKNIAIDGAKIPHLYP----GTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMA 407 (556) Q Consensus 332 ~~~~~~~~~~l~pG~i~~L~p----Ge~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~ 407 (556) ....+..+.+..++. +-++++++.+.+...+..+++.+.+.|....++|-... ..++++|-.+.+..+. T Consensus 244 ------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~gn~Sg~Al~~~~~ 316 (429) T protein:vir:98 244 ------TLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISD-ESFGTASGIALRYRLQ 316 (429) T ss_pred ------hhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCc-cccccchHHHHHHHHH Confidence 000122223333322 23678888888888899999999999999998884333 3345566666777666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHH Q lcl|NC_019524. 408 ETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILR 487 (556) Q Consensus 408 e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~ 487 (556) .........|..|-.. ++.+++..+.. +.. .+... .+ .-+.+.|..+- ..|....+++..++ T Consensus 317 ~l~~k~~~~~~~~~~~-l~~~~~li~~~--~~~----~~~~~--d~-------~~i~v~f~~~~--p~~~~~~a~~~~kl 378 (429) T protein:vir:98 317 AMDNLAKTKERKFMSG-MNRRYKLIASY--PTS----KIGPK--DW-------IGIKYKFTRNL--PANLLEESQIAGNL 378 (429) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHH--hcc----CCCcc--cc-------ccceEEeCCCC--CcCHHHHHHHHHHH Confidence 6666666666665544 34455544442 221 11100 00 01356774332 24655555555543 Q ss_pred HHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHc-CCCCCccccccCCCCCCCCCCCCCC Q lcl|NC_019524. 488 IKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSL-KLDFTGKMVEGNSTQSSNSSESTSD 548 (556) Q Consensus 488 i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~-Gl~~~~~~~~~~~~~~~~~~~~~~~ 548 (556) +|+.|.+.++...|. |+++.++++++|++..-+. .. ....+..++..| T Consensus 379 --~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~-----------~~~~~~~~~~~~ 429 (429) T protein:vir:98 379 --AGIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAG-----------GLNGQNTTTILE 429 (429) T ss_pred --hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh-----------hhcCCCCCCCCC Confidence 799999999999875 8999999999998755221 10 001111111111 No 115 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.35 E-value=1.2e-12 Score=85.87 Aligned_cols=378 Identities=9% Similarity=0.016 Sum_probs=187.3 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..+ ..+ .. ..+. ... +...+..-+-..+..++.+..+| T Consensus 1 Mg~f~~------------------------~f~-~~-----~~~~---~~~------~~~~~~~~~~~~a~~~~~v~~~i 41 (385) T protein:vir:95 1 MGLFDS------------------------VFK-RH-----SELS---WMY------DLEFLQDKSKKAYLKQIALNTVV 41 (385) T ss_pred Cchhhh------------------------hhc-cC-----cccc---ccc------chhhhhccchhhhhhhHHHHHHH Confidence 110000 010 00 0000 000 11111112223344678889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.|...-|++.-+- ... .. .....+..+|+ ..+|..++....+..++..|++|+. T Consensus 42 ~~ia~~ia~~p~~~~~~~--------~~~----~~---~l~~lL~~~PN------~~~t~~~f~~~~~~~l~l~Gna~i~ 100 (385) T protein:vir:95 42 EMVARTISQSEFRVMKNN--------TKE----KG---TLYYLLNVRPN------RNQNAVDFWQKFIFKLIMDNEVLVV 100 (385) T ss_pred HHHHHHHcccceeeeecC--------ccc----cc---hHHHHHhcccC------cCCCHHHHHHHHHHHHhhcCceEEE Confidence 999999988766654211 011 11 12222323443 4678999999999999999999987 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.... + .+ ..+....+..+. .. ..+...|.++..+ ... T Consensus 101 ~~~~~-------~-~~-~~~~~~~~~~~~-----~~-~~~~~~~~~~~~~---------------------------~~~ 138 (385) T protein:vir:95 101 KNDEG-------H-FF-VADDFEKEDELG-----LY-SHRFTNVLVNDFE---------------------------FKR 138 (385) T ss_pred EecCC-------C-ee-eccccccccccc-----cc-cccceeeeecccc---------------------------eee Confidence 64211 0 11 111111111110 00 0011111111110 012 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+..+-..+...|.|++..+...+.. .+.+..-++...++++.+..... .++..... T Consensus 139 ~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~-------~~~~~~~~~~~~g~l~~~~~~~~--------~~e~~~~~ 203 (385) T protein:vir:95 139 VFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGR-------MIDLQMLNNQIRGILKVDATKFY--------NKEKQKEL 203 (385) T ss_pred eeccccEEEecCCCCCcccccchHHHHHHHHHHH-------HHHHHHhcCCCceEEEeCCccCC--------CHHHHHHH Confidence 3667899999998888889999988776655432 22233344555566664321110 01110100 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCC------CccHHHHHHHHHHHHHHhcCCCHHHhhchhhcc Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTP------GGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKT 397 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p------~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~ 397 (556) ....... ..+ ..-..+.|..|..|.+++.++.... ...|.+..+.....||..+|||-+.|. . T Consensus 204 ~~~~~~~---~~g---~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~-----~ 272 (385) T protein:vir:95 204 QAYIDTL---FDA---FQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL-----G 272 (385) T ss_pred HHHHHHH---hhh---hhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc-----C Confidence 1100000 000 0013556888999999998764322 356888889999999999999998884 4 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch Q lcl|NC_019524. 398 NYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE 477 (556) Q Consensus 398 nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP 477 (556) +||++.+.. ..++...+.|+...+ +.++-...++- ... ... .+++..-.....|+ T Consensus 273 ~~sn~e~~~-----------~~~~~~~l~P~~~~i-e~~l~~~L~~~-~~~----------~~~--~~~fd~~~l~~~D~ 327 (385) T protein:vir:95 273 EMADLEKTI-----------ESYLQFCINPLLRKI-EAELNSKFFYQ-DEY----------LND--DMHIKVVGIDKRDP 327 (385) T ss_pred CCcCHHHHH-----------HHHHHHHHHHHHHHH-HHHHHhhcCCh-hhc----------ccc--eEEEechhhhccCH Confidence 666654332 344555566755543 44444444331 110 000 12232333344588 Q ss_pred hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCC-CCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 478 KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNS-TQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 478 ~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~-~~~~~~~~~~~~~~~~e~~~ 556 (556) ...+++..+.+++|+.|+-|+-+..|..|-+. +- ++....+. -..... .+.+|. ++| T Consensus 328 ~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~------------~~-----gd~~~~~~n~~~~~~----~kgge~-~~e 385 (385) T protein:vir:95 328 LKLSEAIDKLVASGTFTRNQVRIMTGEEPADD------------PE-----LDKFIITKNLQSADA----FKGGES-NEE 385 (385) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC------------CC-----Cceeeecccceeccc----ccCCCC-CCC Confidence 88999999999999999999987777766321 00 11000000 000000 111111 111 No 116 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.35 E-value=1.5e-11 Score=79.88 Aligned_cols=457 Identities=11% Similarity=0.045 Sum_probs=196.0 Q ss_pred CCcchhhh---------------HHHHHhhHhh-cccchh----hhhh-----hhcchhccccCCC---------ccccc Q lcl|NC_019524. 1 MKDVKKTT---------------RTRAKKAVDV-VAETAT----ATPM-----AVGGGMEGAERTT---------REMFQ 46 (556) Q Consensus 1 ~sp~~~~~---------------r~~a~~a~~~-~~~~~~----~~~~-----~~~~~y~aa~~~~---------r~~~~ 46 (556) +.|.++.+ |+.+-.+++. .++..+ .+.+ -....|-.+..+. +...+ T Consensus 48 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~ 127 (862) T protein:vir:99 48 VQKEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQD 127 (862) T ss_pred cccccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccc Confidence 22221111 1111111100 000000 0000 0000111111110 11122 Q ss_pred ccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHH Q lcl|NC_019524. 47 WNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNM 126 (556) Q Consensus 47 w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~ 126 (556) |.....-++. ..-.|++-|++++.+|+.+++-++-.|+.+..-.+. ++...+..+++++.|++ T Consensus 128 ~~~~~~f~gy-----------ql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~------~e~~~e~~~~ie~~~~r 190 (862) T protein:vir:99 128 WYLSQGFIGH-----------QACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEG------EEIDEESLEKFKAIDVE 190 (862) T ss_pred cccccCcccH-----------HHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCcc------cccCHHHHHHHHHHHHH Confidence 3222221111 123478999999999999999999999998864321 12223344555555554 Q ss_pred HhcccccceehhcccCHHHHHHHHhh-hheecCceEEEEeeccCCCCcCCCcccc---------eEEEEEchhhcCCCCC Q lcl|NC_019524. 127 AAESPENWFDARRMCTLTGLTRLAVS-GFLMTGEVLATCEWLNPTGTTMQRRPFG---------TAIQMISPYRMSNPNN 196 (556) Q Consensus 127 w~~~~~~~cD~~g~~~f~~lq~l~~r-~~~~dGE~f~~~~~~~~~~~~~~~~~~~---------l~lq~ie~drl~~~~~ 196 (556) . ...+...-+++ ..+..|-+++.++-...+..++ .|+. ..|.+|++..+..... T Consensus 191 L--------------~v~~~l~eair~~RLyGga~ililv~~~D~~~Ls--qPLn~e~I~kG~lkgl~vlDp~w~~p~~v 254 (862) T protein:vir:99 191 F--------------KVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYE--KPFNPDGITPGSYRGISQIDPYWMMPMLT 254 (862) T ss_pred h--------------hHHHHHHHHHHhcccccceEEEEEecCcCchhhh--cCcCcccccccceeEEEEechhhhccccc Confidence 2 12222223333 4455554443332211111110 0110 1245566555531100 Q ss_pred CCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCccc------CCchhhH Q lcl|NC_019524. 197 VMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTR------GISEMVS 270 (556) Q Consensus 197 ~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~R------Gvs~la~ 270 (556) ..+.+-.--..+|+|..|+|. ...|+.+.|||+-...-|+..+ |+|.|-. T Consensus 255 ----~~~~~Dp~sp~yGkP~~y~I~--------------------g~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~ 310 (862) T protein:vir:99 255 ----AESTADPSSQFFYEPEFWIIS--------------------GQKYHRSHLIIARGPQPADILKPTYIFGGIPLVQR 310 (862) T ss_pred ----ccccccccccccCCceeeeec--------------------CeeeccceeEEecCCCchhhhhccCCccCccHHHH Confidence 000111111245777777663 1247778888875555566665 9999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccee-eeEeccCcccccccccccccccccccccccccccccccccccceecCCceeee Q lcl|NC_019524. 271 ALKQMKMTRNFQEITLQNAVVNATYA-ASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPH 349 (556) Q Consensus 271 ~l~~l~~l~~~~dael~~a~i~A~~~-~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~ 349 (556) ++..|++.+.-. .-.+.+.-.+. -+++.+..... .. .... ..-... -...... -| +.. T Consensus 311 iyd~L~~~d~t~---~saa~Ll~ka~l~v~ktd~l~~l-----~~-ed~l----~~r~~~------~~~~rdN-~G-i~l 369 (862) T protein:vir:99 311 IYERVYAAERTA---NEAPLLAMNKRTTAIHTDTAKAI-----AN-EDKF----IQRLMF------WVRYRDN-HA-VKV 369 (862) T ss_pred HHHHHHHHHHHH---HHHHHHHHHhccceeechhHhhh-----cc-HHHH----HHHHHH------HHhccCc-ce-eEE Confidence 888777655443 33333332222 33343322111 00 0000 000000 0000011 12 445 Q ss_pred cCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 350 LYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFASAI 428 (556) Q Consensus 350 L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi 428 (556) |..+++++.++.+ -++..+.+......||++.+||.-.|.|-- -..+ ||.-.-+.-|...++.+|+..+..+ T Consensus 370 iD~eEe~e~ls~s--lSGL~dll~~~~q~IAaas~IP~tiLfGqs-paGlnATGE~D~~nYyD~I~s~QE~~L~P~---- 442 (862) T protein:vir:99 370 LGTDETMEQFDTS--LADFDAVIMGQYQLVASIAKTPATKLLGTA-PKGFNSTGEFETISYHEELESIQEHVYMPF---- 442 (862) T ss_pred ecCCCceeEEecc--cCChHHHHHHHHHHHHhhhCCCceeecccC-cccccCchHHHHHHHHHHHHHHHHHHHHHH---- Confidence 7778999988744 347889999999999999999998887742 2344 4566677778889999997655544 Q ss_pred HHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHH Q lcl|NC_019524. 429 YTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFR 507 (556) Q Consensus 429 ~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e 507 (556) .++.+....++ +.+|..+.+. ++-.|.+-....+|- .|-+++....+++|+.|..++..+.-.+.. T Consensus 443 LerL~~li~~~--lg~~~d~~ie-----------FnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~ 509 (862) T protein:vir:99 443 LQRHYLISRLS--LGIQHEIDVV-----------MEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKR 509 (862) T ss_pred HHHHHHHHHHh--cCCCCcceEE-----------eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCC Confidence 44443332222 2233322211 112333433433443 345567778888888887777654211000 Q ss_pred HHHHHHHHHHHHHHHcCCCCCc-cccccCCCCCCCCCCCCCCCCC----CcCCC Q lcl|NC_019524. 508 EVFKQRAREEGLIKSLKLDFTG-KMVEGNSTQSSNSSESTSDNPN----EETTQ 556 (556) Q Consensus 508 ~v~~q~a~E~~~~~~~Gl~~~~-~~~~~~~~~~~~~~~~~~~~~~----~e~~~ 556 (556) -.+..+-.|.- -...+...+. .......+...+++.++...+. -|..| T Consensus 510 ~g~~~l~ded~-E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~ 562 (862) T protein:vir:99 510 SGYNRLTKEDA-EETPGASPENLAAYQKAGAAQETASAKETQAGAAVTTAEGDQ 562 (862) T ss_pred cCCCCCCcccc-cccCCCCcccccccccCCcccccccccccccccCCccccCCc Confidence 00000000000 0000000000 0000000000000000000000 00001 No 117 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.33 E-value=1e-10 Score=75.34 Aligned_cols=451 Identities=9% Similarity=0.002 Sum_probs=208.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +...+.+...-..- ... ...+.. ....|.-+.. ..+... ....+. ...+. | --+++++ T Consensus 39 ~~~~~~i~~~i~~~----~~~-~~~r~~-~l~~Yy~g~~--~i~~~~---~~~~~~-~~~~~-----k-----i~~n~~k 96 (511) T protein:vir:99 39 LQNVNEVSKYIEHH----MDY-QRPRLK-VLSDYYEGKT--KNLVEL---TRRKEE-YMADN-----R-----VAHDYAS 96 (511) T ss_pred hccHHHHHHHHHHH----HHh-hHHHHH-HHHHHhcccC--cccccc---Cccccc-ccCcc-----e-----eecchHH Confidence 11111111110000 000 001111 1123432221 111000 000000 00010 1 2258999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|..++.... + +.+...|+.|... + +|......+.+....-|.+ T Consensus 97 ~Iv~~~~~yl~g~p~~~~~~------------d----~~~~~~l~~~~~~----n------~~~~~~~~~~~~~~i~G~a 150 (511) T protein:vir:99 97 YISDFINGYFLGNPIQYQDD------------D----KDVLEAIEAFNDL----N------DVESHNRSLGLDLSIYGKA 150 (511) T ss_pred HHHHHHHhhhcccCceeecC------------c----hHHHHHHHHHHhh----c------CHhHHHHHHHHHHHhcCee Confidence 99999999999988776532 1 2233445555443 1 4667777788888999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC--------CCCeEEEEEeecCC------- Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN--------NGAALGYWLRKAFP------- 225 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~--------~Gr~vaY~i~~~hp------- 225 (556) +.++.+ ...+ .+++.+++|..+---++......+..+|.+-. .+...-+.++..+- T Consensus 151 ~~~vy~-ded~--------~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~ 221 (511) T protein:vir:99 151 YELMIR-NQDD--------ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTS 221 (511) T ss_pred EEEEEe-CCCC--------ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEec Confidence 987543 2211 25889999988753333333345667776511 12222222222210 Q ss_pred -CccccCCccccceee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccC Q lcl|NC_019524. 226 -GDPTDMEQWKWGYEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESEL 303 (556) Q Consensus 226 -gd~~~~~~~~~~rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~ 303 (556) +.............+ .+..|| |+++... ..|.|.|.+++..+..++....-......-.+.-..+++... T Consensus 222 ~~~~~~~~~~~~~~~~~~~g~vP---vv~~~nn-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~ 293 (511) T protein:vir:99 222 RTNGLKLTPRENGFESHSFERMP---ITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred CCccccccccccccccCCCCccc---eEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCc Confidence 000000000000000 011232 5555443 258899999887776665543332222211111111222100 Q ss_pred cccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhc Q lcl|NC_019524. 304 PSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASL 383 (556) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaagl 383 (556) .... .+. . .................+......+|-++++++.+.+...+..+++.+.+.|.... T Consensus 294 ~~~~---------~~~-~------~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:99 294 NLDP---------VEV-R------KQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred ccCc---------hhh-c------ccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 0000 000 0 00000000000011122333345668889999999889999999999999888877 Q ss_pred CCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHH Q lcl|NC_019524. 384 GMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRD 460 (556) Q Consensus 384 Gi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~ 460 (556) ++| .+.++++. |-.+++..+..........+..|-..+ +.+++..+...-..+.+..+... T Consensus 358 ~~P~~~~~~~~gn~---Sg~Alk~~~~~l~~ka~~k~~~~~~~l-~~~~~li~~~~~~~~~~~~~~~~------------ 421 (511) T protein:vir:99 358 NTPNMKDDNFSGTQ---SGEAMKYKLFGLEQRTKTKEGLFTKGL-RRRAKLLETILKNTRSIDVSKDF------------ 421 (511) T ss_pred CCcccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCccccccc------------ Confidence 766 33344443 344666666666555555555554443 44555444433233333322110 Q ss_pred HhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCC Q lcl|NC_019524. 461 ALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQ 538 (556) Q Consensus 461 a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~ 538 (556) .-+.+.|...-. .|-...+++..++ .|+.|.+..+...+. |+++.++++++|++...+.-..... ...... T Consensus 422 ~~i~i~f~~~~p--~n~~e~~~~~~kl--~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~---~~~~~~ 494 (511) T protein:vir:99 422 NTVRYVYNRNLP--KSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMY---QDPRNI 494 (511) T ss_pred ccceEEeCCCCC--cCHHHHHHHHHHH--hccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccc---ccCCCC Confidence 013566754322 4655556555544 599999999999864 8999999999998754433221110 000111 Q ss_pred CCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 539 SSNSSESTSDNPNEETTQ 556 (556) Q Consensus 539 ~~~~~~~~~~~~~~e~~~ 556 (556) ....+++++++.. |.+| T Consensus 495 ~~~~~~~~~~~~~-d~~e 511 (511) T protein:vir:99 495 NDDEQDDSTKDSI-DKKE 511 (511) T ss_pred CCCCCCCCCcCcc-cccC Confidence 1111111111111 1111 No 118 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.31 E-value=1.8e-11 Score=79.51 Aligned_cols=405 Identities=12% Similarity=0.047 Sum_probs=198.9 Q ss_pred hhhhhcchhc-cccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeecccc Q lcl|NC_019524. 26 TPMAVGGGME-GAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTI 104 (556) Q Consensus 26 ~~~~~~~~y~-aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~ 104 (556) .-+.-++.-. +.+.++....++ +..... .....+++-|++++.+|+.+....+-.|+.+... T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~-~~~~~~------------~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~---- 63 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGS-LQNQAP------------TILASLYADNALVRRIIDTIPETALAAGFHIDGI---- 63 (422) T ss_pred CccchhhHHHHcCCCCCccccCc-ccccCH------------HHHHHHHHhChhhHHHHhhhhHHHhcCCccccCC---- Confidence 0000011000 011111111111 111111 2345789999999999999999999999987532 Q ss_pred ccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcC----CCcccc Q lcl|NC_019524. 105 VLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTM----QRRPFG 180 (556) Q Consensus 105 ~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~----~~~~~~ 180 (556) ++ .++++..|++ +.+.+...-+++....-|.+.+.+...... ..+ ....+. T Consensus 64 ----~~------~~~~~~~~~~--------------l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~-~~~~Pl~~~g~~~ 118 (422) T protein:vir:10 64 ----DD------EPAFWSRWDD--------------LEMTQNINDAWSWARLFGGAAIVAIVKDNR-ALTSPVREGAELE 118 (422) T ss_pred ----CH------HHHHHHHHHH--------------hhHHHHHHHHHHhhccccceEEEEEecCCC-CccccccccCcee Confidence 11 1234444443 233444455555555667776666543321 111 111111 Q ss_pred eEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeec----- Q lcl|NC_019524. 181 TAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIE----- 255 (556) Q Consensus 181 l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~----- 255 (556) .|.+|++..|. |.. . ...+.=..+|+|..|+|....++- ...|+.+.|||+-. T Consensus 119 -~l~v~d~~~i~-~~~-~-----~~dp~s~~fg~P~~y~v~~~~~~~--------------~~~iH~SRli~~~g~~~p~ 176 (422) T protein:vir:10 119 -TVRVYDRTQVK-VQT-R-----EENPRNARFGEPLTYRITTNESDM--------------FYDVHYSRIHIIDGERIPN 176 (422) T ss_pred -eEEeecccccc-chh-c-----ccCccccccCcceEEEEecCCCCc--------------ceeeccceeEEeCCCCchh Confidence 35667777663 211 1 112222357999999997432211 13577888888732 Q ss_pred -ccCCCcccCCchhhH-HHHHHHHHHHHHHHHHHHHHHh--cceeeeEeccCcccccccccccccccccccccccccccc Q lcl|NC_019524. 256 -ALLAGQTRGISEMVS-ALKQMKMTRNFQEITLQNAVVN--ATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLA 331 (556) Q Consensus 256 -~~r~gQ~RGvs~la~-~l~~l~~l~~~~dael~~a~i~--A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (556) .......-|.|+|.. ++..+++++.-. --.+.+. +.+ .+++.+.-.. ..+.+.... .... T Consensus 177 ~~~~~~~~~G~S~l~~~~~~~i~~~~~~~---~~~~~l~~~~~~-~v~~~~~l~~----~~~~~~~~~--------~~~~ 240 (422) T protein:vir:10 177 VMRRQNDGWGRSVLSSDILDSIKDYTNCE---RLATQLLKRKQQ-AVWKAKGLAE----LCDDSEGFG--------AARL 240 (422) T ss_pred hhcccCCcccchhHHHHHHHHHHHHHHHH---HHHHHHHHHhcc-ccccchhHHH----hcCCccchH--------HHHH Confidence 234555568998876 456566554433 3333321 122 2233321111 110000000 0000 Q ss_pred cccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHH Q lcl|NC_019524. 332 NYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQ 410 (556) Q Consensus 332 ~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~ 410 (556) ........ .-..+.+.....+++++.++.+ -++..+........||++.|||.-.|.|--. ..+ |+.-.-+..+. T Consensus 241 r~~~~~~~-~~~~~~~~l~~~~e~~e~~~~~--lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~-~Glnatgd~d~~~yy 316 (422) T protein:vir:10 241 RLAQVDNN-SGVGQAIGIDAESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNV-GGVSSSQNTALETFH 316 (422) T ss_pred HHHHHHHh-cCCccceeEecCCcceEEEecc--cCChHHHHHHHHHHHHhhhCCCeeeeccCCc-ccccccchHHHHHHH Confidence 00000000 0113445556678999988744 3478999999999999999999988877543 344 45666777888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHH Q lcl|NC_019524. 411 KYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIK 489 (556) Q Consensus 411 r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~ 489 (556) ..++.+|+..+..++..++... +++..+. +. ++..|.+...+-+|- .|.+++....++ T Consensus 317 d~i~~~Qe~~l~p~l~~l~~~i----~~s~~~~------~~-----------f~pL~~~sekekaei~~~~a~a~~~~~~ 375 (422) T protein:vir:10 317 KLVDRKRNAELLPILEFLIPFI----VNAEEWS------VE-----------FNPLAQESSKDKAEILEKNVNSIAALIA 375 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHh----cccCCcE------EE-----------eCCCCCCCHHHHHHHHHHHHHHHHHHHh Confidence 8999999776655555554433 3332111 10 122444444444443 466677777777 Q ss_pred cCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 490 NGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 490 ~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +|+.|..++.. .+... ....|+..+..+ ....+.+..++|+++.++ T Consensus 376 ~g~i~~~e~r~-----------~L~~~---~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~ 421 (422) T protein:vir:10 376 AGAMDIDEARD-----------TLRTI---APEVKINDGSVE-------TEVTISETSNDPLEVPTD 421 (422) T ss_pred cCCCCHHHHHH-----------Hhhhh---cccccCCCCCCc-------cccchhhcCCCCCCCCCC Confidence 77766655543 33211 122333211111 111111112222333333 No 119 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.31 E-value=1.3e-10 Score=74.79 Aligned_cols=457 Identities=10% Similarity=0.014 Sum_probs=206.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchh-h---hhhhhcc--------hhccccCCCcccccccCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETAT-A---TPMAVGG--------GMEGAERTTREMFQWNPSIISPDQQIAQNQDMASAR 68 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~-~---~~~~~~~--------~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~R 68 (556) |-|..---....-+...+...... . ....... .|+-+.+ ....+ + .+.. ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~---~i~~~-~--~~~p----~~~------ 64 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKY---AIRQI-G--NLIP----PEY------ 64 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccc---cchhc-c--cccc----HHH------ Confidence 333322111111111111000000 0 0000000 0110000 00000 0 0000 011 Q ss_pred HHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHH Q lcl|NC_019524. 69 AQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTR 148 (556) Q Consensus 69 aRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~ 148 (556) +++.--..|++-+|+.+++.++..||+.. + ++. .-+.+|+-|.. + +|...+. T Consensus 65 -~~~~~v~n~~~~iVd~~a~rl~~~Gf~~~---d-------~~~------~~~~l~~i~~~-----N------~ld~~~~ 116 (504) T protein:vir:99 65 -LRTATVLGWSAKAVDTLARRCNLESFVWP---D-------GDY------GSIGGPDVWDE-----N------FFATKAN 116 (504) T ss_pred -HHHhhccCcHHHHHHHHHhhhccceeeCC---C-------CCh------hhHHHHHHHHh-----c------ChhhHHH Confidence 11112346788899999999988998642 1 111 11234544432 2 4777888 Q ss_pred HHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEE---ECCCCCeEEEEEeecCC Q lcl|NC_019524. 149 LAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQ---LDNNGAALGYWLRKAFP 225 (556) Q Consensus 149 l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE---~d~~Gr~vaY~i~~~hp 225 (556) .+.+..++-|.+|+.. +... ++.+. ..|+.++|..+-.-++. ..+.+..++. .|..|.+...-++. | T Consensus 117 ~~~~~a~iyG~af~~v-~~~~-----d~~~~-~~I~~~sP~~~~~iyD~-~~~~~~~a~~~~~~d~~g~~~~~~~y~--~ 186 (504) T protein:vir:99 117 NAMVSSLIHGPAFLIN-TEGG-----AGEPD-SLIHVKSAMQATGEWNS-RRNAMDSLLSITSRDAEGHPTGIALYE--D 186 (504) T ss_pred HHHHHHHhhCceeEEE-ecCC-----CCCce-eEEEEeccceeEEEEeC-CCCceeEEEEEEEecCCCeEEEEEEEc--C Confidence 9999999999999775 4321 22222 25888999988543432 2344556654 46777766433332 2 Q ss_pred CccccC---CccccceeeccccCChhHeEeeecccCCCcccCCchhh-HHHHHHHHHHHHHHHHHHHHHHhccee---ee Q lcl|NC_019524. 226 GDPTDM---EQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMV-SALKQMKMTRNFQEITLQNAVVNATYA---AS 298 (556) Q Consensus 226 gd~~~~---~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la-~~l~~l~~l~~~~dael~~a~i~A~~~---~f 298 (556) +..+.. ....|..-......+.+ |+|+....+.+-.=|.|.+. +++..+ +.|.......+..+-.|+ .+ T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~gvP-vV~~~n~~~~~~~~G~sei~~~v~~l~---Da~~~~~~~~~~~~e~~a~p~r~ 262 (504) T protein:vir:99 187 GVTVTADMDDDGDWHADVRTHKLGVP-VEVLPYKPREDRPLGSSRITRPVMSLQ---QRALKGCIRMDGHADVYSFPQLI 262 (504) T ss_pred CcEEEEEEcCCceeeeccccCCCCcc-eEEecccccCccccCcccchhhHHHHH---HHHHHHHHHHHHHHHHhcchhhh Confidence 221111 11222211111222333 88888877777667889887 444443 333333333332222222 12 Q ss_pred EeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCC--------CceeeeecCCC-CCccHH Q lcl|NC_019524. 299 VESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYP--------GTKLKMQPAGT-PGGVGT 369 (556) Q Consensus 299 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p--------Ge~i~~~~~~~-p~~~f~ 369 (556) |.--.+++.. .. +...........+.|..+++ |.++++.+.+. .-.+|. T Consensus 263 i~G~~~~~~~--------~~--------------d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~ 320 (504) T protein:vir:99 263 LLGADAKNFR--------NK--------------DGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHI 320 (504) T ss_pred hccCCccccc--------cc--------------cccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHH Confidence 2110000000 00 00011111223455555554 33455443222 123566 Q ss_pred HHHHHHHHHHHHhcCCCHHHhhchhhcc---cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCC Q lcl|NC_019524. 370 DYEQSLLRNIAASLGMSYEQFSRDYTKT---NYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPP 446 (556) Q Consensus 370 ~F~~~~lr~iaaglGi~ye~l~~D~s~~---nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~ 446 (556) ..++..++.|++-.++|-+.| |..+.. |--++++.+....+..+..|..|-..+.+ +++.. .++..+.-..+. T Consensus 321 ~~l~~~i~~~a~~t~~P~~~l-G~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~-~~rla--~~~~~~~~~~~~ 396 (504) T protein:vir:99 321 EMLEQIAMMFSGETSIPVESL-GFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRR-SMIRA--LAIKNGLDRIPP 396 (504) T ss_pred HHHHHHHHHHHhhhCCCHHHh-cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHhcCCCcccc Confidence 677888999999999998766 333333 33466777777777777777777666544 44432 234443211221 Q ss_pred CcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCC--CCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHc Q lcl|NC_019524. 447 GKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGL--STYEAEISRL-GGDFREVFKQRAREEGLIKSL 523 (556) Q Consensus 447 ~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~--~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~ 523 (556) .. .-+.+.|..+... +-...+.|..+.+.+|. ...++.+.++ |.+++++.+..+. ++..... T Consensus 397 ~~------------~~~~v~w~d~~~~--s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e-~~~~~~~ 461 (504) T protein:vir:99 397 EW------------KTIDSKFRSPLYL--SKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAE-RRRASSV 461 (504) T ss_pred cc------------ccceeEecCCCcc--CHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHH-HHHHhhH Confidence 10 1145678776655 44566777788888885 4455666554 9999987543322 2221111 Q ss_pred CCC---CCccccc--cCCCCC---CCCCCCCCCCCCCcCCC Q lcl|NC_019524. 524 KLD---FTGKMVE--GNSTQS---SNSSESTSDNPNEETTQ 556 (556) Q Consensus 524 Gl~---~~~~~~~--~~~~~~---~~~~~~~~~~~~~e~~~ 556 (556) ++. .+..+.. ...... ..+..++........+| T Consensus 462 ~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~~ 502 (504) T protein:vir:99 462 SIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPTL 502 (504) T ss_pred HHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCccc Confidence 110 0000000 000000 00001111111222222 No 120 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.30 E-value=1.7e-11 Score=79.57 Aligned_cols=411 Identities=10% Similarity=0.047 Sum_probs=200.7 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhcc--ccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEG--AERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~a--a~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |+.-+..-|+ .-+|.- .++.+.....|.+... .+ . ...-.+++-|++ T Consensus 5 m~~~~~~~~~--------------------~D~~~~~~~~~~g~~~~~~~~~~~-~~------~----~~l~~~Y~~~~l 53 (435) T protein:vir:79 5 MSDKVKAITK--------------------EDGYNEIFGSKDGTFRPNAFYMQR-AA------F----KALSQFYEEDGM 53 (435) T ss_pred cccccccchh--------------------hcchhhhhcccccccccCcccCCc-CC------H----HHHHHHHhcCch Confidence 4433211111 011210 1122222223332221 11 1 123567899999 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ++.+|+.+.+..+..|+.+...- + .+++++.|++. ...+...-+++....-| T Consensus 54 ~~~~Vd~~aed~~r~g~~i~g~~-----------~---~~~~~~~~~~l--------------~~~~~l~~a~~~~rl~G 105 (435) T protein:vir:79 54 ARRIVDVIPEEMVTPGFKVDGVK-----------N---EKSFKSRWDEL--------------RLNAKIIDALSWSRLFG 105 (435) T ss_pred hhhhhccchHHhhcCCceecCCC-----------h---HHHHHHHHHHh--------------hHHHHHHHHHHhhhccc Confidence 99999999999999998865321 1 13455555532 22233334444445556 Q ss_pred ceEEEEeeccCCCCcCCCcccce-------EEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccC Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGT-------AIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDM 231 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l-------~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~ 231 (556) .+.+.+...... ..+ -|| .|.+|++..|. |.. .+ ...--..+|+|..|+|... +.. T Consensus 106 ~~~i~i~~~d~~-~~~----~Pl~~~g~i~~i~v~d~~~i~-~~~-~~-----~dp~sp~fg~P~~y~v~~~---~~~-- 168 (435) T protein:vir:79 106 GSAILAVVADNK-MLK----SPVKPGAQLEDIRVYDRYQIT-IHE-RE-----TNARSVRYGEPKLYKISPG---GDI-- 168 (435) T ss_pred cEEEEEEecCCC-Ccc----cccccCCceeeEEeechhhcc-chh-hc-----cCCcccccCcceEEEEecC---CCC-- Confidence 665555443211 111 122 45677777663 211 11 1112235799999998532 111 Q ss_pred CccccceeeccccCChhHeEeee------cccCCCcccCCchhh-HHHHHHHHHHHHHHHHHHHHHHh--cceeeeEecc Q lcl|NC_019524. 232 EQWKWGYEPARFDWGRRRVIHII------EALLAGQTRGISEMV-SALKQMKMTRNFQEITLQNAVVN--ATYAASVESE 302 (556) Q Consensus 232 ~~~~~~rv~~~~~v~a~~viH~f------~~~r~gQ~RGvs~la-~~l~~l~~l~~~~dael~~a~i~--A~~~~fi~~~ 302 (556) ....|+.++|||+- .....+..-|.|+|. +++..+++.+. ++.-.+.+. +.+. +++.+ T Consensus 169 ---------~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~---~~~~~~~l~~~~~~~-v~~~~ 235 (435) T protein:vir:79 169 ---------PEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNY---CQELATQLLRRKQQA-VWKAR 235 (435) T ss_pred ---------CceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHH---HHHHHHHHHHHhcCc-cccch Confidence 12357888899873 344556677888774 56665655544 333333332 2222 22322 Q ss_pred CcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHh Q lcl|NC_019524. 303 LPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAAS 382 (556) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaag 382 (556) .-. ...+.+.... ....-. ........ ..|.+.....+++++.++.+ -++..+........||++ T Consensus 236 ~l~----~~~~~~~~~~--~~~~r~------~~~~~~~~-~~~~~~i~~~~e~~e~~~~~--lsgl~~~~~~~~~~iaaa 300 (435) T protein:vir:79 236 DLA----LMCDDEEGRY--AARLRL------AQVDDESG-VGKAIGIDATDEEYEVLNSD--VSGVPEFLQEKIDRIVAL 300 (435) T ss_pred hHH----HhhcCccchH--HHHHHH------HHHHHhcC-CCCceeEecCCcceEEEecc--cCCHHHHHHHHHHHHHhh Confidence 111 1111000000 000000 00000000 13445455667889888744 457899999999999999 Q ss_pred cCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHH Q lcl|NC_019524. 383 LGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDA 461 (556) Q Consensus 383 lGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a 461 (556) .|||.-.|.|--.+ .. |+.-.-+..+...++.+|+..+..++..+++.. +++ |. +. .. T Consensus 301 ~~IP~t~L~G~s~~-glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li----~~s-----~d-~~--~~-------- 359 (435) T protein:vir:79 301 TGIHEIIIKNKNTG-GVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFM----ISE-----TE-WS--IE-------- 359 (435) T ss_pred hCCCeeeeccCCcc-ccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcC-----CC-Ce--EE-------- Confidence 99999888775433 33 556777888899999999776655555544433 222 11 11 00 Q ss_pred hhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCC Q lcl|NC_019524. 462 LCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSS 540 (556) Q Consensus 462 ~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~ 540 (556) ++.-|.+...+.+|- .|.+++....+++|+.++.++-. .+ +....+.|+...... . -.. T Consensus 360 -f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~-----------~L---~~~~~~~~~~~~~~~-~----~~~ 419 (435) T protein:vir:79 360 -FEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRD-----------TL---RSICPDLKIMDNDNI-E----LPE 419 (435) T ss_pred -eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHH-----------HH---HHhccccCCCCcccc-c----CCc Confidence 122344444444443 56677777777777777665542 22 123334444321111 0 001 Q ss_pred CCCCCCCCCCCCcCCC Q lcl|NC_019524. 541 NSSESTSDNPNEETTQ 556 (556) Q Consensus 541 ~~~~~~~~~~~~e~~~ 556 (556) ....+.++.+|...+| T Consensus 420 ~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 420 PEDLDPEPGQEGGLNK 435 (435) T ss_pred cccCCCCCCCCCCCCC Confidence 1111111111222222 No 121 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.26 E-value=2.8e-10 Score=72.98 Aligned_cols=430 Identities=10% Similarity=0.001 Sum_probs=201.9 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCC--CcccccccCCCCCHHHHHHHHHHHHHHH---HHHHHh- Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERT--TREMFQWNPSIISPDQQIAQNQDMASAR---AQDMVQ- 74 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~--~r~~~~w~~~~~s~~~~i~~~~~~lr~R---aRdl~r- 74 (556) |...+.-+ .+.-+... -+....+.+ .-....|.......+.| ..+.+. T Consensus 1 ~~~~~~~~------------------------~~~~~~~~~~~~~~~~~~~--~~i~~~i~~~~~~~~~~~~~l~~Yy~g 54 (470) T protein:vir:99 1 MKDINYGR------------------------DKVTGNSSFIFPKGEKLTS--NELLGFIAYNETVLKPRYRENMKLYLG 54 (470) T ss_pred CccccCCc------------------------ccccCCceEEeCCCCCcCH--HHHHHHHHHHHHhhHHHHHHHHHHhcc Confidence 22222211 11101000 000000000 00111111111111111 111111 Q ss_pred --------------c----ChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhccccccee Q lcl|NC_019524. 75 --------------N----DGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFD 136 (556) Q Consensus 75 --------------N----n~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD 136 (556) | +++++-+|+..+++++|.+++..+.-+ ....+.+ ++-|.. T Consensus 55 ~~~i~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~~d-----------~~~~~~l---~~~~~~------- 113 (470) T protein:vir:99 55 KHKILTAPEKETGADNRIVVNSAKYVVDVYNGYFCGIEPKLALLND-----------SSKIDEI---ARWNRQ------- 113 (470) T ss_pred ccccccCcccccCCcceeecchHHHHHHHHhhhhccCCeeEeeCCc-----------hhHHHHH---HHHHHh------- Confidence 2 369999999999999999988765321 1112233 332321 Q ss_pred hhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCC Q lcl|NC_019524. 137 ARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNG 213 (556) Q Consensus 137 ~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~G 213 (556) .+|......+.+..+.-|.+|+.+.. ... + -+++..++|+.+-.-++......+..+|.+ +..+ T Consensus 114 ----n~~~~~~~~~~~~~~~~G~~~~~v~~-d~d-----g---~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~ 180 (470) T protein:vir:99 114 ----ENFFDTINEISKQCDIFGRSIASIYQ-GED-----A---RPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNN 180 (470) T ss_pred ----cCHhHHHHHHHHHHHhcCeeEEEEEe-CCC-----C---eEEEEEEccceeEEEEcCCCCcceEEEEEEEEEecCC Confidence 25777778888889999999987543 221 1 157899999988533332333446666654 2222 Q ss_pred CeEEE-EEeecCCCccccCC----ccccceee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 214 AALGY-WLRKAFPGDPTDME----QWKWGYEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQ 287 (556) Q Consensus 214 r~vaY-~i~~~hpgd~~~~~----~~~~~rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~ 287 (556) ...-| .++.. ...+... ...+..+. ....++.-=|+++.. -..|+|.|.+++..+..++....-... T Consensus 181 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~ 253 (470) T protein:vir:99 181 WTDAYGVIQYA--DKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFE-----NEERQGIFDSIKTLINALDKVISQKAN 253 (470) T ss_pred eeEEEEEEEec--CeEEEEEecccccccccccccccCCCccceEeecC-----CCCCCcchHhHHHHHHHHHHHHHHHHH Confidence 22222 22211 1111000 00000000 000111111344333 236999999988887766665554444 Q ss_pred HHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCC Q lcl|NC_019524. 288 NAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAG 362 (556) Q Consensus 288 ~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~ 362 (556) ...-.+.--.+++-...++. + .+.....+....+..+ ..|-++++++.+ T Consensus 254 ~~~~~~~~~~~i~g~~~~~~----------~----------------~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 307 (470) T protein:vir:99 254 QVEYFDNAYMYMIGFKLPED----------D----------------EGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKP 307 (470) T ss_pred HHHHhcCceeeeecCCcccc----------c----------------ccchhhhhhhcceeeecCCCCCCCCcceEEeec Confidence 44444433344442211110 0 0000111222222222 245568888888 Q ss_pred CCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh-cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019524. 363 TPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT-KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGN 441 (556) Q Consensus 363 ~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s-~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~ 441 (556) .+...+..+.+.+.+.|....++|-... ..++ +.|-.+++..+..........+..|-. .++.+++..+...-..+. T Consensus 308 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~~~ 385 (470) T protein:vir:99 308 DADQMQENLIQHLTDFIFMMAMVPNIQD-KNFAGNSSGVALQYKLFAMKNKADSKERKFDK-SLMQLYRIVLATLFNNKQ 385 (470) T ss_pred CChHHHHHHHHHHHHHHHHHhCCccccc-cccccCchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCC Confidence 8888999999999999999999983222 2232 233334444444443334444444433 334455544433222222 Q ss_pred ccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhC-CCHHHHHHHHHHHHHHH Q lcl|NC_019524. 442 VPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLG-GDFREVFKQRAREEGLI 520 (556) Q Consensus 442 l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G-~D~e~v~~q~a~E~~~~ 520 (556) .... ..-+.+.|.++- + .|....+++.... .|+.|.+..+...| .|+++-++++.+|++.. T Consensus 386 ~~~~--------------~~~i~v~f~~~~-p-~~~~e~a~~~~kl--~giis~et~l~~l~~vd~~~E~eri~~E~~~~ 447 (470) T protein:vir:99 386 DQEL--------------WSELDFKFTRNL-P-EDMASAIDNAKNA--EGIVSKKTQLGMIPDIEPDAEMKQIAKEKADA 447 (470) T ss_pred cccc--------------cccceEEeCCCC-C-cCHHHHHHHHHHH--hccCCHHHHHHhCCCCCHHHHHHHHHHHHHHH Confidence 2110 012456773332 2 5777777776665 48999999998885 58898899999887644 Q ss_pred HHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 521 KSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 521 ~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) .+.-.. .....+..+.++++|.+ T Consensus 448 ~~~~~~------------~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 448 IKQTQQ------------LSMPIDILKRDNNAEEE 470 (470) T ss_pred HHHHHh------------hcCCCCcCCCCCCccCC Confidence 322110 00111111111111111 No 122 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.24 E-value=3.5e-10 Score=72.42 Aligned_cols=409 Identities=11% Similarity=0.030 Sum_probs=202.0 Q ss_pred cccchh--hh------hhhhcc-----hhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Q lcl|NC_019524. 19 VAETAT--AT------PMAVGG-----GMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAV 85 (556) Q Consensus 19 ~~~~~~--~~------~~~~~~-----~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~ 85 (556) ...... .. .....+ .|.-+... ...+ ...... . .+++-.-+.|++-+|+. T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~---i~~~---~~~~~~----~-------~~~~k~~~n~~~~ivd~ 63 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNR---VRDL---GVAIPP----E-------LQRVQTVVSWPGIAVDA 63 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc---chhc---Ccccch----h-------hhhhhhhcchHHHHHHH Confidence 111110 11 111000 22211111 1111 011110 1 12223345688899999 Q ss_pred HHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEe Q lcl|NC_019524. 86 HRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCE 165 (556) Q Consensus 86 ~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~ 165 (556) .++.+++.||+.. + .+.+.+.|+ . .+|..++..+.+..+.-|.+|+.. T Consensus 64 ~~~~l~~~g~~~~---d--------------~~~l~~i~~---~-----------n~~~~~~~~~~~~~~~~G~a~~~v- 111 (441) T protein:vir:80 64 LEERLDWLGWTNG---D--------------GYGLDGVYA---A-----------NRLATASCDVHLDALIFGLSFVAI- 111 (441) T ss_pred HHhhhccccccCC---C--------------hHHHHHHHH---h-----------cCHHHHHHHHHHHHhhcCeeEEEE- Confidence 9999988887521 1 122333332 1 268899999999999999999874 Q ss_pred eccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC-CCCeEEEEEeecCCCccc---cCCccccceeec Q lcl|NC_019524. 166 WLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN-NGAALGYWLRKAFPGDPT---DMEQWKWGYEPA 241 (556) Q Consensus 166 ~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~-~Gr~vaY~i~~~hpgd~~---~~~~~~~~rv~~ 241 (556) +.... +. .++..++|+.+..-++...+..+...+.++. .+...-..++. ++..+ ......|..+.. T Consensus 112 ~~d~~-----g~---~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~--~~~~~~~~~~~~~~~~~~~~ 181 (441) T protein:vir:80 112 IPHGD-----GT---VSVRPQSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVEAELLL--PDVIVQVERRGSREWVEVDR 181 (441) T ss_pred EeCCC-----Cc---eEEEEEccceEEEEEeCCCCceeEEEEEEEEecCceEEEEEEe--cCeEEEEEEcCCcceeeccc Confidence 43221 22 3789999999864333333333333333321 12111111221 11111 011122332221 Q ss_pred c-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee---eeEeccCcccccccccccccc Q lcl|NC_019524. 242 R-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA---ASVESELPSDVVFGQLGMGQG 317 (556) Q Consensus 242 ~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~---~fi~~~~~~~~~~~~~~~~~~ 317 (556) . ..++.--|+|+....+.+..-|.|.|+..+..|-| .|.......+.+.-.++ .+|+-...++. T Consensus 182 ~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liD--a~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~---------- 249 (441) T protein:vir:80 182 IPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTD--EAVRTLLGQSVNRDFYAYPQRWVTGVSADEF---------- 249 (441) T ss_pred cccCCCceeEEEeeccccCCccCCcccchhhHHHHHH--HHHHHHHHHHHHHHhhcCceeeeecCCcccc---------- Confidence 1 13444458998888899999999999875544433 44444444333332222 33331110000 Q ss_pred cccccccccccccccccccccceecCCceeeecCCCce---eeeecCCCCCccHHHH---HHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 318 GFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTK---LKMQPAGTPGGVGTDY---EQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~---i~~~~~~~p~~~f~~F---~~~~lr~iaaglGi~ye~l~ 391 (556) ......+.+|.|..+.++++ +++.+.+ .++.+.| ++..++.|+...++|.+.+. T Consensus 250 ------------------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g 309 (441) T protein:vir:80 250 ------------------SQPGWVLSMASVWAVDKDDDGDTPNVGSFP--VNSPTPYSDQMRLLAQLTAGEAAVPERYFG 309 (441) T ss_pred ------------------ccchhhhcccccccCCCCCCCCcceeEecC--ccchHHHHHHHHHHHHHHhcccCCCHHHhc Confidence 00112245666766665543 4443322 3344555 55559999999999988885 Q ss_pred chhhc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecC Q lcl|NC_019524. 392 RDYTK-TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGA 470 (556) Q Consensus 392 ~D~s~-~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p 470 (556) +...+ +|-.++|..+..........+..|-..+.+.+. +-.+++......+.. ..-+.+.|..+ T Consensus 310 ~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~---l~~~~~~~~~~~~~~------------~~~i~~~f~~~ 374 (441) T protein:vir:80 310 FITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGF---LAAKALDSRVDEADF------------FGDVGLRWRDA 374 (441) T ss_pred cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHhcCCCccccc------------ceeeeEEeCCC Confidence 54321 245567777777777777777777776655432 222333322211110 01135678776 Q ss_pred cccccchhhhhHHHHHHHHcCCCCHH--HHHHHhCCCHHHHHHHHHHHHHHHHH-cCCCCCccccccCCCCCCCCCCCCC Q lcl|NC_019524. 471 SRGQIDEKKETEAAILRIKNGLSTYE--AEISRLGGDFREVFKQRAREEGLIKS-LKLDFTGKMVEGNSTQSSNSSESTS 547 (556) Q Consensus 471 ~~~~iDP~Ke~~A~~~~i~~G~~s~~--~~~ae~G~D~e~v~~q~a~E~~~~~~-~Gl~~~~~~~~~~~~~~~~~~~~~~ 547 (556) .. .|-...+++..+.+.+|..+.+ ..+...|...+++ +++.+|++...+ ++-.... T Consensus 375 ~~--~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~-~~~~~e~~e~~~~~~~~~~~------------------ 433 (441) T protein:vir:80 375 ST--PTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQV-EAVMRHRAESSDPLAVLAGA------------------ 433 (441) T ss_pred CC--cCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHH-HHHHHHHHHHHHHHHHHhhh------------------ Confidence 65 4778888999999999986544 3344447765543 455555433322 1100000 Q ss_pred CCCCCcCCC Q lcl|NC_019524. 548 DNPNEETTQ 556 (556) Q Consensus 548 ~~~~~e~~~ 556 (556) .+.+++| T Consensus 434 --~~~~~~~ 440 (441) T protein:vir:80 434 --ISRQTNE 440 (441) T ss_pred --hhccccc Confidence 1111111 No 123 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.24 E-value=3.5e-10 Score=72.40 Aligned_cols=438 Identities=8% Similarity=-0.029 Sum_probs=202.4 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.+........-.+....... ...+. .....|.-+.. .-+. ++...... .... ..++.. --.+++++ T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~-~~~~~-~~l~~Yy~g~~--~i~~--~~~~~~~~----~~~~--~~~~~~-ki~~n~~k 87 (474) T protein:vir:96 21 MKPKVETQEEMIIRLINNHKQ-KLKDI-NVGQKYYDKDN--DINY--QAYKQDLH----GNID--YTKPDW-RITTNFHQ 87 (474) T ss_pred ccccccchHHHHHHHHHHHHH-HHHHH-HHHHHHhcccC--cccc--ccchhhhc----cccc--cccccc-ccccchHH Confidence 111111111111111111111 01111 11223332221 1110 01000000 0000 001110 01357999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++..+. + +.....++.|..+ +|.....-+.+....-|.+ T Consensus 88 ~Iv~~~~~yl~g~p~~~~~~---------~-------~~~~~~l~~~~~n-----------~~~~~~~~l~~~~~~~G~~ 140 (474) T protein:vir:96 88 NLVDQKVSYVAGKPVTYAHD---------D-------DKVLDVIHQVLDT-----------RWDNKLIDILTAASNKGID 140 (474) T ss_pred HHHHhhhhhhcccCceeccC---------C-------hHHHHHHHHHHhc-----------cHHHHHHHHHHHHhhCCeE Confidence 99999999999998887542 1 1223344555421 4566666667888999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccC--------- Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDM--------- 231 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~--------- 231 (556) |.++ +....+ .+++..++|+.+---+.......+..+|++-..-....|.++..+---.+.. T Consensus 141 ~~~~-~~d~~~--------~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~ 211 (474) T protein:vir:96 141 WLQV-YINEDG--------ELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDF 211 (474) T ss_pred EEEe-eeCCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeecc Confidence 9775 433221 2588899998885433333334466666542211112222222110000000 Q ss_pred -Cccccceeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccc Q lcl|NC_019524. 232 -EQWKWGYEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVF 309 (556) Q Consensus 232 -~~~~~~rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~ 309 (556) ....+..+... ..++.==|+++... ..|.|.|.+++..+..++....-......-.+.-..+++.-.++ T Consensus 212 ~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~---- 282 (474) T protein:vir:96 212 YYGDEHIQTHFSTGSWERVPFIAFKNN-----PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGE---- 282 (474) T ss_pred ccccccccCcccccCCCccceEEecCC-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcc---- Confidence 00000000000 00111113444332 46999999988877666643332222222111111222210000 Q ss_pred cccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCH-- Q lcl|NC_019524. 310 GQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSY-- 387 (556) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~y-- 387 (556) + .+.....+..+.+..+..|.++++++.+.+...+..+.+.+.+.|....++|- T Consensus 283 --------~----------------~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 338 (474) T protein:vir:96 283 --------D----------------LSEFMEGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQ 338 (474) T ss_pred --------c----------------ccchhhhhhccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcc Confidence 0 00111123445566778889999999999999999999999999999988772 Q ss_pred -HHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCee Q lcl|NC_019524. 388 -EQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAE 466 (556) Q Consensus 388 -e~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~ 466 (556) +.++++ .|--+.|..+..........+..|- ..++.+++..+. ++..... ..-+.+. T Consensus 339 ~~~~~~n---~Sg~Alk~~~~~l~~k~~~~~~~~~-~~l~~~~~~i~~--~~g~~~d----------------~~~i~i~ 396 (474) T protein:vir:96 339 TDKFGSA---TSGIALKFLYTNLNLKANKLKNKAN-VALQELMQFILD--FNKIKLD----------------AKEIEIT 396 (474) T ss_pred ccccccc---cHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH--HhCCCcc----------------cceeeEE Confidence 223332 2333455444444344444444433 333444443322 2221110 0123466 Q ss_pred eecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC Q lcl|NC_019524. 467 WIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE 544 (556) Q Consensus 467 w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~ 544 (556) |..+ .|.-+........++|+.|.+..+...+. |+++.++++++|++...+. +....+ ......+++ T Consensus 397 f~~~-----~p~~~~e~a~~~~~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~-~~~~~~-----~~~~~~~~~ 465 (474) T protein:vir:96 397 FNFN-----VMVNDLEQSQIGAQSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQ-LPNLDD-----GGADGAQQQ 465 (474) T ss_pred ecCC-----CccCHHHHHHHHHHcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh-cccccc-----ccCCCCCCc Confidence 6433 33334444445677899999999999975 9999999999887654332 111000 011111222 Q ss_pred CCCCCCCCc Q lcl|NC_019524. 545 STSDNPNEE 553 (556) Q Consensus 545 ~~~~~~~~e 553 (556) .++++.+.| T Consensus 466 ~~~~~~e~~ 474 (474) T protein:vir:96 466 QQSENNQSK 474 (474) T ss_pred CCCCccccC Confidence 222222222 No 124 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.24 E-value=3.5e-10 Score=72.40 Aligned_cols=438 Identities=8% Similarity=-0.029 Sum_probs=202.4 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.+........-.+....... ...+. .....|.-+.. .-+. ++...... .... ..++.. --.+++++ T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~-~~~~~-~~l~~Yy~g~~--~i~~--~~~~~~~~----~~~~--~~~~~~-ki~~n~~k 87 (474) T protein:vir:95 21 MKPKVETQEEMIIRLINNHKQ-KLKDI-NVGQKYYDKDN--DINY--QAYKQDLH----GNID--YTKPDW-RITTNFHQ 87 (474) T ss_pred ccccccchHHHHHHHHHHHHH-HHHHH-HHHHHHhcccC--cccc--ccchhhhc----cccc--cccccc-ccccchHH Confidence 111111111111111111111 01111 11223332221 1110 01000000 0000 001110 01357999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++..+. + +.....++.|..+ +|.....-+.+....-|.+ T Consensus 88 ~Iv~~~~~yl~g~p~~~~~~---------~-------~~~~~~l~~~~~n-----------~~~~~~~~l~~~~~~~G~~ 140 (474) T protein:vir:95 88 NLVDQKVSYVAGKPVTYAHD---------D-------DKVLDVIHQVLDT-----------RWDNKLIDILTAASNKGID 140 (474) T ss_pred HHHHhhhhhhcccCceeccC---------C-------hHHHHHHHHHHhc-----------cHHHHHHHHHHHHhhCCeE Confidence 99999999999998887542 1 1223344555421 4566666667888999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccC--------- Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDM--------- 231 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~--------- 231 (556) |.++ +....+ .+++..++|+.+---+.......+..+|++-..-....|.++..+---.+.. T Consensus 141 ~~~~-~~d~~~--------~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~ 211 (474) T protein:vir:95 141 WLQV-YINEDG--------ELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDF 211 (474) T ss_pred EEEe-eeCCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeecc Confidence 9775 433221 2588899998885433333334466666542211112222222110000000 Q ss_pred -Cccccceeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccc Q lcl|NC_019524. 232 -EQWKWGYEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVF 309 (556) Q Consensus 232 -~~~~~~rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~ 309 (556) ....+..+... ..++.==|+++... ..|.|.|.+++..+..++....-......-.+.-..+++.-.++ T Consensus 212 ~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~---- 282 (474) T protein:vir:95 212 YYGDEHIQTHFSTGSWERVPFIAFKNN-----PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGE---- 282 (474) T ss_pred ccccccccCcccccCCCccceEEecCC-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcc---- Confidence 00000000000 00111113444332 46999999988877666643332222222111111222210000 Q ss_pred cccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCH-- Q lcl|NC_019524. 310 GQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSY-- 387 (556) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~y-- 387 (556) + .+.....+..+.+..+..|.++++++.+.+...+..+.+.+.+.|....++|- T Consensus 283 --------~----------------~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 338 (474) T protein:vir:95 283 --------D----------------LSEFMEGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQ 338 (474) T ss_pred --------c----------------ccchhhhhhccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcc Confidence 0 00111123445566778889999999999999999999999999999988772 Q ss_pred -HHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCee Q lcl|NC_019524. 388 -EQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAE 466 (556) Q Consensus 388 -e~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~ 466 (556) +.++++ .|--+.|..+..........+..|- ..++.+++..+. ++..... ..-+.+. T Consensus 339 ~~~~~~n---~Sg~Alk~~~~~l~~k~~~~~~~~~-~~l~~~~~~i~~--~~g~~~d----------------~~~i~i~ 396 (474) T protein:vir:95 339 TDKFGSA---TSGIALKFLYTNLNLKANKLKNKAN-VALQELMQFILD--FNKIKLD----------------AKEIEIT 396 (474) T ss_pred ccccccc---cHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH--HhCCCcc----------------cceeeEE Confidence 223332 2333455444444344444444433 333444443322 2221110 0123466 Q ss_pred eecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC Q lcl|NC_019524. 467 WIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE 544 (556) Q Consensus 467 w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~ 544 (556) |..+ .|.-+........++|+.|.+..+...+. |+++.++++++|++...+. +....+ ......+++ T Consensus 397 f~~~-----~p~~~~e~a~~~~~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~-~~~~~~-----~~~~~~~~~ 465 (474) T protein:vir:95 397 FNFN-----VMVNDLEQSQIGAQSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQ-LPNLDD-----GGADGAQQQ 465 (474) T ss_pred ecCC-----CccCHHHHHHHHHHcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh-cccccc-----ccCCCCCCc Confidence 6433 33334444445677899999999999975 9999999999887654332 111000 011111222 Q ss_pred CCCCCCCCc Q lcl|NC_019524. 545 STSDNPNEE 553 (556) Q Consensus 545 ~~~~~~~~e 553 (556) .++++.+.| T Consensus 466 ~~~~~~e~~ 474 (474) T protein:vir:95 466 QQSENNQSK 474 (474) T ss_pred CCCCccccC Confidence 222222222 No 125 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.23 E-value=4e-10 Score=72.13 Aligned_cols=442 Identities=7% Similarity=-0.052 Sum_probs=195.3 Q ss_pred CCcc-hhhhHHHHHhhHhhcccchhhh-----h-----hhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDV-KKTTRTRAKKAVDVVAETATAT-----P-----MAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARA 69 (556) Q Consensus 1 ~sp~-~~~~r~~a~~a~~~~~~~~~~~-----~-----~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~Ra 69 (556) |... .+..-.++..-+.+........ . ......|.-. .+...+.++....... ..........++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~---~~Yy~g~~~i~~~~~~-~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVG---ERYYNHDPDVLRLAPK-LDNKGEIDPLKP 76 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHH---HHHhccCCcchhccch-hccccccccccc Confidence 2211 1111112222221111110000 0 0000011100 0000011000000000 000000000011 Q ss_pred HHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHH Q lcl|NC_019524. 70 QDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRL 149 (556) Q Consensus 70 Rdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l 149 (556) ..- -.+++++-+++..+.+++|.+++..+. +++.. ..++.|... +|...... T Consensus 77 ~~k-i~~n~~~~Ivd~~~~~l~g~p~~~~~~------------d~~~~----~~l~~~~~n-----------~~~~~~~~ 128 (474) T protein:vir:96 77 DWR-MFTNYHQNLVDQKVAYAVANPVTFSSD------------DDKSL----KTIQEVLNH-----------KWDDKLVD 128 (474) T ss_pred chh-cccchHHHHHHhhhhhhcccCceeecC------------chHHH----HHHHHHHhc-----------CHHHHHHH Confidence 100 125799999999999999998887542 11222 334444321 23333344 Q ss_pred HhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecC----- Q lcl|NC_019524. 150 AVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAF----- 224 (556) Q Consensus 150 ~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~h----- 224 (556) +.+....-|.++.++. ....+ .+++..++|+.+-.-+.......+..+|.+=.......|+++... T Consensus 129 ~~~~~~~~G~~~~~~y-~d~~~--------~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~ 199 (474) T protein:vir:96 129 ILTAASNKGIEWLQPY-IDENG--------EFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYY 199 (474) T ss_pred HHHHHHhcCeeEEEEE-ecCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEE Confidence 4567788899997654 32221 257888999888533333333456666643211111112222111 Q ss_pred ---CCccccCCcccc---------ceee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 225 ---PGDPTDMEQWKW---------GYEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV 291 (556) Q Consensus 225 ---pgd~~~~~~~~~---------~rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i 291 (556) -+.......... ...+ .+..+| |+|+... ..|+|.|.+++..+..++....-......- T Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~nn-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~ 271 (474) T protein:vir:96 200 EYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVP---FIPFKNN-----PQEMSDLFMYKTIIDAMDKRLSDTQNTFDE 271 (474) T ss_pred EecCCceeeccccccccccccccccccccCCCcee---EEEeccC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000 0000 011122 4444332 358999999888876665544333322222 Q ss_pred hcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecC-CCceeeeecCCCCCccHHH Q lcl|NC_019524. 292 NATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY-PGTKLKMQPAGTPGGVGTD 370 (556) Q Consensus 292 ~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~-pGe~i~~~~~~~p~~~f~~ 370 (556) .+.--.+++.-.++. . +.....+..+.+..|. .|-++++++.+.+...+.. T Consensus 272 ~~~~~lv~~g~~~~~---------~-------------------~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~ 323 (474) T protein:vir:96 272 STELIYILKGYEGQD---------L-------------------DEFMRNLKYYKAINVDGDGSGVDTIQIEVPVQSSKE 323 (474) T ss_pred hccceeeeecCCccc---------c-------------------cchhhhhhcCceEEecCCCCceeEEeecCChHHHHH Confidence 222222232111000 0 0011124445555555 4678999999989999999 Q ss_pred HHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCccc Q lcl|NC_019524. 371 YEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNW 450 (556) Q Consensus 371 F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~ 450 (556) +++.+.+.|....++|-...-+.-++.|--+.+..+..........+..|-. .++.+++..++ ++...... T Consensus 324 ~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~~i~~--~~~~~~~~------ 394 (474) T protein:vir:96 324 YLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLT-ALQELLQYIID--FYKLNIKV------ 394 (474) T ss_pred HHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH--HhCCCccc------ Confidence 9999999999999877322211111223334444443333333333333333 33444443332 22221110 Q ss_pred ccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCC Q lcl|NC_019524. 451 RMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFT 528 (556) Q Consensus 451 ~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~ 528 (556) .-+.+.|... .|.-+.......+++|+.|.+..+...+. |+++.++++.+|++...+. ++ T Consensus 395 ----------~~i~i~f~~~-----~p~~~~e~~~~~~~ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~-~~-- 456 (474) T protein:vir:96 395 ----------QDVEITFNFN-----VMVNELEQSQIGVQSQYLSKETVVTNHPWVDDPVAELERIEQDNIDFNKQ-LP-- 456 (474) T ss_pred ----------ceeeEEeccC-----CCcCHHHHHHHHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhc-cc-- Confidence 0124556333 23344455556778999999999999864 8999999999887654332 10 Q ss_pred ccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 529 GKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 529 ~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) .. ...+.+...+.++||+ T Consensus 457 --------~~-~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 457 --------PL-EGDANGRAQDNESETN 474 (474) T ss_pred --------cc-ccccccccCCCcccCC Confidence 00 0111112222233444 No 126 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.22 E-value=4.5e-10 Score=71.81 Aligned_cols=436 Identities=11% Similarity=-0.001 Sum_probs=215.9 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |++-..+....-......... ...+... ...|.-+.. .-....+.. .. ..+. | + .+++++ T Consensus 11 ~p~d~~~~~~~l~~~i~~~~~-~~~r~~~-~~~yy~g~~-~i~~~~~~~-~~------~~~~-----k---i--~~n~~~ 70 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHRL-EVARYEY-LKNMYRGIM-AIDAEPTKD-LW------KPDN-----R---L--TVNFTK 70 (453) T ss_pred cCCCCCCCHHHHHHHHHHHHH-HHHHHHH-HHHHhhccC-chhcCCCcc-cc------Cccc-----e---e--ecchHH Confidence 333333222222222111100 0011111 122332221 111111100 00 0011 1 2 357999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++..+.. +...+.+...|+ . .+|...+..+.+..+.-|.+ T Consensus 71 ~ivd~~~~~l~g~~~~~~~~d------------~~~~~~l~~i~~---~-----------N~~~~~~~~~~~~~~~~G~~ 124 (453) T protein:vir:39 71 YIVDTFTGYFNGIPVKKSHSD------------KETLSKLQEFDN---L-----------NDMEDEESELAKMACIYGRA 124 (453) T ss_pred HHHHHHhhhhcccCceeccCC------------hHHHHHHHHHHH---h-----------cChhHHHHHHHHHHhhcCeE Confidence 999999999999998776421 122333333333 2 15777888888999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCC-CCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNN-GAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~-Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) |+.+.. ... + .+++.+++|+.+-.-++......+..+|++... +...-+.++...---.+......|.-+ T Consensus 125 ~~~v~~-d~~-----g---~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~ 195 (453) T protein:vir:39 125 FELLYQ-NEE-----T---QTNVIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMT 195 (453) T ss_pred EEEEEe-cCC-----C---ceEEEEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeee Confidence 977543 221 1 257899999998654544445668888887543 333323333221100111111222211 Q ss_pred ec----cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccc Q lcl|NC_019524. 240 PA----RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMG 315 (556) Q Consensus 240 ~~----~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~ 315 (556) .. +..+| |+++.. ...|.|.|.+++..+..++....-......-.+.--.+++-...++. T Consensus 196 ~~~~~~~g~vP---vv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~-------- 259 (453) T protein:vir:39 196 EQAPNPFDDLP---VVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEE-------- 259 (453) T ss_pred cccccCCCcee---EEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCch-------- Confidence 11 11222 455443 34699999987777765555443333333232222223321100000 Q ss_pred cccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh Q lcl|NC_019524. 316 QGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT 395 (556) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s 395 (556) .. .. ........+. +. ....+|.++++++.+.+...+..+.+.+.+.|..-.++|-... ..++ T Consensus 260 ---~~---~~--------~~~~~~~~~~-~~-~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~g 322 (453) T protein:vir:39 260 ---DL---KN--------IRSNRVINYY-GE-SSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISD-ESFG 322 (453) T ss_pred ---hh---hh--------hhhcceeeec-CC-CCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-cccc Confidence 00 00 0000111111 11 1134677889998888888999999999998888777774322 3455 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) ++|-.+++..+..........|..|-.. ++.+++..++..-..|.- . .+ .-+.+.|..+-. . T Consensus 323 n~Sg~Al~~~~~~l~~ka~~~~~~~~~~-l~~~~~li~~~~~~~~~~---~-----~~-------~~i~v~f~~~~p--~ 384 (453) T protein:vir:39 323 SSSGVSLAYKLQAMSNLALSFQRKFQSS-LNSRYKLYCELSTNVSNK---E-----AW-------KDIEYTFTRNEP--K 384 (453) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccCCc---c-----cc-------ccceEEeCCCCC--c Confidence 6666677777766666666666665443 455666554432222211 0 00 013567754332 3 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) |-...+++.. ...|+.|.+..+...|. |+++.++++.+|++......... ....++..++.+++ T Consensus 385 ~~~~~a~~~~--kl~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~------------~~~~~~~~~~~~~~ 450 (453) T protein:vir:39 385 DIKEQAETAN--ILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDK------------QPSEKGTDTVVPET 450 (453) T ss_pred CHHHHHHHHH--HHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhc------------cCCCCCCCCCCCCc Confidence 4444444433 34799999999999875 89999999999887554332110 00111111111111 Q ss_pred CCC Q lcl|NC_019524. 554 TTQ 556 (556) Q Consensus 554 ~~~ 556 (556) .+| T Consensus 451 ~~e 453 (453) T protein:vir:39 451 NEE 453 (453) T ss_pred CCC Confidence 111 No 127 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.22 E-value=1e-10 Score=75.35 Aligned_cols=410 Identities=9% Similarity=0.059 Sum_probs=194.7 Q ss_pred hhHHHHHhhHhhcccchhhhhhhhcchhc---cccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 7 TTRTRAKKAVDVVAETATATPMAVGGGME---GAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 7 ~~r~~a~~a~~~~~~~~~~~~~~~~~~y~---aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) ++... .-+|. +.+..+.. .+...+- ++- .--.+++-|++++++| T Consensus 1 ~~~~~-------------------~d~~~~~~~~~~~~~~----~~~~~~~------~~~----~l~a~Y~~~~l~~~~V 47 (427) T protein:vir:10 1 MKIVK-------------------HDGYNDIFNGGADGSP----KPFFMSD------ASY----HVGSFYNDNATAKRIV 47 (427) T ss_pred CCccc-------------------cchHHHHhhcCCCCcc----cCccccC------chH----HHHHHHHcCchhhhhh Confidence 11110 01111 11111111 1111111 111 2246799999999999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.++..|+.+... + + .+++++.|++. .+.....-+++....-|.+.+. T Consensus 48 d~~aed~~r~g~~i~g~---------~--~---~~~~~~~~~~l--------------~~~~~l~~a~~~~rl~G~a~i~ 99 (427) T protein:vir:10 48 DVIPEEMVTAGFKMSGV---------K--D---EKEFKSLWDSY--------------KLDSSLVDLLCWARLYGGAAMV 99 (427) T ss_pred ccchHHhhcCCccccCc---------c--H---HHHHHHHHHHh--------------hHHHHHHHHHHhccccceeEEE Confidence 99999999999887531 1 1 13455555532 2333334444444455666655 Q ss_pred EeeccCCCC----cCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 164 CEWLNPTGT----TMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 164 ~~~~~~~~~----~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) +........ ...+... .|.+|++..|. |.. . . .-.-=..+|+|..|+|... ... T Consensus 100 i~v~d~~~l~~p~~~~g~l~--~l~v~d~~~~~-~~~-~----~-~dp~s~~fg~P~~y~v~~~---~~~---------- 157 (427) T protein:vir:10 100 AIIKDNRMLTSQAKPGAKLE--GVRVYDRFAIT-VEK-R----V-TNARSPRYGEPEIYKVSPG---DNM---------- 157 (427) T ss_pred EEecCCCccccccCCCccee--EEEEechhccc-ccc-c----c-cCccccccCcceEEEEecC---CCC---------- Confidence 544321110 0111112 45667776663 211 1 0 1111124699999998622 110 Q ss_pred eccccCChhHeEeeecc------cCCCcccCCchhh-HHHHHHHHHHHHHHHHHHHHHHhccee-eeEeccCcccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEA------LLAGQTRGISEMV-SALKQMKMTRNFQEITLQNAVVNATYA-ASVESELPSDVVFGQ 311 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~------~r~gQ~RGvs~la-~~l~~l~~l~~~~dael~~a~i~A~~~-~fi~~~~~~~~~~~~ 311 (556) ....|+.++|||+-.. ......-|.|.|. ++...++ +|+.+.--.+.+.=.+. .++|.+.-.. . T Consensus 158 -~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~---~~~~~~~~~~~l~~k~~~~v~k~~~l~~----~ 229 (427) T protein:vir:10 158 -QPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAIC---DYDYCESLATQILRRKQQAVWKVKGLAE----M 229 (427) T ss_pred -cceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHH---HHHHHHHHHHHHHHHhccccccchhHHH----H Confidence 1235788889987322 3344555787765 4444444 45544444444322211 2334321111 0 Q ss_pred cccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 312 LGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~ 391 (556) ...+..... .......-..... ..|.+.....+++++.++.+ -++..+........||++.+||.-.|. T Consensus 230 ~~~~~~~~~--------~~~r~~~~~~~~~-~~~~~~l~~~~e~~e~~~~~--lsgl~~~~~~~~~~iaaa~~IP~t~L~ 298 (427) T protein:vir:10 230 CDDDDAQYA--------ARLRLAQVDDNSG-VGRAIGIDAETEEYDVLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIK 298 (427) T ss_pred hcCccchHH--------HHHHHHHHHHhcC-cccceeeecCCCceeEEecc--cCChHHHHHHHHHHHHhhhCCCeeeec Confidence 000000000 0000000000001 13444555678999988744 357899999999999999999998887 Q ss_pred ch-hhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecC Q lcl|NC_019524. 392 RD-YTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGA 470 (556) Q Consensus 392 ~D-~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p 470 (556) |- .++.| |+.-.-+..+...++.+|+..+..++..++... +++..+ .+ . ++..|.+. T Consensus 299 G~sp~Gln-stgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i----~~s~~~------~~--~---------f~pL~~~s 356 (427) T protein:vir:10 299 NKNVGGVS-ASQNTALETFYKLVDRKREEDYRPLLEFLLPFI----VDEEEW------SI--E---------FEPLSVPS 356 (427) T ss_pred cCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcCCCc------EE--E---------eCCCCCCC Confidence 74 22223 556777888889999999876555555444432 333211 11 0 12356666 Q ss_pred cccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 471 SRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 471 ~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) ..+.+|- .|-+++....+++|+.++.++. +.+... ...-|+.........-.......++..++. T Consensus 357 ~kEkaei~~~~a~a~~~~~~~gvi~~~e~r-----------~~L~~~---~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~ 422 (427) T protein:vir:10 357 KKEESEITKNNVESVTKAITEQIIDLEEAR-----------DTLRSI---APEFKLKDGNNINIREPEETTEPEPGLGEK 422 (427) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCHHHHH-----------HHHHhh---hccccCCCCccccccccchhcCCCCCCCCC Confidence 6665544 5677888888888887766653 222111 122333211100000000011111111111 Q ss_pred CCCcC Q lcl|NC_019524. 550 PNEET 554 (556) Q Consensus 550 ~~~e~ 554 (556) .++|+ T Consensus 423 ~~d~~ 427 (427) T protein:vir:10 423 LEDEN 427 (427) T ss_pred CCCCC Confidence 11111 No 128 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.21 E-value=5.2e-10 Score=71.48 Aligned_cols=450 Identities=9% Similarity=-0.017 Sum_probs=208.7 Q ss_pred CCcch--hhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVK--KTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~--~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) ..+.. .+... +...-.....+.. ....|.-+.. ..+..+... .+. ...+. | + -+++ T Consensus 37 ~~~~~~~~i~~~-----i~~~~~~~~~r~~-~l~~Yy~g~~--~il~~~~~~---~~~-~~~~~-----k---i--~~n~ 94 (511) T protein:vir:93 37 DLLQNVNEVSKY-----IEHHMDYQRPRLK-VLSDYYEGKT--KNLVELTRR---KEE-YMADN-----R---V--AHDY 94 (511) T ss_pred hhhccHHHHHHH-----HHHHHHhhHHHHH-HHHHHhcccC--ccccccCcC---ccc-ccCcc-----e---e--ecch Confidence 11111 11111 0000000001111 1123433321 111111110 000 00111 1 1 2589 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ++-+++..+.+++|.+++.... + +.....|+.|... .+|...+..+.+....-| T Consensus 95 ~k~Iv~~~~~yl~g~p~~~~~~---------d-------~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~~~G 148 (511) T protein:vir:93 95 ASYISDFINGYFLGNPIQYQDD---------D-------KDVLEVIEAFNDL----------NDVESHNRSLGLDLSIYG 148 (511) T ss_pred HHHHHHHHhhhhcccCeeeccC---------C-------hHHHHHHHHHHhh----------cCHhHHHHHHHHHHHhcC Confidence 9999999999999998887532 1 2233445555543 257777888888999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC--------CCCeEEEEEeecCCCcccc Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN--------NGAALGYWLRKAFPGDPTD 230 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~--------~Gr~vaY~i~~~hpgd~~~ 230 (556) .++..+.... .+ .+++.+++|..+-.-++......++.+|.+-. .+...-+.|+...---.+. T Consensus 149 ~ay~~vy~de-~~--------~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~ 219 (511) T protein:vir:93 149 KAYELMIRNQ-DD--------ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYL 219 (511) T ss_pred eeEEEEEeCC-CC--------ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEE Confidence 9998764322 11 25788899988753333333345667776521 1222223333322100000 Q ss_pred CCc--------cccceee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEec Q lcl|NC_019524. 231 MEQ--------WKWGYEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVES 301 (556) Q Consensus 231 ~~~--------~~~~rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~ 301 (556) ..+ ......| .+..|| |+++... .+|.|.|.+++..+..++....-......-.+.-..+++- T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~nn-----~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G 291 (511) T protein:vir:93 220 TSRTNGLKLTPRENGFESHSFERMP---ITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (511) T ss_pred ecCCCccccccccccccccCCCccc---eEEecCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeec Confidence 000 0000011 112233 4554432 3689999998887765554333222222222222223332 Q ss_pred cCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHH Q lcl|NC_019524. 302 ELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAA 381 (556) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaa 381 (556) ..+.... +....... ............+....+.+|-++++++.+.+...+..+.+.+.+.|.. T Consensus 292 ~~~~~~~---------~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~ 355 (511) T protein:vir:93 292 NLNLDPV---------EVRKQKEA-------NVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHM 355 (511) T ss_pred CcccCch---------hhcccccc-------cceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHH Confidence 1111000 00000000 0000000011122333466788999999998899999999999999888 Q ss_pred hcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhh Q lcl|NC_019524. 382 SLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMM 458 (556) Q Consensus 382 glGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~ 458 (556) -.++| .+.++++. |--+++..+..........+..|-.. ++..++..+...-..+.+..+... T Consensus 356 ~s~~P~~~~~~~~~n~---Sg~Al~~~~~~l~~k~~~k~~~f~~~-l~~~~~li~~~l~~~~~~~~~~d~---------- 421 (511) T protein:vir:93 356 FTNTPNMKDDNFSGTQ---SGEAMKYKLFGLEQRTKTKEGLFTKG-LRRRAKLLETILKNTWSIDANKDF---------- 421 (511) T ss_pred HhCCcccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCccccccc---------- Confidence 88877 23344443 33356655555555555555555443 344555444332223333221110 Q ss_pred HHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCC-CCCccccccC Q lcl|NC_019524. 459 RDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKL-DFTGKMVEGN 535 (556) Q Consensus 459 ~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl-~~~~~~~~~~ 535 (556) .-+.+.|..+-. .|-...+++..++ .|+.|.+..+...+. |+++.++++.+|++...+.-. ....++ T Consensus 422 --~~i~~~f~~~~p--~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~---- 491 (511) T protein:vir:93 422 --NTVRYVYNRNLP--KSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP---- 491 (511) T ss_pred --ccceEEeCCCCC--CCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCC---- Confidence 013556743222 5766666665554 699999999988864 899999999998764433211 110110 Q ss_pred CCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 536 STQSSNSSESTSDNPNEETT 555 (556) Q Consensus 536 ~~~~~~~~~~~~~~~~~e~~ 555 (556) .......+++++++..++.+ T Consensus 492 ~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 492 RDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCCcccccccccC Confidence 11111111111111111111 No 129 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.21 E-value=2.5e-12 Score=84.21 Aligned_cols=378 Identities=8% Similarity=-0.027 Sum_probs=186.4 Q ss_pred chhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChh Q lcl|NC_019524. 32 GGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDG 111 (556) Q Consensus 32 ~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~ 111 (556) .++.+....-. ..+.+.....-. +. .......|.+++.++|+.|++.|-..-|++.-+.+.. +..+. T Consensus 1 Mg~f~~~~~f~--~~~~~~~~~~~~----~~-----~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~--~~~~~ 67 (378) T protein:vir:93 1 MNLFGKVVSFS--RGKLNNDTQRVT----AW-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSD--VGSDT 67 (378) T ss_pred Cccchhhhhhh--ccccCCCcceee----ec-----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccc--ccccc Confidence 01111110000 001111100000 00 0112345778899999999999988777654332210 00000 Q ss_pred HHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhc Q lcl|NC_019524. 112 WGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRM 191 (556) Q Consensus 112 ~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl 191 (556) ...... ..++..+-.+|+ -.+|.+++-...+..++..|++|+.+.+... T Consensus 68 ~~~~~~---~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~---------------------- 116 (378) T protein:vir:93 68 LISMAG---SDLDEVLNWSPK------GERNSMDFWRKVIKKLLRAPYVDLYAVFDDN---------------------- 116 (378) T ss_pred cccccc---chHHHHHhhcCC------CCCCHHHHHHHHHHHHhhcCceEEEEEeecC---------------------- Confidence 000001 122233333554 2568999999999999999999987643221 Q ss_pred CCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHH Q lcl|NC_019524. 192 SNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSA 271 (556) Q Consensus 192 ~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~ 271 (556) .|+++.++. .+. ...++.++|||+..+. +-..|++.+..+ T Consensus 117 --------------------~g~~~~l~~-----~~~-------------~~~~~~~diih~r~~~--~~~~~~s~l~~~ 156 (378) T protein:vir:93 117 --------------------TGELLDLLF-----ADD-------------KKEYKTEELVRLTSPF--YINEDTSILDNA 156 (378) T ss_pred --------------------CceEEEEEe-----cCC-------------eeEeccceeEEecCcc--ccchhhHHHHHH Confidence 122222111 000 1135667999997653 334466665443 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecC Q lcl|NC_019524. 272 LKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY 351 (556) Q Consensus 272 l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~ 351 (556) +.. +..+.-++...++++.+..-... . .....+.+... ..+.. ..-..|.+..|. T Consensus 157 ~~~-----------i~~~~~~~~~~g~l~~~~~l~~~--~----~~~~~~~~~~~-------~~~~~-~~~~~~~~~~l~ 211 (378) T protein:vir:93 157 LAS-----------IQTKLEQGKLRGLLKINAFLDID--N----TQEYREKALTT-------IKNMQ-EGSSYNGLTPVD 211 (378) T ss_pred HHH-----------HHHHHhcCcccceeeeCCcCCHH--H----HHHHHHHHHHH-------HHHhh-cccccccceEcC Confidence 322 22333355666777754321000 0 00000000000 00000 011356788999 Q ss_pred CCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 352 PGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTL 431 (556) Q Consensus 352 pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~ 431 (556) .|.+++.++.+....++ ...+.+.++||..+|||...|.+. ||. + ....++..-+.|+... T Consensus 212 ~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g~-----~~e--~-----------~~~~f~~~tl~P~~~~ 272 (378) T protein:vir:93 212 NKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLGT-----ATQ--E-----------QQIYFYNSTIIPLLIQ 272 (378) T ss_pred CCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcCC-----cHH--H-----------HHHHHHHHHHHHHHHH Confidence 99999999877666666 556788999999999999887442 221 1 1123455666776665 Q ss_pred HHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHH Q lcl|NC_019524. 432 WLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFK 511 (556) Q Consensus 432 ~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~ 511 (556) ++.++.+..++-..... .. .......+.+-.......|+...+++....+.+|+.|+-|+-+..|..|-+--+ T Consensus 273 -ie~~l~~kLl~~~er~~-~~-----~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD 345 (378) T protein:vir:93 273 -LEKELTYKLISTNRRRV-VK-----GNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred -HHHHHHhhcCChhHhhh-hh-----hcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 44455555443210000 00 000011234445556667999999999999999999999998888988764321 Q ss_pred HHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 512 QRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 512 q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) . +=++....+... .............++|+++| T Consensus 346 ~----------~~~~~n~~~~~~--~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 346 V----------YIANLNAVAVKN--LSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred e----------eeeccccccccc--hhhhcCccCCCCCCCCCCCC Confidence 1 111111111110 11111111122223334444 No 130 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.20 E-value=6e-10 Score=71.13 Aligned_cols=439 Identities=9% Similarity=-0.009 Sum_probs=208.0 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |.|..-. ..+.........+......-|.|-. .... .+............. | -.+++++ T Consensus 30 ~~~~~i~------~~i~~~~~~~~~~~~~~~~yY~g~~-~~i~----~~~~~~~~~~~~~~~-----k-----i~~n~~~ 88 (481) T protein:vir:10 30 LKEENLR------NFISRHQTEQVPRLEMLESYYLNRN-TDIL----AGERRLQKYGDKADH-----R-----AVHNYAK 88 (481) T ss_pred cCHHHHH------HHHHHHHHHHHHHHHHHHHHhcCCC-cccc----cCccccccccccccc-----e-----eecchHH Confidence 4442211 1111000000001111112244321 1110 000000000000000 1 1478999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++..+. ++...+.+...|+ . .+|...+..+.+..+.-|.+ T Consensus 89 ~ivd~~~~~l~g~~~~~~~~------------d~~~~~~l~~~~~----~----------n~~~~~~~~~~~~~~~~G~~ 142 (481) T protein:vir:10 89 YVSRFIVGYLTGNPITITHQ------------DNQTNDKIIELND----L----------NDADEVNSDLALNLSIYGRA 142 (481) T ss_pred HHHHHHHhhhccCCceEecC------------ChhHHHHHHHHHH----h----------cChhHHHHHHHHHHHhcCeE Confidence 99999999999988876642 1122333333332 1 14777888888999999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeE-EEEEeecCCCccccCCccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAAL-GYWLRKAFPGDPTDMEQWK 235 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~v-aY~i~~~hpgd~~~~~~~~ 235 (556) |+.+.. ... + -+++..++|+.+---++......+..+|.+ |..+..+ -+.++...---.+...+.. T Consensus 143 ~~~~~~-d~d-----g---~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~ 213 (481) T protein:vir:10 143 YEIVYR-DFE-----D---RDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGT 213 (481) T ss_pred EEEEEe-CCC-----C---eEEEEEEcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCc Confidence 977543 221 1 257899999888432333333456667643 3333333 2233322111111112233 Q ss_pred cceeec----cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccc Q lcl|NC_019524. 236 WGYEPA----RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQ 311 (556) Q Consensus 236 ~~rv~~----~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~ 311 (556) |..+.. ...|| |+|+... .+|+|.+.+++..+..++....-......-.+.-..+++-...... T Consensus 214 ~~~~~~~~~~~g~vP---vv~~~n~-----~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~---- 281 (481) T protein:vir:10 214 YHRVEEVEHYYNDVP---IIEYLND-----QFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDS---- 281 (481) T ss_pred eeecccccccCCcee---EEEeecC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCc---- Confidence 332211 12344 6665543 3689999988777776766543333222222222233331111000 Q ss_pred cccccccccccccccccccccccccccceecCCceee-ecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHh Q lcl|NC_019524. 312 LGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIP-HLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQF 390 (556) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~-~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l 390 (556) +... .........+.++... ....+-++++++.+.+...+..+++.+.+.|....++|-. . T Consensus 282 ------~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~ 343 (481) T protein:vir:10 282 ------EDAK-----------AFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDL-N 343 (481) T ss_pred ------cchh-----------hhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc-c Confidence 0000 0000011112222211 1233557888888888899999999998888888887732 1 Q ss_pred hchhhc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeec Q lcl|NC_019524. 391 SRDYTK-TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIG 469 (556) Q Consensus 391 ~~D~s~-~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~ 469 (556) .+.+++ .|-.+++..+..........|..|-..+ +.+++..+..+-..+.-.. ....+.+.|.. T Consensus 344 ~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~~~~~~~~~~--------------~~~~i~v~f~~ 408 (481) T protein:vir:10 344 DEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGL-MKRYKLLLNNVNLTGLKQH--------------NYAELTITFTP 408 (481) T ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcc--------------ccceeeEEeCC Confidence 223322 2223444444444444444554444443 3344443333212221100 01235678854 Q ss_pred CcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCC Q lcl|NC_019524. 470 ASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTS 547 (556) Q Consensus 470 p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~ 547 (556) +- ..|-...+++..++ .|+.|.+..+...+. |+++-++++.+|++...+.-.... -+......++. T Consensus 409 ~~--~~~~~~~a~~~~kl--~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~--------~~~~~~~~~~~ 476 (481) T protein:vir:10 409 NL--PKSMMESINAFNAL--SGGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRG--------YGEAFENHLNV 476 (481) T ss_pred CC--CcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhcc--------CCccCCCCCCC Confidence 43 34666666666554 588999999998875 899999999888866554322110 00111112222 Q ss_pred CCCCC Q lcl|NC_019524. 548 DNPNE 552 (556) Q Consensus 548 ~~~~~ 552 (556) |++++ T Consensus 477 dd~~g 481 (481) T protein:vir:10 477 DDSNG 481 (481) T ss_pred CCCCC Confidence 22222 No 131 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.20 E-value=6.6e-10 Score=70.90 Aligned_cols=453 Identities=8% Similarity=0.015 Sum_probs=208.5 Q ss_pred CCcchhhhHHHHHhhHhhc-ccchhhhhhhhcch-hccccCCCcccccccCCC--CCHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVV-AETATATPMAVGGG-MEGAERTTREMFQWNPSI--ISPDQQIAQNQDMASARAQDMVQND 76 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~-~~~~~~~~~~~~~~-y~aa~~~~r~~~~w~~~~--~s~~~~i~~~~~~lr~RaRdl~rNn 76 (556) |.+..-. .-+... ... ..+- ..... |+|-. . ..+..+.... .-++ . -..+ T Consensus 22 l~~~~i~------~li~~~~~~~-~~r~-~~l~~YY~g~~-~-~i~~~~~~~~~~~~~~------~----------ki~~ 75 (506) T protein:vir:94 22 LTPNKIM------KFITHHFNYQ-RPRL-EMLDDYYQGYN-L-KILDKQSRRHEDGKAD------H----------RATH 75 (506) T ss_pred CCHHHHH------HHHHHHHHHH-HHHH-HHHHHHhcCCC-c-cccccccccccccCCc------c----------eeec Confidence 2221100 000000 000 0011 11123 44322 1 1111111110 1011 0 0246 Q ss_pred hHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhee Q lcl|NC_019524. 77 GYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLM 156 (556) Q Consensus 77 ~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~ 156 (556) ++++-+|+..+.+++|.+++..+. ++ .....|+.|.+. .+|...+..+.+..+. T Consensus 76 n~~~~Iv~~~~~~l~G~p~~~~~~---------d~-------~~~~~l~~~~~~----------N~~~~~~~~~~~~~~~ 129 (506) T protein:vir:94 76 SFAKYIADFQTSYSVGNPINVKLP---------DD-------GSNSGFDTFNKA----------NDVDAENYDLFLDMSR 129 (506) T ss_pred chHHHHHHHhhhhhcccCceeecC---------cc-------hHHHHHHHHHhc----------cCHhHHHHHHHHHHHh Confidence 899999999999999998876542 11 122345555543 2577778888888999 Q ss_pred cCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeEE---EE--EeecCCCc Q lcl|NC_019524. 157 TGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAALG---YW--LRKAFPGD 227 (556) Q Consensus 157 dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~va---Y~--i~~~hpgd 227 (556) -|.+|+++.+.. .+ .+++.+++|..+---+.......+..+|.+ +..+..+. || ++...--- T Consensus 130 ~G~a~~~v~~de-d~--------~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~ 200 (506) T protein:vir:94 130 YGRAYEYVYRGE-DN--------EEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYT 200 (506) T ss_pred cCeEEEEEEecC-CC--------eeEEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEE Confidence 999998764422 11 257888999888543333333446677763 22332222 11 11111000 Q ss_pred cccCCccccceee----ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhc--c---eeee Q lcl|NC_019524. 228 PTDMEQWKWGYEP----ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNA--T---YAAS 298 (556) Q Consensus 228 ~~~~~~~~~~rv~----~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A--~---~~~f 298 (556) .+......|.-+. .+..+| |+++... ..|+|.|.+++..+..++....-......-.+ . ++.. T Consensus 201 ~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n~-----~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~ 272 (506) T protein:vir:94 201 LYNPTPIMGKMQVDTTKPITTFP---VVEFKNS-----NFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDI 272 (506) T ss_pred EeccccCccceeccccccCCccc---eEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCc Confidence 0000111111110 011122 3444332 36899999988776555443222222111111 1 1110 Q ss_pred EeccCcccccccccccccccccccccccccccccccccccceecCCce-eeecCCCceeeeecCCCCCccHHHHHHHHHH Q lcl|NC_019524. 299 VESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAK-IPHLYPGTKLKMQPAGTPGGVGTDYEQSLLR 377 (556) Q Consensus 299 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~-i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr 377 (556) .....+... .........+.......................+.++. +.....+-++++++.+-+...+..+++.+.+ T Consensus 273 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~ 351 (506) T protein:vir:94 273 DTLFEGSDM-MNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAG 351 (506) T ss_pred cccccchhc-cccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHH Confidence 000000000 00000000000000000000000011112223333332 3344566789999988899999999999999 Q ss_pred HHHHhcCCCH---HHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-cCCccCCCCcccccc Q lcl|NC_019524. 378 NIAASLGMSY---EQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVN-AGNVPLPPGKNWRMF 453 (556) Q Consensus 378 ~iaaglGi~y---e~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l-~G~l~~p~~~~~~~~ 453 (556) .|....++|- +.++++.| --+.+..+..........+..|-.. ++.+++..+...-. .+.... . T Consensus 352 ~I~~~s~~p~~~~~~~~~n~S---g~Aik~~~~~l~~k~~~k~~~~~~~-l~~~~~li~~~~~~~~~~~~~-d------- 419 (506) T protein:vir:94 352 DIHKFSHTPDLTDENFASNSS---GVAMQYKVLGTVELASTKRRMFERG-LYARYQIISDIENSIHGDWTF-D------- 419 (506) T ss_pred HHHHHhCccccccccccccch---HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCcccc-c------- Confidence 9998888773 34444443 3356655555555555555555443 34444443332221 221110 0 Q ss_pred cchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccc Q lcl|NC_019524. 454 YDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKM 531 (556) Q Consensus 454 ~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~ 531 (556) ..-+.+.|..+-. .|....+++..++ .|+.|.+..+...+. |+++.++++++|++...+. .. T Consensus 420 ------~~~i~i~f~~~~p--~d~~e~a~~~~kl--~g~iS~et~~~~lp~v~d~~~E~~ri~~E~~~~~~~-~~----- 483 (506) T protein:vir:94 420 ------PQELTFTFRDNLP--ADNISQIKALVQA--GATLPQKYLYQQLPGVTNPQDIVDMMKEQSANGDYS-FD----- 483 (506) T ss_pred ------cccceEEeCCCCC--cCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHhhc-ch----- Confidence 0113566744333 4767666665554 699999999999854 7999999999988654332 11 Q ss_pred cccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 532 VEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .....+.. .+.++.++++++ T Consensus 484 ----~~~~~~~~-~~~~~~~~~~~~ 503 (506) T protein:vir:94 484 ----QNGVISND-GQTNTTATQTDE 503 (506) T ss_pred ----hhcCCCcc-cCcccccccccc Confidence 11111122 222223333333 No 132 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.19 E-value=4.9e-10 Score=71.64 Aligned_cols=403 Identities=11% Similarity=-0.025 Sum_probs=209.9 Q ss_pred hhhhhhhhcchhccc--cCCCcccccccCCCCCHHHHHHH-HHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeee Q lcl|NC_019524. 23 ATATPMAVGGGMEGA--ERTTREMFQWNPSIISPDQQIAQ-NQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNA 99 (556) Q Consensus 23 ~~~~~~~~~~~y~aa--~~~~r~~~~w~~~~~s~~~~i~~-~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~ 99 (556) +...........-.. .+..+ ...|.-...... .+.. .-+.++...+ ++ .+|.+-+|+.+++..+=.||+. T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~-~~~yy~g~~~~~-~~~~~~p~~~~~~~~-~v--~nw~~~~Vd~~a~rl~~~Gf~~-- 73 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDK-RYRYYAMDDRDD-TRSIVMPNNVREMYR-SV--LEWTAKGVDSLADRIIFREFTN-- 73 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHH-HHHHHhcCCChh-hcCccccHHHHHHHH-hh--cchhHHHHHHHHhccccceeeC-- Confidence 111111111111111 12221 233332222111 1111 1233433333 33 3678889999988777778753 Q ss_pred eccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCccc Q lcl|NC_019524. 100 KPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPF 179 (556) Q Consensus 100 ~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~ 179 (556) ++ . .+|+-|.. .+|...|..+.+..++-|.+|+..-.... ++ . T Consensus 74 -~d-----------~-------~l~~~w~~-----------N~ld~~~~~~~~~al~~G~sf~~v~~~~~-----~~--~ 116 (422) T protein:vir:97 74 -DD-----------F-------NAWEIFKA-----------NNPDIFFDTAIQSALIASCCFVYIMPGAE-----DG--L 116 (422) T ss_pred -Cc-----------h-------hHHHHHHh-----------cChHHHHHHHHHHHHHhcceeEEEeeCCC-----CC--e Confidence 11 1 24555543 25778888999999999999988632211 12 2 Q ss_pred ceEEEEEchhhcCCCCCCCCCceEEEEEE---ECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecc Q lcl|NC_019524. 180 GTAIQMISPYRMSNPNNVMDTPNLRSGVQ---LDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEA 256 (556) Q Consensus 180 ~l~lq~ie~drl~~~~~~~~g~~i~~GIE---~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~ 256 (556) | .+++++|..+---++ +..+.+..++. .|..|.+.+.+++..+-- .+......|.++|- ..+.-=|+|+... T Consensus 117 p-~i~~~sp~~~~~i~D-~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~g~vPvv~~~n~ 191 (422) T protein:vir:97 117 P-KMQVIEASKATGILD-PTTFLLTEGYAILESDSNGNPTLEAYFTDKDI-WYYPKKGKPYNIKN--PTGHPLLVPIIHR 191 (422) T ss_pred e-EEEEechhhEEEEEe-CCCCcceeeEEEEEecCCCcEEEEEEEcCceE-EEEcCCCccccccC--CCCCcceEEeccc Confidence 2 589999998864443 22345555554 567788766444433211 12223334444432 3344458888888 Q ss_pred cCCCcccCCchhh-HHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccc Q lcl|NC_019524. 257 LLAGQTRGISEMV-SALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVA 335 (556) Q Consensus 257 ~r~gQ~RGvs~la-~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (556) .+.+-.=|.|.++ |+|..+..+++-.--.+..+...|.=-.+|+--++ .+ . T Consensus 192 ~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~------------d~----------------~ 243 (422) T protein:vir:97 192 PDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDP------------DA----------------K 243 (422) T ss_pred CCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCc------------cc----------------c Confidence 8888778888875 45555544444333333333322221122211000 00 0 Q ss_pred cccceecCCceeeecCCC---ceeeeecCCCCC-ccHHHHHHHHHHHHHHhcCCCHHHhhchhhc-ccchhHHHHHHHHH Q lcl|NC_019524. 336 QTKNIAIDGAKIPHLYPG---TKLKMQPAGTPG-GVGTDYEQSLLRNIAASLGMSYEQFSRDYTK-TNYSSARASMAETQ 410 (556) Q Consensus 336 ~~~~~~l~pG~i~~L~pG---e~i~~~~~~~p~-~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~-~nYSs~R~~~~e~~ 410 (556) .........|.|..+++. +.+++.+.+..+ .+|.+-++..++.+|+-.++|-+.|.++..+ +|=-++++.+.... T Consensus 244 ~~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~ 323 (422) T protein:vir:97 244 PMEKWRATVSTLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLR 323 (422) T ss_pred cCchhhhhhhhhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHH Confidence 011112345677777654 445553333222 4688888999999999999999988776542 33446777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHh--hCeeeecCcccccch---hhhhHHHH Q lcl|NC_019524. 411 KYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDAL--CNAEWIGASRGQIDE---KKETEAAI 485 (556) Q Consensus 411 r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~--~~~~w~~p~~~~iDP---~Ke~~A~~ 485 (556) +..+..|..|-..+.+ +++.. .++..+.-..+ ..| +.+.|. |..+ .|. ...+.|.. T Consensus 324 ~ka~~k~~~fg~~l~~-~~rla--~~~~~~~~~~~--------------~~~~~~~~~w~-p~~~-~~~~s~a~~aDa~~ 384 (422) T protein:vir:97 324 AAGRKAQRSFSSGFLN-VAYIA--VCLRDEFPYLR--------------NQFMDTVIKWE-PLFE-ADANMLTLVGDGAI 384 (422) T ss_pred HHHHHHHHHHHHHHHH-HHHHH--HHHhcCCcccc--------------hhhccceEEEc-cCCC-CChHHHHHHHHHHH Confidence 7778888888777655 44432 23444322211 223 345664 1111 232 23345556 Q ss_pred HHHHc--CCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcC Q lcl|NC_019524. 486 LRIKN--GLSTYEAEISRLGGDFREVFKQRAREEGLIKSLK 524 (556) Q Consensus 486 ~~i~~--G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~G 524 (556) +.+.+ |+.+.+-+..+.|.+..+.-.++.++. +.-| T Consensus 385 Kl~~a~~~~~~~~~~~~~lg~~~~~~~~~~~~~~---~~d~ 422 (422) T protein:vir:97 385 KLNQAIPGFMDADVIRDLTGVKGADKPIPAITEV---TTDG 422 (422) T ss_pred HHHhhccccccHHHHHHHcCCCchhHHHHHHHhh---hccC Confidence 66777 787777777777986443332222221 2223 No 133 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.19 E-value=1.1e-10 Score=75.14 Aligned_cols=392 Identities=9% Similarity=0.073 Sum_probs=192.8 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|.+.+.. ......+ ...++.....+.. . ..-+.+-+..++.+..+| T Consensus 1 MGl~~~~~~~~~-------------------~~~~~~~-~~~~~~~~~~~~~------~---~~vt~~~al~~~~v~~~i 51 (394) T protein:vir:62 1 MGLRDRFSNYLF-------------------KKAEKRG-YLDNVLGKSIRYS------G---VYVTDSNILQSSDVYELL 51 (394) T ss_pred Cchhhhhhhhcc-------------------CCCCchh-hhhhhhhcccccC------c---cccChhhhhccHHHHHHH Confidence 222222211000 0000000 0000000000000 0 000111233568899999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.+-.--|++.-+ +++ +..+ ...+... .+|+ -.+|..++....+..++..|++|+. T Consensus 52 ~~Ia~~iA~lp~~v~~~--------~g~---~~~~--~~~~~Ll-~~PN------~~~t~~~f~~~~~~~lll~Gn~~~~ 111 (394) T protein:vir:62 52 QDISNQMVLADIVVEDE--------FGN---EIKD--DIALQIL-RNPN------NYLTQSEFIKLMTNTYLLEGETFPI 111 (394) T ss_pred HHHHHhhcccceEEEcC--------CCc---ccch--hhHHHHh-ccCC------CCCCHHHHHHHHHHHHHhcCCeEEE Confidence 99999999877766422 111 1111 1122222 3443 3578999999999999999999988 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.. . +.. ++. .+ . ++.|..+ .|++.. . .. T Consensus 112 i~~-~---------~~~----~~~--~~----------~----~~~~~~~---~~~~~~-~-----------------~~ 140 (394) T protein:vir:62 112 LNG-A---------QIH----LAS--NV----------F----TELDDNL---VEHFNI-G-----------------GH 140 (394) T ss_pred Eec-c---------eee----ccc--cc----------e----EEECCce---EEEEee-C-----------------CE Confidence 631 1 111 110 00 0 1233322 122211 0 02 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+++.+|||+...- .+...|+|++..+...|.....-.++...-.+=.+...++++.+..-... ++..+.. T Consensus 141 ~~~~~eiih~r~~~-~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~--------~~~~~~~ 211 (394) T protein:vir:62 141 EIPPCMIRHVKNIG-ADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQ--------NGAQSKL 211 (394) T ss_pred EechhheEEecCcC-CCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcC--------HHHHHHH Confidence 36778999987654 56789999998888777766666666666555567777888865331100 0000000 Q ss_pred cccccccccccccccceecCCceeeecCCCceee--eecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchh Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLK--MQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSS 401 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~--~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs 401 (556) - ..........-..|.+..|..|.+++ .++.+.-...|.+..+...+.||..+|||...| ++. +||+ T Consensus 212 ~-------~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~---~~sn 280 (394) T protein:vir:62 212 I-------NAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTY-TEL---IKED 280 (394) T ss_pred H-------HHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHc-CCC---CCcC Confidence 0 00000001112356666677777555 455554566788888899999999999999876 443 4554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhh Q lcl|NC_019524. 402 ARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKET 481 (556) Q Consensus 402 ~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~ 481 (556) +.+.. ..++..-+.|+...+ +.++....+.-..+.. +.+++- ...-.|+...+ T Consensus 281 ~e~~~-----------~~~~~~~l~P~~~~i-e~~l~~kll~~~~~~~-------------~~~~fd--~~~~~~~~~~~ 333 (394) T protein:vir:62 281 IEKAM-----------MYIHNKAVRPIMKNF-EDHLSLLFYAQNSGKR-------------IKFKIN--ILDFVTYSNKT 333 (394) T ss_pred HHHHH-----------HHHHHHHHHHHHHHH-HHHHhhhhcCccccCc-------------eEEEec--hhhhcCHHHHH Confidence 32222 334555667766664 3344444332211111 112221 12224566677 Q ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 482 EAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 482 ~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) ++..+.+++|+.|+-|+-+..|.+|-+- .+ ..++=++... .+-...+...+....++++.+ T Consensus 334 ~~~~~~~~~g~~T~NE~R~~~gl~p~~~-----~~---gd~~~~~~n~-----~~~~~~~~~~~~~kgge~~en 394 (394) T protein:vir:62 334 NIGYNLVRTAITSPDNVADMLGFPKQNT-----KE---SQAIYISNDV-----TEIGKKEATDGSLGGGEENEN 394 (394) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCC-----CC---CCeeeccccc-----ccccccccccccCCCCCCCCC Confidence 8888999999999998888888876421 00 0011011000 001111111122222222111 No 134 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.18 E-value=2e-12 Score=84.70 Aligned_cols=376 Identities=9% Similarity=0.010 Sum_probs=184.2 Q ss_pred Hhhcccchhhhhhhhcchhcccc-CCCc-ccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccC Q lcl|NC_019524. 16 VDVVAETATATPMAVGGGMEGAE-RTTR-EMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGS 93 (556) Q Consensus 16 ~~~~~~~~~~~~~~~~~~y~aa~-~~~r-~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~ 93 (556) +..... .. ++.... .+.+ +...|. .+....|++++..+|+.|.+.|-.- T Consensus 1 Mg~f~~-------~~--~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~v~~~v~~IA~~iA~l 51 (378) T protein:vir:94 1 MNLFGK-------VV--SFSRGKLNNDTQRVTAWQ--------------------NEAVEYTSAFVTNIHNKIANEITKV 51 (378) T ss_pred CCcccc-------ch--hcccccccCCcceeeeec--------------------cchhHHHHHHHHHHHHHHHhhhhhC Confidence 111100 00 000000 0000 011111 0113346678899999999998876 Q ss_pred CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCc Q lcl|NC_019524. 94 QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTT 173 (556) Q Consensus 94 Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~ 173 (556) -|++.-+... -+..+........ .++..+-.+|+ -.++.+++....+..++..|++|+.+.+... T Consensus 52 p~~~~~~~~~--~~~~~~~~~~~~~---~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~---- 116 (378) T protein:vir:94 52 EFNHVKYKKS--DVGSDTLISMAGS---DLDEVLNWSPK------GERNSMDFWRKVIKKLLSAPYVDLYAVFDDN---- 116 (378) T ss_pred ceeeEEEccc--Ccccccccccccc---hHHHHHhhcCC------CCCCHHHHHHHHHHHHhhcCceEEEEEeeCC---- Confidence 6654322110 0000000000011 12223333454 3578999999999999999999987654321 Q ss_pred CCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEee Q lcl|NC_019524. 174 MQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHI 253 (556) Q Consensus 174 ~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~ 253 (556) .|+++.++. .+. ...++.++|||+ T Consensus 117 --------------------------------------~g~~~~l~p-----~~~-------------~~~~~~~diiH~ 140 (378) T protein:vir:94 117 --------------------------------------TGELLDLLF-----ADD-------------KKEYKPEELVRL 140 (378) T ss_pred --------------------------------------CceEEEEEe-----cCC-------------eeEeeeeeeEEe Confidence 122221111 000 012455699999 Q ss_pred ecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccc Q lcl|NC_019524. 254 IEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANY 333 (556) Q Consensus 254 f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (556) ..+ .+-..|+|.+..+...+ ..+.-++...++++.+..-... ......+.+...... T Consensus 141 ~~~--~~~~~g~s~l~~~~~~i-----------~~~~~~~~~~gil~~~~~l~~~------~~~~~~~~~~~~~~~---- 197 (378) T protein:vir:94 141 TSP--FYINEDTSILDNALASI-----------QTKLEQGKLRGLLKINAFLDID------NTQEYREKALTTIKN---- 197 (378) T ss_pred cCc--CCccchhHHHHHHHHHH-----------HHHHhcccccceeeeCCcCCHH------HHHHHHHHHHHHHHH---- Confidence 865 34456887776555433 2223345566777764321100 000000111110000 Q ss_pred cccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHH Q lcl|NC_019524. 334 VAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYM 413 (556) Q Consensus 334 ~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~ 413 (556) . ...-..|.+..|..|.+++.++.+....++ ...+.+..+||..+|||...|.+ +||. +. T Consensus 198 ---~-~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~~-----~~se------~~---- 257 (378) T protein:vir:94 198 ---M-QEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG-----TASQ------EQ---- 257 (378) T ss_pred ---h-hcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC-----ChHH------HH---- Confidence 0 000135668899999999998877666676 45678889999999999988844 3331 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCC Q lcl|NC_019524. 414 DSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLS 493 (556) Q Consensus 414 ~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~ 493 (556) ...++..-+.|+..+| +.++-...++-..- .... .......+.+-.......|+...+++....+.+|+. T Consensus 258 ---~~~f~~~tL~P~~~~i-e~~l~~~Ll~~~er-~~g~-----~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~ 327 (378) T protein:vir:94 258 ---QIYFYNSTIIPLLIQL-EKELTYKLISTNRR-RVVK-----GNLYYERIIVDNQLFKFATLKELIDLYHENINGPIF 327 (378) T ss_pred ---HHHHHHHHHHHHHHHH-HHHHHhhcCChhHh-hhhh-----hcccccceeecchhhhhcCHHHHHHHHHHHHhCCCc Confidence 1124555667766553 44454444321100 0000 000011234445555567999999999999999999 Q ss_pred CHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 494 TYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 494 s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) |.-|+-+..|..|-+--+ ++=++....+... ....+........++|+++| T Consensus 328 T~NE~R~~~gl~p~~gGD----------~~~~~~n~~~~~~--~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 328 TQNQLLVKMGEQPIEGGD----------VYIANLNAVAVKN--LSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred CHHHHHHHhCCCCCCCCC----------eeeeccccccccc--chhhcCCcCCCCCCCCCCCC Confidence 999999988988764211 1111111111111 11111111122223333333 No 135 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.18 E-value=8.1e-10 Score=70.42 Aligned_cols=445 Identities=10% Similarity=0.032 Sum_probs=202.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhh-c-chhc-cccCCCcccccccCCCCCHHHHHH-HHHHHHHHHHHHH---- Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAV-G-GGME-GAERTTREMFQWNPSIISPDQQIA-QNQDMASARAQDM---- 72 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~-~-~~y~-aa~~~~r~~~~w~~~~~s~~~~i~-~~~~~lr~RaRdl---- 72 (556) |+|.-...-..-...........-...... . ..+. -..+-.+ +..+.... .+|. ...........+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~-~~~YY~g~----~~i~~~~~~~~~~~~~~~~~~~ 75 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISI-GQEYYEQR----PDIVKEPKPVDATGAVDPLKPD 75 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHH-HHHHhccc----cccccccchhhccccccccccc Confidence 666554322111000000000000000000 0 0000 0000000 00000000 0000 0000000000000 Q ss_pred -HhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHh Q lcl|NC_019524. 73 -VQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAV 151 (556) Q Consensus 73 -~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~ 151 (556) -..+++++-+|+..+.+++|.+++.... + +.....|+.|..+ +|...+..+. T Consensus 76 ~ri~~n~~~~ivd~~~~~l~g~~~~~~~~---------d-------~~~~~~l~~~~~n-----------~~~~~~~~~~ 128 (472) T protein:vir:93 76 DRMITNFHANLVDQKVSYIVGKPIAFKHT---------D-------DEVVKRIDEVLGN-----------RFDDKLHSVL 128 (472) T ss_pred cccccchHHHHHHHHhhhhcccCeeeccC---------C-------hHHHHHHHHHHhc-----------cHHHHHHHHH Confidence 0125899999999999999998877532 1 1223345555431 3566666677 Q ss_pred hhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCC-------eEEEEEe Q lcl|NC_019524. 152 SGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGA-------ALGYWLR 221 (556) Q Consensus 152 r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr-------~vaY~i~ 221 (556) +..+.-|.+|+.+. ....+ .+++.+++|..+-.-+.......+..+|++ +.... .+.|+++ T Consensus 129 ~~~~~~G~~~~~v~-~d~d~--------~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~ 199 (472) T protein:vir:93 129 TGASNKGIEWLHPY-LDEEG--------EFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVY 199 (472) T ss_pred HHHhhcCeEEEEEE-ECCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEE Confidence 88899999997754 32211 257888999887543333333445666644 11111 1112221 Q ss_pred ecCCCcccc---CCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeee Q lcl|NC_019524. 222 KAFPGDPTD---MEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAAS 298 (556) Q Consensus 222 ~~hpgd~~~---~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~f 298 (556) .. +.... .....|.-......++.-=|+++... ..|.|.|.+++..+..++....-......-.+.-..+ T Consensus 200 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~ 272 (472) T protein:vir:93 200 EN--GSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV 272 (472) T ss_pred ec--CeeeecccccccccccccccCCCCCcceEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeE Confidence 11 00000 00000000000000111113333322 3589999998777766654433333333322222233 Q ss_pred EeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHH Q lcl|NC_019524. 299 VESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRN 378 (556) Q Consensus 299 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~ 378 (556) ++.-..+. .+.....+..+.+..+..|.++++++.+.+...+..+.+.+.+. T Consensus 273 ~~g~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 324 (472) T protein:vir:93 273 LTNYDDQE----------------------------LPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 324 (472) T ss_pred eecCCccc----------------------------chhhHHHHhhccccccCCCCcceeEeecCCHHHHHHHHHHHHHH Confidence 33111000 00011123455566678888999998888899999999999888 Q ss_pred HHHhcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccc Q lcl|NC_019524. 379 IAASLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYD 455 (556) Q Consensus 379 iaaglGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~ 455 (556) |..-.++| .+.++++. |-.+++..+..........+..|-.. ++.+++..++ ++ | .+... T Consensus 325 i~~~s~~p~~~~~~~~~n~---Sg~Al~~~~~~l~~ka~~~~~~~~~~-l~~~~~li~~--~~-~---~~~~~------- 387 (472) T protein:vir:93 325 IMLFGQAVDFSSDKFGSAP---SGVALEFLYTNLNLKADKLARKAKVA-IQELLWFVFE--HF-D---IKGEH------- 387 (472) T ss_pred HHHHhCCCCCCccccccCc---hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH--Hh-C---CCccc------- Confidence 88887766 23344433 33356655555555555555554333 2333333222 11 2 12111 Q ss_pred hhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccc Q lcl|NC_019524. 456 PMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVE 533 (556) Q Consensus 456 ~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~ 533 (556) .-+.+.|... .+ .|-...+++..+. .|+.|.+..+...+. |+++.++++.+|++...+.--.. T Consensus 388 -----~~i~v~f~~~-~p-~~~~~~~~~~~k~--~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~------ 452 (472) T protein:vir:93 388 -----KDVDISFNYN-KV-ANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNL------ 452 (472) T ss_pred -----ceeeEEeCCC-CC-CCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCc------ Confidence 1135677332 22 4555555555543 699999999999864 89999999999876544331111 Q ss_pred cCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 534 GNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 534 ~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ....... ..+.+++++++.| T Consensus 453 --~~~~~d~-~~~~~~~~~~~~e 472 (472) T protein:vir:93 453 --DDGGADG-AQQQERSNNKESE 472 (472) T ss_pred --CcccCCC-CCCCCCCCcccCC Confidence 1111111 1122222333333 No 136 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.16 E-value=1e-09 Score=69.93 Aligned_cols=446 Identities=8% Similarity=-0.007 Sum_probs=207.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.....+.+.-- ..-.....+.. ....|.-+.. .....+.. ..+. ...+. | -.+++++ T Consensus 39 ~~~~~~i~~~i~-----~~~~~~~~r~~-~l~~Yy~g~~--~i~~~~~~---~~~~-~~~~~-----k-----i~~n~~k 96 (511) T protein:vir:10 39 LQNVNEVSKCIE-----HHMDYQRPRLK-VLSDYYEGKT--KNLVELTR---RKEE-YMADN-----R-----VAHDYAS 96 (511) T ss_pred ccCHHHHHHHHH-----HHHHhhHHHHH-HHHHHhcccC--ccccccCc---cccc-ccCcc-----e-----eecchHH Confidence 111122221110 00000001111 1123432221 11111111 0000 00000 1 2258999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+++..+.+++|.+++..+. + +.....|+.|... + +|...+..+.+....-|.+ T Consensus 97 ~Iv~~~~~yl~g~p~~~~~~------------d----~~~~~~l~~~~~~----n------~~~~~~~~~~~~~~i~G~a 150 (511) T protein:vir:10 97 YISDFINGYFLGNPIQYQDD------------D----KDVLEAIEAFNDL----N------DVESHNRSLGLDLSIYGKA 150 (511) T ss_pred HHHHHHhhhhcccCceeecC------------c----hHHHHHHHHHHhh----c------CHHHHHHHHHHHHHhcCee Confidence 99999999999988877542 1 1223345555443 1 4667777788888999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC-------CCCeE-EEEEeecCCCccccCC Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN-------NGAAL-GYWLRKAFPGDPTDME 232 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~-------~Gr~v-aY~i~~~hpgd~~~~~ 232 (556) +..+.. ...+ -+++.+++|..+-.-++......++.+|++-. ....+ -+.++..+---.+... T Consensus 151 y~~vy~-dedg--------~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~ 221 (511) T protein:vir:10 151 YEIMIR-NQDD--------ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTS 221 (511) T ss_pred EEEEEe-CCCC--------ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEec Confidence 977543 2211 25788899988754344333345677776521 11222 2223332210000000 Q ss_pred cccc-----c---eee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccC Q lcl|NC_019524. 233 QWKW-----G---YEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESEL 303 (556) Q Consensus 233 ~~~~-----~---rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~ 303 (556) ...+ . ..| .+..+| |+++... ..|.|.|.+++..+..++....-......-.+.-..+++-.. T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vP---vv~f~nn-----~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~ 293 (511) T protein:vir:10 222 RTNGLKLTPRENGFESHSFERMP---ITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred CCCcccccccccccccccCccee---EEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccc Confidence 0000 0 000 011222 4444332 268899999888776555433222222211121112222111 Q ss_pred cccccccccccccccccccccccccccccccccccceec-----CCceeeecCCCceeeeecCCCCCccHHHHHHHHHHH Q lcl|NC_019524. 304 PSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAI-----DGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRN 378 (556) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-----~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~ 378 (556) .... .+... ........+ ..+......+|-++++++.+.+...+..+.+.+.+. T Consensus 294 ~~~~---------~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~ 352 (511) T protein:vir:10 294 NLDP---------VEVRK------------QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 352 (511) T ss_pred cCCc---------hhhcc------------chhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 1000 00000 000001111 122233456678899999888999999999999998 Q ss_pred HHHhcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccc Q lcl|NC_019524. 379 IAASLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYD 455 (556) Q Consensus 379 iaaglGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~ 455 (556) |....++| .+.++++.| --++|..+..........+..|-..+ +..++..+...-..+.+..+... T Consensus 353 I~~~s~~P~~~~~~~~~n~S---g~Al~~~~~~l~~k~~~k~~~f~~~l-~~~~~li~~~~~~~~~~~~~~d~------- 421 (511) T protein:vir:10 353 IHMFTNTPNMKDDNFSGTQS---GEAMKYKLFGLEQRTKTKEGLFTKGL-RRRAKLLETILKNTRSIDANKDF------- 421 (511) T ss_pred HHHHhCCcccccccccccch---HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhCCccccccc------- Confidence 88877766 344544443 34666666666555655555554443 33444433322222222211100 Q ss_pred hhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccc Q lcl|NC_019524. 456 PMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVE 533 (556) Q Consensus 456 ~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~ 533 (556) .-+.+.|..+-. .|-...+++..++ .|+.|.+..+...+. |+++.++++++|++...+.-... ... T Consensus 422 -----~~i~i~f~~~~p--~d~~~~~~~~~kl--~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~---~~~ 489 (511) T protein:vir:10 422 -----NTVRYVYNRNLP--KSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKG---IYK 489 (511) T ss_pred -----ceeeEEeCCCCC--cCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhh---ccc Confidence 013456754433 4666667666665 489999999999875 89999999999977543321110 000 Q ss_pred cCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 534 GNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 534 ~~~~~~~~~~~~~~~~~~~e~~ 555 (556) .........+++++++..++.+ T Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 490 DPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCCCCcccCcccccC Confidence 0011111111111111111111 No 137 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.16 E-value=1.1e-09 Score=69.79 Aligned_cols=422 Identities=11% Similarity=-0.024 Sum_probs=206.3 Q ss_pred hcccchhhhhhhh-cchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHH----------------------- Q lcl|NC_019524. 18 VVAETATATPMAV-GGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMV----------------------- 73 (556) Q Consensus 18 ~~~~~~~~~~~~~-~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~----------------------- 73 (556) +..- ..... ..-|.. ... +..|.. .-.+..+...+..-..+-.+.+ T Consensus 1 ~~~~----~~~~~~~~~~~~--~~~--~~~~~~--~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~ 70 (479) T protein:vir:79 1 MLNI----YISETDLIKVQL--KKE--STINLV--KVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDF 70 (479) T ss_pred CCCc----eecccceEeecc--ccC--ChhHHH--HHHHHHHhhhhHHHHHHHHHHhccCCccccccccccccccccccc Confidence 1000 00000 000000 000 000000 0000000000000001111111 Q ss_pred ------hcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHH Q lcl|NC_019524. 74 ------QNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLT 147 (556) Q Consensus 74 ------rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq 147 (556) -.+++++-+|+..+.++.|.+++.... ++.+...|+.|..+ +|...+ T Consensus 71 ~~~~~ki~~~~~~~Ivd~~~~~l~g~p~~~~~~----------------~~~~~~~~~~~~~n-----------~~~~~~ 123 (479) T protein:vir:79 71 TKVNNKAINNYHKLLVDQKVGYSVGNPIVFNAD----------------DDNLTKLLNDLLGE-----------EFDDTI 123 (479) T ss_pred ccCcceeecchHHHHHHHHHhhhhcCCceeccC----------------CHHHHHHHHHHHhc-----------CHHHHH Confidence 137889999999999999998877532 12344566666431 567777 Q ss_pred HHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeEEE-EEee Q lcl|NC_019524. 148 RLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAALGY-WLRK 222 (556) Q Consensus 148 ~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~vaY-~i~~ 222 (556) ..+++..+.-|.+|.++.+ +..+ -+++..++|+.+---+.......+..+|++ +..|..+-| .++. T Consensus 124 ~~~~~~~~~~G~~~~~v~~-d~~~--------~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~ 194 (479) T protein:vir:79 124 TELYLNASNKGVEWLHPYI-NRKG--------EFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYT 194 (479) T ss_pred HHHHHHHHhcCeEEEEEEe-CCCC--------ceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEe Confidence 7778889999999977643 2221 258899999888533333333446667753 334443322 1111 Q ss_pred cCCCccccCCccccc-------------------eee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHH Q lcl|NC_019524. 223 AFPGDPTDMEQWKWG-------------------YEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQ 282 (556) Q Consensus 223 ~hpgd~~~~~~~~~~-------------------rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~ 282 (556) .+-.-.+......+. .+. ....++.-=|+|+... ..|+|.|..++..+..++... T Consensus 195 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~g~sd~~~v~~liDa~d~~~ 269 (479) T protein:vir:79 195 ENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNN-----EKCVSDLTFYKSLIDIYDNNI 269 (479) T ss_pred CCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCC-----CCCCcchhhhHHHHHHHHHHH Confidence 110000000000000 000 0000111123444332 358999998887776666543 Q ss_pred HHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCC Q lcl|NC_019524. 283 EITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAG 362 (556) Q Consensus 283 dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~ 362 (556) .-......-.+.-..+++.-.++. . +.....+..+.+..+.+|.++++++.+ T Consensus 270 S~~~~~~~~~~~~~~v~~g~~~~~---------~-------------------~~~~~~~~~~~~i~~~~~~~~~~l~~~ 321 (479) T protein:vir:79 270 STLADNLDEIQEVIYVLKEYPGTS---------L-------------------QEFIDNIRYYKSIKVDGGGGVDKLEIN 321 (479) T ss_pred HHHHHHHHHhhCceeeeecCCccc---------c-------------------ccchhhhhhccceecCCCCcceEEecc Confidence 332222222111112222110000 0 001112445666678889999999999 Q ss_pred CCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCc Q lcl|NC_019524. 363 TPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNV 442 (556) Q Consensus 363 ~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l 442 (556) .+...+..+++.+.+.|....++|-... ..++++|-.+.+..+..........+..|.. .++.+++..+...-..+.. T Consensus 322 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~~~~ 399 (479) T protein:vir:79 322 IPVEAKKELLDRLEKNIIIFGQGVNPES-QNTGDKSGVALKFLYSLLDLKCSKTEKKFKK-AIRELLWFVCEYLKISGNK 399 (479) T ss_pred CCHHHHHHHHHHHHHHHHHHhCcccccc-ccccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCCC Confidence 9999999999999999988888774333 2344445556666555555555555555444 3455555544432222221 Q ss_pred cCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHH Q lcl|NC_019524. 443 PLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLI 520 (556) Q Consensus 443 ~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~ 520 (556) ... ..-+.+.|...- ..|-...+++..++ .|+.|.+..+...+. |+++.++++++|++.. T Consensus 400 ~~~--------------~~~i~i~f~~~~--p~~~~~~a~~~~kl--~g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~ 461 (479) T protein:vir:79 400 SYD--------------YKTVQITFNHSM--IINEAEKIDMAAKS--TGIVSDETIVSNHPWVEDVNDELERLKKQEDTQ 461 (479) T ss_pred ccc--------------cccceEEeCCCC--CcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 110 112356663322 24655555554443 699999999999874 8999999999998765 Q ss_pred HHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 521 KSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 521 ~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) .+..-.. ....++..+|| T Consensus 462 ~~~~~~~----------------~~~~~~~~~e~ 479 (479) T protein:vir:79 462 KEYDDLI----------------PNNQDGVIDET 479 (479) T ss_pred HHHHhcc----------------CcccCCCcCcC Confidence 4432111 11122222222 No 138 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.16 E-value=1.1e-09 Score=69.76 Aligned_cols=397 Identities=12% Similarity=-0.021 Sum_probs=205.3 Q ss_pred CcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHH Q lcl|NC_019524. 2 KDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAG 81 (556) Q Consensus 2 sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~ 81 (556) =..+.+.+...+-. .+. .+... ...|.-+...-+.+ +... -+.++.+.| ++ .+|++- T Consensus 1 ~~~~~i~~L~~~~~----~~~--~r~~~-~~~yY~g~~~~~~~------~~~~-------p~~~~~~~~-~v--~nw~~~ 57 (409) T protein:vir:94 1 MTEKGIGYLRFKLS----VHK--RRAEM-RYDQYAMKYVDRFK------GITI-------PQALSQQYR-SI--LGWCAK 57 (409) T ss_pred CCHHHHHHHHHHHH----HHh--HHHHH-HHHHhcccCchhhc------Chhh-------hHHHHHHHh-hh--cchhHH Confidence 11122222221110 011 11111 12333222211111 0011 112333333 23 368899 Q ss_pred HHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceE Q lcl|NC_019524. 82 VVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVL 161 (556) Q Consensus 82 ~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f 161 (556) +|+.+++.++=.||+. ++ +.+|+-|.. .+|...+..+.+..+.-|.+| T Consensus 58 iVds~a~rl~~~Gf~~---~d------------------~~l~~i~~~-----------N~ld~~~~~~~~~aliyG~sf 105 (409) T protein:vir:94 58 GVDSLADRLVFREFEN---DD------------------FTVNEIFEE-----------NNPDIFFDSAVLSSLIASCSF 105 (409) T ss_pred HHHHhHhhcccCcccC---Cc------------------hHHHHHHHh-----------cChhHHHHHHHHHHHHhccee Confidence 9999988777778752 11 124554433 257788889999999999999 Q ss_pred EEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCCeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 162 ATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGAALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) +.. +... ++. ..|+.++|..+-.-++. ..+.+..++.+ |..|.++.+-++..+---.+......|.. T Consensus 106 ~~v-~~~~-----dg~---~~i~~~sp~~~~~i~D~-~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 175 (409) T protein:vir:94 106 TYI-SKGE-----NDA---VRLQVIEAVNATGIIDP-ITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNIS 175 (409) T ss_pred EEE-ecCC-----CCc---eEEEEeccceEEEEEec-CCCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEe Confidence 875 3221 122 37899999888644432 23456667654 34565544322221111111122334444 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhh-HHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMV-SALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQG 317 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la-~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 317 (556) +|- ..+.-=|+|++...+.+-.=|.|.++ |++..+..+.+-.--.+..+...|.=-.+|+--+. + T Consensus 176 ~~n--~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~------------d 241 (409) T protein:vir:94 176 IAN--PTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSD------------D 241 (409) T ss_pred eeC--CCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCC------------C Confidence 442 33444488888877777777899886 45555555544443333333333322122221000 0 Q ss_pred cccccccccccccccccccccceecCCceeeecCC---CceeeeecCCCC-CccHHHHHHHHHHHHHHhcCCCHHHhhch Q lcl|NC_019524. 318 GFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYP---GTKLKMQPAGTP-GGVGTDYEQSLLRNIAASLGMSYEQFSRD 393 (556) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p---Ge~i~~~~~~~p-~~~f~~F~~~~lr~iaaglGi~ye~l~~D 393 (556) ..........++.|..++. |+.+++.+.+.. -.+|.+.++.+++.+|+-.++|-+.|.+. T Consensus 242 ----------------~~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~ 305 (409) T protein:vir:94 242 ----------------AEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFV 305 (409) T ss_pred ----------------CcccchhhhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHhccc Confidence 0001112234566777753 444555332222 24688888999999999999999988776 Q ss_pred hhc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcc Q lcl|NC_019524. 394 YTK-TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASR 472 (556) Q Consensus 394 ~s~-~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~ 472 (556) ..+ +|=-++++.+....+..+..|..|-..+.+ +++ |-.++..+.-..|..+ .-+.+.|.+.-. T Consensus 306 ~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~-~~r--la~~i~~~~~~~~~~~------------~~~~v~W~p~~~ 370 (409) T protein:vir:94 306 SDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLN-VAY--LAACLRDDAPYLREQF------------RKTKPKWEPLFE 370 (409) T ss_pred cCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH--HHHHHhCCCCcccccc------------ccceEEeccCCC Confidence 532 334456777777777777777777666544 444 3334444432222110 013566862211 Q ss_pred ccc-chhhhhHHHHHHHHcC--CCCHHHHHHHhCCCHHH Q lcl|NC_019524. 473 GQI-DEKKETEAAILRIKNG--LSTYEAEISRLGGDFRE 508 (556) Q Consensus 473 ~~i-DP~Ke~~A~~~~i~~G--~~s~~~~~ae~G~D~e~ 508 (556) ..+ .-...+.|..+.+.+| +.+.+-.....|.+-.+ T Consensus 371 ~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 371 ADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred cchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 111 1134467788889998 55556555666987766 No 139 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.15 E-value=1.2e-09 Score=69.53 Aligned_cols=450 Identities=8% Similarity=-0.017 Sum_probs=205.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +...+.+...--.- .. . ...+.. ....|.-+.. .....+.. ..+.. ..+. | --+++++ T Consensus 39 ~~~~~~i~~~i~~~---~~-~-~~~r~~-~l~~Yy~g~~--~i~~~~~~---~~~~~-~~~~-----k-----i~~n~~k 96 (511) T protein:vir:96 39 LQNVNEVSKYIEHH---MD-Y-QRPRLK-VLSDYYEGKT--KNLVELTR---RKEEY-MADN-----R-----VAHDYAS 96 (511) T ss_pred hccHHHHHHHHHHH---HH-h-hHHHHH-HHHHHhcccC--ccccccCc---Ccccc-cCcc-----e-----eecchHH Confidence 11111111110000 00 0 001111 1123432221 11111110 00000 0000 1 1258999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+++..+.++.|.+++.... ++..++ .++.|... .+|......+.+....-|.+ T Consensus 97 ~Iv~~~~~yl~g~p~~~~~~------------~~~~~~----~l~~~~~~----------n~~~~~~~~~~~~~~i~G~a 150 (511) T protein:vir:96 97 YISDFINGYFLGNPIQYQDD------------DKDVLE----AIEAFNDL----------NDVESHNRSLGLDLSIYGKA 150 (511) T ss_pred HHHHHHHhhhccCCceeecC------------chHHHH----HHHHHHhh----------cCHHHHHHHHHHHHHhcCee Confidence 99999999999988776532 112233 34444332 14777778888888999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC-------C-CCeEEEEEeecCCCccccCC Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN-------N-GAALGYWLRKAFPGDPTDME 232 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~-------~-Gr~vaY~i~~~hpgd~~~~~ 232 (556) +.++.. ...+ .+++.+++|..+---++......++.+|.+-. . +...-+.++..+---.+... T Consensus 151 ~~~vy~-ded~--------~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~ 221 (511) T protein:vir:96 151 YELMIR-NQDD--------ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTS 221 (511) T ss_pred EEEEEe-CCCC--------ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEec Confidence 977543 2211 25788899988753333333345677776521 1 12222223332210000000 Q ss_pred ccccc--------eee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccC Q lcl|NC_019524. 233 QWKWG--------YEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESEL 303 (556) Q Consensus 233 ~~~~~--------rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~ 303 (556) ...|. ..+ .+..|| |+++... ..|.|.|.+++..+..++....-......-.+.-..+++... T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vP---vv~~~nn-----~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~ 293 (511) T protein:vir:96 222 RTNGLKLTPRENGFESHSFERMP---ITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred CCCcccccccccccccccCCcee---eEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCc Confidence 00000 000 011122 4444332 368999999888877666544333333322222222333211 Q ss_pred cccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhc Q lcl|NC_019524. 304 PSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASL 383 (556) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaagl 383 (556) .... .+... ................+......+|-++++++.+.+...+..+.+.+.+.|..-. T Consensus 294 ~~~~---------~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:96 294 NLDP---------VEVRK-------QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred cCCc---------hhhcc-------cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 1000 00000 0000000000001112223345567789999988889999999999999887777 Q ss_pred CCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHH Q lcl|NC_019524. 384 GMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRD 460 (556) Q Consensus 384 Gi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~ 460 (556) ++| .+.++++.| --+++..+..........+..|-.. ++..++..+...-..+.+..+.. . T Consensus 358 ~~p~~~~~~~~~n~S---g~Al~~~~~~l~~k~~~k~~~~~~~-l~~~~~li~~~~~~~~~~~~~~d------------~ 421 (511) T protein:vir:96 358 NTPNMKDDNFSGTQS---GEAMKYKLFGLEQRTKTKEGLFTKG-LRRRAKLLETILKNTWSIDANKD------------F 421 (511) T ss_pred CCcccccccccccch---HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCcccccc------------c Confidence 766 344444443 3356666555555555555554443 34444444332222233221110 0 Q ss_pred HhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCC-CCccccccCCC Q lcl|NC_019524. 461 ALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLD-FTGKMVEGNST 537 (556) Q Consensus 461 a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~-~~~~~~~~~~~ 537 (556) .-+.+.|..+-. .|-...+++...+ .|+.|.+..+...+. |+++.++++.+|++...+.-.. ....+ .. T Consensus 422 ~~i~~~f~~~~p--~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~----~~ 493 (511) T protein:vir:96 422 NTVRYVYNRNLP--KSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP----RD 493 (511) T ss_pred ccceEEeCCCCC--CCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCC----CC Confidence 113566753333 4666666655544 799999999998874 8999999999997654332111 10010 11 Q ss_pred CCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 538 QSSNSSESTSDNPNEETT 555 (556) Q Consensus 538 ~~~~~~~~~~~~~~~e~~ 555 (556) .....+++++++..++.+ T Consensus 494 ~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 494 INDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCcccccccccC Confidence 111111111111111111 No 140 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.12 E-value=1.7e-09 Score=68.68 Aligned_cols=433 Identities=9% Similarity=0.009 Sum_probs=204.1 Q ss_pred CCcchh-hhHHHHHhhHhhcccchhhhhhhhcchhccccCC--CcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKK-TTRTRAKKAVDVVAETATATPMAVGGGMEGAERT--TREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~-~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~--~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) ..|-.. .-...-.+........ ..+. .....|+-+... .+....|. +.. .. ..++.. -..++ T Consensus 28 ~~~~~~e~~~~~i~~~i~~~~~~-~~r~-~~l~~YY~g~~~i~~~~~~~~~------~~~----~~--~~~~~~-ki~~n 92 (483) T protein:vir:12 28 RTNNKPETLEEMIVRYIKQHLEK-LPEI-SIGQEYYEQRPDIVKEPKPVDA------TGA----VD--PLKPDD-RMITN 92 (483) T ss_pred ccCCchhhHHHHHHHHHHHHHHH-HHHH-HHHHHHhccccccccccccccc------ccc----cc--cccccc-ccccc Confidence 111111 0000001111111110 1111 112234322211 00000000 000 00 001100 01368 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) +++-+|+..+.+++|.+++.... + +.....|+.|..+ +|...+..+.+..+.- T Consensus 93 ~~k~Ivd~~~~~l~G~p~~~~~~---------d-------~~~~~~l~~~~~n-----------~~~~~~~~~~~~~~~~ 145 (483) T protein:vir:12 93 FHANLVDQKVSYIVGKPIAFKHT---------D-------DEVVKRIDEVLGN-----------RFDDKLHSVLTGASNK 145 (483) T ss_pred hHHHHHHHHhhhhcccCceeccC---------C-------hHHHHHHHHHHhc-----------cHHHHHHHHHHHHhhC Confidence 99999999999999998876532 1 1223345555431 3455555566778889 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCC-------eEEEEEeecCCCc Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGA-------ALGYWLRKAFPGD 227 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr-------~vaY~i~~~hpgd 227 (556) |.++..+.+ ...+ .+++.+++|..+-.-+.....+.+..+|.+ +.... .+-|++... ... T Consensus 146 G~~y~~v~~-d~d~--------~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~-~~~ 215 (483) T protein:vir:12 146 GIEWLHPYL-DEEG--------EFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYEN-GSL 215 (483) T ss_pred CeEEEEEEE-cCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeC-Cee Confidence 999977543 3221 257899999988543333333446666654 11111 122222211 000 Q ss_pred --cccCCccccce--ee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEecc Q lcl|NC_019524. 228 --PTDMEQWKWGY--EP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESE 302 (556) Q Consensus 228 --~~~~~~~~~~r--v~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~ 302 (556) ........|.. .+ .+..|| |+.+.. -..|.|.|.+++..+..++....-......-.+.-..+++.- T Consensus 216 ~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~ 287 (483) T protein:vir:12 216 IPDYSNNLENSKTHFSTGSWGKIP---FIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY 287 (483) T ss_pred eecccccccccccccccCCCCccc---eEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC Confidence 00000000000 00 011122 333332 236899999988777655543333333233222222333321 Q ss_pred CcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHh Q lcl|NC_019524. 303 LPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAAS 382 (556) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaag 382 (556) ..+.. +.....+..+.+..+..|.++++++.+.+...+..+.+.+.+.|... T Consensus 288 ~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 339 (483) T protein:vir:12 288 DDQEL----------------------------PEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLF 339 (483) T ss_pred Ccccc----------------------------hhHHHhhhhccccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHH Confidence 11000 00011234455666788899999998989999999999988888888 Q ss_pred cCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhH Q lcl|NC_019524. 383 LGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMR 459 (556) Q Consensus 383 lGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~ 459 (556) .++| .+.++++.|+ .+++..+..........+..|-.. ++.+++..++ ++ | .+... T Consensus 340 s~~p~~~~~~~~~n~Sg---~Al~~~~~~l~~k~~~~~~~f~~~-l~~~~~li~~--~~-~---~~~~~----------- 398 (483) T protein:vir:12 340 GQAVDFSSDKFGSAPSG---VALEFLYTNLNLKADKLARKAKVA-IQELLWFVFE--HF-D---IKGEH----------- 398 (483) T ss_pred hCCCCCCccccccCcHH---HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH--Hh-c---CCCcc----------- Confidence 7766 3444444433 355555555555555555554333 3444443322 22 2 22211 Q ss_pred HHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCC Q lcl|NC_019524. 460 DALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNST 537 (556) Q Consensus 460 ~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~ 537 (556) .-+.+.|...-. .|-...+++..++ .|+.|.+..+...+. |+++.++++.+|++...+.--.. .. T Consensus 399 -~~i~v~f~~~~p--~~~~~~a~~~~kl--~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~--------~~ 465 (483) T protein:vir:12 399 -KDVDISFNYNKV--ANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNL--------DD 465 (483) T ss_pred -ceeeEEeCCCCC--CCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc--------cc Confidence 123567744333 4666666655554 699999999999874 89999999999876543321111 11 Q ss_pred CCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 538 QSSNSSESTSDNPNEETT 555 (556) Q Consensus 538 ~~~~~~~~~~~~~~~e~~ 555 (556) .......++++.+++|++ T Consensus 466 ~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 466 GGADGAQQQERSNNKESE 483 (483) T ss_pred cccCCcccCCCCCcccCC Confidence 111112222222222222 No 141 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.11 E-value=1.9e-09 Score=68.42 Aligned_cols=434 Identities=9% Similarity=0.036 Sum_probs=201.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.........--.+....... ...+. .....|.-+.. . ... .+.....+. .....++.. -.-+++++ T Consensus 38 ~~~~~~~~~~~i~~~i~~~~~-~~~r~-~~l~~YY~g~~-~-i~~--~~~~~~~~~------~~~~~~~~~-ri~~n~~k 104 (492) T protein:vir:97 38 TNNKPETLEEMIVRYIKQHLE-KLPEI-SIGQEYYEQRP-D-IVK--EPKPVDATG------AVDPLKPDD-RMITNFHA 104 (492) T ss_pred CCCchhhHHHHHHHHHHHHHH-HHHHH-HHHHHHhcccC-c-ccc--ccccccccc------ccccccccc-ccccchHH Confidence 010001100000011111000 00111 11123332221 1 100 000000000 000001111 01268999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++.... + +.....|+.|..+ +|...+..+.+..+.-|.+ T Consensus 105 ~Ivd~~~~yl~g~p~~~~~~---------d-------~~~~~~l~~~~~n-----------~~~~~~~~~~~~~~~~G~a 157 (492) T protein:vir:97 105 NLVDQKVSYIVGKPIAFKHT---------D-------DEVVKRIDEVLGN-----------RFDDKLHSVLTGASNKGIE 157 (492) T ss_pred HHHHHHhhhhcccCceeccC---------c-------hHHHHHHHHHHhc-----------cHHHHHHHHHHHHhhcCeE Confidence 99999999999998876531 1 1233345555431 3445556667788889999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCC-------eEEEEEeecCCCcccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGA-------ALGYWLRKAFPGDPTD 230 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr-------~vaY~i~~~hpgd~~~ 230 (556) |..+ +....+ .+++.+++|..+-.-+.....+.+..+|++ +.... .+-||++.. +.... T Consensus 158 ~~~v-~~d~dg--------~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~--~~~~~ 226 (492) T protein:vir:97 158 WLHP-YLDEEG--------EFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYEN--GSLIP 226 (492) T ss_pred EEEE-EecCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEec--Ceeee Confidence 9875 432211 257889999888433333333446666654 11111 111222211 10000 Q ss_pred ---CCcccccee--e-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 231 ---MEQWKWGYE--P-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 231 ---~~~~~~~rv--~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) .....|... + .+..|| |+++... ..|+|.|.+++..+..++....-......-.+.-..+++. .+ T Consensus 227 ~~~~~~~~~~~~~~~~~~g~vP---vv~~~nn-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g-~~ 297 (492) T protein:vir:97 227 DYSNNLENSKTHFSTGSWGKIP---FIPFKNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKN-YD 297 (492) T ss_pred cccccccccccccccCCCCCcc---eEEecCC-----CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeec-CC Confidence 000000000 0 001111 3333332 3589999987777654443222221111111111122221 11 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcC Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLG 384 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglG 384 (556) ... .+.....+....+..+..|.++++++.+.+...+..+.+.+.+.|..-.+ T Consensus 298 ~~~---------------------------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~ 350 (492) T protein:vir:97 298 DQE---------------------------LPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQ 350 (492) T ss_pred ccc---------------------------chhHHHHHhhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhC Confidence 000 00011123455566778889999999888999999999998888877777 Q ss_pred CC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHH Q lcl|NC_019524. 385 MS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDA 461 (556) Q Consensus 385 i~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a 461 (556) +| .+.++++.| -.+.|..+..........+..|-.. ++.+++..++ ++ | .+... . T Consensus 351 ~p~~~~~~~~~n~S---g~Al~~~~~~l~~ka~~~~~~f~~~-l~~~~~li~~--~~-~---~~~~~------------~ 408 (492) T protein:vir:97 351 AVDFSSDKFGSAPS---GVALEFLYTNLNLKADKLARKAKVA-IQELLWFVFE--HF-D---IKGEH------------K 408 (492) T ss_pred CCCCCccccccCcH---HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH--Hh-c---CCccc------------c Confidence 66 344544443 3456665555555555555555443 3444444332 22 2 22211 1 Q ss_pred hhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCC Q lcl|NC_019524. 462 LCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQS 539 (556) Q Consensus 462 ~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~ 539 (556) -+.+.|...-. .|-...+++..+. .|+.|.+..+...+. |+++.++++.+|.+...+. +.. ... . T Consensus 409 ~i~v~f~~~~p--~~~~e~a~~~~kl--~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~-~~~-------~~~-~ 475 (492) T protein:vir:97 409 DVDISFNYNKV--ANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQ-LPN-------LDD-G 475 (492) T ss_pred eeeEEecCCCC--CCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-hhc-------ccc-C Confidence 23567744333 3555555555544 699999999999974 8999999999987654332 110 011 1 Q ss_pred CCCCCCCCCCCCCcCCC Q lcl|NC_019524. 540 SNSSESTSDNPNEETTQ 556 (556) Q Consensus 540 ~~~~~~~~~~~~~e~~~ 556 (556) ......+.++++++++| T Consensus 476 ~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 476 GADSAQQQERSNNKESE 492 (492) T ss_pred CCCCCcccccccccccC Confidence 11122222333333333 No 142 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.10 E-value=1.5e-11 Score=79.89 Aligned_cols=378 Identities=9% Similarity=-0.035 Sum_probs=184.1 Q ss_pred chhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChh Q lcl|NC_019524. 32 GGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDG 111 (556) Q Consensus 32 ~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~ 111 (556) .++.+...+-. ..+.+...+.-.. . ....+..|.+++..+|+.|++.|-..-|++--+.+.. +..+. T Consensus 1 Mg~f~~~~~~~--~~~~~~~~~~~~~----~-----~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~--~~~~~ 67 (378) T protein:vir:16 1 MNLFGKVVSFS--RGKLNNDTQRVTA----W-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSD--VGSDT 67 (378) T ss_pred Cccchhhhhhh--cccccCCcceeee----c-----ccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccc--ccccc Confidence 11111111000 0011111100000 0 0012345788899999999999887666543221110 00000 Q ss_pred HHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhc Q lcl|NC_019524. 112 WGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRM 191 (556) Q Consensus 112 ~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl 191 (556) ...... ..++..+-.+|+ -.+|.+++....+..++..|++|+.+.+.... T Consensus 68 ~~~~~~---~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~--------------------- 117 (378) T protein:vir:16 68 LISMAG---SDLDEVLNWSPK------GERNSMDFWRKVIKKLLRAPYVDLYAVFDDNT--------------------- 117 (378) T ss_pred cccccc---chHHHHHhhcCC------CCCCHHHHHHHHHHHHhhcCceEEEEEeecCC--------------------- Confidence 000011 122233333554 25789999999999999999999986543211 Q ss_pred CCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHH Q lcl|NC_019524. 192 SNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSA 271 (556) Q Consensus 192 ~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~ 271 (556) |+++.++ |.+ ....++.++|||+..+. .-.+|++.+..+ T Consensus 118 ---------------------g~~~~l~-----~~~-------------~~~~~~~~diih~r~~~--~~~~~~s~l~~~ 156 (378) T protein:vir:16 118 ---------------------GELLDLL-----FAD-------------DKKEYKPEELVRLTSPF--YINEDTSILDNA 156 (378) T ss_pred ---------------------ceEEEEE-----ecC-------------CeeEecccceEEecCcc--CccchhHHHHHH Confidence 1111111 000 00124567999997542 334566666544 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecC Q lcl|NC_019524. 272 LKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY 351 (556) Q Consensus 272 l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~ 351 (556) +..+ ..+.-++.+.++++.+..-... . .....+.+..... + ....-..|.+..|. T Consensus 157 ~~~i-----------~~~~~~~~~~g~l~~~~~l~~~--~----~~~~~~~~~~~~~-------~-~~~~~~~g~~~vl~ 211 (378) T protein:vir:16 157 LASI-----------QTKLEQGKLRGLLKINAFLDID--N----TQEYREKALTTIK-------N-MQEGSSYNGLTPVD 211 (378) T ss_pred HHHH-----------HHHHhcCccceeeEeCCcCCHH--H----HHHHHHHHHHHHH-------H-hhcccccccceEcC Confidence 4322 2222345566777754321100 0 0000011111000 0 00011356788999 Q ss_pred CCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 352 PGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTL 431 (556) Q Consensus 352 pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~ 431 (556) .|.+++.++.+....++. ..+.+..+||..+|||...|.++ |+. +. ...++..-+.|+... T Consensus 212 ~g~~~~~l~~~~~~~~~~-~~~~~~~~Ia~~fgVPp~~l~g~-----~~e------~~-------~~~f~~~tl~P~~~~ 272 (378) T protein:vir:16 212 NKTEIVELKKDYSVLNKD-EIDLIKSELLTGYFMNENILLGT-----ASQ------EQ-------QIYFYNSTIIPLLIQ 272 (378) T ss_pred CCceEEEccCChhhhhHH-HHHHHHHHHHHHhCCCHHHhcCC-----chH------HH-------HHHHHHHHHHHHHHH Confidence 999999988776666764 46788899999999999888542 321 11 123455556776655 Q ss_pred HHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHH Q lcl|NC_019524. 432 WLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFK 511 (556) Q Consensus 432 ~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~ 511 (556) ++.++-...++-...... ........+.+-.......|+...+++....+.+|+.|+-|+-+..|..|-+-- T Consensus 273 -ie~~l~~kLl~~~e~~~~------~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gg- 344 (378) T protein:vir:16 273 -LEKELTYKLISTNRRRVV------KGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGG- 344 (378) T ss_pred -HHHHHHhhcCChhhhhhh------hhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC- Confidence 344454444431100000 000001234455556667799999999999999999999999888898885421 Q ss_pred HHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 512 QRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 512 q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +++=++....+.... ............++|+++| T Consensus 345 ---------D~~~~~~n~~~~~~~--~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 345 ---------DVYIANLNAVAVKNL--SDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred ---------CeEeeccccccccch--hhhcCccCCCCCCCCCCCC Confidence 111111111111111 0111111112223333333 No 143 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.10 E-value=2.2e-09 Score=68.01 Aligned_cols=428 Identities=9% Similarity=0.015 Sum_probs=192.4 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHH-HHHhcChHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQ-DMVQNDGYA 79 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaR-dl~rNn~~a 79 (556) +-+.....-..-.+....... ..........|.-+... -... +...... ......++. -+ -++++ T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~--~~~~~~~~~~yY~g~~~-i~~~---~~~~~~~------~~~~~~~~~~ki--~~n~~ 85 (468) T protein:vir:96 20 IKPQYETQEEMILRLITKHKE--NVEDITVGERYYNHQPD-VLFN---APKRNVK------GEIDPFKPDWRM--YTNYH 85 (468) T ss_pred ccccccCcHHHHHHHHHHHHH--HHHHHHHHHHHhcCCCc-cccc---ccccccc------cccccccccccc--ccchH Confidence 222222211111111111100 01111112334433211 0000 0000000 000000111 12 25699 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) +-+++..+.+++|.+++..+. +++..+.+ +.|..+ +|.....-+.+....-|. T Consensus 86 ~~Iv~~~~~~l~g~p~~~~~~------------d~~~~~~l----~~~~~n-----------~~~~~~~~~~~~~~~~G~ 138 (468) T protein:vir:96 86 QNLVDQKVAYAVANPVTYGTE------------DEKSLKTI----QEVLNH-----------KWDDKLVDILTAASNKGV 138 (468) T ss_pred HHHHHHHHhhhccCCceeccC------------ChHHHHHH----HHHHhc-----------CHHHHHHHHHHHHhhcCe Confidence 999999999999988876532 12223333 333221 344455555677888999 Q ss_pred eEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCC--------CCeEEEEEeecCCCcc Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNN--------GAALGYWLRKAFPGDP 228 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~--------Gr~vaY~i~~~hpgd~ 228 (556) ++.+.. ....+ .+++..++|+.+-.-+.....+.+..+|.+ +.. ++..-|.......... T Consensus 139 ~~~~v~-~d~~~--------~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (468) T protein:vir:96 139 EWIQPY-VDEQG--------EFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPD 209 (468) T ss_pred EEEEEE-EcCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeec Confidence 997754 32221 257889999887432322223334445532 221 1111111111000000 Q ss_pred c---cCCccccceeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 229 T---DMEQWKWGYEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 229 ~---~~~~~~~~rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) . ......+..+... ...+.-=|+++.. -..|+|.|.+++..+..++...........-.+.-..+++--.. T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~ 284 (468) T protein:vir:96 210 YYQGEEHVQAHYYVGNKSMSWNRVPFIPFKN-----NPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEG 284 (468) T ss_pred ccccccccccceeeccccccCCcccEEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc Confidence 0 0000001000000 0111111333333 24699999998888777766554444333333322233331111 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCceeeecC--CCceeeeecCCCCCccHHHHHHHHHHHHHHh Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY--PGTKLKMQPAGTPGGVGTDYEQSLLRNIAAS 382 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~--pGe~i~~~~~~~p~~~f~~F~~~~lr~iaag 382 (556) +. .+.....+..+.+..+. .|-++++++.+.+...+..+.+.+.+.|... T Consensus 285 ~~----------------------------~~~~~~~~~~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 336 (468) T protein:vir:96 285 ED----------------------------LEEFMYNLKYYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEF 336 (468) T ss_pred cc----------------------------cchhhhhhhcCceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHH Confidence 00 00011112333444433 4557899998889999999999999999988 Q ss_pred cCCCHHHhhchhh-cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHH Q lcl|NC_019524. 383 LGMSYEQFSRDYT-KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDA 461 (556) Q Consensus 383 lGi~ye~l~~D~s-~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a 461 (556) .++|--.. ..++ +.|-.+.+..+..........+..| ...++.+++..+. ++...+. + . T Consensus 337 s~~p~~~~-~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~-~~~l~~~~~li~~--~~g~~~d------------~----~ 396 (468) T protein:vir:96 337 GQGVDFQQ-DKFGNSPSGIALKFMYSNLDLKANKLKNKT-LTALQELLQYIID--FYKLSIK------------V----Q 396 (468) T ss_pred hCcccccc-cccccchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH--HhCCCcc------------c----c Confidence 88772111 1121 2233344444433333333444333 3333444443322 2221111 0 1 Q ss_pred hhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCC Q lcl|NC_019524. 462 LCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQS 539 (556) Q Consensus 462 ~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~ 539 (556) -+.+.|... .+ .| +.+.....+++|+.|.+..+...+. |+++-++++++|++...+.--.+.. . T Consensus 397 ~i~i~f~~~-~p-~d---~~e~a~~~~~~g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~---------~ 462 (468) T protein:vir:96 397 DVEITFNFN-VM-VN---ELEQSQIGVNSQYLSKETVVTNHPWVDDPVAEMERIDQEELALPSIEEGLNG---------K 462 (468) T ss_pred eeeEEecCC-CC-cC---HHHHHHHHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHhhccCC---------C Confidence 134566322 22 34 3333445678899999999988854 8999999999998877654322211 0 Q ss_pred CCCCCC Q lcl|NC_019524. 540 SNSSES 545 (556) Q Consensus 540 ~~~~~~ 545 (556) .+.++. T Consensus 463 ~~~~~~ 468 (468) T protein:vir:96 463 ENNEPT 468 (468) T ss_pred CCCCCC Confidence 000000 No 144 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.09 E-value=2.4e-09 Score=67.80 Aligned_cols=431 Identities=13% Similarity=0.049 Sum_probs=209.6 Q ss_pred CC--cchhhhHHHH-HhhHhhccc-----chhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MK--DVKKTTRTRA-KKAVDVVAE-----TATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDM 72 (556) Q Consensus 1 ~s--p~~~~~r~~a-~~a~~~~~~-----~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl 72 (556) |. |++-..=..- ......... ............|.-+.. .-+. ....... ... +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~--~i~~---~~~~~~~---~~~--------~ki 64 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIM--AIDD---EPAKDSW---KPD--------NRL 64 (452) T ss_pred CcccCceeEEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc--cccc---Ccccccc---Ccc--------cee Confidence 22 2221100000 000111111 000101111123332211 0100 0000000 000 012 Q ss_pred HhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhh Q lcl|NC_019524. 73 VQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVS 152 (556) Q Consensus 73 ~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r 152 (556) ..++++-+|+..+.+++|.+++..+. ++ ...+ .|+.|... .+|...+..+.+ T Consensus 65 --~~n~~~~ivd~~~~~l~g~~~~~~~~---------d~---~~~~----~l~~~~~~----------n~~~~~~~~~~~ 116 (452) T protein:vir:36 65 --AVNFTKYIVDTFTGYFNGIPVKKSHS---------DK---EILT----KLQEFDNL----------NDMEDEESELAK 116 (452) T ss_pred --ecchHHHHHHHHhhhhcccCceeecC---------Ch---hHHH----HHHHHHhh----------cChhHHHHHHHH Confidence 25799999999999999999877542 11 1223 34444332 257788888899 Q ss_pred hheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE--CCCCCeEEEEEeecCCCcccc Q lcl|NC_019524. 153 GFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL--DNNGAALGYWLRKAFPGDPTD 230 (556) Q Consensus 153 ~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~--d~~Gr~vaY~i~~~hpgd~~~ 230 (556) ..+.-|.+|..+.. ...+ .+++..++|+.+-.-++......+..+|.+ +..+ ..-++++...---.+. T Consensus 117 ~~~~~G~~~~~v~~-d~~g--------~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~-~~~~~vyt~~~i~~~~ 186 (452) T protein:vir:36 117 MACIYGRAFEFLYQ-DEDT--------QTNVVYNSPENMFMVYDDTVKQEPLFAVRYGVDEDK-KLQGEVYTLLETIKIS 186 (452) T ss_pred HHHhcCeEEEEEEe-cCCC--------eeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCc-eEEEEEEecCeEEEEE Confidence 99999999977543 2211 258899999988533333333446666654 2222 2222333321100001 Q ss_pred CCccccceee----ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccc Q lcl|NC_019524. 231 MEQWKWGYEP----ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSD 306 (556) Q Consensus 231 ~~~~~~~rv~----~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~ 306 (556) .....|..+. .+..+| |+++... ..|+|.|.+++..+..++....-......-.+.--.+++-...++ T Consensus 187 ~~~~~~~~~~~~~~~~g~iP---vv~~~n~-----~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~ 258 (452) T protein:vir:36 187 GENDEISFGEGTYNPYPDLP---VVEFYFN-----EERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE 258 (452) T ss_pred EcCCceEEecceeccCCccc---EEEecCC-----CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCc Confidence 1111121111 112233 5554432 359999999888877776655544444433333333333111000 Q ss_pred ccccccccccccccccccccccccccccccccceecCCceeeecC-----CCceeeeecCCCCCccHHHHHHHHHHHHHH Q lcl|NC_019524. 307 VVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY-----PGTKLKMQPAGTPGGVGTDYEQSLLRNIAA 381 (556) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~-----pGe~i~~~~~~~p~~~f~~F~~~~lr~iaa 381 (556) +. ...+.++.+..+. .|.++++++.+.+...+..+.+.+.+.|.. T Consensus 259 ----------~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~ 308 (452) T protein:vir:36 259 ----------ED--------------------LKNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQ 308 (452) T ss_pred ----------hh--------------------hhhhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHH Confidence 00 0011122222222 234688888888889999999999999999 Q ss_pred hcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHH Q lcl|NC_019524. 382 SLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDA 461 (556) Q Consensus 382 glGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a 461 (556) ..++|.... ..++++|-.+++..+..........+..|-. .++.+++..+...-..|.-. .+ . T Consensus 309 ~s~~p~~~~-~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~~~~~--------~~-------~ 371 (452) T protein:vir:36 309 TTMVANISD-ESFGSSSGVSLAYKLQAMSNLALSFQRKFQS-SLNSRYKLFCELSTNVSNKD--------SW-------K 371 (452) T ss_pred HhCccccCc-ccccCCcHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCCcc--------cc-------c Confidence 999885433 3456666667776666665556555555433 34445555444332222110 00 0 Q ss_pred hhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCC Q lcl|NC_019524. 462 LCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQS 539 (556) Q Consensus 462 ~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~ 539 (556) -+.+.|..+- ..|....+++.. ...|+.|.+..+...|. |+++.++++.+|++...+.... .... T Consensus 372 ~i~i~f~~~~--p~d~~~~a~~~~--k~~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~---------~~~~ 438 (452) T protein:vir:36 372 DIEYTFTRNE--PKDIKEQAETAN--ILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKD---------KQPS 438 (452) T ss_pred cceEEeCCCC--CcCHHHHHHHHH--HHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhh---------ccCC Confidence 1357775432 245444444433 34799999999999875 8999999999988654332110 0001 Q ss_pred CCCCCCCCCCCCCc Q lcl|NC_019524. 540 SNSSESTSDNPNEE 553 (556) Q Consensus 540 ~~~~~~~~~~~~~e 553 (556) .....++.++.++| T Consensus 439 ~~~~~~~~~~~~~e 452 (452) T protein:vir:36 439 EKGTDTVVSETNEE 452 (452) T ss_pred CCcccccCccccCC Confidence 11111111111111 No 145 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.08 E-value=6.9e-10 Score=70.81 Aligned_cols=385 Identities=10% Similarity=0.031 Sum_probs=174.5 Q ss_pred HhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCc Q lcl|NC_019524. 16 VDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQY 95 (556) Q Consensus 16 ~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi 95 (556) +... ....-.++.. ..+ . .+...+-..+...+..++.+..+|+.+.+.|-..-| T Consensus 1 MGlf-------------~~~~~~~~~~--~~~---~--------~~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~ 54 (395) T protein:vir:98 1 MGIL-------------DFFSFKKSGT--LSD---D--------DSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTF 54 (395) T ss_pred Ccch-------------hhhcCCCccc--ccc---c--------ccchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCce Confidence 1000 0000000000 000 0 011122234444555678899999999999988777 Q ss_pred eeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCC Q lcl|NC_019524. 96 KLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQ 175 (556) Q Consensus 96 ~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~ 175 (556) ++.-. +.. .... ...+..+..+|+ ..+|.+++..+.+..++..|++|+.++... T Consensus 55 ~~~~~-~~~-----~~~~-------~~~~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnayi~~~~~~------- 108 (395) T protein:vir:98 55 RLKTP-EKL-----TENQ-------KDWLYWINTKAN------PNQSASQFWVEVIQKLLVDGETLIFVIPGK------- 108 (395) T ss_pred eEEec-CCc-----cccc-------chHHHHHhhcCC------CCCCHHHHHHHHHHHHhhcCceEEEEEeCC------- Confidence 66432 110 0001 122333334555 356889999999999999999998875421 Q ss_pred CcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeec Q lcl|NC_019524. 176 RRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIE 255 (556) Q Consensus 176 ~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~ 255 (556) . +. + ++-....... ...+...|.++. |.+ ....+.++|||+.. T Consensus 109 -~-----~~-~-~~~~~~~~~~--~~~~~~~~~~~~------~~~---------------------~~~~~~~evih~k~ 151 (395) T protein:vir:98 109 -G-----IY-V-ADSFTQDKKI--SGSQFKVSRVQG------QTY---------------------EKTFTFDQVIYLKN 151 (395) T ss_pred -c-----ee-c-CCcccccccc--cCcccceeeecC------cee---------------------eeEecCccEEEecC Confidence 0 11 1 1111100000 001111222211 111 11245568999876 Q ss_pred ccCCCcccCCchhhH----HHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccc Q lcl|NC_019524. 256 ALLAGQTRGISEMVS----ALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLA 331 (556) Q Consensus 256 ~~r~gQ~RGvs~la~----~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (556) .. ++...+.+.+.. ++...-....+..+..-. .-.+...+.++...... .........+..... T Consensus 152 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~- 219 (395) T protein:vir:98 152 DN-SDLMSKVESLWEEYGELLGHVINNQKIANQIRFT-MIPPKDKVRERAQENSD---------GGRQSKSDKDFFKRT- 219 (395) T ss_pred CC-CCccccccchhhhHHHHHHHHHHHHHHHHHHHHh-hccccccccccccccCC---------cHHHHHHHHHHHHHH- Confidence 54 333333332222 222221221111111000 01111111111110000 000000000000000 Q ss_pred cccccccceecCCceeeecCCCceeeeecCCCCCc------cHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHH Q lcl|NC_019524. 332 NYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG------VGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARAS 405 (556) Q Consensus 332 ~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~------~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~ 405 (556) . + ...-..+.|..|+.|.+++.++...... .|....+....+||..+|||-+.|. .+||++.+. T Consensus 220 --~-~--~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~-----~~~sn~e~~ 289 (395) T protein:vir:98 220 --V-E--KIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH-----GDIADNQKN 289 (395) T ss_pred --H-h--hhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc-----CCcccHHHH Confidence 0 0 0011355677899999999887543322 5666777888999999999999884 466665443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHH Q lcl|NC_019524. 406 MAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAI 485 (556) Q Consensus 406 ~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~ 485 (556) .+ .++...+.|+...+ +.++-...++--.... ....++. .....|+...+++.. T Consensus 290 ~~-----------~f~~~tl~P~~~~i-e~~l~~kll~~~~~~~------------g~~f~~~--~l~~~d~~~~~~~~~ 343 (395) T protein:vir:98 290 YE-----------LLLEGPIESLITNI-VDGLEYAIFDKSETLQ------------GSFIKVT--GLKNYDLFSISNQAD 343 (395) T ss_pred HH-----------HHHHHHHHHHHHHH-HHHHHHhcCChhhhcC------------cceeeeh--hhhccCHHHHHHHHH Confidence 33 34566677766654 4445444443110000 0111221 234458888999999 Q ss_pred HHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 486 LRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 486 ~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) +.+++|+.|+-|+-+..|..|-+-- ..+++=++....+ . ++..++..++..| T Consensus 344 ~~~~~G~~T~NE~R~~~g~~Pi~~~--------~gD~~~~~~n~~~--------~--~~~gge~~~~~~~ 395 (395) T protein:vir:98 344 KLISSGFVFIDEVREEIGLPELPDG--------LGKVLYMTKNYES--------V--LERGGEVDEEVET 395 (395) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCC--------CCceeeeccccee--------c--ccccCCCCCCCCC Confidence 9999999999999888888774210 0011111111111 1 1111122222222 No 146 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.08 E-value=2.8e-09 Score=67.51 Aligned_cols=459 Identities=8% Similarity=-0.010 Sum_probs=210.9 Q ss_pred CCcchhhhHHHHHhhHhhccc---chhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAE---TATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~---~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) |++.....-.-.......... ....+. .....|.-+.. .....+.. ..+. ...+. | --.+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~-~~l~~YY~g~~--~i~~~~~~---~~~~-~~~~~-----k-----i~~n 93 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRL-KVLSDYYEGKT--KNLVELTR---RKEE-YMADN-----R-----VAHD 93 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHHHH-HHHHHHhcccC--ccccccCc---cccc-ccCcc-----e-----eecc Confidence 222221111111111111100 000111 11223432221 11111111 0010 00110 1 1258 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) +++-+|+..+.+++|.+++..+. + +.....|+.|... .+|...+..+.+..+.- T Consensus 94 ~~k~Ivd~~~~yl~g~p~~~~~~------------d----~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~i~ 147 (512) T protein:vir:97 94 YASYISDFINGYFLGNPIQCQDD------------D----KDVLEAIEAFNDL----------NDVESHNRSLGLDLSIY 147 (512) T ss_pred hHHHHHHHHhhhhcccCceeccC------------C----hHHHHHHHHHHhh----------cCHHHHHHHHHHHHHhc Confidence 99999999999999998887542 1 1223345555432 25777788888889999 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC----C----CCeEEEEEeecCC---- Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN----N----GAALGYWLRKAFP---- 225 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~----~----Gr~vaY~i~~~hp---- 225 (556) |.+|..+.. ...+ .+++..++|..+-.-++......+..+|++-. . +....+.|+..+- T Consensus 148 G~ay~~vy~-ded~--------~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~ 218 (512) T protein:vir:97 148 GKAYELMIR-NQDD--------ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRY 218 (512) T ss_pred CeEEEEEEe-CCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEE Confidence 999987543 2211 25789999988754344333345677776421 1 1111112222210 Q ss_pred --CccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccC Q lcl|NC_019524. 226 --GDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESEL 303 (556) Q Consensus 226 --gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~ 303 (556) .+........+.--+..-.++.-=|+++... ..|.|.|.+++..+..++....-......-.+.-..+++... T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~ 293 (512) T protein:vir:97 219 LTSRTNGLKLTPRENGFESHSFERMPITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (512) T ss_pred EecCCCcccccccccccccccCcccceEeecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCc Confidence 0000000000000000001111124444332 368899999888877666544333333222222223333211 Q ss_pred cccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhc Q lcl|NC_019524. 304 PSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASL 383 (556) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaagl 383 (556) +... ... ........ ..........++.......|-++++++.+.+...+..+.+.+.+.|..-. T Consensus 294 ~~~~-~~~--------~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s 358 (512) T protein:vir:97 294 NLDP-VEV--------RKQKEANV------LFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 358 (512) T ss_pred cCCc-hhh--------hhhhhccc------ccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 1100 000 00000000 00000011122333345678889999999999999999999999998888 Q ss_pred CCCH---HHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHH Q lcl|NC_019524. 384 GMSY---EQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRD 460 (556) Q Consensus 384 Gi~y---e~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~ 460 (556) ++|- +.++++. |--+++..+..........+..|-..+ +.+++..+...-..+.+..+..+ T Consensus 359 ~~p~~~~~~~~gn~---Sg~Al~~~~~~l~~ka~~k~~~f~~~l-~~~~~li~~~~~~~~~~~~~~d~------------ 422 (512) T protein:vir:97 359 NTPNMKDDNFSGTQ---SGEAMKYKLFGLEQRTKTKEGLFTKGL-RRRAKLLETILKNTRSIDANKDF------------ 422 (512) T ss_pred CCcccCcccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCccccccc------------ Confidence 7763 3344443 333566665555555555555554443 33444443332223333221110 Q ss_pred HhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCC-CCccccccCCC Q lcl|NC_019524. 461 ALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLD-FTGKMVEGNST 537 (556) Q Consensus 461 a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~-~~~~~~~~~~~ 537 (556) .-+.+.|..+-. .|-...+++.... .|+.|.+..+...+. |+++.++++++|++...+.-.. ...++ .+ T Consensus 423 ~~i~~~f~~~~p--~~~~e~~~~~~kl--~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~----~~ 494 (512) T protein:vir:97 423 NTVRYVYNRNLP--KSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP----RD 494 (512) T ss_pred ccceEEeCCCCC--cCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCC----CC Confidence 013567754333 4655556555544 599999999988874 8999999999997754332111 10111 11 Q ss_pred CCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 538 QSSNSSESTSDNPNEETT 555 (556) Q Consensus 538 ~~~~~~~~~~~~~~~e~~ 555 (556) .....+++++++..++.+ T Consensus 495 ~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 495 INDDEQDDDTKDTVDKKE 512 (512) T ss_pred CCCCCCCCCccccccccC Confidence 111111111111111111 No 147 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.08 E-value=2.8e-09 Score=67.44 Aligned_cols=440 Identities=7% Similarity=-0.060 Sum_probs=199.4 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +++.. +.+. ...... ...+. .....|.-+.. ...........-++ .-..+++++ T Consensus 16 ~~~~~-i~~~-----i~~~~~-~~~~~-~~l~~Yy~g~~--~i~~~~~~~~~~~~----------------~ki~~n~~~ 69 (499) T protein:vir:10 16 PNIEA-INYA-----IRELQN-RKKRL-DKLSDYYNGKQ--EIEKHEFDNATVEA----------------ANVMVNHAK 69 (499) T ss_pred CCHHH-HHHH-----HHHHHH-HHHHH-HHHHHHhcccc--chhcCCcCcCCCCc----------------ceeecchHH Confidence 22111 1111 111100 01111 11233432221 11111000000000 011246899 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|..++..+.. ....+ . +++|... .+|...+..+.+..+..|.+ T Consensus 70 ~Iv~~~~~~l~g~p~~~~~~~------------~~~~~---~-l~~~~~~----------n~~~~~~~~~~~~~~~~G~~ 123 (499) T protein:vir:10 70 YITDMNVGFMTGNPVKYVAEK------------GKNID---D-ILEVFNQ----------IDIHKHDIELEKDLSVFGYG 123 (499) T ss_pred HHHHHHhhhhcccCceeecCC------------hhHHH---H-HHHHHhh----------cCHhHHHHHHHHHHHhcCce Confidence 999999999999887765421 11122 2 3333322 14666677788889999999 Q ss_pred EEEEeeccCCCCcCC---------CcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCC-CCeE-EEEEeecCC Q lcl|NC_019524. 161 LATCEWLNPTGTTMQ---------RRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNN-GAAL-GYWLRKAFP 225 (556) Q Consensus 161 f~~~~~~~~~~~~~~---------~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~-Gr~v-aY~i~~~hp 225 (556) |.++.... .+.... .....+++..|+|..+-.-+....+..+..+|.+ |.. +..+ .+.|+... T Consensus 124 ~~~v~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~- 201 (499) T protein:vir:10 124 YELLYLKK-TDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQ- 201 (499) T ss_pred EEEEEecc-cccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCC- Confidence 98764432 221000 0011367889999877433333334445555543 222 2222 22232211 Q ss_pred CccccC--C-----cccc---ceee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019524. 226 GDPTDM--E-----QWKW---GYEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNAT 294 (556) Q Consensus 226 gd~~~~--~-----~~~~---~rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~ 294 (556) ..+.+ . ...+ ...| .+..|| |+++... ..|+|.|.+++..+..++....-......-.+. T Consensus 202 -~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n~-----~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~ 272 (499) T protein:vir:10 202 -RIVEYRTKTTMEVSANDPIVYDGENLFGAVP---IIEFRNN-----EERQGDFEQLISLIDAYNLLQTDRISDKEAFVD 272 (499) T ss_pred -eEEEEEecCCccccCcceecccccCCCCccc---eEEecCC-----CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcC Confidence 10000 0 0000 0000 012222 4544432 358899998777765555433222222222222 Q ss_pred eeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeee--cCCCceeeeecCCCCCccHHHHH Q lcl|NC_019524. 295 YAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPH--LYPGTKLKMQPAGTPGGVGTDYE 372 (556) Q Consensus 295 ~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~--L~pGe~i~~~~~~~p~~~f~~F~ 372 (556) --.+++-...+ +. ......+..|.+.. ...|.++++++.+.+...+..+. T Consensus 273 ~~lv~~G~~~~------------~~----------------~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~ 324 (499) T protein:vir:10 273 ALLVTFGFGLG------------DD----------------KDDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLS 324 (499) T ss_pred ceeeeecCccc------------cc----------------cchhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHH Confidence 12223211000 00 00011233344443 45777899999888889999999 Q ss_pred HHHHHHHHHhcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcc Q lcl|NC_019524. 373 QSLLRNIAASLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKN 449 (556) Q Consensus 373 ~~~lr~iaaglGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~ 449 (556) +.+.+.|..-.++| .+.++++.|++ +++..+..........+..|-. .++.+++..++..-..|. .. T Consensus 325 ~~l~~~I~~~s~~p~~~~~~~~gn~Sg~---Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~~~~~~~~------~~ 394 (499) T protein:vir:10 325 QSIENDIHKISYVPNMNDEKFMGNVSGE---AMKFKLFGLENLLSIKQRYFFD-GLRRRLKLIQTIVNIKGA------ND 394 (499) T ss_pred HHHHHHHHHHhCcccCCchhhcccchHH---HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCC------cc Confidence 99999887777665 45555555443 4444444333333333333332 334444444432211221 00 Q ss_pred cccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCC- Q lcl|NC_019524. 450 WRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLD- 526 (556) Q Consensus 450 ~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~- 526 (556) + ..-+.+.|..+-. .|....+++..++ +|+.|.+.++...+. |+++.++++.+|++...+.-.. T Consensus 395 -----d----~~~i~i~f~~~~p--~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~ 461 (499) T protein:vir:10 395 -----D----ASGCKISLVANIP--SNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEA 461 (499) T ss_pred -----c----cccceEEeCCCCC--CCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhh Confidence 0 0124567744443 4666666666655 799999999999865 7999999998887653222111 Q ss_pred C-CccccccCCCCCCCCCCCCCCCCCC--cCCC Q lcl|NC_019524. 527 F-TGKMVEGNSTQSSNSSESTSDNPNE--ETTQ 556 (556) Q Consensus 527 ~-~~~~~~~~~~~~~~~~~~~~~~~~~--e~~~ 556 (556) + ...+... ......+++++++++. ++++ T Consensus 462 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 492 (499) T protein:vir:10 462 LRGQDPDRL--ELEDKQDDSSENDKEAGSNHNQ 492 (499) T ss_pred hccCCCCCC--CCCCCCcccCCCCCCCcccccc Confidence 0 0111110 0111111111111111 2222 No 148 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.07 E-value=3e-09 Score=67.28 Aligned_cols=432 Identities=9% Similarity=0.018 Sum_probs=200.7 Q ss_pred CCcchh-hhHHHHHhhHhhcccchhhhhhhhcchhccccCC--CcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKK-TTRTRAKKAVDVVAETATATPMAVGGGMEGAERT--TREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~-~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~--~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) +-+-.. ....--.+....... ...+.. ....|+-+... .+....|... .. . ..++. .---++ T Consensus 37 ~~~~~~~~~~~~i~~~i~~~~~-~~~r~~-~l~~YY~g~~~I~~~~~~~~~~~----~~------~--~~~~~-~ri~~n 101 (492) T protein:vir:94 37 RTNNKPETLEEMIVRYIKQHLE-KLPEIS-IGQEYYEQRPDIVKEPKPVDATG----AV------D--PLKPD-DRMITN 101 (492) T ss_pred ccCCchhhHHHHHHHHHHHHHH-HHHHHH-HHHHHhccccccccccccccccc----cc------c--ccccc-cccccc Confidence 111000 000000011111000 011111 12234322211 0000111000 00 0 00110 001368 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) +++-+|++.+.+++|.+++.... + +.....|+.|..+ +|...+..+.+..+.- T Consensus 102 ~~k~Ivd~~~~yl~G~p~~~~~~------------d----~~~~~~l~~~~~n-----------~~~~~~~~~~~~a~~~ 154 (492) T protein:vir:94 102 FHANLVDQKVSYIVGKPIAFKHT------------D----DEVVKRIDEVLGN-----------RFDDKLHSVLTGASNK 154 (492) T ss_pred hHHHHHHHHHhhhcccCceeccC------------c----hHHHHHHHHHHhc-----------cHHHHHHHHHHHHhhC Confidence 99999999999999988776532 1 1233345555431 3555666677888899 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCC-------eEEEEEeecCCCc Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGA-------ALGYWLRKAFPGD 227 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr-------~vaY~i~~~hpgd 227 (556) |.+|..+. ....+ .+++.+++|..+-.-+.....+.+..+|++ +.... .+.||+... +. T Consensus 155 G~a~~~v~-~d~dg--------~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~--~~ 223 (492) T protein:vir:94 155 GIEWLHPY-LDEEG--------EFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYEN--GS 223 (492) T ss_pred CeEEEEEE-ecCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEec--Ce Confidence 99997754 32211 257889999887543333334446666654 12111 222332221 10 Q ss_pred cc--cCCccccceee----ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEec Q lcl|NC_019524. 228 PT--DMEQWKWGYEP----ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVES 301 (556) Q Consensus 228 ~~--~~~~~~~~rv~----~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~ 301 (556) .. ......+..+. .+..|| |+.+.. -..|+|.|.+++..+..++....-......-.+.-..+++. T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n-----n~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g 295 (492) T protein:vir:94 224 LIPDYSNNLENSKTHFSTGSWGKIP---FIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKN 295 (492) T ss_pred eeeccccccccccccccccCCCccc---eEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 00 00000000000 001111 222222 23699999998877765555332222222211111122221 Q ss_pred cCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHH Q lcl|NC_019524. 302 ELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAA 381 (556) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaa 381 (556) .+.. + .+.....+....+..+..|.++++++.+.+...+..+.+.+...|.. T Consensus 296 -~~~~-----------~----------------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~ 347 (492) T protein:vir:94 296 -YDDQ-----------E----------------LPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIML 347 (492) T ss_pred -CCcc-----------c----------------chhhHHHHhhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHH Confidence 1100 0 00011123445666778889999999898999999999998887777 Q ss_pred hcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhh Q lcl|NC_019524. 382 SLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMM 458 (556) Q Consensus 382 glGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~ 458 (556) -.++| .+.++++.|+ -+++..+..........+..|-..+ +.+++..++ ++ | .+... T Consensus 348 ~s~~p~~~~~~~~~n~Sg---~Al~~~~~~l~~k~~~k~~~f~~~l-~~~~~li~~--~~-~---~~~~~---------- 407 (492) T protein:vir:94 348 FGQAVDFSSDKFGSAPSG---VALEFLYTNLNLKADKLARKAKVAI-QELLWFVFE--HF-D---IKGEH---------- 407 (492) T ss_pred HhCCcCCCccccccCchH---HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH--Hh-c---CCccc---------- Confidence 77665 3445444433 3556555555555555555554433 334443322 22 1 22211 Q ss_pred HHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCC Q lcl|NC_019524. 459 RDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNS 536 (556) Q Consensus 459 ~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~ 536 (556) .-+.+.|..+-. .|-...+++..+. .|+.|.+..+...|. |+++.++++.+|++...+.--.+ . T Consensus 408 --~~i~v~f~~~~p--~~~~e~~~~~~kl--~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~--------~ 473 (492) T protein:vir:94 408 --KDVDISFNYNKV--ANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNL--------D 473 (492) T ss_pred --ceeeEEecCCCC--CCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc--------c Confidence 113567744433 3555555555544 599999999999875 89999999999976553321111 0 Q ss_pred CCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 537 TQSSNSSESTSDNPNEETT 555 (556) Q Consensus 537 ~~~~~~~~~~~~~~~~e~~ 555 (556) ....+..+++++..+.|++ T Consensus 474 ~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 474 DGGADSAQQQERSNNKESE 492 (492) T ss_pred cccCCCCccccCCccccCC Confidence 1111111111111111222 No 149 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.07 E-value=3.2e-09 Score=67.18 Aligned_cols=454 Identities=8% Similarity=-0.021 Sum_probs=213.0 Q ss_pred CCcchhhhHHHHHh-hHhhccc----chhhhh-hhhcch-hccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKK-AVDVVAE----TATATP-MAVGGG-MEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMV 73 (556) Q Consensus 1 ~sp~~~~~r~~a~~-a~~~~~~----~~~~~~-~~~~~~-y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~ 73 (556) |.=++=+.-....- .+..... ....+. ...... |.+-............... .+............++. .= T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~k 78 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEK-EDFETGGNVRRLDVSVN-NK 78 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhh-hhhhhcccccccccCcc-cc Confidence 22222221111110 0000000 000000 111112 2221100000000000000 00000000000111110 01 Q ss_pred hcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhh Q lcl|NC_019524. 74 QNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSG 153 (556) Q Consensus 74 rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~ 153 (556) -.+++++-+|+..+.+++|..++.....+ ..-++.+...|++|... + +|......+.+. T Consensus 79 i~~n~~~~ivd~~~~yl~g~pv~~~~~~~-----------~~~~e~~~~~l~~~~~~----n------~~~~~~~~~~~~ 137 (474) T protein:vir:94 79 LNNSFDSEIVDTRVGYLHGVPVTYDLDEN-----------AEKNEKLKKFITNFAIR----N------SVDDEDSEIGKM 137 (474) T ss_pred cccchHHHHHHhHhhheeccceeEeeCCC-----------CcchHHHHHHHHHHHhh----c------CHhHHHHHHHHH Confidence 13679999999999999998887765321 12234555566666543 2 567777888888 Q ss_pred heecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEE----ECCCCCeEEEEEeecCCCccc Q lcl|NC_019524. 154 FLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQ----LDNNGAALGYWLRKAFPGDPT 229 (556) Q Consensus 154 ~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE----~d~~Gr~vaY~i~~~hpgd~~ 229 (556) ...-|.+|..+ +....+ .+++.+++|..+-.-++. .+ ....+|. .+..+...-|++.--.+...+ T Consensus 138 ~~~~G~a~~~~-~~d~~~--------~~~~~~i~p~~~~~v~d~-~~-~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~ 206 (474) T protein:vir:94 138 AAICGYGARLA-YIDTNG--------DIRIKNIDPYNVIFVGDN-IL-EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYY 206 (474) T ss_pred HhhcCeEEEEE-EeCCCC--------eeEEEEEcccceEEEEcC-CC-ceEEEEEEEEEeeCCCceEEEEEEEEcCceEE Confidence 99999999775 433222 268899999987322221 11 1223332 233333222222211111111 Q ss_pred cC---Cccccceeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcc Q lcl|NC_019524. 230 DM---EQWKWGYEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPS 305 (556) Q Consensus 230 ~~---~~~~~~rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~ 305 (556) .. ....|..+... ..++.-=|+|+.. -..|+|.|.+++..+..++....-......-.+.-..+|+-...+ T Consensus 207 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~ 281 (474) T protein:vir:94 207 VFRGEGIDALQEVGRYEHLFDYNPLFGVPN-----NKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMS 281 (474) T ss_pred EEeecCCCcccccccccCCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC Confidence 00 00111111111 0122112455443 346999999987777666654443333332222222233211000 Q ss_pred cccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCC Q lcl|NC_019524. 306 DVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGM 385 (556) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi 385 (556) + + . .......|.|..+..|-++++++.+.+...+..+.+.+.+.|....++ T Consensus 282 ~-----------~-------~-----------~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 332 (474) T protein:vir:94 282 E-----------E-------M-----------IQETQKSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKS 332 (474) T ss_pred c-----------h-------h-----------hhhhhhcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCC Confidence 0 0 0 000123566777788899999999999999999999999999888887 Q ss_pred CH---HHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHh Q lcl|NC_019524. 386 SY---EQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDAL 462 (556) Q Consensus 386 ~y---e~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~ 462 (556) |- +.++++ .|-.+.+..+..........+..|-.. ++.+++..+...-..|.-..+..+ .- T Consensus 333 p~~~~~~~~~n---~Sg~Al~~~~~~l~~k~~~~~~~~~~~-l~~~~~li~~~l~~~~~~~~~~~~------------~~ 396 (474) T protein:vir:94 333 VNFNSDEFNGN---VPIIGMKLKLMALENKCMTFERKMTAM-LRYQFKVILSALKRKGYNLDDDSY------------LN 396 (474) T ss_pred ccccccccccc---chHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhccCCCCcccc------------cc Confidence 63 333333 444466665555544455555444333 344555544432222221111100 01 Q ss_pred hCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCC Q lcl|NC_019524. 463 CNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLG--GDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSS 540 (556) Q Consensus 463 ~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G--~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~ 540 (556) +.+.|..+- ..|....+++..+. .|+.|.+..+...+ .|+++.++++++|++...+.=... .. T Consensus 397 i~~~f~~~~--p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~--------~~--- 461 (474) T protein:vir:94 397 LIFKFTRNI--PVNKLEESQVLINL--KGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDI--------DE--- 461 (474) T ss_pred ceEEeCCCC--CCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc--------cC--- Confidence 345664333 36777777777665 59999999999987 499999999999987655431110 00 Q ss_pred CCCCCCCCCCCCcCC Q lcl|NC_019524. 541 NSSESTSDNPNEETT 555 (556) Q Consensus 541 ~~~~~~~~~~~~e~~ 555 (556) .+..++..+.|++ T Consensus 462 --~~~~~~~~~~~s~ 474 (474) T protein:vir:94 462 --GDANDKSQNNQSE 474 (474) T ss_pred --CCcCCCCccccCC Confidence 0011111111111 No 150 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.07 E-value=3.2e-09 Score=67.18 Aligned_cols=454 Identities=8% Similarity=-0.021 Sum_probs=213.0 Q ss_pred CCcchhhhHHHHHh-hHhhccc----chhhhh-hhhcch-hccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKK-AVDVVAE----TATATP-MAVGGG-MEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMV 73 (556) Q Consensus 1 ~sp~~~~~r~~a~~-a~~~~~~----~~~~~~-~~~~~~-y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~ 73 (556) |.=++=+.-....- .+..... ....+. ...... |.+-............... .+............++. .= T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~k 78 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEK-EDFETGGNVRRLDVSVN-NK 78 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhh-hhhhhcccccccccCcc-cc Confidence 22222221111110 0000000 000000 111112 2221100000000000000 00000000000111110 01 Q ss_pred hcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhh Q lcl|NC_019524. 74 QNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSG 153 (556) Q Consensus 74 rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~ 153 (556) -.+++++-+|+..+.+++|..++.....+ ..-++.+...|++|... + +|......+.+. T Consensus 79 i~~n~~~~ivd~~~~yl~g~pv~~~~~~~-----------~~~~e~~~~~l~~~~~~----n------~~~~~~~~~~~~ 137 (474) T protein:vir:10 79 LNNSFDSEIVDTRVGYLHGVPVTYDLDEN-----------AEKNEKLKKFITNFAIR----N------SVDDEDSEIGKM 137 (474) T ss_pred cccchHHHHHHhHhhheeccceeEeeCCC-----------CcchHHHHHHHHHHHhh----c------CHhHHHHHHHHH Confidence 13679999999999999998887765321 12234555566666543 2 567777888888 Q ss_pred heecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEE----ECCCCCeEEEEEeecCCCccc Q lcl|NC_019524. 154 FLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQ----LDNNGAALGYWLRKAFPGDPT 229 (556) Q Consensus 154 ~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE----~d~~Gr~vaY~i~~~hpgd~~ 229 (556) ...-|.+|..+ +....+ .+++.+++|..+-.-++. .+ ....+|. .+..+...-|++.--.+...+ T Consensus 138 ~~~~G~a~~~~-~~d~~~--------~~~~~~i~p~~~~~v~d~-~~-~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~ 206 (474) T protein:vir:10 138 AAICGYGARLA-YIDTNG--------DIRIKNIDPYNVIFVGDN-IL-EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYY 206 (474) T ss_pred HhhcCeEEEEE-EeCCCC--------eeEEEEEcccceEEEEcC-CC-ceEEEEEEEEEeeCCCceEEEEEEEEcCceEE Confidence 99999999775 433222 268899999987322221 11 1223332 233333222222211111111 Q ss_pred cC---Cccccceeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcc Q lcl|NC_019524. 230 DM---EQWKWGYEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPS 305 (556) Q Consensus 230 ~~---~~~~~~rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~ 305 (556) .. ....|..+... ..++.-=|+|+.. -..|+|.|.+++..+..++....-......-.+.-..+|+-...+ T Consensus 207 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~ 281 (474) T protein:vir:10 207 VFRGEGIDALQEVGRYEHLFDYNPLFGVPN-----NKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMS 281 (474) T ss_pred EEeecCCCcccccccccCCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC Confidence 00 00111111111 0122112455443 346999999987777666654443333332222222233211000 Q ss_pred cccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCC Q lcl|NC_019524. 306 DVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGM 385 (556) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi 385 (556) + + . .......|.|..+..|-++++++.+.+...+..+.+.+.+.|....++ T Consensus 282 ~-----------~-------~-----------~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 332 (474) T protein:vir:10 282 E-----------E-------M-----------IQETQKSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKS 332 (474) T ss_pred c-----------h-------h-----------hhhhhhcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCC Confidence 0 0 0 000123566777788899999999999999999999999999888887 Q ss_pred CH---HHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHh Q lcl|NC_019524. 386 SY---EQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDAL 462 (556) Q Consensus 386 ~y---e~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~ 462 (556) |- +.++++ .|-.+.+..+..........+..|-.. ++.+++..+...-..|.-..+..+ .- T Consensus 333 p~~~~~~~~~n---~Sg~Al~~~~~~l~~k~~~~~~~~~~~-l~~~~~li~~~l~~~~~~~~~~~~------------~~ 396 (474) T protein:vir:10 333 VNFNSDEFNGN---VPIIGMKLKLMALENKCMTFERKMTAM-LRYQFKVILSALKRKGYNLDDDSY------------LN 396 (474) T ss_pred ccccccccccc---chHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhccCCCCcccc------------cc Confidence 63 333333 444466665555544455555444333 344555544432222221111100 01 Q ss_pred hCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCC Q lcl|NC_019524. 463 CNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLG--GDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSS 540 (556) Q Consensus 463 ~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G--~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~ 540 (556) +.+.|..+- ..|....+++..+. .|+.|.+..+...+ .|+++.++++++|++...+.=... .. T Consensus 397 i~~~f~~~~--p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~--------~~--- 461 (474) T protein:vir:10 397 LIFKFTRNI--PVNKLEESQVLINL--KGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDI--------DE--- 461 (474) T ss_pred ceEEeCCCC--CCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccc--------cC--- Confidence 345664333 36777777777665 59999999999987 499999999999987655431110 00 Q ss_pred CCCCCCCCCCCCcCC Q lcl|NC_019524. 541 NSSESTSDNPNEETT 555 (556) Q Consensus 541 ~~~~~~~~~~~~e~~ 555 (556) .+..++..+.|++ T Consensus 462 --~~~~~~~~~~~s~ 474 (474) T protein:vir:10 462 --GDANDKSQNNQSE 474 (474) T ss_pred --CCcCCCCccccCC Confidence 0011111111111 No 151 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.04 E-value=4.3e-09 Score=66.43 Aligned_cols=420 Identities=9% Similarity=0.015 Sum_probs=191.6 Q ss_pred chhhhHHH----HHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHH--HHHHHHHHh--- Q lcl|NC_019524. 4 VKKTTRTR----AKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMA--SARAQDMVQ--- 74 (556) Q Consensus 4 ~~~~~r~~----a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l--r~RaRdl~r--- 74 (556) |.-..+-. ...-++.. ...+. +. ..| ....+......+ ..+.+..+. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~--------------~~~~~-~~---~~~------i~~~i~~~~~~~~~~~~~~~yY~g~~ 56 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQI--------------KPKYE-TQ---EEM------ILRLVREHKENIDNITMGERYYNHHP 56 (478) T ss_pred CccccCCCCchhHHHHHHHH--------------hhccC-Cc---HHH------HHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 33332200 00000000 00000 00 000 000000000000 011111111 Q ss_pred ------------------------cChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcc Q lcl|NC_019524. 75 ------------------------NDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAES 130 (556) Q Consensus 75 ------------------------Nn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~ 130 (556) .+++++-+|++.+.+++|.+++.... .+ ...+.+. .|... T Consensus 57 ~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~---------~d---~~~~~l~----~~~~n 120 (478) T protein:vir:10 57 DILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVD---------ND---KALKQIQ----HTLNH 120 (478) T ss_pred chhccccccccccccccccccceeccchHHHHHHHHHhhhccCCeeeecC---------Ch---HHHHHHH----HHHhc Confidence 24688999999999999988876532 11 2223332 33221 Q ss_pred cccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEEC Q lcl|NC_019524. 131 PENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLD 210 (556) Q Consensus 131 ~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d 210 (556) +|...+.-+.+..+.-|.+|+.+ |....+ .+++.+++|+.+---+.....+.+..+|.+- T Consensus 121 -----------~~~~~~~~~~~~~~~~G~~~~~~-~~d~~g--------~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~ 180 (478) T protein:vir:10 121 -----------KWDDKLVDILTAASNKGIEWVQP-YVDEEG--------EFKTFRVPAEQAVPIWTNKERDELQAFIRVY 180 (478) T ss_pred -----------CHHHHHHHHHHHHHhcCeEEEEE-EecCCC--------eeEEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 45666666678889999999775 433222 2578899998875323322233456666431 Q ss_pred C-CCC---------eEEEEEeecCCCcccc----CCcccccee----eccccCChhHeEeeecccCCCcccCCchhhHHH Q lcl|NC_019524. 211 N-NGA---------ALGYWLRKAFPGDPTD----MEQWKWGYE----PARFDWGRRRVIHIIEALLAGQTRGISEMVSAL 272 (556) Q Consensus 211 ~-~Gr---------~vaY~i~~~hpgd~~~----~~~~~~~rv----~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l 272 (556) . .+. .+-||.+..-...... .....+..+ -.+..+| |+|+... ..|.|.|.+++ T Consensus 181 ~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP---vv~~~n~-----~~g~sd~~~v~ 252 (478) T protein:vir:10 181 ELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVP---FIPFKNN-----PQEVSDLFMYK 252 (478) T ss_pred EecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccc---eEEeccC-----CCCCCcHHHHH Confidence 1 111 1112222110000000 000000000 0112223 5555443 36999999977 Q ss_pred HHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeec-- Q lcl|NC_019524. 273 KQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-- 350 (556) Q Consensus 273 ~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-- 350 (556) ..+..++....-......-.+.--.+++--.++. . . .....+..+.+..+ T Consensus 253 ~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~------------~----~------------~~~~~~~~~~~~~~~~ 304 (478) T protein:vir:10 253 TIIDALDKRLSDTQNTFDESVELIYILKGYEGED------------M----K------------DFMHNLKYYKAISVAG 304 (478) T ss_pred HHHHHHHHHHHHHHHHHHHhhCceeeeecCCccc------------c----c------------hhhhhhhhcceEEecC Confidence 7776666544333333222222122222111000 0 0 00011222333333 Q ss_pred CCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 351 YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASA 427 (556) Q Consensus 351 ~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~p 427 (556) ..|-++++++.+.+...+..+++.+.+.|....++| .+.++++. |-.+.+..+..........+..|-.. ++. T Consensus 305 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~---Sg~Al~~~~~~l~~k~~~~~~~~~~~-l~~ 380 (478) T protein:vir:10 305 ESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSP---SGIALKFMYSNLDLKANKLKNKTLTA-LQE 380 (478) T ss_pred CCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccccccc---HHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 467788999888889999999999988888888866 23333333 33355554444433344444443333 333 Q ss_pred HHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--C Q lcl|NC_019524. 428 IYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--D 505 (556) Q Consensus 428 i~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D 505 (556) +++..+ +++.+.+.. .-+.+.|.. ..+ .|....+++..+ .+|+.|.+..+...|. | T Consensus 381 ~~~li~--~~~g~~~~~----------------~~i~i~f~~-~~p-~d~~e~a~~~~k--l~g~iS~et~~~~l~~v~D 438 (478) T protein:vir:10 381 LLQYII--DFYRLDVKV----------------QDIEITFNF-NVM-VNELENSQIAMN--STGLLSKETILSNHAWVED 438 (478) T ss_pred HHHHHH--HHhCCCccc----------------ccceEEecC-CCC-CCHHHHHHHHHH--HhCCCChHHHHHhCCCCCC Confidence 443322 222222211 123567733 333 466555555443 4899999999999874 8 Q ss_pred HHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 506 FREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNE 552 (556) Q Consensus 506 ~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (556) +++.++++++|++...+.-.... .......+++++++.+| T Consensus 439 ~~~E~~ri~~E~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 439 PVAEMERIEQENIELNQQLPDIE-------EGLNGEQQRQSENNQPE 478 (478) T ss_pred HHHHHHHHHHHHHHHHhhccccc-------cccCCCCCCCCCCCCCC Confidence 99999999998765544311110 11111111111111111 No 152 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.03 E-value=4.5e-09 Score=66.33 Aligned_cols=393 Identities=6% Similarity=-0.068 Sum_probs=195.7 Q ss_pred cccccCCCCCHHHHHHHHHHHHH------HHHHHHHh-----------------------------------cChHHHHH Q lcl|NC_019524. 44 MFQWNPSIISPDQQIAQNQDMAS------ARAQDMVQ-----------------------------------NDGYAAGV 82 (556) Q Consensus 44 ~~~w~~~~~s~~~~i~~~~~~lr------~RaRdl~r-----------------------------------Nn~~a~~~ 82 (556) |. .......|..-..... .+.++.+. .+++++-+ T Consensus 1 ~~-----~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 75 (471) T protein:vir:10 1 ME-----IEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLL 75 (471) T ss_pred CC-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHH Confidence 00 0001111111111111 11122221 24688888 Q ss_pred HHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEE Q lcl|NC_019524. 83 VAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLA 162 (556) Q Consensus 83 v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~ 162 (556) |+..+.+++|..++..+. ++.....|+.|..+ +|...+..+.+....-|.++. T Consensus 76 vd~~~~yl~G~p~~~~~~----------------~~~~~~~l~~~~~n-----------~~~~~~~~~~~~~~~~G~~~~ 128 (471) T protein:vir:10 76 LDQKKAYALTYPPTFDVD----------------DKKVNDMIVDVLGD-----------DYERISKQLCVNAGNAGIAWL 128 (471) T ss_pred HHhhhhhhcccCceeccC----------------ChHHHHHHHHHHhc-----------CHHHHHHHHHHHHhhCCeEEE Confidence 999999999987776532 12233445555331 456667777888999999997 Q ss_pred EEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----C-CCCCeE-EEEEeecCCCccccCCc--- Q lcl|NC_019524. 163 TCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----D-NNGAAL-GYWLRKAFPGDPTDMEQ--- 233 (556) Q Consensus 163 ~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d-~~Gr~v-aY~i~~~hpgd~~~~~~--- 233 (556) .+.+... ++ .+++..++|..+-.-+.......+..+|++ + ..+..+ -++++..+---.+.... T Consensus 129 ~v~~d~~-----~g---~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~ 200 (471) T protein:vir:10 129 HVWKDAS-----DN---SFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKP 200 (471) T ss_pred EEEeeCC-----CC---eeEEEEEcccceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcc Confidence 7544221 11 268899999987533333333445666643 2 223322 22232221100000000 Q ss_pred -----------------ccc---ceee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 234 -----------------WKW---GYEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVN 292 (556) Q Consensus 234 -----------------~~~---~rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~ 292 (556) ..+ ..++ .+..|| |+|+... .+|+|.|.+++..+..++....-......-. T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n~-----~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~ 272 (471) T protein:vir:10 201 LEELETFQAISLIDTMNGDRSSDNSFKHDFGLVP---FIPFKNN-----EIETNDLKPIKDLVDVYDKVFSGFVNDTDDV 272 (471) T ss_pred cccccccccccccccccccccccccccCCCCcee---EEEeccC-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 0000 111222 4444332 4689999987777665554332222222211 Q ss_pred cceeeeEeccCcccccccccccccccccccccccccccccccccccceecC-Cceeee----cCCCceeeeecCCCCCcc Q lcl|NC_019524. 293 ATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAID-GAKIPH----LYPGTKLKMQPAGTPGGV 367 (556) Q Consensus 293 A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-pG~i~~----L~pGe~i~~~~~~~p~~~ 367 (556) +--..+++-...+. .++ ....+. .++|.. ...|-++++++.+.+... T Consensus 273 ~~~~lv~~g~~~~~---------~~~-------------------~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~ 324 (471) T protein:vir:10 273 QEVIFVLTNYGGQD---------KQE-------------------FLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEA 324 (471) T ss_pred hCceeeeecCCccc---------cch-------------------hHHHhhcCCeEEecCCCCccCccceEEeecCChHH Confidence 21112333211000 000 000111 222221 123458899999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCC Q lcl|NC_019524. 368 GTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPG 447 (556) Q Consensus 368 f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~ 447 (556) +..+++.+.+.|....++|-... ..++++|-.+++..+......+...+..|-..+ +.+++..+. ++.. . .. T Consensus 325 ~~~~~~~l~~~I~~~s~tp~~~~-~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~--~~~~-~---d~ 396 (471) T protein:vir:10 325 RNLILERTKKQIFISGQGVNPET-DKLGNSSGVALKFLYSLLELKAGNMETQFRSGY-ATLVKMILK--HLGL-S---DK 396 (471) T ss_pred HHHHHHHHHHHHHHHhCCcCCCc-ccccCccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH--Hhcc-C---CC Confidence 99999999999999988773322 234555556677766666655666666655544 444544333 2221 1 11 Q ss_pred cccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019524. 448 KNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKL 525 (556) Q Consensus 448 ~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl 525 (556) .-+.+.|...- ..|....+++..+. .|+.|.+.++...+. |+++.++++++|++...+.- T Consensus 397 -------------~~i~i~f~~~~--p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~- 458 (471) T protein:vir:10 397 -------------LKIKQTWTRNS--INNDTEMAQVVSTL--ATITSRENVAKSNPIVEDWQDELRLQKAEQEGRSEKL- 458 (471) T ss_pred -------------ceeEEEeCCCC--CCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcc- Confidence 01346664433 34666666655543 699999999999875 89999999999887553321 Q ss_pred CCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 526 DFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 526 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) + + ..+..++++-| T Consensus 459 ~-~--------------~~~~~~~~e~~ 471 (471) T protein:vir:10 459 Y-D--------------MEEVEHESEVE 471 (471) T ss_pred c-c--------------cCCCCCccccC Confidence 0 0 00000011111 No 153 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=99.03 E-value=1.8e-10 Score=73.98 Aligned_cols=330 Identities=10% Similarity=0.051 Sum_probs=153.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchh--hhhhhhcchhcccc--CCCc---------ccccccCCCCCHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETAT--ATPMAVGGGMEGAE--RTTR---------EMFQWNPSIISPDQQIAQNQDMASA 67 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~--~~~~~~~~~y~aa~--~~~r---------~~~~w~~~~~s~~~~i~~~~~~lr~ 67 (556) +-||+..+..+....+........ ...+...+++.... -+++ .+..|..++-+.. .| T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~--------~L-- 92 (376) T protein:vir:10 23 VLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFA--------GL-- 92 (376) T ss_pred cccchhccCCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHH--------HH-- Confidence 333333221111100000000000 00000001110000 0000 0111222222211 11 Q ss_pred HHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHH Q lcl|NC_019524. 68 RAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLT 147 (556) Q Consensus 68 RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq 147 (556) -.|.+-|++..++|...++.+.+ .++|. ..+|..++. T Consensus 93 --a~~~~~~~~h~s~l~~k~n~l~~-~~~Pn----------------------------------------p~lT~~~f~ 129 (376) T protein:vir:10 93 --AKSFRASTHHSSALFFKANVLAS-TFRPH----------------------------------------RWLSRHAFE 129 (376) T ss_pred --HHHHhhhHHhhhhHHHHhHHHHh-ccCCC----------------------------------------CCCCHHHHH Confidence 14566677777777655444433 22222 234455555 Q ss_pred HHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCc Q lcl|NC_019524. 148 RLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGD 227 (556) Q Consensus 148 ~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd 227 (556) +++. .++.-|.+++.++... ...++.|..|.|.++.. +. |.. .|+..... + T Consensus 130 ~~v~-d~ll~Gnay~~~~rn~--------~G~~~~L~pl~~~~vr~------------~~--d~~----~~~~~~~~-~- 180 (376) T protein:vir:10 130 RWAL-DFLTFGNGYLERRRNM--------VGGTLRLEPALAKYVRR------------KA--DFN----GFVYVNGW-Q- 180 (376) T ss_pred HHHH-HHHhcCCeEEEEEECC--------CCCEEEEEEeCCcceEE------------Ee--eCC----eEEEEEcC-C- Confidence 5554 5677899998876432 12467888888877632 11 111 23333210 0 Q ss_pred cccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEeccCc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESELP 304 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~~~ 304 (556) ....+++.+|||+..+...++..|+|++..++..+. -=..|++-+.+. .+...++|+..++ T Consensus 181 -------------~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~---l~~aa~~f~~~~f~NGa~pggIl~~~d~ 244 (376) T protein:vir:10 181 -------------ERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAW---LNESSTLFRRKYYENGSHAGFILYMTDA 244 (376) T ss_pred -------------eEEEEccccEEEecCCCCCCCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEecCC Confidence 012356789999999887789999999888776543 223344433333 3445555554322 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHHHH Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLRNI 379 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr~i 379 (556) .. .+++.+........ ..+. -..|.+..| +.|-+++.++......+|.+..+....+| T Consensus 245 ~l---------~~e~~~~lr~~~~~----~~G~----~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eI 307 (376) T protein:vir:10 245 AQ---------KQDDVDNMRDALKN----AKGP----GNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDL 307 (376) T ss_pred CC---------CHHHHHHHHHHHHH----hcCc----cccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHH Confidence 11 01111111111100 0000 112344444 34667888877777888999999999999 Q ss_pred HHhcCCCHHHhhchhhc--ccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccch Q lcl|NC_019524. 380 AASLGMSYEQFSRDYTK--TNYSSARASMAETQKYM-DSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDP 456 (556) Q Consensus 380 aaglGi~ye~l~~D~s~--~nYSs~R~~~~e~~r~~-~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~ 456 (556) |+.+|||.. +.|...+ .+||++.+....|.+.. ..++..|. ++.. ||.. +.+.|+.. T Consensus 308 a~af~VPp~-llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~-----~L~~-------------~~~~F~~~ 367 (376) T protein:vir:10 308 LAAHRVPPQ-LLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELND-----WLGE-------------EVVRFDDY 367 (376) T ss_pred HHHhCCCHH-HhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHHh-----hccc-------------cccccChh Confidence 999999987 5576654 46999988877776543 34444331 1111 1110 11122222 Q ss_pred hhHHHhhCe Q lcl|NC_019524. 457 MMRDALCNA 465 (556) Q Consensus 457 ~~~~a~~~~ 465 (556) ...+.-.++ T Consensus 368 ~Llr~d~ka 376 (376) T protein:vir:10 368 EIPPAPVAA 376 (376) T ss_pred HhhcccccC Confidence 111111111 No 154 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=99.02 E-value=1.1e-10 Score=75.13 Aligned_cols=378 Identities=7% Similarity=-0.044 Sum_probs=183.4 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|.+ ++.+. .........+..... +...|++++..+| T Consensus 1 M~if~~~~-------------------~~~~~--~~~~~~~~~~~~~~~------------------~~~~~~~~v~~~v 41 (378) T protein:vir:94 1 MNLFGKVV-------------------SFSRG--KLNNDTQRVTAWQNE------------------AVEYTSAFVTNIH 41 (378) T ss_pred CchhHHhH-------------------hhhhc--ccccCcceeeeeecc------------------hhhhhhHHHHHHH Confidence 22222111 11110 100001111111011 1234556788999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-.-=|++.-+... -+..+....... ..++..+-.+|+ -.+|.+++-.+.+..++..|++|+. T Consensus 42 ~~Ia~~iA~lp~~~~~~~~~--~~~~~~~~~~~~---~~l~~lLn~~PN------~~~t~~~f~~~~~~~lll~Gnayi~ 110 (378) T protein:vir:94 42 NKIANEITKVEFNHVKYKKS--DVGSDTLISMAG---SDLDEVLNWSSK------GERNSMEFWQKVIKKLLTTRYIDLY 110 (378) T ss_pred HHHHHhHhhCceeeeeeccc--cccccccccccc---chHHHHHhhcCC------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 99999988754433221110 000000000001 112233334554 2568889999999999999999976 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) .++. |..|+++-++.... .+ T Consensus 111 ~i~~------------------------------------------~~~g~~~~~~~~~~------------------~~ 130 (378) T protein:vir:94 111 PIFD------------------------------------------SETGELLDLLFAND------------------KK 130 (378) T ss_pred EEee------------------------------------------CCCCcEEEEEEecC------------------cE Confidence 4321 12333332222110 12 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .++.++|||+..+...+ .+++.+..++. .+..+.-++...++++.+..-... ......+.+ T Consensus 131 ~~~~~dvih~~~~~~~~--~~~~~~~~~~~-----------~~~~~~~~~~~~g~l~~~~~l~~~------~~~~~~e~~ 191 (378) T protein:vir:94 131 EYKPEELVRLTSPFYIN--EDTSILDNALA-----------SIQTKLEQGKLRGLLKINAFLDID------NTQEYREKA 191 (378) T ss_pred EechhceeeecCcCCcc--cchhHHHHHHH-----------HHHHHHhhCCcccceeeCCcCCHH------HHHHHHHHH Confidence 35677999997654433 34444433321 122233345556777754321100 000000111 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ..... .....-..|.+..|..|.+++.++.+.-..++ ..++.+..+||..+|||-+.|.+++ +. T Consensus 192 ~~~~~--------~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~g~~-----~e-- 255 (378) T protein:vir:94 192 LATIK--------NMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLGTA-----TQ-- 255 (378) T ss_pred HHHHH--------HhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhcCCc-----hH-- Confidence 10000 00011234668899999999999866555565 5568888999999999998885533 21 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) + ....++..-+.|+... ++.++-...++-....... ....+..+.+-.......|+...+++ T Consensus 256 ~-----------~~~~f~~~tl~P~~~~-ie~~l~~~Ll~~~e~~~g~------~~~~~~~~~f~~~~l~~~d~~~~~e~ 317 (378) T protein:vir:94 256 E-----------QQIYFYNSTIIPLLIQ-LEKELTYKLISTNRRRVVK------GNLYYERIIVDNQLFKFATLKELIDL 317 (378) T ss_pred H-----------HHHHHHHHHHHHHHHH-HHHHHHhhcCChhHhhhhh------hhcccceeEeecchhhhcCHHHHHHH Confidence 0 1123555667776665 3445555444321100000 00011223444555566799999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ....+++|+.|+-|+-+..|..|-+--++ +=++....+... ....+........++|+++| T Consensus 318 ~~~~~~~G~~t~NE~R~~~g~~p~~ggd~----------~~~~~n~~~~~~--~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 318 YHENINGPIFTQNQLLVKMGEQPIEGGDV----------YIANLNAVAVKN--LSDLQGNRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCCCe----------eeecccccchhc--chhcccccCCCCCCCCCCCC Confidence 99999999999999988889888542211 111111111111 11111122222334445555 No 155 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.02 E-value=2.1e-09 Score=68.18 Aligned_cols=436 Identities=11% Similarity=0.058 Sum_probs=183.7 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) ||-- + ..++.......+.....-+.....++-.+.+.+.|....-+ .. .....| ..+++.|++|+ T Consensus 1 ~~~~--~-----~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~-~~---~~~~~l----~~~Yr~~~ia~ 65 (449) T protein:vir:10 1 MTDK--L-----TLAVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFP-EL---VTYENL----YSLYRRGGIAH 65 (449) T ss_pred Cchh--h-----HHHHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCc-cc---CCHHHH----HHHHhcCchhH Confidence 3321 1 12222222111110000001111111111122334322211 11 122222 33899999999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHH---HHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWG---EEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~---~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) .+|+.+++-.-=.+.... -|.+.+.. ..|..++++++.. + -|..+......+++-. T Consensus 66 ~iVd~~~d~~~~~~~~i~-------~g~~~~~~~~~~~~e~~~~~l~~~-------------~-~~~~l~ea~~~~rl~G 124 (449) T protein:vir:10 66 GAVEKLVGKCWQTNPEII-------EGDDADDSEDETSWEKKSKQVFTN-------------R-LWRSFAEADRRRLVGR 124 (449) T ss_pred HHHHhhhhhhhhcCcccc-------cCccccchhhhHHHHHHHHHHHHH-------------H-HHHHHHHHHHhhhccC Confidence 999999875422211110 01111111 1233333333211 0 0222223333344444 Q ss_pred CceEEEEeeccCCCCcCCCccc-ce-EEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccc Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPF-GT-AIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWK 235 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~-~l-~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~ 235 (556) |-+++. ..........+-.+- ++ +|.++....|. +. . .....--..+|+|..|+|...-||... T Consensus 125 ga~i~i-~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~-~~-----~-~~~dp~sp~yg~P~~y~v~~~~~g~~~------ 190 (449) T protein:vir:10 125 YAGILL-HIRDEKDWNLPATKGRGLQKVSVSWAGSLK-VA-----E-WDTGINSKTYGQPKLWKYTERLPNGSS------ 190 (449) T ss_pred cEEEEE-EecCCCCCCcccccCcceeeEEeeccccCC-hh-----h-hhcCCCCCCCCCceEEEEeeeccCCCc------ Confidence 444433 322211111110000 00 12222222221 10 0 111111235799999999877666432 Q ss_pred cceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHH----HHHHH-HHh-cceeeeEeccCcccccc Q lcl|NC_019524. 236 WGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEI----TLQNA-VVN-ATYAASVESELPSDVVF 309 (556) Q Consensus 236 ~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~da----el~~a-~i~-A~~~~fi~~~~~~~~~~ 309 (556) ....|+.+.|||+-... ..|+|.|-|+...+.+++.-... -+..+ +.. -.|...++ . .... T Consensus 191 -----~~~~iH~SRl~~~~~~~----~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~--~--~~l~ 257 (449) T protein:vir:10 191 -----RRVDIHPDRVFILGDYS----EDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEID--F--TNLA 257 (449) T ss_pred -----cceeeccceeEeecCCC----CCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhh--h--hhhh Confidence 12358888899875432 33899999999888777664321 11111 110 00101110 0 0000 Q ss_pred cccccccccccccccccccccccccccccceecCCce-eeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHH Q lcl|NC_019524. 310 GQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAK-IPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYE 388 (556) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~-i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye 388 (556) ...+.+.++..+..... ...+.-|+ ..-+..+++++.++.+ + ++..++.......||++.|||.- T Consensus 258 ~~~~~~~e~~~~~~~~~------------~~~~~~~~~~~~i~~~~d~~~~~~~-~-sgl~d~l~~~~q~iaaa~~IP~t 323 (449) T protein:vir:10 258 SLYGVSIDELQDKFNEV------------AGEINRGNDVLMTTQGATVTPLVTS-V-ADPTATYNVNLQTAAAGVDIPTR 323 (449) T ss_pred HHhhCCchHHHHHHHHH------------HHHHhccchheeecCCcceEEEecc-c-CChhHHHHHHHHHHHHHhCCCee Confidence 11111111000000000 00111111 1123467778877754 3 47889999999999999999998 Q ss_pred HhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeee Q lcl|NC_019524. 389 QFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWI 468 (556) Q Consensus 389 ~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~ 468 (556) .|.|--. ...+|. .-+.-+...++.+|+. ++|+.++.++..+.++.+..|..+.+. ++.-|. T Consensus 324 ~L~Gqsp-~glnst-~D~~nyyd~i~~~Q~~-----l~p~le~l~~~l~~s~~g~~~~d~~i~-----------f~pL~~ 385 (449) T protein:vir:10 324 ILIGNQQ-AERSST-EDQKYFNARCQSRRVD-----LSFEIEDFCDKLIELKIIDAVAKKAVI-----------WDDLNE 385 (449) T ss_pred eeeccCc-cccccc-hhHHHHHHHHHHHHHh-----hhHHHHHHHHHHHHhhcCCCCCceeEE-----------eCCCCC Confidence 8888543 345554 4455555566665543 367777777777778777666543221 123555 Q ss_pred cCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCC Q lcl|NC_019524. 469 GASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTS 547 (556) Q Consensus 469 ~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~ 547 (556) +-..+..|- .|-++|....+++|.... .+.+|+- +.+|...+. ......+++++ T Consensus 386 ~t~kEkAei~k~~A~a~~~~~~ag~~~~--------~~~~EiR----------~~~~~~~~~-------~~~~~~e~~de 440 (449) T protein:vir:10 386 QTGTEKLTNAKTMGEINQTMLGSGDNPA--------FSREEIR----------TAAGYDNDD-------EEPLGEEDGDE 440 (449) T ss_pred CCHHHHHHHHHHHHHHHHHHHHccccCC--------cCHHHHH----------HHhcccCCC-------CCCCCCCCCcc Confidence 655555543 334666667777773211 1222221 122332211 00011111111 Q ss_pred CCCCCcCCC Q lcl|NC_019524. 548 DNPNEETTQ 556 (556) Q Consensus 548 ~~~~~e~~~ 556 (556) .++..++.- T Consensus 441 ~~~~~d~~a 449 (449) T protein:vir:10 441 EDKATDSAA 449 (449) T ss_pred ccccCCcCC Confidence 111111111 No 156 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.01 E-value=5.6e-09 Score=65.81 Aligned_cols=394 Identities=12% Similarity=-0.015 Sum_probs=206.7 Q ss_pred CcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHH Q lcl|NC_019524. 2 KDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAG 81 (556) Q Consensus 2 sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~ 81 (556) =..+.+.+...+-.- +. .+......-|+| ...-+.+ +.+ .-+.++.+.| ++ .+|.+- T Consensus 1 ~~~~~i~~L~~~~~~----~~--~r~~~~~~yY~g-~~~~~~~------~~~-------~p~~~~~~~~-~v--~nw~~~ 57 (409) T protein:vir:16 1 MTEKGIGYLRFKLSV----HK--RRAEMRYEQYAM-KHVDRFK------GIT-------IPQALSQQYR-SI--LGWCAK 57 (409) T ss_pred CCHHHHHHHHHHHHH----Hh--HHHHHHHHHHhc-cCchhhc------chh-------hhHHHHHHHh-hh--cChhHH Confidence 111122222211110 00 111111122443 2211111 011 1122333333 33 368899 Q ss_pred HHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceE Q lcl|NC_019524. 82 VVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVL 161 (556) Q Consensus 82 ~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f 161 (556) +|+.+++.++=.||+. + + +.+|+-|.. .+|...+..+.+..++-|.+| T Consensus 58 iVds~a~rl~~~Gf~~---~--------d----------~~l~~i~~~-----------N~ld~~~~~~~~~al~yG~sf 105 (409) T protein:vir:16 58 GVDSLADRLVFREFEN---D--------D----------FTVNEIFEE-----------NNPDIFFDSTVLSALIASCSF 105 (409) T ss_pred HHHHhHhhcccccccC---c--------c----------hHHHHHHHh-----------cChhHHHHHHHHHHHHhCcee Confidence 9999988777778752 1 1 124554433 268888999999999999999 Q ss_pred EEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---CCCCCeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 162 ATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---DNNGAALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d~~Gr~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) +.. +... ++. ..|+.++|..+-.-++. ..+++..++.+ |..|.++.+-++..+---.+......|.. T Consensus 106 ~~v-~~~~-----dg~---~~i~~~sP~~~~~i~D~-~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 175 (409) T protein:vir:16 106 TYI-SKGE-----NDA---VRLQVIEATNATGIIDP-ITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNIS 175 (409) T ss_pred EEE-ecCC-----CCc---eEEEEEcccceEEEeec-ccccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccc Confidence 875 3221 122 37899999988644432 23455566653 34566555433322211111222334444 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhh-HHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMV-SALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQG 317 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la-~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 317 (556) +|- ..+.-=|+|++...+.+-.=|.|.++ |++..+..+.+-.--.+..+...|.=-.+|+--+. + T Consensus 176 ~~~--~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~------------d 241 (409) T protein:vir:16 176 IAN--PTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSD------------D 241 (409) T ss_pred eec--CCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCC------------C Confidence 442 23444488888877777667999987 56655555555444333444333322223321000 0 Q ss_pred cccccccccccccccccccccceecCCceeeecC---CCceeee--ecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhc Q lcl|NC_019524. 318 GFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY---PGTKLKM--QPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSR 392 (556) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~---pGe~i~~--~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~ 392 (556) ..........+|.|..++ .|+.+++ ++.+.. .+|.+.++.+.+.+|+-.++|-+.|.. T Consensus 242 ----------------~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~ 304 (409) T protein:vir:16 242 ----------------AEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGF 304 (409) T ss_pred ----------------CCccchhhhhhhHhhccCCCCCCCCceEEecCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHccc Confidence 001111234457777775 3455555 443332 478899999999999999999998876 Q ss_pred hhhc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHh--hCeeeec Q lcl|NC_019524. 393 DYTK-TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDAL--CNAEWIG 469 (556) Q Consensus 393 D~s~-~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~--~~~~w~~ 469 (556) ...+ +|=-++++++....+..+..|..|-..+.+ +++. -.++..+.-..| ..+ +.+.|.+ T Consensus 305 ~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~-~~rl--a~~~~~~~~~~~--------------~~~~~~~v~W~~ 367 (409) T protein:vir:16 305 VSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLN-VAYL--AACLRDDVPYLR--------------EQFSKTKPKWEP 367 (409) T ss_pred ccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH--HHHHhcCCCccc--------------hhhccceEEecC Confidence 6532 233356777777777777777777666544 4442 223433321111 222 3567864 Q ss_pred Ccccc-cchhhhhHHHHHHHHcC-CCCHHHHHHH-hCCCHHH Q lcl|NC_019524. 470 ASRGQ-IDEKKETEAAILRIKNG-LSTYEAEISR-LGGDFRE 508 (556) Q Consensus 470 p~~~~-iDP~Ke~~A~~~~i~~G-~~s~~~~~ae-~G~D~e~ 508 (556) .--+. -.-...+.|..+.+.+| .-..++++.+ .|.+-.+ T Consensus 368 ~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 368 LFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred CCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 32221 11245677788888887 3333455544 4876666 No 157 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.01 E-value=5.7e-09 Score=65.80 Aligned_cols=435 Identities=10% Similarity=0.008 Sum_probs=214.6 Q ss_pred CCcchhhhHHHHH--------hhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAK--------KAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDM 72 (556) Q Consensus 1 ~sp~~~~~r~~a~--------~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl 72 (556) |-|++-.....-+ ....... ....+.. ....|.-+.. .-... . ..... ..+ ..+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~r~~-~~~~yy~g~~--~i~~~--~-~~~~~---~~~--------~ki 64 (453) T protein:vir:73 3 LKPIKLMTYSRDEEITDKVVNDFMKKHQ-EEVERYE-YLGNMYKGIM--EISSQ--K-AKDSW---KPD--------NRL 64 (453) T ss_pred cccceeeeccccccCCHHHHHHHHHHHH-HHHHHHH-HHHHHhcccc--chhcC--C-CCCcc---Ccc--------cee Confidence 4444443322111 0000000 0011111 1123332221 11110 0 00000 000 012 Q ss_pred HhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhh Q lcl|NC_019524. 73 VQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVS 152 (556) Q Consensus 73 ~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r 152 (556) .+++++-+|+..+.+++|.+++..+. + +.....|+.|... .+|...+..+.+ T Consensus 65 --~~n~~~~ivd~~~~~l~g~~~~~~~~---------d-------~~~~~~l~~~~~~----------n~~~~~~~~~~~ 116 (453) T protein:vir:73 65 --TNNFAKYIVDTFVGYFNGIPIKKTHD---------D-------KSVLEAMQLFDNL----------NDMEDEESELAK 116 (453) T ss_pred --ecchHHHHHHHhhhhhcccCceeecC---------C-------hHHHHHHHHHHHh----------cChhHHHHHHHH Confidence 35799999999999999999887542 1 2233345555432 257778888889 Q ss_pred hheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEE--ECCCCCeEEEEEeecCCCcccc Q lcl|NC_019524. 153 GFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQ--LDNNGAALGYWLRKAFPGDPTD 230 (556) Q Consensus 153 ~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE--~d~~Gr~vaY~i~~~hpgd~~~ 230 (556) ..+.-|.+|..+.. ...+ .+++..++|+.+-.-+....+......|. +|..|.-. ..|+..+---.+. T Consensus 117 ~~~~~G~~~~~v~~-d~~~--------~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~-~~vyt~~~i~~~~ 186 (453) T protein:vir:73 117 IACVYGRAYELMYQ-NEST--------ESEVIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLS-GTVYTLLETISIT 186 (453) T ss_pred HHHhcCeEEEEEEe-CCCC--------ceEEEEEcccceEEEEeCCCCceeEEEEEEEEecCceEE-EEEEeCCeEEEEE Confidence 99999999987543 2221 24788899988854343333444555554 34445432 2233221100001 Q ss_pred CCccccceee----ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccc Q lcl|NC_019524. 231 MEQWKWGYEP----ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSD 306 (556) Q Consensus 231 ~~~~~~~rv~----~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~ 306 (556) .....|.-+. .+..+| |+++.. -..|.|.+.+++..+..++....-......-.+.-..+++-...++ T Consensus 187 ~~~~~~~~~~~~~~~~g~vP---vv~~~n-----~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~ 258 (453) T protein:vir:73 187 GKAGEVKFGESTYNVYSDLP---IVEYNF-----NEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDE 258 (453) T ss_pred ecCCceEEccceeccCCcee---EEEecC-----CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc Confidence 1111121111 011222 444332 2368899998777666555544333333333332223443211111 Q ss_pred ccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCC Q lcl|NC_019524. 307 VVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMS 386 (556) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ 386 (556) ..... .... .........++......++.++++.+.+.+..++..+.+.+...|..-.++| T Consensus 259 ~~~~~-----------~~~~--------~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 319 (453) T protein:vir:73 259 EDAKN-----------IKDN--------RLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAA 319 (453) T ss_pred hhhhc-----------cccc--------ccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 00000 0000 0000111233444455677889999988888999999999998888877776 Q ss_pred HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCee Q lcl|NC_019524. 387 YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAE 466 (556) Q Consensus 387 ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~ 466 (556) --.. ..++++|-.+++..+..........|..|-..+ +.+++.++...-..|. +.. + .-+.+. T Consensus 320 ~~~~-~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l-~~~~~li~~~~~~~~~---~~~-----~-------~~i~v~ 382 (453) T protein:vir:73 320 NISD-ENFGNSSGVALAYKLQAMSNLALSFQRKFQSAL-NRRYSLWSSLSTNASN---KDA-----W-------KDIEYT 382 (453) T ss_pred ccCc-ccccCccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCC---ccc-----c-------ccceEE Confidence 3222 234556666677776666666666666665544 4455554432211221 100 0 113567 Q ss_pred eecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC Q lcl|NC_019524. 467 WIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE 544 (556) Q Consensus 467 w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~ 544 (556) |..+-. .|....+++..+. .|+.|.+..+...+. |+++.++++.+|++...+...... + ..++ T Consensus 383 f~~~~p--~~~~~~a~~~~k~--~giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~--~---------~~~~ 447 (453) T protein:vir:73 383 FTRNEP--KDIKEQAETANIL--KGITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSN--L---------VRMK 447 (453) T ss_pred eCCCCC--CCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhcc--C---------Ccch Confidence 844332 4655555555544 489999999998876 899999999999876544322110 0 0000 Q ss_pred CCCCCCCC Q lcl|NC_019524. 545 STSDNPNE 552 (556) Q Consensus 545 ~~~~~~~~ 552 (556) . +..+- T Consensus 448 ~--~~~~~ 453 (453) T protein:vir:73 448 Q--MRGNL 453 (453) T ss_pred h--hhcCC Confidence 0 00000 No 158 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=98.99 E-value=4.7e-11 Score=77.21 Aligned_cols=378 Identities=7% Similarity=-0.031 Sum_probs=181.3 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=.. +...+.+ +.........|.. ..++...|++.+.++| T Consensus 1 M~~f~-------------------k~~~~~~--~~~~~~~~~~~~~------------------~~~~~~~~~~~v~~~v 41 (378) T protein:vir:85 1 MNLFG-------------------KVVSFSR--GKLNNDTQRVTAW------------------QNEAVEYTSAFVTNIH 41 (378) T ss_pred Cchhh-------------------hhhhhhh--cccccCCcceeee------------------eccchhhhhHHHHHHH Confidence 11111 1111111 1111111111111 1112345778889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-.--|.+.-+... +...+ ......-..+...+-.+|+ -.+|.+++-...+..++..|++++. T Consensus 42 ~~ia~~iA~lp~~~~~~~~~---~~~~~--~~~~~~~~~l~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnayi~ 110 (378) T protein:vir:85 42 NKIANEITKVEFNHVKYKKS---DVGSD--TLISMAGSDLDEVLNWSYK------GEHNSMEFWQKVIKKLLCTRYVDLY 110 (378) T ss_pred HHHHHhHhhCceeEEEEecc---ccccc--cccccccchHHHHHhccCC------CCCCHHHHHHHHHHHHhhcCCeEEE Confidence 99999988765555322110 00000 0000011122333434554 2478899999999999999999976 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +++. |..|+.+.++... . .. T Consensus 111 ~i~~------------------------------------------~~~g~~~~~~~~~---------~---------~~ 130 (378) T protein:vir:85 111 PIFD------------------------------------------SETGELLDLLFAN---------D---------KK 130 (378) T ss_pred Eeec------------------------------------------CCCceEEEEEecC---------C---------CE Confidence 4321 1233333222211 0 01 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .+...+|||+..+..... +++.+..++. .+..+.-++...++++.+..-... ......+.+ T Consensus 131 ~~~~~dvih~~~~~~~~~--~~~~~~~a~~-----------~~~~~~~~~~~~g~l~~~~~l~~~------~~~~~~~~~ 191 (378) T protein:vir:85 131 EYKPEELVRLVSPFYINE--DTSILDNALA-----------SIQTKLEQGKLRGLLKINAFLDID------NTQEYREKA 191 (378) T ss_pred EEcccceEEEecCcCccc--hhhHHHHHHH-----------HHHHHHhcCCcceEEEeCCcCCHH------HHHHHHHHH Confidence 233468999986654333 2333332222 122233345667777754321100 000000100 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHH Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSAR 403 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R 403 (556) ..... .....-..|.+..|..|.+++.++.+....++ .+.+.+..+||..+|||-+.|.+ +|+. T Consensus 192 ~~~~~--------~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~~-----s~~e-- 255 (378) T protein:vir:85 192 LATIK--------NMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILLG-----TATQ-- 255 (378) T ss_pred HHHHH--------HhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC-----CchH-- Confidence 00000 00011235678899999999998765555555 45677788999999999988854 3321 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHH Q lcl|NC_019524. 404 ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEA 483 (556) Q Consensus 404 ~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A 483 (556) +. ...++..-+.|+..++ +.++....|+-..-.. .. .......+.+-...-...|+...+++ T Consensus 256 ~~-----------~~~f~~~tL~P~~~~i-e~~l~~kLl~~~er~~-~~-----~~~~~~~~~f~~~~l~~~d~~~~~~~ 317 (378) T protein:vir:85 256 EQ-----------QIYFYNSTIIPLLIQL-EKELTYKLISTNRRRV-VK-----GNLYYERIIVDNQLFKFATLKELIDL 317 (378) T ss_pred HH-----------HHHHHHHHHHHHHHHH-HHHHHhhcCChhhhhh-hh-----hccccceeeecchhhhhcCHHHHHHH Confidence 11 1124556677766654 4444444432110000 00 00001123333445555699999999 Q ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 484 AILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 484 ~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ....+.+|+.|+-|+-+..|..|-+--+ ++=++....+.. ..............++++++| T Consensus 318 ~~~~~~~G~~T~NE~R~~lgl~p~~gGD----------~~~~~~N~~~~~--~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 318 YHENINGPIFTQNQLLVKMGEQPIEGGD----------IYIANLNAVAVK--NLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCCC----------eEeecccccccc--cchhhcCccCCCCCCCCCCCC Confidence 9999999999999998888888754211 111111111111 111111222223344445555 No 159 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=98.98 E-value=1.5e-09 Score=69.04 Aligned_cols=390 Identities=9% Similarity=0.012 Sum_probs=174.0 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|..- . +....++.. ...|. ..+.. ..-..+.+..++.+..+| T Consensus 1 Mg~~~~~~~-------------------~-~~~~~~~~~-~~~~~--~~~~~----------~~~~~~~~l~~~~v~~~v 47 (395) T protein:vir:40 1 MGFKSWVSG-------------------F-FNEEQRTLN-LTDTV--WCSIP----------SEKLKELSIKKWAIDSCA 47 (395) T ss_pred CchHHHHHh-------------------h-hcccccccc-cccch--hhccc----------cccchhhhhhhHHHHHHH Confidence 222111110 0 000011111 11111 11111 111122344567788999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-..-|++.-+- + ..... .+..+-.+|+ ..+|.+++....+..++..|++|+. T Consensus 48 ~~Ia~~ia~~p~~~~~~~--------~----~~~~~---~~~lL~~~PN------~~~t~~~f~~~~~~~lll~Gnay~~ 106 (395) T protein:vir:40 48 NKIANTLSCAEVLTYEKG--------E----EVRKK---NWYMFNVEAN------QNQNATEFWKKAIYKLVYDNEALIF 106 (395) T ss_pred HHHHHHHhhCceeeccCC--------c----cccch---HHHHHHhcCC------CCCCHHHHHHHHHHHHhhcCceEEE Confidence 999998887666553210 1 11111 2223333554 3578999999999999999999987 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.... + .+........... .......|.++.++ + .. T Consensus 107 ~~~~~--------------~-~~~~~~~~~~~~~--~~~~~~~v~~~~~~------~---------------------~~ 142 (395) T protein:vir:40 107 MQDEY--------------I-YVADSFTKNDKSL--YENTYTEVTLKDLT------L---------------------KK 142 (395) T ss_pred EecCc--------------e-eecCCcccccccc--ccceeeeeeecCce------e---------------------ee Confidence 64211 0 0111111111000 00111222222110 0 12 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhccee--eeEeccCcccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYA--ASVESELPSDVVFGQLGMGQGGFKE 321 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~--~fi~~~~~~~~~~~~~~~~~~~~~~ 321 (556) .++..+|||+......+ .+.+.+++.....+ ....+.+..-++... +.++..... ..+... T Consensus 143 ~~~~~evih~r~~~~~~----~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~~~~~----------~~~~~~ 205 (395) T protein:vir:40 143 EFKESEVLHLTLNNESI----KSIIDGFYLLYGDL---LTAAVNKYKKLNSRKIIVKLKAMFGQ----------TPEAEE 205 (395) T ss_pred eeccccEEEeecCCCCc----cccchhHHHHHHHH---HHHHHHHHHhcCCCCceEEEecccCC----------CHHHHH Confidence 35677999986433222 22222222222111 111122222222222 222221110 000000 Q ss_pred cccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHH---HHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYE---QSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~---~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) ....... .... ...-..|.+..|..|.+++.++.+....+|.+.. +.+.++||..+|||-..|.+ + T Consensus 206 ~~~~~~~---~~~~---~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~-----~ 274 (395) T protein:vir:40 206 KLRLMLS---ERMK---KFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG-----D 274 (395) T ss_pred HHHHHHH---HHHH---HhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-----C Confidence 0000000 0000 0011356677899999999999877777786544 44578999999999998754 4 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||++.+. ...++...+.|+.+.+ +.++-...++-..... -..+++-.-.....|+. T Consensus 275 ~sn~e~~-----------~~~f~~~~L~P~~~~i-e~~l~~kLl~~~~~~~------------g~~i~fd~~~ll~~d~~ 330 (395) T protein:vir:40 275 TVGLSEQ-----------VNSFLMFSINPIAEMF-TDEGNRKFYGRDSVLE------------RTYMKLDTTRIKVQDIQ 330 (395) T ss_pred CcCHHHH-----------HHHHHHHHHHHHHHHH-HHHHHHhcCChhhhcC------------CceEEEechhhhccCHH Confidence 5554332 2345566667766654 4445444443211100 00122222233445888 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) ..+++..+.+.+|+.|+-|+-+..|.+|-+- ...+++=++....+. ...... .++.+++.++.++ T Consensus 331 ~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~--------~~gD~~~~~~n~~~~---~~~~~~-~kgge~~~~~~~~ 395 (395) T protein:vir:40 331 EIASSMDVLFHIGVNTIDDNLRMIGREPVMS--------PETQERFVTKNYAPL---GENEED-LKGGDINENKGDS 395 (395) T ss_pred HHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--------CCCceeeeccccccc---cccccc-cCCCCCCCCcCCC Confidence 8888889999999999999988888887541 000010011110000 000111 1111111111111 No 160 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=98.95 E-value=1e-08 Score=64.37 Aligned_cols=423 Identities=8% Similarity=-0.021 Sum_probs=188.9 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHH---HHHHHHHHHHHHh------ Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQN---QDMASARAQDMVQ------ 74 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~---~~~lr~RaRdl~r------ 74 (556) |....|-.-..--...- ..-..-+.-. ..| ....+... +..+.. ..+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~----------~~~~~~~~~~----~~~------i~~~i~~~~~~~~r~~~-~~~Yy~g~~~i~ 59 (478) T protein:vir:10 1 MISINWPWDKPYHEQVV----------EQIKPKYETQ----EEM------ILRLVREHKENIDNITM-GERYYNHHPDIL 59 (478) T ss_pred CccccccCCchhhhHHH----------HHhhhccCCh----HHH------HHHHHHHHHHHHHHHHH-HHHHhccccccc Confidence 22222111000000000 0000000000 000 00111110 111100 011111 Q ss_pred ---------------------cChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhccccc Q lcl|NC_019524. 75 ---------------------NDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPEN 133 (556) Q Consensus 75 ---------------------Nn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~ 133 (556) .+++++-+|++.+.+++|.+++..+- . ++..+. ++.|..+ T Consensus 60 ~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~---------~---~~~~~~----l~~~~~n--- 120 (478) T protein:vir:10 60 DAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVD---------N---DKALKQ----IQHTLNH--- 120 (478) T ss_pred ccchhhhcccccccccccceeccchHHHHHHHHhhhhcccCceeecC---------C---hHHHHH----HHHHHhc--- Confidence 25799999999999999998887532 1 122222 3333321 Q ss_pred ceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE---C Q lcl|NC_019524. 134 WFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL---D 210 (556) Q Consensus 134 ~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~---d 210 (556) +|.....-+.+....-|.+++++.+ ...+ .+++..++|+.+---+.....+.+..+|.+ + T Consensus 121 --------~~~~~~~~~~~~~~~~G~~~~~v~~-d~~~--------~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~ 183 (478) T protein:vir:10 121 --------KWDDKLVDILTAASNKGIEWVQPYV-DEEG--------EFKTFRVPAEQAVPIWTNKERDELQAFIRVYELD 183 (478) T ss_pred --------cHHHHHHHHHHHHhhCCeEEEEEEe-cCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEeee Confidence 3555555566788889999977543 3221 268899999987432222222335555522 1 Q ss_pred CCCC-------eEEEEEeecCCC-ccccC--Cc-ccccee---e-ccccCChhHeEeeecccCCCcccCCchhhHHHHHH Q lcl|NC_019524. 211 NNGA-------ALGYWLRKAFPG-DPTDM--EQ-WKWGYE---P-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQM 275 (556) Q Consensus 211 ~~Gr-------~vaY~i~~~hpg-d~~~~--~~-~~~~rv---~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l 275 (556) .... .+-||.+..-.. ..... .+ ..+..+ + ....+| |+|+... .+|+|.|.+++..+ T Consensus 184 ~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n~-----~~g~sd~e~v~~li 255 (478) T protein:vir:10 184 GAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVP---FIPFKNN-----PQEVSDLFMYKTII 255 (478) T ss_pred CceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcce---EEEeccC-----CCCCCcHHHHHHHH Confidence 1111 122222221100 00000 00 000000 0 011122 4444443 37999999977777 Q ss_pred HHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceee--ecCCC Q lcl|NC_019524. 276 KMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIP--HLYPG 353 (556) Q Consensus 276 ~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~--~L~pG 353 (556) ..++....-......-.+.-..+++-..++. . .. ....+....+. ....| T Consensus 256 Da~~~~~S~~~~~~~~~~~~~~~~~g~~~~~---------~-------~~------------~~~~~~~~~~~~~~~~~~ 307 (478) T protein:vir:10 256 DALDKRLSDTQNTFDESVELIYILKGYEGED---------M-------KD------------FMHNLKYYKAISVAGESG 307 (478) T ss_pred HHHHHHHHHHHHHHHHhhCcceeeecCCccc---------c-------cc------------hhhhhhhCceeEecCCCC Confidence 6666544333333222221112232111000 0 00 00012222222 23456 Q ss_pred ceeeeecCCCCCccHHHHHHHHHHHHHHhcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 354 TKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYT 430 (556) Q Consensus 354 e~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~ 430 (556) .++++++.+.+...+..+++.+.+.|....++| .+.++++.|+ -+.+..+..........+..|-. .++.+++ T Consensus 308 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg---~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~ 383 (478) T protein:vir:10 308 SGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSG---IALKFMYSNLDLKANKLKNKTLT-ALQELLQ 383 (478) T ss_pred CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHH---HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 789999988889999999999999988888876 2333333332 24444433333333333333322 2333333 Q ss_pred HHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHH Q lcl|NC_019524. 431 LWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFRE 508 (556) Q Consensus 431 ~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~ 508 (556) ..++ ++.+.... .-+.+.|.. ..+ .|....+++ .....|+.|.+..+...+. |+++ T Consensus 384 li~~--~~~~~~d~----------------~~i~i~f~~-~~p-~~~~e~~~~--~~~~~g~iS~et~i~~~~~v~d~~~ 441 (478) T protein:vir:10 384 YIID--FYRLDVRV----------------QDIEITFNF-NVM-VNELENSQI--AMNSTGLLSKETILGNHSWVQDPVA 441 (478) T ss_pred HHHH--HhCCCccc----------------ccceEEeCC-CCC-CCHHHHHHH--HHHHhCCCChHHHHHhCCCCCCHHH Confidence 3222 33322110 113567733 333 454444444 3445899999999998875 8999 Q ss_pred HHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 509 VFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 509 v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) .++++.+|++.....-... ..+....+.+.+++.++| T Consensus 442 E~~ri~~E~~~~~~~~~~~--------~~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 442 EMERIEQENIELNQQLPDI--------EEGLNDEQQRQSEDNQSE 478 (478) T ss_pred HHHHHHHHHHHHHHhcccc--------CCCCcccccccCcCCCCC Confidence 9999999987765432211 011111111111111112 No 161 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=98.95 E-value=3.2e-09 Score=67.14 Aligned_cols=383 Identities=12% Similarity=0.030 Sum_probs=174.7 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=+.+ .. ..+...... ......+...+-..+..++.+.++| T Consensus 1 Mgl~d~-------------------------~~--------~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~~v~~~i 42 (395) T protein:vir:96 1 MGILDF-------------------------FS--------FKKSGTLSD-----DDSGSTTSEKLTNVVLKEDALYKCV 42 (395) T ss_pred Ccchhh-------------------------hc--------CCCCccccc-----cccccchhhhcchhhhhhHHHHHHH Confidence 111000 00 000000000 0011122233445556778889999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.|.+.|-..-|++.-+ + -....... +...+..+|+ -.+|..++....+..++..|++|+. T Consensus 43 ~~Ia~~ia~lp~~v~~~-~-----~~~~~~~~-------~~~lL~~~PN------~~~t~~~f~~~l~~~lll~Gna~~~ 103 (395) T protein:vir:96 43 NYLARIISKSTFRIKAP-E-----KLTENQKD-------WLYWINTKAN------PNQSASQFWVEVVQKLLVDGETLIF 103 (395) T ss_pred HHHHHhhccceeEEEeC-C-----ccccccch-------HHHHHhhcCC------CCCCHHHHHHHHHHHHhhcCceEEE Confidence 99999998876666432 1 01111111 1222223554 2468889999999999999999998 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +.... . + .-++........ ...+...|.++. |.+ .. T Consensus 104 ~~~~~--~-----------~--~~~~~~~~~~~~--~~~~~~~v~~~~------~~~---------------------~~ 139 (395) T protein:vir:96 104 VIPGK--G-----------I--YVADAFTQDKKL--SGNKFKVSRVQG------QTY---------------------EK 139 (395) T ss_pred EEcCC--c-----------e--ecCCcccccccc--ccceeeeeeecc------cee---------------------ee Confidence 75421 0 0 111111111100 001222222221 111 11 Q ss_pred cCChhHeEeeecccCCCcccCCchhh---HHHHHHHHHHHHHHHH---HHHHHHhcceeeeEeccCcccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMV---SALKQMKMTRNFQEIT---LQNAVVNATYAASVESELPSDVVFGQLGMGQG 317 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la---~~l~~l~~l~~~~dae---l~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 317 (556) .++..+|||+......+...+-+.+. ++|.....+.....+. +...+-.+...+.++...+... . T Consensus 140 ~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~ 210 (395) T protein:vir:96 140 IFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQP---------K 210 (395) T ss_pred EeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhH---------H Confidence 35667999997665444444433322 2222222221111111 0111111111122221111000 0 Q ss_pred cccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHH------HHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 318 GFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYE------QSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~------~~~lr~iaaglGi~ye~l~ 391 (556) ........... ...-..+.+..|..|.+++.++.+....++.+.. +....+||..+|||-+.|. T Consensus 211 ~~~~~~~~~~~----------~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~ 280 (395) T protein:vir:96 211 SDKDFFKRTIE----------KIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH 280 (395) T ss_pred HHHHHHHHHHH----------HhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc Confidence 00000000000 0011345577799999999988776666554433 3457899999999999884 Q ss_pred chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCc Q lcl|NC_019524. 392 RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGAS 471 (556) Q Consensus 392 ~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~ 471 (556) .+||++.+... .|+..-+.|+...| +.++....++-...... ..... -. T Consensus 281 -----~~~sn~e~~~~-----------~f~~~~L~P~~~~i-e~~l~~~Ll~~~e~~~~------------~~f~~--~~ 329 (395) T protein:vir:96 281 -----GDIADNQKNYE-----------LLLEGPIESLITNI-VDGLEYAIFDKSETLEG------------SFIKV--TG 329 (395) T ss_pred -----CCCccHHHHHH-----------HHHHHHHHHHHHHH-HHHHHhhcCChhhhcCc------------eeEee--cc Confidence 46666543332 34555667766664 44454444431111000 01111 22 Q ss_pred ccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 472 RGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPN 551 (556) Q Consensus 472 ~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (556) ....|+...+++....+++|+.|+-|+-+..|..|-+- ...+++=++... .+.+ +..+|+.+ T Consensus 330 l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~--------~~gD~~~~~~N~--------~~~~--~~gge~~~ 391 (395) T protein:vir:96 330 LKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPD--------GLGKVLYMTKNY--------ESVL--ERGGEVDE 391 (395) T ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--------CCCceeeecccc--------eech--hccCCCCC Confidence 33459999999999999999999999888888777431 000011011000 0000 01111111 Q ss_pred CcCC Q lcl|NC_019524. 552 EETT 555 (556) Q Consensus 552 ~e~~ 555 (556) +..| T Consensus 392 ~~~~ 395 (395) T protein:vir:96 392 EVET 395 (395) T ss_pred CCCC Confidence 1111 No 162 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=98.92 E-value=4.2e-09 Score=66.48 Aligned_cols=371 Identities=10% Similarity=0.059 Sum_probs=169.2 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.=..|..-+ +....|..... ....-+-..+-.++.+..+| T Consensus 1 Mg~f~~l~~~----------------------------~~~~~~~~~~~-----------~~~~~~~~~~l~~~~v~~~i 41 (376) T protein:vir:78 1 MGFFSELFKR----------------------------NKEIEWMWDLD-----------FLEDKTTKVYLKKMALNTCV 41 (376) T ss_pred Cchhhhhhcc----------------------------CCccccccchh-----------hccccchhhhhhhHHHHHHH Confidence 2111111100 00001100000 00001111223456788999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+.+.+...-|++.-+- ... . ...+..+..+|+ ..+|..++....+..++..|++|+. T Consensus 42 ~~Ia~~ia~~p~~~~~~~--------~~~----~---~~l~~ll~~~PN------~~~t~~~f~~~~~~~lll~Gn~~~~ 100 (376) T protein:vir:78 42 KHIARTIAKSDFRLKNGE--------TSV----R---DKLYYKLNIRPN------TDMSSSSFWEKVIYKLIYDNECLIV 100 (376) T ss_pred HHHHHhhcccceeecccc--------ccc----c---chHHHHHhhccc------cCCCHHHHHHHHHHHHhHcCcEEEE Confidence 999999988766654210 000 1 123333444554 3578899999999999999999988 Q ss_pred EeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccc Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARF 243 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~ 243 (556) +..... .++..+-.+.+..+ ......+|.++.++ + .. T Consensus 101 ~~r~~~--------~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~------~---------------------~~ 137 (376) T protein:vir:78 101 LSDTDD--------FLIADSYVRKEFAF--------FPDVFEGVTVKDYR------Y---------------------NR 137 (376) T ss_pred EEeCCC--------eeeccceeecccce--------eeeeeeeeeeecce------e---------------------ee Confidence 754221 11111111111111 00111122221111 0 01 Q ss_pred cCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccc Q lcl|NC_019524. 244 DWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIF 323 (556) Q Consensus 244 ~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (556) .++.++|||+.....+ |.+.+.+.+... .......+.+..-+....+.++...+.. ..++..+.. T Consensus 138 ~~~~~evih~~~~~~~----~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~e~~~~~ 202 (376) T protein:vir:78 138 NFSMDDVIFLEYGNER----LSAFTDGMFEDY---GELFGKMIRAQMRNFQIRGAVNFKMAGV--------ADKDKQTKL 202 (376) T ss_pred eeccccEEEeccCCCC----chhhhhHHHHHH---HHHHHHHHHHHHhcCCCceeEEEccCCC--------CCHHHHHHH Confidence 3567799998654433 333333333322 2222233333333333333322211100 000000001 Q ss_pred cccccccccccccccceecCCceeeecCCCceeeeecCCCCCc-----cHHHHHHHHHHHHHHhcCCCHHHhhchhhccc Q lcl|NC_019524. 324 NEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG-----VGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN 398 (556) Q Consensus 324 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~-----~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n 398 (556) .+.... ......-..+.|..|..|.+++.++.+.... +|.+..+....+||..+|||-+.|.+ + T Consensus 203 ~~~~~~------~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~-----~ 271 (376) T protein:vir:78 203 QEYIDK------VYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG-----D 271 (376) T ss_pred HHHHHH------HhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC-----C Confidence 110000 0000011345678899999999887654332 56677788899999999999998854 5 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||++.+.. ..|+..-+.|+...+ +.++-+..++ +... ...++. . ...-.|+. T Consensus 272 ~s~~e~~~-----------~~f~~~~l~P~~~~i-e~~l~~kll~-~~~~--~~~~~~---~----------~ll~~d~~ 323 (376) T protein:vir:78 272 MADLSNNM-----------KAYMEYCIDPLTKKL-EDELNAKLFT-FSEF--LAGEHI---K----------IIHKKDII 323 (376) T ss_pred CCCHHHHH-----------HHHHHHHHHHHHHHH-HHHHHhhhCC-cccc--eecccc---h----------hhcccCHH Confidence 55543332 234555566755553 4444444433 2111 110110 1 11234888 Q ss_pred hhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCC Q lcl|NC_019524. 479 KETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNP 550 (556) Q Consensus 479 Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (556) ..+++....+++|+.|+-|+-+..|.+|-+ ++....--...+-.+-+.-++++ T Consensus 324 ~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~-------------------~g~~d~~~~~~n~~~~~~~~e~g 376 (376) T protein:vir:78 324 ENAEAVDKLVASGSFNRNEVRELLGAERVD-------------------NPELDKYLITKNYQSADEGGEDG 376 (376) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCC-------------------CCCCceeeeccCceehhccccCC Confidence 899999999999999998877666665511 01000000011111111112222 No 163 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=98.91 E-value=8.7e-10 Score=70.26 Aligned_cols=333 Identities=10% Similarity=0.046 Sum_probs=161.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchh--hhhh---hhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETAT--ATPM---AVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQN 75 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~--~~~~---~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rN 75 (556) |+.-....-.....+ ...+-..- +... .....|-.--..+ ...|..++-+.... -.|.+- T Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~--~~~~~epp~~~~~L------------a~l~~~ 65 (348) T protein:vir:26 1 MTEQLIHSHTTDGTE-SKSVYSFDPNPEPVDTNSWMTRYCELFYND--FDDYWEPPISLKGL------------AEIANA 65 (348) T ss_pred CCccccchhhccccC-CceEEEecCCCeeecCcchHHHHHHHHhcC--CCccccCCCCHHHH------------HHHHhh Confidence 432222110000000 00000000 0000 0001121111111 22344444443211 123455 Q ss_pred ChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhe Q lcl|NC_019524. 76 DGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFL 155 (556) Q Consensus 76 n~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~ 155 (556) |++-.++|..-++.+.++ ++|.+ .+|..++.+++. .++ T Consensus 66 n~~h~~~i~~k~N~l~~~-~~Pn~----------------------------------------~~t~~~f~~~~~-d~l 103 (348) T protein:vir:26 66 NGYHGSLLKARANYVAGR-FMNGG----------------------------------------GLPMYKMNSACW-DYF 103 (348) T ss_pred hhhhhhhHhhhhhHHhhc-ccCCC----------------------------------------CCCHHHHHHHHH-HHH Confidence 777777777666655542 33322 234444545543 567 Q ss_pred ecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccc Q lcl|NC_019524. 156 MTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWK 235 (556) Q Consensus 156 ~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~ 235 (556) ..|.+++.++.... .-++.|..|.+..+.. ..+|. .|++... | T Consensus 104 l~Gnay~~~~rn~~--------G~~~~L~~l~~~~v~~----------------~~d~~--~~~~~~~--g--------- 146 (348) T protein:vir:26 104 GLGMSAFVKIRSYL--------KNVIALEPLPMVHMRK----------------RKNGD--FVQLLRN--N--------- 146 (348) T ss_pred hcCCeEEEEEEcCC--------CcEEEEEEecCceeEe----------------eecCc--EEEEEec--C--------- Confidence 77999988764321 2356788887765531 12232 1222211 1 Q ss_pred cceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEeccCccccccccc Q lcl|NC_019524. 236 WGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESELPSDVVFGQL 312 (556) Q Consensus 236 ~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~~~~~~~~~~~ 312 (556) ....+++.+|||+..+.--.+..|+|++..++..+.-- ..|++-+.+. .+.-.++|+...+.. T Consensus 147 -----~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~---~~a~~~~~~~f~NGa~pg~Il~~~~~~l------ 212 (348) T protein:vir:26 147 -----EQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLN---RDATLFRRRYYLNGAHMGFIFYATDPNL------ 212 (348) T ss_pred -----eEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHH---HHHHHHHHHHHhccCCCceEEEecCCCC------ Confidence 01245678999999887678899999998888765422 3344444443 344455554332211 Q ss_pred ccccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCH Q lcl|NC_019524. 313 GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSY 387 (556) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~y 387 (556) ..++.+........ ..+. -..|.+..| ..|.+++.++.......|.+..+.....||+.+|||- T Consensus 213 ---s~e~~~~lk~~~~~----~~G~----~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp 281 (348) T protein:vir:26 213 ---SEADEKALKEKIAS----SKGI----GNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPA 281 (348) T ss_pred ---CHHHHHHHHHHHHH----hcCc----ccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCH Confidence 11111111111100 0000 012344445 4566788888777778899999999999999999998 Q ss_pred HHhhchh--hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCe Q lcl|NC_019524. 388 EQFSRDY--TKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNA 465 (556) Q Consensus 388 e~l~~D~--s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~ 465 (556) ..+ |.. +..+||++.+....|.+.. +.|+.+.|.++ +. -.+.+|....+. | T Consensus 282 ~ll-Gi~~~~~~~~sn~e~~~~~f~~~~-----------l~P~~~~ie~~-ln-~~l~~~~~~~~~-f------------ 334 (348) T protein:vir:26 282 GMG-GMLPQQGANVPDPLKVSQVYDFYE-----------VIPVCKRFMDA-VN-NDPEIPDNLKLK-F------------ 334 (348) T ss_pred HHc-cccCCCCCccccHHHHHHHHHHHH-----------HHHHHHHHHHH-Hh-hhhCCCCccEEE-E------------ Confidence 754 543 3478999877776665433 44555554332 22 223334332111 1 Q ss_pred eeecCcccccchhhhhHHHHHHH Q lcl|NC_019524. 466 EWIGASRGQIDEKKETEAAILRI 488 (556) Q Consensus 466 ~w~~p~~~~iDP~Ke~~A~~~~i 488 (556) -+||.+|..... +| T Consensus 335 --------dl~~~~e~~~~~-a~ 348 (348) T protein:vir:26 335 --------NLNPGVESANGS-AV 348 (348) T ss_pred --------ecCcccccchhh-cC Confidence 135555443322 22 No 164 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.91 E-value=1.6e-08 Score=63.28 Aligned_cols=446 Identities=9% Similarity=-0.018 Sum_probs=201.4 Q ss_pred CC--cchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MK--DVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~s--p~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) .. ..+.+...-.. .... ...+.. ....|.-+.. .... ......+. ...+. | --+++ T Consensus 37 ~~~~~~~~i~~~i~~----~~~~-~~~r~~-~l~~Yy~g~~--~il~---~~~~~~~~-~~~~~-----k-----i~~n~ 94 (511) T protein:vir:96 37 DLLQNVNEVSKYIEH----HMDY-QRPRLK-VLSDYYEGKT--KNLV---ELTRRKEE-YMADN-----R-----VAHDY 94 (511) T ss_pred hhhcCHHHHHHHHHH----HHHh-hhHHHH-HHHHHhhccC--cccc---ccCccccc-ccCcc-----e-----eecch Confidence 11 11111111000 0000 000111 1123432221 1110 00000000 00010 1 12589 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ++-+++..+.+++|..++.... ++...+. ++.|... + +|......+.+..+.-| T Consensus 95 ~k~Iv~~~~~yl~g~p~~~~~~------------d~~~~~~----l~~~~~~----n------~~~~~~~~~~~~~~~~G 148 (511) T protein:vir:96 95 ASYISDFINGYFLGNPIQYQDD------------DKDVLEA----IEAFNDL----N------DVESHNRSLGLDLSIYG 148 (511) T ss_pred HHHHHHHHhhhhcccCceeecC------------chHHHHH----HHHHHhh----c------ChhHHHHHHHHHHHhcC Confidence 9999999999999988776532 1122333 3333332 1 46666777778888999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC----C----CCeEEEEEeecCCCcccc Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN----N----GAALGYWLRKAFPGDPTD 230 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~----~----Gr~vaY~i~~~hpgd~~~ 230 (556) .+|..+. ....+ .+++.+++|..+-.-++......+..+|++-. . +...-+.++..+---.+. T Consensus 149 ~a~~~vy-~d~dg--------~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~ 219 (511) T protein:vir:96 149 KAYELMI-RNQDD--------ETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYL 219 (511) T ss_pred eeEEEEE-eCCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEE Confidence 9997754 32211 25889999988853343333345677776411 1 112222333322100000 Q ss_pred CCcccc-----cee-eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 231 MEQWKW-----GYE-PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 231 ~~~~~~-----~rv-~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) ..+..+ ..+ +....++.--|+++... ..|.|.|.+++..+..++....-......-.+.-..+++.... T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~ 294 (511) T protein:vir:96 220 TNRTNGLKLTPRENSFESHSFERMPITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN 294 (511) T ss_pred ecCCCcccccccccccccCcCcccceEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCcc Confidence 000000 000 00001111124444332 3588999998777765554333222222221211122221111 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCce-----eeecCCCceeeeecCCCCCccHHHHHHHHHHHH Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAK-----IPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNI 379 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~-----i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~i 379 (556) ... .+. . .........+.++. -....+|-++++++.+.+...+..+.+.+.+.| T Consensus 295 ~~~---------~~~-~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I 353 (511) T protein:vir:96 295 LDP---------VEV-R-----------KQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDI 353 (511) T ss_pred CCc---------hhh-c-----------ccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHH Confidence 000 000 0 00000111111111 123455778899998888899999999999988 Q ss_pred HHhcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccch Q lcl|NC_019524. 380 AASLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDP 456 (556) Q Consensus 380 aaglGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~ 456 (556) ..-.++| .+.++++. |--+++..+..........+..|-..+ +..++..+...-..+.+..+... T Consensus 354 ~~~s~~P~~~~~~~~~n~---Sg~Al~~~~~~l~~ka~~~~~~f~~~l-~~~~~li~~~~~~~~~~~~~~~~-------- 421 (511) T protein:vir:96 354 HMFTNTPNMKDDNFSGTQ---SGEAMKYKLFGLEQRTKTKEGLFTKGL-RRRAKLLETILKNTRSIDANKDF-------- 421 (511) T ss_pred HHHhCCcccccccccccc---HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccc-------- Confidence 8877766 34444443 333566666555555555555554443 44454443322222322211100 Q ss_pred hhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCC-CCccccc Q lcl|NC_019524. 457 MMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLD-FTGKMVE 533 (556) Q Consensus 457 ~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~-~~~~~~~ 533 (556) .-+.+.|..+-. .|-...+++..++ .|+.|.+..+...+. |+++.++++.+|++...+.-.. ...++. T Consensus 422 ----~~i~~~f~~~~p--~n~~e~~d~~~kl--~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~- 492 (511) T protein:vir:96 422 ----NTVRYVYNRNLP--KSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR- 492 (511) T ss_pred ----ccceEEeCCCCC--cCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCC- Confidence 013567754433 4655566655554 599999999998875 8999999999987654332111 111110 Q ss_pred cCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 534 GNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 534 ~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .... ++..++.+++.+| T Consensus 493 ---~~~~---~~~~~~~~~~~~e 509 (511) T protein:vir:96 493 ---DIND---DEQDDDTKDTVDK 509 (511) T ss_pred ---CCCC---CCCCCCccCcccc Confidence 1111 1111122221112 No 165 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.91 E-value=1.6e-08 Score=63.28 Aligned_cols=446 Identities=9% Similarity=-0.018 Sum_probs=201.4 Q ss_pred CC--cchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MK--DVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~s--p~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) .. ..+.+...-.. .... ...+.. ....|.-+.. .... ......+. ...+. | --+++ T Consensus 37 ~~~~~~~~i~~~i~~----~~~~-~~~r~~-~l~~Yy~g~~--~il~---~~~~~~~~-~~~~~-----k-----i~~n~ 94 (511) T protein:vir:78 37 DLLQNVNEVSKYIEH----HMDY-QRPRLK-VLSDYYEGKT--KNLV---ELTRRKEE-YMADN-----R-----VAHDY 94 (511) T ss_pred hhhcCHHHHHHHHHH----HHHh-hhHHHH-HHHHHhhccC--cccc---ccCccccc-ccCcc-----e-----eecch Confidence 11 11111111000 0000 000111 1123432221 1110 00000000 00010 1 12589 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ++-+++..+.+++|..++.... ++...+. ++.|... + +|......+.+..+.-| T Consensus 95 ~k~Iv~~~~~yl~g~p~~~~~~------------d~~~~~~----l~~~~~~----n------~~~~~~~~~~~~~~~~G 148 (511) T protein:vir:78 95 ASYISDFINGYFLGNPIQYQDD------------DKDVLEA----IEAFNDL----N------DVESHNRSLGLDLSIYG 148 (511) T ss_pred HHHHHHHHhhhhcccCceeecC------------chHHHHH----HHHHHhh----c------ChhHHHHHHHHHHHhcC Confidence 9999999999999988776532 1122333 3333332 1 46666777778888999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC----C----CCeEEEEEeecCCCcccc Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN----N----GAALGYWLRKAFPGDPTD 230 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~----~----Gr~vaY~i~~~hpgd~~~ 230 (556) .+|..+. ....+ .+++.+++|..+-.-++......+..+|++-. . +...-+.++..+---.+. T Consensus 149 ~a~~~vy-~d~dg--------~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~ 219 (511) T protein:vir:78 149 KAYELMI-RNQDD--------ETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYL 219 (511) T ss_pred eeEEEEE-eCCCC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEE Confidence 9997754 32211 25889999988853343333345677776411 1 112222333322100000 Q ss_pred CCcccc-----cee-eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 231 MEQWKW-----GYE-PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 231 ~~~~~~-----~rv-~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) ..+..+ ..+ +....++.--|+++... ..|.|.|.+++..+..++....-......-.+.-..+++.... T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~ 294 (511) T protein:vir:78 220 TNRTNGLKLTPRENSFESHSFERMPITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN 294 (511) T ss_pred ecCCCcccccccccccccCcCcccceEEecCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCcc Confidence 000000 000 00001111124444332 3588999998777765554333222222221211122221111 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCce-----eeecCCCceeeeecCCCCCccHHHHHHHHHHHH Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAK-----IPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNI 379 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~-----i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~i 379 (556) ... .+. . .........+.++. -....+|-++++++.+.+...+..+.+.+.+.| T Consensus 295 ~~~---------~~~-~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I 353 (511) T protein:vir:78 295 LDP---------VEV-R-----------KQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDI 353 (511) T ss_pred CCc---------hhh-c-----------ccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHH Confidence 000 000 0 00000111111111 123455778899998888899999999999988 Q ss_pred HHhcCCC---HHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccch Q lcl|NC_019524. 380 AASLGMS---YEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDP 456 (556) Q Consensus 380 aaglGi~---ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~ 456 (556) ..-.++| .+.++++. |--+++..+..........+..|-..+ +..++..+...-..+.+..+... T Consensus 354 ~~~s~~P~~~~~~~~~n~---Sg~Al~~~~~~l~~ka~~~~~~f~~~l-~~~~~li~~~~~~~~~~~~~~~~-------- 421 (511) T protein:vir:78 354 HMFTNTPNMKDDNFSGTQ---SGEAMKYKLFGLEQRTKTKEGLFTKGL-RRRAKLLETILKNTRSIDANKDF-------- 421 (511) T ss_pred HHHhCCcccccccccccc---HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccc-------- Confidence 8877766 34444443 333566666555555555555554443 44454443322222322211100 Q ss_pred hhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCC-CCccccc Q lcl|NC_019524. 457 MMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLD-FTGKMVE 533 (556) Q Consensus 457 ~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~-~~~~~~~ 533 (556) .-+.+.|..+-. .|-...+++..++ .|+.|.+..+...+. |+++.++++.+|++...+.-.. ...++. T Consensus 422 ----~~i~~~f~~~~p--~n~~e~~d~~~kl--~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~- 492 (511) T protein:vir:78 422 ----NTVRYVYNRNLP--KSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR- 492 (511) T ss_pred ----ccceEEeCCCCC--cCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCC- Confidence 013567754433 4655566655554 599999999998875 8999999999987654332111 111110 Q ss_pred cCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 534 GNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 534 ~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .... ++..++.+++.+| T Consensus 493 ---~~~~---~~~~~~~~~~~~e 509 (511) T protein:vir:78 493 ---DIND---DEQDDDTKDTVDK 509 (511) T ss_pred ---CCCC---CCCCCCccCcccc Confidence 1111 1111122221112 No 166 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=98.90 E-value=9e-10 Score=70.19 Aligned_cols=327 Identities=11% Similarity=0.044 Sum_probs=148.4 Q ss_pred CCcchhhhHHHHHhhHhhccc--chhhhhhhhcchhcccc--CCCc---------ccccccCCCCCHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAE--TATATPMAVGGGMEGAE--RTTR---------EMFQWNPSIISPDQQIAQNQDMASA 67 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~--~~~~~~~~~~~~y~aa~--~~~r---------~~~~w~~~~~s~~~~i~~~~~~lr~ 67 (556) ||=-+..++.+.-..+...+. ......+...+++.... -+++ ....|..+.-+... | T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~--------l-- 70 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEG--------L-- 70 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHH--------H-- Confidence 433222111100000000000 00000000001110000 0000 01122222222221 1 Q ss_pred HHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHH Q lcl|NC_019524. 68 RAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLT 147 (556) Q Consensus 68 RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq 147 (556) | .|.+-|++..++|...++.+.++ ++| ...+|.+++. T Consensus 71 -a-~~~~~~~~h~~~l~~k~n~l~~~-~~P----------------------------------------n~~~t~~~f~ 107 (350) T protein:vir:11 71 -A-KSVGSSVYLQSGLKFKRNMLAKT-FIP----------------------------------------HRLLSRATFE 107 (350) T ss_pred -H-HHHhhhhhhccchhhhhhhhhhc-ccC----------------------------------------CCCCCHHHHH Confidence 1 22344555555555444443331 111 2234555565 Q ss_pred HHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCc Q lcl|NC_019524. 148 RLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGD 227 (556) Q Consensus 148 ~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd 227 (556) +++. ..+..|.+++.++.... .-++.|..|.+..+.. ..++.. ||.... .+ T Consensus 108 ~~v~-d~ll~Gnay~~~~rn~~--------G~~~~L~~l~~~~vr~----------------~~~~~~--~~~~~~-~~- 158 (350) T protein:vir:11 108 QFSL-DWLTFGSAYLEQPRSRL--------GTRMPLQAPLAKYMRR----------------GTDLET--FYQVRS-WK- 158 (350) T ss_pred HHHH-HHHhcCCeEEEEEEcCC--------CCEEEEEEeCCceeEe----------------eecCCe--EEEEee-CC- Confidence 6554 66788999998764321 2356788888877642 111211 222221 11 Q ss_pred cccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEeccCc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESELP 304 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~~~ 304 (556) ....+++.+|||+..+..-++..|+|++..++..+.--. .+++-+.+. .+...++|+.+.+ T Consensus 159 -------------~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~---~a~~~~~~~f~NGa~~~gil~~~~~ 222 (350) T protein:vir:11 159 -------------DEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNE---SATLFRRKYYNNGSHAGFILYMTDA 222 (350) T ss_pred -------------eEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHH---HHHHHHHHHHhccCCCceEEEecCC Confidence 112467789999998887788999999888776665322 334333333 4455566664432 Q ss_pred ccccccccccccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHHHH Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLRNI 379 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr~i 379 (556) ... +++.+........ ..+. -..|.+..| +.|-+++.+....-..+|.+..+....+| T Consensus 223 ~ls---------~e~~~~l~~~~~~----~~G~----~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eI 285 (350) T protein:vir:11 223 AQN---------EEDIDALRTALKT----AKGP----GNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQ 285 (350) T ss_pred CCC---------HHHHHHHHHHHHH----hcCc----cccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHH Confidence 111 1111111111100 0000 122344444 34567887776766778999999999999 Q ss_pred HHhcCCCHHHhhchhhc--ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchh Q lcl|NC_019524. 380 AASLGMSYEQFSRDYTK--TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPM 457 (556) Q Consensus 380 aaglGi~ye~l~~D~s~--~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~ 457 (556) |+.+|||-+ +.|+..+ .+||++.+....|.+.. +.|+.+.|.+ +.. .| +. +...|..+ T Consensus 286 a~a~~VPp~-llGi~~~~t~~~sn~e~~~~~f~~~~-----------L~P~~~~ie~--ln~-~l--~~--~~~~F~~~- 345 (350) T protein:vir:11 286 LAGLRVYPQ-LMGVVPQNAGGFGSISDAAAVWASLE-----------LAPMQTRLQQ--VNE-MI--GE--EVVRFAQF- 345 (350) T ss_pred HHHhCCCHH-HhcccCCCCCCcCCHHHHHHHHHHHH-----------HHHHHHHHHH--HHh-hc--Cc--cccccCcc- Confidence 999999987 5566544 56999888777765543 2333333321 111 11 00 00111110 Q ss_pred hHHHh Q lcl|NC_019524. 458 MRDAL 462 (556) Q Consensus 458 ~~~a~ 462 (556) ...++ T Consensus 346 ~~~~l 350 (350) T protein:vir:11 346 DAPGL 350 (350) T ss_pred cccCC Confidence 01111 No 167 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=98.90 E-value=1.7e-08 Score=63.17 Aligned_cols=439 Identities=13% Similarity=0.033 Sum_probs=204.4 Q ss_pred hcccchhhhhhhhcchhccccCC-CcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCce Q lcl|NC_019524. 18 VVAETATATPMAVGGGMEGAERT-TREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYK 96 (556) Q Consensus 18 ~~~~~~~~~~~~~~~~y~aa~~~-~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~ 96 (556) +..++..+..+.. .+|- +++. ....+-|...... .+++ +.......+++...++++.+++++...-|.|..++ T Consensus 1 ~~~~~~~~~p~~~-~g~~-~~~~~~~~~~~~~~~e~~--~~lr--~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~ 74 (469) T protein:vir:10 1 MTERVKTAAPVSE-AGYV-FGSGVVDGWTVWDPFEQT--PELQ--WPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWR 74 (469) T ss_pred CCCcccCCCCccc-hhhh-hhcccccchhhccccccc--cccc--cccchHHHHHHHhhChHHHHHHHHHHHHHhcCCce Confidence 1111111111100 1111 1110 0001111110000 0111 12334467788889999999999999999998887 Q ss_pred eeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCC Q lcl|NC_019524. 97 LNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQR 176 (556) Q Consensus 97 ~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~ 176 (556) +.+.- .+++.++.+.+.+...+..-. .....|+..++.+|.....-.+-..+.=|=+++-++|...... .+| T Consensus 75 v~p~~------~~~e~~~~~~~~L~~~~~~~~-~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~-~dG 146 (469) T protein:vir:10 75 IRANG------ASDEVTEFVSRNLMVPIDGED-DVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQS-PDG 146 (469) T ss_pred EecCC------CCHHHHHHHHHHHHhhhhhhh-hhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeeccccc-CCC Confidence 76532 244444445554444433221 1223577788999999988888777778999999998754321 233 Q ss_pred cccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCC-c-cccCCccccceeeccccCChh-HeEee Q lcl|NC_019524. 177 RPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPG-D-PTDMEQWKWGYEPARFDWGRR-RVIHI 253 (556) Q Consensus 177 ~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpg-d-~~~~~~~~~~rv~~~~~v~a~-~viH~ 253 (556) .-.+-+|....+..+. =+.||.++..+.++-...... . ......... ..+|.. -|+|. T Consensus 147 ~~~~~~l~~rp~~~i~-------------~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~------~~lp~~k~i~~~ 207 (469) T protein:vir:10 147 RFWLRKLAPRPQWTIS-------------KFNVAPDGGLESIEQIAPPARTRGSLYVANIAP------PEIPVNRLVVYT 207 (469) T ss_pred ceeeeeeeecCcccce-------------eeeeccCCceeeeeecCcccccccccccCCCCc------cccccCcEEEEE Confidence 3333333333322221 145566666666653221100 0 000000111 123333 46677 Q ss_pred ecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccc Q lcl|NC_019524. 254 IEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANY 333 (556) Q Consensus 254 f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (556) +. .+.|..-|.+.|.++.-...-........+.-...-++=..+.|.+.+.. ++.... T Consensus 208 ~~-~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~----------~~ek~~----------- 265 (469) T protein:vir:10 208 RN-KRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATD----------EDEVRK----------- 265 (469) T ss_pred ec-CCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCC----------HHHHHH----------- Confidence 65 46889999999988765533333333333333333222223333322111 000000 Q ss_pred cccccceecCC--ceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHH Q lcl|NC_019524. 334 VAQTKNIAIDG--AKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQK 411 (556) Q Consensus 334 ~~~~~~~~l~p--G~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r 411 (556) .-.....+.. .....++.|.+|+++++...+..|..|.+.+-++|+..+--. .||.+-.+.|||.+-.-...+.. T Consensus 266 -l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~--tlTs~~~gGS~a~~~vh~ev~~d 342 (469) T protein:vir:10 266 -MAALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAH--FLNLDGKGGSYALASVLEDPFTQ 342 (469) T ss_pred -HHHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhcc--cccccCccchhhHHHHHHHHHHH Confidence 0011111222 224558999999999999888999999999999998875432 35655444566655444444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcC Q lcl|NC_019524. 412 YMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNG 491 (556) Q Consensus 412 ~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G 491 (556) ..+.....+...+-+-+..+++. ++ .+.... +-+..|- ..+. |....+++....+.+| T Consensus 343 ~~~sDa~~i~~tln~~li~~l~~---lN----~g~~~~------------~P~~~~~--~~e~-~~~~~a~~i~~l~~~G 400 (469) T protein:vir:10 343 AVHAYATSICRIANQHIIEDLVD---IN----FGVDTP------------APVLTFD--PIGS-RQDLTAAAVKLLYDAG 400 (469) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH---hc----CCCCCC------------ccEEEec--CCCC-cHHHHHHHHHHHHhcC Confidence 44555555555554555555543 12 111100 0112331 1121 2223355555666777 Q ss_pred CCC----HHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccC----------------CCCCCCCCCCCCCCCC Q lcl|NC_019524. 492 LST----YEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGN----------------STQSSNSSESTSDNPN 551 (556) Q Consensus 492 ~~s----~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~----------------~~~~~~~~~~~~~~~~ 551 (556) +.. .++.++ +++||+.+.+..... ....+...+.....+. T Consensus 401 ~~~~~~~~~~~~~--------------------e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (469) T protein:vir:10 401 VFDDDPAVKRAIR--------------------QRFNLPSELNDTPSAEPEEPAAVPNQSAAPARTRSSGNADARARAPK 460 (469) T ss_pred CccCccccHHHHH--------------------HHhCCCCCCCCcccccchhcccCCCCCccccccCCCCCcccccccCC Confidence 732 222233 344554322111100 0000000011111111 Q ss_pred CcCCC Q lcl|NC_019524. 552 EETTQ 556 (556) Q Consensus 552 ~e~~~ 556 (556) .+..+ T Consensus 461 ~~~~~ 465 (469) T protein:vir:10 461 ADQGV 465 (469) T ss_pred ChHHh Confidence 11111 No 168 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.88 E-value=2.1e-08 Score=62.63 Aligned_cols=445 Identities=9% Similarity=-0.039 Sum_probs=201.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) +.|-.-..-...... . ...+.. ....|.-+.. .-....-.+...-++ . | + -+++++ T Consensus 15 ~~~~~~~~~i~~~~~-~-----~~~r~~-~~~~yy~g~~-~i~~~~~~~~~~~~~------~-----k---i--~~n~~~ 70 (489) T protein:vir:99 15 LWIDQLKNYISRFKA-E-----QLERLK-ELKRYYLGDN-NIKYRPAKTDKYAAD------N-----R---I--ASDFAK 70 (489) T ss_pred CCHHHHHHHHHHHHH-H-----HHHHHH-HHHHHhcccC-ccccccccccccCCc------c-----e---e--ecchHH Confidence 333322211111110 0 001111 1122332221 100000001001011 1 1 2 368999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|+..+.+++|.+++..+. + +.+...++.|... .+|...+..+.+..+.-|.+ T Consensus 71 ~iv~~~~~~l~g~~~~~~~~---------d-------~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~~~G~~ 124 (489) T protein:vir:99 71 YITVFEQGYMLGVPVEYKNE---------N-------KDLQAAIDLMSVR----------NNEDYHNVKIKTDLSIYGRA 124 (489) T ss_pred HHHHHHhhhhccCCceeecC---------C-------hhHHHHHHHHHhh----------cChhHHHHHHHHHHhhCCeE Confidence 99999999999998876542 1 2233344444432 25777778888899999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE-----CCCCCeEEEEEeecCCCccccC--Cc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL-----DNNGAALGYWLRKAFPGDPTDM--EQ 233 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~-----d~~Gr~vaY~i~~~hpgd~~~~--~~ 233 (556) |.++.... ..+ +. -.+++..++|+.+-.-++......+..+|.+ ++.+...-+.++..+ ..+.. .. T Consensus 125 ~~~v~~~~-~~d---~~-~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~--~i~~~~~~~ 197 (489) T protein:vir:99 125 YELLTVEK-IDD---KK-TEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSD--TIYTYEDYN 197 (489) T ss_pred EEEEeecc-CcC---CC-cceEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCC--cEEEEEecC Confidence 98765432 111 11 2367899999987433333333446666653 122233333343322 11100 00 Q ss_pred ---cccc---eee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEecc-Ccc Q lcl|NC_019524. 234 ---WKWG---YEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESE-LPS 305 (556) Q Consensus 234 ---~~~~---rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~-~~~ 305 (556) ..+. .+| .+..+| |+|+... ..|.|.|.+++..+..++....-......-.+.-..+|+.. .++ T Consensus 198 ~~~~~~~~~~~~~~~~g~vP---vv~~~n~-----~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~ 269 (489) T protein:vir:99 198 LETKGMRLKDYEGHFFKGVP---VNEYANN-----EERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTG 269 (489) T ss_pred CCcccceecccccccCCcee---EEEeecC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCccc Confidence 0000 000 011122 5665543 35889999887776666554433333332222111222211 111 Q ss_pred cccccccccccccccccccccccccccccccccceecCCceeeecCC-------CceeeeecCCCCCccHHHHHHHHHHH Q lcl|NC_019524. 306 DVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYP-------GTKLKMQPAGTPGGVGTDYEQSLLRN 378 (556) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p-------Ge~i~~~~~~~p~~~f~~F~~~~lr~ 378 (556) ......... . .. ...........+..+.+..+.+ +.++++++.+.+...+..+++.+.+. T Consensus 270 ~~~~~~~~~-----~---~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 336 (489) T protein:vir:99 270 ADENDYLDD-----G---RL-----NPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVAD 336 (489) T ss_pred ccchhhhhh-----c---cc-----ccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHH Confidence 100000000 0 00 0000000111123344444333 34678888888888999999999999 Q ss_pred HHHhcCCCH---HHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc-CCccCCCCccccccc Q lcl|NC_019524. 379 IAASLGMSY---EQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNA-GNVPLPPGKNWRMFY 454 (556) Q Consensus 379 iaaglGi~y---e~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~-G~l~~p~~~~~~~~~ 454 (556) |..-.++|- +.++++ .|-.+.+..+..........+..| ...++.+++..++..-.. +....... T Consensus 337 i~~~s~~p~~~~~~~~~n---~Sg~Al~~~~~~l~~k~~~k~~~~-~~~l~~~~~li~~~~~~~~~~~~~~~~------- 405 (489) T protein:vir:99 337 ILRFTFTPDTQDMKFSGV---QSGESMKYKLMASDNYREKQERLF-KKGLMRRLRLAANIWAIKGNEATTYSL------- 405 (489) T ss_pred HHHHhCCccccccccccc---chHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcCCccccccc------- Confidence 988887762 223333 334455555554444444444443 333454554443322111 11111000 Q ss_pred chhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC----CHHHHHHHHHHHHHHHHHcCCCCCcc Q lcl|NC_019524. 455 DPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG----DFREVFKQRAREEGLIKSLKLDFTGK 530 (556) Q Consensus 455 ~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~----D~e~v~~q~a~E~~~~~~~Gl~~~~~ 530 (556) ..-+.+.|..+- ..|....+++..++ .|+.|.+..+...+. |+++.++++++|.+...+. .- .. T Consensus 406 -----~~~i~v~f~~~~--p~d~~~~~~~~~kl--~giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~--~~-~~ 473 (489) T protein:vir:99 406 -----VNDTSIVFTPNL--PQNDNEIVTAAQNL--YGIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSL--PE-PR 473 (489) T ss_pred -----cccceEEeCCCC--CcCHHHHHHHHHHH--hccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhcc--cc-cc Confidence 001345563222 24666666666654 599999999988754 7788888888877543322 11 00 Q ss_pred ccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 531 MVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) ..+..+. +.++++++. T Consensus 474 -----~~~~~~~---~~~~~~~~p 489 (489) T protein:vir:99 474 -----LVGDASG---QEEPTAEKP 489 (489) T ss_pred -----ccCCCCC---CcCCCCCCC Confidence 0011101 111111111 No 169 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=98.87 E-value=1.1e-09 Score=69.81 Aligned_cols=347 Identities=10% Similarity=0.043 Sum_probs=162.1 Q ss_pred CCcchhhhHHH-HHhhHhhcccchhhhhhhhcchhccccCCCccccccc---CCCCCHHHHHHHHHHHHHHHHHHHHhc- Q lcl|NC_019524. 1 MKDVKKTTRTR-AKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWN---PSIISPDQQIAQNQDMASARAQDMVQN- 75 (556) Q Consensus 1 ~sp~~~~~r~~-a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~---~~~~s~~~~i~~~~~~lr~RaRdl~rN- 75 (556) || ..++.+ .+.+............ ....++.+.+ +..|. |...-...++.... .++.| T Consensus 1 m~---~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~-~~~~~fg~p~~~~~~~~~~~~~--------~~~~~~ 63 (368) T protein:vir:79 1 MS---RNKTRRAARAASAHVRTANTDAP-----TEHHTDRAAQ-AEVFSFGDPVEVLDRRELLDYV--------ECMRMG 63 (368) T ss_pred CC---ccccccchhccCcccccccccCc-----chhhccccCc-eEEEEcCCceeecchhhHHHHH--------HHHhcc Confidence 33 322222 1111111111100000 0111111121 11121 11010111111111 23333 Q ss_pred ----ChHHHHHHHHHH-hhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHH Q lcl|NC_019524. 76 ----DGYAAGVVAVHR-DSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLA 150 (556) Q Consensus 76 ----n~~a~~~v~~~~-~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~ 150 (556) .|+...++..+. .|+-+.+. +..+ .-+-.. . +.....+|..++..++ T Consensus 64 ~~~~~pi~~~~la~~~~~~~~h~~~-~~~~-----------------~n~l~l----~------~~Pn~~~t~~~f~~l~ 115 (368) T protein:vir:79 64 QWYEPPMPWDGLARSFRAAAHHSSA-VYVK-----------------RNILVS----T------FIPHPLLSRATFERLV 115 (368) T ss_pred chhccCcCHHHHHHHHhhccccchh-hhhh-----------------cchhhh----h------cCCCcCCCHHHHHHHH Confidence 344444444433 33322211 1110 111111 1 2344556777776664 Q ss_pred hhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCcccc Q lcl|NC_019524. 151 VSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTD 230 (556) Q Consensus 151 ~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~ 230 (556) ..++..|++|+.++... ...++.|..|.+..+... .++. .|+..... + T Consensus 116 -~d~ll~Gnay~~~~r~~--------~G~~~~L~~l~~~~v~~~----------------~~~~--~~~~~~~~-~---- 163 (368) T protein:vir:79 116 -LDWQVFGNAYLERRENV--------LGGTIRLDTPLAKYVRRG----------------LDLN--TYFFVQNW-Q---- 163 (368) T ss_pred -HHHhhcCCeEEEEEEcC--------CCCEEEEEEeCcccceee----------------ccCC--EEEEEecC-C---- Confidence 57889999999876432 123567888888777421 1111 12222111 0 Q ss_pred CCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccc Q lcl|NC_019524. 231 MEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFG 310 (556) Q Consensus 231 ~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~ 310 (556) ....+++.+|||+..+...++..|+|++..++..+.--..-......--+=.+...++|+.+.+.. T Consensus 164 ----------~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l---- 229 (368) T protein:vir:79 164 ----------QPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQ---- 229 (368) T ss_pred ----------eEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC---- Confidence 012356779999998887888999999888776555333322222222222445556665432211 Q ss_pred ccccccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCC Q lcl|NC_019524. 311 QLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGM 385 (556) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi 385 (556) .++..+........ ..+ .-..|.+..| +.|-+++.++...-...|.+..+....+||+.+|| T Consensus 230 -----~~e~~~~lk~~~~~----~~G----~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~V 296 (368) T protein:vir:79 230 -----KQEDVDTLREAMKS----AKG----PGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRV 296 (368) T ss_pred -----CHHHHHHHHHHHHH----hcC----CcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCC Confidence 01111111111100 000 0123445555 45668888887777788888899999999999999 Q ss_pred CHHHhhchhhc--ccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHH---HHHHHHHHHHcCCccCCCCcccccccchhhH Q lcl|NC_019524. 386 SYEQFSRDYTK--TNYSSARASMAETQKYM-DSRKKLVADRFASAI---YTLWLEEEVNAGNVPLPPGKNWRMFYDPMMR 459 (556) Q Consensus 386 ~ye~l~~D~s~--~nYSs~R~~~~e~~r~~-~~~q~~lv~~~~~pi---~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~ 459 (556) |...| |...+ .+||++.+....|++.. ..+++.|. ++..-+ +-+|-+..++++..+.++... .| T Consensus 297 Pp~ll-Gi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~e~~rF~~~~l~~~D~~a~a~~~--------~r 366 (368) T protein:vir:79 297 PPQLM-GIIPNNTGGFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGDEVVRFAPYALGGHDQPAAAPGG--------QR 366 (368) T ss_pred CHHHc-cccCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCcceeeechhHhhcccccccCCcc--------cc Confidence 98744 65544 35999999888887644 45555442 222211 112334445555544443211 11 Q ss_pred HH Q lcl|NC_019524. 460 DA 461 (556) Q Consensus 460 ~a 461 (556) .| T Consensus 367 sa 368 (368) T protein:vir:79 367 SA 368 (368) T ss_pred cC Confidence 11 No 170 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=98.86 E-value=6.9e-10 Score=70.81 Aligned_cols=325 Identities=10% Similarity=0.036 Sum_probs=156.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcch---------------hccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGG---------------MEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~---------------y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) ||--|..+|...-.++......... .+..... |...- .+..|..++-+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~p~~v~~~~~~~~~~~~~----~~~~~~~pp~~~~~--------- 66 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAP-ARAEVFTFDDPTPVMNRAEILDYVECW----SNGEWFEPPVSFAG--------- 66 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhccc-ceeEEEEcCCceeecCcchhhhhhhhh----hcCceecCCCCHHH--------- Confidence 5533332221110100000000000 0000111 11110 01223333332211 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) --.|.+-|++..++|..-++.+.+. ++| ...+|.++ T Consensus 67 ---la~~~~~~~~h~~~l~~k~n~l~~~-~~P----------------------------------------np~~t~~~ 102 (351) T protein:vir:79 67 ---LAKSFRASTHHSSALFFKANVLAST-FRP----------------------------------------HRWLSRHA 102 (351) T ss_pred ---HHHHHhhhHhhhhhhhhhhhHHhhc-ccC----------------------------------------CCCCCHHH Confidence 1134556677777776554444431 222 22345555 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCC Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFP 225 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hp 225 (556) |..++. ..+..|.+++.++.... .-++.|..|.|.++.... |.. .|++.... T Consensus 103 f~~~v~-d~ll~Gnay~~~~r~~~--------G~~~~L~~l~~~~v~~~~--------------~~~----~~~~~~~~- 154 (351) T protein:vir:79 103 FERWAL-DFLTFGNGYLERRRNMV--------GGTLRLEPALAKYVRRKA--------------DFS----GFVYVNGW- 154 (351) T ss_pred HHHHHH-HHHhcCCeEEEEEECCC--------CCEEEEEEeCCcceeeee--------------cCC----eEEEEecC- Confidence 655554 66788999988764321 235788888888774211 111 13333211 Q ss_pred CccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEecc Q lcl|NC_019524. 226 GDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESE 302 (556) Q Consensus 226 gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~ 302 (556) + ....+++.+|||+..+..-++..|+|++..++..+. -=..+++-+.+. .|...++|+.+ T Consensus 155 g--------------~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~---l~~~a~~~~~~~f~NGa~pg~il~~~ 217 (351) T protein:vir:79 155 Q--------------ERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAW---LNESSTLFRRKYYENGSHAGFILYMT 217 (351) T ss_pred c--------------eEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEec Confidence 0 012356779999998887789999998877665544 234455444443 34445555543 Q ss_pred CcccccccccccccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHH Q lcl|NC_019524. 303 LPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLR 377 (556) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr 377 (556) .+... .++.+........ ..+. -..|.+..+ +.|-+++.+.......+|.+..+.... T Consensus 218 ~~~ls---------~e~~~~lk~~~~~----~~G~----~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~ 280 (351) T protein:vir:79 218 DAAQK---------QDDVDNMRDALKN----AKGP----GNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRD 280 (351) T ss_pred CCCCC---------HHHHHHHHHHHHH----hcCc----cccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHH Confidence 32110 1111111111110 0000 012333333 456788888877777889999999999 Q ss_pred HHHHhcCCCHHHhhchhhc--ccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCccccccc Q lcl|NC_019524. 378 NIAASLGMSYEQFSRDYTK--TNYSSARASMAETQKYM-DSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFY 454 (556) Q Consensus 378 ~iaaglGi~ye~l~~D~s~--~nYSs~R~~~~e~~r~~-~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~ 454 (556) +|++.+|||...+ |...+ .+||++......|.+.. ..+|..+. ++. .||. .+...|+ T Consensus 281 eI~~a~~VPp~ll-Gi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie-~ln-----~~lg-------------~~~~~F~ 340 (351) T protein:vir:79 281 DLLAAHRVPPQLL-GIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELN-----DWLG-------------DEVVTFD 340 (351) T ss_pred HHHHHhCCCHHHh-cccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHH-----hhcC-------------cceeeeC Confidence 9999999998855 65433 56999988888776543 34444332 111 1221 1122233 Q ss_pred chhhHHHhhCe Q lcl|NC_019524. 455 DPMMRDALCNA 465 (556) Q Consensus 455 ~~~~~~a~~~~ 465 (556) .+...+.-..+ T Consensus 341 ~~~llr~d~~a 351 (351) T protein:vir:79 341 DYEIPPAPVAA 351 (351) T ss_pred hhhhccccccC Confidence 22211111111 No 171 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.86 E-value=2.5e-08 Score=62.24 Aligned_cols=443 Identities=10% Similarity=0.006 Sum_probs=210.3 Q ss_pred CCcchhhh-----HHHHHhhHhhcccchhhhhh-hhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 1 MKDVKKTT-----RTRAKKAVDVVAETATATPM-AVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQ 74 (556) Q Consensus 1 ~sp~~~~~-----r~~a~~a~~~~~~~~~~~~~-~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~r 74 (556) |-|++..+ ..-......+......-+.+ .....|.-+.+. .... + .++. ..+..++ . T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~---~~~~-~--~~~p----~~~r~~~-------~ 63 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENLRWKNLLRTSYYENKRT---IQYV-G--TLIP----PQYFNLG-------L 63 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCC---hhhc-c--cccc----HHHHHHH-------h Confidence 44444321 11100000111110001111 112234433221 1111 1 1111 1222221 1 Q ss_pred cChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 75 NDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 75 Nn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) -.+|.+-+|+.+++..+-.||+.- + ++.+ -..+|+-|..+ +|...+..+.+.. T Consensus 64 v~nw~~~~Vd~~a~rl~~~Gf~~~---d-------~~~~------~~~l~~iw~~N-----------~ld~~~~~~~~~a 116 (474) T protein:vir:81 64 VLGWTGKAVDALARRCNLEGFVWP---D-------GDLD------SLGGTEVVDDN-----------HLLSEIDSAIVAA 116 (474) T ss_pred hcChHHHHHHHHHhhhcccceECC---C-------CCcc------chHHHHHHHhc-----------ChhHHHHHHHHHH Confidence 358899999999999999998742 1 1111 12356555432 5778889999999 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEE---EECCCCCeEEEEEeecCCCccc-- Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGV---QLDNNGAALGYWLRKAFPGDPT-- 229 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GI---E~d~~Gr~vaY~i~~~hpgd~~-- 229 (556) +.-|.+|+..-... ++.+. ..++.++|.++---++.. .+.+..++ +.|..|.+...-++. |+..+ T Consensus 117 l~~G~sf~~V~~~~------d~~~~-~~i~~~sp~~~~~~~D~~-~~~~~~al~~~~~~~~g~~~~~~ly~--~~~~~~~ 186 (474) T protein:vir:81 117 MQHGPAFLINTVGE------DDEPE-ALIHVKDASEATGEWNRR-RRGLNNLLSIIDKDKEGKVLSLALYL--DNETVTA 186 (474) T ss_pred HhhCceeEEEecCC------CCCce-eEEEEeccceEEEEEeCC-CCcceeeeEEEEEcCCCcEEEEEEEe--CCcEEEE Confidence 99999997753222 12222 358899999886444322 23455555 456777765432222 22211 Q ss_pred c--CCccccceeeccccCChhHeEeeecccCCCcccCCchhh-HHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccc Q lcl|NC_019524. 230 D--MEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMV-SALKQMKMTRNFQEITLQNAVVNATYAASVESELPSD 306 (556) Q Consensus 230 ~--~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la-~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~ 306 (556) . ..+..|..-......+.+ |+++....+.+-.=|.|.++ |+|..+..+.+-.--.+..+...|.=--+|.--.+++ T Consensus 187 ~~~~~~~~w~~~~~~~~~gvP-vV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~ 265 (474) T protein:vir:81 187 QRDKATLKWQVDRDEHVYGVP-AQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESA 265 (474) T ss_pred EEcCccceeeeccCCCCCCcc-eEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhh Confidence 0 112223221111234444 88888776665555888876 5555444444333333333333222222222111100 Q ss_pred ccccccccccccccccccccccccccccccccceecCCceeeecCCCce----------eeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 307 VVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTK----------LKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~----------i~~~~~~~p~~~f~~F~~~~l 376 (556) . . +. +......-....|.|..++++++ +.-++.. .-.+|.+.++.++ T Consensus 266 ~------------~----d~------d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a-~l~~~~~~l~~~~ 322 (474) T protein:vir:81 266 L------------K----NA------DGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAA-SPDAHWSDINGLA 322 (474) T ss_pred c------------c----cc------cccccchhhhhHHHHhcCCCcccccccccccccccccCCC-ChhHHHHHHHHHH Confidence 0 0 00 00001111123455666666654 2333322 2346778889999 Q ss_pred HHHHHhcCCCHHHhhchhhcccch---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccc Q lcl|NC_019524. 377 RNIAASLGMSYEQFSRDYTKTNYS---SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMF 453 (556) Q Consensus 377 r~iaaglGi~ye~l~~D~s~~nYS---s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~ 453 (556) +.+|+-.|+|-+.|. -.+-.|-| +++++.....+..+..|..|-..+.+ +++.-+ ++. |..+... T Consensus 323 ~~~a~~t~iP~~~lG-~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~-~~rla~--~i~-~~~~~~~------- 390 (474) T protein:vir:81 323 KLFAREASLPDTAVA-ISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRK-AFIRAL--AMK-NKVAIDE------- 390 (474) T ss_pred HHHHhhhCCCHHHhc-ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH--HHh-CCCCccc------- Confidence 999999999988762 22223444 66666777777777777778777655 344322 333 3322110 Q ss_pred cchhhHHH--hhCeeeecCcccccchhhhhHHHHHHHHcCCCC-HHHHHHH-hCCCHHHHHHHHHHHHHHHHHcCCCCCc Q lcl|NC_019524. 454 YDPMMRDA--LCNAEWIGASRGQIDEKKETEAAILRIKNGLST-YEAEISR-LGGDFREVFKQRAREEGLIKSLKLDFTG 529 (556) Q Consensus 454 ~~~~~~~a--~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s-~~~~~ae-~G~D~e~v~~q~a~E~~~~~~~Gl~~~~ 529 (556) .+.. -+.+.|..|..+ .....+.|..+.+.+|..- .++...+ .|.+++++-+... +++.....++. +. T Consensus 391 ----~~~~~~~~~v~W~d~~~~--s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~-~~~~~~~~~~~-~~ 462 (474) T protein:vir:81 391 ----IPDEWKSIDAKWRDPRYL--SKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMA-DKRRVQGRGTL-QA 462 (474) T ss_pred ----cchhhccceeEecCCCcc--CHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHH-HHHHHhHHHHH-HH Confidence 0111 245678777665 4466788888889988433 3344445 4999887643221 11111111110 00 Q ss_pred cccccCCCCCCC Q lcl|NC_019524. 530 KMVEGNSTQSSN 541 (556) Q Consensus 530 ~~~~~~~~~~~~ 541 (556) -......++.++ T Consensus 463 l~~~~~~~~~aq 474 (474) T protein:vir:81 463 LIDRSNNGATAQ 474 (474) T ss_pred HHhcCCCCCCCC Confidence 000001111111 No 172 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.81 E-value=4e-08 Score=61.13 Aligned_cols=430 Identities=9% Similarity=0.024 Sum_probs=200.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |+.....++.+-.+. ..-|.|-.. ..... ......+. ... | --+++++ T Consensus 2 ~~~~~~~~~~r~~~l---------------~~yy~g~~~--~~~~~--~~~~~~~~---~~~-----k-----i~~n~~~ 49 (440) T protein:vir:95 2 LAAFLGSQKQRLAIL---------------ASYAQGDNF--SILSG--HRRLDDEK---ADY-----R-----VRHKWGG 49 (440) T ss_pred hhhHHHHHHHHHHHH---------------HHHhccCCc--ccccc--cccccccC---Ccc-----e-----eecchHH Confidence 333333332222111 112443211 00000 00000000 000 0 1378999 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+|++.+.++.|.+++..... +.++ +..+. |++|++. .+|...+..+.+..+.-|.+ T Consensus 50 ~ivd~~~~~l~g~~~~~~~~~---------~~~~---~~~~~-l~~~~~~----------n~~~~~~~~~~~~~~~~G~a 106 (440) T protein:vir:95 50 YISSFATGYVIGNPVSIGVME---------GGSA---DQLST-IKDIEWQ----------NDINALNSDLAFDASVYGRA 106 (440) T ss_pred HHHHhhhhheeccCceEeeCC---------CccH---HHHHH-HHHHHHh----------cCHhHHHHHHHHHHhhcCeE Confidence 999999999999887765321 1122 22222 3333322 25778888888899999999 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccc---cCCccccc Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPT---DMEQWKWG 237 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~---~~~~~~~~ 237 (556) |+++.... . + .+++..++|+.+-.-++......+..+|.+=......-++|+...---.+ ......|. T Consensus 107 ~~~~~~d~-~-----~---~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~ 177 (440) T protein:vir:95 107 YEYHFRDK-D-----K---VDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLV 177 (440) T ss_pred EEEEEecC-C-----C---ceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEecCCcccee Confidence 98864322 1 1 14788999998864343333345666666421111111222221100000 00001111 Q ss_pred eeecc-ccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccc Q lcl|NC_019524. 238 YEPAR-FDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQ 316 (556) Q Consensus 238 rv~~~-~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~ 316 (556) .+... ..++.==|+|+... .-|+|.|.+++..+..++.-..-......-.+.-..+++-....... . T Consensus 178 ~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~-------~ 245 (440) T protein:vir:95 178 VDDVKKHSYNDVPVVEWWNN-----RFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKL-------S 245 (440) T ss_pred ecceeeccCceeeEEEeeCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCC-------C Confidence 11100 00110013444332 24889999988877666554433333333323222333321110000 0 Q ss_pred ccccccccccccccccccccccceecCC-ceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCH---HHhhc Q lcl|NC_019524. 317 GGFKEIFNEYMTGLANYVAQTKNIAIDG-AKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSY---EQFSR 392 (556) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~l~p-G~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~y---e~l~~ 392 (556) ++.... ........+.. ........|-++++++.+.+...+..+++.+.+.|....++|- +.+++ T Consensus 246 ~e~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~ 314 (440) T protein:vir:95 246 PEDAAK-----------MKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNS 314 (440) T ss_pred ccchhh-----------hhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc Confidence 000000 00000000101 1111134556788998888889999999999999999888773 33444 Q ss_pred hhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcc Q lcl|NC_019524. 393 DYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASR 472 (556) Q Consensus 393 D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~ 472 (556) +. |-.+.+..+..........|..|-..+ +.+++..+. ++...- +... + ...+.+.|..+-. T Consensus 315 n~---Sg~Al~~~~~~l~~k~~~k~~~~~~~l-~~~~~li~~--~~~~~~----~~~~----~----~~~v~i~f~~~~p 376 (440) T protein:vir:95 315 TS---SGIALLYKMIGLEQVRKDKETYFTKAL-RRRYELISN--IHKAIN----GPVI----E----ANKLTFTFHPNIP 376 (440) T ss_pred cc---hHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH--HHhhcC----Cccc----c----cccceEEeCCCCC Confidence 33 444666666666666666666554443 334443222 222110 0000 0 1124567754443 Q ss_pred cccchhhhhHHHHHHHHcCCCCHHHHHHHhC-CCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 473 GQIDEKKETEAAILRIKNGLSTYEAEISRLG-GDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPN 551 (556) Q Consensus 473 ~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G-~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (556) .|-...+++..+. .|+.|.+..+...+ .|.++-++++.+|++......-. .. ...++.. T Consensus 377 --~~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~~~E~~ri~~E~~~~~~~~~~------------~~----~~~~~~~ 436 (440) T protein:vir:95 377 --QDVWTEIKAYIEA--GGEISQETLMENASFTDYKTEHSRILKQGGSSDLEIGQ------------IV----GDADVGQ 436 (440) T ss_pred --CCHHHHHHHHHHH--hccCcHHHHHHhCCCCCcHHHHHHHHHHHHHhhhhHHh------------hc----cCCCCCC Confidence 4666666665554 68999999999885 46666667777776543211100 00 0011111 Q ss_pred CcCC Q lcl|NC_019524. 552 EETT 555 (556) Q Consensus 552 ~e~~ 555 (556) +|.| T Consensus 437 ~~~e 440 (440) T protein:vir:95 437 ADTE 440 (440) T ss_pred cCCC Confidence 1111 No 173 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.79 E-value=4.8e-08 Score=60.72 Aligned_cols=446 Identities=9% Similarity=-0.009 Sum_probs=197.6 Q ss_pred CCcchhhhHHHHHhh----Hhhcc-------cc-hhhhhh---hhcchhccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKA----VDVVA-------ET-ATATPM---AVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a----~~~~~-------~~-~~~~~~---~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) |+=-..+++..-+-. +...- +. ....+. .....|..+.. .|..... .+ ... T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~------~~l~~~~-~~----~~~--- 66 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDF------KQVTHKN-SY----GDT--- 66 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCC------ccccccc-cC----CCc--- Confidence 332222222221100 00000 00 000000 00011111100 0000000 00 000 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) +.| ....-++++.+++.+++.|.|.-.++.. + + +...+.|+++.+. .+|.. T Consensus 67 --~~~-~~~slnl~~~i~~~~A~ll~~e~~~i~~--~-------d-------~~~~e~l~~i~~~----------n~f~~ 117 (505) T protein:vir:79 67 --QKH-ELQSVNVTKLASAKLASLIFNEQCQVTV--S-------D-------ETANDFLDDVFQQ----------NDFYT 117 (505) T ss_pred --ccc-ceeecchHHHHHHHHHhhhcCCCceeec--C-------C-------hHHHHHHHHHHHh----------ccHHH Confidence 000 0112268999999999999997433332 1 1 1222334444432 14666 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhc-CCCCCCCCCceEEEEEEEC-----CCCCeEEEE Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRM-SNPNNVMDTPNLRSGVQLD-----NNGAALGYW 219 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl-~~~~~~~~g~~i~~GIE~d-----~~Gr~vaY~ 219 (556) ...-++...+.-|..++++.+-. + ..+|..+.|+.+ +...+ .+.+...+.+- ..+.-+=|. T Consensus 118 ~~~~~~e~a~a~G~~~~k~~~D~--~--------~~~i~~v~ad~~~P~~~d---~~~~~~~a~~~~~~~~~~~~~~~yt 184 (505) T protein:vir:79 118 TFEEKLEEWIALGSGCVRPYVDS--G--------KIKLAWATADQVYPLQAD---TNQVNELAIASRTTEVENHRTIYYT 184 (505) T ss_pred HHHHHHHHHhhcCCeEEEEEEeC--C--------ceEEEEEcCCeeEEEEEc---CCCeEEEEEEEEEEEecCCcceEEE Confidence 66666666777777776765531 1 257888888864 32111 12222222210 001111121 Q ss_pred EeecC-----------------CCcccc----CC-ccccceeecc---ccCChhHeEeeecc----cCCCcccCCchhhH Q lcl|NC_019524. 220 LRKAF-----------------PGDPTD----ME-QWKWGYEPAR---FDWGRRRVIHIIEA----LLAGQTRGISEMVS 270 (556) Q Consensus 220 i~~~h-----------------pgd~~~----~~-~~~~~rv~~~---~~v~a~~viH~f~~----~r~gQ~RGvs~la~ 270 (556) +...| ..+..+ .. -.+|..+... ..++++-+.|+.-+ ...+..-|+|.|+. T Consensus 185 ~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~ 264 (505) T protein:vir:79 185 LLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDN 264 (505) T ss_pred EEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhh Confidence 22222 111000 00 0112222111 23556666665432 34455679999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcceeeeE-----eccCcccccccccccccccccccccccccccccccccccceecCCc Q lcl|NC_019524. 271 ALKQMKMTRNFQEITLQNAVVNATYAASV-----ESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGA 345 (556) Q Consensus 271 ~l~~l~~l~~~~dael~~a~i~A~~~~fi-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG 345 (556) ++..+..|+.-.+.-...-+. +.-..+| ++...... .......+. +... ..-.......+ T Consensus 265 ~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~-------~~~~~~~~~--fd~~----~~~y~~~~~~~- 329 (505) T protein:vir:79 265 SYTVIDAINRTHDQFVDEVKK-GQRRLIVPAEWLKTGSSYGG-------QASETHPPM--FDPD----ETVYQAMYGDA- 329 (505) T ss_pred hHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcccCCCCc-------ccccccccC--CCcc----ceeeeeccCCC- Confidence 999999998754433332222 2222232 21111100 000000000 0000 00000001111 Q ss_pred eeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHH--HHHHH Q lcl|NC_019524. 346 KIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKK--LVADR 423 (556) Q Consensus 346 ~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~--~lv~~ 423 (556) .+..++.++|.-+..+|..-...+++.|....|+++..++-|-+++ .++-+...+..+.+..+.. ..+.. T Consensus 330 ------~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~--~TAtei~s~~~~l~~t~~~~~~~~~~ 401 (505) T protein:vir:79 330 ------SEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGI--QTATEVVTNNSQTYQTRSSYITQVEK 401 (505) T ss_pred ------CCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCcccc--chHHHHHHHHhHHHHHHHHHHHHHHH Confidence 1345888998888888999999999999999999999887664432 2343443344444443332 23455 Q ss_pred HHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh- Q lcl|NC_019524. 424 FASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL- 502 (556) Q Consensus 424 ~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~- 502 (556) .++.+....++.+-+-+..+. +.+... ......-+.+.| .--..+|+.++++.....+.+|++|.+..+.+. T Consensus 402 al~~li~~i~~~~~~~~~~~~-g~~~~~----~~~~~~~i~v~f--~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~~~ 474 (505) T protein:vir:79 402 TIKALTYAILELASVPSFYAD-GQARWT----GDVDSLDITINF--NDGVFVDQESKRAADLQAVQAQVMPKKQFLMRNY 474 (505) T ss_pred HHHHHHHHHHHHHHHhccccc-cccccc----CCCCceeEEEEe--CCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcC Confidence 556666666665544332110 000000 000000123344 233357999999999999999999999988777 Q ss_pred CCCHHHHHHHHHHHHHHHHH-cCCCCCccccccCCCCC Q lcl|NC_019524. 503 GGDFREVFKQRAREEGLIKS-LKLDFTGKMVEGNSTQS 539 (556) Q Consensus 503 G~D~e~v~~q~a~E~~~~~~-~Gl~~~~~~~~~~~~~~ 539 (556) |.|-+++.+++++ +++ .+...| ....-.+. T Consensus 475 ~~~eeea~~el~r----i~~E~~~~~p---~~~~~gg~ 505 (505) T protein:vir:79 475 GLDEEEADEWLAQ----IDAENSTAEP---EFNQFGGD 505 (505) T ss_pred CCChHHHHHHHHH----HHHhccccCC---CchhccCC Confidence 8887776443332 211 111100 00000000 No 174 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=98.76 E-value=3.4e-09 Score=66.98 Aligned_cols=326 Identities=10% Similarity=0.071 Sum_probs=146.5 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhcccc------------CCCcccccccCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE------------RTTREMFQWNPSIISPDQQIAQNQDMASARAQD 71 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~------------~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRd 71 (556) |+...+..+.+ .....+ . .....+++.... ..-.....|..++-+... | | . T Consensus 1 m~~~~~~~~~~-~~~~~~--~--~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~--------l---a-~ 63 (346) T protein:vir:10 1 MKKQLRKNLTQ-NDRLQP--Q--AQTEIFSFGDPIPVLDRADILNYLECSAMYEKWYNPPMSFDG--------L---A-K 63 (346) T ss_pred CCcccCCCCCc-cccccc--c--cCeEEEecCCcceecCchhHHHHHHHhhcCCceEecCCCHHH--------H---H-H Confidence 33322111100 000000 0 000001110000 000011223222222211 0 1 1 Q ss_pred HHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHh Q lcl|NC_019524. 72 MVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAV 151 (556) Q Consensus 72 l~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~ 151 (556) |.+.|++-.+++..-.+.+. . +.+.++ ..+|..++.+++. T Consensus 64 l~~~~~~h~~~i~~k~n~l~------------------------------~----l~~~Pn------~~~t~~~f~~~~~ 103 (346) T protein:vir:10 64 SLRSSTHHESAIITKANILL------------------------------S----TCEVDS------RYLSRRDLSSFVK 103 (346) T ss_pred HHHhhhhcchhhhhhhhhHH------------------------------H----HHhCCC------CCCCHHHHHHHHH Confidence 23333433333322111111 1 122232 3556677777654 Q ss_pred hhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccC Q lcl|NC_019524. 152 SGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDM 231 (556) Q Consensus 152 r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~ 231 (556) .++..|.+++.+.+... .-++.|..|.+..+.. + .+.++ ..|.+. .++. T Consensus 104 -d~ll~Gnay~~i~r~~~--------G~~~~L~pl~~~~v~~------------~--~~~~~--~~~~~~--~~~g---- 152 (346) T protein:vir:10 104 -DYLVFGNAYFEVVRNRL--------GQVQRIESPLAKYVRK------------G--LEAGQ--FYYVPQ--RFDH---- 152 (346) T ss_pred -HHHhcCCeEEEEEEcCC--------CcEEEEEEecCCceEE------------E--EcCCe--EEEEEE--ccCC---- Confidence 67889999988764321 1356788888877742 1 11111 112221 1111 Q ss_pred CccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccc Q lcl|NC_019524. 232 EQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQ 311 (556) Q Consensus 232 ~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~ 311 (556) ....++..+|||+..+...++..|+|++..++..+.-...-+.....--+=.+...++++..++... T Consensus 153 ---------~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~---- 219 (346) T protein:vir:10 153 ---------QEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQK---- 219 (346) T ss_pred ---------eEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC---- Confidence 0124567799999999877899999988877766554443333332222224445566654332110 Q ss_pred cccccccccccccccccccccccccccceecCCceeeecCC-----CceeeeecCCCCCccHHHHHHHHHHHHHHhcCCC Q lcl|NC_019524. 312 LGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYP-----GTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMS 386 (556) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p-----Ge~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ 386 (556) +++.+........ ..+ .-..|.+..|.| |-+++.++...-...|.+..+.....||+.+||| T Consensus 220 -----~e~~~~i~~~~~~----~~g----~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VP 286 (346) T protein:vir:10 220 -----QEDVENIRQQLKQ----SKG----VGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVP 286 (346) T ss_pred -----HHHHHHHHHHHHH----hcC----ccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCC Confidence 1111111111100 000 012244455544 5567777766666778888999999999999999 Q ss_pred HHHhhchhhc--ccchhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhh Q lcl|NC_019524. 387 YEQFSRDYTK--TNYSSARASMAETQKY-MDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALC 463 (556) Q Consensus 387 ye~l~~D~s~--~nYSs~R~~~~e~~r~-~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~ 463 (556) ...+ |...+ .+||++......+.+. +..++..|. ++.. ||-. +.+.|+ ....+ T Consensus 287 p~ll-G~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie-e~n~-----~L~~-------------e~i~F~----~~~ll 342 (346) T protein:vir:10 287 PQLM-GIIPNNTGGFGNVADAAEVFFITEIEPLQERLK-EFNQ-----WLGQ-------------EVIKFK----PSKLL 342 (346) T ss_pred HHHh-cccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHHh-----hccc-------------ceeeec----hhhhc Confidence 9854 65543 4699888777776553 234444332 2111 1111 111121 12222 Q ss_pred Ceee Q lcl|NC_019524. 464 NAEW 467 (556) Q Consensus 464 ~~~w 467 (556) +..= T Consensus 343 ~~~~ 346 (346) T protein:vir:10 343 QRTQ 346 (346) T ss_pred ccCC Confidence 2211 No 175 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.75 E-value=6.4e-08 Score=60.01 Aligned_cols=455 Identities=9% Similarity=0.022 Sum_probs=196.0 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCC-Ccc---cccccCCCCCHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERT-TRE---MFQWNPSIISPDQQIAQNQDMASARAQDMVQND 76 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~-~r~---~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn 76 (556) .+-|+.. ..+.-...-........... ..-|...... .+. ..-|.... ++ .......+. T Consensus 4 ~~~~~~~--i~~w~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~w~~~~-~~-------------~~~~~~~~~ 66 (518) T protein:vir:78 4 WSVMTRF--IKGWLNGKPNGSEPELIPKY-LPLVPDNQKEWSKDSYLTSLWAQGY-VP-------------TVHDKLMNS 66 (518) T ss_pred hhhHHHH--HHHhhcCCCCccchhccHHH-hhhcccchhhhhhhhhhhhhcccCC-CC-------------ccccccccC Confidence 3333332 12221100000000000000 0001111000 000 01121110 11 111233577 Q ss_pred hHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhee Q lcl|NC_019524. 77 GYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLM 156 (556) Q Consensus 77 ~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~ 156 (556) ++++.+++.+++.|.|...+.... +.+...++.+++.++..++ . ..|+....-.+...+. T Consensus 67 ~l~~~i~~~~A~ll~~e~~~i~v~------~~~~~d~e~~~~~l~~il~----~----------n~f~~~~~~~~e~a~a 126 (518) T protein:vir:78 67 GTGNEIVVVAAEYISGKPLSIDVT------GVNGSKDENLTKQLKEALR----I----------DNFDSKSVKIVELAGG 126 (518) T ss_pred ChHHHHHHHHHHhhcCCCceEEec------CccccCcHHHHHHHHHHHH----h----------ccHHHHHHHHHHHhhc Confidence 889999999999999975443321 1111112333444444333 1 2456655555556666 Q ss_pred cCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEEC---CCCCeEEEEEeecCC-------- Q lcl|NC_019524. 157 TGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLD---NNGAALGYWLRKAFP-------- 225 (556) Q Consensus 157 dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d---~~Gr~vaY~i~~~hp-------- 225 (556) -|.++++..|... -.++..++++.+- |. ..+| .+..-|-+. ..++..-|+....|- T Consensus 127 ~G~~~~k~~~d~~----------~~~i~~v~ad~~~-P~-~~~g-~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~ 193 (518) T protein:vir:78 127 SGVSAVKINILNG----------RPSISVHSSSQFW-ID-FKNN-EPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEG 193 (518) T ss_pred cCceEEEEEEECC----------eeEEEEEcCCeeE-EE-eecC-cEEEEEEEEEeecCCcceeEEEEEeecccccccee Confidence 6777777666321 1478888888774 22 1222 233322211 123333444333331 Q ss_pred -----------------CccccCC----------ccccceeecc----ccCChhHeEeeecc----cCCCcccCCchhhH Q lcl|NC_019524. 226 -----------------GDPTDME----------QWKWGYEPAR----FDWGRRRVIHIIEA----LLAGQTRGISEMVS 270 (556) Q Consensus 226 -----------------gd~~~~~----------~~~~~rv~~~----~~v~a~~viH~f~~----~r~gQ~RGvs~la~ 270 (556) ++..... ...|..++.. .-.+..-+.|++.+ ...+-.-|+|.|+- T Consensus 194 ~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~ 273 (518) T protein:vir:78 194 KKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQ 273 (518) T ss_pred ecccceeEEEEEeeecCcccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhh Confidence 1110000 0011111100 01112223343322 12234459999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccc-ccccccccccccccccccccceecCCceeee Q lcl|NC_019524. 271 ALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGG-FKEIFNEYMTGLANYVAQTKNIAIDGAKIPH 349 (556) Q Consensus 271 ~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~ 349 (556) ++..+..|+.-.+.-...-+. +.-..||-...= ...+.+.+. ....+.. . ...-..+ .| . T Consensus 274 ~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l-----~~~~~~~~~~~~~~fd~---~------~~~y~~i-~~---~ 334 (518) T protein:vir:78 274 CTNYLFAVDYFFTVYMREGEK-TKTKIAASERMF-----RKKVNKSTDKEEWSMNV---D------EDYFMQF-KG---T 334 (518) T ss_pred hhHHHHHHHHHHHHHHHHHHh-CCceeeechhHh-----ccCCCCCCCccccccCC---C------CceEEEe-cC---c Confidence 999999998877654444333 444444421110 000000000 0000000 0 0000000 01 1 Q ss_pred cCCCc----eeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 350 LYPGT----KLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFA 425 (556) Q Consensus 350 L~pGe----~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~ 425 (556) +..|. .|+.++|.-+..+|..=+..+|+.|..+.|+||..+..|-...+=+.++.....-.+.+...|.. ++..+ T Consensus 335 ~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~-~e~al 413 (518) T protein:vir:78 335 LDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIEKKKRL-IQNVY 413 (518) T ss_pred CCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 11222 47888888888889998999999999999999998865432322223333333333333334433 33434 Q ss_pred HHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh--C Q lcl|NC_019524. 426 SAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL--G 503 (556) Q Consensus 426 ~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~--G 503 (556) .-+....++.+ ............ ....-+.+.|-- -..+|+..+++.....+.+|++|.+..++++ | T Consensus 414 ~~l~~~i~~l~--~~~~~~~~~~~~-------~~~~~v~i~f~D--~i~~D~~~~~~~~~~~v~aGimS~e~~i~~~~~~ 482 (518) T protein:vir:78 414 EQMLWDFLYLL--TGGTNNKEKAIM-------RDEIRVIIEFPD--PMSVNLNELSSTLNNMNSALAMSVEEKVKLIHPK 482 (518) T ss_pred HHHHHHHHHHH--HhhcCccccccC-------CCceeEEEEeCC--CCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCC Confidence 44444333321 211110000000 000112345533 3457999999999999999999999999886 4 Q ss_pred CCHHHHHHHHHHHHHHHHHcCCCCCccc---cccCCCCC Q lcl|NC_019524. 504 GDFREVFKQRAREEGLIKSLKLDFTGKM---VEGNSTQS 539 (556) Q Consensus 504 ~D~e~v~~q~a~E~~~~~~~Gl~~~~~~---~~~~~~~~ 539 (556) .|-+++.+++++=++ |.+......| ....+.+. T Consensus 483 ~~deea~~e~~ri~~---E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 483 WEDEEIQAEVKRIYL---ENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred CCHHHHHHHHHHHHH---HhcccCCCCCccccCCCCCCC Confidence 555544443333111 2221111111 11111111 No 176 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=98.75 E-value=2.7e-09 Score=67.59 Aligned_cols=325 Identities=10% Similarity=0.042 Sum_probs=155.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcch---------------hccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGG---------------MEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~---------------y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) ||--|..+|...-.++......... .+..... |...-. +..|..++-+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~p~~v~~~~~~~~~~~~~~----~~~~~~pp~~~~~--------- 66 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAP-ARAEVFTFDDPTPVMNRAEILDYVECWS----NGEWFEPPVSFAG--------- 66 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhccc-ceeEEEEcCCceeecCcchhhhhhhhhc----cCceecCCCCHHH--------- Confidence 5533332221110100000000000 0000111 111110 1223333333221 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) --.|.+-|++..++|..-++.+.++ ++|. .-+|.++ T Consensus 67 ---la~~~~~~~~h~~~l~~k~n~l~~~-~~Pn----------------------------------------~~~t~~~ 102 (351) T protein:vir:78 67 ---LAKSFRASTHHSSALFFKANVLAST-FRPH----------------------------------------RWLSRHA 102 (351) T ss_pred ---HHHHHhhhHhhhhhhhhhhhHHhhc-ccCC----------------------------------------CCCCHHH Confidence 1134556677777776655554442 2221 2235555 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCC Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFP 225 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hp 225 (556) |.+++. ..+..|.+++.++... ...++.|..|.+..+.. +. |.. .|++.... T Consensus 103 f~~~~~-d~ll~Gnay~~~~rn~--------~G~~~~L~pl~~~~v~~------------~~--~~~----~~~~~~~~- 154 (351) T protein:vir:78 103 FERWAL-DFLTFGNGYLERRRNM--------VGGTLRLEPALAKYVRR------------KA--DFS----GFVYVNGW- 154 (351) T ss_pred HHHHHH-HHHhcCCeEEEEEECC--------CCCEEEEEEecCcceEE------------ee--eCC----eEEEEecC- Confidence 655554 5667799998876432 12356788888776632 11 111 24433210 Q ss_pred CccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEecc Q lcl|NC_019524. 226 GDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESE 302 (556) Q Consensus 226 gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~ 302 (556) + ....+++.+|||+..+.--.+..|+|++..++..+. --..+++-+.+. .+...++++.. T Consensus 155 ~--------------~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~---l~~~a~~~~~~~f~NGa~pggIl~~~ 217 (351) T protein:vir:78 155 Q--------------ERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAW---LNESSTLFRRKYYENGSHAGFILYMT 217 (351) T ss_pred C--------------eEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEec Confidence 1 012466789999998877789999998877766543 334455555544 34445555543 Q ss_pred CcccccccccccccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHH Q lcl|NC_019524. 303 LPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLR 377 (556) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr 377 (556) .+... ++..+........ ..+. =..|.+..+ +.|-+++.++......+|.+..+.... T Consensus 218 ~~~ls---------~e~~~~lr~~~~~----~~G~----~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~ 280 (351) T protein:vir:78 218 DAAQK---------QDDVDNMRDALKN----AKGP----GNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRD 280 (351) T ss_pred CCCCC---------HHHHHHHHHHHHH----hcCc----ccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHH Confidence 22110 1111111111100 0000 122444444 346678887777677889999999999 Q ss_pred HHHHhcCCCHHHhhchhhc--ccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCccccccc Q lcl|NC_019524. 378 NIAASLGMSYEQFSRDYTK--TNYSSARASMAETQKYM-DSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFY 454 (556) Q Consensus 378 ~iaaglGi~ye~l~~D~s~--~nYSs~R~~~~e~~r~~-~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~ 454 (556) +||+.+|||...+ |...+ .+||++.+....|.+.. ..+|..|. ++. . ||.. +.+.|+ T Consensus 281 eIa~a~~VPp~ll-Gi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie-e~n----~-~l~~-------------~~~~F~ 340 (351) T protein:vir:78 281 DLLAAHRVPPQLL-GIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELN----D-WLGD-------------EVVRFD 340 (351) T ss_pred HHHHHhCCCHHHh-cccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHH----h-hcCc-------------cceecC Confidence 9999999998754 66544 56999988888776543 34444332 111 1 2111 112222 Q ss_pred chhhHHHhhCe Q lcl|NC_019524. 455 DPMMRDALCNA 465 (556) Q Consensus 455 ~~~~~~a~~~~ 465 (556) .....+.-.++ T Consensus 341 ~~~Llr~d~ka 351 (351) T protein:vir:78 341 DYEIPPAPVAA 351 (351) T ss_pred hhhhccccccC Confidence 22211111111 No 177 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=98.75 E-value=4e-09 Score=66.60 Aligned_cols=322 Identities=11% Similarity=0.068 Sum_probs=153.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchh-----hhhhh---hcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETAT-----ATPMA---VGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDM 72 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~-----~~~~~---~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl 72 (556) || |..+|.++.......++... +.... ....|-... .+..|..++-+... - ..| T Consensus 1 m~--~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~----~~~~~~~pp~~~~~-----------l-a~l 62 (340) T protein:vir:98 1 MS--KRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECI----SNGKWYEPPVSFSG-----------L-AKS 62 (340) T ss_pred CC--CCCCCccccccccCccceeEEEcCCceeecCcchhhhhhhhh----hcCceecCCCCHHH-----------H-HHH Confidence 55 22222221110000000000 00000 000111110 11223333333221 1 234 Q ss_pred HhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhh Q lcl|NC_019524. 73 VQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVS 152 (556) Q Consensus 73 ~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r 152 (556) .+-|++-.++|...++.+.++ ++| ...+|..++.+++. T Consensus 63 ~~a~~~h~s~i~~k~n~l~~~-~~P----------------------------------------n~~lt~~~f~~~~~- 100 (340) T protein:vir:98 63 LRSAVHHSSPIYVKRNVLAST-YIP----------------------------------------HPLLSRQDFSRFAL- 100 (340) T ss_pred HHhccccchhhhhhhhHHhhc-cCC----------------------------------------CCCCCHHHHHHHHH- Confidence 455566666665544444331 222 22334445555554 Q ss_pred hheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCC Q lcl|NC_019524. 153 GFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDME 232 (556) Q Consensus 153 ~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~ 232 (556) ..+..|.+++.+++.. ..-++.|..+.+..+.. ...|. ..|++. . ++ T Consensus 101 d~ll~Gnay~~~~rn~--------~G~~~~L~pl~~~~vr~----------------~~~~~-~~~~~~-~--~~----- 147 (340) T protein:vir:98 101 DYLVFGNAFLEQRHSV--------TGQLIKLLTSPAKYTRR----------------GVDDS-VFWFVE-N--FT----- 147 (340) T ss_pred HHHhcCCeEEEEEECC--------CCcEEEEEEeCCceEEE----------------cccCc-EEEEEe-c--CC----- Confidence 5677899998876432 12356777777666532 12222 123332 1 11 Q ss_pred ccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEeccCcccccc Q lcl|NC_019524. 233 QWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESELPSDVVF 309 (556) Q Consensus 233 ~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~~~~~~~~ 309 (556) ....+++.+|||+..+.-..+..|+|++..++..+- -=..+++-+.+. .|.-.++++.+.+... T Consensus 148 --------~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~---l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls-- 214 (340) T protein:vir:98 148 --------QPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAW---LNESATLFRRKYYQNGAHAGYIMYVTDPAQS-- 214 (340) T ss_pred --------eEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEecCCCCC-- Confidence 012356789999998876788999999988776543 223455555554 2334444543322111 Q ss_pred cccccccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHHHHHHhcC Q lcl|NC_019524. 310 GQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLG 384 (556) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglG 384 (556) +++.+........ ..+. -..|.+..| +.|-+++.++......+|.+..+.....||+.+| T Consensus 215 -------~e~~~~lk~~~~~----~~G~----~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~ 279 (340) T protein:vir:98 215 -------ATDVESLRDAMRN----SKGL----GNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHR 279 (340) T ss_pred -------HHHHHHHHHHHHH----hcCc----cccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhC Confidence 1111111111100 0000 012344444 4577788888777788999999999999999999 Q ss_pred CCHHHhhchhhc--ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHh Q lcl|NC_019524. 385 MSYEQFSRDYTK--TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDAL 462 (556) Q Consensus 385 i~ye~l~~D~s~--~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~ 462 (556) ||.. +.|+..+ .+||++.+....|++.. +.|+.+.|.+ ++..|.. +.+.|. .... T Consensus 280 VPp~-llGi~~~~t~~~sn~e~~~~~f~~~~-----------l~Pl~~~iee---~n~~L~~----e~~rF~----~~~l 336 (340) T protein:vir:98 280 VPFQ-LMGGKPENIGSLGDVEKVAKVFVRNE-----------LSPLQDRFRE---VNDWLGM----EVIRFK----EYTL 336 (340) T ss_pred CCHH-HhcccCCCCCccccHHHHHHHHHHHH-----------HHHHHHHHHH---HHhcccc----cccccC----cccc Confidence 9987 5566543 46998877777665543 3444444321 1122210 111121 1233 Q ss_pred hCee Q lcl|NC_019524. 463 CNAE 466 (556) Q Consensus 463 ~~~~ 466 (556) ++.+ T Consensus 337 ~~~d 340 (340) T protein:vir:98 337 DNPE 340 (340) T ss_pred ccCC Confidence 3443 No 178 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=98.75 E-value=6.5e-08 Score=59.99 Aligned_cols=418 Identities=7% Similarity=-0.036 Sum_probs=191.5 Q ss_pred hcccchhhh-h------hhhcchhccccC--CCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHh Q lcl|NC_019524. 18 VVAETATAT-P------MAVGGGMEGAER--TTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRD 88 (556) Q Consensus 18 ~~~~~~~~~-~------~~~~~~y~aa~~--~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~ 88 (556) +-.....-. . ......|....+ .+.+ .-++......+ ........-.+..+.--.+++++-+|+..+. T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~-~I~~~~~~~~~--~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~ 77 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKT-DITTRNNGKAK--LNKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-chhccccchhc--ccccccccccccCCcccccchHHHHHHhhhh Confidence 000000000 0 000000110000 0000 00000000000 0000000000000001137788899999999 Q ss_pred hhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeecc Q lcl|NC_019524. 89 SIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLN 168 (556) Q Consensus 89 nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~ 168 (556) +++|..++..+.- ....+.+...|+. +|........+.+...|.++..+.+.. T Consensus 78 yl~G~p~~~~~~d------------~~~~~~l~~~~~~---------------~~~~~~~~l~~~~~~~G~a~~~~y~d~ 130 (470) T protein:vir:10 78 YVASVFPDIDVGK------------DADNKKIIDVLGD---------------DRALTLNGLLVDSSNAGRAWLHYWIDE 130 (470) T ss_pred heeccceeeecCc------------hHHHHHHHHHHhh---------------hHHHHHHHHHHHHhhcCeeEEEEEecC Confidence 9999887765421 1223334333321 223333334566778899998764422 Q ss_pred CCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE----CCCCCeE--EEEEeecCCCccccCC---------- Q lcl|NC_019524. 169 PTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL----DNNGAAL--GYWLRKAFPGDPTDME---------- 232 (556) Q Consensus 169 ~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~----d~~Gr~v--aY~i~~~hpgd~~~~~---------- 232 (556) .+ -+++.+++|..+---+.....+.+..+|.+ |..+... .|.++..+--..+... T Consensus 131 -~~--------~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~ 201 (470) T protein:vir:10 131 -DG--------NFRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPY 201 (470) T ss_pred -CC--------ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceecccc Confidence 21 268899999877543443444556667754 3444322 2223221111110000 Q ss_pred --------------------ccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 233 --------------------QWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVN 292 (556) Q Consensus 233 --------------------~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~ 292 (556) ...|.+|| |+|+... ..|+|.|.+++..+..++....-......-. T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~g~vP---------vv~~~nn-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~ 267 (470) T protein:vir:10 202 NIITSYDLSAGYETGQSNTLKHNFGRVP---------FIEFSKN-----KYRLPELNKYKGLIDAYDDIYNGFINDLDDV 267 (470) T ss_pred ccccccccccccccccccccccCCCeee---------EEEeecC-----CCCCCchhHHHHHHHHHHHHHHHHHHHHHHh Confidence 01122222 3444443 3699999988877766555444333333322 Q ss_pred cceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecC-----CCceeeeecCCCCCcc Q lcl|NC_019524. 293 ATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY-----PGTKLKMQPAGTPGGV 367 (556) Q Consensus 293 A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~-----pGe~i~~~~~~~p~~~ 367 (556) +--..+++.-..+. .+ +. ...+....+..+. .|-++++++.+.+... T Consensus 268 ~~~~lvl~g~~~~~---------~~-------~~------------~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~ 319 (470) T protein:vir:10 268 QTVILVLTNYGGAD---------LH-------QF------------MNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEA 319 (470) T ss_pred cCcceeeecCCccc---------cc-------hh------------hhhhhhcCeEeccCCCCCcCceeEEEeecCChHH Confidence 22222333111000 00 00 0012222222222 2456889999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCC Q lcl|NC_019524. 368 GTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPG 447 (556) Q Consensus 368 f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~ 447 (556) +..+++.+.+.|-.-.++|--.. ..+++.|-.+.+.-+..........+..|-.. ++.+++..+. ++.. ... T Consensus 320 ~~~~~~~L~~~I~~~s~~p~~~~-~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~-l~~~~~~i~~--~l~~-~~~--- 391 (470) T protein:vir:10 320 RDDALKITRKNIFLFGQGIDPAN-FESSNASGVAIKMLYSHLELKAAKTQTYFEHA-INELVRAIMR--YLNF-SDA--- 391 (470) T ss_pred HHHHHHHHHHHHHHHhCCCCCCc-cccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH--Hhcc-cCc--- Confidence 99999999999988877773322 23444555566655555554454444444333 3334443332 2221 111 Q ss_pred cccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019524. 448 KNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKL 525 (556) Q Consensus 448 ~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl 525 (556) + ..-+.+.|..+-. .|..-.++.... ..|+.|.+..+...+. |+++.++++++|++......- T Consensus 392 -d----------~~~i~i~f~~~~p--~d~~e~~~~~~~--~~g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~ 456 (470) T protein:vir:10 392 -D----------KRHISQHWTRTKV--EDSLTKAQIVST--VANYSSKEAVAKANPIVDDWQQELKDLAKDKEENDPYSN 456 (470) T ss_pred -c----------cceeeEEeccCCC--CCHHHHHHHHHH--HhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhc Confidence 0 0113456644333 466655554444 4799999999999875 899999999999876544321 Q ss_pred CCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 526 DFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 526 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .+ ...++. ..+.+| T Consensus 457 ~~-------------~~~~~~----~~dde~ 470 (470) T protein:vir:10 457 QA-------------DELNGK----GVNDEQ 470 (470) T ss_pred cc-------------cccCCC----CCCCCC Confidence 11 001111 111111 No 179 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.75 E-value=6.8e-08 Score=59.89 Aligned_cols=449 Identities=10% Similarity=-0.024 Sum_probs=201.5 Q ss_pred CCcchhhhHHHHHhhHhhcccc-----------hhhhh----hhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAET-----------ATATP----MAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~-----------~~~~~----~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) |+=...+++..-+......... ....+ .....-|.+ .... ...|. .....+ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g-~~~~--~~~~~-~~~~~~---------- 66 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQS-KWDD--VQYKN-TDGDIK---------- 66 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcC-Cccc--ccccc-cCcchh---------- Confidence 4433333333221110000000 00000 000000111 0000 00000 000000 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) .+-...-++++.+++.+++.|.|.-.+... + + +..++ .|+..... ..|+. T Consensus 67 ----~~~~~slnl~~~i~~~~A~lv~~e~~~i~v--~-------d---~~~~~----~l~~~l~~----------n~f~~ 116 (522) T protein:vir:47 67 ----SRPMNHLPIARTASKKIASLVYNEQATITT--K-------N---EILQK----FLDDMLTN----------DRFNK 116 (522) T ss_pred ----cccceecchHHHHHHHHhhhhcCCcceeec--C-------C---hHHHH----HHHHHHhh----------cchHH Confidence 001122289999999999999996443332 1 1 12223 33332221 25667 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEEC-----CCCCeEEEEE Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLD-----NNGAALGYWL 220 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d-----~~Gr~vaY~i 220 (556) +....+...+.-|...+++.|.. + ..++..+.+|.+- |. ..+++.+..++-|- ...+-+-|++ T Consensus 117 ~~~~~~e~a~a~G~~a~k~~~d~--~--------~~~i~~v~ad~~~-P~-~~~~~~~~e~a~~~~~~~~~~~~~~~yt~ 184 (522) T protein:vir:47 117 NFERYLESCLALGGLAMRPYIDG--D--------KVRVAFIQAPVFF-PL-ESNTQDVSSAAILTKTIKSEGRKNVYYTL 184 (522) T ss_pred HHHHHHHHhhccCCEEEEEEEcC--C--------ceEEEEEcCCceE-EE-EEcCCceEEEEEEEEEEeecccceeEEEE Confidence 66666666666676666766532 1 2467778887663 21 12233344444332 1222333444 Q ss_pred eecCC-----------------C----------------ccccCCc-cccceeecc---ccCChhHeEeeecc----cCC Q lcl|NC_019524. 221 RKAFP-----------------G----------------DPTDMEQ-WKWGYEPAR---FDWGRRRVIHIIEA----LLA 259 (556) Q Consensus 221 ~~~hp-----------------g----------------d~~~~~~-~~~~rv~~~---~~v~a~~viH~f~~----~r~ 259 (556) ...|- + ....... .+|.-++.. ..++++-++|+..+ ... T Consensus 185 lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~ 264 (522) T protein:vir:47 185 VEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDI 264 (522) T ss_pred EEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCceEeCCCCcceEEEecCCccccccc Confidence 44431 0 0000000 012222111 12344445554333 233 Q ss_pred CcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeE-----eccCccccccccccccccccccccccccccccccc Q lcl|NC_019524. 260 GQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASV-----ESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYV 334 (556) Q Consensus 260 gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 334 (556) +-.-|+|.|+-++..|+.|+.--+.-...-+.+ ....|| ++..+... ...... ..+.. T Consensus 265 ~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~--------g~~~~~--~~fd~------ 327 (522) T protein:vir:47 265 NSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMG-QRRVIVPEHLTQRQYQRPD--------GTIDFR--PRFDV------ 327 (522) T ss_pred CCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhc-cceeecchHHhccCCCCCC--------cccccc--cccCc------ Confidence 556799999999999999997554433222221 112222 22111100 000000 00000 Q ss_pred ccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHH Q lcl|NC_019524. 335 AQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMD 414 (556) Q Consensus 335 ~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~ 414 (556) +.. .+.+... ....|..|+.++|.-...+|..=++.+|+.|....|++|..++-|-++ -.++-....+..+.+. T Consensus 328 -~~~--~f~~~~~-~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~--~kTAtEi~s~~~~~~~ 401 (522) T protein:vir:47 328 -EQN--VYMQIGG-SSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQG--MKTATEIVSENSDTYQ 401 (522) T ss_pred -ccc--eEeecCC-CCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccc--cccHHHHHHHHHHHHH Confidence 000 0111110 123455688899988888999889999999999999999988766433 3444455445555554 Q ss_pred HHHH--HHHHHHHHHHHHHHHHHHH----HcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHH Q lcl|NC_019524. 415 SRKK--LVADRFASAIYTLWLEEEV----NAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRI 488 (556) Q Consensus 415 ~~q~--~lv~~~~~pi~~~~l~~a~----l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i 488 (556) .+.. ..+...++.+...-++.+- .+|..+ .. .-+.+.|- --..+|+.++++.....+ T Consensus 402 t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~--~~-------------~~i~v~f~--D~i~~D~~~~~~~~~~~v 464 (522) T protein:vir:47 402 MRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIP--EL-------------DDISVNLD--DGVFTDRHAELDYWAKMV 464 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCC--Cc-------------ceeEEEcC--CCCCCCHHHHHHHHHHHH Confidence 4332 1233333444443333322 222211 11 11334553 333569999999999999 Q ss_pred HcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcC Q lcl|NC_019524. 489 KNGLSTYEAEISRL-GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEET 554 (556) Q Consensus 489 ~~G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 554 (556) .+|+.|.++.+.+. |.+-++..+++++-++.... ..+...+.. +.. ..+...+++|. T Consensus 465 ~aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~-~~~~~~~~~----~~~----~~~~~~~d~~~ 522 (522) T protein:vir:47 465 AAGFSTKKRAIGKTLNISGVEAEKELNAINSELLP-MNDAELAIY----GMH----DQNEEKADDKG 522 (522) T ss_pred hcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhcc-CCCCCCCCC----CCC----CcccccCCCCC Confidence 99999999988777 88877654444433221111 111111111 011 11111111111 No 180 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=98.74 E-value=7.2e-08 Score=59.76 Aligned_cols=460 Identities=11% Similarity=0.051 Sum_probs=210.1 Q ss_pred hhhhcchhccccCC-CcccccccCCCCCHHHHHHHH-HHH---------------HHHHHHHHHhcChHHHHHHHHHHhh Q lcl|NC_019524. 27 PMAVGGGMEGAERT-TREMFQWNPSIISPDQQIAQN-QDM---------------ASARAQDMVQNDGYAAGVVAVHRDS 89 (556) Q Consensus 27 ~~~~~~~y~aa~~~-~r~~~~w~~~~~s~~~~i~~~-~~~---------------lr~RaRdl~rNn~~a~~~v~~~~~n 89 (556) +-.--+.|.-.++. .-..+.|-+ ++..-+-. .+. ++++ -.+--++|+++-+|++ +.+ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~----~~D~~RlaaY~ly~d~y~n~~~el~~il~G~-dr~~~~~ps~r~~V~~-~~~ 74 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVD----ENDKNRVRAYDLYENIYLNSAETLKLVLRGD-DSVPILMPSGRKIVEA-VHR 74 (563) T ss_pred CCccccccCCCcccccccccccCC----HHHHHHHHHHHHHHHhhcCchhhhhhhcCCC-ceeeeccchHHHHHHH-HHH Confidence 10111122222210 001233321 11110000 000 1111 1233578899999999 668 Q ss_pred hccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccC Q lcl|NC_019524. 90 IVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNP 169 (556) Q Consensus 90 vVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~ 169 (556) +.|.|++...-+.. .++...++++..++.|+++. ++.++-....|-.++-||...+++|-+. T Consensus 75 ~Lg~~~~~~Ve~~~--------~de~~~~avq~~Lr~~~~~e----------~l~~~~~~~~r~a~vlGDgvf~l~wDp~ 136 (563) T protein:vir:74 75 FLGVGFDYLVEPDM--------GDEGIRQSLNAYFRTTFKRE----------AIKAKFTSNKRWGLIRGDAHFYIHADPN 136 (563) T ss_pred hcCCCcEEecCccc--------cCcchHHHHHHHHHHHHHHh----------hhHHHHHHHHHhhhhhcceeEEEeeccc Confidence 88999888754432 12333466899999999863 4666666777777888999889998643 Q ss_pred CCCcCCCcccceEEEEEchhhcC-----------------CCCCCCCC--ceEEEE-----EEECCCCCeE-EE--EEee Q lcl|NC_019524. 170 TGTTMQRRPFGTAIQMISPYRMS-----------------NPNNVMDT--PNLRSG-----VQLDNNGAAL-GY--WLRK 222 (556) Q Consensus 170 ~~~~~~~~~~~l~lq~ie~drl~-----------------~~~~~~~g--~~i~~G-----IE~d~~Gr~v-aY--~i~~ 222 (556) .+. + -.+.+..+||..+- ..+..++. ..| -+ .+.|+.|..+ +| -.-. T Consensus 137 K~~---g--~R~rv~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~-~r~~~~~~~lndeg~~~~~~~~dae~ 210 (563) T protein:vir:74 137 KKA---G--ERISVDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKL-ARRRTFRRVRNDEGMFTGRISSELTH 210 (563) T ss_pred ccc---C--CCceEeecCCceeeeccCCCCcccceeeecccCCCCCcchhccc-eeeeeeeeeeCCCCCccceeeeccch Confidence 221 1 12355666666441 11111110 011 12 2334444311 12 0000 Q ss_pred cCCC--ccccCCccccceeeccccCChhHe-------------Eeeecc-cCCCcccCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 223 AFPG--DPTDMEQWKWGYEPARFDWGRRRV-------------IHIIEA-LLAGQTRGISEMVSALKQMKMTRNFQEITL 286 (556) Q Consensus 223 ~hpg--d~~~~~~~~~~rv~~~~~v~a~~v-------------iH~f~~-~r~gQ~RGvs~la~~l~~l~~l~~~~dael 286 (556) -.+| |........|.+.+.....-..+. ||+|.. -.++-+.|.|.|+-+|..+..|..-..=+- T Consensus 211 w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s 290 (563) T protein:vir:74 211 WTLGNWDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDED 290 (563) T ss_pred hccccccccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHH Confidence 0111 111111122222222221211221 344654 578899999999999999888855432222 Q ss_pred HHHHHhc-ceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCce---eeeecCC Q lcl|NC_019524. 287 QNAVVNA-TYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTK---LKMQPAG 362 (556) Q Consensus 287 ~~a~i~A-~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~---i~~~~~~ 362 (556) ....+.. -|.++ ....+.+. ........++.||+|.+|....+ +..++.. T Consensus 291 ~i~~~tG~pi~vl-~~~~p~d~-------------------------~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~ 344 (563) T protein:vir:74 291 ATIVFQGLGMYVT-NASAPVDP-------------------------NTGELTDWNIGPMQIVEIAGNRNDNYFERVSGV 344 (563) T ss_pred HHHHhcCCCeEEe-cccccccc-------------------------ccccccccccCCceeEeccCCccccceeeecch Confidence 2222221 22222 11111110 00111224589999999985544 5555543 Q ss_pred CCCccHHHHHHHHH-HHHHHhcCCCHHHhhc--hhhcccchhHHHHH--HHHHHHHHHHH--H-HHHHHHHHHHHHHHHH Q lcl|NC_019524. 363 TPGGVGTDYEQSLL-RNIAASLGMSYEQFSR--DYTKTNYSSARASM--AETQKYMDSRK--K-LVADRFASAIYTLWLE 434 (556) Q Consensus 363 ~p~~~f~~F~~~~l-r~iaaglGi~ye~l~~--D~s~~nYSs~R~~~--~e~~r~~~~~q--~-~lv~~~~~pi~~~~l~ 434 (556) ..-..+-..++-+. |.|+-..++|--. .| |.++ +=|++...+ ...-..+...+ . -...+|..-.+.+||. T Consensus 345 ~~l~~~q~Hm~~l~eral~~~s~tPavA-~G~vD~~~-~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~ 422 (563) T protein:vir:74 345 QDVSPFQDHMKWIDEKGIAEGSGTPEVA-IGRVDVTS-AESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLP 422 (563) T ss_pred hhhHHHHHHHHHHHHHHHHhhccCccee-eccccccc-ccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 22233334444333 4677767777543 35 7665 445544332 22222111111 1 1234444445666664 Q ss_pred H---HHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh---CCCHHH Q lcl|NC_019524. 435 E---EVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL---GGDFRE 508 (556) Q Consensus 435 ~---a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~---G~D~e~ 508 (556) . ..+.|.-+-+.+....- . ..-+.|.|-++-. +|-.+=++-..+.+++|+-|++...++. |..+.+ T Consensus 423 ~~erl~~~g~~~~~~g~~~~~----~--~~~v~ivf~p~~P--~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pd 494 (563) T protein:vir:74 423 AYESDFQEQDGSRPFASADLL----N--ECSVVCIFADPMP--VNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPE 494 (563) T ss_pred HHHhHhhhhcccccccccccC----C--ceEEEEEeCCCCC--ccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCc Confidence 3 33456655544433220 0 1112455644433 6888888888999999999999997666 866655 Q ss_pred HHHHHHH-HHHHHHHcCCC--CCccccccCCCCCCCCCCCCCCCCCCc-----------------CCC Q lcl|NC_019524. 509 VFKQRAR-EEGLIKSLKLD--FTGKMVEGNSTQSSNSSESTSDNPNEE-----------------TTQ 556 (556) Q Consensus 509 v~~q~a~-E~~~~~~~Gl~--~~~~~~~~~~~~~~~~~~~~~~~~~~e-----------------~~~ 556 (556) +-.++.+ |.+.+..+=+. ...++. ...+-.+.+-+|...+| -+| T Consensus 495 ae~e~~~ie~~~i~~~~~a~a~ad~~~----~~~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~~~~~~ 558 (563) T protein:vir:74 495 VDDQGNALTDDDIADMLLAEAEADASL----GLSAMDNGGAGEQQFDDQGNPIDQFGNPVEIPPDVTQ 558 (563) T ss_pred HHHHHhhcCHHHHHHHHHHHhhccCcc----cceecccCCCCcccccccCCchhHcCCcccCCccccc Confidence 4444322 22222111000 000111 11111111111111111 111 No 181 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=98.73 E-value=1.7e-08 Score=63.15 Aligned_cols=331 Identities=10% Similarity=0.024 Sum_probs=156.4 Q ss_pred chhhhHHHHHhh-Hhhcccc--hh--hhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 4 VKKTTRTRAKKA-VDVVAET--AT--ATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 4 ~~~~~r~~a~~a-~~~~~~~--~~--~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |+.-+...++.. ....+.. .. ...-.....|.+..-. .+..|..++-+... .| .|.+-|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~~epp~~~~~-----------la-~~~~~~~~ 66 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIGFD--ENYNCYLPPVNRHA-----------LA-KLPHQNAQ 66 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCcccchhhcccceeee--cCCccccCCCCHHH-----------HH-HHhhcchh Confidence 222211111000 0000000 00 0000011123322111 13446666655332 12 23466777 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) -.++|..-++.+.+ .++|. ..+|..++.+++. ..+..| T Consensus 67 h~~~i~~k~n~l~~-~~~Pn----------------------------------------~~~t~~~f~~~v~-d~ll~G 104 (345) T protein:vir:37 67 HGGILHSRANMVSA-TYEGG----------------------------------------KALSKMEMRALCL-NLIQFG 104 (345) T ss_pred hcchhhhhhhHHhh-ccCCC----------------------------------------CCCCHHHHHHHHH-HHHhcC Confidence 77777666655554 22221 1234455544543 567789 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) ++++.++.... .-++.|..|.|..+... .+|.... ++...+.. ..+ T Consensus 105 nay~~i~rn~~--------G~~~~L~pl~~~~vr~~----------------~d~~~~~-~~~~~~~~----~~g----- 150 (345) T protein:vir:37 105 DVGLLKVRNGF--------GQVVRLVPLSSLYLRVH----------------KDGGYSY-LMKKSLYD----TAQ----- 150 (345) T ss_pred CeEEEEEECCC--------CCEEEEEEecCceeEEe----------------ecCCeeE-EEeeeeec----cCc----- Confidence 99998875321 13567888887766421 1111111 11100000 000 Q ss_pred eeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEeccCcccccccccccc Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESELPSDVVFGQLGMG 315 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~~~~~~~~~~~~~~ 315 (556) .....++.+|||+..+.--++..|+|++..++..+- -=+.+++-+.+. .+...++++...+... T Consensus 151 --~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~---l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~-------- 217 (345) T protein:vir:37 151 --EIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSAL---LNSDATVFRRRYFSNGAHMGFILYSTDPDLT-------- 217 (345) T ss_pred --eEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHH---HHHHHHHHHHHHHhccCCcceEEEeCCCCCC-------- Confidence 011356779999998876688899998887766553 223455555554 3444455553322111 Q ss_pred cccccccccccccccccccccccceecCCceeeec----CCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 316 QGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL----YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L----~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~ 391 (556) +++.+........ . .+. .. ....++.. +.|-+++.++......+|.+..+.....||+.+|||...+ T Consensus 218 -~e~~~~lk~~~~~---~-~g~--~n-~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~li- 288 (345) T protein:vir:37 218 -EEMEEEIARKISE---S-KGV--GN-FRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLS- 288 (345) T ss_pred -HHHHHHHHHHHHH---h-cCc--cc-cCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHh- Confidence 1111111111100 0 000 00 01122222 3456788888877788899999999999999999998754 Q ss_pred chhhc--ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhC Q lcl|NC_019524. 392 RDYTK--TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCN 464 (556) Q Consensus 392 ~D~s~--~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~ 464 (556) |+..+ .+||++.+....|.+ .-+.|+.++|.++ +- -.+.+++.. ...|+.+ ..++ T Consensus 289 Gi~~~~t~~~s~~e~~~~~f~~-----------~~l~P~~~~ie~~-ln-~~~e~~~~~-~i~F~~~----~l~k 345 (345) T protein:vir:37 289 GIIPTNTGGLGDPLKYREVYHY-----------DEVMPLQEIIAET-IN-QDPEIKNLL-KIKFREQ----NFAK 345 (345) T ss_pred ccccCCCCCcccHHHHHHHHHH-----------HHHHHHHHHHHHH-hh-hhhccCCcc-eEEECch----hhcC Confidence 76643 468888766665544 3345555544332 21 122333322 2223222 2223 No 182 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=98.65 E-value=1.5e-08 Score=63.55 Aligned_cols=332 Identities=10% Similarity=0.032 Sum_probs=151.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhh--hhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATA--TPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~--~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |..-+..+......+.......... ........|.+-... .+..|..++-+... -| .|.+-|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~--~~~~~~epp~~~~~-----------la-~l~~~~~~ 66 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPALDYVGIGFD--ENYNCYLPPVNRHA-----------LA-KLPHQNAQ 66 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccchhhhhhhhc--CCccccCCCCCHHH-----------HH-HHhhcccc Confidence 3222221111000000000000000 000011122221110 12335444444221 11 23344555 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ..++|..-++.+.+. +.....+|.+++.+++. ..+..| T Consensus 67 h~~~i~~k~n~l~~~-----------------------------------------~~Pn~~lt~~~f~~~~~-d~ll~G 104 (345) T protein:vir:37 67 HGGILHSRANMVSSL-----------------------------------------YEGGKALSRMDMRALCL-NLIQFG 104 (345) T ss_pred cccceeeechHHHhh-----------------------------------------ccCCCCCCHHHHHHHHH-HHHhcC Confidence 555543322222211 12233456666766654 677889 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEee-cCCCccccCCccccc Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRK-AFPGDPTDMEQWKWG 237 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~-~hpgd~~~~~~~~~~ 237 (556) .+++.++.... ..++.|..|.+..+.. + . ++... |++.. .+.+. + T Consensus 105 nay~~~~rn~~--------G~~~~L~pl~~~~vr~------------~--~--d~~~~-~~~~~~~~~~~-----g---- 150 (345) T protein:vir:37 105 DVGLLKVRNGF--------GQVVRLVPLSSLYLRV------------R--K--DGGYS-YLMKKSLYDTA-----Q---- 150 (345) T ss_pred CeEEEEEEcCC--------CcEEEEEEEcCceeEE------------E--E--eCCee-EEEEEeEecCC-----c---- Confidence 99988765321 2356788888776631 1 1 11111 11111 11111 0 Q ss_pred eeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEeccCccccccccccc Q lcl|NC_019524. 238 YEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESELPSDVVFGQLGM 314 (556) Q Consensus 238 rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~~~~~~~~~~~~~ 314 (556) ....+++.+|||+..+....+..|+|++..++..+.- -+.+++-+.+. .+.-.++|+.+++... T Consensus 151 ---~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l---~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~------- 217 (345) T protein:vir:37 151 ---EIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALL---NSDATVFRRRYFSNGAHMGFILYSTDPDLT------- 217 (345) T ss_pred ---eEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHH---HHHHHHHHHHHHhccCCcceEEEecCCCCC------- Confidence 1124677899999988877889999998887665432 33455544443 3344455554322110 Q ss_pred ccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHH Q lcl|NC_019524. 315 GQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQ 389 (556) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~ 389 (556) +++.+........ ..+. -..+.+..+ +.|-+++.++......+|.+..+.....||+.+|||... T Consensus 218 --~e~~~~lk~~~~~----~~g~----~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~l 287 (345) T protein:vir:37 218 --EEMEEEIARKISE----SKGV----GNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGL 287 (345) T ss_pred --HHHHHHHHHHHHH----hcCc----ccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 0111111000000 0000 012233333 356778888777677888888899999999999999885 Q ss_pred hhchhh--cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhC Q lcl|NC_019524. 390 FSRDYT--KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCN 464 (556) Q Consensus 390 l~~D~s--~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~ 464 (556) + |... ..+||++.+....|.+. -+.|+.+++.+ ++.+ .+.+++... ..|+.+. ..+ T Consensus 288 l-Gi~~~~~~~~~~~e~~~~~f~~~-----------~l~P~~~~ie~-~ln~-~~~~~~~~~-i~F~~~~----L~~ 345 (345) T protein:vir:37 288 S-GIIPTNTGGLGDPLKYREVYHYD-----------EVMPLQEIIAE-TINQ-DPEIKNLLK-IKFREQN----FAK 345 (345) T ss_pred h-CccCCCCCCcccHHHHHHHHHHH-----------HHHHHHHHHHH-Hhhh-hccCCCcce-EEecchh----hcC Confidence 4 5543 35788887766665443 23454443322 2221 233443322 2222211 111 No 183 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=98.65 E-value=1e-08 Score=64.36 Aligned_cols=317 Identities=12% Similarity=0.099 Sum_probs=150.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcch---------------hccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGG---------------MEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~---------------y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) ||=-+..+.. ..... .... ......+. |-..-. +..|..++-+... T Consensus 1 ~~~~~~~~~~---~~~~~--~~~~-~~~~~~~~~~~p~~v~~~~~~~~~~~~~~----~~~~~~pp~~~~~--------- 61 (344) T protein:vir:56 1 MSKKKGKTPQ---PAAKT--MTAS-APKMEAFTFGEPVPVLDRRDILDYVECIS----NGRWYEPPVSFTG--------- 61 (344) T ss_pred CCCCCCCCCc---hhhHH--hhcC-CCceEEEEcCCceeecCcchhhhHHHhhh----cCccccCCCCHHH--------- Confidence 4433322111 10000 0000 00000111 111100 1223322222221 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) - ..|.+-|++-.++|...++.+.++ ++|. ..+|..+ T Consensus 62 --l-a~~~~a~~~h~s~i~~k~n~l~~~-~~Pn----------------------------------------p~~t~~~ 97 (344) T protein:vir:56 62 --L-AKSLRAAVHHSSPIYVKRNILAST-FIPH----------------------------------------PWLSQQD 97 (344) T ss_pred --H-HHHHhhhhhhCccceehhhhHHhh-cCCC----------------------------------------CCCCHHH Confidence 1 123445555555555544444331 2222 2344555 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCC Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFP 225 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hp 225 (556) +.+++. .++..|.+++.+++.. ...++.|..|.+.++... ..+. .||+.... T Consensus 98 f~~~~~-d~ll~Gnay~~~~rn~--------~G~~~~L~pl~~~~v~~~----------------~~~~--~~~~~~~~- 149 (344) T protein:vir:56 98 FSRFVL-DFLVFGNAFLEKRYST--------TGKVIRLETSPAKYTRRG----------------VEED--VYWWVPSF- 149 (344) T ss_pred HHHHHH-HHHhcCCeEEEEEECC--------CCcEEEEEEeCCceeEEe----------------ecCC--EEEEEecC- Confidence 555543 6677899999876432 123567888887776421 1111 13332210 Q ss_pred CccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEecc Q lcl|NC_019524. 226 GDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESE 302 (556) Q Consensus 226 gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~ 302 (556) | ....+++.+|||+..+...++..|+|++..++..+. -=..+++-+.+. .|.-.++|+.+ T Consensus 150 g--------------~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~---l~~~a~~~~~~~f~NGa~pg~Il~~~ 212 (344) T protein:vir:56 150 N--------------EPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAW---LNESATLFRRKYYENGAHAGYIMYVT 212 (344) T ss_pred C--------------eEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEec Confidence 1 112456789999998887788999999887776544 233445544444 34455666543 Q ss_pred CcccccccccccccccccccccccccccccccccccceecCCceeeec------CCCceeeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 303 LPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL------YPGTKLKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L------~pGe~i~~~~~~~p~~~f~~F~~~~l 376 (556) .+... +++.+....... ... .-..|+...| +.|-+++.++......+|.+..+... T Consensus 213 d~~ls---------~e~~~~lk~~~~-------~~~--g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~ 274 (344) T protein:vir:56 213 DAVQD---------RNDIEMLRENMV-------KSK--GRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASA 274 (344) T ss_pred CCCCC---------HHHHHHHHHHHH-------Hhc--CCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhH Confidence 32111 111111111110 000 0123333333 35778888888778888999999999 Q ss_pred HHHHHhcCCCHHHhhchhhc--ccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccc Q lcl|NC_019524. 377 RNIAASLGMSYEQFSRDYTK--TNYSSARASMAETQKYM-DSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMF 453 (556) Q Consensus 377 r~iaaglGi~ye~l~~D~s~--~nYSs~R~~~~e~~r~~-~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~ 453 (556) .+||+.+|||-..+ |...+ .+|+++.+....|.+.. ..+|..+ +++ ..||. ...+..-.+ ... T Consensus 275 ~eIa~afrVPp~ll-Gi~~~~t~~~~n~eq~~~~f~~~tL~Pl~~~i-e~~-----n~~l~----~~~~~F~~y---~l~ 340 (344) T protein:vir:56 275 ADLLDAHRIPFQLM-GGKPENVGSLGDIEKVAKVFVRNELIPLQDRI-REI-----NGWIG----QEVIRFKNY---SLD 340 (344) T ss_pred HHHHHHhCCCHHHh-ccCCCCCCccccHHHHHHHHHHHHHHHHHHHH-HHH-----Hhhhc----cccccCCCc---ccc Confidence 99999999999844 65544 46898887777765543 2333332 111 11221 111221111 000 Q ss_pred cchhhHHH Q lcl|NC_019524. 454 YDPMMRDA 461 (556) Q Consensus 454 ~~~~~~~a 461 (556) .+ ++ T Consensus 341 ~~----~~ 344 (344) T protein:vir:56 341 TD----NG 344 (344) T ss_pred cc----CC Confidence 00 11 No 184 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.62 E-value=1.8e-07 Score=57.58 Aligned_cols=404 Identities=13% Similarity=0.070 Sum_probs=190.9 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHH-HHHHhcChHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARA-QDMVQNDGYAAGV 82 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~Ra-Rdl~rNn~~a~~~ 82 (556) |+ +.+. ....++ .+..+ .....|......++..+...+..-.-+. ++|.+ ++++.++ T Consensus 1 v~------------~~~l---~~e~at----~~~~~--d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~-D~~i~s~ 58 (488) T protein:vir:99 1 ME------------KPAL---GREIAT----SGDGR--DITRPFISGLQVPNDSILQRRGGNDLRVYEEILS-DAQVKTV 58 (488) T ss_pred CC------------ccch---hHHHHH----HHhhh--hhhccccCCCCCCChHHHHhhccCCHHHHHHHhh-ChHHHHH Confidence 11 1100 000000 00000 0112222222223333332222110112 44544 8999999 Q ss_pred HHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEE Q lcl|NC_019524. 83 VAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLA 162 (556) Q Consensus 83 v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~ 162 (556) +++...-|.|.-+.+++.- +...+++..+.|++.++ +.+|..+...++-+ +.-|=+++ T Consensus 59 l~~rk~av~~~~w~i~p~~-------~~~~~~~~ae~v~~~l~--------------~~~~~~~l~~~lda-~~~G~s~~ 116 (488) T protein:vir:99 59 WGQRQLAVVSREWKVEAGG-------DRPIDQAAAEHLEQQLQ--------------RVGWDRVTSKMLFG-VFYGYAVS 116 (488) T ss_pred HHHHHHHHhcCCceEEcCC-------CChHHHHHHHHHHHHHh--------------CCCHHHHHHHHHhh-hhhcceeE Confidence 9999999999888887642 22345555555655443 13677777777754 55788888 Q ss_pred EEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeecc Q lcl|NC_019524. 163 TCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPAR 242 (556) Q Consensus 163 ~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~ 242 (556) .++|... ++...+-+|..+.+.++ .||.+|+.+ |. ...++.++ T Consensus 117 Ei~w~~~-----~g~~~~~~l~~r~~~~f----------------~~d~~~~l~-~~-~~~~~~~g-------------- 159 (488) T protein:vir:99 117 ELIYGRD-----DRYITLEAIKVRNRRRF----------------RYDQDGGLR-LL-TPNNMFEG-------------- 159 (488) T ss_pred EEEEeec-----CCeeeEeeeeeecccce----------------eecCCCceE-Ee-ccCCCCCc-------------- Confidence 8888653 33344445666665544 244444322 11 11222111 Q ss_pred ccCCh--hHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccccccccc Q lcl|NC_019524. 243 FDWGR--RRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFK 320 (556) Q Consensus 243 ~~v~a--~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~ 320 (556) ..+|. .-|+|.+.. +.|..-|.+.|.++.-...-..........-...-++=..+.|.+..+. ..+.. T Consensus 160 ~~lp~~~~~i~~~~~~-~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a---------~~~ek 229 (488) T protein:vir:99 160 EPCPAPYFWHFSTGAD-NDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTA---------TPEDK 229 (488) T ss_pred cccccCceEEEEeecC-CCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCC---------CHHHH Confidence 01222 235555544 4788899999988766533333333333333333333334444321100 00000 Q ss_pred ccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCcc-HHHHHHHHHHHHHHhc-CCCHHHhhchhhccc Q lcl|NC_019524. 321 EIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGV-GTDYEQSLLRNIAASL-GMSYEQFSRDYTKTN 398 (556) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~-f~~F~~~~lr~iaagl-Gi~ye~l~~D~s~~n 398 (556) . ..-.....+.......++.|.+|++++....++. |..|.+.+-++|+..+ |- .||.+-.+.| T Consensus 230 ~------------~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGq---tlts~~~~Gs 294 (488) T protein:vir:99 230 A------------KLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQ---VASTQGTPGR 294 (488) T ss_pred H------------HHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhh---hhcccccccc Confidence 0 0011123355566777899999999997655554 8999999999998775 53 4666654447 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchh Q lcl|NC_019524. 399 YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEK 478 (556) Q Consensus 399 YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~ 478 (556) ||.+..-........+.....+...+-+-+...++.. + .|.... ..+.|- --.-.|.. T Consensus 295 ~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~---N----~~~~~~-------------p~~~~~--~~e~edl~ 352 (488) T protein:vir:99 295 LGNDDLQADVRLDLVKADADLICESFNLGPARWLTEW---N----FPGAQP-------------PRVYRV--IEEPEDIT 352 (488) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---C----cCCcCC-------------ceeEec--CCCcccHH Confidence 7766555544444555555555555544444444432 1 121110 011111 11223555 Q ss_pred hhhHHHHHHHHc-CCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 479 KETEAAILRIKN-GLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 479 Ke~~A~~~~i~~-G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .-+++..+.+++ |+.=..+ ...+++||+.+........+ ..+....+...+.+.... T Consensus 353 ~~a~~~~~l~~~~G~~i~~~--------------------~i~e~~Gip~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 410 (488) T protein:vir:99 353 AKAERDEKVFRMSGFRPTRG--------------------YVQETYGVEVESTQAEATAP-TPSTEFAEGDQPSDPAAA 410 (488) T ss_pred HHHHHHHHHHhhcCCCCCHH--------------------HHHHHcCCCCcccccccccC-CCcccCCCCCCCCCchHH Confidence 556666666665 5532222 23355666643322111111 111111111111111111 No 185 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=98.56 E-value=5.9e-08 Score=60.21 Aligned_cols=317 Identities=13% Similarity=0.102 Sum_probs=147.9 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcch---------------hccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGG---------------MEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~---------------y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) ||.-+..++..+.. ... ........+. |...-. ...|..++-+.. T Consensus 1 m~~~~~~~~~~~~~-----~~~-~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~----~~~~~~pp~~~~---------- 60 (344) T protein:vir:60 1 MSKKKGKTLQPAAK-----KMT-ASAPKMEAFTFGEPVPVLDRRDILDYVECIS----NGRWYEPPISFT---------- 60 (344) T ss_pred CCcccCCCCCchHH-----hhc-CCcCcEEEEEcCCceeecCCcchhHHHHhhh----cCccccCCCCHH---------- Confidence 44433322111100 000 0000000011 111100 122322222222 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) +-| .|.+-|++-.++|...++.+.++ ++| ...+|..+ T Consensus 61 -~la-~~~~a~~~h~~~i~~k~n~l~~~-~~P----------------------------------------n~~~t~~~ 97 (344) T protein:vir:60 61 -GLA-KSLRAAVHHSSPIYVKRNILAST-FIP----------------------------------------HPWLSQQD 97 (344) T ss_pred -HHH-HHHHhhhhhccchhhhhhHHHhh-ccC----------------------------------------CCCCCHHH Confidence 112 33445555556655544443331 222 22345555 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCC Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFP 225 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hp 225 (556) +++++ ...+..|.+++.+++... .-++.|..|.+..+.. + ..+. .||.... T Consensus 98 f~~~~-~d~ll~Gnay~~i~rn~~--------G~~~~L~~l~~~~vr~------------~----~~~~--~~~~v~~-- 148 (344) T protein:vir:60 98 FSRFV-LDFLVFGNAFLEKRYSTT--------GKVIRLETSPAKYTRR------------G----VEED--VYWWVPS-- 148 (344) T ss_pred HHHHH-HHHHhcCCeEEEEEECCC--------CcEEEEEEcCcceEEE------------e----ecCC--eEEEEcc-- Confidence 65554 366788999988764321 2356778887766632 1 1111 1332211 Q ss_pred CccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEecc Q lcl|NC_019524. 226 GDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESE 302 (556) Q Consensus 226 gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~ 302 (556) + + ....+++.+|||+..+...++..|+|++..++..+. --..+++-+.+. .+.-.++|+.+ T Consensus 149 ~------~-------~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~---l~~~a~~~~~~~f~NG~~pg~il~~~ 212 (344) T protein:vir:60 149 F------N-------EPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAW---LNESATLFRRKYYENGAHAGYIMYVT 212 (344) T ss_pred C------C-------eEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEec Confidence 0 0 012456789999998887789999999887776544 333445444443 34455566543 Q ss_pred CcccccccccccccccccccccccccccccccccccceecCCce--eeec----CCCceeeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 303 LPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAK--IPHL----YPGTKLKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~--i~~L----~pGe~i~~~~~~~p~~~f~~F~~~~l 376 (556) .+... +++.+........ .. ....|. |+.. ..|-+++.++......+|.+..+... T Consensus 213 ~~~ls---------~e~~~~ik~~~~~----~~-----g~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~ 274 (344) T protein:vir:60 213 DAVQD---------RNDIEMLRENMVK----SK-----GRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASA 274 (344) T ss_pred CcCCC---------HHHHHHHHHHHHH----hc-----CCCCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhH Confidence 22111 1111111111100 00 011222 2222 34667888887777788999999999 Q ss_pred HHHHHhcCCCHHHhhchhhc--ccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccc Q lcl|NC_019524. 377 RNIAASLGMSYEQFSRDYTK--TNYSSARASMAETQKYM-DSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMF 453 (556) Q Consensus 377 r~iaaglGi~ye~l~~D~s~--~nYSs~R~~~~e~~r~~-~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~ 453 (556) ..||+.+|||-. +.|...+ .+|+++.+....|.+.. ..++..|. + +..||.+.+ +.... T Consensus 275 ~eIa~af~VPp~-llGi~~~~t~~~~n~e~~~~~f~~~~L~Pl~~~~e-~-----ln~~lg~~~----i~F~~------- 336 (344) T protein:vir:60 275 ADLLDAHRIPFQ-LMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIR-E-----INGWLGQEV----IRFKN------- 336 (344) T ss_pred HHHHHHhCCCHH-HhcccCCCCCccccHHHHHHHHHHHHHHHHHHHHH-H-----HHHhcCCcc----cccCc------- Confidence 999999999997 4466543 36999888777775543 23333221 1 112221110 11100 Q ss_pred cchhhHHHhhCeeeecCcccccchhhhh Q lcl|NC_019524. 454 YDPMMRDALCNAEWIGASRGQIDEKKET 481 (556) Q Consensus 454 ~~~~~~~a~~~~~w~~p~~~~iDP~Ke~ 481 (556) +. -..+|+ T Consensus 337 ~~--------------------l~~~d~ 344 (344) T protein:vir:60 337 YS--------------------LDTDNG 344 (344) T ss_pred cc--------------------cCCCCC Confidence 00 011111 No 186 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.56 E-value=2.9e-07 Score=56.46 Aligned_cols=439 Identities=11% Similarity=0.036 Sum_probs=200.5 Q ss_pred CCcchhhhHHHHHhhHhhcc----------cc-hh----hhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVA----------ET-AT----ATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~----------~~-~~----~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) |+=...++...-+....+.. +- .. .+......-|.+ ..... .-|...+.. . T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g-~~~~~--~~~~~~~~~-~---------- 66 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKS-DWDSV--LYLNTDGET-K---------- 66 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcC-CCCCc--ccccCCCCc-c---------- Confidence 33333333222111000000 00 00 111111122332 11111 111111000 0 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) .| -...-++++.+++.+++.|.|.-.+... + + +...+.|+++.+. .+|.. T Consensus 67 ---~~-~~~slnl~~~i~~~~A~lv~~e~~~i~~--~-------d-------~~~~~~l~~il~~----------n~f~~ 116 (500) T protein:vir:30 67 ---KR-DLNHLPIARTAAKKIASLVFNEQAEIKV--D-------D-------DAANEFISETLKN----------DRFNK 116 (500) T ss_pred ---cC-ceeecchHHHHHHHHhhhhcCCcceEec--C-------C-------hHHHHHHHHHHhh----------ccHHH Confidence 00 0112279999999999999996433322 1 1 2233344544432 25777 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE-----CCCCCeEEEEE Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL-----DNNGAALGYWL 220 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~-----d~~Gr~vaY~i 220 (556) ...-++...+.-|.+++++.+-.. ..+|..+.++.+- |. ..+++.+..++-+ ...|+.+-|.. T Consensus 117 ~~~~~~e~a~a~G~~~~k~~~d~~----------~~~I~~v~ad~~~-P~-~~d~~~~~~~a~~~~~~~~~~~~~~~yt~ 184 (500) T protein:vir:30 117 NFERYLESCLALGGLAMRPYVDGD----------KVRVAFVQAPVFL-PL-QSNTQDVSSAAVVIKSVKTINGKEVYYTL 184 (500) T ss_pred HHHHHHHHHhhcCCEEEEEEEeCC----------ceEEEEEcCCeeE-EE-EEcCCCeEEEEEEEEEeeeecCCceEEEE Confidence 666666666777777777655321 1467888888652 21 1122222222221 12233333323 Q ss_pred eecCC---CccccCC------------c------cccceeec---cccCChhHeEeeecc----cCCCcccCCchhhHHH Q lcl|NC_019524. 221 RKAFP---GDPTDME------------Q------WKWGYEPA---RFDWGRRRVIHIIEA----LLAGQTRGISEMVSAL 272 (556) Q Consensus 221 ~~~hp---gd~~~~~------------~------~~~~rv~~---~~~v~a~~viH~f~~----~r~gQ~RGvs~la~~l 272 (556) ...|- ++.+... + .-|.-.+. ...++.+-+.|+.-+ ...+..-|+|.|+.++ T Consensus 185 lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~ 264 (500) T protein:vir:30 185 IEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAK 264 (500) T ss_pred EEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhH Confidence 32221 1100000 0 00111111 123455555555433 2345567999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcceeeeE-----eccCcccccccccccccccccccccccccccccccccccceecCCcee Q lcl|NC_019524. 273 KQMKMTRNFQEITLQNAVVNATYAASV-----ESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI 347 (556) Q Consensus 273 ~~l~~l~~~~dael~~a~i~A~~~~fi-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i 347 (556) ..+..|+.-.+.-...-+. +.-..++ +...... .+ ....... -+.....-..+ ++ T Consensus 265 ~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~----------~g--~~~~~~~----~d~~~~~~~~~-~~-- 324 (500) T protein:vir:30 265 TTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTT----------DG--DVVPRPR----FESDQNVYIRM-GG-- 324 (500) T ss_pred HHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCC----------Cc--cccCCcc----cCCCcceEEEc-CC-- Confidence 9999998766554443333 2223333 2111100 00 0000000 00000000001 00 Q ss_pred eecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHH---HHHHHHHH Q lcl|NC_019524. 348 PHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSR---KKLVADRF 424 (556) Q Consensus 348 ~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~---q~~lv~~~ 424 (556) ....+..|+.++|.-+..+|..=...+|+.|....|+++..++-|-+++ .++-+......+.+..+ |.. +... T Consensus 325 -~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~--~TAtei~s~~~~~~~t~~~~~~~-~~~a 400 (500) T protein:vir:30 325 -RDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSM--KTATEIVSENSDTYQMRNSIVAL-VEQS 400 (500) T ss_pred -CCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCcc--ccHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 1233556888888888888888889999999999999999998776543 23444434444444333 332 2333 Q ss_pred HHHHHHHHHHHH----HHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHH Q lcl|NC_019524. 425 ASAIYTLWLEEE----VNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEIS 500 (556) Q Consensus 425 ~~pi~~~~l~~a----~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~a 500 (556) ++.+....++.+ +..|.++ ... -+.+.|- --..+|+.++++.....+.+|+.|.+..+. T Consensus 401 l~~lv~~il~~~~~~~~~~~~~~--~~~-------------~v~v~f~--d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~ 463 (500) T protein:vir:30 401 LKELVISIFEIAKAYDLYQSEVP--SMD-------------NISISLD--DGVFTDRDAELDYWIKVVNAGFGTREMAIQ 463 (500) T ss_pred HHHHHHHHHHHHHHHhhcCCCCC--CCc-------------ceEEEeC--CCCCCCHHHHHHHHHHHHHcCCCCHHHHHH Confidence 333444333322 2223222 111 1346663 224579999999999999999999999887 Q ss_pred Hh-CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 501 RL-GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 501 e~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) +. |.+-+|+.+++++-++. . .+... . .++..+.=.| T Consensus 464 ~~~g~~eeea~~~l~~i~~E---~-~~~~~------------~-~~~~~~~~g~ 500 (500) T protein:vir:30 464 KVLNVTEEKAQEIAAEINTG---I-VDEIN------------Q-QRTDTHLYGE 500 (500) T ss_pred hcCCCCHHHHHHHHHHHHHh---c-cccCC------------C-CCccccccCC Confidence 76 88766654443332221 1 11000 0 0000011111 No 187 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.56 E-value=2.9e-07 Score=56.46 Aligned_cols=439 Identities=11% Similarity=0.036 Sum_probs=200.5 Q ss_pred CCcchhhhHHHHHhhHhhcc----------cc-hh----hhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVA----------ET-AT----ATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~----------~~-~~----~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) |+=...++...-+....+.. +- .. .+......-|.+ ..... .-|...+.. . T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g-~~~~~--~~~~~~~~~-~---------- 66 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKS-DWDSV--LYLNTDGET-K---------- 66 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcC-CCCCc--ccccCCCCc-c---------- Confidence 33333333222111000000 00 00 111111122332 11111 111111000 0 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) .| -...-++++.+++.+++.|.|.-.+... + + +...+.|+++.+. .+|.. T Consensus 67 ---~~-~~~slnl~~~i~~~~A~lv~~e~~~i~~--~-------d-------~~~~~~l~~il~~----------n~f~~ 116 (500) T protein:vir:98 67 ---KR-DLNHLPIARTAAKKIASLVFNEQAEIKV--D-------D-------DAANEFISETLKN----------DRFNK 116 (500) T ss_pred ---cC-ceeecchHHHHHHHHhhhhcCCcceEec--C-------C-------hHHHHHHHHHHhh----------ccHHH Confidence 00 0112279999999999999996433322 1 1 2233344544432 25777 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE-----CCCCCeEEEEE Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL-----DNNGAALGYWL 220 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~-----d~~Gr~vaY~i 220 (556) ...-++...+.-|.+++++.+-.. ..+|..+.++.+- |. ..+++.+..++-+ ...|+.+-|.. T Consensus 117 ~~~~~~e~a~a~G~~~~k~~~d~~----------~~~I~~v~ad~~~-P~-~~d~~~~~~~a~~~~~~~~~~~~~~~yt~ 184 (500) T protein:vir:98 117 NFERYLESCLALGGLAMRPYVDGD----------KVRVAFVQAPVFL-PL-QSNTQDVSSAAVVIKSVKTINGKEVYYTL 184 (500) T ss_pred HHHHHHHHHhhcCCEEEEEEEeCC----------ceEEEEEcCCeeE-EE-EEcCCCeEEEEEEEEEeeeecCCceEEEE Confidence 666666666777777777655321 1467888888652 21 1122222222221 12233333323 Q ss_pred eecCC---CccccCC------------c------cccceeec---cccCChhHeEeeecc----cCCCcccCCchhhHHH Q lcl|NC_019524. 221 RKAFP---GDPTDME------------Q------WKWGYEPA---RFDWGRRRVIHIIEA----LLAGQTRGISEMVSAL 272 (556) Q Consensus 221 ~~~hp---gd~~~~~------------~------~~~~rv~~---~~~v~a~~viH~f~~----~r~gQ~RGvs~la~~l 272 (556) ...|- ++.+... + .-|.-.+. ...++.+-+.|+.-+ ...+..-|+|.|+.++ T Consensus 185 lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~ 264 (500) T protein:vir:98 185 IEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAK 264 (500) T ss_pred EEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhH Confidence 32221 1100000 0 00111111 123455555555433 2345567999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcceeeeE-----eccCcccccccccccccccccccccccccccccccccccceecCCcee Q lcl|NC_019524. 273 KQMKMTRNFQEITLQNAVVNATYAASV-----ESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI 347 (556) Q Consensus 273 ~~l~~l~~~~dael~~a~i~A~~~~fi-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i 347 (556) ..+..|+.-.+.-...-+. +.-..++ +...... .+ ....... -+.....-..+ ++ T Consensus 265 ~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~----------~g--~~~~~~~----~d~~~~~~~~~-~~-- 324 (500) T protein:vir:98 265 TTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTT----------DG--DVVPRPR----FESDQNVYIRM-GG-- 324 (500) T ss_pred HHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCC----------Cc--cccCCcc----cCCCcceEEEc-CC-- Confidence 9999998766554443333 2223333 2111100 00 0000000 00000000001 00 Q ss_pred eecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHH---HHHHHHHH Q lcl|NC_019524. 348 PHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSR---KKLVADRF 424 (556) Q Consensus 348 ~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~---q~~lv~~~ 424 (556) ....+..|+.++|.-+..+|..=...+|+.|....|+++..++-|-+++ .++-+......+.+..+ |.. +... T Consensus 325 -~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~--~TAtei~s~~~~~~~t~~~~~~~-~~~a 400 (500) T protein:vir:98 325 -RDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSM--KTATEIVSENSDTYQMRNSIVAL-VEQS 400 (500) T ss_pred -CCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCcc--ccHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 1233556888888888888888889999999999999999998776543 23444434444444333 332 2333 Q ss_pred HHHHHHHHHHHH----HHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHH Q lcl|NC_019524. 425 ASAIYTLWLEEE----VNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEIS 500 (556) Q Consensus 425 ~~pi~~~~l~~a----~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~a 500 (556) ++.+....++.+ +..|.++ ... -+.+.|- --..+|+.++++.....+.+|+.|.+..+. T Consensus 401 l~~lv~~il~~~~~~~~~~~~~~--~~~-------------~v~v~f~--d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~ 463 (500) T protein:vir:98 401 LKELVISIFEIAKAYDLYQSEVP--SMD-------------NISISLD--DGVFTDRDAELDYWIKVVNAGFGTREMAIQ 463 (500) T ss_pred HHHHHHHHHHHHHHHhhcCCCCC--CCc-------------ceEEEeC--CCCCCCHHHHHHHHHHHHHcCCCCHHHHHH Confidence 333444333322 2223222 111 1346663 224579999999999999999999999887 Q ss_pred Hh-CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 501 RL-GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 501 e~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) +. |.+-+|+.+++++-++. . .+... . .++..+.=.| T Consensus 464 ~~~g~~eeea~~~l~~i~~E---~-~~~~~------------~-~~~~~~~~g~ 500 (500) T protein:vir:98 464 KVLNVTEEKAQEIAAEINTG---I-VDEIN------------Q-QRTDTHLYGE 500 (500) T ss_pred hcCCCCHHHHHHHHHHHHHh---c-cccCC------------C-CCccccccCC Confidence 76 88766654443332221 1 11000 0 0000011111 No 188 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.49 E-value=4.6e-07 Score=55.31 Aligned_cols=434 Identities=12% Similarity=0.030 Sum_probs=195.9 Q ss_pred hcccchh-hhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhc--------------------- Q lcl|NC_019524. 18 VVAETAT-ATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQN--------------------- 75 (556) Q Consensus 18 ~~~~~~~-~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rN--------------------- 75 (556) +...--. .+.......|.. + .. +....++..+......-..+.+.+++| T Consensus 1 m~~~~~~~~~~~~~~~~~~~---~---~~---~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~ 71 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLK---S---LK---DVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQ 71 (499) T ss_pred ChhHHHHHHHHHHHHhcccc---c---hh---hhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccce Confidence 2211100 011110011100 0 00 000000000011111112223333332 Q ss_pred --ChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhh Q lcl|NC_019524. 76 --DGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSG 153 (556) Q Consensus 76 --n~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~ 153 (556) .++++.+++.+++.+.|.-.+++. + + +...+.|++|... .+|.....-++.. T Consensus 72 ~s~n~~~~iv~~~a~~l~~ep~~i~~--~-------d-------~~~~e~l~~~~~~----------n~f~~~~~~~~~~ 125 (499) T protein:vir:80 72 LSMNLPKVTAKYMSKLLFNEKVKINI--D-------D-------ETAEEFVLNVLKT----------NGFTKNMERYIEY 125 (499) T ss_pred eecchHHHHHHHHHHhhhCCcceEee--C-------C-------HHHHHHHHHHHhh----------ccHHHHHHHHHHH Confidence 478899999999999997554433 1 1 2223334444332 2466666666666 Q ss_pred heecCceEEEEeeccCCCCcCCCcccceEEEEEchhhc-CCCCCCCCCceEEEE---------------EEECCC--CCe Q lcl|NC_019524. 154 FLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRM-SNPNNVMDTPNLRSG---------------VQLDNN--GAA 215 (556) Q Consensus 154 ~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl-~~~~~~~~g~~i~~G---------------IE~d~~--Gr~ 215 (556) .+.-|.+++++.+... + .+++..++|+.+ +...+ .+.+..- +|+-.+ +.- T Consensus 126 a~~~G~~~~~~~~D~~-~--------~~~i~~v~a~~~~Pi~~d---~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~ 193 (499) T protein:vir:80 126 GEAMGGFVIKVYHDGN-K--------NVKVSFATADCMYPLSND---SENVDECLIANSFHKNNKYYKLLEWNEWKGEKE 193 (499) T ss_pred HhhcCcEEEEEEECCC-C--------cEEEEEEcCCceEEEEec---CCCeEEEEEEEEEeecCeEEEEEEEEEecccce Confidence 7778888887665321 1 257899999875 32222 1112111 222111 111 Q ss_pred EEEEEee----cCCCccccCCc---cccceeecc---ccCChhHeEeeecc----cCCCcccCCchhhHHHHHHHHHHHH Q lcl|NC_019524. 216 LGYWLRK----AFPGDPTDMEQ---WKWGYEPAR---FDWGRRRVIHIIEA----LLAGQTRGISEMVSALKQMKMTRNF 281 (556) Q Consensus 216 vaY~i~~----~hpgd~~~~~~---~~~~rv~~~---~~v~a~~viH~f~~----~r~gQ~RGvs~la~~l~~l~~l~~~ 281 (556) ..|+|.. ...++..+..- .-|.-++.. ..++++-++|+.-+ ...+..-|+|.|+.++..+..|+.- T Consensus 194 ~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~ 273 (499) T protein:vir:80 194 EVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLM 273 (499) T ss_pred eeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHH Confidence 2233321 11111000000 001111111 12344444444333 2456667999999999999999876 Q ss_pred HHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecC Q lcl|NC_019524. 282 QEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPA 361 (556) Q Consensus 282 ~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~ 361 (556) ...-...-. .+.-..|+-... ........++ ....+... ......-++ ..-..|..|+.++| T Consensus 274 ~s~~~~e~~-~~~~~i~v~~~~-----l~~~~~~~g~---~~~~~~~~-------~~~~~~~~~--~~~~~~~~i~~~~~ 335 (499) T protein:vir:80 274 FDSYYQEFK-LGKKKVLVPSSF-----VKTAVNLDGS---TTQYFDST-------DEAFFLYQG--EQDDNGKAIKDISV 335 (499) T ss_pred HHHHHHHHH-hcccceecchhh-----hhccCCCCCC---cccCCCcc-------cceeeEeec--cCCCCcCceeEecC Confidence 655322221 222222321000 0000000000 00000000 000000011 11122345888999 Q ss_pred CCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_019524. 362 GTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTN-YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVN-- 438 (556) Q Consensus 362 ~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~n-YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l-- 438 (556) .-+..+|..-++.+++.|..+.|+++..++.|-+++. =..++.............+..+ ...++.+....+..+-+ T Consensus 336 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~ 414 (499) T protein:vir:80 336 EIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLI-EQGIKEMIVSILEVGKLIK 414 (499) T ss_pred cCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhc Confidence 9888899899999999999999999999987765431 1122223333333333333333 33334444444433222 Q ss_pred --cCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHHH---HHH Q lcl|NC_019524. 439 --AGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFREV---FKQ 512 (556) Q Consensus 439 --~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~v---~~q 512 (556) .|....+ .-+.+.| .--..+|+..+++.....+.+|+.|.+..+... |.+=++. +++ T Consensus 415 ~~~~~~~~~---------------~~v~v~f--~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~ea~~el~~ 477 (499) T protein:vir:80 415 AYDGDTVEL---------------DTITVDF--DDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITEAEADEWAEM 477 (499) T ss_pred cccCCCCCc---------------cceEEEe--CCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChHHHHHHHHH Confidence 2222111 1134555 333446999999999999999999999998877 8764443 223 Q ss_pred HHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 513 RAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 513 ~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) +++|+. ..++ .++..+..+++| T Consensus 478 i~~E~~----~~~~---------------~~d~~g~~ge~e 499 (499) T protein:vir:80 478 LAKEKQ----AEIP---------------NNDMTGIFGEEE 499 (499) T ss_pred HHHHhh----cCCC---------------CCCccccCCCCC Confidence 332321 1111 011112222233 No 189 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=98.49 E-value=1.4e-08 Score=63.69 Aligned_cols=209 Identities=11% Similarity=0.089 Sum_probs=111.9 Q ss_pred EEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 207 VQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITL 286 (556) Q Consensus 207 IE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael 286 (556) |...++|. +-|...... .+ ..+ ....+++++|||+..+...+...|+|++..++..+.--. .++. T Consensus 1 ~r~~~dg~-~~y~~~~~~-~~---~~g-------~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~---aa~~ 65 (219) T protein:vir:98 1 MRVCKDGN-YKYLMKKSL-YD---TKS-------EIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNS---DATI 65 (219) T ss_pred CceeecCe-EEEEEecce-ec---CCc-------eeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHH---HHHH Confidence 56666775 344442111 00 001 123578899999998887899999999888765554221 2222 Q ss_pred HHHH---HhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeec-----CCCceeee Q lcl|NC_019524. 287 QNAV---VNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKM 358 (556) Q Consensus 287 ~~a~---i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~ 358 (556) -+++ =.+.-.++|+.+.+.. ..+..+........ ..+. -..+++..+ ..|-+++. T Consensus 66 ~~~~~f~Ng~~p~gil~~~~~~l---------~~e~~~~~~~~~~~----~~g~----~n~~~~~l~~~gg~~~G~~~~~ 128 (219) T protein:vir:98 66 FRRRYYSNGAHMGFILYSTDPDM---------TEEMEDEIAERIRD----SKGV----GNFRSMFVNIAGGHPDGLKVIP 128 (219) T ss_pred HHHHHHhcCCCCceEEEeCCCCC---------CHHHHHHHHHHHHH----hcCc----ccccceeEecCCCCccceeEEE Confidence 2222 1345556665443211 11111111111100 0000 011333333 34778888 Q ss_pred ecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhch--hhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 359 QPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRD--YTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEE 436 (556) Q Consensus 359 ~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D--~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a 436 (556) ++.+....+|.+..+....+||+.+|||.+.| |. -+.++||++.+..+.| +..-++|+..+ ++.+ T Consensus 129 ~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~l-G~~~~~~~~~sn~eq~~~~f-----------~~~tL~P~~~~-ie~~ 195 (219) T protein:vir:98 129 IGDTGQKDEFANIKNISAQDVLTSHRFPPGLS-GIIPVNTAGLGDPLKIREAY-----------QADEVLPLQEI-IAES 195 (219) T ss_pred ccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHc-ccccCCCCCccCHHHHHHHH-----------HHHHHHHHHHH-HHHH Confidence 88887888999999999999999999999865 43 3567899876555544 44455675554 4444 Q ss_pred HHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhH Q lcl|NC_019524. 437 VNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETE 482 (556) Q Consensus 437 ~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~ 482 (556) +-+. +.+|..... .|.++ -.+|.+ T Consensus 196 ln~~-~~~~~~~~~-~F~~~--------------------~~~d~~ 219 (219) T protein:vir:98 196 INSD-YEIKSALKV-NFKQP--------------------EKRDKN 219 (219) T ss_pred hhhh-hcCCCccEE-eecCc--------------------ccccCC Confidence 5332 334443221 12221 111222 No 190 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=98.48 E-value=6.1e-08 Score=60.15 Aligned_cols=317 Identities=12% Similarity=0.097 Sum_probs=146.9 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcch---------------hccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGG---------------MEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~---------------y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) ||=-+..+...+..+ .. ....+...+. |-..-. ...|..++-+... T Consensus 1 ~~~~~~~~~~~~~~~-----~~-~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~----~~~~~~pp~~~~~--------- 61 (344) T protein:vir:20 1 MSKKKGKTPQPAAKT-----MT-ASGPKMEAFTFGEPVPVLDRRDILDYVECIS----NGRWYEPPVSFTG--------- 61 (344) T ss_pred CCcccCCCCcchhhh-----hh-ccCCceEEEEcCCceEecCcchhhhhhhhhh----cCceecCCCCHHH--------- Confidence 443333211111000 00 0000000011 111100 1223322322221 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) - ..|.+-|++-.++|...++.+.+ -++| ...+|..+ T Consensus 62 --l-a~~~~a~~~h~~~i~~k~n~l~~-~~~P----------------------------------------n~~lt~~~ 97 (344) T protein:vir:20 62 --L-AKSLRAAVHHSSPIYVKRNILAS-TFIP----------------------------------------HPWLSQQD 97 (344) T ss_pred --H-HHHHhhhhhhCccceehhhhHHH-hccC----------------------------------------CCCCCHHH Confidence 1 12344455555555544443333 1222 12344455 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCC Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFP 225 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hp 225 (556) +++++ ...+..|.+++.++.... ..++.|..+.+..+.. + .++.. ||.... T Consensus 98 f~~~~-~d~ll~Gnay~~i~rn~~--------G~~~~L~pl~~~~vr~------------~----~~~~~--~~~~~~-- 148 (344) T protein:vir:20 98 FSRFV-LDFLVFGNAFLEKRYSTT--------GKVIRLETSPAKYTRR------------G----VEEDV--YWWVPS-- 148 (344) T ss_pred HHHHH-HHHHhcCCeEEEEEECCC--------CcEEEEEEcCCceeEe------------e----ecCCE--EEEEcc-- Confidence 55544 366778999998764321 2356777777766632 1 11111 333211 Q ss_pred CccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEecc Q lcl|NC_019524. 226 GDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESE 302 (556) Q Consensus 226 gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~ 302 (556) +. ....+++.+|||+..+...++..|+|++..++..+. -=..+++-+.+. .+.-.++|+.+ T Consensus 149 ~~-------------~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~---l~~~a~~~~~~~f~NGa~p~~Il~~~ 212 (344) T protein:vir:20 149 FN-------------EPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAW---LNESATLFRRKYYENGAHAGYIMYVT 212 (344) T ss_pred CC-------------eEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHH---HHHHHHHHHHHHHhccCCCceEEEec Confidence 10 012456789999998877788999999887665543 334455555544 34445555543 Q ss_pred CcccccccccccccccccccccccccccccccccccceecCCce--eeec----CCCceeeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 303 LPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAK--IPHL----YPGTKLKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~--i~~L----~pGe~i~~~~~~~p~~~f~~F~~~~l 376 (556) .+... +++.+........ .. .-..|. |+.. +.|-+++.++......+|.+..+... T Consensus 213 d~~l~---------~e~~~~ik~~~~~----~~-----g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~ 274 (344) T protein:vir:20 213 DAVQD---------RNDIEMLRENMVK----SK-----GRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASA 274 (344) T ss_pred CcCCC---------HHHHHHHHHHHHH----hc-----CCCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhH Confidence 22111 1111111111100 00 011222 2222 34677888887777888999999999 Q ss_pred HHHHHhcCCCHHHhhchhhc--ccchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccc Q lcl|NC_019524. 377 RNIAASLGMSYEQFSRDYTK--TNYSSARASMAETQKYM-DSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMF 453 (556) Q Consensus 377 r~iaaglGi~ye~l~~D~s~--~nYSs~R~~~~e~~r~~-~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~ 453 (556) ..||+.+|||.+.+ |+..+ .+|+++.+....|.+.. ..+|..|. + ...||.. ..+ .| T Consensus 275 ~eIa~af~VPp~ll-Gi~~~~t~~~~n~e~~~~~f~~~~l~P~~~~~e-~-----in~~lg~----~~i---------~F 334 (344) T protein:vir:20 275 ADLLDAHRIPFQLM-GGKPENVGSLGDIEKVAKVFVRNELIPLQDRIR-E-----INGWLGQ----EVI---------RF 334 (344) T ss_pred HHHHHHhCCCHHHh-ccCCCCCCccccHHHHHHHHHHHHHHHHHHHHH-H-----HHHhcCC----ccc---------cc Confidence 99999999999854 76543 45888877776665433 23333321 1 1122211 101 11 Q ss_pred cchhhHHHhhCeeeecCcccccchhhh Q lcl|NC_019524. 454 YDPMMRDALCNAEWIGASRGQIDEKKE 480 (556) Q Consensus 454 ~~~~~~~a~~~~~w~~p~~~~iDP~Ke 480 (556) ..+ -+|.-.| T Consensus 335 ~~~-----------------~l~~~d~ 344 (344) T protein:vir:20 335 KNY-----------------SLDTDND 344 (344) T ss_pred Ccc-----------------ccccCCC Confidence 110 0121112 No 191 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=98.45 E-value=6.2e-07 Score=54.62 Aligned_cols=406 Identities=9% Similarity=0.063 Sum_probs=189.9 Q ss_pred CCcchhhhHHHHHhhHh-hcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVD-VVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~-~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) +.|-++.++.+.+++.. ..+. ....++- . ...+..|. .+. +.........--+++. .++++ T Consensus 7 ~~~g~~~~~~~~~~~~~~~ia~--------~~~~~~~----~-~~~~~~p~---~~~-il~~~~~~~~~y~~m~-~D~~i 68 (491) T protein:vir:79 7 VSPTEFVKFGEPDKSLSSQIAT--------RARSIDF----F-ALGMYLPN---PDP-VLKALGKDIRVYRELR-ADAHV 68 (491) T ss_pred CCCCCcccccccchhHHHHHhh--------hcccccc----c-cccccCcc---hhH-HHhhccCCHHHHHHHh-hChHH Confidence 33444443332222211 1100 0011110 0 00111111 122 2222222234556776 59999 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) .+++++...-|.|.-+.+++.-+ + ++..+.+++.++. ++|..+...++. .+.-|= T Consensus 69 ~s~l~~Rk~av~~~~w~i~~~~~------~----~~~a~~i~e~l~~--------------~~~~~~i~~~ld-a~~~G~ 123 (491) T protein:vir:79 69 GGCVRRRKAAVKALEWGLDRGKA------K----SRVAKSIADVFAD--------------LDLSRIATEMLD-AVLYGY 123 (491) T ss_pred HHHHHHHHHHHhCCCcEEecCCC------C----HHHHHHHHHHHhc--------------CCHHHHHHHHHH-hhhhcc Confidence 99999999999998777765321 1 2234444444332 357777776664 455788 Q ss_pred eEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) +++-+.|... ++.-.+-+|..+.++++. ||..|+.+ |. ...+..++. -+ T Consensus 124 s~~Ei~w~~~-----~g~~~~~~l~~r~~~~f~----------------~d~~~~l~-l~-~~~~~~~g~--------~l 172 (491) T protein:vir:79 124 QPMEITWGKV-----GNYIVPIDVVGKPADWFV----------------YDPENQLR-FR-SKEHWVQGE--------EL 172 (491) T ss_pred eeEEEEEeec-----CCeeeEEeeeeeccccee----------------eccCCceE-Ee-ecCCCCCce--------ee Confidence 8888888653 344455567777776553 23333322 11 111111110 01 Q ss_pred eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) |. -.-|+|. ...+.|..-|.+.|.++.-...-........+.-...-++=..+.|.+.+... +. T Consensus 173 p~-----~k~i~~~-~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~----------~e 236 (491) T protein:vir:79 173 PA-----RKFLVPR-QEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASD----------AE 236 (491) T ss_pred cC-----CCeEEEE-ecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCH----------HH Confidence 11 1235555 44457888999999887665544444444444444444443344444322110 00 Q ss_pred cccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCc---cHHHHHHHHHHHHHHhc-CCCHHHhhchhh Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG---VGTDYEQSLLRNIAASL-GMSYEQFSRDYT 395 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~---~f~~F~~~~lr~iaagl-Gi~ye~l~~D~s 395 (556) ... .-.....+.......++.|.+|+++++...++ -|..|.+.+=+.|+..+ |-+ ||.|- T Consensus 237 k~~------------l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqt---lTt~~- 300 (491) T protein:vir:79 237 TNL------------LLDRLEDMVQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQN---QTTEA- 300 (491) T ss_pred HHH------------HHHHHHHHhcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhh---hccCc- Confidence 000 01112235556677799999999998764332 38888888888888764 333 66663 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCccccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI 475 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i 475 (556) +.||+.+-.-........+.....+...+-+ +...++. +++- +.|. ..|.....+-+ T Consensus 301 ~gs~a~~~vh~~v~~~i~~~D~~~i~~tln~-li~~l~~---~N~~-~~~~------------------p~f~~~e~ee~ 357 (491) T protein:vir:79 301 TSTRASAQAGLEVTDDIRDGDKAIVVEAMNM-LIRWICD---LNFD-GAAR------------------PVFDMWEQEQV 357 (491) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH---hcCC-CCCc------------------ceEeecCcCch Confidence 5678777655444444555555565555544 5554443 2321 1111 01222222222 Q ss_pred chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCC----CCCCCCCC Q lcl|NC_019524. 476 DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSS----ESTSDNPN 551 (556) Q Consensus 476 DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~----~~~~~~~~ 551 (556) + .+-+++..+.+..|+.=..+.+ .+++||+.+.......+....++. ..+...++ T Consensus 358 ~-~~~a~~~~~L~~~G~~i~~~~~--------------------~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 416 (491) T protein:vir:79 358 D-EIQAGRDEKLTRAGARFTPAYF--------------------KRAYNLQDGDLDERPLPVSAVDAVGAASFAEFEAPD 416 (491) T ss_pred h-HHHHHHHHHHHhCCCccCHHHH--------------------HHHhCCCCCCCCccccCcCcccccccccccccCCCC Confidence 2 1235556666677764322222 245566543221111111111110 01111111 Q ss_pred CcCC-C Q lcl|NC_019524. 552 EETT-Q 556 (556) Q Consensus 552 ~e~~-~ 556 (556) .+.. . T Consensus 417 ~~~~d~ 422 (491) T protein:vir:79 417 QDALDA 422 (491) T ss_pred CcchHH Confidence 1110 0 No 192 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.38 E-value=9.3e-07 Score=53.65 Aligned_cols=444 Identities=12% Similarity=0.011 Sum_probs=197.5 Q ss_pred hcccchh-hhhhhhcchhccccCCCccc-ccccCCCCCHHHHHHHHHHHHHHHHHHHH---------------hcChHHH Q lcl|NC_019524. 18 VVAETAT-ATPMAVGGGMEGAERTTREM-FQWNPSIISPDQQIAQNQDMASARAQDMV---------------QNDGYAA 80 (556) Q Consensus 18 ~~~~~~~-~~~~~~~~~y~aa~~~~r~~-~~w~~~~~s~~~~i~~~~~~lr~RaRdl~---------------rNn~~a~ 80 (556) +.-.--. .+.......+...- .... ..+.+.+.+--.-+...+.-....-..+. ...++++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k 78 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKAL--KDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPK 78 (496) T ss_pred ChhHHHHHHHHHHHHhccchhh--HHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHH Confidence 1111100 00000000111100 0000 11111111100111111111111000000 1147899 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCce Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEV 160 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~ 160 (556) -+++.+++.+.|..+++... + +...+.|++|... .+|.....-++...+.-|.+ T Consensus 79 ~i~~~~a~~l~~~p~~i~~~---------d-------~~~~e~l~~~~~~----------n~f~~~~~~~~~~a~~~G~~ 132 (496) T protein:vir:38 79 VTAKYMSKLLFNEKVKINID---------D-------KAAEEFVLNVLKT----------NGFTKNMERYIEYGEAMGGF 132 (496) T ss_pred HHHHHHhhhhhCCcceEeeC---------C-------hHHHHHHHHHHhc----------cCHHHHHHHHHHHHhhhCcE Confidence 99999999999976554431 1 2233344555432 25888788888888888998 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEE---EECCCCCe---------------EEEEEee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGV---QLDNNGAA---------------LGYWLRK 222 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GI---E~d~~Gr~---------------vaY~i~~ 222 (556) +++..+.. .+ .+++..++|+.+= |.- .+++.+...+ ++...|.. +-|.+++ T Consensus 133 ~~~~~~D~-~~--------~~~i~~v~~~~~~-P~~-~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~ 201 (496) T protein:vir:38 133 VIKVYHDG-NK--------NVKVSFATADCMY-PLS-NDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQ 201 (496) T ss_pred EEEEEEcC-CC--------cEEEEEEcccceE-EEE-ecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEe Confidence 88875532 11 2578889998652 211 1112222222 22222221 1111122 Q ss_pred cCCCccccCCc---cccceee---ccccCChhHeEeeec-----ccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 223 AFPGDPTDMEQ---WKWGYEP---ARFDWGRRRVIHIIE-----ALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV 291 (556) Q Consensus 223 ~hpgd~~~~~~---~~~~rv~---~~~~v~a~~viH~f~-----~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i 291 (556) ...++..+..- .-|.-++ ....++..- +++|. ....+..-|+|.|+-++..+..++.-...-. ...- T Consensus 202 ~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~-f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~-~~~~ 279 (496) T protein:vir:38 202 SDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPT-FIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYY-QEFK 279 (496) T ss_pred cCCccccCccccccccccccccceeecCCCcce-EEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHH-HHHh Confidence 11111000000 0000000 011223332 33332 2245666799999999998888865544322 1111 Q ss_pred hcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHH Q lcl|NC_019524. 292 NATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDY 371 (556) Q Consensus 292 ~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F 371 (556) .+.-..|+-. . .....+...++. ...+.. ....+..+.......+..++.+++.-+..+|..= T Consensus 280 ~~~~~i~v~~----~-~l~~~~~~~g~~---~~~~~~---------~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~ 342 (496) T protein:vir:38 280 LGKKKVLVPS----S-FVKTAVNLDGST---TQYFDS---------TDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIES 342 (496) T ss_pred hcccceecch----H-HhhccCCCCCcc---ccCCCC---------ccceEEEeecCCCcccccceeeccccCHHHHHHH Confidence 1222222210 0 000000000000 000000 0001112222233445568888988888888888 Q ss_pred HHHHHHHHHHhcCCCHHHhhchhhccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHcCCccCCC Q lcl|NC_019524. 372 EQSLLRNIAASLGMSYEQFSRDYTKTN-YSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEE----VNAGNVPLPP 446 (556) Q Consensus 372 ~~~~lr~iaaglGi~ye~l~~D~s~~n-YSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a----~l~G~l~~p~ 446 (556) .+.+++.+....|+|+..++.|-+++. =..++.............+.. ....++.+++..++.+ +..|....+. T Consensus 343 l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~-~~~~l~~l~~~il~~~~~~~~~~g~~~~~~ 421 (496) T protein:vir:38 343 INAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQL-IEQGIKEMIVSILEVGKFIEAYSGEVVELD 421 (496) T ss_pred HHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCcc Confidence 999999999999999999988766542 222333333333333333333 3444445555444432 2233322111 Q ss_pred CcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019524. 447 GKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFREVFKQRAREEGLIKSLKL 525 (556) Q Consensus 447 ~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~Gl 525 (556) -+.+.|.- -..+|+..+++.....+.+|+.|.+.++... |.+-+++-+++++.++ |... T Consensus 422 ---------------~i~v~f~d--~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~d~ea~~el~ri~~---E~~~ 481 (496) T protein:vir:38 422 ---------------TITVDFDD--SIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNITEAEADEWAEMLAK---EKQA 481 (496) T ss_pred ---------------ceEEEeCC--CCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHH---hhhc Confidence 12345542 2345999999999999999999999998776 7765554444333322 1111 Q ss_pred CCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 526 DFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 526 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) ..+ ..+..+..+++| T Consensus 482 ~~~-------------~~d~~~~~~~~e 496 (496) T protein:vir:38 482 EMP-------------NNDMNGIFGEEE 496 (496) T ss_pred cCc-------------cccccCCCCCCC Confidence 110 001111111112 No 193 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=98.36 E-value=1.1e-06 Score=53.34 Aligned_cols=425 Identities=8% Similarity=-0.092 Sum_probs=199.4 Q ss_pred HHHHhhHhhcccchhhh-hhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHh Q lcl|NC_019524. 10 TRAKKAVDVVAETATAT-PMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRD 88 (556) Q Consensus 10 ~~a~~a~~~~~~~~~~~-~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~ 88 (556) .-...-......-...+ .......|.-+.. .-....+......... ..++ +.--.+++++-+|+..+. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~---------~~~~-~~ki~~n~~~~Ivd~~~~ 69 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKN-DILKKGVVVQNRDENP---------LRNA-DNRISHNFHEILVDEKAS 69 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccC-cccccccccccccccc---------cccc-ccccccchHHHHHHhhhh Confidence 22222222222211111 1122333443321 1000101000000000 0000 001126888999999999 Q ss_pred hhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeecc Q lcl|NC_019524. 89 SIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLN 168 (556) Q Consensus 89 nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~ 168 (556) ++.|..++..+.. .+.....|+.|..+ +|...+..+.+....-|.++....+.. T Consensus 70 yl~G~p~~~~~~~---------------~~~~~~~~~~~~~n-----------~~~~~~~~~~~~~~~~G~a~~~~y~de 123 (451) T protein:vir:10 70 YMFTYPVLFDIDN---------------NKELNEKVTDVLGN-----------EFTRKAKNLAIEASNCGSAWLHYWIDE 123 (451) T ss_pred heecccceeecCC---------------cHHHHHHHHHHhcc-----------CHHHHHHHHHHHHhhcCeEEEEEeecC Confidence 9999887765421 12233455555431 567777778888899999998764432 Q ss_pred CCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEE-----CCCCCe-----EEEEEeecCCCccccC----Ccc Q lcl|NC_019524. 169 PTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQL-----DNNGAA-----LGYWLRKAFPGDPTDM----EQW 234 (556) Q Consensus 169 ~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~-----d~~Gr~-----vaY~i~~~hpgd~~~~----~~~ 234 (556) .. .......-.+++..|+|+.+---+.....+.+..+|.+ |..|.. .-++++...--..+.. ... T Consensus 124 ~~-~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~ 202 (451) T protein:vir:10 124 EY-SGEQVTNQTFKYGVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCG 202 (451) T ss_pred Cc-ccccccccceeEEEEcccceEEEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccc Confidence 21 11111111357888999887422222333446666643 333332 1223333321100000 000 Q ss_pred ccc---eee-ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccc Q lcl|NC_019524. 235 KWG---YEP-ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFG 310 (556) Q Consensus 235 ~~~---rv~-~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~ 310 (556) .+. .+| .+..+| |+++... .-|+|.|.+++..+..++....-......-.+--..+++.-.++. T Consensus 203 ~~~~~~~~~~~~g~vP---vv~~~nn-----~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~---- 270 (451) T protein:vir:10 203 SQIEHITVQHRFNSVP---FVEFSNN-----IKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGED---- 270 (451) T ss_pred cccccccccCCCCeee---EEEeccC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCccc---- Confidence 000 000 011222 3444332 248899999887776555432222211111111112222110000 Q ss_pred ccccccccccccccccccccccccccccceecCCceeeecC-----CCceeeeecCCCCCccHHHHHHHHHHHHHHhcCC Q lcl|NC_019524. 311 QLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY-----PGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGM 385 (556) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~-----pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi 385 (556) .. .....+..+.+..+. .|-++++++.+.+...+..+.+.+.+.|....++ T Consensus 271 -----~~-------------------~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 326 (451) T protein:vir:10 271 -----TS-------------------EFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQG 326 (451) T ss_pred -----ch-------------------hhHHHHhhCCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCc Confidence 00 000112222222222 3457899998889999999999999999999998 Q ss_pred CHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCe Q lcl|NC_019524. 386 SYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNA 465 (556) Q Consensus 386 ~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~ 465 (556) |.-.. ..++++|-.+.+..+..........+..|-.. ++.+++..+. ++ |... + . -+.+ T Consensus 327 p~~~~-~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~-l~~~~~li~~--~~-~~~d---~------~-------~i~i 385 (451) T protein:vir:10 327 LQQDT-ENFGNASGVALKFFYRKLELKSGLLETEFRTS-FDKLIKAILY--FL-GVTD---Y------K-------KIQQ 385 (451) T ss_pred ccccc-cccccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH--Hh-CCCC---c------c-------ceeE Confidence 84222 23455555566666666655555555554333 3444443332 22 2111 0 0 1345 Q ss_pred eeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCcccccc Q lcl|NC_019524. 466 EWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG--DFREVFKQRAREEGLIKSLKLDFTGKMVEG 534 (556) Q Consensus 466 ~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~--D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~ 534 (556) .|..+-. .|-...+++..++ .|+.|.+..+...+. |+++.++++++|.+.....-.. ...+... T Consensus 386 ~f~~~~p--~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~~-~~~~~~~ 451 (451) T protein:vir:10 386 TYTRNMM--SNDLEDADIATKS--VGIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKVSD-DYNNFTE 451 (451) T ss_pred EecCCCC--CCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh-hcCCCCC Confidence 6644422 4655555555554 489999999999975 8898888887776544321110 0000000 No 194 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.30 E-value=1.6e-06 Score=52.40 Aligned_cols=451 Identities=11% Similarity=0.008 Sum_probs=203.4 Q ss_pred CCcchhhhHHHHHhhHhhcccc-----------hh----hhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAET-----------AT----ATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~-----------~~----~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) |+=....+...-+....+.... .. .+......-|.|-. ..+......... T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~------~~~~~~~~~~~~--------- 65 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDY------PQVEYINSQGKI--------- 65 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCC------cccccccccccc--------- Confidence 4433333333322111110000 00 00000000011100 000000000000 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeec-cc-cccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKP-NT-IVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTL 143 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~-~~-~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f 143 (556) +.|. ..--++++.+.+.+++.|++.--++...- +. .....++ +..++.+.+.++ ..+| T Consensus 66 --~~~~-~~sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~---~~~~e~l~~i~~--------------~n~f 125 (517) T protein:vir:98 66 --QERD-YMTLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSF---KTAHEFIQHVFQ--------------HNKF 125 (517) T ss_pred --cccc-eeecCcHHHHHHHhhhhhcCCcceEEecccccccccccch---hHHHHHHHHHHH--------------hccH Confidence 0000 00126788899999999988532222110 00 0000011 112222322222 1246 Q ss_pred HHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEE----E-CCCCCeEEE Q lcl|NC_019524. 144 TGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQ----L-DNNGAALGY 218 (556) Q Consensus 144 ~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE----~-d~~Gr~vaY 218 (556) +....-++...+.-|.+++++.|... ..+|..+.+|.+- |. ..+.+.+..++- . ...++.+-| T Consensus 126 ~~~~~~~~e~a~a~G~~a~k~~~d~~----------~~~I~~v~ad~~~-Pl-~~~~~~v~~~ai~~~~~~~~~~~~~~Y 193 (517) T protein:vir:98 126 IKNLSDYLEPTFALGGLTVRPYVDNG----------EIEFSWALANAFY-PL-RSNSNGISEGVMKSVTTKVIGNKTVYY 193 (517) T ss_pred HHHHHHHHHHHhhhCCEEEEEEEeCC----------eeEEEEEcCCeeE-EE-EecCCCeEEEEEEEEEEEeecCCceEE Confidence 66666666677778898888776421 2468888888772 21 223344555541 1 122333444 Q ss_pred EEeecCCCcc------ccC--------C-----------ccccceeecc---ccCChhHeEeeecc----cCCCcccCCc Q lcl|NC_019524. 219 WLRKAFPGDP------TDM--------E-----------QWKWGYEPAR---FDWGRRRVIHIIEA----LLAGQTRGIS 266 (556) Q Consensus 219 ~i~~~hpgd~------~~~--------~-----------~~~~~rv~~~---~~v~a~~viH~f~~----~r~gQ~RGvs 266 (556) .+...|--+. .+. . ..-|.-++.. ..++.+-+.|+..+ ...+-..|+| T Consensus 194 t~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S 273 (517) T protein:vir:98 194 TLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLG 273 (517) T ss_pred EEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCc Confidence 4444442110 000 0 0001111111 22445545554443 2335567999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeE-----eccCccccccccccccccccccccccccccccccccccccee Q lcl|NC_019524. 267 EMVSALKQMKMTRNFQEITLQNAVVNATYAASV-----ESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIA 341 (556) Q Consensus 267 ~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 341 (556) .|+-++..|+.|+.--+.-...-+. +.-..|+ +...... +...+ ....... . -..... T Consensus 274 ~~~~a~~~~d~lD~~~s~~~~e~~~-g~~~i~vp~~~l~~~~~~~------g~~~~-------~~~d~~~-~--~y~~~~ 336 (517) T protein:vir:98 274 ITDNSVSTLKKINDTYDQFWWEIKM-GQRTVFVSDVMLRTVPDES------GMPPP-------QVFDPDV-N--VYKSIR 336 (517) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHh-CCcceecChhhhccccCCC------CcccC-------CCCCccc-c--eeeecc Confidence 9999999999999766554433333 2223333 1111110 00000 0000000 0 000001 Q ss_pred cCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHH--H Q lcl|NC_019524. 342 IDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKK--L 419 (556) Q Consensus 342 l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~--~ 419 (556) +.. .+..++.++|.-...+|..=...+|+.|....|+||..++-|-. .-.++-....+..+.+..+.. . T Consensus 337 ~~~-------~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~--~~kTATEi~s~~~~~~~t~~~~~~ 407 (517) T protein:vir:98 337 MGT-------DEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGR--SMKTATEIVSENDLTYRTRNDHVY 407 (517) T ss_pred CCC-------CCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccccc--ccccHHHHHHHHHHHHHHHHHHHH Confidence 111 13347788888777788888999999999999999999976643 334555555555555554432 1 Q ss_pred HHHHHHHHHHHHHHHHH----HHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCH Q lcl|NC_019524. 420 VADRFASAIYTLWLEEE----VNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTY 495 (556) Q Consensus 420 lv~~~~~pi~~~~l~~a----~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~ 495 (556) .+...+..+...-+..+ +..|.++... -+.+.|--. ...|+.++++.....+.+|+.|. T Consensus 408 ~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~---------------~v~v~f~D~--i~~D~~~~~~~~~~~v~aG~ms~ 470 (517) T protein:vir:98 408 EVEQFIKGLVISVLELAKTYKLFGGEIPSAE---------------HIGVDFDDG--VFQDRSALLRFYGQAKTFGFIPT 470 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCc---------------ceEEEcCCC--CCCCHHHHHHHHHHHHhcCCCCH Confidence 23333333333333222 2334332111 123555332 34699999999999999999999 Q ss_pred HHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 496 EAEISRL-GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 496 ~~~~ae~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) ...+.+. |.+-++..+++++-++ |.. .. ++.. .+.+....-.+++| T Consensus 471 ~~~i~~~~g~~eeeA~~e~~~i~~---E~~-~~--~~~~------~~~~~~~~~~gd~e 517 (517) T protein:vir:98 471 VEAIQRIFKVPKKTAEQWLEEIRK---DQI-EL--DPVT------ISQRAQKRMFGDEE 517 (517) T ss_pred HHHHHHhCCCChHHHHHHHHHHHH---hcc-cc--CCCC------ccccccCCCCCCCC Confidence 9998887 9887665333332211 111 11 1110 01111111111222 No 195 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.28 E-value=1.7e-06 Score=52.20 Aligned_cols=453 Identities=10% Similarity=0.019 Sum_probs=199.2 Q ss_pred CCcchhhhHHHHHhhHhh--------------cc--cchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDV--------------VA--ETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDM 64 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~--------------~~--~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~ 64 (556) |+=-..+++..-+..-.+ .+ .....+......-|.+-. +.... .+.... T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~---~~~~~-~~~~~~----------- 65 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKL---QYIHY-QASDGI----------- 65 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCC---ccccc-ccCCCC----------- Confidence 433333333221100000 00 000011111111233211 11100 000000 Q ss_pred HHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHH Q lcl|NC_019524. 65 ASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLT 144 (556) Q Consensus 65 lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~ 144 (556) +-..+...-++++.+++.+++.|.|.-.+....- ++.+ ++. ++++.+. -+|. T Consensus 66 ---~~~~~~~sln~~~~i~~~~A~lv~~e~~~i~v~~--------~~~~---~e~----l~~il~~----------n~f~ 117 (508) T protein:vir:15 66 ---KKKRLKNTINMAKTAARRIASVVFNEKAEIHVKD--------NNEA---DKF----LNDVLED----------NDFK 117 (508) T ss_pred ---ccccceeecchHHHHHHHHHhhhhCCCceEEeCC--------chHH---HHH----HHHHHHh----------ccHH Confidence 0000122337889999999999999643333211 1111 122 2222221 2466 Q ss_pred HHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhc-CCCCCCCCCceEEEEEEECC-----CCCeEEE Q lcl|NC_019524. 145 GLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRM-SNPNNVMDTPNLRSGVQLDN-----NGAALGY 218 (556) Q Consensus 145 ~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl-~~~~~~~~g~~i~~GIE~d~-----~Gr~vaY 218 (556) ....-++...+.-|.++++..+-.. ..+|..+.|+.+ +...+ .+.+...+-+.. .++-.-| T Consensus 118 ~~~~~~~e~a~a~G~~~~k~~~d~~----------~~~i~~v~ad~~~P~~~d---~~~~~~~af~~~~~~~~~~~~~~y 184 (508) T protein:vir:15 118 NKFEEALEKGVALGGFAMRPYIDGN----------HIKIAWVRADQFYPLQSN---TNDISEAAIASRTQRTESNQTKYY 184 (508) T ss_pred HHHHHHHHHHhhcCceEEEEEEeCC----------eeEEEEEcCCeeEEEEEc---CCCeEEEEEEEEEEeecCCCceEE Confidence 6666666677777888777665321 357888888774 32111 122222222111 1111112 Q ss_pred EEeecC-------------------C---CccccCCc-cccceeecc---ccCChhHeEeeecc----cCCCcccCCchh Q lcl|NC_019524. 219 WLRKAF-------------------P---GDPTDMEQ-WKWGYEPAR---FDWGRRRVIHIIEA----LLAGQTRGISEM 268 (556) Q Consensus 219 ~i~~~h-------------------p---gd~~~~~~-~~~~rv~~~---~~v~a~~viH~f~~----~r~gQ~RGvs~l 268 (556) .....| + |....... -+|.-++.. ..++++-++|+.-+ ...+..-|+|.| T Consensus 185 t~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~ 264 (508) T protein:vir:15 185 TLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVV 264 (508) T ss_pred EEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchH Confidence 222222 1 00000000 001111111 12334444444332 234566799999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceee Q lcl|NC_019524. 269 VSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIP 348 (556) Q Consensus 269 a~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~ 348 (556) +-++..+..++.-.+.-...-+ .+.-..++-...- ...+ . ....+.. ....-..+.. T Consensus 265 ~~~~~lid~lD~~~s~~~~e~~-~~~~~i~v~~~~l-----~~d~---~-~~~~~~~---------~~~~~~~~~~---- 321 (508) T protein:vir:15 265 DNAKHVLDDINDTHDQFIWEIR-LGQKHIAVQPGML-----RFDD---E-HKPTFDT---------EQNVYVGVLS---- 321 (508) T ss_pred hhhHHHHHHHHHHHHHHHHHHH-hcccceeechHHh-----cCCC---C-CccccCC---------CCeeEEeccC---- Confidence 9999999999866554333332 2222222211100 0000 0 0000000 0000000111 Q ss_pred ecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHH--HHHHHHHH Q lcl|NC_019524. 349 HLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKK--LVADRFAS 426 (556) Q Consensus 349 ~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~--~lv~~~~~ 426 (556) .-..|..|+.++|.-...+|..-++.+++.|....|+++..++-|-+++ .++-+...+..+.+..+.. ..+...++ T Consensus 322 ~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~--~TAtei~s~~~~~~~t~~~~~~~~~~al~ 399 (508) T protein:vir:15 322 DDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGV--KTATEVVSNNSMTYQTRSSYLTMVEKAID 399 (508) T ss_pred CCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCcc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1245677999999988889999999999999999999999887664432 3444444444455543322 23344445 Q ss_pred HHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCC Q lcl|NC_019524. 427 AIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL-GGD 505 (556) Q Consensus 427 pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D 505 (556) .+.+..++.+-+-+.... +.+......+ ....-+.+.|-- -...|+.++++.....+.+|+.|.+..+.+. |.| T Consensus 400 ~lv~~il~l~~~~~~~~~--g~~~~~~~~~-~~~~~v~v~f~D--~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~~g~~ 474 (508) T protein:vir:15 400 ELCQSIFELANAGALFDD--GKPLFTLDSA-SQPLDIECHFDD--GVFVNKDKQLEEDAKVLAIGALSKQTFLQRNYGMT 474 (508) T ss_pred HHHHHHHHHHHHhccccc--cccccccccc-cCCcceEEEeCC--CCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC Confidence 555554544433222110 0000000000 000012344422 2346999999999999999999999998877 888 Q ss_pred HHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 506 FREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPN 551 (556) Q Consensus 506 ~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (556) -+++.+++++=++. .....+.++ ...+.++.++| T Consensus 475 deea~~el~ri~~E---~~~~~~~~~---------~~~~~~g~~ge 508 (508) T protein:vir:15 475 DEQAAEELAKIQSE---APTDTFEGG---------RSAILNGGDGE 508 (508) T ss_pred hHHHHHHHHHHHHh---ccccCcccc---------ccccCCCCCCC Confidence 77654333332111 110100100 11111111111 No 196 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=98.25 E-value=2.1e-06 Score=51.74 Aligned_cols=408 Identities=11% Similarity=0.061 Sum_probs=185.3 Q ss_pred CCc------chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 1 MKD------VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQ 74 (556) Q Consensus 1 ~sp------~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~r 74 (556) |+| .++.++...++.+... .++.. +..... +.......++..+ ..+......-+++. T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~--------ia~~~------~~~~~~-~~~~~~~~~~~iL-r~~~~~~~~y~~m~- 63 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQ--------IATRA------RSIDFF-ALGMYLPNPDPVL-KALGKDIRVYRELR- 63 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHH--------HHhhh------cccccc-cccCCccchHHHH-HhcCCCHHHHHHHh- Confidence 322 2222222211111100 00000 000000 0111222233222 22222233445666 Q ss_pred cChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 75 NDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 75 Nn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) .++++.+++++...-|.|.-+.+.+.-+ + .+..+.+++.++ +++|..+...++- . T Consensus 64 ~D~~i~s~l~~Rk~av~~~~w~i~~~~~------~----~~~~e~v~e~l~--------------~~~~~~~l~~~ld-a 118 (491) T protein:vir:10 64 ADAHVGGCVRRRKAAVKALEWGLDRGKA------K----SRVAKSIADVFA--------------DLDLSRIVTEMLD-A 118 (491) T ss_pred hChHHHHHHHHHHHHHhCCCcEEecCCC------C----HHHHHHHHHHHh--------------cCCHHHHHHHHHH-h Confidence 5999999999999999998877765321 1 222344443332 2357777777774 4 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcc Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQW 234 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~ 234 (556) +.-|=++..++|... ++...+-+|..+.+.++.. |..|+.+ | +.+.+..++.. T Consensus 119 ~~~G~s~~Ei~w~~~-----~g~~~~~~l~~r~~~~f~~----------------d~~~~l~-~-~~~~~~~~g~~---- 171 (491) T protein:vir:10 119 VLYGYQPMEITWGKV-----GNYIVPIDVVGKPADWFVY----------------DPENQLR-F-RSKDHWMQGEE---- 171 (491) T ss_pred hhhcceeEEEEEeec-----CCeeEEEEeeeecccceee----------------ccCCceE-E-ecCCCCCCcce---- Confidence 567888888888653 3444556777777766642 2222211 1 11111111100 Q ss_pred ccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccccccc Q lcl|NC_019524. 235 KWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGM 314 (556) Q Consensus 235 ~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~ 314 (556) +|. ..-|+|.+.. +.|..-|.+.|.++.-...-..........-...-++=..+.|.+.+... T Consensus 172 ----l~~-----~k~i~~~~~~-~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~------- 234 (491) T protein:vir:10 172 ----LPA-----RKFLVPRQEA-TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASD------- 234 (491) T ss_pred ----ecC-----CCEEEEEecC-CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCCH------- Confidence 111 1235666444 57888999999887665544444444444434333333334443322110 Q ss_pred ccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCc---cHHHHHHHHHHHHHHhcCCCHHHhh Q lcl|NC_019524. 315 GQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG---VGTDYEQSLLRNIAASLGMSYEQFS 391 (556) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~---~f~~F~~~~lr~iaaglGi~ye~l~ 391 (556) +..+ ..-.....+.......++.|.+|+++++...++ .|..|.+.+=+.|+..+-- +.|| T Consensus 235 ---~ek~------------~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG--qtlT 297 (491) T protein:vir:10 235 ---GEKN------------LLLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLG--QNQT 297 (491) T ss_pred ---HHHH------------HHHHHHHHHhcCcEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhh--hhcc Confidence 0000 001112234555667789999999998764332 3888888888888876431 3466 Q ss_pred chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCc Q lcl|NC_019524. 392 RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGAS 471 (556) Q Consensus 392 ~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~ 471 (556) .|- +.||+.+-.-........+.....+...+- .+...+++ +++- +.+ +....|..+ T Consensus 298 t~~-~gs~a~~~vh~~v~~di~~~D~~~i~~tln-~li~~l~~---~N~~-~~~----------------~p~f~~~~~- 354 (491) T protein:vir:10 298 TEA-TSTRASAQAGLEVTDDIRDGDKAVVSEAMN-MLIRWICD---LNFD-GAD----------------RPVFDMWEQ- 354 (491) T ss_pred cCc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH---hcCC-CCC----------------cceEEecCc- Confidence 663 557776655444444444555555444443 35554443 2211 111 001222211 Q ss_pred ccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCC----CCC Q lcl|NC_019524. 472 RGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSE----STS 547 (556) Q Consensus 472 ~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~----~~~ 547 (556) +-+| .+-+++..+.+..|+.=..+. ..+++||+.+.......+....+... .+. T Consensus 355 -~e~~-~~~a~~~~~L~~~G~~i~~~~--------------------i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (491) T protein:vir:10 355 -EQVD-EIQAGRDQKLTQAGARFTPAY--------------------FKRAYNLQDGDLDERPLPVSAVDTVGAASFAEF 412 (491) T ss_pred -Cchh-HHHHHHHHHHHhCCCcCCHHH--------------------HHHHhCCCCCCcCccccccCCCCCccccccccc Confidence 2122 223444555666666433222 23455655432211111111111000 011 Q ss_pred CCCCCcC-CC Q lcl|NC_019524. 548 DNPNEET-TQ 556 (556) Q Consensus 548 ~~~~~e~-~~ 556 (556) ..++.+. ++ T Consensus 413 ~~~~~~~~d~ 422 (491) T protein:vir:10 413 EAPDQDALDA 422 (491) T ss_pred CCCCCCchHH Confidence 1111100 01 No 197 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=98.24 E-value=1.5e-06 Score=52.45 Aligned_cols=321 Identities=11% Similarity=0.070 Sum_probs=144.9 Q ss_pred chhhhHHHHHhhHhhcccchh---hhhhhhc---chhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETAT---ATPMAVG---GGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~---~~~~~~~---~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) |+..+..++.....-...... +.....+ ..|-.--.. ....|..++-+... .| .|.+-|+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~--~~~~~~~pP~~~~~-----------La-~l~~~~~ 66 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRPSVVFSMPEAIDPTAWMTDYTGVFYN--PYGEYYQPPIDRKG-----------LA-KVARANA 66 (337) T ss_pred CCCcccCcccccccCceeEEEecCcccccCcchhHhhhhhhhc--cCcceecCCCCHHH-----------HH-HHhhcch Confidence 332221111111000000000 0000000 011110000 01223333333221 11 2334466 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) +-.+++..-.+.+.+ .+.+. -..+++++ -..+.. T Consensus 67 ~h~~~L~~k~N~~~~-~f~~~--------------------------------------------~~~~~~~~-~d~ll~ 100 (337) T protein:vir:78 67 HHGAILMARRNMVAG-RFTNQ--------------------------------------------RATITAFV-HNYLQF 100 (337) T ss_pred hhhhHHHhhhccccc-cCcCc--------------------------------------------HHHHHHHH-HHHHhh Confidence 666665554433322 12111 01122332 245677 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccc Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWG 237 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~ 237 (556) |.+++.+++.. ..-++.|..|.+..+.. ..+|+. |++... +. T Consensus 101 GNay~~~~rn~--------~G~~~~L~pl~~~~v~~----------------~~d~~~--~~~~~~--~~---------- 142 (337) T protein:vir:78 101 GDGGLLKLRNS--------FGQVVGLHPLSSVYLRR----------------REDGCF--VYLQQG--KP---------- 142 (337) T ss_pred CCeEEEEEECC--------CCcEEEEEEeCCceeEe----------------eeCCeE--EEEEcC--Cc---------- Confidence 99998876532 12356788887766532 112221 222211 10 Q ss_pred eeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHH---hcceeeeEeccCccccccccccc Q lcl|NC_019524. 238 YEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVV---NATYAASVESELPSDVVFGQLGM 314 (556) Q Consensus 238 rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i---~A~~~~fi~~~~~~~~~~~~~~~ 314 (556) ...+++.+|||+..+...++..|+|++..++..+.- -..+++-+.+. .+...++|+.+.+... T Consensus 143 ----~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l---~~aa~~~~~~~f~NGa~p~~il~~~~~~l~------- 208 (337) T protein:vir:78 143 ----NLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALL---NQDATLFRRRYFLNGAHMGFIFYATDPNMD------- 208 (337) T ss_pred ----eEEECCccEEEECCCCCCCCcccccHHHHHHHHHHH---HHHHHHHHHHHHhccCCCceeEEcCCCCCC------- Confidence 124667899999998877899999988777765543 23444444443 3445556654322111 Q ss_pred ccccccccccccccccccccccccceecCCceeeec-----CCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHH Q lcl|NC_019524. 315 GQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL-----YPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQ 389 (556) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-----~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~ 389 (556) .+..+........ ..+. -..+.+..+ +.|-+++.++...-..+|.+..+.....||+.+|||-+. T Consensus 209 --~e~~~~lk~~~~~----~~G~----~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~l 278 (337) T protein:vir:78 209 --DDTEEEMKEMIAN----SKGV----GNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPAL 278 (337) T ss_pred --HHHHHHHHHHHHH----hcCc----ccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 1111111111100 0000 012333334 345678888877777889898999999999999999985 Q ss_pred hh--chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhh Q lcl|NC_019524. 390 FS--RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALC 463 (556) Q Consensus 390 l~--~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~ 463 (556) +- .+-+..+|+++.+....|. ..-+.|+.++|.++ +..-.+ |.. ....|. ....+.+ T Consensus 279 lGi~~~~~~~~~~n~e~~~~~f~-----------~~~L~P~~~~ie~~-~n~~ll--~~~-~~~~f~--~~~~~~~ 337 (337) T protein:vir:78 279 AGIIPTNGGGGLGDPEKYDATYA-----------RNEVLPLCELVQDA-INSAGL--PRA-LWVTFR--ETIGAAV 337 (337) T ss_pred cccccCCCcCccccHHHHHHHHH-----------HHHHHHHHHHHHHH-HhhhcC--Chh-hceecc--ccccccC Confidence 52 2444568888766655553 33445555554432 222122 211 000111 1112222 No 198 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.12 E-value=4.2e-06 Score=50.06 Aligned_cols=428 Identities=11% Similarity=-0.002 Sum_probs=188.2 Q ss_pred CC---cchh--hhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHH--HHHH-----HHHHHHHH Q lcl|NC_019524. 1 MK---DVKK--TTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQ--QIAQ-----NQDMASAR 68 (556) Q Consensus 1 ~s---p~~~--~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~--~i~~-----~~~~lr~R 68 (556) |+ +-.+ .++..- +.+ .++ .-+. ..+.+..+......|.. .|.. +......- T Consensus 1 ~~~~~d~~g~p~~~~~~-~~~------~~~--------~~~~--~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L 63 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQL-REP------QTS--------RLAG--LAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAEL 63 (526) T ss_pred CCeeECCCCCccccccc-cch------hhh--------hhhh--hhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHH Confidence 11 0011 100000 000 000 0000 01122222222223322 1111 23444456 Q ss_pred HHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHH Q lcl|NC_019524. 69 AQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTR 148 (556) Q Consensus 69 aRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~ 148 (556) .+++...++++.+++++...-|.|.-+.+.+.-+ +...++...+.+++.|+ + .-+|..+.. T Consensus 64 ~e~m~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~------~~~~~~~~a~~v~~~l~----~---------~~~~~~~i~ 124 (526) T protein:vir:99 64 FMDMEERDAHLFAEMSKRKRAILGLDWAVEPPRN------ASAAEKADADYLHELLL----D---------LEGLEDLLL 124 (526) T ss_pred HHHHHhhChHHHHHHHHHHHHHhCCCceEecCCC------CCHHHHHHHHHHHHHHh----c---------ccCHHHHHH Confidence 6778889999999999999999998766665322 11234444455544443 1 114777777 Q ss_pred HHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCcc Q lcl|NC_019524. 149 LAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDP 228 (556) Q Consensus 149 l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~ 228 (556) .++. .+--|=+++-++|... ++.-.+-++..+.+.+... |..++.. ...+.+..++ T Consensus 125 ~~ld-a~~~G~s~~Eivw~~~-----~g~~~~~~l~~r~~~~f~~----------------~~~~~~~--l~~~~~~~~g 180 (526) T protein:vir:99 125 DALD-GIGHGYSCIELEWALQ-----GREWMPLAFHHRPQSWFQL----------------NPEDQNE--LRLRDNSPAG 180 (526) T ss_pred HHHH-hhhhcceeEEEEEeec-----CCceeEEEeeeecccceee----------------ccCCCcE--EEecCCCCCc Confidence 7765 4556888888888653 3444555676676665532 2222111 1111111111 Q ss_pred ccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccc Q lcl|NC_019524. 229 TDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVV 308 (556) Q Consensus 229 ~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~ 308 (556) .. +| |..-|+|.+ ..+.|..-|.+.|.++.-...--.......+.-...-++=..+.|.+.+.. T Consensus 181 ~~--------l~-----~~k~i~~~~-~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~-- 244 (526) T protein:vir:99 181 EA--------LQ-----PFGWIIHRP-RARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTA-- 244 (526) T ss_pred ee--------ec-----CCCeEEEee-cCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCC-- Confidence 00 11 123577775 445788888888877643322111111111122222222223333322110 Q ss_pred ccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCC-ccHHHHHHHHHHHHHHh-cCCC Q lcl|NC_019524. 309 FGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPG-GVGTDYEQSLLRNIAAS-LGMS 386 (556) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~-~~f~~F~~~~lr~iaag-lGi~ 386 (556) ++.. ...-.....+.......++.|.+|++++....+ ..|..|.+.+=+.|+.. ||-+ T Consensus 245 --------~~ek------------~~L~~av~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqt 304 (526) T protein:vir:99 245 --------DEEK------------ATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGT 304 (526) T ss_pred --------HHHH------------HHHHHHHHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhh Confidence 0000 001111234556677889999999999976544 34899999999999988 5644 Q ss_pred HHHhhchhhc---ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhh Q lcl|NC_019524. 387 YEQFSRDYTK---TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALC 463 (556) Q Consensus 387 ye~l~~D~s~---~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~ 463 (556) ||.|..+ .||+.+..-........+.....+...+-+-+...++. +++--..|. ..+. T Consensus 305 ---lTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~---~N~~~~~~~-------------~~~p 365 (526) T protein:vir:99 305 ---LTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLV---LNRPGSPDV-------------RRAP 365 (526) T ss_pred ---hccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCCcCCc-------------cccc Confidence 5444332 23443332222233333444455555554445555543 222111110 0011 Q ss_pred CeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCC-CCCC Q lcl|NC_019524. 464 NAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNST-QSSN 541 (556) Q Consensus 464 ~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~-~~~~ 541 (556) ++.| ......|...-+++....+..|+.=..+.++++ |....+--++. |.....+...+.. .... T Consensus 366 ~~~~--~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~e~~-----------l~~~~~~~~~~~~~~~~~ 432 (526) T protein:vir:99 366 RLVF--DLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNEPV-----------LRSAAQPAILSRQHGQRV 432 (526) T ss_pred eEEe--CCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCcccc-----------cCCCCCCccccccccccc Confidence 1222 223445666677788888888885444444333 54211100000 0000000000000 0000 Q ss_pred CCCCCCCCCCCcCCC Q lcl|NC_019524. 542 SSESTSDNPNEETTQ 556 (556) Q Consensus 542 ~~~~~~~~~~~e~~~ 556 (556) ........+...+.+ T Consensus 433 ~~~~~~~~~~~~~~~ 447 (526) T protein:vir:99 433 AALATIVGPRYGDQQ 447 (526) T ss_pred ccccccccccCcchh Confidence 000000111101111 No 199 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=98.05 E-value=1.6e-06 Score=52.43 Aligned_cols=248 Identities=8% Similarity=0.005 Sum_probs=132.7 Q ss_pred HhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHH--HHHHHHHHHHHHhcChHHHHHHHHHHhhhccC Q lcl|NC_019524. 16 VDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQN--QDMASARAQDMVQNDGYAAGVVAVHRDSIVGS 93 (556) Q Consensus 16 ~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~--~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~ 93 (556) +...-. .+ .|... +.+.....-...... ......-+.+-+.+++.+..+|+.+.+.|-.- T Consensus 1 MglF~~------------~~-----~r~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~l 62 (251) T protein:vir:46 1 MGIFYK------------NE-----KRDLQ-YNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARM 62 (251) T ss_pred CCcccc------------cc-----ccccC-CCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhC Confidence 111100 00 00000 000000000000000 00000111233456788899999999999887 Q ss_pred CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCc Q lcl|NC_019524. 94 QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTT 173 (556) Q Consensus 94 Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~ 173 (556) -|++.-.- +... . ..+++.+..+++ -.+|.+++....+..++..|++|+.+.+... T Consensus 63 p~~~~~~~---------~~~~--~---~~~~~ll~~~Pn------~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~---- 118 (251) T protein:vir:46 63 PIRVTVNG---------QINY--S---DRIVNLLNTRPN------PMYNGYIFKLVVFVSALLTSHGYIEITRDKT---- 118 (251) T ss_pred ceEEeeCc---------cccc--c---chHHHHHhccCC------CCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---- Confidence 77664221 0000 0 123344445554 3568889999999999999999999865321 Q ss_pred CCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEee Q lcl|NC_019524. 174 MQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHI 253 (556) Q Consensus 174 ~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~ 253 (556) ..+..|..|+|++|. |+.|.+|++..|+.....+... ....++.++|||+ T Consensus 119 ----G~~~~L~~i~~~~v~--------------v~~~~~g~~~~~~~~~~~~~~g------------~~~~~~~~diiH~ 168 (251) T protein:vir:46 119 ----GEPMNLTFRKTSEIE--------------LKSDARGRLYYFHQRIDSNGNN------------IERNVKFEDMLDI 168 (251) T ss_pred ----CcEEEEEEECCceEE--------------EEECCCCcEEEEEEEeccCCcc------------eeEEECCccEEEe Confidence 236788999998883 4566777776655443322211 1235778899999 Q ss_pred ecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccc-ccccccc Q lcl|NC_019524. 254 IEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNE-YMTGLAN 332 (556) Q Consensus 254 f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 332 (556) ...- .+...|+|++..+...|.-....++......+=.+...++++.+..-.. ++..+.... +... T Consensus 169 r~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~---------~e~~~~~~~~~~~~--- 235 (251) T protein:vir:46 169 KFYS-LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN---------KKARDRAREEFPKV--- 235 (251) T ss_pred cCcC-CCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCC---------HHHHHHHHHHHHHH--- Confidence 8764 5668999999999999988888888888888888899999997643110 000000000 0000 Q ss_pred ccccccceecCCceeeecCCCcee Q lcl|NC_019524. 333 YVAQTKNIAIDGAKIPHLYPGTKL 356 (556) Q Consensus 333 ~~~~~~~~~l~pG~i~~L~pGe~i 356 (556) ..+.. ..|.+. .|.+= T Consensus 236 -~~g~~----n~g~~~---~gm~~ 251 (251) T protein:vir:46 236 -LVELN----KLGKLS---YSMNQ 251 (251) T ss_pred -hcCcc----cccccc---cccCC Confidence 00000 012111 12111 No 200 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=97.99 E-value=8.1e-06 Score=48.51 Aligned_cols=428 Identities=11% Similarity=0.011 Sum_probs=189.4 Q ss_pred CC---cchhh--hHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHH--HHHH-----HHHHHHHH Q lcl|NC_019524. 1 MK---DVKKT--TRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQ--QIAQ-----NQDMASAR 68 (556) Q Consensus 1 ~s---p~~~~--~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~--~i~~-----~~~~lr~R 68 (556) |+ ...+. ++.. .+.+ ..+. -+ ...+.++.+......|.. .|.. +......- T Consensus 1 ~~~~~d~~g~p~~~~~-~~~~------~~~~--------~~--~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L 63 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQ-LREP------QTSR--------LA--GLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAEL 63 (526) T ss_pred CCeeeCCCCCccCccc-cchh------hhhh--------hh--hhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHH Confidence 11 00110 0000 0000 0000 00 001122222222223322 1111 23334456 Q ss_pred HHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHH Q lcl|NC_019524. 69 AQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTR 148 (556) Q Consensus 69 aRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~ 148 (556) .+++..+++++.+++++...-|.|.-+.+.+.-+ +...++...+.++..|. + .-+|..+.. T Consensus 64 ~edm~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~------~~~~~~~~a~~v~~~l~----~---------~~~~~~~i~ 124 (526) T protein:vir:79 64 FMDMEERDAHLFAEMSKRKRAILGLDWAVEPPRN------ASAAEKADADYLHELLL----D---------LEGLEDLLL 124 (526) T ss_pred HHHHHhhChHHHHHHHHHHHHHhCCCceEecCCC------CChHHHHHHHHHHHHHh----c---------ccCHHHHHH Confidence 6678889999999999999999997766665322 11234444555554443 1 114777777 Q ss_pred HHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCcc Q lcl|NC_019524. 149 LAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDP 228 (556) Q Consensus 149 l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~ 228 (556) .++.+ +--|=+++-++|... +|.-.+-+|....+.++.. |..++.. ...+.+..++ T Consensus 125 ~~ldA-~~~G~s~~Ei~w~~~-----~g~~~~~~l~~r~~~~F~~----------------~~~~~~~--l~~~~~~~~g 180 (526) T protein:vir:79 125 DALDG-IGHGYSCIELEWALQ-----GREWMPLAFHHRPQSWFQL----------------NPEDQNE--LRLRDNSPAG 180 (526) T ss_pred HHHhh-hhhcceeEEEEEeec-----CCceeEEEeeeecccceEe----------------ccCCCcE--EEecCCCCCc Confidence 77654 456888888888653 3344555666666655432 2222110 0011111111 Q ss_pred ccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccc Q lcl|NC_019524. 229 TDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVV 308 (556) Q Consensus 229 ~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~ 308 (556) .. +| |..-|+|.+ ..+.|..-|.+.|.++.-...--.......+.-...-++=..+.|.+.+.. T Consensus 181 ~~--------l~-----~~k~iv~~~-~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~-- 244 (526) T protein:vir:79 181 EA--------LQ-----PFGWIIHRP-RARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTA-- 244 (526) T ss_pred ee--------ec-----CCceEEEee-cCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCC-- Confidence 00 11 123477775 445788888888877643322111111111121222222223333322110 Q ss_pred ccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCC-ccHHHHHHHHHHHHHHh-cCCC Q lcl|NC_019524. 309 FGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPG-GVGTDYEQSLLRNIAAS-LGMS 386 (556) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~-~~f~~F~~~~lr~iaag-lGi~ 386 (556) ++.. ...-.....|....+..++.|.+|++++....+ ..|..|.+.+=+.|+.. ||-+ T Consensus 245 --------~~ek------------~~L~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqt 304 (526) T protein:vir:79 245 --------DEEK------------ATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGT 304 (526) T ss_pred --------HHHH------------HHHHHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhh Confidence 0000 001112234556677889999999999976544 34899999999999988 5654 Q ss_pred HHHhhchhhc---ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhh Q lcl|NC_019524. 387 YEQFSRDYTK---TNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALC 463 (556) Q Consensus 387 ye~l~~D~s~---~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~ 463 (556) ||.|.++ .||+.+..-........+.....+...+-+-+...++. +++ |...+. ..+. T Consensus 305 ---lTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~---~N~----~~~~~~---------~~~p 365 (526) T protein:vir:79 305 ---LTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLV---LNR----PGSPDV---------RRAP 365 (526) T ss_pred ---hccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCC----CCcCCc---------cccc Confidence 5544322 24544433333333344455555555554555555544 221 111100 0011 Q ss_pred CeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCC Q lcl|NC_019524. 464 NAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNS 542 (556) Q Consensus 464 ~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~ 542 (556) ++.| ......|-..-+++....+.+|+.=..+.++++ |...-+-.+.+ + +-. ..+........... T Consensus 366 ~~~~--~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~~~~~e~~------l---~~~--~~~~~~~~~~~~~~ 432 (526) T protein:vir:79 366 RLVF--DLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNEPV------L---RPA--AQPAILSRQHGQRV 432 (526) T ss_pred eEEe--CCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCCchhh------c---ccc--CCcccccccccccc Confidence 1222 223445666677888888888886554444443 54211100000 0 000 00000000000000 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_019524. 543 SESTSDNPNEETTQ 556 (556) Q Consensus 543 ~~~~~~~~~~e~~~ 556 (556) ............+| T Consensus 433 ~~~~~~~~~~~~~~ 446 (526) T protein:vir:79 433 AALATIVGPRYGDQ 446 (526) T ss_pred ccccccccccCchh Confidence 00000001111111 No 201 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=97.98 E-value=8.4e-06 Score=48.42 Aligned_cols=458 Identities=14% Similarity=0.087 Sum_probs=203.4 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhcccc--CCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE--RTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~--~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |.+-.- |--+.++..++ . ...++..||+ -+.+....|.....|...+...+...+..-.=.|---.+| T Consensus 1 ma~~~l----r~~rrpk~~p~--~----~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW 70 (629) T protein:vir:86 1 MAPTSL----RIVRRPKSEPV--S----TRQRALVAASQPVENPGKAFRKAMGSSTRTDWQEDAWKAYDAVGELRYYVGW 70 (629) T ss_pred CCccce----eeeecCCCCCh--h----hhhhhhhhhhhccccccchhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhh Confidence 222110 00011111111 1 1112233332 2233334555666666666655555444433333333344 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ..+++..+.-++ + -|.| -+.....++++ .....+++.+.-.. =..|.+--.+|.+.+...+-+-| T Consensus 71 ~~~s~Sr~rL~a-s-~idp--Dtg~ptg~i~e--~~~~~~~v~~~v~~---------i~gG~lgqa~lLkr~~~~ltV~G 135 (629) T protein:vir:86 71 RSSSASRVRLIA-S-AIDP--DTGLPTGSIDE--DDRVGARVQQIVNQ---------IAGGALGQAQLIKRVVEQLTVAG 135 (629) T ss_pred hhhhhceeeeEe-e-eecC--CCCCCccccCC--CchhHHHHHHHHHh---------hcCChhhHHHHHHHHHhheeccc Confidence 444444332221 1 0111 11111112222 11122333333221 25688888999999999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccce-EEEEEchhhcCCCCCC-----CCCceEEEEEEECCCCCeEEEEEeecCCCccccCC Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGT-AIQMISPYRMSNPNNV-----MDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDME 232 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l-~lq~ie~drl~~~~~~-----~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~ 232 (556) |+.+.....+... .+....+. .-+.+-++-|.+..+. +.| .=.||+..--.+ +.|++.||.....- T Consensus 136 E~wiv~~~~~~~~--~d~~~~~~~eW~~vt~~ei~~~~~~~~i~lP~g----~~~e~~~~~d~l-~RiW~P~Prr~~e~- 207 (629) T protein:vir:86 136 ETWVAILFTDKSR--LDSNGNPVPEWLALTPEEVRASEKKTIIELPTG----DKHEFRDGLDGM-FRVWNPRARRAREP- 207 (629) T ss_pred ceEEEEeecCCCc--cCCCCcchhhheeechHHhhhccCceeeEcCCC----CcceeeCCCceE-EEeeCCCcccccCC- Confidence 9999887554322 22222221 3456777776543221 222 123454444444 67777777542110 Q ss_pred ccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHH---HHHHHHHHHhcceeeeEeccCcccccc Q lcl|NC_019524. 233 QWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQ---EITLQNAVVNATYAASVESELPSDVVF 309 (556) Q Consensus 233 ~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~---dael~~a~i~A~~~~fi~~~~~~~~~~ 309 (556) -|+.-++|..++.|-+.. .+....-.+.+-+ .|+-.+..-.... T Consensus 208 --------------------------------DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGv-lflP~e~slP~~~ 254 (629) T protein:vir:86 208 --------------------------------DSPVRANLDSLKEIVRTTKTIANASKSRLIGNGV-VFVPHEMSLPSMN 254 (629) T ss_pred --------------------------------cchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCce-eeeccCcccCccC Confidence 123333444444444333 3333333333322 3333222111111 Q ss_pred cccccccccccc--cccc-----c-----------ccccccccccccceecCCceeeecCCCceeeee---cCC-CCCcc Q lcl|NC_019524. 310 GQLGMGQGGFKE--IFNE-----Y-----------MTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQ---PAG-TPGGV 367 (556) Q Consensus 310 ~~~~~~~~~~~~--~~~~-----~-----------~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~---~~~-~p~~~ 367 (556) ...+....+... .... + +.....-+.-.+ |+---|||-++-+ .-. .-+.. T Consensus 255 ~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vP--------iia~~P~E~i~~i~hlkf~~ei~e~ 326 (629) T protein:vir:86 255 APVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIP--------MFAAAPGELIKNVTHLKFDNQVTEV 326 (629) T ss_pred CCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceee--------eeEeechHHhcCeeEEeecCchhHH Confidence 111111000000 0000 0 000000111111 1222334333322 111 12222 Q ss_pred HHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHcCCccC Q lcl|NC_019524. 368 GTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL---VADRFASAIYTLWLEEEVNAGNVPL 444 (556) Q Consensus 368 f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~---lv~~~~~pi~~~~l~~a~l~G~l~~ 444 (556) -..-....++.+|+|+.||-|.|+|==|++|-=|+=+.--+ ..|.+ .+.-+|+-|+.-||.-++.+--|+ T Consensus 327 aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~de------dvrlHI~P~l~~ic~AlT~~~Lrp~Le~eGiD- 399 (629) T protein:vir:86 327 AIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDE------DVRLHILPPVEMLCEAITNQVLRTVLMREGID- 399 (629) T ss_pred HHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEeccc------ceeeecchHHHHHHHHHHhhHHHHHHHHhCCC- Confidence 33445778899999999999999984367776554322222 23332 356789999999998888874442 Q ss_pred CCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHH--------HHHHHH Q lcl|NC_019524. 445 PPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVF--------KQRARE 516 (556) Q Consensus 445 p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~--------~q~a~E 516 (556) | ..| +-|.-..-..+||-|--+| +.+.+.|.-|-+......|.+-++.. +|++.. T Consensus 400 p--------------~kY--vvW~DaS~Lt~dPd~~deA-~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d 462 (629) T protein:vir:86 400 P--------------NAY--VVWHDASQLTVDPDKTDEA-RDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARD 462 (629) T ss_pred H--------------HHh--EeeecCcccccCCCCcHHH-HHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHH Confidence 2 233 6899999999999987665 56789999999988888887664444 344443 Q ss_pred HHHHH-------------HcCCCCCccccccCCCCCCCCCCCCCCC-----CCCcCCC Q lcl|NC_019524. 517 EGLIK-------------SLKLDFTGKMVEGNSTQSSNSSESTSDN-----PNEETTQ 556 (556) Q Consensus 517 ~~~~~-------------~~Gl~~~~~~~~~~~~~~~~~~~~~~~~-----~~~e~~~ 556 (556) .-... -.++.++...+...+.......++.++. |+.|++. T Consensus 463 ~V~~~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~ 520 (629) T protein:vir:86 463 RVGQDPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGASRREEPDTEDDA 520 (629) T ss_pred hhhhCcchhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCCCcCCCCCCCCCC Confidence 21111 1122222111111111111111111111 1111111 No 202 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=97.93 E-value=1e-05 Score=47.95 Aligned_cols=458 Identities=14% Similarity=0.087 Sum_probs=202.5 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhcccc--CCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE--RTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~--~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |.+-.- |--+.++..++ . ...++..||+ -+.+....|.....|...+...+...+..-.=.|---.+| T Consensus 1 ma~~~l----r~~rrpk~~p~--~----~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW 70 (629) T protein:vir:99 1 MAPTSL----RIVRRPKSEPV--S----TRQRALVAASQPVENPGKAFRKAMGSSTRTDWQDDAWKAYDAVGELRYYVGW 70 (629) T ss_pred CCccce----eeeecCCCCCh--h----hhhhhhhhhhhcccccchhhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhh Confidence 222110 00011111111 1 1112233332 1233334455566666666555555444433333333344 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ..+++..+.-++ + -|.| -+.....++++ .....+++.+.-.. =+.|.+--.+|.+.+...+-+-| T Consensus 71 ~~~s~Sr~rL~a-s-~idp--Dtg~ptg~i~e--~~~~~~~v~~~v~~---------i~gG~lgqa~lLkr~~~~ltV~G 135 (629) T protein:vir:99 71 RSSSASRVRLIA-S-AIDP--DTGLPTGSIDE--DDRVGARVQQIVNQ---------IAGGALGQAQLIKRVVEQLTVAG 135 (629) T ss_pred hhhhhceeeeEe-e-eecC--CCCCCccccCC--CchhHHHHHHHHHh---------hcCChhhHHHHHHHHHhheeccc Confidence 444444332221 1 0111 11111112222 11122333333221 25688888999999999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccce-EEEEEchhhcCCCCCC-----CCCceEEEEEEECCCCCeEEEEEeecCCCccccCC Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGT-AIQMISPYRMSNPNNV-----MDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDME 232 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l-~lq~ie~drl~~~~~~-----~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~ 232 (556) |+.+.....+... .+....+. .-+.+-++-|.+..+. +.| .=.||+..--.+ +.|++.||.....- T Consensus 136 E~wiv~~~~~~~~--~d~~~~~~~eW~~vt~~ei~~~~~~~~i~lP~g----~~~e~~~~~d~l-~RiW~P~Prr~~e~- 207 (629) T protein:vir:99 136 ETWVAILFTDKSR--LDSNGNPVPEWLALTPEEVRASEKKTIIELPTG----DKHEFRDGLDGM-FRVWNPRARRAREP- 207 (629) T ss_pred ceEEEEeecCCCc--cCCCCcchhhheeechHHhhhccCceeEEcCCC----CccceeCCCceE-EEeeCCCcccccCC- Confidence 9999887554322 22222221 3456777777643221 222 122444333333 67777777542110 Q ss_pred ccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHH---HHHHHHHHHhcceeeeEeccCcccccc Q lcl|NC_019524. 233 QWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQ---EITLQNAVVNATYAASVESELPSDVVF 309 (556) Q Consensus 233 ~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~---dael~~a~i~A~~~~fi~~~~~~~~~~ 309 (556) -|+.-++|..++.|-+.. .+....-.+.+-+ .|+-.+..-.... T Consensus 208 --------------------------------DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGv-lflP~e~slP~~~ 254 (629) T protein:vir:99 208 --------------------------------DSPVRANLDSLKEIVRTTKTIANASKSRLIGNGV-VFVPHEMSLPSMN 254 (629) T ss_pred --------------------------------cchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCce-eEeccCcccCccC Confidence 123333444444444333 3333333333322 3333222111111 Q ss_pred cccccccccccc--cccc-----c-----------ccccccccccccceecCCceeeecCCCceeeee---cCC-CCCcc Q lcl|NC_019524. 310 GQLGMGQGGFKE--IFNE-----Y-----------MTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQ---PAG-TPGGV 367 (556) Q Consensus 310 ~~~~~~~~~~~~--~~~~-----~-----------~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~---~~~-~p~~~ 367 (556) ...+....+... .... + +.....-+.-.+ |+---|||-++-+ .-. .-+.. T Consensus 255 ~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vP--------iia~~P~E~i~~i~hlkf~~ei~e~ 326 (629) T protein:vir:99 255 APVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIP--------MFAAAPGELIKNVTHLKFDNQVTEV 326 (629) T ss_pred CCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceee--------eeEeechHHhcCeeEEeecCchhHH Confidence 111111000000 0000 0 000000111111 1222334333322 111 12222 Q ss_pred HHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHcCCccC Q lcl|NC_019524. 368 GTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL---VADRFASAIYTLWLEEEVNAGNVPL 444 (556) Q Consensus 368 f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~---lv~~~~~pi~~~~l~~a~l~G~l~~ 444 (556) -..-....++.+|+|+.||-|.|+|==|++|-=|+=+.--+ ..|.+ .+.-+|+-|+.-||.-++.+--|+ T Consensus 327 aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~de------dvrlHI~P~l~~ic~AlT~~~Lrp~Le~eGiD- 399 (629) T protein:vir:99 327 AIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDE------DVRLHILPPVEMLCEAITNQVLRTVLMREGID- 399 (629) T ss_pred HHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEeccc------ceeeecchhHHHHHHHHHhhHHHHHHHHhCCC- Confidence 33445778899999999999999984367776554322222 23332 356789999999998888874442 Q ss_pred CCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHH--------HHHHHH Q lcl|NC_019524. 445 PPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVF--------KQRARE 516 (556) Q Consensus 445 p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~--------~q~a~E 516 (556) | ..| +-|.-..-..+||-|--+| +.+.+.|.-|-+......|.+-++.. +|++.. T Consensus 400 p--------------~kY--vvW~DaS~Lt~dPd~~deA-~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d 462 (629) T protein:vir:99 400 P--------------NAY--VVWHDASQLTVDPDKTDEA-RDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARD 462 (629) T ss_pred H--------------HHh--EeeecCcccccCCCCcHHH-HHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHH Confidence 2 233 6899999999999987665 56788999999988888887664444 344443 Q ss_pred HHHHH-------------HcCCCCCccccccCCCCCCCCCCCCCCC-----CCCcCCC Q lcl|NC_019524. 517 EGLIK-------------SLKLDFTGKMVEGNSTQSSNSSESTSDN-----PNEETTQ 556 (556) Q Consensus 517 ~~~~~-------------~~Gl~~~~~~~~~~~~~~~~~~~~~~~~-----~~~e~~~ 556 (556) .-... -.++.++...+...+.......++.++. |+.|++. T Consensus 463 ~V~~~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~ 520 (629) T protein:vir:99 463 RVGQDPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGASRREEPDTEDDA 520 (629) T ss_pred hhhhCcchhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCCCcCCCCCCCCCC Confidence 21111 1122222111111111111111111111 1111111 No 203 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=97.73 E-value=2.4e-05 Score=45.91 Aligned_cols=429 Identities=8% Similarity=-0.021 Sum_probs=187.0 Q ss_pred CC--------cchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHH---HHH----HHHHHH Q lcl|NC_019524. 1 MK--------DVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQ---QIA----QNQDMA 65 (556) Q Consensus 1 ~s--------p~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~---~i~----~~~~~l 65 (556) |+ |++.. .. .+..+++. + ...+.+..+......|.. .++ .+...+ T Consensus 1 ~~~~~d~~g~p~~~~-------~~---~~~~~~~~--------~--~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~ 60 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQ-------QL---RKQQTAHL--------A--GLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQ 60 (528) T ss_pred CCeeECCCCCccccc-------cc---cchhhhhh--------h--hhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHH Confidence 11 11110 00 00000000 0 001112222222223321 111 234445 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) ..-.+++...++++.+++++...-|.|.-+++.+.-+ +...+++..+.+++.+.. .-+|.. T Consensus 61 ~~L~~~m~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~------~~~~~~~~a~~v~~~l~~-------------~~~f~~ 121 (528) T protein:vir:10 61 AELFMDMEERDAHLFAEMSKRKRAVLGLDWTIEPPRN------ASAAEKADAEYLHELLLD-------------LEGIED 121 (528) T ss_pred HHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC------CCHHHHHHHHHHHHHHhC-------------CccHHH Confidence 5566777789999999999999999998777765322 112234444555444431 114777 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCC Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFP 225 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hp 225 (556) +...++.+ +.-|=+++-+.|... ++...+-++..+.+.++. ||..++.+ +.+ +.++ T Consensus 122 ~i~~~lda-~~~G~s~~Ei~w~~~-----~g~~~~~~~~~r~~~~f~----------------~~~~~~~~-l~~-~~~~ 177 (528) T protein:vir:10 122 LMLDCMDG-VGHGYSAIELDWSLQ-----GREWLPQAFDHRPQSWFQ----------------LNPDDQDE-LRL-RDNS 177 (528) T ss_pred HHHHHHhh-hhhcceeEEEEEeec-----CCceeEEEeeeeccccee----------------eccCCCcE-Eec-cCCC Confidence 77766653 556888888888643 344555567767765543 22222221 111 1111 Q ss_pred CccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcc Q lcl|NC_019524. 226 GDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPS 305 (556) Q Consensus 226 gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~ 305 (556) .++.. +| |..-|+|.+ ..+.|..-|.+.|.++.-...--.......+.-...-++=..+.|.+.+. T Consensus 178 ~~g~~--------l~-----~~k~iv~~~-~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a 243 (528) T protein:vir:10 178 IAGEV--------LQ-----PFGWIMHKP-RSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGT 243 (528) T ss_pred CCcee--------ec-----CCCeEEEee-cCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCC Confidence 11000 11 223577765 44577778888888764433322222222222222223222333433211 Q ss_pred cccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCc-cHHHHHHHHHHHHHHhcC Q lcl|NC_019524. 306 DVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG-VGTDYEQSLLRNIAASLG 384 (556) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~-~f~~F~~~~lr~iaaglG 384 (556) . ++... ..-.....+.......++.|.+|++++....++ .|..|.+.+-+.|+..+- T Consensus 244 ~----------~~ek~------------~L~~al~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iL 301 (528) T protein:vir:10 244 P----------DEEKV------------TLLRAVTGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAIL 301 (528) T ss_pred C----------HHHHH------------HHHHHHHHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHh Confidence 0 00000 001112335555667789999999999764443 489999999999988763 Q ss_pred CCHHHhhchhh---cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHH Q lcl|NC_019524. 385 MSYEQFSRDYT---KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDA 461 (556) Q Consensus 385 i~ye~l~~D~s---~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a 461 (556) - +.||.+.. ..||+.+..-........+.....+...+-+-+..+++. ++ .++..+ ... T Consensus 302 G--qtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~---~N----~~~~~~---------~~~ 363 (528) T protein:vir:10 302 G--GTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLV---LN----RSGNLD---------ARR 363 (528) T ss_pred h--hhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hC----CCCCCC---------ccc Confidence 2 35655542 234444322222223333444444444444444444433 12 111100 001 Q ss_pred hhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCC Q lcl|NC_019524. 462 LCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRL-GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSS 540 (556) Q Consensus 462 ~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~ 540 (556) +.++.|. .....|..+-+++....+..|+.=..+.++++ |....+--+++.. ...... ..+. ...+... T Consensus 364 ~p~~~~~--~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~p~~~e~~~~------~~~~~~-~~~~-~~~~~~~ 433 (528) T protein:vir:10 364 APRLVFD--LKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLGIPLPANGEAVLG------DQAGAG-IAQL-SRRPGPR 433 (528) T ss_pred cceEEec--CCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCCccccc------CCCccc-cccc-Ccccccc Confidence 1122222 22345666677888888899985444444333 5321110000000 000000 0000 0000000 Q ss_pred CCCCCCCCCCCCcCCC Q lcl|NC_019524. 541 NSSESTSDNPNEETTQ 556 (556) Q Consensus 541 ~~~~~~~~~~~~e~~~ 556 (556) .....+...+.....+ T Consensus 434 ~~~~~~~~~~~~~~~~ 449 (528) T protein:vir:10 434 IAALAQVIGPRYRDQE 449 (528) T ss_pred cccccccccccccccc Confidence 0000001111111111 No 204 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=97.72 E-value=2.5e-05 Score=45.82 Aligned_cols=420 Identities=10% Similarity=0.002 Sum_probs=185.5 Q ss_pred CC---cchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHH---HHH----HHHHHHHHHHH Q lcl|NC_019524. 1 MK---DVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQ---QIA----QNQDMASARAQ 70 (556) Q Consensus 1 ~s---p~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~---~i~----~~~~~lr~RaR 70 (556) || .-.+.+-.+ .. .. ....+. -+ ...+.+.+....+.+|.. .++ .+......-.. T Consensus 1 m~~~~d~~g~p~~~--~~--~~-~~~~~~--------~~--~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~ 65 (512) T protein:vir:19 1 MGRILDISGQPFDF--DD--EM-QSRSDE--------LA--MVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAF 65 (512) T ss_pred CcceeCCCCCcccc--cc--cc-ccccch--------hc--ccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHH Confidence 11 001100000 00 00 000000 00 000111111111122221 111 12233333467 Q ss_pred HHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHH Q lcl|NC_019524. 71 DMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLA 150 (556) Q Consensus 71 dl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~ 150 (556) ++...++++.+++++...-|.|.-+++.+.-+ ....+++..+.+++.+.. .-+|..+...+ T Consensus 66 dm~~~D~hi~s~l~~Rk~av~~~~w~I~p~~~------~~~~~~~~a~~v~~~l~~-------------~~~f~~~~~~l 126 (512) T protein:vir:19 66 DMEEKDTHLFSELSKRRLAIQALEWRIAPARD------ASAQEKKDADMLNEYLHD-------------AAWFEDALFDA 126 (512) T ss_pred HHHhhChHHHHHHHHHHHHHhCCCceEecCCC------CCHHHHHHHHHHHHHHhc-------------CCCHHHHHHHH Confidence 88889999999999999999998776665322 122345555666555431 22577777777 Q ss_pred hhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCcccc Q lcl|NC_019524. 151 VSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTD 230 (556) Q Consensus 151 ~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~ 230 (556) +. .+.-|=+++-+.|... ++...|-++..+++.++.... .++..++. +.+..++.. T Consensus 127 ld-A~~~G~s~~Ei~w~~~-----~g~~~~~~~~~r~~~~f~~~~--~~~~~lr~----------------~~~~~~G~~ 182 (512) T protein:vir:19 127 GD-AILKGYSMQEIEWGWL-----GKMRVPVALHHRDPALFCANP--DNLNELRL----------------RDASYHGLE 182 (512) T ss_pred Hh-hhhhcceeeeeEeeee-----CCceeeeeeeeeccccceecc--CCCcEEEe----------------cCCCCCcee Confidence 64 4556777888888543 344455667777776654211 11122221 111111100 Q ss_pred CCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccccccc Q lcl|NC_019524. 231 MEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFG 310 (556) Q Consensus 231 ~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~ 310 (556) +| |..-|+|.+.. +.|..-|.+.|.++.-...-........+.-...-++=..+.|.+.+... T Consensus 183 --------l~-----~~k~i~~~~~~-~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~--- 245 (512) T protein:vir:19 183 --------LQ-----PFGWFMHRAKS-RTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTN--- 245 (512) T ss_pred --------ec-----CCceEEEeccC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCH--- Confidence 11 12347777755 47888888888765433322222222222222232332334443322110 Q ss_pred ccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCcc-HHHHHHHHHHHHHHh-cCCCHH Q lcl|NC_019524. 311 QLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGV-GTDYEQSLLRNIAAS-LGMSYE 388 (556) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~-f~~F~~~~lr~iaag-lGi~ye 388 (556) +..+ ..-.....+.......++.|.+|++++....+.. |..|.+.+-+.|+.. ||-+ T Consensus 246 -------~ek~------------~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqt-- 304 (512) T protein:vir:19 246 -------REKA------------TLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGT-- 304 (512) T ss_pred -------HHHH------------HHHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhh-- Confidence 0000 0011122345566777899999999997654444 899999999999976 5654 Q ss_pred Hhhchh-hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeee Q lcl|NC_019524. 389 QFSRDY-TKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEW 467 (556) Q Consensus 389 ~l~~D~-s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w 467 (556) ||.+- ++.|||.+..-........+.....+...+-+-+...++. +++ +...+. ..+-++.| T Consensus 305 -lTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~---~N~----~~~~~~---------~~~p~~~f 367 (512) T protein:vir:19 305 -LTTEAGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLA---LNS----DSTIDI---------NRLPGIVF 367 (512) T ss_pred -hcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCC----CCCCCc---------cccceEEe Confidence 44443 2334665544444444455555566666655555555443 221 111100 00011122 Q ss_pred ecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccC----CCCCCCCC Q lcl|NC_019524. 468 IGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGN----STQSSNSS 543 (556) Q Consensus 468 ~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~----~~~~~~~~ 543 (556) - .....|..+.+++..+. ..|+.=..+ ...+++||+.+.+..... ........ T Consensus 368 ~--~~e~eDl~~~a~~~~~l-~~G~~i~~~--------------------~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~ 424 (512) T protein:vir:19 368 D--TSEAGDITALSDAIPKL-AAGMRIPVS--------------------WIQEKLHIPQPVGDEAVFTIQPVVPDNGSQ 424 (512) T ss_pred c--CCChhhHHHHHHHHHHH-hcCCCCCHH--------------------HHHHHhCCCCCCCccccccCCCcccccccc Confidence 1 11223444444433332 245533222 233455665432211100 00000000 Q ss_pred CCCCCCCCCcCCC Q lcl|NC_019524. 544 ESTSDNPNEETTQ 556 (556) Q Consensus 544 ~~~~~~~~~e~~~ 556 (556) .......+...+| T Consensus 425 ~~~~~~~~~~~~~ 437 (512) T protein:vir:19 425 KEAALSAEDIPQE 437 (512) T ss_pred ccccccccCCCch Confidence 0111111111111 No 205 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=97.61 E-value=3.7e-05 Score=44.85 Aligned_cols=449 Identities=16% Similarity=0.099 Sum_probs=191.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccC--CCCCHHHHHHHHHHHHHHHHHHHHhcC-- Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNP--SIISPDQQIAQNQDMASARAQDMVQND-- 76 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~--~~~s~~~~i~~~~~~lr~RaRdl~rNn-- 76 (556) |.+-. + ++.+++.- ....++..+|+....-...... .+.+.+.++..+.. ...|++-.- T Consensus 1 ma~~~----------l-rv~rrpk~--~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW----~~~d~VgElry 63 (629) T protein:vir:10 1 MAAST----------L-RVSRRPKG--SPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAW----ECMDLVGELRY 63 (629) T ss_pred CCccc----------e-eEEecCCC--ccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHH----HHHHhhhhHHH Confidence 11110 0 11111100 0112333344322110000101 11122333333322 333333222 Q ss_pred --hHHHHHHHHHHhhhccCCceeeeeccccccCCCh--hHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhh Q lcl|NC_019524. 77 --GYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPD--GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVS 152 (556) Q Consensus 77 --~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~--~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r 152 (556) +|..+++.++.-.+ + -|.| -+.....++.+ -....+.+.++.. +.|.+--.+|++.+.. T Consensus 64 yvgW~~ss~Sr~rL~a-s-~idp--Dtg~ptg~i~ed~p~~~~v~~~v~~i-------------agG~lGqaqLlkr~~~ 126 (629) T protein:vir:10 64 YVGWRASSCSRVELIA-S-ELDP--DTGKPTGGIRDDDPDGLRFLEIVKTM-------------AGGPLGQAQLQKRAAE 126 (629) T ss_pred HhhhhhhhheeeeEEE-e-eecC--CCCCCccccccCchhHHHHHHHHHHh-------------cCccchHHHHHHHHHh Confidence 22222222221111 0 0110 11111112211 1122333333332 6688888999999999 Q ss_pred hheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECC-------CCCeEEEEEeecCC Q lcl|NC_019524. 153 GFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDN-------NGAALGYWLRKAFP 225 (556) Q Consensus 153 ~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~-------~Gr~vaY~i~~~hp 225 (556) .+-+-||..+.++- ++....++..-+ .-.++..+.|.+... ..+ -||... .|+-+=+.|++.|| T Consensus 127 ~ltV~GE~~i~il~--~~~~~pd~~~r~-~W~vVt~~Ei~~kg~----g~~--~i~lpdg~~he~~~~~D~l~RvW~P~P 197 (629) T protein:vir:10 127 CLTVPGEHRICLLD--QGDKNPDGSVRH-NWYVVTNDEVKNKGA----GKT--DIELPDGTIHEYSKGRDVMFRVWNPRP 197 (629) T ss_pred heeccCceEEEEee--cCCCCCCccccc-ceeeecHHHhccccC----cee--EEEcCCCceeeeeCCCeeEEEeeCCCc Confidence 99999997666542 222223333332 456677777764321 111 133321 22333346666666 Q ss_pred CccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcce--eeeEeccC Q lcl|NC_019524. 226 GDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATY--AASVESEL 303 (556) Q Consensus 226 gd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~--~~fi~~~~ 303 (556) ..... --|+.-.+|..|+.|-....-...++|--.+= ..|+-.+. T Consensus 198 rr~~e---------------------------------~DSpvra~l~~lrEi~r~tk~i~~aakSRL~gnGvlflP~e~ 244 (629) T protein:vir:10 198 RRAKE---------------------------------PDSPVRACLDSLREIIRTTKKIRNASKSRLIGNGVVFLPQEL 244 (629) T ss_pred ccccC---------------------------------CcchhHHHHHHHHHHHHhhhHhHHHHHhHHhhCceeEeccCc Confidence 53211 11333445555555544443333333322222 23343322 Q ss_pred ccccccccccccccccc----------cccccc-----ccccccccccccceecCCce--eeecCCCceeeeecCCCCCc Q lcl|NC_019524. 304 PSDVVFGQLGMGQGGFK----------EIFNEY-----MTGLANYVAQTKNIAIDGAK--IPHLYPGTKLKMQPAGTPGG 366 (556) Q Consensus 304 ~~~~~~~~~~~~~~~~~----------~~~~~~-----~~~~~~~~~~~~~~~l~pG~--i~~L~pGe~i~~~~~~~p~~ 366 (556) .-.......+..+.+.. +++... +....+... .... |+---|||-++-+..=.=.+ T Consensus 245 slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S-------~aA~vPiia~vP~E~l~~ikhLkf~~ 317 (629) T protein:vir:10 245 SLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDS-------QAALIPLLATVPGEHLQKIFHLKIGN 317 (629) T ss_pred ccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCC-------ccceeeeEEeechHHhcCeeeeeecC Confidence 21111111111111100 000000 000000000 0111 22233455544443333333 Q ss_pred cHH----HHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHc Q lcl|NC_019524. 367 VGT----DYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL---VADRFASAIYTLWLEEEVNA 439 (556) Q Consensus 367 ~f~----~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~---lv~~~~~pi~~~~l~~a~l~ 439 (556) .++ .-.+-.++.+|+|+.||-|.|+|==|++|-=|+=+.--| ..|.+ .+.-+|+-|++-||.-++.+ T Consensus 318 eite~~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~de------dvrlHI~P~l~~ic~Ait~~~Lrp~L~~ 391 (629) T protein:vir:10 318 EITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDE------DVQLHIKPVMEVLCAAIYREVLVATLRA 391 (629) T ss_pred chhHHHHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEeccc------ceeeecchHHHHHHHHHHhHHHHHHHHH Confidence 333 334678899999999999999985477876555332222 23332 35678999999999988887 Q ss_pred CCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHH--------HH Q lcl|NC_019524. 440 GNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREV--------FK 511 (556) Q Consensus 440 G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v--------~~ 511 (556) --|+ | ..| +-|.-..-..+||-|--+| +.+-+.|.-|-+..-+..|.+-++. ++ T Consensus 392 eGiD-p--------------~~Y--vvw~DaS~Lt~dPd~~deA-~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q 453 (629) T protein:vir:10 392 EGID-P--------------DRY--VLWYDASGLTVDPDKTDEA-TAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQ 453 (629) T ss_pred hCCC-H--------------HHh--EeeecCcccccCCCCcHHH-HHHHHcCCccHHHHHHHhccccccCCCcCCcHHHH Confidence 5442 2 223 6899999999999987665 5677889999888666666544443 34 Q ss_pred HHHHHHH--------HHHHcCCC------CCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 512 QRAREEG--------LIKSLKLD------FTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 512 q~a~E~~--------~~~~~Gl~------~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) ++|+..- +...++++ ++..+....++...+.+++.+-++++-++| T Consensus 454 ~~A~~~v~~~P~Li~~~apll~~~l~~i~~P~p~~a~~~~~~~~~~~E~~~~~~e~~~e 512 (629) T protein:vir:10 454 AWARDAIVADPSLIKVLAPLLTDELAEIDWPEPPAALPPGEDDQADEEQDTTGSEPSTE 512 (629) T ss_pred HHHHHHhcCCCchhhhhhhhcCCccccccccCCCCcCCCCCcccCccccCCCCCCcCCC Confidence 4444431 11122222 111111111111111111111111111111 No 206 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=97.55 E-value=4.7e-05 Score=44.32 Aligned_cols=441 Identities=15% Similarity=0.035 Sum_probs=187.4 Q ss_pred CCc----chhhhHHHHHhhHhhcccchhhhhhhhcchhcccc---CCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKD----VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE---RTTREMFQWNPSIISPDQQIAQNQDMASARAQDMV 73 (556) Q Consensus 1 ~sp----~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~---~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~ 73 (556) +.| +.+..+..+| .++..+|+ .++. ...|. +..+.+.+...+...+..--=.|- T Consensus 4 ~rPk~~p~~p~~~~~ar-----------------rr~LtaAsa~l~~~~-~~~~k-t~~~~~~~WQ~eAW~~~d~vpELr 64 (646) T protein:vir:10 4 LKPKSAPPEPFGAEVAR-----------------RIALAGATAQVDLGA-SSSWK-TWKFGNKDWQTEGWRLYDIIPEHH 64 (646) T ss_pred cCCCCCCCCcccccccc-----------------hhhhhhccccccCCC-cceee-cCCCcchhhhHHHHHHHhhhhhHh Confidence 111 1111111110 11122222 1111 11122 333444444444333332222222 Q ss_pred hcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhh Q lcl|NC_019524. 74 QNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSG 153 (556) Q Consensus 74 rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~ 153 (556) --.+|..+++..+.-+ -+.++..-+ .+++.+. ++++..-.. =..|..-=.+|.+.+... T Consensus 65 y~vgW~~~a~SR~rL~--------aseiddtG~-~tg~v~~---~~v~~iv~~---------~~Gg~~gQ~qlLkr~~~~ 123 (646) T protein:vir:10 65 FLAGRIGDSVAQARLY--------VTEVDDTGE-ETGEVQD---ERIKRLAAV---------PLGTGSQRDDNLRLAGLD 123 (646) T ss_pred hHhhhhhhhhceeeee--------eeeecCCCC-CcCccch---HHHHHHhhh---------hccchhhHHHHHHHHHhh Confidence 2233333333332211 122331111 1222222 133333221 133444557788888999 Q ss_pred heecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCce--EEEE-E-EECCCCCeEEEEEeecCCCccc Q lcl|NC_019524. 154 FLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPN--LRSG-V-QLDNNGAALGYWLRKAFPGDPT 229 (556) Q Consensus 154 ~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~--i~~G-I-E~d~~Gr~vaY~i~~~hpgd~~ 229 (556) +-+-||+.++-..-. .....+.. ..+++..+.|.....+.+... +..| . || .+++.+-+.|++.||.... T Consensus 124 ltV~GE~wiv~~~~~-~~~~~~~~----~W~vvt~~Ev~~tg~~~~i~~p~~~~g~~~v~-~~~~d~lvRiW~P~Prr~~ 197 (646) T protein:vir:10 124 LAVGGECWIVGEGAA-TSPEAAEG----SWFVVTGSAISRTGDEIAVRRPQQRGGSKLVL-VDGQDILIRCWRPHPNDTD 197 (646) T ss_pred eecccceEEeecccc-CCCCCCcc----ceeeecHHHhccCCCeeeeecCccCCCCCcce-ecCCceEEEEecCCccccc Confidence 999999998631111 11111111 357788888854322110000 0000 0 11 1344556666666665321 Q ss_pred cCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccc Q lcl|NC_019524. 230 DMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVF 309 (556) Q Consensus 230 ~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~ 309 (556) . --|+.-++|..++.|-+...-...+++--.+=.+++--+.+ ..+ T Consensus 198 e---------------------------------pDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGvLfvP~e--~s~ 242 (646) T protein:vir:10 198 Q---------------------------------ADSFTRSAIVPLREIELLTKREFAELDSRLTGAGIMFLPEG--VDF 242 (646) T ss_pred C---------------------------------CcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCceeeeccc--ccc Confidence 1 01333344444444444433333333322222222222222 111 Q ss_pred ccccccccccccccccc-----ccccccccccccceecCCce--eeecCCCc------eeeeecCCC-CCccHHHHHHHH Q lcl|NC_019524. 310 GQLGMGQGGFKEIFNEY-----MTGLANYVAQTKNIAIDGAK--IPHLYPGT------KLKMQPAGT-PGGVGTDYEQSL 375 (556) Q Consensus 310 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~pG~--i~~L~pGe------~i~~~~~~~-p~~~f~~F~~~~ 375 (556) .... ......+++... +....+... .... |+---||| .||.+.-.+ -+..-..-.... T Consensus 243 p~~~-~~~a~~~~l~~~l~qaa~tAi~De~S-------~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aiktR~da 314 (646) T protein:vir:10 243 PRGE-EDPAGLAGFMAYLQRAAAASMADQSR-------ASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPMKDKA 314 (646) T ss_pred CCCC-CCCcchhHHHHHHHHHHHhhhcCCCC-------ccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhhHHHH Confidence 1110 010011111110 011110000 1111 22334555 444222221 222334556778 Q ss_pred HHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccc Q lcl|NC_019524. 376 LRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKK--LVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMF 453 (556) Q Consensus 376 lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~--~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~ 453 (556) ++.+|+|+.||-|.|+| +.++|.=++=+.--+ ..|+ =.+.-+|+-|+.-||.-++.+--|+-|.- T Consensus 315 I~RlA~glDIppE~LLG-lgd~NHWtAWqI~de------~vrHI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~k------ 381 (646) T protein:vir:10 315 IARLASSAEIPGEVLTG-IGDANHWTAWLISDE------GIRWIRGYLGLIADALTRGFLRRALESMGVTNPER------ 381 (646) T ss_pred HHHHHhccCCchhheee-ccccceeeeeeeccc------cchhhhhHHHHHHHHHHhhHHHHHHHHcCCCChhH------ Confidence 99999999999999998 446777666222222 2342 13567899999999999988866654431 Q ss_pred cchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCH------HHHHHHHHHHHHHHH------ Q lcl|NC_019524. 454 YDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDF------REVFKQRAREEGLIK------ 521 (556) Q Consensus 454 ~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~------e~v~~q~a~E~~~~~------ 521 (556) =+-|.-+.-..+||-|--+| +.+.+.|.-|-+......|.+- +|-..|+++...... T Consensus 382 ----------yvvW~DaS~Lt~~pd~~deA-~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~ 450 (646) T protein:vir:10 382 ----------YAFAFDTSTLASKPNRLDEA-IQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILD 450 (646) T ss_pred ----------eEEeecCcccccCCCCcHHH-HHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCcccccc Confidence 15788889999999987665 5678889999888888777654 444455444332211 Q ss_pred -----HcCCCCCccccccCCCCCCCCCCCCCC-----------CCCCcCC--C Q lcl|NC_019524. 522 -----SLKLDFTGKMVEGNSTQSSNSSESTSD-----------NPNEETT--Q 556 (556) Q Consensus 522 -----~~Gl~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~e~~--~ 556 (556) .+|++ ..+....++...++.+++.+ +++.+.. + T Consensus 451 P~~qa~~~~P--~~~~~~lpp~~~~~~dg~~~~~e~~g~~~~~E~~~~pda~~ 501 (646) T protein:vir:10 451 PAIQAALGLP--AVQSVGLPPTAAQRTDGDLDDDESEGAPNGGEAPDQPDADE 501 (646) T ss_pred chhhccccCC--CcCccccCCcccccccCCCCChhhcCCCCCCccCCCCCCCc Confidence 12221 11111111111111111111 1111000 1 No 207 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=97.48 E-value=6e-05 Score=43.74 Aligned_cols=433 Identities=10% Similarity=0.001 Sum_probs=183.0 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhcccc--CCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE--RTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAG 81 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~--~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~ 81 (556) |....|...+.++........+... ..+.. ..+....+..+. ..+. |.+... ...--+++.+ ++++.+ T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~g~~~~--~~~~-iLr~~~-~~~ly~~m~~-D~hi~s 70 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPK-----LEGASVPVMSTSYDVVVDR--EFDE-LLQGKD-GLLVYHKMLS-DGTVKN 70 (448) T ss_pred CCCCCCCCccccCcccccccccchh-----hhhhhhhhccccccccccc--chhH-hhcccc-chHHHHHHhh-ChHHHH Confidence 2222222111111100000000000 00000 000000111111 1111 222111 2345677776 999999 Q ss_pred HHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceE Q lcl|NC_019524. 82 VVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVL 161 (556) Q Consensus 82 ~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f 161 (556) ++++...-|.|.-+.+.+.- +...+.+..+.+ .+|-.... -..++.+|.++...++-+ +.-|=++ T Consensus 71 ~l~~Rk~av~~~~w~v~p~~-------~~~~~~~~ae~v----~~~l~~~~---~~~~~~~f~~~~~~~lda-~~~G~s~ 135 (448) T protein:vir:79 71 ALNYIFGRIRSAKWYVEPAS-------TDPEDIAIAAFI----HAQLGIDD---ASVGKYPFGRLFAIYENA-YIYGMAA 135 (448) T ss_pred HHHHHHHHHhcCCceEecCC-------CCHHHHHHHHHH----HHHhhhhh---hhhccCCHHHHHHHHHHh-hhhccee Confidence 99999999999888876531 122233333333 33433221 134577899988887754 4567777 Q ss_pred EEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeec Q lcl|NC_019524. 162 ATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPA 241 (556) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~ 241 (556) +.++|.... +|.-.+-+|..+.+..+. =+.||.+|+.+-+.. .++... .... .. T Consensus 136 ~Eivw~~~~----~g~~~~~~l~~r~~~~~~-------------~f~~~~d~~l~~~~~--~~~~~~---~~~~----~~ 189 (448) T protein:vir:79 136 GEIVLTLGA----DGKLILDKIVPIHPFNID-------------EVLYDEEGGPKALKL--SGEVKG---GSQF----VS 189 (448) T ss_pred EEEEeeecC----CCceecccccccCCcccc-------------ceeeecCCceEEeec--CCcccc---cccC----CC Confidence 777775321 121111122222222121 134555555443221 111100 0000 01 Q ss_pred cccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccc Q lcl|NC_019524. 242 RFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKE 321 (556) Q Consensus 242 ~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~ 321 (556) ...+|..-+||.... +.|..=|.+.|.++.-...-........+.-...-++=..+.|.+.+.... ..... T Consensus 190 ~~~lP~~~~i~~~~~-~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~--------~~~~~ 260 (448) T protein:vir:79 190 GLEIPIWKTVVFLHN-DDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQG--------TKQWE 260 (448) T ss_pred ccccccceEEEEecC-ccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcC--------HHHHH Confidence 123566678887654 678888888887765533322222222222222222212233322111000 00000 Q ss_pred cccccccccccccccccceec--CCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAI--DGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY 399 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l--~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY 399 (556) ....-...+ ..-....++.|.+|++++....+++|..|.+.+=++|+..+-- +.||.|-.+-+| T Consensus 261 ------------~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLG--qtlTs~~~~g~~ 326 (448) T protein:vir:79 261 ------------AAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGI--DFNTVQLNMGVQ 326 (448) T ss_pred ------------HHHHHHHHHhcCCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHHhh--hhhccccccchh Confidence 000011112 1223456999999999999988899999999999999887642 346666544455 Q ss_pred hhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCc-cCCCCcccccccchhhHHHhhCeeeecCcccccch Q lcl|NC_019524. 400 SSARASMAETQ-KYMDSRKKLVADRFASAIYTLWLEEEVNAGNV-PLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE 477 (556) Q Consensus 400 Ss~R~~~~e~~-r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l-~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP 477 (556) +++-....+.+ ...+.....+...|.+-+..++++.- =|.. +.| ++.| ...+| T Consensus 327 ~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lN--fg~~~~~P------------------~~~f-----~~~e~ 381 (448) T protein:vir:79 327 AINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPN--WPSATRFP------------------RLTF-----EMEER 381 (448) T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCcCCCc------------------EEEe-----cCCCh Confidence 55443333333 23344545555555555555554311 0111 111 1111 12233 Q ss_pred hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 478 KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 478 ~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) . |+++....+.. + ...+.+.+ .+. .+++|++.+.... ...+.. ..+...+.++...++ T Consensus 382 ~-Dl~~~a~~~~~----l----~~~~~~~~----~~~-----~~~~~~p~~~~~~-~~~a~~--~~~~~~~~~~~~~~~ 439 (448) T protein:vir:79 382 N-DFSAAANLMGM----L----INAVKDSE----DIP-----TELKALIDALPSK-MRRALG--VVDEVREAVRQPADS 439 (448) T ss_pred H-HHHHHHHHhhh----h----hccchhhH----HHH-----HHhhcCCCCCCCc-cccccC--CCCcccccccCCccc Confidence 2 33333322221 1 11122222 222 2356776432211 111111 111111111111111 No 208 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=97.38 E-value=8.1e-05 Score=43.02 Aligned_cols=462 Identities=14% Similarity=0.048 Sum_probs=188.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCC-cccccccCC-CCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTT-REMFQWNPS-IISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~-r~~~~w~~~-~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |.+- . .|--+.++...+. ...++..+|++-. .-.+.|... ..|+++++..+...+..-.=.|---.+| T Consensus 1 ma~~-~---lr~~rrpk~~p~~------~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW 70 (639) T protein:vir:97 1 MAAT-S---LRVVRRPKGSAPA------ARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSW 70 (639) T ss_pred CCcc-c---eeeeecCCCCCcc------hhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhh Confidence 2221 0 0000111111110 0111122222111 001223222 3344444444333222221122222233 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ..+++..+.-++ + -|.|- +...-.+++.|.+....+ +...-+. =+.|.+--.+|.+.+...+-+-| T Consensus 71 ~~~s~sr~rL~a-s-~idpD--tg~PtG~V~~E~d~~~~~-v~~~v~~---------iagG~lGqa~llkr~~~~ltV~G 136 (639) T protein:vir:97 71 RANSCSRTTLIP-S-AIDPD--TGLPTGEVDIEEDPDAQT-VADYVKG---------IADGPLGQAALIKRAVECMTVVG 136 (639) T ss_pred hhhhhceeeeEe-e-eeccc--cCCCCCccccccccCcch-HHHHHHh---------hcCccchHHHHHHHHHhheeccc Confidence 333333222111 1 11111 111111122222222222 2222221 26688888999999999999999 Q ss_pred ceEEEEeeccCCC-CcCCCcccceEEEEEchhhcCCCC-CC-----CCCceEEEEEEECCCCCeEEEEEeecCCCccccC Q lcl|NC_019524. 159 EVLATCEWLNPTG-TTMQRRPFGTAIQMISPYRMSNPN-NV-----MDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDM 231 (556) Q Consensus 159 E~f~~~~~~~~~~-~~~~~~~~~l~lq~ie~drl~~~~-~~-----~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~ 231 (556) |+.+..+..+..+ .++...+.. .-+++..+.|.+.. +. ++|. +-||+. |..+=+.|++.||..... T Consensus 137 E~wi~~l~r~~k~~~~~~~~~~~-~W~vvs~~Ei~~~~~~~~~i~lPdG~----~he~~~-~~d~l~RvW~P~prr~~e- 209 (639) T protein:vir:97 137 EVWIAVLIRQEKDPVTGLAAPRA-RWYAVTREEIKSKAGETAEISLPDGK----THEFNR-DLDSLVRIWNPRPRKASQ- 209 (639) T ss_pred ceEEEEEEecCccccCccccccc-ceeeeeHHHhcccCCCeeEeecCCCC----CccccC-CCceEEEEeCCCcccccC- Confidence 9998866543321 111111111 34667777775321 11 1221 124433 334446677777654211 Q ss_pred CccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHH---HHHHHHHHHhcceeeeEeccCccccc Q lcl|NC_019524. 232 EQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQ---EITLQNAVVNATYAASVESELPSDVV 308 (556) Q Consensus 232 ~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~---dael~~a~i~A~~~~fi~~~~~~~~~ 308 (556) --|+.-.+|..++.|-+.. .+....-.+.+-+ .|+-.+..-... T Consensus 210 --------------------------------~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGv-lfvP~els~p~~ 256 (639) T protein:vir:97 210 --------------------------------ATSPVRACLETLREIERTTRKIKNAAKSRVMNNGV-LFVPAEMSLPAA 256 (639) T ss_pred --------------------------------CcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCce-eeeccccCCCCc Confidence 0123334444444444433 3333333333222 343322211110 Q ss_pred ccc--ccccc--cccccccc------cc--------ccccccccccccceecCCceeeecCCCceeeeecCCCCCccHH- Q lcl|NC_019524. 309 FGQ--LGMGQ--GGFKEIFN------EY--------MTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGT- 369 (556) Q Consensus 309 ~~~--~~~~~--~~~~~~~~------~~--------~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~- 369 (556) ... .+.++ +....... .+ +....+.....- +-| |+---|||-++-+..=.=.+.++ T Consensus 257 ~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA---~vP--iia~~p~E~l~~ikhl~f~~ei~e 331 (639) T protein:vir:97 257 QAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAA---YIP--LVASVAAEHLEKVQHIKFGNEVTE 331 (639) T ss_pred cccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccc---eee--eeEeechHHhcCeeeeeecCchhH Confidence 000 00000 00000000 00 000000000000 001 22233455444333322223333 Q ss_pred ---HHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHcCCcc Q lcl|NC_019524. 370 ---DYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL---VADRFASAIYTLWLEEEVNAGNVP 443 (556) Q Consensus 370 ---~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~---lv~~~~~pi~~~~l~~a~l~G~l~ 443 (556) .-....++.+|+|+.||-|.|+|= +++|-=|+=+.--+ ..|.+ .+.-+|+-|+.-||.-++.+--++ T Consensus 332 ~aiktR~daI~RlA~glDi~pE~LLGl-~d~NHWsAWqI~de------dvrlHI~P~l~~icdAlT~~~Lrp~Le~eGvD 404 (639) T protein:vir:97 332 VEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDE------DVQLHIKPVMDLICQAIYNDILTPLLAREGID 404 (639) T ss_pred HHHhhHHHHHHHHHhccCCchhheeec-ccccceEEEEeccc------ceeeecchhHHHHHHHHHhhHHHHHHHHhCCC Confidence 344678899999999999999985 78886554322222 23332 356788999999998888874442 Q ss_pred CCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHH--------HHHHHHH Q lcl|NC_019524. 444 LPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFRE--------VFKQRAR 515 (556) Q Consensus 444 ~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~--------v~~q~a~ 515 (556) | ..| +-|.-..-..+||-|--+| +.+.+.|.-|-+..-...|.+-++ -+++++. T Consensus 405 -p--------------~kY--vvW~DaS~Lt~dPd~~deA-~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~ 466 (639) T protein:vir:97 405 -P--------------TKY--ILWYDASGLTSDPDLSDEA-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAA 466 (639) T ss_pred -H--------------HHh--EeeecCcccccCCCCcHHH-HHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHH Confidence 2 233 7899999999999987665 567788888888666666654442 3333443 Q ss_pred HHH--------HHHHcCCCCCccccccCCCCCCCCCCCCCC---------CCCCcCCC Q lcl|NC_019524. 516 EEG--------LIKSLKLDFTGKMVEGNSTQSSNSSESTSD---------NPNEETTQ 556 (556) Q Consensus 516 E~~--------~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~---------~~~~e~~~ 556 (556) ..- +..-++.+.-....-+.++....+.+++.+ .++.+++. T Consensus 467 ~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~ePdte~ 524 (639) T protein:vir:97 467 DVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQREPQTED 524 (639) T ss_pred HHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcCCCccc Confidence 321 111111110000000011111111111111 11111111 No 209 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=97.38 E-value=8.1e-05 Score=43.02 Aligned_cols=462 Identities=14% Similarity=0.048 Sum_probs=188.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCC-cccccccCC-CCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTT-REMFQWNPS-IISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~-r~~~~w~~~-~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |.+- . .|--+.++...+. ...++..+|++-. .-.+.|... ..|+++++..+...+..-.=.|---.+| T Consensus 1 ma~~-~---lr~~rrpk~~p~~------~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW 70 (639) T protein:vir:10 1 MAAT-S---LRVVRRPKGSAPA------ARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSW 70 (639) T ss_pred CCcc-c---eeeeecCCCCCcc------hhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhh Confidence 2221 0 0000111111110 0111122222111 001223222 3344444444333222221122222233 Q ss_pred HHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) ..+++..+.-++ + -|.|- +...-.+++.|.+....+ +...-+. =+.|.+--.+|.+.+...+-+-| T Consensus 71 ~~~s~sr~rL~a-s-~idpD--tg~PtG~V~~E~d~~~~~-v~~~v~~---------iagG~lGqa~llkr~~~~ltV~G 136 (639) T protein:vir:10 71 RANSCSRTTLIP-S-AIDPD--TGLPTGEVDIEEDPDAQT-VADYVKG---------IADGPLGQAALIKRAVECMTVVG 136 (639) T ss_pred hhhhhceeeeEe-e-eeccc--cCCCCCccccccccCcch-HHHHHHh---------hcCccchHHHHHHHHHhheeccc Confidence 333333222111 1 11111 111111122222222222 2222221 26688888999999999999999 Q ss_pred ceEEEEeeccCCC-CcCCCcccceEEEEEchhhcCCCC-CC-----CCCceEEEEEEECCCCCeEEEEEeecCCCccccC Q lcl|NC_019524. 159 EVLATCEWLNPTG-TTMQRRPFGTAIQMISPYRMSNPN-NV-----MDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDM 231 (556) Q Consensus 159 E~f~~~~~~~~~~-~~~~~~~~~l~lq~ie~drl~~~~-~~-----~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~ 231 (556) |+.+..+..+..+ .++...+.. .-+++..+.|.+.. +. ++|. +-||+. |..+=+.|++.||..... T Consensus 137 E~wi~~l~r~~k~~~~~~~~~~~-~W~vvs~~Ei~~~~~~~~~i~lPdG~----~he~~~-~~d~l~RvW~P~prr~~e- 209 (639) T protein:vir:10 137 EVWIAVLIRQEKDPVTGLAAPRA-RWYAVTREEIKSKAGETAEISLPDGK----THEFNR-DLDSLVRIWNPRPRKASQ- 209 (639) T ss_pred ceEEEEEEecCccccCccccccc-ceeeeeHHHhcccCCCeeEeecCCCC----CccccC-CCceEEEEeCCCcccccC- Confidence 9998866543321 111111111 34667777775321 11 1221 124433 334446677777654211 Q ss_pred CccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHH---HHHHHHHHHhcceeeeEeccCccccc Q lcl|NC_019524. 232 EQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQ---EITLQNAVVNATYAASVESELPSDVV 308 (556) Q Consensus 232 ~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~---dael~~a~i~A~~~~fi~~~~~~~~~ 308 (556) --|+.-.+|..++.|-+.. .+....-.+.+-+ .|+-.+..-... T Consensus 210 --------------------------------~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGv-lfvP~els~p~~ 256 (639) T protein:vir:10 210 --------------------------------ATSPVRACLETLREIERTTRKIKNAAKSRVMNNGV-LFVPAEMSLPAA 256 (639) T ss_pred --------------------------------CcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCce-eeeccccCCCCc Confidence 0123334444444444433 3333333333222 343322211110 Q ss_pred ccc--ccccc--cccccccc------cc--------ccccccccccccceecCCceeeecCCCceeeeecCCCCCccHH- Q lcl|NC_019524. 309 FGQ--LGMGQ--GGFKEIFN------EY--------MTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGT- 369 (556) Q Consensus 309 ~~~--~~~~~--~~~~~~~~------~~--------~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~- 369 (556) ... .+.++ +....... .+ +....+.....- +-| |+---|||-++-+..=.=.+.++ T Consensus 257 ~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA---~vP--iia~~p~E~l~~ikhl~f~~ei~e 331 (639) T protein:vir:10 257 QAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAA---YIP--LVASVAAEHLEKVQHIKFGNEVTE 331 (639) T ss_pred cccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccc---eee--eeEeechHHhcCeeeeeecCchhH Confidence 000 00000 00000000 00 000000000000 001 22233455444333322223333 Q ss_pred ---HHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHcCCcc Q lcl|NC_019524. 370 ---DYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL---VADRFASAIYTLWLEEEVNAGNVP 443 (556) Q Consensus 370 ---~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~---lv~~~~~pi~~~~l~~a~l~G~l~ 443 (556) .-....++.+|+|+.||-|.|+|= +++|-=|+=+.--+ ..|.+ .+.-+|+-|+.-||.-++.+--++ T Consensus 332 ~aiktR~daI~RlA~glDi~pE~LLGl-~d~NHWsAWqI~de------dvrlHI~P~l~~icdAlT~~~Lrp~Le~eGvD 404 (639) T protein:vir:10 332 VEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDE------DVQLHIKPVMDLICQAIYNDILTPLLAREGID 404 (639) T ss_pred HHHhhHHHHHHHHHhccCCchhheeec-ccccceEEEEeccc------ceeeecchhHHHHHHHHHhhHHHHHHHHhCCC Confidence 344678899999999999999985 78886554322222 23332 356788999999998888874442 Q ss_pred CCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHH--------HHHHHHH Q lcl|NC_019524. 444 LPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFRE--------VFKQRAR 515 (556) Q Consensus 444 ~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~--------v~~q~a~ 515 (556) | ..| +-|.-..-..+||-|--+| +.+.+.|.-|-+..-...|.+-++ -+++++. T Consensus 405 -p--------------~kY--vvW~DaS~Lt~dPd~~deA-~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~ 466 (639) T protein:vir:10 405 -P--------------TKY--ILWYDASGLTSDPDLSDEA-VEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAA 466 (639) T ss_pred -H--------------HHh--EeeecCcccccCCCCcHHH-HHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHH Confidence 2 233 7899999999999987665 567788888888666666654442 3333443 Q ss_pred HHH--------HHHHcCCCCCccccccCCCCCCCCCCCCCC---------CCCCcCCC Q lcl|NC_019524. 516 EEG--------LIKSLKLDFTGKMVEGNSTQSSNSSESTSD---------NPNEETTQ 556 (556) Q Consensus 516 E~~--------~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~---------~~~~e~~~ 556 (556) ..- +..-++.+.-....-+.++....+.+++.+ .++.+++. T Consensus 467 ~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~ePdte~ 524 (639) T protein:vir:10 467 DVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQREPQTED 524 (639) T ss_pred HHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcCCCccc Confidence 321 111111110000000011111111111111 11111111 No 210 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=97.22 E-value=0.00013 Score=41.95 Aligned_cols=429 Identities=10% Similarity=0.001 Sum_probs=181.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCccccccc-CCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWN-PSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~-~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) |+=-+..++..... +....++..+ .+.++ +..-....|. ......+..+ ... ....-.+++.+ ++++ T Consensus 1 m~kk~~k~~~~~~~-~~~~~~~~~~-------~~~~~-~~~~~~~~~~g~~~~~~~~iL-r~~-~~~~ly~~m~~-D~hi 68 (448) T protein:vir:77 1 MAKRGRKPKELVPG-PGSIDPSDVP-------KLEGA-SVPVMSTSYDVVVDREFDELL-QGK-DGLLVYHKMLS-DGTV 68 (448) T ss_pred CCCCCCCCcccCCc-ccccchhhhh-------hhccc-hhhhcccccccccccchhHhh-ccc-cchHHHHHHhh-ChHH Confidence 32222221111000 0000000000 00111 0000000010 0111122222 221 12345678876 8999 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) .+++++...-|.|.-+++.+.- ++..+++.. +..++|-.... + ..++.+|.++...++ ..+.-|= T Consensus 69 ~s~l~~Rk~av~~~~w~v~p~~-------~~~~d~~~a----e~v~~~l~~~~--~-~~~~~~f~~~i~~~l-da~~~G~ 133 (448) T protein:vir:77 69 KNALNYIFGRIRSAKWYVEPAS-------TDPEDIAIA----AFIHAQLGIDD--A-SVGKYPFGRLFAIYE-NAYIYGM 133 (448) T ss_pred HHHHHHHHHHHhcCCceEecCC-------CCHHHHHHH----HHHHHHhhchh--h-hhccCCHHHHHHHHH-Hhhhhcc Confidence 9999999999999877776521 112233333 33444443321 1 124678999888876 4566788 Q ss_pred eEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccccee Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYE 239 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv 239 (556) +++-++|.... +|.-.+-+|..+.+..+. =..||.+|+.+-...- ++.. .... . T Consensus 134 s~~Eivw~~~~----dg~~~~~~l~~r~~~~~~-------------~f~~~~~~~l~~~~~~--~~~~---~~~~----~ 187 (448) T protein:vir:77 134 AAGEIVLTLGA----DGKLILDKIVPIHPFNID-------------EVLYDEEGGPKALKLS--GEVK---GGSQ----F 187 (448) T ss_pred eeEEEEEeecC----CCceeeccccccCCCccc-------------eeeeecCCceEEEecC--Cccc---cccc----C Confidence 88888886421 122122222222222111 1345555554432211 1100 0000 0 Q ss_pred eccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccc Q lcl|NC_019524. 240 PARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGF 319 (556) Q Consensus 240 ~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~ 319 (556) .....+|...++|.... +.|..-|.+.|.++.-...--.......+.-...-++=..+.|.+.+... .+.. T Consensus 188 ~~~~~lP~~~~i~~~~~-~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~--------~~~~ 258 (448) T protein:vir:77 188 VNGLEIPIWKTVVFLHN-DDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQ--------GTKQ 258 (448) T ss_pred CCccccccceEEEEecC-CcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCC--------CHHH Confidence 01124566778887654 56899999988776543332222222222222222221122232211100 0000 Q ss_pred cccccccccccccccccccceecC--CceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcc Q lcl|NC_019524. 320 KEIFNEYMTGLANYVAQTKNIAID--GAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKT 397 (556) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~l~--pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~ 397 (556) .. ....-...+. .-....++.|.+|++++....+++|..|.+.+=++|+..+.-. .||.|-.+- T Consensus 259 ~~------------~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGq--tlTs~~~~g 324 (448) T protein:vir:77 259 WE------------AAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGID--FNTVQLNMG 324 (448) T ss_pred HH------------HHHHHHHHHhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHhcc--ccccccccc Confidence 00 0000111121 2234558999999999999888999999999999999886533 466664332 Q ss_pred cchhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHc-CCc-cCCCCcccccccchhhHHHhhCeeeecCcccc Q lcl|NC_019524. 398 NYSSARASMAET-QKYMDSRKKLVADRFASAIYTLWLEEEVNA-GNV-PLPPGKNWRMFYDPMMRDALCNAEWIGASRGQ 474 (556) Q Consensus 398 nYSs~R~~~~e~-~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~-G~l-~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~ 474 (556) +|+++-....+. ....+.....+...|.+-+..+++. ++ |.. +.| ++.| .. T Consensus 325 ~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~---lNfg~~~~~P------------------~~~f-----~~ 378 (448) T protein:vir:77 325 VQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVL---PNWPGATRFP------------------RLTF-----EM 378 (448) T ss_pred hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCCCCCCCC------------------EEEe-----cC Confidence 333333222222 2233444445555555555555544 22 211 111 1111 11 Q ss_pred cchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccc-cCCCCCCCCCCCCCCCCCC- Q lcl|NC_019524. 475 IDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVE-GNSTQSSNSSESTSDNPNE- 552 (556) Q Consensus 475 iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~-~~~~~~~~~~~~~~~~~~~- 552 (556) .+| .|+++....+. ++.. ...+++||+...+... +.+++...+....+.+++. T Consensus 379 ~e~-eDl~~~a~~~~----~l~~--------------------~~~~~~~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (448) T protein:vir:77 379 EER-NDFSAAANLMG----MLIN--------------------AVKDSEDIPTELKALIDALPSKMRRALGVVDEVREAV 433 (448) T ss_pred CCh-hhHHHHHHHhH----HHHH--------------------HHHHHhcCCccCCcCCCCCchhcccccCCCCCCCchh Confidence 122 34443333322 1111 1335667764322111 1111111111111111111 Q ss_pred ---cCCC Q lcl|NC_019524. 553 ---ETTQ 556 (556) Q Consensus 553 ---e~~~ 556 (556) -++. T Consensus 434 ~~~~~~~ 440 (448) T protein:vir:77 434 RQPADSR 440 (448) T ss_pred hcchhhH Confidence 1111 No 211 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=97.16 E-value=0.00015 Score=41.59 Aligned_cols=457 Identities=11% Similarity=0.036 Sum_probs=182.7 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhc--cccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGME--GAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~--aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) -||--.+.+..- ++...-+...+ ++..++++ ++... .+ +|....+=+- .+.+..|.. +|- T Consensus 63 ~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~~~~~~~~~~--~l-~~~~~~~F~G---y~~la~laQ--------~~e 124 (698) T protein:vir:10 63 PSPSLRLARQFE---VDVSNYTPRER-RAASYALDFNGTSMD--AL-SFVTSSGFPG---FPTLVLLAQ--------LPE 124 (698) T ss_pred CCccccccccce---eccccCCcccc-chhhhhhcccccccc--cc-hhhhccCcch---HHHHHHHhh--------ccc Confidence 122111111100 00000000000 11112222 22111 11 3432211111 122222222 233 Q ss_pred HHHHHHHHHhhhccCCceeeee----cccccc--------CCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHH Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAK----PNTIVL--------GAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGL 146 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~----~~~~~l--------g~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~l 146 (556) .+..+.++..-.+-..+..... ++..-+ ..+.++. ++|++.|++..-. .-+ T Consensus 125 yr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi----~~L~~e~erl~V~-------------~~l 187 (698) T protein:vir:10 125 YRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQL----KQINDEIERLRIR-------------DAV 187 (698) T ss_pred hhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHH----HHHHHHHHHHHHH-------------HHH Confidence 3334444444333322211100 000000 0122333 3444444433221 122 Q ss_pred HHHHhhhheecCceEEEEeeccCCCCcCCCccc---ceE--------EEEEchhhcCCCCCCCCCceEEEEEEECCCCCe Q lcl|NC_019524. 147 TRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPF---GTA--------IQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAA 215 (556) Q Consensus 147 q~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~---~l~--------lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~ 215 (556) ...+.-+.+-.|-+.+...-..... ++ .|+ +.+ |-+||+..|. |......+.+. ..+|+| T Consensus 188 ~eai~~aRlfGGa~~~i~I~gdd~~-l~--~PL~~~~~~I~kGslKGL~ViDp~~vt-P~~~n~~dP~s-----pdfgkP 258 (698) T protein:vir:10 188 RTTVIHDQAFGRAHPYFKIKGDDQI-MD--TPLVPRPYTVPKGSFQGLRVVEPYWVT-PNNYNSINPVA-----DDFYKP 258 (698) T ss_pred HHHHHhcccccceEEEEEeecCccc-cc--cccccccccccCccceeeeeecccccc-cchhhhccchh-----hccCCC Confidence 2333334455555433322221100 00 111 112 6667777664 21100001111 257888 Q ss_pred EEEEEeecCCCccccCCccccceeeccccCChhHeEeeecc------cCCCcccCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 216 LGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEA------LLAGQTRGISEMVSALKQMKMTRNFQEITLQNA 289 (556) Q Consensus 216 vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~------~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a 289 (556) ..|+|.-. .|+++.++.+.-. +..-+..|+|..-.++..+...+.-.+....-. T Consensus 259 ~~y~V~G~--------------------~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li 318 (698) T protein:vir:10 259 STWWMIGS--------------------EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV 318 (698) T ss_pred ceEEEecc--------------------eecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHH Confidence 88888411 2333333211111 122335699988888887776654433322211 Q ss_pred HHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHH Q lcl|NC_019524. 290 VVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGT 369 (556) Q Consensus 290 ~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~ 369 (556) + ...+.++ +++.. ..+..+........-... .......+..+.. ...|++..++ .+-++.+ T Consensus 319 ~-~~~~~~l-~~dla-----~aL~~g~~~~l~~R~eli---------~~~Rsn~G~~llD-k~~Eefeq~s--t~lSGLd 379 (698) T protein:vir:10 319 K-QFSVSGI-LMDLA-----QALTPGANVDLSMRAELI---------NRYRDNRNILFLD-KATEEFFQFN--TPLSGLD 379 (698) T ss_pred H-HhhHHHH-HHHHH-----HhcCChhhHHHHHHHHHH---------HHhcCccceEEEe-cCCcceEEEe--cCcCCHH Confidence 1 1111121 11110 001000000000000000 0001223333332 3578888776 4678999 Q ss_pred HHHHHHHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCc Q lcl|NC_019524. 370 DYEQSLLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGK 448 (556) Q Consensus 370 ~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~ 448 (556) +-+......||...+||.--|.|-- =..+ |+.-..+.-++..++..|+..+..+++.++......+ -|.++ | .. T Consensus 380 dVi~qf~q~VAgaa~IPltkLfGqS-PkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~--~G~id-p-~i 454 (698) T protein:vir:10 380 ALQAQAQEQMSAVSHIPLIKLLGIT-PTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSL--FGAVD-P-SI 454 (698) T ss_pred HHHHHHHHHHHhhhcCchhhhhccC-CcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCCC-C-cc Confidence 9999999999999999988887732 3456 5777788889999999998776666666555432211 25542 2 11 Q ss_pred ccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHH--------------- Q lcl|NC_019524. 449 NWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQ--------------- 512 (556) Q Consensus 449 ~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q--------------- 512 (556) . +. ++.-|++-..+..|- .|.++++...++.|+-++.++....-.|++=.... T Consensus 455 ~--~~---------fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~ 523 (698) T protein:vir:10 455 K--WQ---------WNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDD 523 (698) T ss_pred e--EE---------eCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCc Confidence 1 11 123566666666666 67778888888888888888776654444333311 Q ss_pred HHHHHHHH---HHcCCCCC-ccc---cccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 513 RAREEGLI---KSLKLDFT-GKM---VEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 513 ~a~E~~~~---~~~Gl~~~-~~~---~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +..+.... .+-|=.-. ..+ +..++.+++..+..-+.++.+-..| T Consensus 524 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 574 (698) T protein:vir:10 524 IDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQ 574 (698) T ss_pred chHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCccccCcc Confidence 00110000 00000000 000 0000000000111111111111111 No 212 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=97.15 E-value=0.00015 Score=41.51 Aligned_cols=313 Identities=12% Similarity=-0.006 Sum_probs=134.3 Q ss_pred EEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceee Q lcl|NC_019524. 161 LATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEP 240 (556) Q Consensus 161 f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~ 240 (556) +.-++|... +|.-.+-+|....+..+... .++..|+.+.+... ++.+. +...+| T Consensus 1 v~Eivw~~~-----~g~~~~~~l~~r~~~~~~~f-------------~~~~~~~l~~~~~~--~~~g~------~~~~lp 54 (355) T protein:vir:78 1 MFEQVYRIE-----NGRARLGKLAWRPPRTISRF-------------DVAPDGGLVAIEQW--GVFGK------ATVRIP 54 (355) T ss_pred CeEEEEEee-----CCeEEEeeeeecCccceeee-------------eeccCCceeEEEec--CCCCC------Ccceec Confidence 556677542 33344555666666555421 24444444433322 11110 111122 Q ss_pred ccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHh-ccee-eeEeccCccccccccccccccc Q lcl|NC_019524. 241 ARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVN-ATYA-ASVESELPSDVVFGQLGMGQGG 318 (556) Q Consensus 241 ~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~-A~~~-~fi~~~~~~~~~~~~~~~~~~~ 318 (556) . ..-|+|.+.. +.|..-|.+.|.++.-...-........+.-...- .-+. +..+...+..... T Consensus 55 ~-----~kfi~~~~~~-~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d--------- 119 (355) T protein:vir:78 55 V-----DRLVVFVNER-EGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARD--------- 119 (355) T ss_pred c-----CCEEEEEeCC-CCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccch--------- Confidence 2 2356777664 58888899988887665444333333333333322 1111 2222111100000 Q ss_pred ccccccccccccccccccccceecCCc--eeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh- Q lcl|NC_019524. 319 FKEIFNEYMTGLANYVAQTKNIAIDGA--KIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT- 395 (556) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~l~pG--~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s- 395 (556) ........... ......-...+..| ....++.|.+|+++++.....+|..+.+.+=++|+..+.-. .||.+-+ T Consensus 120 -~~~~~~~~~~~-~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGq--tlTs~~~~ 195 (355) T protein:vir:78 120 -TARAEQWLNDQ-KEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAH--FLTLGGDK 195 (355) T ss_pred -hhhHHHHHHHH-HHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhh--hhccccCC Confidence 00000000000 00000001112223 45568999999999998888899999999999999887543 4555442 Q ss_pred -cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccc Q lcl|NC_019524. 396 -KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQ 474 (556) Q Consensus 396 -~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~ 474 (556) +.||+.+-.-........+.....+...|.+-+..+++. ++ .+...+. -...| ..-+- T Consensus 196 ~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~---lN----~~~~~~~------------P~~~~--~~~~~ 254 (355) T protein:vir:78 196 STGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVD---QN----WGPEEPA------------PRLVP--AQLGK 254 (355) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hc----CCCCCCC------------CEEEe--cCcCh Confidence 235555544444444455566666666666656665544 22 1111000 01111 11111 Q ss_pred cchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccC--------CCCCCCCC--- Q lcl|NC_019524. 475 IDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGN--------STQSSNSS--- 543 (556) Q Consensus 475 iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~--------~~~~~~~~--- 543 (556) .| .+-+++....+..|+.-..+.. +.-..+++||+.+....... ........ T Consensus 255 ~~-~~~a~~~~~l~~~G~~~~~~~~----------------~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (355) T protein:vir:78 255 EQ-PVTAEAIRALVECGAFTADPEL----------------EKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKRLPGQ 317 (355) T ss_pred hH-HHHHHHHHHHHhCCCccccHHH----------------HHHHHHHhCCCCCCCCCcccCCccccccccccccccCCc Confidence 22 2235566667777765443211 11122345554322110000 00000000 Q ss_pred ----CC-----CCCCCCCcCCC Q lcl|NC_019524. 544 ----ES-----TSDNPNEETTQ 556 (556) Q Consensus 544 ----~~-----~~~~~~~e~~~ 556 (556) +. ..+++.+.+.+ T Consensus 318 ~~~~~~~a~~~~a~~~~~~~~~ 339 (355) T protein:vir:78 318 RQGAALPSRSPRADPPRRRGPL 339 (355) T ss_pred cccccccccCCCCCChhhhHHH Confidence 00 01111111111 No 213 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=96.97 E-value=0.00023 Score=40.49 Aligned_cols=457 Identities=10% Similarity=0.019 Sum_probs=181.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhc--cccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGME--GAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~--aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) -||--.+.+..- ++...-+...+ ++..++++ ++... .+ +|....+=+- .+.+..|.. +|- T Consensus 63 ~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~~~~~~~~~~--~l-~~~~~~~F~G---y~~la~laQ--------~~e 124 (695) T protein:vir:78 63 PSPSLRLARQFE---VDVSNYTPRER-RAASYALDFNGTSMD--AL-SFVTSSGFPG---FPTLVLLAQ--------LPE 124 (695) T ss_pred CCcccccceece---eccccCCcccc-chhhhhhcccccccc--cc-hhhhccCcch---HHHHHHHhh--------ccc Confidence 111111110000 00000000000 11112222 22111 11 3432211111 122222222 233 Q ss_pred HHHHHHHHHhhhccCCceeeee----cccccc--------CCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHH Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAK----PNTIVL--------GAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGL 146 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~----~~~~~l--------g~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~l 146 (556) .+..+.++..-.+-..+..... ++..-+ ..+.++. ++|++.|++..-. ..+ T Consensus 125 yr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi----~~L~~e~erL~V~-------------~~l 187 (695) T protein:vir:78 125 YRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQL----KQINDEIERLRIR-------------DAV 187 (695) T ss_pred hhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHH----HHHHHHHHHHHHH-------------HHH Confidence 3344444444333322211100 000000 0122333 3444444433211 112 Q ss_pred HHHHhhhheecCceEEEEeeccCCCCcCCCccc---ceE--------EEEEchhhcCC-CCCCCCCceEEEEEEECCCCC Q lcl|NC_019524. 147 TRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPF---GTA--------IQMISPYRMSN-PNNVMDTPNLRSGVQLDNNGA 214 (556) Q Consensus 147 q~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~---~l~--------lq~ie~drl~~-~~~~~~g~~i~~GIE~d~~Gr 214 (556) ...+.-+.+-.|-+.+...-..... ++ .|+ +.+ |.+||+..|.- .++.. +.+. ..+|+ T Consensus 188 ~eaik~aRlfGGa~~~i~i~gdd~~-l~--~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~--dP~s-----pdfgk 257 (695) T protein:vir:78 188 RTTVIHDQAFGRAHPYFKIKGDDQI-MD--TPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVA-----DDFYK 257 (695) T ss_pred HHHHHhhccccceEEEEEeccCccc-cc--cccccccccccCcceeeeEeecccccccchhhhc--cchh-----hccCC Confidence 2333334444454432222111100 00 111 112 56677776631 11110 0011 25688 Q ss_pred eEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019524. 215 ALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNAT 294 (556) Q Consensus 215 ~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~ 294 (556) |..|+|.-. .-+..|+-.+...|-++++ +..-+.-|+|..-.++..+.....-.+.-..-.+ ... T Consensus 258 P~~y~V~G~---------kIH~SRL~~f~g~plPd~L-----Kp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~-~~~ 322 (695) T protein:vir:78 258 PSTWWMIGT---------EVHATRLHTIVSRPVGDML-----KPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVK-QFS 322 (695) T ss_pred CceEEEece---------EEeeeeEEEecCCCchhhh-----hcccccCcccHHHHHHHHHHHHHHHHhHHHHHHH-hhh Confidence 888887411 0111111111111222222 1122345888888877777765443332221111 111 Q ss_pred eeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHH Q lcl|NC_019524. 295 YAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQS 374 (556) Q Consensus 295 ~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~ 374 (556) +.++ +++.. ..+..+........-... .......+..+.. ...|++..++ .+-++.++-+.. T Consensus 323 v~~l-k~dla-----~~L~~g~~~~l~~R~eli---------~~~Rsn~G~~llD-k~~Eefeq~s--tslSGLddVi~q 384 (695) T protein:vir:78 323 VSGI-LMDLA-----QALMPGANVDLSMRAELI---------NRYRDNRNILFLD-KATEEFFQFN--TPLSGLDALQAQ 384 (695) T ss_pred hHHH-HHHHH-----HhhcChhHHHHHHHHHHH---------HHhcCccceEEEe-cCCcceEEEe--cccCCHHHHHHH Confidence 1121 21110 000000000000000000 0001223333333 3578888776 467899999999 Q ss_pred HHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc--CCccCCCCcccc Q lcl|NC_019524. 375 LLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNA--GNVPLPPGKNWR 451 (556) Q Consensus 375 ~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~--G~l~~p~~~~~~ 451 (556) ....||+..|||.--|.|- |=..+ |+.-..+.-++..++..|+..+..+++.++... .++ |.++ |. .. T Consensus 385 f~q~VAgaa~IPltkLfGq-SPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii----~rS~~G~id-pd-i~-- 455 (695) T protein:vir:78 385 AQEQMSAVSHIPLIKLLGI-TPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMI----QLSLFGAVD-PS-IK-- 455 (695) T ss_pred HHHHHHhhhcCchhhhhcc-CCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhcCCCC-Cc-ce-- Confidence 9999999999998777773 23456 567778888999999999877666665555443 333 5542 21 11 Q ss_pred cccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCcc Q lcl|NC_019524. 452 MFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGK 530 (556) Q Consensus 452 ~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~ 530 (556) +. ++.-|.+-..+..|- .|.++++...++.|+-+..++......|++=....... .-++-|++.+.+ T Consensus 456 ~~---------fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D---~~d~p~~~~~~~ 523 (695) T protein:vir:78 456 WQ---------WNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLD---ANDDPGVPADDD 523 (695) T ss_pred EE---------eCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccc---cccCCCcCccch Confidence 11 133566666676676 77888888888888888888877766655444321000 011111111111 Q ss_pred cccc------------------CCCCCCCCCCCCC----CCCCCcCCC Q lcl|NC_019524. 531 MVEG------------------NSTQSSNSSESTS----DNPNEETTQ 556 (556) Q Consensus 531 ~~~~------------------~~~~~~~~~~~~~----~~~~~e~~~ 556 (556) .... +.++...++.-.. -.+.+-.-| T Consensus 524 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ag~~ 571 (695) T protein:vir:78 524 IDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVKPREAGAQ 571 (695) T ss_pred hhhhHhhhcCcccccccCCCCCCCCCCCCCCceeeeeccccccccCCC Confidence 0000 0000000000000 000000000 No 214 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=96.78 E-value=0.00035 Score=39.56 Aligned_cols=454 Identities=11% Similarity=0.052 Sum_probs=181.3 Q ss_pred CCcchhhhHHHHHhh----Hhh---ccc-----------------chhhhhhhhcchhc--cccCCCcccccccCCCCCH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKA----VDV---VAE-----------------TATATPMAVGGGME--GAERTTREMFQWNPSIISP 54 (556) Q Consensus 1 ~sp~~~~~r~~a~~a----~~~---~~~-----------------~~~~~~~~~~~~y~--aa~~~~r~~~~w~~~~~s~ 54 (556) -.-.++++.-.||+. ++. +-+ ++..+ ++..++++ ++... .+ +|....+=+ T Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~l-~~~~~~~F~ 111 (695) T protein:vir:36 36 AAAAQPVPADFARRGALNALDAAPVVEPSPSLRLARQFEVDVSNYTPRER-RAASYALDFNGTSMD--AL-SFVTSSGFP 111 (695) T ss_pred hccccccchhhhhcccccccccccccCCCcccccceeceecccccCcccc-chhhhhhcccccccc--cc-hhhhccCcc Confidence 111111111222211 000 000 00000 01112222 11111 11 343221111 Q ss_pred HHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeee----cccccc--------CCChhHHHHHHHHHHH Q lcl|NC_019524. 55 DQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAK----PNTIVL--------GAPDGWGEEFQEVVEA 122 (556) Q Consensus 55 ~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~----~~~~~l--------g~~~~~~~~~~~~ie~ 122 (556) - .+.+..|.. +|-.+..+.++..-.+-..+..... ++..-+ ..++++.+. |++ T Consensus 112 G---y~~la~laQ--------~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~----L~~ 176 (695) T protein:vir:36 112 G---FPTLVLLAQ--------LPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQ----IND 176 (695) T ss_pred h---HHHHHHHhh--------ccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHH----HHH Confidence 1 122222222 2333334444443333322111100 000000 012233333 444 Q ss_pred HHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCccc---ceE--------EEEEchhhc Q lcl|NC_019524. 123 RFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPF---GTA--------IQMISPYRM 191 (556) Q Consensus 123 ~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~---~l~--------lq~ie~drl 191 (556) .|++..- +..+...+.-+.+-.|-+.+...-..... ++ .|+ +.+ |.+||+..| T Consensus 177 e~erL~V-------------~~~l~eaik~aRlfGGa~~~i~i~gdd~~-l~--~PL~~~~~~I~kGslKGl~ViDp~~v 240 (695) T protein:vir:36 177 EIERLRI-------------RDAVRTTVIHDQAFGRAHPYFKIKGDDQI-MD--TPLVPRPYTVPKGSFQGLRVVEPYWV 240 (695) T ss_pred HHHHHHH-------------HHHHHHHHHhhccccceEEEEEeccCccc-cc--cccccccccccCcceeeeEeeccccc Confidence 4333221 11122333334444454432222111100 00 111 112 566777766 Q ss_pred CC-CCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeeccc------CCCcccC Q lcl|NC_019524. 192 SN-PNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEAL------LAGQTRG 264 (556) Q Consensus 192 ~~-~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~------r~gQ~RG 264 (556) .- .++.. +.+. ..+|+|..|+|.-. .|+++.++.+.... ..-+.-| T Consensus 241 tP~~~n~~--dP~s-----pdfgkP~~y~V~G~--------------------kIH~SRL~~f~g~plPd~LKp~y~~~G 293 (695) T protein:vir:36 241 TPNNYNSI--NPVA-----DDFYKPSTWWMIGT--------------------EVHATRLHTIVSRPVGDMLKPTYSFAG 293 (695) T ss_pred ccchhhhc--cchh-----hccCCCceEEEece--------------------EEeeeeEEEecCCCchhhhhcccccCc Confidence 31 11110 0011 25688888887411 23333332221111 1223458 Q ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCC Q lcl|NC_019524. 265 ISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDG 344 (556) Q Consensus 265 vs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p 344 (556) +|..-.++..+.....-.+.-..-.+ ...+.++ +++.. ..+..+........-... .......+ T Consensus 294 iSv~q~~~e~V~~~~rT~~~v~~Li~-~~~v~~l-k~dla-----~aL~~g~~~~l~~R~eli---------~~~Rsn~G 357 (695) T protein:vir:36 294 ISMTQLAMPYIDNWLRTRQSVSDIVK-QFSVSGI-LMDLA-----QALMPGANVDLSMRAELI---------NRYRDNRN 357 (695) T ss_pred ccHHHHHHHHHHHHHHHHhHHHHHHH-hhhHHHH-HHHHH-----HhhcChhHHHHHHHHHHH---------HHhcCccc Confidence 88888877777665443333221111 1111111 21110 000000000000000000 00012233 Q ss_pred ceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 345 AKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADR 423 (556) Q Consensus 345 G~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~ 423 (556) ..+.. ...|++..++ .+-++.++-+......||+..|||.--|.|- |=..+ |+.-..+.-++..++..|+..+.. T Consensus 358 ~~llD-k~~Eefeq~s--tslSGLddVi~qf~q~VAgaa~IPltkLfGq-SPkGlNATGE~D~rnYYD~I~s~Qe~~L~p 433 (695) T protein:vir:36 358 ILFLD-KATEEFFQFN--TPLSGLDALQAQAQEQMSAVSHIPLIKLLGI-TPTGLNASSEGEIRVWYDYVRAYQRNALQQ 433 (695) T ss_pred eEEEe-cCCcceEEEe--cccCCHHHHHHHHHHHHHhhhcCchhhhhcc-CcccccccchhhHHHHHHHHHHHHHHHHHH Confidence 33333 3578888776 4678999999999999999999998777773 23456 567778888999999999877666 Q ss_pred HHHHHHHHHHHHHHHc--CCccCCCCcccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHH Q lcl|NC_019524. 424 FASAIYTLWLEEEVNA--GNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEIS 500 (556) Q Consensus 424 ~~~pi~~~~l~~a~l~--G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~a 500 (556) +++.++... .++ |.++ | ... +. ++.-|.+-..+..|- .|.++++...++.|+-+..++.. T Consensus 434 ~L~rl~~ii----~rS~~G~id-p-di~--~~---------fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~ 496 (695) T protein:vir:36 434 LMNDVIVMI----QLSLFGAVD-P-SIK--WQ---------WNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAA 496 (695) T ss_pred HHHHHHHHH----HHHhcCCCC-C-cce--EE---------eCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHH Confidence 665555443 333 5542 2 111 11 133566777777776 77888888888888888888887 Q ss_pred HhCCCHHHHHHHHHHHHHHHHHcCCCCCcccc------------------ccCCCCCCCCCCCCC----CCCCCcCCC Q lcl|NC_019524. 501 RLGGDFREVFKQRAREEGLIKSLKLDFTGKMV------------------EGNSTQSSNSSESTS----DNPNEETTQ 556 (556) Q Consensus 501 e~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~------------------~~~~~~~~~~~~~~~----~~~~~e~~~ 556 (556) ....|++=....... .-++-|++.+.+.. ..+.++...++.... -++.+-.-| T Consensus 497 rL~~d~~s~Y~~~~D---~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~ 571 (695) T protein:vir:36 497 RLNTEPDGPYAGKLD---ANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQ 571 (695) T ss_pred HHhcCCCcccccccc---cccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCC Confidence 766655444321000 00111111111100 000000000000000 000000001 No 215 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=96.77 E-value=0.00035 Score=39.54 Aligned_cols=490 Identities=13% Similarity=0.114 Sum_probs=245.6 Q ss_pred CCcchhhhHHHHH--hhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHH-HHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTTRTRAK--KAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQ-QIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~r~~a~--~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~-~i~~~~~~lr~RaRdl~rNn~ 77 (556) ||-.=+..-..-. ...+.+++ --+++++-. ..+|......... ....+-..|..+=|.| .++| T Consensus 1 m~~lfgf~i~~~~~~~~~S~vpp----------~~~~~~~~i---~~g~~g~~v~~~g~~~~~n~~eLI~~YR~m-a~~p 66 (564) T protein:vir:10 1 MSQLFGFLINEKEGQKGQSPVPP----------NDEASVSTV---AGGYFGTYVDTSGGQNSRNEYELIRRYRDM-SLHP 66 (564) T ss_pred CcchhcceeeeeccCCCCCcccC----------CcCCChhhh---hccccceeeecccccchhhHHHHHHHHHHH-hhcc Confidence 3332222111100 00011110 011221110 1122111111000 0124567889999999 7899 Q ss_pred HHHHHHHHHHhhhccC--CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhe Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFL 155 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~ 155 (556) -+-.||+-|++-+|=. +=.|. .++..-+ +..+.+.+.|...|+.-. ..++|..--.-.+|.|. T Consensus 67 EVd~Av~eIVneaIv~d~~~~pV-~vdL~~~----~~s~siK~kI~eEF~~Il----------~ll~F~~~~~e~fR~WY 131 (564) T protein:vir:10 67 EVDSAIDEIVNEFVVNDGDDKPV-EVDLQNL----EIGSGVKKKIRDEFNRIL----------RMMNFNVNAHEIIRNWY 131 (564) T ss_pred chhhHHHHhhcceeEecCCCceE-EEEeccc----CcchHHHHHHHHHHHHHH----------HHhccchhhhHHHhhhh Confidence 9999999998887642 22222 2222212 235566777888887644 34677777778999999 Q ss_pred ecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC---CCC--CCCCceEEEEEE-ECCCCCeEEEEEeecC--CCc Q lcl|NC_019524. 156 MTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN---PNN--VMDTPNLRSGVQ-LDNNGAALGYWLRKAF--PGD 227 (556) Q Consensus 156 ~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~---~~~--~~~g~~i~~GIE-~d~~Gr~vaY~i~~~h--pgd 227 (556) +||-+|..++..... ++.| . ..|+.|+|-.+.. ... ..++..|..|+. +..++...-|++++.. +|. T Consensus 132 VDgRi~fHkiid~~~--pk~G--I-~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~ 206 (564) T protein:vir:10 132 VDGRSHYHKVIDLDN--PKKG--I-LELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGN 206 (564) T ss_pred hcceEEEEEEeeCCC--hhhh--h-hhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCc Confidence 999999776553221 1222 1 3678888886542 111 234556777765 4577888888888532 221 Q ss_pred cccCCccccceeeccccCChhHeEeeecccCC-CcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCccc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALLA-GQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSD 306 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~-gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~ 306 (556) .....+....-.....+++.+.|.|.-...-. .-.-=+|.|+++++.+.+|.=.+||.++=...-|-=--++.=+.+.- T Consensus 207 ~~~~~~~~~~~~~~~ikI~~daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnL 286 (564) T protein:vir:10 207 IPMVTGSMDWSNQEGIKIASDAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNL 286 (564) T ss_pred ccccccccccccccceeechhhcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCC Confidence 11111111111112357888899988876533 22223889999999999999999999887665553321111111111 Q ss_pred c---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHH Q lcl|NC_019524. 307 V---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDY 371 (556) Q Consensus 307 ~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F 371 (556) . +.++. -....+......-.+++..+..-..+. +| -|.+|+.+.....-+. .+- T Consensus 287 Pk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPRRe----Gg------rgTEItTLpGgqnLge-m~D 355 (564) T protein:vir:10 287 PKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRRE----GG------RGTEITTLPGGQNLGE-LKD 355 (564) T ss_pred CchhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhcccccC----CC------cccceeeccccCCcch-HHH Confidence 1 00000 011222223333344444444443332 22 2444444443322222 234 Q ss_pred HHHHHHHHHHhcCCCHHHhhchhhcccchhHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCC Q lcl|NC_019524. 372 EQSLLRNIAASLGMSYEQFSRDYTKTNYSSARAS-----MAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPP 446 (556) Q Consensus 372 ~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~-----~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~ 446 (556) +....+.+=.+|+||-.-|..|= .+|+-.|.+ -+.|.+.+.++|..|..-|.+++-. ..+|.|.+.. . T Consensus 356 V~YF~kKLY~aLnVP~SRl~~e~--~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~----qLiLKgiit~-e 428 (564) T protein:vir:10 356 VEYFKKKLYNSLNLPPSRLTDDN--KAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKT----QLILKGIITP-E 428 (564) T ss_pred HHHHHHHHHHHhCCCcccccCCC--ceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhccCCCH-H Confidence 55566666778999988776552 233333332 4668888888898887777766544 3577787742 1 Q ss_pred CcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc------CCC---CH----HHHHHHhCCCHHHHHHHH Q lcl|NC_019524. 447 GKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN------GLS---TY----EAEISRLGGDFREVFKQR 513 (556) Q Consensus 447 ~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~------G~~---s~----~~~~ae~G~D~e~v~~q~ 513 (556) .|. ...-...|....-.+.-.+||++--..++.. .+- |. +.+++..-.+.++...|+ T Consensus 429 eW~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI 498 (564) T protein:vir:10 429 DWD----------DMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQM 498 (564) T ss_pred HHH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHH Confidence 111 1112345555666677888887765554432 111 22 233444456666677777 Q ss_pred HHHHHHHHHcCCCCCcc------ccccCC---------------------CCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 514 AREEGLIKSLKLDFTGK------MVEGNS---------------------TQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 514 a~E~~~~~~~Gl~~~~~------~~~~~~---------------------~~~~~~~~~~~~~~~~e~~~ 556 (556) +.|... |+-.++. +....+ +..+.++++++.++.+++++ T Consensus 499 ~~E~k~----~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 564 (564) T protein:vir:10 499 KSDIES----GLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKSQSNK 564 (564) T ss_pred HHHhhc----CCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccCCCCCCCCCCcCcCCC Confidence 777552 3321110 000000 00000111111122222222 No 216 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=96.74 E-value=0.00037 Score=39.37 Aligned_cols=416 Identities=11% Similarity=-0.004 Sum_probs=168.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHH---HHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDM---ASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~---lr~RaRdl~rNn~ 77 (556) -.|...+.|.-+- . +...+ .+.+|. ++|.-++..... -..-.+++.+.++ T Consensus 7 ~~p~~~~~~~~~~---------------------~-~~~~~-~~~g~~----~~D~~lr~~gg~~~~~~~l~~~m~e~D~ 59 (446) T protein:vir:98 7 NAPTPAIRRRTIY---------------------A-MEHLG-LATSYL----SEDGGYKRAGKPTYQQLSAWDEAAQTEP 59 (446) T ss_pred CCCchhhhhhhhh---------------------c-cccch-hhcccC----CcchHhhhcCCChHHHHHHHHHHHhcch Confidence 1122222211110 0 00001 112221 223222222111 1234689999999 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) ++++.+++...-|.|.-+++.+. +.+ ..+.|+..++. . .|..+...+ ...+.- T Consensus 60 ~v~s~l~~Rk~av~~~~w~V~p~--------~~~----~a~~v~~~l~~--------~------~~~~~~~~~-ldai~~ 112 (446) T protein:vir:98 60 IIAQGLDSIALSVLNKVGPYQHG--------DKR----IKKFIDDQLRN--------R------AKTWISHCV-KSIMTY 112 (446) T ss_pred HHHHHHHHHHHHhhcCCceecCc--------cHH----HHHHHHHHHhh--------c------CchhHHHHH-HHHHhh Confidence 99999999999999976666542 222 23334333321 1 122222222 244557 Q ss_pred CceEEEEeeccCCCCcCCCcccceEE----EEEchhhcCCCCCCCCCceEEEEEEE------CCCCCeEEEEEeecCCCc Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAI----QMISPYRMSNPNNVMDTPNLRSGVQL------DNNGAALGYWLRKAFPGD 227 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~l----q~ie~drl~~~~~~~~g~~i~~GIE~------d~~Gr~vaY~i~~~hpgd 227 (556) |=++.-++|....+.. +|-++ ..+.+-.+...++. ...+..|... +....++... +.++ T Consensus 113 G~s~~Eivw~~~~g~~-----~p~~~~d~~~~~~~~~~r~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~ 181 (446) T protein:vir:98 113 GFSLSEQIYAHGARDN-----MPATVLDDIVNYHPLQVMLIAND--NGRIVDGDTVTASQYKSGYWVPLPPY----RIGD 181 (446) T ss_pred CceeeeEEEeeccccc-----ccchhhccccccccccceeeecc--CCccccccccchhhcccccccCcccc----hhhh Confidence 8888888887544321 22211 11222222111110 1111112111 1111111111 1111 Q ss_pred cccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDV 307 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~ 307 (556) ....... ......+|...++|+-..-+.|..-|.+.|.++.-...=......-.+.-...-.+=..+-|-+.+... T Consensus 182 ~~~~~~~----~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~ 257 (446) T protein:vir:98 182 PPKKVDV----VGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTG 257 (446) T ss_pred hhhhccc----CcccccccccceEEEEecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCc Confidence 1100000 001234566565555555678889999877665543322222222222222222222222222211000 Q ss_pred cccccccccccccccccccc--ccccc-ccccccceecCCceee---ecCCCceeeeecCCCCCc-cHHHHHHHHHHHHH Q lcl|NC_019524. 308 VFGQLGMGQGGFKEIFNEYM--TGLAN-YVAQTKNIAIDGAKIP---HLYPGTKLKMQPAGTPGG-VGTDYEQSLLRNIA 380 (556) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~~~l~pG~i~---~L~pGe~i~~~~~~~p~~-~f~~F~~~~lr~ia 380 (556) ...+...... ..... -...-..+.-..|-|. .++.|.+|+++++...++ .|..+.+.+=++|+ T Consensus 258 ----------~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~Is 327 (446) T protein:vir:98 258 ----------VVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNML 327 (446) T ss_pred ----------ccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHH Confidence 0000000000 00000 0000001111233332 358999999999876543 59999999999999 Q ss_pred HhcCCCHHHhhchh-hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhH Q lcl|NC_019524. 381 ASLGMSYEQFSRDY-TKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMR 459 (556) Q Consensus 381 aglGi~ye~l~~D~-s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~ 459 (556) .++....-.|+.+- ++.||+.+..-..-+....+.....+...+-+-+..++++ +++ ++..+. T Consensus 328 kaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~---lNf----~~~~~~--------- 391 (446) T protein:vir:98 328 MGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIR---LNF----DPALYP--------- 391 (446) T ss_pred HHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCC----Cccccc--------- Confidence 98755432222221 1234443322211122223344444555554445544443 221 111000 Q ss_pred HHhhCeeeecCcc---cccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccc Q lcl|NC_019524. 460 DALCNAEWIGASR---GQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVE 533 (556) Q Consensus 460 ~a~~~~~w~~p~~---~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~ 533 (556) ....+-.|.. +-.|-.+-+++....+..|+.++. .+....+++||+.. ++.. T Consensus 392 ---~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~------------------~~~~ire~~giP~~-~~~~ 446 (446) T protein:vir:98 392 ---LASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDG------------------DKDHIRSITGLPDA-ISST 446 (446) T ss_pred ---cccccccceeccCChhhHHHHHHHHHHHHhCCccccc------------------cHHHHHHHhCcCCC-CCCC Confidence 0000001111 234555567777777888876421 12223456777531 1111 No 217 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=96.65 E-value=0.00044 Score=38.98 Aligned_cols=454 Identities=11% Similarity=0.049 Sum_probs=180.6 Q ss_pred CCcchhh-------------------hHHH--HHhhHhhcccchhhhhhhhcchhc--cccCCCcccccccCCCCCHHHH Q lcl|NC_019524. 1 MKDVKKT-------------------TRTR--AKKAVDVVAETATATPMAVGGGME--GAERTTREMFQWNPSIISPDQQ 57 (556) Q Consensus 1 ~sp~~~~-------------------~r~~--a~~a~~~~~~~~~~~~~~~~~~y~--aa~~~~r~~~~w~~~~~s~~~~ 57 (556) --||..- +..+ ..-.++...-++..+ ++..++.+ +++. ..+ +|....+=+- T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~l-~~~~~~~F~G-- 111 (694) T protein:vir:10 38 AQPVPADFARRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRER-RAASYALDFNGTSM--DAL-SFVTSSGFPG-- 111 (694) T ss_pred CCcccCCccccccchhhcccccCCCCcchhhhhhccccccCCCcccc-chhhhhhccCcccc--cch-hhhhccCcch-- Confidence 1111111 0000 000000000000000 00111222 1111 111 2322111111 Q ss_pred HHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeee----cccccc--------CCChhHHHHHHHHHHHHHH Q lcl|NC_019524. 58 IAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAK----PNTIVL--------GAPDGWGEEFQEVVEARFN 125 (556) Q Consensus 58 i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~----~~~~~l--------g~~~~~~~~~~~~ie~~~~ 125 (556) .+.+..|.. +|-.+..+.++..-.+-..+..... ++..-+ ..+.++. ++|++.|+ T Consensus 112 -y~~la~laQ--------~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi----~~L~~e~e 178 (694) T protein:vir:10 112 -FPTLVLLAQ--------LPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQL----KQINDEIE 178 (694) T ss_pred -HHHHHHHhh--------ccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHH----HHHHHHHH Confidence 122222222 2333344444444333322211100 000000 0122333 34444444 Q ss_pred HHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCccc---ceE--------EEEEchhhcCC- Q lcl|NC_019524. 126 MAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPF---GTA--------IQMISPYRMSN- 193 (556) Q Consensus 126 ~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~---~l~--------lq~ie~drl~~- 193 (556) +..-. ..+...+.-+.+-.|-+.+...-..... ++ .|+ +.+ |.+||+..|.- T Consensus 179 rl~V~-------------~~l~eaik~aRlfGGa~~~i~I~gdd~~-l~--~PL~~~~~~I~kGslKGl~ViDp~~vtP~ 242 (694) T protein:vir:10 179 RLRIR-------------DAVRTTVIHDQAFGRAHPYFKIKGDDQI-MD--TPLVPRPYTVPKGSFQGLRVVEPYWVTPN 242 (694) T ss_pred HHHHH-------------HHHHHHHHhhccccceEEEEEeecCccc-cc--cccccccccccCcceeeeEeecccccccc Confidence 33211 1122333334444454432222121100 00 111 112 56677776631 Q ss_pred CCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeeccc------CCCcccCCch Q lcl|NC_019524. 194 PNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEAL------LAGQTRGISE 267 (556) Q Consensus 194 ~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~------r~gQ~RGvs~ 267 (556) .++.. +.+. ..+|+|..|+|.-. .|+++.++.+.... ..-+.-|+|. T Consensus 243 ~~n~~--dP~s-----pdfgkP~~y~V~G~--------------------~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv 295 (694) T protein:vir:10 243 NYNSI--NPVA-----DDFYKPSTWWMIGT--------------------EVHATRLHTIVSRPVGDMLKPTYSFAGISM 295 (694) T ss_pred hhhhc--cchh-----hccCCCceEEEece--------------------EEeeeeEEEecCCCchhhhhcccccCcccH Confidence 11110 0011 25688888887411 23333332221111 1223458888 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCcee Q lcl|NC_019524. 268 MVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI 347 (556) Q Consensus 268 la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i 347 (556) .-.++..+.....-.+.-..-.+ ...+.++ +++.. ..+..+........-... .......+..+ T Consensus 296 ~q~~~e~V~~~~rT~~~v~~Li~-~~~v~~l-k~dla-----~~L~~g~~~~l~~R~eli---------~~~Rsn~G~~l 359 (694) T protein:vir:10 296 TQLAMPYIDNWLRTRQSVSDIVK-QFSVSGI-LMDLA-----QALMPGANVDLSMRAELI---------NRYRDNRNILF 359 (694) T ss_pred HHHHHHHHHHHHHHHhHHHHHHH-hhhhHHH-HHHHH-----HhhcChhHHHHHHHHHHH---------HHhcCccceEE Confidence 88777777665443332221111 1111111 21110 000000000000000000 00012233333 Q ss_pred eecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 348 PHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFAS 426 (556) Q Consensus 348 ~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~ 426 (556) .. ...|++..++ .+-++.++-+......||+..|||.--|.|- |=..+ |+.-..+.-++..++..|+..+..+++ T Consensus 360 lD-k~~Eefeq~s--tslSGLddVi~qf~q~VAgaa~IPltkLfGq-SPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~ 435 (694) T protein:vir:10 360 LD-KATEEFFQFN--TPLSGLDALQAQAQEQMSAVSHIPLIKLLGI-TPTGLNASSEGEIRVWYDYVRAYQRNALQQLMN 435 (694) T ss_pred Ee-cCCcceEEEe--cccCCHHHHHHHHHHHHHhhhcCchhhhhcc-CcccccccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 33 3578888776 4678999999999999999999998777773 23456 567778888999999999877666665 Q ss_pred HHHHHHHHHHHHc--CCccCCCCcccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhC Q lcl|NC_019524. 427 AIYTLWLEEEVNA--GNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLG 503 (556) Q Consensus 427 pi~~~~l~~a~l~--G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G 503 (556) .++... .++ |.++ | ... +. ++.-|.+-..+..|- .|.++++...++.|+-+..++..... T Consensus 436 rl~~ii----~rS~~G~id-p-~i~--~~---------fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~ 498 (694) T protein:vir:10 436 DVIVMI----QLSLFGAVD-P-SIK--WQ---------WNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLN 498 (694) T ss_pred HHHHHH----HHHhcCCCC-C-cce--EE---------eCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHh Confidence 555443 333 5542 2 111 11 133566777777776 77888888888888888888887766 Q ss_pred CCHHHHHHHHHHHHHHHHHcCCCCCcccc------------------ccCCCCCCCCCCCCC----CCCCCcCCC Q lcl|NC_019524. 504 GDFREVFKQRAREEGLIKSLKLDFTGKMV------------------EGNSTQSSNSSESTS----DNPNEETTQ 556 (556) Q Consensus 504 ~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~------------------~~~~~~~~~~~~~~~----~~~~~e~~~ 556 (556) .|++=....... .-++-|++.+.+.. ..+.++...++.... -++.+-.-| T Consensus 499 ~d~~s~Y~~~~D---~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~ 570 (694) T protein:vir:10 499 TEPDGPYAGKLD---ANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQ 570 (694) T ss_pred cCCCcccccccc---cccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCC Confidence 655444321000 00111111111100 000000000000000 000000000 No 218 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=96.44 E-value=0.00062 Score=38.16 Aligned_cols=458 Identities=16% Similarity=0.121 Sum_probs=192.8 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccC-C---CcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAER-T---TREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~-~---~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) |.+....|--+.+....+. ..++..+|+. - ++. ..|. ...+.++++..+...+..-.=.|---.+|. T Consensus 1 ~~a~~~lr~~rrpkg~~~a-------~~r~L~aAs~~~~dpg~~-~~~~-~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~ 71 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPA-------PSRALTAASQPLPDPSQV-FSKS-TGISRNSDWQTDAWEAVDLVGELRYYVGWR 71 (631) T ss_pred CCcccceeeeecCCCCCcc-------chhhhhhhhccccchhhh-hhhh-cCCcccchhhHHHHHHHHhhhhHHHHhhhh Confidence 3222222111111111110 0112222221 1 111 2222 333455554444444333222222223344 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCc Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGE 159 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE 159 (556) .+++..+.-++ + -|.| -+...-.++.++.. . ..++.+.-.. =..|.+.-.+|.+-....+-+-|| T Consensus 72 ~~s~sr~rL~a-s-~idp--Dtg~ptg~iee~~~-~-~~~v~~~~~~---------i~gG~lgQ~~llkrl~~~ltV~GE 136 (631) T protein:vir:10 72 ASSCSRCRLVA-S-ELDE--NTGLPTGGISEDNT-E-GERVREIVSK---------IADGTLGQAALTKRVVECLTVPGE 136 (631) T ss_pred hhhhceeeeEe-e-eecc--CCCCCccccccCCc-h-hHHHHHHHHh---------cCCCcchHHHHHHHHHhheecccc Confidence 44433332211 1 0111 00011111221100 0 1222222221 267888889999999999999999 Q ss_pred eEEEEeeccCCCCcCCCcccce-----EEEEEchhhcCCCCCC-------CCCceEEEEEEECCCCCeEEEEEeecCCCc Q lcl|NC_019524. 160 VLATCEWLNPTGTTMQRRPFGT-----AIQMISPYRMSNPNNV-------MDTPNLRSGVQLDNNGAALGYWLRKAFPGD 227 (556) Q Consensus 160 ~f~~~~~~~~~~~~~~~~~~~l-----~lq~ie~drl~~~~~~-------~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd 227 (556) +.+.+...+..+. ...+-+. ..+++..+.+...... +.|. .+ || ..|..+.+.|++.||.. T Consensus 137 ~wiv~l~~p~~~~--~~~pd~~~r~~~~W~~vt~~ei~~~~~g~g~~v~lp~g~--~h--~~-~~~~D~l~RiW~P~prr 209 (631) T protein:vir:10 137 LWIVILTRPVKGA--PAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGE--EH--EF-VKGTDIIFRVWIPKPRK 209 (631) T ss_pred eEEEEEeccCcCC--CCCcccccccccceeeccHHHHhcccCcccceeecCCCC--cc--ce-ecCCceEEEeeCCCccc Confidence 9998765443211 1111111 4567777777532111 1111 00 12 13445777778777764 Q ss_pred cccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHH---HHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 228 PTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQ---EITLQNAVVNATYAASVESELP 304 (556) Q Consensus 228 ~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~---dael~~a~i~A~~~~fi~~~~~ 304 (556) ...- -|+.-++|..++.|-+.. .+....-.+.+- ..|+-.+.. T Consensus 210 ~~e~---------------------------------dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnG-vlflP~els 255 (631) T protein:vir:10 210 ASEP---------------------------------DSPVRAVLDSIREIVRTTKTIANASKSRLIGNG-VLFVPHEMS 255 (631) T ss_pred ccCC---------------------------------cchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCc-eeEeccccc Confidence 2111 123333444444444333 333333333332 234433221 Q ss_pred ccccccccccccccccccccccc--------------cccccccccccceecCCceeeecCCCceeeee---cCC-CCCc Q lcl|NC_019524. 305 SDVVFGQLGMGQGGFKEIFNEYM--------------TGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQ---PAG-TPGG 366 (556) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~---~~~-~p~~ 366 (556) -...........+........+. ....+.....-. -| |+---|||-++-+ .-. .-+. T Consensus 256 ~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~---vP--ii~~~p~E~i~~i~hlkf~~ei~e 330 (631) T protein:vir:10 256 LPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAF---IP--VIAGVPGEQIKDVKHIRFDNEITE 330 (631) T ss_pred cCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccce---ee--eeEeechHHhcCeeEEeecCchhH Confidence 11000000000000000001110 000000000000 00 1222334333322 211 1222 Q ss_pred cHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHcCCcc Q lcl|NC_019524. 367 VGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL---VADRFASAIYTLWLEEEVNAGNVP 443 (556) Q Consensus 367 ~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~---lv~~~~~pi~~~~l~~a~l~G~l~ 443 (556) .-..-....++.+|+|+.||-|.|+|==|++|-=|+=+.--| ..|.+ .+.-+|+-|+.-||.-++.+--|+ T Consensus 331 ~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~de------dVrlHI~P~l~lic~AlT~q~Lrp~Le~eGvD 404 (631) T protein:vir:10 331 VAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDE------DVQLHIAPVMEIFCQALTDQILRVTLAREGID 404 (631) T ss_pred HHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEeccc------ceeeecchHHHHHHHHHHhhHHHHHHHHhCCC Confidence 233445778899999999999999984367776554222222 23332 356789999999999888874442 Q ss_pred CCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHH--------HHHH Q lcl|NC_019524. 444 LPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFK--------QRAR 515 (556) Q Consensus 444 ~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~--------q~a~ 515 (556) | ..| +-|.-+.-..+||-|--+| +.+.+.|.-|-+......|.+-++..+ .++. T Consensus 405 -p--------------~kY--vvW~DaS~Lt~dPdr~deA-~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~ 466 (631) T protein:vir:10 405 -P--------------SKY--VVWYDPSQLTIDPDKSDEA-KFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQ 466 (631) T ss_pred -H--------------HHh--EeeecCcccccCCCCcHHH-HHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHH Confidence 2 233 7899999999999987665 567899999999988888877665532 2222 Q ss_pred HHH--------HHHHc------CCCCCccccccCCCCCCCCCCCC-CCCCCCcCCC Q lcl|NC_019524. 516 EEG--------LIKSL------KLDFTGKMVEGNSTQSSNSSEST-SDNPNEETTQ 556 (556) Q Consensus 516 E~~--------~~~~~------Gl~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~ 556 (556) ..- .+..+ .+.++.. ....+.++.+.++++ .++..+++++ T Consensus 467 ~av~~dpaLip~lApl~~~~~~~v~~P~~-~a~~~~g~ed~~~~~~~~~g~~epdt 521 (631) T protein:vir:10 467 DAVSKDPTLIPMLAPLIAGVLKQIEFPQQ-QAIDSGGNEDTSDADDLDDGEQEPDT 521 (631) T ss_pred HHhhcccCcchhhHHHHHHHhhhccCCCC-CCCCCCCCCccccccccccCCCCCCC Confidence 210 00110 0111111 000111111111110 1111111111 No 219 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=95.96 E-value=0.0012 Score=36.61 Aligned_cols=434 Identities=8% Similarity=-0.031 Sum_probs=179.9 Q ss_pred hhHhhcccchhhhh------------------hhhcchhccccCC--CcccccccCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 14 KAVDVVAETATATP------------------MAVGGGMEGAERT--TREMFQWNPSIISPDQQIAQNQDMASARAQDMV 73 (556) Q Consensus 14 ~a~~~~~~~~~~~~------------------~~~~~~y~aa~~~--~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~ 73 (556) ..+.+......... ......|.-+... .+... |.....- ......+ .+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~-~~~~~~~--------~~~d~~~-~nnk 70 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIF-YMNDKGQ--------LREDNYA-SNVK 70 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccc-ccccccc--------ccccccc-cccc Confidence 11111111100000 0001112211100 00000 0000000 0000000 0001 Q ss_pred hcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhh Q lcl|NC_019524. 74 QNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSG 153 (556) Q Consensus 74 rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~ 153 (556) -.+.+++-+|+..+.+++|..++..+. ++.+ +.+...|+.|... +|........+. T Consensus 71 i~~nf~k~Ivd~~~~yl~G~Pv~~~~~---------d~~~----~e~~~~l~~~~~~-----------~~~~~~~el~~~ 126 (537) T protein:vir:78 71 ISHGFFTELVDQLAQYLLSNGVEVKVK---------DEDN----TQLDEILQEYFDE-----------DFQATIDTLVTN 126 (537) T ss_pred cccchHHHHHHHHhhhhcccCceeecC---------cchh----HHHHHHHHHHhhc-----------cHHHHHHHHHHH Confidence 234688899999999999988877542 2222 3344445555321 344444445566 Q ss_pred heecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCC--ceEEEEE--EE---CCCCCeE-EEEEeecCC Q lcl|NC_019524. 154 FLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDT--PNLRSGV--QL---DNNGAAL-GYWLRKAFP 225 (556) Q Consensus 154 ~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g--~~i~~GI--E~---d~~Gr~v-aY~i~~~hp 225 (556) +..-|.++..+. ....+ .+++..|+|..+=.-++.... ..++.-. +. +..+..+ -+.++.... T Consensus 127 ~s~~G~ay~~~y-~de~~--------~~~~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~ 197 (537) T protein:vir:78 127 ASKKGFEGIFAR-TTSEG--------KLKFQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEA 197 (537) T ss_pred HhhcCeeEEEee-ecCCC--------ceEEEEEccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCc Confidence 788999987754 33222 368899998875211111110 0111110 01 1111111 111111110 Q ss_pred CccccCC------------------------------------------ccccceeeccccCChhHeEeeecccCCCccc Q lcl|NC_019524. 226 GDPTDME------------------------------------------QWKWGYEPARFDWGRRRVIHIIEALLAGQTR 263 (556) Q Consensus 226 gd~~~~~------------------------------------------~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~R 263 (556) --.+... ...|.+ || |+++... .- T Consensus 198 i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~------iP---vv~f~nn-----~~ 263 (537) T protein:vir:78 198 VCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSK------FP---FQLLYNN-----KD 263 (537) T ss_pred EEEEEecCCcccccccccccccccccceeeeccccccccccccccccccccCCcc------ee---EEEeccC-----cc Confidence 0000000 001111 11 3333332 24 Q ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHHhccee---eeEeccCcccccccccccccccccccccccccccccccccccce Q lcl|NC_019524. 264 GISEMVSALKQMKMTRNFQEITLQNAVVNATYA---ASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNI 340 (556) Q Consensus 264 Gvs~la~~l~~l~~l~~~~dael~~a~i~A~~~---~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (556) |+|.|..++..+..++ .+.-..+....-|+ .+++--..+ .. .+.. . T Consensus 264 ~~sd~e~v~~LiDayd---~~~S~~an~~~~~~~~ilvi~g~~~~---------~~-------~~~~------------~ 312 (537) T protein:vir:78 264 GMSDVKRVKSIIDDYD---VMNCFLSNNLQDFSEAIYVVKGFSGD---------ST-------DKLR------------Q 312 (537) T ss_pred CCCchhhhHHHHHHHH---HHHHhhhhHHHHhcCceeeeecCCCc---------cc-------hhHH------------H Confidence 7899988776664433 33333332222222 122210000 00 0000 0 Q ss_pred ec-CCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCC--HHHhhchhhcccchhHHHHHHHHHHHHHHHH Q lcl|NC_019524. 341 AI-DGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMS--YEQFSRDYTKTNYSSARASMAETQKYMDSRK 417 (556) Q Consensus 341 ~l-~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~--ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q 417 (556) .+ .-+.|..-..|-++++++.+-+...+..+.+.+.+.|-.-..+| -+.. ++++|-.|.|.-+..........+ T Consensus 313 ~l~~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~---~gn~SGvAlk~~~~~l~~ka~~ke 389 (537) T protein:vir:78 313 NIKAKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVG---DGNVTNVVIKSRYTLLAMKARKME 389 (537) T ss_pred HHhhcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCcccc---ccCCcHHHHHHHHhhHHHHHHHHH Confidence 11 12334333456789999999999988888888888665443222 1222 334444566665555544444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHH Q lcl|NC_019524. 418 KLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEA 497 (556) Q Consensus 418 ~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~ 497 (556) ..| ...++..++..+...-..|....- ...+.+.|...-. .|-...++...+.+..|+.|.+. T Consensus 390 ~~f-~~~l~~~~~~i~~~~~~~~~~~~d--------------~~~i~i~f~~~~P--~n~~e~a~~~~~l~~~giiS~eT 452 (537) T protein:vir:78 390 TSL-RKVLRWCADMVVSDIALRGLGEYD--------------SNDICFEIEPHVL--ANELDIATTRKTEAETEALKIGN 452 (537) T ss_pred HHH-HHHHHHHHHHHHHHHhhcCCcccc--------------cceeeEEeccCCC--CCHHHHHHHHHHHHhcCcchHHH Confidence 444 334555666555543333321110 0123556654432 57777778777788999999999 Q ss_pred HHHHhC--CCHHHHHHHHHHHHHH-H-------HH-----cCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 498 EISRLG--GDFREVFKQRAREEGL-I-------KS-----LKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 498 ~~ae~G--~D~e~v~~q~a~E~~~-~-------~~-----~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +++..+ .|++. .+.+.+|.+. . ++ .....+..+..........+++.+.+++..|++- T Consensus 453 ~l~~~p~vdd~e~-ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 525 (537) T protein:vir:78 453 IMTVAPRIGDDET-LKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNV 525 (537) T ss_pred HHHhCCCCCCHHH-HHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCC Confidence 999987 46543 3333333211 0 01 1111111111111111111111222222222222 No 220 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=95.61 E-value=0.0018 Score=35.70 Aligned_cols=443 Identities=12% Similarity=0.034 Sum_probs=198.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) ||++-+.. +-...+.+..- ........++.++ +.++.. . ...+...|..+=|.|+.++|-+. T Consensus 20 ~~~~~~~~---~p~~~dG~s~i--~~~~~~~~~~~~~------~~~~~g----g---~~~n~~eLI~~YR~ma~~~pEVd 81 (533) T protein:vir:58 20 LSPMYGMG---APHGAGGSSMI--PINMYHPFATAGY------ASRFYG----G---IEFNRFFLYDMYDRMDYTDPLIS 81 (533) T ss_pred hchhhccc---CccCCCCCccc--cCCCCcchhhhhh------hhhhhc----c---ccccHHHHHHHHHHhhccCcchh Confidence 44444431 00000110000 0000000111110 011110 0 12356779999999999999999 Q ss_pred HHHHHHHhhhccC--CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 81 GVVAVHRDSIVGS--QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 81 ~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) .||+.|++-+|=. +=.|. ..+...+ +.++.+.+.|.. .++|..--.-.+|.|.+|| T Consensus 82 ~AideIvneaiv~d~~~~pV-~v~l~~~----e~s~~iK~kI~~-----------------lldf~~~~~~~fR~WYVDG 139 (533) T protein:vir:58 82 TVLDIIADECTIPNENGNIV-DVVTKDI----ELAKAILSYLDY-----------------VINIEKNAYPIIRNMIKYG 139 (533) T ss_pred hHHHhhhceeeEecCCCcee-Eeecccc----cccHHHHHHHHH-----------------HhcchhhhhHHHHhhhhcc Confidence 9999999888752 21121 2221111 122333333322 3456666678899999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccce Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGY 238 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~r 238 (556) -+|..+...++.. |- ..|+.|+|-.+..= + -|+.+ .-|++++ |+......+. T Consensus 140 riy~Hkiik~~k~----GI---~elr~lDPr~i~~v---------r-~~~t~-----~eyyvy~--~~~~~~~s~~---- 191 (533) T protein:vir:58 140 DMFLHILEKGSDG----TI---EKFQVVSPYIFSKR---------Y-NPETD-----TWYYVIT--DVYRNVVSGY---- 191 (533) T ss_pred eeEEEeccCCccc----ch---hhheecCCeeeEEE---------E-eeccc-----eEEEeec--ccccccccCc---- Confidence 9997775432221 11 27888998888531 1 12222 3455553 2222111111 Q ss_pred eeccccCChhHeEeeecc-cCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcc--eeeeEeccC-ccccccccc-- Q lcl|NC_019524. 239 EPARFDWGRRRVIHIIEA-LLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNAT--YAASVESEL-PSDVVFGQL-- 312 (556) Q Consensus 239 v~~~~~v~a~~viH~f~~-~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~--~~~fi~~~~-~~~~~~~~~-- 312 (556) ...++|.+.|+|+-.. ....-.-++|.|+.+++.+.+|.=.+||.++=...-|- =..+|.... |...+.+.. T Consensus 192 --~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~ 269 (533) T protein:vir:58 192 --FNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTN 269 (533) T ss_pred --cccccchhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHH Confidence 1247889999999988 45556677899999999999999999998877666661 122333211 111111110 Q ss_pred ----------ccccccccccccccc---cccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHH Q lcl|NC_019524. 313 ----------GMGQGGFKEIFNEYM---TGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNI 379 (556) Q Consensus 313 ----------~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~i 379 (556) -....+...+..-.+ ++..+..-..+ ++ .-|.+|+.+... . -+=.+-+....+.+ T Consensus 270 im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRR----eG------grgTEI~TLpGg-~-lgemeDV~YF~kkL 337 (533) T protein:vir:58 270 IAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRR----GD------RRAVEIDILQGS-K-VDLAEDVEYMLNRL 337 (533) T ss_pred HHHhcccceEEeccCCeEeeccchhhhhhhHhhhccccc----CC------CccceeeecCCC-C-CCcHHHHHHHHHHH Confidence 011111111111111 22222222222 11 234555555432 2 23345667777888 Q ss_pred HHhcCCCHHHhhchhhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhh Q lcl|NC_019524. 380 AASLGMSYEQFSRDYTKTNY-SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMM 458 (556) Q Consensus 380 aaglGi~ye~l~~D~s~~nY-Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~ 458 (556) =.+|+||-.-|..|- +.+= +.+----+.|.+.+.++|..|..-|.+ ..+|+|.+.. T Consensus 338 y~ALnVP~sRl~~e~-~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~--------qLilk~iit~-------------- 394 (533) T protein:vir:58 338 ISALKVPKAFIGYEG-DVNAKNTLATQDIKFNNTIKRIQGFFVEELER--------MVRMNKEFAD-------------- 394 (533) T ss_pred HHHhCCCeeecCCCC-CCccchhhhHHHHHHHHHHHHHHHHHHHHHhc--------ccccccCcch-------------- Confidence 889999977665442 1111 111112234777777777766543322 3456666542 Q ss_pred HHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHH--cCCCCCcc Q lcl|NC_019524. 459 RDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLSTYEAEISRL-GGDFREVFKQRAREEGLIKS--LKLDFTGK 530 (556) Q Consensus 459 ~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~s~~~~~ae~-G~D~e~v~~q~a~E~~~~~~--~Gl~~~~~ 530 (556) ..| .|....-.+.-.+||++--..+|.. +.-...-+.+.- ... ++...|.+ ..-++ .|+-...+ T Consensus 395 -eew---~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~t-dei~~q~e---~ie~E~~~~~~~~~~ 466 (533) T protein:vir:58 395 -QDF---RLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIP-YDLKPQEE---VAEAAGGGGLFDTGG 466 (533) T ss_pred -hhe---eeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCC-hhhhHHHH---HHHHhhcCCCCCCCC Confidence 122 3444445556777776654444332 111111111111 111 13333221 11111 22211110 Q ss_pred ccccC-C---------CCCCCCCCC---CCCCCC--------------CcCCC Q lcl|NC_019524. 531 MVEGN-S---------TQSSNSSES---TSDNPN--------------EETTQ 556 (556) Q Consensus 531 ~~~~~-~---------~~~~~~~~~---~~~~~~--------------~e~~~ 556 (556) ..... + +...+..++ .+..+. ++.++ T Consensus 467 ~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 519 (533) T protein:vir:58 467 FGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEE 519 (533) T ss_pred cccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhh Confidence 00000 0 000000000 000000 00000 No 221 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=95.56 E-value=0.0019 Score=35.57 Aligned_cols=466 Identities=11% Similarity=0.067 Sum_probs=226.7 Q ss_pred CCcchhhhHHHH------Hh----hHhhcccchhhhhhhhcchhccccCC--CcccccccCCC----CCHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRA------KK----AVDVVAETATATPMAVGGGMEGAERT--TREMFQWNPSI----ISPDQQIAQNQDM 64 (556) Q Consensus 1 ~sp~~~~~r~~a------~~----a~~~~~~~~~~~~~~~~~~y~aa~~~--~r~~~~w~~~~----~s~~~~i~~~~~~ 64 (556) |..-..+-.... +. .....+.. ...--..+||.-- +-....+..-. .+.+ ....+-.. T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S-----~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e-~~~~~~~e 74 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLES-----VTAPKLDDGAREIETQEQNIPYNALMQQMFGSNE-PEVKNTRE 74 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCc-----cccCCCCCCceeeccCcccccchhhhhhhhhccc-chhhhHHH Confidence 221111111110 00 00000000 0001112332110 00011111110 0111 12335677 Q ss_pred HHHHHHHHHhcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhccc Q lcl|NC_019524. 65 ASARAQDMVQNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMC 141 (556) Q Consensus 65 lr~RaRdl~rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~ 141 (556) |..+=|.| .++|-+-.||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..+ T Consensus 75 LI~~YR~m-a~~pEvd~Av~eIVneaiv~d~~~~pV------~l~Ld~~~~s~siK~kI~eeF~~Il----------~ll 137 (524) T protein:vir:10 75 LIDTYRNL-MNNYEVDNAVQEIVSDAIVYEDDKEVV------ALNLDGTDFSQSIKDKILAEFSEVL----------NLL 137 (524) T ss_pred HHHHHHHH-hhccchhhHHHHhhcceeEecCCCceE------EEEecccCcchHHHHHHHHHHHHHH----------HHh Confidence 89999999 78999999999998887742 21221 122222 235667777888887644 346 Q ss_pred CHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC----CCCCCCceEEEEEEECCCCCeEE Q lcl|NC_019524. 142 TLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP----NNVMDTPNLRSGVQLDNNGAALG 217 (556) Q Consensus 142 ~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~----~~~~~g~~i~~GIE~d~~Gr~va 217 (556) +|..--.-.+|.|.+||-+|..++..+.. ++.|- ..|..|+|-.+..- ....++..++.|++ - T Consensus 138 ~F~~~~~~~fR~WYVDgRi~fHkiid~~~--pk~GI---~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~--------e 204 (524) T protein:vir:10 138 NFQRKGTDHFQRWYVDSRIFFHKIINPKK--MKDGV---QELRRLDPRQVQYIREIVTRMEDGVKIVDGYR--------E 204 (524) T ss_pred ccchhhhHHHhhheeeceEEEEEEeeCCC--ccccc---eeeeeeCCccceeeeeecccCcccchhhcchh--------h Confidence 67777778899999999999877654221 11221 36777888877421 11223333444433 4 Q ss_pred EEEeecCCCccc-cCCccccceeeccccCChhHeEeeecccCCCcc-cCCchhhHHHHHHHHHHHHHHHHHHHHHHhcce Q lcl|NC_019524. 218 YWLRKAFPGDPT-DMEQWKWGYEPARFDWGRRRVIHIIEALLAGQT-RGISEMVSALKQMKMTRNFQEITLQNAVVNATY 295 (556) Q Consensus 218 Y~i~~~hpgd~~-~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~-RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~ 295 (556) |+++. ||... ...+. ........++|.+.|+|+-...-+.-. -=+|.|+++++.+.+|.=.+||.++=...-|-= T Consensus 205 ~f~Y~--~~~~~~~~~~~-~~~~~~~ikI~~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPe 281 (524) T protein:vir:10 205 FFVYD--TGHESYCADGR-IYSAGTKVKIPRAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPD 281 (524) T ss_pred heeec--CCCcccccCcc-eecCCcceecchhheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhcccc Confidence 55653 33221 11111 111112357889999998887644332 337999999999999999999998876655533 Q ss_pred eeeEeccCcccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeec Q lcl|NC_019524. 296 AASVESELPSDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQP 360 (556) Q Consensus 296 ~~fi~~~~~~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~ 360 (556) --++.=+.+.-. +.+.. -....+...+....+++..+..-..+. +| -|.+|+.+. T Consensus 282 RRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLp 351 (524) T protein:vir:10 282 RRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRD----GK------AVTEVDTMP 351 (524) T ss_pred ceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccC----CC------Cccceeecc Confidence 211111111111 00000 011222233333444444444444432 22 244444444 Q ss_pred CCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 361 AGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARA-----SMAETQKYMDSRKKLVADRFASAIYTLWLEE 435 (556) Q Consensus 361 ~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~-----~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~ 435 (556) ....-+. .+-+....+.+=.+|+||-.-|-.+= +..|+-.|. --+.|.+.+.++|..|..-|.+++-. . T Consensus 352 Ggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~-~~~f~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~----q 425 (524) T protein:vir:10 352 GATGMSD-MDDVLYFRTALYRALRIPESRIPSES-NSGVMFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKT----N 425 (524) T ss_pred ccCCcCh-HHHHHHHHHHHHHHhCCCchhccCCC-CccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----h Confidence 3322223 34455566667778999988773321 012333333 34567788888888887777766544 3 Q ss_pred HHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCC----CHH----HHHHHh Q lcl|NC_019524. 436 EVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLS----TYE----AEISRL 502 (556) Q Consensus 436 a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~----s~~----~~~ae~ 502 (556) .+|.|.+.. ..|. ...-...|.-..-.+.-.+||++--..++.. +.. |.+ .+++.. T Consensus 426 LilKgiit~-eew~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t 494 (524) T protein:vir:10 426 LILKKIITE-DEWE----------REINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMT 494 (524) T ss_pred hhhccCCCH-HHHH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccC Confidence 577787742 1111 1112345555666677888887755554432 100 111 222222 Q ss_pred CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 503 GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 503 G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) -.+.++...|++.|. +.|+-. ++++++++- T Consensus 495 Deei~~~~k~I~~E~----k~~~~~-------------~~~~~~~~f 524 (524) T protein:vir:10 495 DEEINQEAKQIEEES----KEARFQ-------------NPDEEEEDF 524 (524) T ss_pred HHHHHHHHHHHHHHh----hcCCCC-------------CCChhhhcC Confidence 333333333333332 223321 111111111 No 222 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=95.50 E-value=0.002 Score=35.44 Aligned_cols=480 Identities=13% Similarity=0.136 Sum_probs=229.3 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) ||-.=+..-.+....+...+... -...+|+.-- ...+....-.+.+.. ..+-..|..+=|.|+ ++|-+- T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~-------~~~~dg~~~i--~~~~~~~~~~~~e~~-~~~~~eLI~~YR~ma-~~pEvd 69 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQ-------KDNLDGSQPV--SGGGYYGYTVDFDGQ-VRNEYQLISRYREMV-LQPECD 69 (533) T ss_pred CccccccccccccccccCCCCCC-------CCccccccee--ecccccceeeecccc-cchHHHHHHHHHHHh-hccchh Confidence 44433333222222211111100 0011221100 000111111111111 123456777778776 578889 Q ss_pred HHHHHHHhhhccC--CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 81 GVVAVHRDSIVGS--QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 81 ~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +||+-|++-+|=. +=.|. .++ |+-+ +..+.+.+.|...|+.-. ..++|..--.-.+|.|.+|| T Consensus 70 ~Av~eIVneaiv~d~~~~pV-~i~---Ld~~-~~s~~iK~kI~eEF~~Il----------~ll~F~~~~~e~fR~WYVDg 134 (533) T protein:vir:10 70 SAVDDIVNETICGNFDDVPV-SVE---LSNL-KVSDKIKKLIREEFGEIL----------RLLDFENRSYEIFRRWYVDG 134 (533) T ss_pred hHHHHhhcceeeecCCCceE-EEE---eccc-ccchHHHHHHHHHHHHHH----------HHhccchhhhHHHhhhhhcc Confidence 9999888877642 22221 111 1111 245667777888887644 34677777778999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC--C--CCCCCce-EEEEEEECCCCCeEEEEEeecCCCccccCCc Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP--N--NVMDTPN-LRSGVQLDNNGAALGYWLRKAFPGDPTDMEQ 233 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~--~--~~~~g~~-i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~ 233 (556) -+|..++..+.. ++.|- ..|..|||-.+..= . ..+++.. +..+.+ -.+.-.-|++++ |......++ T Consensus 135 Ri~fHkiid~~~--pk~GI---~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~--v~~~~~eyf~Yn--p~g~~~~~~ 205 (533) T protein:vir:10 135 RLFYHKVIDPDN--PQGGL---IELRYIDPRKIRKINETEQKRPEQLRGLPLNQQ--LSPKSAEYFLYD--PKGLKNSTT 205 (533) T ss_pred eEEEEEEecCCC--ccccc---eeeeeccccceeeeeeeeccCCCccceeecchh--hhccceeeeeec--cccccccCC Confidence 999776554211 12221 35777888777521 1 1122211 112222 245556688884 443333222 Q ss_pred cccceeeccccCChhHeEeeecccCCCcccC--CchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccc---c Q lcl|NC_019524. 234 WKWGYEPARFDWGRRRVIHIIEALLAGQTRG--ISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDV---V 308 (556) Q Consensus 234 ~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RG--vs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~---~ 308 (556) + ..++|.+-|..+.... ..=..| +|.|+++++.+.+|.=.+||.++=...-|-=--++.=+.+.-. + T Consensus 206 ~-------~vkI~~dAI~y~hSGl-~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KA 277 (533) T protein:vir:10 206 Q-------GLKIAPDSICYVHSGI-MDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKA 277 (533) T ss_pred C-------ceecchhheeeeeccc-eeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhH Confidence 1 2356665444433333 333344 7999999999999999999998876655533211111111111 0 Q ss_pred cccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 309 FGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 309 ~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~l 376 (556) .++. -....+......-.+++..+..-..+. +| -|.+|+.+.....-+. .+-+.... T Consensus 278 eqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRRe----Gg------rgTEItTLpGgqnLge-m~DV~YF~ 346 (533) T protein:vir:10 278 EQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRRE----GG------RGTEITTLPGGQNLGE-LEDVKYFQ 346 (533) T ss_pred HHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccC----CC------CccceeeccccCCcCh-HHHHHHHH Confidence 0000 011222222333344444444443332 22 2444444443322223 34455666 Q ss_pred HHHHHhcCCCHHHhhchhhcccchhHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccc Q lcl|NC_019524. 377 RNIAASLGMSYEQFSRDYTKTNYSSARA-----SMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWR 451 (556) Q Consensus 377 r~iaaglGi~ye~l~~D~s~~nYSs~R~-----~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~ 451 (556) +.+=.+|+||-.-|..+ + .|+-.|. --+.|.+.+.++|..|..-|.+++-. ..+|.|.+.. ..|. T Consensus 347 kKLY~aLnVP~SRl~~e-~--~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~----qLiLKgiit~-eeW~-- 416 (533) T protein:vir:10 347 KKLYKSLNVPGSRLETE-T--TFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKT----QLVLKGVISI-EEWD-- 416 (533) T ss_pred HHHHHHhCCCccccCCC-C--cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhccCCCH-HHHH-- Confidence 66777899998877654 2 3444444 34677788888888887777766544 3577787742 1111 Q ss_pred cccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc------CCC---CHH----HHHHHhCCCHHHHHHHHHHHHH Q lcl|NC_019524. 452 MFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN------GLS---TYE----AEISRLGGDFREVFKQRAREEG 518 (556) Q Consensus 452 ~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~------G~~---s~~----~~~ae~G~D~e~v~~q~a~E~~ 518 (556) ...-...|....-.+.-.+||++--..++.. .+- |.+ .+++..-.+.++...|++.|. T Consensus 417 --------~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~- 487 (533) T protein:vir:10 417 --------QMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEM- 487 (533) T ss_pred --------HHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHH- Confidence 1112345555666677888887755544432 111 222 223333444445555555553 Q ss_pred HHHHcCCCCCcccc-cc------CCCCCCCCCC--CCCCCCCC-cCCC Q lcl|NC_019524. 519 LIKSLKLDFTGKMV-EG------NSTQSSNSSE--STSDNPNE-ETTQ 556 (556) Q Consensus 519 ~~~~~Gl~~~~~~~-~~------~~~~~~~~~~--~~~~~~~~-e~~~ 556 (556) +.|+-.++... .+ +..+....++ ++..+|++ -..| T Consensus 488 ---k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (533) T protein:vir:10 488 ---ESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEGPDPSDERKAE 532 (533) T ss_pred ---hCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCCCCcchhhccC Confidence 34444322111 00 0111111111 11111111 1222 No 223 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=94.69 E-value=0.0038 Score=33.88 Aligned_cols=490 Identities=12% Similarity=0.089 Sum_probs=232.5 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |+-.=+..-...+.......... .--..+|+.. -..+++....-+.+. ...+-..|..+=|.| .++|-+- T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~------~p~~ddg~~~--~~~~g~~~~~~~~~~-~~~~~~eLI~~YR~m-a~~pEvd 70 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPV------PKNNEDGVDN--FISSGFYGQYVDIEG-AYRSEYDLIRRYREM-ALHPEAD 70 (558) T ss_pred CcchhcchhhhhhhhccCCcccc------CCCccccccc--eeccceeeeeecccc-hhhhHHHHHHHHHHH-hhccchh Confidence 33333322211111111100000 0012333311 112333332222222 234567889999999 7899999 Q ss_pred HHHHHHHhhhccC--CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 81 GVVAVHRDSIVGS--QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 81 ~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +||+-|++-+|=. +=.|. .++..-+ +..+.+.++|...|+.-.. .++|..--.-.+|.|.+|| T Consensus 71 ~Av~eIVneaiv~d~~~~pV-~i~Ld~~----~~s~~iK~kI~eEF~~Il~----------ll~F~~~~~e~fR~WYVDg 135 (558) T protein:vir:10 71 GAIEDVVNEAIVSDLYDSPV-EVELSNL----NASNTLKKKIREEFRYIKE----------MMDFDKKSHEIFRNWYVDG 135 (558) T ss_pred hHHHHhhcceeEecCCCceE-EEEeccc----CcchHHHHHHHHHHHHHHH----------HhccchhhhHHHhhheeee Confidence 9999999887752 22221 1111111 1245677888888886432 4567777778899999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC----CCCCCCCc---eEEEEEEECCCCCeEEEEEeecCCCccccC Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN----PNNVMDTP---NLRSGVQLDNNGAALGYWLRKAFPGDPTDM 231 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~----~~~~~~g~---~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~ 231 (556) -+|..++..... ++.|- ..|..|+|-.+.. .....+++ .+..+.++--...-.-|+++. |+..... T Consensus 136 RiyfHKiid~k~--pk~GI---~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~--~~~~~~~ 208 (558) T protein:vir:10 136 RVFYLKVIDTKN--PQEGI---QDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYT--PKVQHPT 208 (558) T ss_pred EEEEEEEEeCCC--ccccc---eeeeeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeec--CCccccc Confidence 999887663221 11121 3677788887731 11111111 122222221223444566663 3322111 Q ss_pred CccccceeeccccCChhHeEeeecccCCCcccC--CchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccc-- Q lcl|NC_019524. 232 EQWKWGYEPARFDWGRRRVIHIIEALLAGQTRG--ISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDV-- 307 (556) Q Consensus 232 ~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RG--vs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~-- 307 (556) ............+++.+ .||+-...-.+..-| +|.|+++++.+.+|.=.+||.++=...-|-=--++.=+.+.-. T Consensus 209 ~~~~~~~~~~~vkI~~d-AI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~ 287 (558) T protein:vir:10 209 GMVGQMGGKNSIKIAKD-SITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKV 287 (558) T ss_pred ccceeecCCCceeechh-heeeecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCch Confidence 11000000011244544 555555433444444 7999999999999999999998876655533211111111111 Q ss_pred -ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHH Q lcl|NC_019524. 308 -VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQS 374 (556) Q Consensus 308 -~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~ 374 (556) +.++. -....+......-.+++..+..-..+. +| -|.+|+.+.....-+. .+-+.. T Consensus 288 KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLpGgqnLge-m~DV~Y 356 (558) T protein:vir:10 288 KAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRRE----GG------RGTEITTLPGGQNLGE-LSDVDY 356 (558) T ss_pred hHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccC----CC------CccceeeccccCCcch-HHHHHH Confidence 00000 011222222333344444444443332 22 2444444443322222 234555 Q ss_pred HHHHHHHhcCCCHHHhhchhhcccchhHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcc Q lcl|NC_019524. 375 LLRNIAASLGMSYEQFSRDYTKTNYSSARA-----SMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKN 449 (556) Q Consensus 375 ~lr~iaaglGi~ye~l~~D~s~~nYSs~R~-----~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~ 449 (556) ..+-+=.+|+||-.-|..+ + .|+-.|. --+.|.+.+.++|..|..-|.+++-. ..+|.|.+..- .|. T Consensus 357 F~kKLy~aLnVP~SRl~~e-~--~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~----qLilKgiit~e-eW~ 428 (558) T protein:vir:10 357 FQKKLYRALGVPESRIAAE-G--GFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKT----QLVLKNIVTPE-DWK 428 (558) T ss_pred HHHHHHHHhCCCccccCCC-C--cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhccCCCHH-HHH Confidence 6666677899998877655 2 3444444 24567888888888887777766544 25777777531 111 Q ss_pred cccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCC----CHH----HHHHHhCCCHHHHHHHHHHH Q lcl|NC_019524. 450 WRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLS----TYE----AEISRLGGDFREVFKQRARE 516 (556) Q Consensus 450 ~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~----s~~----~~~ae~G~D~e~v~~q~a~E 516 (556) ...-...|....-.+.-.+||++--..++.. +.. |.+ .+++..-.+.++...|++.| T Consensus 429 ----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E 498 (558) T protein:vir:10 429 ----------TMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDE 498 (558) T ss_pred ----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHH Confidence 1112345555666677888887755554432 111 222 22233334444444555544 Q ss_pred HHHHHHcCCCCCccccccCC-----CCCCCCCCCCCCCCCCcC---CC Q lcl|NC_019524. 517 EGLIKSLKLDFTGKMVEGNS-----TQSSNSSESTSDNPNEET---TQ 556 (556) Q Consensus 517 ~~~~~~~Gl~~~~~~~~~~~-----~~~~~~~~~~~~~~~~e~---~~ 556 (556) . +.|+-.++....+-. .++....++-+..+.+.. ++ T Consensus 499 ~----k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 542 (558) T protein:vir:10 499 I----QKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQA 542 (558) T ss_pred H----hCCCCCCccccChhhccccCccCCchhccCCCCCcccccccch Confidence 3 345443322111111 110111111111111111 11 No 224 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=94.21 E-value=0.0051 Score=33.16 Aligned_cols=472 Identities=12% Similarity=0.131 Sum_probs=222.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |.+.+...+.. ..+. -...+|+.. -...+......+.+.. ..+-..|..+=|.|+ ++|-+- T Consensus 10 i~~~~~~~~~~-----s~~~----------~~~~dg~~~--~~~~~~~g~~~~~e~~-~~~~~eLI~~YR~ma-~~pEvd 70 (537) T protein:vir:10 10 LQRAKKVPKGP-----SFVQ----------KDSLDGSQP--IVGGGYFGYSVDFDGT-IRNDHELITRYREMV-LNPECD 70 (537) T ss_pred eecccccccCC-----cccC----------CCcccccce--eecccccccccccccc-cchHHHHHHHHHHHh-hccchh Confidence 22222211111 0000 011122110 0011111111111111 123356777888876 578889 Q ss_pred HHHHHHHhhhccC--CceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecC Q lcl|NC_019524. 81 GVVAVHRDSIVGS--QYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTG 158 (556) Q Consensus 81 ~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dG 158 (556) +||+-|++-+|=. +=.|. .++..-+ +..+.+.++|...|+.-. ..++|..--.-.+|.|.+|| T Consensus 71 ~Av~eIVneaiv~d~~~~pV-~i~Ld~~----~~s~~iK~kI~eEF~~Il----------~ll~F~~~~~e~fR~WYVDg 135 (537) T protein:vir:10 71 SAVDDVVNETICGNFDDVPI-SIDLHNL----KQSEKIKKLIRSEFDEIL----------RLLDFDNRAYEIFRRWYVDG 135 (537) T ss_pred hHHHHhhcceeEecCCCceE-EEEeccc----ccchHHHHHHHHHHHHHH----------HHhccchhhhHHHhhheeee Confidence 9999888877642 22221 2221111 235567777888887644 34677777778999999999 Q ss_pred ceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCC-----eEEEEEeecCCCccccCCc Q lcl|NC_019524. 159 EVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGA-----ALGYWLRKAFPGDPTDMEQ 233 (556) Q Consensus 159 E~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr-----~vaY~i~~~hpgd~~~~~~ 233 (556) -+|..++..+.. ++.|- ..|..|+|-.+..-..... ...++++....++ -.-|.+++ |......+ T Consensus 136 Ri~fhKiid~k~--pk~GI---~ELr~lDPr~i~~vR~i~~--~~~~~~~~~~~~~~v~~~~~eyf~yn--p~g~~~~~- 205 (537) T protein:vir:10 136 RLFFHKVIDPKK--PRQGL---VELRYVDPRKIRKVTEYEA--KRPEALRTQDLNQQLTQQSASYFLYN--PKGLKNST- 205 (537) T ss_pred EEEEEEEEeCCC--ccccc---eeeeeeCCccceeeEeecc--cCCccceEEecceeeeecccceeeec--cccccccC- Confidence 999887663221 11121 3677888888742111000 0011222222222 23455553 43333322 Q ss_pred cccceeeccccCChhHeEeeecccCCCc--ccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccc---c Q lcl|NC_019524. 234 WKWGYEPARFDWGRRRVIHIIEALLAGQ--TRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDV---V 308 (556) Q Consensus 234 ~~~~rv~~~~~v~a~~viH~f~~~r~gQ--~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~---~ 308 (556) .. ..++|.+ .||+-...-.+. .-.+|.|+++++.+.+|.=.+||.++=...-|-=--++.=+.+.-. + T Consensus 206 ~~------~vkI~~d-AI~y~hSGl~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KA 278 (537) T protein:vir:10 206 NQ------GMKIAPD-SIAYCHSGIQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKA 278 (537) T ss_pred CC------ceeccHh-heeeecccceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhH Confidence 22 2356664 444444322222 3458999999999999999999998766655533211111111111 0 Q ss_pred cccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHH Q lcl|NC_019524. 309 FGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLL 376 (556) Q Consensus 309 ~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~l 376 (556) .++. -....+......-.+++..+..-..+. +| -|.+|+.+.....-+. .+-+.... T Consensus 279 eqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRRe----Gg------rgTEItTLpGgqnlge-m~DV~YF~ 347 (537) T protein:vir:10 279 EQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRRE----GG------RGTEISTLPGGQNLGE-LEDVKYFQ 347 (537) T ss_pred HHHHHHHHHhccceEEEeccCceecccchhhhhhhhhcccccC----CC------cccceeeccccCCcCh-HHHHHHHH Confidence 0000 011222222333344444444443332 22 2444444443322223 34455666 Q ss_pred HHHHHhcCCCHHHhhchhhcccchhHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccc Q lcl|NC_019524. 377 RNIAASLGMSYEQFSRDYTKTNYSSARA-----SMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWR 451 (556) Q Consensus 377 r~iaaglGi~ye~l~~D~s~~nYSs~R~-----~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~ 451 (556) +.+=.+|+||-.-|..+ + .|+-.|. --+.|.+.+.++|..|..-|.+++-. ..+|.|.+.. ..|. T Consensus 348 kKLy~aLnVP~SRl~~e-~--~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~----qLilKgiit~-eeW~-- 417 (537) T protein:vir:10 348 KKLYKALNVPSSRLETE-T--TFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKT----QLILKGICSI-EEWE-- 417 (537) T ss_pred HHHHHHhCCCccccCCC-C--cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhccCCCH-HHHH-- Confidence 66777899998877654 2 3444444 34567788888888887777766544 2577777742 1111 Q ss_pred cccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc------CCC---CH----HHHHHHhCCCHHHHHHHHHHHHH Q lcl|NC_019524. 452 MFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN------GLS---TY----EAEISRLGGDFREVFKQRAREEG 518 (556) Q Consensus 452 ~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~------G~~---s~----~~~~ae~G~D~e~v~~q~a~E~~ 518 (556) ...-...|....-.+.-.+||++--..++.. .+- |. +.+++..-.+.++...|++.|.. T Consensus 418 --------~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k 489 (537) T protein:vir:10 418 --------EMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIA 489 (537) T ss_pred --------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhh Confidence 1112345555666677888887755544432 111 22 22333334455555555655543 Q ss_pred HHHHcCCCCCccccccC--CCCCCCCCCCCCCCCCCcCC---------C Q lcl|NC_019524. 519 LIKSLKLDFTGKMVEGN--STQSSNSSESTSDNPNEETT---------Q 556 (556) Q Consensus 519 ~~~~~Gl~~~~~~~~~~--~~~~~~~~~~~~~~~~~e~~---------~ 556 (556) .|+-.++...... ..+...+-++.+.+|+.|.+ - T Consensus 490 ----~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (537) T protein:vir:10 490 ----DGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKR 534 (537) T ss_pred ----CCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccC Confidence 2433221110000 00000111111112221111 1 No 225 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=94.07 E-value=0.0055 Score=32.97 Aligned_cols=472 Identities=11% Similarity=0.062 Sum_probs=225.8 Q ss_pred CCc--------chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCC----CCHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKD--------VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSI----ISPDQQIAQNQDMASAR 68 (556) Q Consensus 1 ~sp--------~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~----~s~~~~i~~~~~~lr~R 68 (556) |++ -....=.............. +.+...-++++.....+-...+|.... ++.+.. ..+-..|..+ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~-~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~-~~~~~eLI~~ 78 (524) T protein:vir:10 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSI-TAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPG-MKTTRELIDT 78 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccc-cCccCCCCceeeeecccccccccceeeeehhcccccc-cchHHHHHHH Confidence 222 11111000000000000000 000011111111110000001222111 111111 2356778889 Q ss_pred HHHHHhcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 69 AQDMVQNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 69 aRdl~rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) =|.| .++|-+-.||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..++|.. T Consensus 79 YR~m-a~~pEvd~Av~eIVneaiv~d~~~~pV------~l~L~~~~~s~~iK~kI~eeF~~Il----------~ll~F~~ 141 (524) T protein:vir:10 79 YRNL-MNNYEVDNAVSEIVSDAIVYEDDTEVV------ALNLDKSKFSPKIKNMMLDEFNDVL----------NHLSFQR 141 (524) T ss_pred HHHH-hhccchhhHHHHhhcceeEecCCCceE------EEEecCcCcchHHHHHHHHHHHHHH----------HHhccch Confidence 9999 78999999999998887742 21121 122211 234566777888887543 3467777 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC----CCCCCCceEEEEEEECCCCCeEEEEEe Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP----NNVMDTPNLRSGVQLDNNGAALGYWLR 221 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~----~~~~~g~~i~~GIE~d~~Gr~vaY~i~ 221 (556) --.-.+|.|.+||-+|..++..+.. ++.|- ..|..|+|-.+..- ....++..++.| -.-|+++ T Consensus 142 ~~~~~fR~WYVDgRi~fhKiid~k~--pk~GI---~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~--------~~e~f~Y 208 (524) T protein:vir:10 142 KGSDHFRRWYVDSRIFFHKIIDPKR--PKEGI---KELRRLDPRQVQYVREIITETEAGTKIVKG--------YKEYFIY 208 (524) T ss_pred hhhHHHhhheeeeEEEEEEEeeCCC--ccccc---eeeeeeCCccceeeeeeccCCCccchhhcc--------hhhheee Confidence 7778899999999999887664221 11121 36777888877421 112233333333 3346665 Q ss_pred ecCCCccccCCccccceeeccccCChhHeEeeecccCCCcc-cCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEe Q lcl|NC_019524. 222 KAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQT-RGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVE 300 (556) Q Consensus 222 ~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~-RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~ 300 (556) +.. ...+...+..+ ......++|.+-|.|.-...-+.-. -=+|.|+++++.+.+|.=.+||.++=...-|-=--++. T Consensus 209 ~~~-~~~y~~~g~~~-~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFY 286 (524) T protein:vir:10 209 DTA-HESYACDGRMY-EAGTKIKIPKAAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWY 286 (524) T ss_pred ccC-ccccccCcccc-CCCcceecchhheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEE Confidence 531 22222222110 0112346777777777665433222 33799999999999999999999887665553321111 Q ss_pred ccCcccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCC Q lcl|NC_019524. 301 SELPSDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPG 365 (556) Q Consensus 301 ~~~~~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~ 365 (556) =+.+.-. +.+.. -....+...+....+++..+..-..+. +| -|.+|+.+.....- T Consensus 287 IDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLpGgqnl 356 (524) T protein:vir:10 287 VDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRD----GK------AVTEVDTLPGADNT 356 (524) T ss_pred EecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccC----CC------cccceeeccccCCc Confidence 1111111 00000 012222233334445555444444442 22 24444444433222 Q ss_pred ccHHHHHHHHHHHHHHhcCCCHHHhhchhh-cccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019524. 366 GVGTDYEQSLLRNIAASLGMSYEQFSRDYT-KTNY---SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGN 441 (556) Q Consensus 366 ~~f~~F~~~~lr~iaaglGi~ye~l~~D~s-~~nY---Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~ 441 (556) +. .+-+....+.+=.+|+||-.-|..|-+ +.|+ |.+----+.|.+.+.+.|..|..-|.+++-. ..+|.|. T Consensus 357 ge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~----qLilKgi 431 (524) T protein:vir:10 357 GN-MEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKT----NLLLKGI 431 (524) T ss_pred Ch-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhccC Confidence 23 344556666677789999988865532 2333 2222334567788888888887777776544 3577787 Q ss_pred ccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCC----CHH----HHHHHhCCCHHH Q lcl|NC_019524. 442 VPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLS----TYE----AEISRLGGDFRE 508 (556) Q Consensus 442 l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~----s~~----~~~ae~G~D~e~ 508 (556) +.. ..|. ...-...|....-.+.-.+||++--..++.. +.- |.+ .+++..-.+.++ T Consensus 432 it~-eew~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~ 500 (524) T protein:vir:10 432 ITE-DEWN----------DEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQ 500 (524) T ss_pred CCH-HHHH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHH Confidence 742 1111 1112345555666677888887755554432 100 111 222222333333 Q ss_pred HHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 509 VFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 509 v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) ...|++.|. +.|+-. +++++++|- T Consensus 501 ~~k~I~~E~----k~~~~~-------------~~~~~~~~f 524 (524) T protein:vir:10 501 EAKQIEEES----KEARFQ-------------DPDQEQEDF 524 (524) T ss_pred HHHHHHHHh----hcCCCC-------------CCchhhhcC Confidence 333343332 223321 111111111 No 226 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=93.92 E-value=0.006 Score=32.78 Aligned_cols=195 Identities=10% Similarity=0.023 Sum_probs=94.3 Q ss_pred HHHHHHHHHHH---HHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceee Q lcl|NC_019524. 272 LKQMKMTRNFQ---EITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIP 348 (556) Q Consensus 272 l~~l~~l~~~~---dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~ 348 (556) +.+++.|.+.. +.++. ++++. +.. .. .+ -+++. T Consensus 1 V~k~~~l~~~~~~~~~~~~-~r~~~----~~~------------------------------------~~--~~-~~~~~ 36 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAAR-LRLAQ----VDN------------------------------------NS--GV-GQAIG 36 (201) T ss_pred CccchHHHHHhcCChHHHH-HHHHH----HHH------------------------------------hh--hh-hhhhe Confidence 11111111111 11111 11110 000 00 00 12333 Q ss_pred ecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch-hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 349 HLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS-SARASMAETQKYMDSRKKLVADRFASA 427 (556) Q Consensus 349 ~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS-s~R~~~~e~~r~~~~~q~~lv~~~~~p 427 (556) ....+|+++.++. +-++..+.+......||+..|||.-.|.|-- -...+ +.-.-+.-|+..++..|+..+..++.. T Consensus 37 ld~~~e~~e~~~~--~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~s-p~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~ 113 (201) T protein:vir:10 37 IDADSEEYNVLNS--DIGGIDTFLSQKFDRIVALSGIHEIILKGKN-VGGVSASQNTALETFYGYVDRKRKAELLPLLEF 113 (201) T ss_pred eecCCcceeeeec--CcCChHHHHHHHHHHHHhHhcCchhhhcCCC-CccccccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 3445677777764 4568999999999999999999999888743 34564 566677789999999998765555554 Q ss_pred HHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccch-hhhhHHHHHHHHcCCCCHHHHHHHhCCCH Q lcl|NC_019524. 428 IYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDE-KKETEAAILRIKNGLSTYEAEISRLGGDF 506 (556) Q Consensus 428 i~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~ 506 (556) +++. |. .|..+.+. ++.-|.+-.....+- .|.+++....+++|+.|..++..+ T Consensus 114 l~~~--------~~--~~~~~~~~-----------f~pL~~~s~kekAei~~~~a~a~~~~~~~g~i~~~e~r~~----- 167 (201) T protein:vir:10 114 LLPF--------IV--TEQEWSVE-----------FNPLSQVSDKDKSEILEKNVNSVAALIAAGIIDADEARDT----- 167 (201) T ss_pred HHHh--------hc--CCCCceEe-----------eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHH----- Confidence 4441 22 23222111 122444444444332 345677777777777776655432 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 507 REVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 507 e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +..+ ...|...+. ......+..++++|+++++. T Consensus 168 ------L~~~----~~~~~~~~~-------~~~~~~~~~e~~dp~~~~~~ 200 (201) T protein:vir:10 168 ------LRAI----STEVKIGEG-------SIQTEVVINESEDPLDVSAN 200 (201) T ss_pred ------HHhc----CCcCCCCCC-------CCCccccccccCCCCCCCCC Confidence 2221 111211111 00111112222223322222 No 227 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=93.74 E-value=0.0065 Score=32.56 Aligned_cols=444 Identities=13% Similarity=0.098 Sum_probs=204.9 Q ss_pred ccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHH Q lcl|NC_019524. 37 AERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEF 116 (556) Q Consensus 37 a~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~ 116 (556) -.|+... ..-.++..-.+-++..-...|-..-|+--.-+.....+++-.+.|....|+.-.... T Consensus 1 ~~~~~~~-~~~~~t~~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~--------------- 64 (525) T protein:vir:10 1 MTRTKGS-KNKSTTIEKQSLQIEQLQEHINELERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNL--------------- 64 (525) T ss_pred CCCCcCC-cccccchhhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeH--------------- Confidence 1111110 000111111112222223333333333333345567777777888888887655432 Q ss_pred HHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC-- Q lcl|NC_019524. 117 QEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP-- 194 (556) Q Consensus 117 ~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~-- 194 (556) +.+ +.|..++..+ ...+..|+.--.+.+||+|-.......-. +.-+++.++.-+.-... T Consensus 65 -~~l----~~~f~npd~~--------~~~i~~l~~y~yi~~~~v~ql~~li~~lp------~l~y~i~~~~~~k~~~~~~ 125 (525) T protein:vir:10 65 -DTL----QLWFNNPDKY--------INNIVNLLTYYYIIDGNVFQLYDLIFSLP------PLDYQIKVLKRDKDYKEDL 125 (525) T ss_pred -HHH----HhhhcChHHH--------HHHHHHHHHHhhhhcchHHHHHHHHHhcC------CcceeehhhhhccchhhHH Confidence 111 2333333211 12233344444456666654322211100 11223322221110000 Q ss_pred --C-------------------CCCC-C--------------ceEEEEEEE-CCCCCeEEEEEeecCCC-------c--- Q lcl|NC_019524. 195 --N-------------------NVMD-T--------------PNLRSGVQL-DNNGAALGYWLRKAFPG-------D--- 227 (556) Q Consensus 195 --~-------------------~~~~-g--------------~~i~~GIE~-d~~Gr~vaY~i~~~hpg-------d--- 227 (556) . .... | .+|.+-|.+ =.+||+-+=|+|-.+-- + T Consensus 126 s~~n~~l~k~i~hk~ltrdll~q~a~~gtlig~wlg~~~~py~~vf~~~kyvfp~~r~~g~~v~vid~~~f~~~~~~~r~ 205 (525) T protein:vir:10 126 STINLYLEKKIQHKQLTRDLLVQLAHSGTLIGTWLGSKREPYFNVFNNLKYVFPYGRAKGKMVAVIDLQWFDEMSELERK 205 (525) T ss_pred HHHHHHHHHhHHHHHHHHHHHHHhhccCceeEeeecCCCCcchhhhhhhhhhccccccCCceEEEEehHHhhhhhHHHHH Confidence 0 0001 1 112222221 13556666665543210 0 Q ss_pred -------ccc--CCccccce-------eeccccCChhHeEeeeccc-CCCcccCCchhhHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_019524. 228 -------PTD--MEQWKWGY-------EPARFDWGRRRVIHIIEAL-LAGQTRGISEMVSALKQMKMTRNFQEITLQ-NA 289 (556) Q Consensus 228 -------~~~--~~~~~~~r-------v~~~~~v~a~~viH~f~~~-r~gQ~RGvs~la~~l~~l~~l~~~~dael~-~a 289 (556) ++- ..-..|.. --++...|-+.++|+.-.. -..|.-|+||..|.+-.+.++.+|.++|.- +. T Consensus 206 ~~~~~lsp~i~~~~y~~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~EqsIA~ 285 (525) T protein:vir:10 206 LTFENLSPLITENKYKKWKEYNGENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDLEQSIAD 285 (525) T ss_pred HHHHhhchhhhhhhhhHHhhcccccchhheeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHHHHHHHH Confidence 000 00011221 1134568889999998764 456666999999999999999999999964 55 Q ss_pred HHhcceeee-EeccCcccccccccccccccccccccccccccccccccccceecCCce-eeecCCCceeeeecCCCCCcc Q lcl|NC_019524. 290 VVNATYAAS-VESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAK-IPHLYPGTKLKMQPAGTPGGV 367 (556) Q Consensus 290 ~i~A~~~~f-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~-i~~L~pGe~i~~~~~~~p~~~ 367 (556) ++.-.|++. |-...+....... ..-..-..+...+. ........|- ++-++.=-+|+|-+-.....+ T Consensus 286 kii~a~avLk~gg~~gn~mk~p~-----~~kqkil~gVk~al------eK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~g 354 (525) T protein:vir:10 286 KIIKAMAVLKFRGKDDNDSKVKE-----SAKRKVLAGVKRAL------EKGVKDKNGIACIAMPDFATFEFPEIKNGDKT 354 (525) T ss_pred HhhhhheeeeeccccCccccCch-----HHHHHHHHHHHHHH------hcccccccCeEEEeccceeecccccccCcccC Confidence 666555543 2222221111000 00000000000000 0011122232 222333333333222111111 Q ss_pred HH-HHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCC Q lcl|NC_019524. 368 GT-DYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPP 446 (556) Q Consensus 368 f~-~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~ 446 (556) .+ .=.+.+-..|-.++|+|-..++|| +-||+++...|--|.+.+..+-+. |++..+-++.+.|.++- T Consensus 355 lDg~K~d~I~~DI~~A~GlS~sL~nGd--ggNyAtaslnld~fykkigVm~e~-Iee~y~kL~d~Vl~~~k--------- 422 (525) T protein:vir:10 355 LDPKKYDSIDNDITNATGISQVLTNGT--KGNYASAKLNLDVFYKKIGVMLEI-IEEIYNQLIDIILGEEK--------- 422 (525) T ss_pred CCchhhhhhhhhhhhhhccceeeecCC--CCceeeeeeeHHHHHHHHHHHHHH-HHHHHHHHHhhhcCccc--------- Confidence 11 134567778899999999999998 689999999998888877766555 45777777777665411 Q ss_pred CcccccccchhhHHHhhCeeeecC--cccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHH--HHH Q lcl|NC_019524. 447 GKNWRMFYDPMMRDALCNAEWIGA--SRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRAREEGL--IKS 522 (556) Q Consensus 447 ~~~~~~~~~~~~~~a~~~~~w~~p--~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~--~~~ 522 (556) .|.|+.. .-.-||-.|.+.--++.-.-|.++..-.- -.|.|||+-++|--.|.+. ++| T Consensus 423 -----------------~~nyifnydkd~pi~~kkk~d~LIkL~d~g~s~k~vld-l~gis~e~y~E~s~yEtE~lkl~E 484 (525) T protein:vir:10 423 -----------------GCNYIFQYNKDTPIEREKKLDTLIKLEAQGYSAKYVLD-ILGISSEEYFEESIYEIEKLKLRE 484 (525) T ss_pred -----------------CcceEEecCCCchhhhhhhhhhhhhhhccchhhhhhhh-hhccCcchHHHHHHHHHHHHHHhh Confidence 1222222 11224555656666777777887776544 6799999999997666544 445 Q ss_pred cCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 523 LKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 523 ~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .=.++-...+-.-.+...-..+..++++..|.+= T Consensus 485 Ki~pp~~~~v~SGk~~n~iG~P~~dd~~~~dati 518 (525) T protein:vir:10 485 KIMPPLNTNVLSGKDGNDIGSPKLDDSDSSDATI 518 (525) T ss_pred hccccccceeeeccccccccCCccCCCcchhhhh Confidence 4443322222221111222222222222222111 No 228 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=93.31 E-value=0.008 Score=32.07 Aligned_cols=463 Identities=9% Similarity=0.062 Sum_probs=225.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCC-----CcccccccCC-CCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERT-----TREMFQWNPS-IISPDQQIAQNQDMASARAQDMVQ 74 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~-----~r~~~~w~~~-~~s~~~~i~~~~~~lr~RaRdl~r 74 (556) +++-....-........-.+...++ .-..+||.-- +-..++..-. -.+.+.. ..+-..|..+=|.| . T Consensus 13 ~~~~~~~d~~~~~~~~~~~~~s~~~-----p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~-~~~~~eLI~~YR~m-a 85 (524) T protein:vir:98 13 FKNFAREDEIELEQQLKNDTGSVAP-----PKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPA-IQNKEQLINTYRGI-M 85 (524) T ss_pred hhhhhhhhhhhHhhhhcCCcccccC-----CCCCCCceeecCCCCcceecceeeeeccccccc-cchHHHHHHHHHHH-h Confidence 1111111111111111000000000 0112222100 0001111100 1111211 13567788899999 7 Q ss_pred cChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHh Q lcl|NC_019524. 75 NDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAV 151 (556) Q Consensus 75 Nn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~ 151 (556) ++|-+-+||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..++|..--.-.+ T Consensus 86 ~~pEvd~Av~eIVneaIv~~~~~~pV------~l~L~~~~~s~~iK~kI~eeF~~Il----------~ll~F~~~~~~~f 149 (524) T protein:vir:98 86 SYPEVENAVSEIIDDAIVNEQGKDII------TMDLAKTNFSKAIQDKIVEEFDNVL----------NIYDFDNMGARLF 149 (524) T ss_pred hccchhhHHHhhhcceeEecCCCceE------EEEecccccchHHHHHHHHHHHHHH----------HHhccchhhhHHH Confidence 8999999999998887642 21221 122222 234667777888887644 3466777777889 Q ss_pred hhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC----CCCC-CCCceEEEEEEECCCCCeEEEEEeecCCC Q lcl|NC_019524. 152 SGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN----PNNV-MDTPNLRSGVQLDNNGAALGYWLRKAFPG 226 (556) Q Consensus 152 r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~----~~~~-~~g~~i~~GIE~d~~Gr~vaY~i~~~hpg 226 (556) |.|-+||-+|..++..+.+ ..| . ..|..|+|-.+.. .... .++..++.| -.-||++...-. T Consensus 150 R~WYVDgRi~fhkiid~~~---~kG--I-~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~--------~~e~f~Y~~~~~ 215 (524) T protein:vir:98 150 RDWYVDSRIYFHKIMHKDE---SKG--I-RELRQLDPRCMELIRESITETLDGGVKVFRG--------YREFFVYSAPKA 215 (524) T ss_pred hhhhhcceeEEEEEEcCCC---Ccc--e-eeeeeeCCccceeeeeccccccccchhhccc--------eeeeeeeccCCC Confidence 9999999999888764221 112 1 3677788887742 1111 122233333 556777764222 Q ss_pred ccccCCccccceeeccccCChhHeEeeecccC--CCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCc Q lcl|NC_019524. 227 DPTDMEQWKWGYEPARFDWGRRRVIHIIEALL--AGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELP 304 (556) Q Consensus 227 d~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r--~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~ 304 (556) + +...+. ........++|.+.|+|+-...- .+.. ||.|+++++.+.+|.=.+||.++=...-|-=--++.=+.+ T Consensus 216 ~-~~~~g~-~~~~~~~ikI~~dAIvy~hSGL~d~~~~i--isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvG 291 (524) T protein:vir:98 216 G-YTYNGQ-IYQANQKIKIPRSAIVYAHSGLEDCSNNI--IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVG 291 (524) T ss_pred c-cccccc-eecCCCceeechhheeeeccCcccCCCCe--eeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecC Confidence 2 111111 01111224788899999877643 2332 7999999999999999999998876655533221111111 Q ss_pred ccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHH Q lcl|NC_019524. 305 SDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGT 369 (556) Q Consensus 305 ~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~ 369 (556) .-. +.+.. -+...+...+....+++..+..-..+. +| .|.+|+.+.....-+. . T Consensus 292 nlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRRe----Gg------rgTEItTLpggqnlge-m 360 (524) T protein:vir:98 292 QMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRD----GK------AITEVSTLPGGQNFSD-M 360 (524) T ss_pred CCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhhhhcccccC----CC------CccceeeccccCCcCh-H Confidence 111 00000 011122222233334444444443332 22 2444444443322233 3 Q ss_pred HHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccC Q lcl|NC_019524. 370 DYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARAS-----MAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPL 444 (556) Q Consensus 370 ~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~-----~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~ 444 (556) +-+....+.+=.+|+||-.-|.-+ +.+|+-.|.+ -+.|.+.+.+.|..|..-|.+++-. ..+|.|.+.. T Consensus 361 ~DV~YF~kkLy~aLnVP~sRl~~~--~~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~----qLilKgiit~ 434 (524) T protein:vir:98 361 DDIKWFNRKLYEALRVPLSRMPRD--DGGMQIGGGGEITRDELKFSKFIRTLQIQFSPVLSDPLKT----NLIAKKIITE 434 (524) T ss_pred HHHHHHHHHHHHHhCCCceeccCC--CCccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhhcCCCH Confidence 445556666777899998777422 3344444433 3567778888888887777666443 2567777642 Q ss_pred CCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcC---------CCCH----HHHHHHhCCCHHHHHH Q lcl|NC_019524. 445 PPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNG---------LSTY----EAEISRLGGDFREVFK 511 (556) Q Consensus 445 p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G---------~~s~----~~~~ae~G~D~e~v~~ 511 (556) ..|+ ...-...|.-..-.+.-.+||++--..++..- .-|. +.+++..-.+.++... T Consensus 435 -eew~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k 503 (524) T protein:vir:98 435 -DEWE----------ENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAK 503 (524) T ss_pred -HHHH----------HHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccCHHHHHHHHH Confidence 2111 11123455555666778888877655554321 1111 1222222333333333 Q ss_pred HHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 512 QRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 512 q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) |++.|. +.|+-. +++++.+|- T Consensus 504 ~I~~E~----k~~~~~-------------~p~~e~~~f 524 (524) T protein:vir:98 504 LIEEES----KEERFK-------------NPEAEEENF 524 (524) T ss_pred HHHHHH----hCCCCc-------------CCccccccC Confidence 333332 223211 111111122 No 229 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=92.97 E-value=0.0093 Score=31.73 Aligned_cols=444 Identities=11% Similarity=-0.013 Sum_probs=187.9 Q ss_pred hhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHH-------H-----HHHHHHhcChHHHHHHHHHHhhh Q lcl|NC_019524. 23 ATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMAS-------A-----RAQDMVQNDGYAAGVVAVHRDSI 90 (556) Q Consensus 23 ~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr-------~-----RaRdl~rNn~~a~~~v~~~~~nv 90 (556) ..+. .....|+... +.| ..|-..-.....-+ ++.+- . +-|.----++.+..+++++++.+ T Consensus 1 ~~~~--~l~~r~~~l~-~~R--~~~e~~w~e~~~~~---lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L 72 (547) T protein:vir:10 1 MENS--KIVKRLDFLK-TDR--KNVEQIWDCIRKYI---MPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSL 72 (547) T ss_pred CCHH--HHHHHHHHHH-HHh--hHHHHHHHHHHHHh---cccccccccCCCCCcccccccccccccchHHHHHHHHHHHH Confidence 1111 1112344332 122 11110000000000 00000 0 00111113478889999998888 Q ss_pred ccCCceeeeeccccccCCCh------hHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEE Q lcl|NC_019524. 91 VGSQYKLNAKPNTIVLGAPD------GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATC 164 (556) Q Consensus 91 VG~Gi~~~~~~~~~~lg~~~------~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~ 164 (556) .+ ||+|..++=.. |...+ .+.+.|-+.+++.-. ++=-+.+|+.-...++...++-|-+++-+ T Consensus 73 ~~-~ltPp~~~WF~-l~~~d~~~~~~~~v~~~L~~ve~~i~----------~~l~~snf~~~~~~~~~~L~~~G~a~l~~ 140 (547) T protein:vir:10 73 HG-SLTSPATKWFE-LAFRDKELNSDDECRKWLENATHDVY----------SALQDSNFNLEANETYIDLCGYGNAIMVE 140 (547) T ss_pred HH-hhcCCCCcccc-cccCCccccchHHHHHHHHHHHHHHH----------HHHHhcCcHHHHHHHHHHHHhHCcEeEEe Confidence 86 46664443322 32221 233444444444322 22224578888888888888888886554 Q ss_pred eeccCCCCcCCCcccceEEEEEchhhc--CC-CCC-----------------------------------CCC----Cce Q lcl|NC_019524. 165 EWLNPTGTTMQRRPFGTAIQMISPYRM--SN-PNN-----------------------------------VMD----TPN 202 (556) Q Consensus 165 ~~~~~~~~~~~~~~~~l~lq~ie~drl--~~-~~~-----------------------------------~~~----g~~ 202 (556) ...+. .+-++.++.+....+ .. +.+ .++ --. T Consensus 141 ~~d~~-------~~~~~r~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~ 213 (547) T protein:vir:10 141 EEDED-------EEGSVVFQSSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQE 213 (547) T ss_pred ccCCC-------CCCceeEEEeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEE Confidence 22110 011222222222221 10 000 000 123 Q ss_pred EEEEEEECCCC--------------CeEE-EEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCch Q lcl|NC_019524. 203 LRSGVQLDNNG--------------AALG-YWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISE 267 (556) Q Consensus 203 i~~GIE~d~~G--------------r~va-Y~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~ 267 (556) +++.|..+..+ +|.+ +|+.... ++.....+ .|. .. =.+..--...+|..-|.++ T Consensus 214 v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~-~~~~l~es-g~~------e~---P~~~~Rw~~~~ge~YGrgp 282 (547) T protein:vir:10 214 VVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEG-AVQLGEEG-GYY------EM---PAYAIRWRKSAGSQWGFGP 282 (547) T ss_pred EEEEEeeccCCCCCccccceeeccccceeEEEEEecC-ceeeeecC-Ccc------cC---CeeeeeeeecCCcccccch Confidence 55666543322 2222 2222111 11000000 000 00 1333333456899999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCcee Q lcl|NC_019524. 268 MVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI 347 (556) Q Consensus 268 la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i 347 (556) ..-+|-.++.|+.+..+.+..+.+++.....+... +- .....+.||.+ T Consensus 283 ~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~-g~-------------------------------~~~~~~~pgg~ 330 (547) T protein:vir:10 283 SHLALPDVLTANRYVELVLRSSEKVIDPAIMVTER-GL-------------------------------ISDIDLGASGL 330 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCceecccc-cc-------------------------------cccceecCCee Confidence 99999999999999999999999988877654421 10 01123557777 Q ss_pred eecCCCceeeeecCCCCCccHHH---HHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 348 PHLYPGTKLKMQPAGTPGGVGTD---YEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRF 424 (556) Q Consensus 348 ~~L~pGe~i~~~~~~~p~~~f~~---F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~ 424 (556) .....++.++.++.. ++|.. ....+-..|..++=..-. ...|-+.++-.=++.-..|..+.+-..=..|...| T Consensus 331 ~~~~~~~~v~pl~~~---~~~~~~~~~i~~~~~rI~~af~~d~~-~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~ 406 (547) T protein:vir:10 331 TVVRDMESMKPFESR---ARFDVSSIQLTDLRSAVRRIYYVDQL-QMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDF 406 (547) T ss_pred eecCCcccceeeecc---cchHHHHHHHHHHHHHHHHHhhhhhh-hcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 777888888877633 34432 233333344444322111 22344444444444444444444444444566789 Q ss_pred HHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHHhCC Q lcl|NC_019524. 425 ASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISRLGG 504 (556) Q Consensus 425 ~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~ 504 (556) +.|+.++.+..+...|.||-|+-. ........-...+|.|+--++.... +.+=....+.+..--+. T Consensus 407 l~Pli~r~~~il~r~g~lP~~p~~-------------l~~~~~~~~~v~~is~Laraq~~~~-~~~i~~~~~~v~~laq~ 472 (547) T protein:vir:10 407 LSPMIQRTFNIRFRAGKLGELPSK-------------LLESGKAAMDIVYTGPLSRAQKIDQ-AASIERWAGSTAQLAEI 472 (547) T ss_pred HHHHHHHHHHHHHhcCCCCCCchh-------------hhccCcceEEEEeccHHHHHHHHHH-HHHHHHHHHHHHHhhcc Confidence 999999999998899998854310 1111111111234455544332111 11000011111111122 Q ss_pred CHHHHHHHHHHH---HHHHHHcCCCCCcccc----------------------ccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 505 DFREVFKQRARE---EGLIKSLKLDFTGKMV----------------------EGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 505 D~e~v~~q~a~E---~~~~~~~Gl~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +|+ +++.+.-+ ...++.+|++...... ....++.+-+.-+ .....-..+| T Consensus 473 ~P~-vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~-~~~a~~~~~~ 547 (547) T protein:vir:10 473 NPE-VLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQG-KGQAALKENQ 547 (547) T ss_pred Chh-hhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CcccchhccC Confidence 332 11111111 0111233433210000 0000000000011 1112212333 No 230 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=92.88 E-value=0.0096 Score=31.64 Aligned_cols=468 Identities=10% Similarity=0.060 Sum_probs=227.2 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccC-----CCccc--ccccCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAER-----TTREM--FQWNPSIISPDQQIAQNQDMASARAQDMV 73 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~-----~~r~~--~~w~~~~~s~~~~i~~~~~~lr~RaRdl~ 73 (556) |++..+..-...+..+.-.+...++ .-..+||.- .++.. ++....-.+.+. ...+-..|..+=|.| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~s~~~-----P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~-~~~~~~eLI~~YR~m- 80 (521) T protein:vir:81 8 LARWADFDNDKYEEQIKDKAESIAA-----PKNNDGATEVEINDNLPASAWNSLTQQFYSTDQ-KISTTKQLVNTYRGL- 80 (521) T ss_pred hHhhcCchhhhHHhhhccCcccccc-----CCCCCCceEecccCCCcceeecceeeeeccccc-chhhHHHHHHHHHHH- Confidence 4444443322222211111111100 011222210 00000 011111111111 123557788999999 Q ss_pred hcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHH Q lcl|NC_019524. 74 QNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLA 150 (556) Q Consensus 74 rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~ 150 (556) .++|-+-+||+-|++-+|=. +=.|. . +.++. +..+.+.++|...|+.-. ..++|..--.-. T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV-~-----l~L~~~~~s~~iK~kI~eeF~~Il----------~ll~F~~~~~~~ 144 (521) T protein:vir:81 81 MNNHEVENAVQNIVNDAIVFEEGHEVV-S-----LNLEATGFSESVKERIHEEFKDLL----------NTIQFDRRGQDM 144 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceE-E-----EEecccccchHHHHHHHHHHHHHH----------HHhccchhhhHH Confidence 78999999999998887742 21221 1 22221 235666777888887644 346677777788 Q ss_pred hhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCcccc Q lcl|NC_019524. 151 VSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTD 230 (556) Q Consensus 151 ~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~ 230 (556) +|.|-+||-+|..++..+.+ +.|- ..|..|+|-.+..-..-.. .-..|+++ .+.-.-||++. |++... T Consensus 145 fR~WYVDgRi~fhkiid~~p---k~GI---~Elr~lDPr~i~~vr~i~k--~~~~~~~v--~~~~~e~f~Y~--~~~~~~ 212 (521) T protein:vir:81 145 FRRWYVDSRIFFHKIIGKNP---KDGI---VELRQLDPRNLEYVREIIT--EDTPEGKI--YKATKEYFIYT--VGNSSY 212 (521) T ss_pred HhhhhhcceEEEEEEEcCCc---cccc---eeeeeeCCcceeeeeeecc--cccCccce--ecceeeeeeee--cCCccc Confidence 99999999999888764321 2222 3677888887753211110 01123333 34466788883 443221 Q ss_pred C-CccccceeeccccCChhHeEeeecccCCCcccC--CchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccc Q lcl|NC_019524. 231 M-EQWKWGYEPARFDWGRRRVIHIIEALLAGQTRG--ISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDV 307 (556) Q Consensus 231 ~-~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RG--vs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~ 307 (556) . .++.+.-- ...+++.+-|..+.... ..=..| +|.|+++++.+.+|.=.+||.++=...-|-=--++.=+.+.-. T Consensus 213 ~~~g~~~~~~-~~vkI~~dAI~y~hSGl-~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlp 290 (521) T protein:vir:81 213 CAGGQVFSPN-SRVKIPRSAITYAHSGL-MDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMN 290 (521) T ss_pred cccceeecCC-cceeechhheeeeeccc-eeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCC Confidence 1 11111100 11345555444444333 333344 7999999999999999999998876655533211111111111 Q ss_pred ---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHH Q lcl|NC_019524. 308 ---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYE 372 (556) Q Consensus 308 ---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~ 372 (556) +.+.. -+...+...+..-.+++..+..-..+. +| .|.+|+.+.....-+. .+-+ T Consensus 291 k~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLpGgqnlge-m~DV 359 (521) T protein:vir:81 291 NRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRD----GK------AITDVTTLPGASGMSD-IDDI 359 (521) T ss_pred chhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhcccccC----CC------cccceeecccCCCCCh-HHHH Confidence 00000 011222222333334444444443332 22 2444444443322233 3445 Q ss_pred HHHHHHHHHhcCCCHHHhhchhhcccchhHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCC Q lcl|NC_019524. 373 QSLLRNIAASLGMSYEQFSRDYTKTNYSSARAS-----MAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPG 447 (556) Q Consensus 373 ~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~-----~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~ 447 (556) ....+.+=.+|+||-.-|..+- +.+|+-.|.+ -+.|.+.+.++|..|..-|.+++-. ..+|.|.+.. .. T Consensus 360 ~YF~kkLy~aLnVP~sRl~~e~-~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~----qLilKgiit~-ee 433 (521) T protein:vir:81 360 RYFNRKLYEALRVPLSRSNLSD-ANMVIGGDGSEITRDELEFSKFIRTRQSQFSEVLRDPLKY----NLILKNVITE-DD 433 (521) T ss_pred HHHHHHHHHHhCCccccccCCC-CcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhhcCCCH-HH Confidence 5666667778999988774332 2244433433 4577888888888887777766444 2567777642 21 Q ss_pred cccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCC----CHH----HHHHHhCCCHHHHHHHHH Q lcl|NC_019524. 448 KNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLS----TYE----AEISRLGGDFREVFKQRA 514 (556) Q Consensus 448 ~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~----s~~----~~~ae~G~D~e~v~~q~a 514 (556) |+ ...-...|.-..-.+.-.+||++--..++.. +.. |.+ .+++..-.+.++...|++ T Consensus 434 w~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~ 503 (521) T protein:vir:81 434 WD----------REINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIE 503 (521) T ss_pred HH----------HHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHH Confidence 11 1112345555666677888887755554432 100 111 222222333333333333 Q ss_pred HHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 515 REEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 515 ~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) .|. +.|+-. +++++.++- T Consensus 504 ~E~----~~~~~~-------------~p~~~~~~f 521 (521) T protein:vir:81 504 EEA----NDPRFK-------------QTPDEIEDF 521 (521) T ss_pred HHh----hCCCCC-------------CCcccccCC Confidence 332 122211 111111222 No 231 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=92.53 E-value=0.011 Score=31.31 Aligned_cols=466 Identities=11% Similarity=0.055 Sum_probs=193.9 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) ||-.+-..-.+++.+..+...-...|..--...-+-+..+-++.. +.....+ +.+.----++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~---~~~~~~~------------~~~~~~~~dst~~ 65 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLF---PKDSDNA------------STDYTTPWQAVGA 65 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccC---CCCCCcc------------ccccCCcccccHH Confidence 665555443333333222211111110000000000111111100 0000000 0000113677888 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHH------HHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEF------QEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~------~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) .+++.+++.+.+ ||+|. ++ +-.|..++...+.+ ...++.++..-...- .++=.+.+||.-...++... T Consensus 66 ~a~~~Laa~l~~-~ltP~-~~-WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~---~~~~~~snf~~~~~~~~~~L 139 (535) T protein:vir:94 66 RGLNNLASKLML-ALFPM-QT-WMKLTISEFEAKQLVAQPAELAKVEEGLSMVERIL---MNYIESNSYRVTLFETLKQL 139 (535) T ss_pred HHHHHHHHHHHh-hhcCC-CC-ccccccChhhhhccccchhHHHHHHHHHHHHHHHH---HHHHHhcCcHHHHHHHHHHH Confidence 899999999988 67784 65 76666655332111 122333322211111 23334668998888888888 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEE-----------------EEEchhhcCCCC----------CCCCCceEEEEE Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAI-----------------QMISPYRMSNPN----------NVMDTPNLRSGV 207 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~l-----------------q~ie~drl~~~~----------~~~~g~~i~~GI 207 (556) ++.|-+++-+... .+....-..||+.= +-+..+.|+... +......|++-| T Consensus 140 ~~~G~a~l~~~~~--~~~~~~f~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v 217 (535) T protein:vir:94 140 VVAGNALLYIPEP--EGTYNPMKLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHI 217 (535) T ss_pred HhhCcEeEeeccC--cCcccceEEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEE Confidence 8888876543221 11100001111110 112222222100 001112467777 Q ss_pred EECCCCCeEE-EEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 208 QLDNNGAALG-YWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITL 286 (556) Q Consensus 208 E~d~~Gr~va-Y~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael 286 (556) ..++.+.+.+ |+-.+..+.....+. ..+... =.+..-....+|..-|.++..-+|-.+|.|+.+..+.+ T Consensus 218 ~~~~~~~~~~~~~e~~g~~~~~~~~~-~g~~~~---------P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 287 (535) T protein:vir:94 218 YLDEESGEYLKYEEIDGVEVEGTDAS-YPVDAC---------PYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 (535) T ss_pred EeeCCCCcEEEEEEecCeeecccccc-CccccC---------CceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 7776654433 332222221110000 000000 13333344568999999999999999999999999999 Q ss_pred HHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCc Q lcl|NC_019524. 287 QNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG 366 (556) Q Consensus 287 ~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~ 366 (556) ..+.+++.-...+. +++.... ......++|.|+.-.+ ++|.+++-. ..+ T Consensus 288 ~~~~~a~~~~~lv~-p~g~~~~----------------------------~~~~~~~~g~~v~g~~-~~v~~~~~~-~~~ 336 (535) T protein:vir:94 288 KMSMISAKVIGLVN-PAGITQV----------------------------RRLTKAQTGDFVSGRP-EDISFLQLE-KAA 336 (535) T ss_pred HHHHHhccCCcccc-cccccch----------------------------hhcccCCCceeecCCc-ccceeeecc-ccc Confidence 99988876554433 2111100 0001123444433232 334443222 223 Q ss_pred cH---HHHHHHHHHHHHHhcCCCHHHhh-chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCc Q lcl|NC_019524. 367 VG---TDYEQSLLRNIAASLGMSYEQFS-RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNV 442 (556) Q Consensus 367 ~f---~~F~~~~lr~iaaglGi~ye~l~-~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l 442 (556) +| ......+...|..++ -+.+++ .|-++++-.=++.-..|....+-..=..|-..|+.|+.++-+..+...|.| T Consensus 337 ~~~~~~~~i~~~~~rI~~af--~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~l 414 (535) T protein:vir:94 337 DFSVARAVSEQIEGRLSYAF--MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQI 414 (535) T ss_pred chhHHHHHHHHHHHHHHHHH--hHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 44 233444455555554 122332 455555555555555555555555455678889999999999999999999 Q ss_pred cCCCCcccccccchhhHHHhhCeeeecC-----------------------cccccchhhhhHHHHHHHHcCC------- Q lcl|NC_019524. 443 PLPPGKNWRMFYDPMMRDALCNAEWIGA-----------------------SRGQIDEKKETEAAILRIKNGL------- 492 (556) Q Consensus 443 ~~p~~~~~~~~~~~~~~~a~~~~~w~~p-----------------------~~~~iDP~Ke~~A~~~~i~~G~------- 492 (556) |-|.. ..++++++.+ +..-+|+.=+.......+...+ T Consensus 415 P~~p~-------------~~v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i 481 (535) T protein:vir:94 415 PELPK-------------EAVEPTISTGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGI 481 (535) T ss_pred CCCCh-------------hhccceEeehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhh Confidence 75432 1122333221 0011222111222222221111 Q ss_pred -CCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 493 -STYEAEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPN 551 (556) Q Consensus 493 -~s~~~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (556) +|.+++.+++ +..++.+.....+...|=.... ....++.......+.-+=.|+ T Consensus 482 ~rs~eev~~~~-----~q~~~~~~~~~~~~~~g~~~~~-~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 482 LKTPEEKQQEM-----AEAAQGTAMQNAAASAGAGAGT-MATASPENMKAAAAQAGMAPN 535 (535) T ss_pred cCCHHHHHHHH-----HHHHHHHHHHHHHHHHHHhhhc-ccccChHHHHHHHHHhccCCC Confidence 1222222111 0000000011111111100000 000000000000000011111 No 232 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=91.77 E-value=0.014 Score=30.70 Aligned_cols=480 Identities=11% Similarity=0.006 Sum_probs=210.0 Q ss_pred CCcchhhhHHHH--Hh-hHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHH--HHHH-- Q lcl|NC_019524. 1 MKDVKKTTRTRA--KK-AVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARA--QDMV-- 73 (556) Q Consensus 1 ~sp~~~~~r~~a--~~-a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~Ra--Rdl~-- 73 (556) .+++....-... .+ +.-...+....+.... ..-..|. ........++ ++-+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~-----------~~~~~w~-----------~~~~~~~~~~~~~~y~~~ 62 (651) T protein:vir:80 5 TTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQ-----------VCEETWL-----------EAWGMYLSTPEAQDYLRD 62 (651) T ss_pred ccccchhhhhhhhhHHHHHHHHHHHHHHHHHhh-----------hhhhhHH-----------HHHHhhcccHHHHHhhcc Confidence 333333221111 00 1011111111111000 0000110 0011111111 1111 Q ss_pred ----------------hcChHHHHHHHHHHhhhccCCceeeeeccccccCC-ChhHHHHHHHHHHHHHHHHhccccccee Q lcl|NC_019524. 74 ----------------QNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGA-PDGWGEEFQEVVEARFNMAAESPENWFD 136 (556) Q Consensus 74 ----------------rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~-~~~~~~~~~~~ie~~~~~w~~~~~~~cD 136 (556) --++.++..|+.+..+.... |.+..+. ...... +++.++...+.++.+|..+.. .| T Consensus 63 ~~~~~~~~~~~~~rs~~~~~~v~~~ve~~~~~l~~~-~~~~~~~-~~~~p~~~~d~a~~~~~~~~~~~~~~l~----~~- 135 (651) T protein:vir:80 63 QVLRSVGDVNADWRHKITTGKAFEAIETIHAYLMSA-TFPNKNW-FDVVPAKPGQDNLLVSRLIKRYVQDKLT----EG- 135 (651) T ss_pred ccccccCCCCCCCCccccChhHHHHHHHHHHHHHHh-hcCCCce-eEeccCCchhHHHHHHHHHHHHHHHHhh----cc- Confidence 12356666666666655552 2222111 111122 455566677778887775543 25 Q ss_pred hhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCcCC----------C------------cccceEEEEEchhhcCCC Q lcl|NC_019524. 137 ARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQ----------R------------RPFGTAIQMISPYRMSNP 194 (556) Q Consensus 137 ~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~----------~------------~~~~l~lq~ie~drl~~~ 194 (556) +|.......+...+..|-+|++..|......... + ..-..++..|+|..+-.+ T Consensus 136 -----~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~d 210 (651) T protein:vir:80 136 -----KFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYD 210 (651) T ss_pred -----CcHHHHHHHHHhhcccCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeec Confidence 4777777888888999999998877432100000 0 001135666766655422 Q ss_pred CCC---CCCceEE--------------EEEEEC-----------------------------CCC-------CeEEEEEe Q lcl|NC_019524. 195 NNV---MDTPNLR--------------SGVQLD-----------------------------NNG-------AALGYWLR 221 (556) Q Consensus 195 ~~~---~~g~~i~--------------~GIE~d-----------------------------~~G-------r~vaY~i~ 221 (556) ... .+..+|. .|+..| ..| ..+-||+. T Consensus 211 p~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~ 290 (651) T protein:vir:80 211 PNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGD 290 (651) T ss_pred CCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEE Confidence 111 1121211 121100 000 12334443 Q ss_pred ecCCCccccC-----CccccceeeccccC-ChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcce Q lcl|NC_019524. 222 KAFPGDPTDM-----EQWKWGYEPARFDW-GRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATY 295 (556) Q Consensus 222 ~~hpgd~~~~-----~~~~~~rv~~~~~v-~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~ 295 (556) ..--|+.+.. .+..-.++...... ..+ .+|+--...||...|.+..--++...+-|+.+..+.+..+.+++.- T Consensus 291 ~d~e~~~~~~~~v~~~g~~il~~~~~~~~~~~P-f~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~ 369 (651) T protein:vir:80 291 IHLENKTYHDVVVTIMGNEVLRFEQNPYWCGRP-FVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQ 369 (651) T ss_pred eeccCCceEEEEEEEcCcEEecccccCCCCCCC-eeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCC Confidence 2222221110 00000010000000 112 3343334579999999999999999999999999999999998888 Q ss_pred eeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCcc-HHHHHHH Q lcl|NC_019524. 296 AASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGV-GTDYEQS 374 (556) Q Consensus 296 ~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~-f~~F~~~ 374 (556) ..++..+.-. ........||.+++...+.++.++.+..++.. ...-... T Consensus 370 ~~~v~~d~~~------------------------------~~~~l~~~pg~vi~~~~~~~~~~l~~~~~~~~~~~~~l~~ 419 (651) T protein:vir:80 370 MYTLRSDGLL------------------------------QPEDVYTEPGKVFLVSDHGDLQPLANQSSNFSITYQESSF 419 (651) T ss_pred cEEecCCccc------------------------------cHHHhhcCCCceEEecCCCCceeeccCcccchhHHHHHHH Confidence 7776422100 00111246788888888888888876544311 1122344 Q ss_pred HHHHHHHhcCCCHHHhh---chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccc Q lcl|NC_019524. 375 LLRNIAASLGMSYEQFS---RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWR 451 (556) Q Consensus 375 ~lr~iaaglGi~ye~l~---~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~ 451 (556) +...+-..+||+-.... .+...++-+.++.-..+........-..|...|+.|++++.+......+..+-..-.. T Consensus 420 l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~-- 497 (651) T protein:vir:80 420 LESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVA-- 497 (651) T ss_pred HHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeec-- Confidence 44456667788754321 2334456677777777888788777778888899999999999888777654211000 Q ss_pred cccchhhHHHhhCeeeecCcccccc----h-hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHH-HHHHH-HHHHHHcC Q lcl|NC_019524. 452 MFYDPMMRDALCNAEWIGASRGQID----E-KKETEAAILRIKNGLSTYEAEISRLGGDFREVFK-QRARE-EGLIKSLK 524 (556) Q Consensus 452 ~~~~~~~~~a~~~~~w~~p~~~~iD----P-~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~-q~a~E-~~~~~~~G 524 (556) ..+. ......|+.+.-..++ + ......... .-+..+..+..-.|.+|.-... ++++- .+.++.+| T Consensus 498 --~~~~---~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~---~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g 569 (651) T protein:vir:80 498 --GDEA---GAYEYYELDVEDLQKEVRLVPIGSDHVIERK---QYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQHWG 569 (651) T ss_pred --cccc---ccccccccCccceeeeeeeeeccHHHHHHHH---HHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHHcC Confidence 0000 0000111111111000 0 000000000 0111122233333445432211 12222 23556778 Q ss_pred CCCCccccccCCCCCCCCCCCCCC--CCCCcCCC Q lcl|NC_019524. 525 LDFTGKMVEGNSTQSSNSSESTSD--NPNEETTQ 556 (556) Q Consensus 525 l~~~~~~~~~~~~~~~~~~~~~~~--~~~~e~~~ 556 (556) +..+...... +...+...+.+.. ..+....+ T Consensus 570 ~~~~~~~l~~-~~q~~~~~~~~~~~~q~~~~~~~ 602 (651) T protein:vir:80 570 FEEPEAYLKQ-QDQQAPANPQEALLSQAKDVGGQ 602 (651) T ss_pred CCCcHHhcCC-CccchhhhhhHHHHhhHHHHHHH Confidence 7643321111 1111111111000 00000000 No 233 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=91.06 E-value=0.018 Score=30.19 Aligned_cols=466 Identities=10% Similarity=0.082 Sum_probs=224.8 Q ss_pred CCcc-hhh-hHH--HHHh--hHhhcccchhhhhhhhcchhccccCC-----Cc-ccccccCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDV-KKT-TRT--RAKK--AVDVVAETATATPMAVGGGMEGAERT-----TR-EMFQWNPSIISPDQQIAQNQDMASAR 68 (556) Q Consensus 1 ~sp~-~~~-~r~--~a~~--a~~~~~~~~~~~~~~~~~~y~aa~~~-----~r-~~~~w~~~~~s~~~~i~~~~~~lr~R 68 (556) |++- ..+ .-. .... .-....+. .+...--..+|+.-- +. ...+.....-+.+. -..+-..|..+ T Consensus 1 m~~~~l~lf~f~~k~~e~~~~~~~~~~~---~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~-~~~n~~eLI~~ 76 (521) T protein:vir:10 1 MNPIFLKLLQPWMKDDEKRVQSDLSDRI---DSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAP-KIQNTKDLINQ 76 (521) T ss_pred CCcchhHHhhhhhhhhhhHHhhhhccCc---cccccccCCCCceeeccCCCccccccchhhhhhcccc-ccchHHHHHHH Confidence 3321 000 000 0000 00000000 000000112222100 00 00111111111111 12345678888 Q ss_pred HHHHHhcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 69 AQDMVQNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 69 aRdl~rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) =|.| .++|-+-.||+-|++-+|=. +=.|. . +.++. +..+.+.+.|...|+.-. ..++|.. T Consensus 77 YR~m-a~~pEvd~Av~eIvneaiv~d~~~~pV-~-----i~Ld~~~~s~~iK~kI~eeF~~Il----------~ll~F~~ 139 (521) T protein:vir:10 77 YRSL-SKYHEVDNAIDEIINDAIVQEDNRDTV-Y-----LDLDKTDWNESVKEMVREEFRTIL----------KLLKFER 139 (521) T ss_pred HHHH-hhccchhhHHHhhhcceEEecCCCceE-E-----EEecCcccchHHHHHHHHHHHHHH----------HHhccch Confidence 8999 78999999999998887742 21221 1 22211 224567778888888644 3466777 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC----CCCCCCceEEEEEEECCCCCeEEEEEe Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP----NNVMDTPNLRSGVQLDNNGAALGYWLR 221 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~----~~~~~g~~i~~GIE~d~~Gr~vaY~i~ 221 (556) --.-.+|.|.+||-+|..++..+.. ++.|- ..|..|+|-.+..- ....++.+++.|+ .-|+++ T Consensus 140 ~~~~~fR~WYVDgRi~fHkiid~~~--pk~GI---~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~--------~e~f~Y 206 (521) T protein:vir:10 140 EGKRHFRRWYVDSRIYFHKMIDPAR--PKDGI---KELRLLDPRNVEYYRVNLKSNENGNDVYKGV--------KEFFTY 206 (521) T ss_pred hhhHHHhhheeeeeEEEEEEeeCCC--ccccc---eeeeeeCCcceeeeeeecCCCCCcchhhccc--------eeeeee Confidence 7778899999999999877653221 11221 36777888877421 1123444455554 367777 Q ss_pred ecCCCccccCCccccceeeccccCChhHeEeeeccc-CCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEe Q lcl|NC_019524. 222 KAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEAL-LAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVE 300 (556) Q Consensus 222 ~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~-r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~ 300 (556) +.--..++...+.. ....++|.+.|.|.-... ...-.-.+|.|+++++.+.+|.=.+||.++=...-|-=--++. T Consensus 207 ~~~~~~~~~~~g~~----~~~vkI~~daI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFY 282 (521) T protein:vir:10 207 GATEDNRYNISGNS----NNLVQIPIDAIVYSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFY 282 (521) T ss_pred ccCCCceecCCCCC----CcceeechhheeeecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEE Confidence 53211111111111 123467887777766543 2233667999999999999999999999887665553321111 Q ss_pred ccCcccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCC Q lcl|NC_019524. 301 SELPSDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPG 365 (556) Q Consensus 301 ~~~~~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~ 365 (556) =+.+.-. +.++. -....+...+....+++..+..-..+. +| -|.+|+.+.....- T Consensus 283 IDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEI~TLpggqnl 352 (521) T protein:vir:10 283 IDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRD----GK------ATTEVSTLPGAQSM 352 (521) T ss_pred EecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccC----CC------CccceeeccccCCc Confidence 1111111 00000 012222233333444444444444432 22 24444444433222 Q ss_pred ccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019524. 366 GVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARAS-----MAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAG 440 (556) Q Consensus 366 ~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~-----~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G 440 (556) +. .+-+....+.+=.+|+||-.-|..+ +..|+-.|.+ -+.|.+.+.+.|..|..-|.+++-. ..+|.| T Consensus 353 ge-m~DV~YF~kkLy~aLnVP~sRl~~e--~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~~f~~~L~~----qLilKg 425 (521) T protein:vir:10 353 GE-MDDVRWFNRKLYESMKIPLSRLPQE--GAGVTFGAGNDITRDELQFTKYIRGLQQQFEPIFLNPLRT----NLMLKG 425 (521) T ss_pred Ch-HHHHHHHHHHHHHHhCCCccccCCC--CCceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhcc Confidence 33 3445566666777899998877555 2334444433 3567777888888887777666443 357778 Q ss_pred CccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----C------CCCH----HHHHHHhCCC Q lcl|NC_019524. 441 NVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----G------LSTY----EAEISRLGGD 505 (556) Q Consensus 441 ~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G------~~s~----~~~~ae~G~D 505 (556) .+.. ..|. ...-...|.-..-.+.-.+||++--..++.. + .-|. +.+++..-.+ T Consensus 426 iit~-eew~----------~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDee 494 (521) T protein:vir:10 426 KMSV-SEWE----------EQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDED 494 (521) T ss_pred CCCH-HHHH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhH Confidence 7742 1111 1112345555666677888887655544432 1 0111 1112222233 Q ss_pred HHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 506 FREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 506 ~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) .++...|++.|. +.|+-. +++++.+|- T Consensus 495 ik~~~k~I~~E~----~~~~~~-------------~p~~e~~df 521 (521) T protein:vir:10 495 IKTEREKIDGEL----KDSVYK-------------NPEDPMEEF 521 (521) T ss_pred HHHHHHHHHHhh----hCCCCC-------------CCcchhhcC Confidence 333333333332 122211 111122222 No 234 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=90.52 E-value=0.02 Score=29.85 Aligned_cols=450 Identities=12% Similarity=0.055 Sum_probs=176.0 Q ss_pred hhhhHHHHHhhHhhc--ccchhh-hh-----hh-h---cchhc-------------cccCC-----CcccccccCCCCCH Q lcl|NC_019524. 5 KKTTRTRAKKAVDVV--AETATA-TP-----MA-V---GGGME-------------GAERT-----TREMFQWNPSIISP 54 (556) Q Consensus 5 ~~~~r~~a~~a~~~~--~~~~~~-~~-----~~-~---~~~y~-------------aa~~~-----~r~~~~w~~~~~s~ 54 (556) -+.+|..+++.++-. .++.++ .+ ++ . ...|. ++... .+.+..|.....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~- 79 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRD- 79 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCC- Confidence 223344444332111 000000 00 00 0 00010 00000 0011111000000 Q ss_pred HHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_019524. 55 DQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENW 134 (556) Q Consensus 55 ~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~ 134 (556) .+.......-.+|| +--.+++..++..||.=|+--+..+ +-..++. |+++ T Consensus 80 -~E~~~~Y~~rl~rA--------~~~n~~~~tl~~l~G~vfrk~p~~~-------------~p~~l~~----l~~d---- 129 (535) T protein:vir:80 80 -EEQRRRYETYLQRA--------IFYNVTARTLDGMMGQVFSRDPIRQ-------------LPPALEA----IVED---- 129 (535) T ss_pred -cCCHHHHHHHHhhc--------cCCChhHHHHHHHhchhhcCCccee-------------ccHHHHH----HHhc---- Confidence 01011111111111 1122344445555554333222111 1233333 4433 Q ss_pred eehhcccCHHHHHHHHhhhheecCceEEEEeeccCCCCc-----CCCcccceEEEEEchhhcCC-CCCCCCCc------e Q lcl|NC_019524. 135 FDARRMCTLTGLTRLAVSGFLMTGEVLATCEWLNPTGTT-----MQRRPFGTAIQMISPYRMSN-PNNVMDTP------N 202 (556) Q Consensus 135 cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~-----~~~~~~~l~lq~ie~drl~~-~~~~~~g~------~ 202 (556) ||-.|. +++++.+.+++..+..|=|+++..+....... .-+...|+ +.+|.|+.|-+ .....+|. . T Consensus 130 ~D~~G~-~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy-~~~y~ae~IinW~~~~v~G~~~Lt~v~ 207 (535) T protein:vir:80 130 IDGEGV-SLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPT-ITLVHPTSIINWRTKLVGGKSVISLVV 207 (535) T ss_pred cCCCCC-CHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcE-EEEechhhccCccccccCCccceeEEE Confidence 899998 99999999999999999999888764322110 00011232 77888888853 22222221 1 Q ss_pred EEEEEEEC--CCCCeEE--EEEeec------------CCC-ccccC----------CccccceeeccccCChhHeEeeec Q lcl|NC_019524. 203 LRSGVQLD--NNGAALG--YWLRKA------------FPG-DPTDM----------EQWKWGYEPARFDWGRRRVIHIIE 255 (556) Q Consensus 203 i~~GIE~d--~~Gr~va--Y~i~~~------------hpg-d~~~~----------~~~~~~rv~~~~~v~a~~viH~f~ 255 (556) |+.=|..+ .+|.-.- |.+... .++ +.... ....+..|| ++. +. T Consensus 208 lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IP---------fv~-~~ 277 (535) T protein:vir:80 208 IQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIP---------FQF-IG 277 (535) T ss_pred EEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeE---------EEE-ee Confidence 22322221 2222111 222211 011 00000 111222222 111 23 Q ss_pred ccCCCcccCCchhhHHHHH-HHHHHHHHHHHHHHHHHhccee-eeEeccCcccccccccccccccccccccccccccccc Q lcl|NC_019524. 256 ALLAGQTRGISEMVSALKQ-MKMTRNFQEITLQNAVVNATYA-ASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANY 333 (556) Q Consensus 256 ~~r~gQ~RGvs~la~~l~~-l~~l~~~~dael~~a~i~A~~~-~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (556) ..+-+=.-|.|+|..+... +++...- |.+....--+++. .||+ ....... .. T Consensus 278 ~~~~~~~~~~pPLl~LA~lni~Hy~~s--sd~~~il~~~~~P~l~i~-G~~~~~~-----------------------~~ 331 (535) T protein:vir:80 278 PLDNNADIDHPPLLDLCEVNIGHYRNS--ADYEEMAFVAGQPTAFFT-GLTKDWV-----------------------ED 331 (535) T ss_pred cCCCCCCCCccchHHHHHHHHHHhhch--hHHHHHHHHhcCceeeee-cCchhhh-----------------------hc Confidence 4455666777777653322 2222211 1233223333333 4443 1111000 00 Q ss_pred cccccceecCCceeeecCCCceeeeecCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHH Q lcl|NC_019524. 334 VAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYM 413 (556) Q Consensus 334 ~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~ 413 (556) ......+.+++..+..|+.|-+++++.....+........-.-+++..|..+ +.... .|=++. ++.+++.+.. T Consensus 332 ~~~~~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a~~~l~~~e~qM~~lGa~l----l~~~~--~~~Ta~-~a~~~~~~~~ 404 (535) T protein:vir:80 332 VFKDFKVHLGSRAIIPLPQGATAGILQITPNSVPFEAMTHKESQMIAMGANL----LVKSG--GNRTFG-EAQQEEASEQ 404 (535) T ss_pred CCCCcceEecCcccccCCCCCCcceeeeccchhHHHHHHHHHHHHHHHHHHh----hccCc--ccccHH-HHHHHHHHHh Confidence 1112235688889999999999998887654444433222222222333222 22221 122122 3334444444 Q ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcC Q lcl|NC_019524. 414 DSRKKL--VADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNG 491 (556) Q Consensus 414 ~~~q~~--lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G 491 (556) ..++.. -++..++-+++++ |...|....+....+. ++-+|+- .-+||. ++++..+++..| T Consensus 405 S~L~~~a~~le~al~~aL~~~---A~w~G~~~~~~~~~i~-----------~n~dF~~---~~ld~~-~~~all~~~~~G 466 (535) T protein:vir:80 405 SILSACTKNVSMAFRKALRWA---NQFQTGIVNDETVEYN-----------LNTDFPA---ARLTPN-ERAELILEWQQG 466 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHcCCccCCCceEEE-----------ecccccc---ccCCHH-HHHHHHHHHhcC Confidence 444332 1333333344422 2233432222211111 1112221 224674 999999999999 Q ss_pred CCCHHHHHHHh---CC-----CHHHHHHHHHHHHHHH-HHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 492 LSTYEAEISRL---GG-----DFREVFKQRAREEGLI-KSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 492 ~~s~~~~~ae~---G~-----D~e~v~~q~a~E~~~~-~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .-|.+....+. |. ++++....+..|.... ...|++.+.. .+..+..+ .++.+.-.+| T Consensus 467 ~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~------~~g~~~~~--~~~~~~~~~~ 532 (535) T protein:vir:80 467 AITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAA------SGGTNKAK--LNNGNGGGNQ 532 (535) T ss_pred CCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCC------CCCCCcCc--ccCCcccccc Confidence 99988776553 43 4455444444443221 1223332211 11111111 1112222233 No 235 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=90.34 E-value=0.021 Score=29.74 Aligned_cols=443 Identities=11% Similarity=0.025 Sum_probs=168.0 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |.....+. .-..+..+ ..-.++|..+.. . +.....+.+++ +.....--+++.+ ++++.+++ T Consensus 1 ~~~~~~~~---------~gl~p~rl-~~i~~~~~~~~~---~---~~~~~~~~~Lr--~~~~~~ly~~m~~-D~hi~s~l 61 (488) T protein:vir:95 1 MADITETQ---------ESLPPFRM-GEVGSLGLKVKN---G---RIYEEPRQALR--FPESIKTFQLMMR-DPAVAASV 61 (488) T ss_pred CCCccccC---------CCCCHHHH-HHHHHHhhcccc---c---hhhccchhhhc--ccchHHHHHHHhh-ChHHHHHH Confidence 22211111 11111111 111223322111 0 11112333332 2334455677776 99999999 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) ++...-|.|--+++.+.-+. -.+..+.+..+.++..++. ...+|.++..-++- .+.-|=+++- T Consensus 62 ~~Rk~av~~~~w~v~p~~~~----~~d~~~~~~a~~v~~~l~~------------~~~~~~~~i~~~ld-a~~~G~s~~E 124 (488) T protein:vir:95 62 NIIKMFVRKVNWRFVPPKGK----EQDPKMLERADFFNSLMDD------------MEHDWADFINSVMS-FCTYGFCVNE 124 (488) T ss_pred HHHHHHHhcCCceEecCCCC----chhHHHHHHHHHHHHHHhc------------cCccHHHHHHHHHH-hhcccceeee Confidence 99999999987777653211 0111223333444333332 23467777777664 4666888888 Q ss_pred EeeccCCCCc-------CCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccCCcccc Q lcl|NC_019524. 164 CEWLNPTGTT-------MQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKW 236 (556) Q Consensus 164 ~~~~~~~~~~-------~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~ 236 (556) ++|....+.. .+|.-.+-+|....++ ...-+.||.+|+.+-. ....+++.. ......| T Consensus 125 ivw~~~~~~~~~~~~~~~dg~~~~~~i~~Rpq~-------------~~~~f~~d~d~~l~~~-~~~~~~~~~-~~~~~~~ 189 (488) T protein:vir:95 125 KVYKKRQGKKGKYQSKFDDGLIGWAKLPIRNQS-------------TLDKWYFDEDFRRVTG-VRQNLRNVS-HIAGAIN 189 (488) T ss_pred eeeeccccccccccccccCCeeeeeeeeecCcc-------------cccceeeccCCCceee-ccccccccc-ccccccc Confidence 8886532210 0111111122111111 1124566666654321 111222211 0000000 Q ss_pred -ceeeccccCChh-HeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHH-HHhccee-eeEeccCccccccccc Q lcl|NC_019524. 237 -GYEPARFDWGRR-RVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNA-VVNATYA-ASVESELPSDVVFGQL 312 (556) Q Consensus 237 -~rv~~~~~v~a~-~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a-~i~A~~~-~fi~~~~~~~~~~~~~ 312 (556) ...+....+|.. -|+|.+..+ .|..-|.+.|.++.-...--.......+.-. |-..-+. +......... T Consensus 190 ~~~~~~~~~lP~~kfi~~~~~~~-~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~------ 262 (488) T protein:vir:95 190 LGERPLTRKLPRAKFMLFKYDDE-YGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDE------ 262 (488) T ss_pred cccccccccccccceEEEeecCC-CCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCC------ Confidence 001111234333 467776654 8888888888776544322211111111111 1111111 1111110000 Q ss_pred ccccccccccccccc-cccccccccccceecCCceeeecCCCce---------eeeecCCCC-CccHHHHHHHHHHHHHH Q lcl|NC_019524. 313 GMGQGGFKEIFNEYM-TGLANYVAQTKNIAIDGAKIPHLYPGTK---------LKMQPAGTP-GGVGTDYEQSLLRNIAA 381 (556) Q Consensus 313 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~pG~i~~L~pGe~---------i~~~~~~~p-~~~f~~F~~~~lr~iaa 381 (556) .........-... ....+... ....|. .++-|.+ ++....... ...|..+.+.+=++|+. T Consensus 263 --~~~~e~~~l~~a~~~i~~~~~~-----~~~ag~--iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk 333 (488) T protein:vir:95 263 --NAEPEKKAFVQYCKTVVNDMIA-----NDRAGL--IWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMM 333 (488) T ss_pred --cccHHHHHHHHHHHHHHHHhhc-----cchhhe--eeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHH Confidence 0000000000000 00000000 000121 2333333 333333322 23477778888888887 Q ss_pred hcCCCHHHhhc-hhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHH Q lcl|NC_019524. 382 SLGMSYEQFSR-DYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRD 460 (556) Q Consensus 382 glGi~ye~l~~-D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~ 460 (556) .+--. .||. +=.+.|||.+-.-...+....+.....+...+-+-+..+++. ++ .+...+. T Consensus 334 ~iLGq--tLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~---~N----fg~~~~~---------- 394 (488) T protein:vir:95 334 AFMSD--VLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYA---LN----MWDDEEH---------- 394 (488) T ss_pred HHhcc--ccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hc----CCCCCCc---------- Confidence 75322 2332 222335544433333333344444445555554445554433 22 1110000 Q ss_pred HhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHH----HHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCccccccCC Q lcl|NC_019524. 461 ALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYE----AEISRLGGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNS 536 (556) Q Consensus 461 a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~----~~~ae~G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~ 536 (556) ..+....-...|-.+-+++....+.+|+.-.. +.++ +++||+.+........ T Consensus 395 ----P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~--------------------e~~gip~~~~~e~~~~ 450 (488) T protein:vir:95 395 ----VQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLR--------------------EHIGLPPADESQPVSE 450 (488) T ss_pred ----cEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHH--------------------HHhCCCCCCCCccccc Confidence 01111222334545556677777788876652 2233 3455543221110000 Q ss_pred CCC-----------CCCCCCCCCCCCCcCCC Q lcl|NC_019524. 537 TQS-----------SNSSESTSDNPNEETTQ 556 (556) Q Consensus 537 ~~~-----------~~~~~~~~~~~~~e~~~ 556 (556) ... ..+.......+..+.++ T Consensus 451 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (488) T protein:vir:95 451 KLSPNSQSRSGDGYKTAGEGTAKTPSAKDPS 481 (488) T ss_pred cCCCCCCCCCCcccCCCcccCCcccccccch Confidence 000 00000011111111111 No 236 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=90.29 E-value=0.022 Score=29.71 Aligned_cols=468 Identities=10% Similarity=0.054 Sum_probs=224.7 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCC-----c--ccccccCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTT-----R--EMFQWNPSIISPDQQIAQNQDMASARAQDMV 73 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~-----r--~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~ 73 (556) |++-....-..-...+.-.+...+ ..-..+||.--. . ...+..-...+.+. ...+-..|..+=|.| T Consensus 8 ~~~~~~~d~~~~~e~~~~~~~s~~-----~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~-~~~~~~eLI~~YR~m- 80 (521) T protein:vir:65 8 LARWADFDNDKYEEQIKDKAESIA-----APKNNDGATEVEINDNSPASSWNSLTQQFYSTDQ-KISTTKQLVNTYRGL- 80 (521) T ss_pred hhhccCchhhHHHhhhccCCCccc-----CCCCCCCceeecccCCccccccccceeeeccccc-hhhhHHHHHHHHHHH- Confidence 333332221111111111111100 011233332110 0 00111111111111 234567889999999 Q ss_pred hcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHH Q lcl|NC_019524. 74 QNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLA 150 (556) Q Consensus 74 rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~ 150 (556) .++|-+-+||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..++|..--.-. T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV------~l~L~~~~~s~~iK~kI~eeF~~Il----------~ll~F~~~~~~~ 144 (521) T protein:vir:65 81 MNNHEVENAVQNIVNDAIVFEEGHEVV------SLNLEATGFSESVKERIHEEFKDLL----------NTIQFDRRGQDM 144 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceE------EEEecccccchHHHHHHHHHHHHHH----------HHhccchhhhHH Confidence 78999999999998887742 21121 122222 235666777888887644 346677777788 Q ss_pred hhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccc- Q lcl|NC_019524. 151 VSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPT- 229 (556) Q Consensus 151 ~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~- 229 (556) +|.|-+||-+|..++..+. ++.|- ..|..|+|-.+..-..-.. .-..|+++ .+.-.-||++. |++.. T Consensus 145 fR~WYVDgRi~fhkiid~~---pk~GI---~ELr~lDPr~i~~vr~i~k--~~~~~~~v--~~~~~e~f~Y~--~~~~~~ 212 (521) T protein:vir:65 145 FRRWYVDSRIFFHKIIGKN---PKDGI---VELRQLDPRNLEYVREIIT--EDTPEGKI--YKATKEYFIYT--VGNSSY 212 (521) T ss_pred HhhhhhcceeEEEEEEcCC---ccccc---eeeeeeCCcceeeeeeecc--cccCCcce--ecceeeeeeee--cCCcce Confidence 9999999999988876432 12222 3677888887753211110 01123333 44566788884 33322 Q ss_pred cCCccccceeeccccCChhHeEeeecccCCCcccC--CchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccc Q lcl|NC_019524. 230 DMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRG--ISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDV 307 (556) Q Consensus 230 ~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RG--vs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~ 307 (556) ...++.+.-- ...+++.+-|..+.... ..=..| +|.|+++++.+.+|.=.+||.++=...-|-=--++.=+.+.-. T Consensus 213 ~~~g~~~~~~-~~vkI~~dAI~y~hSGl-~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlP 290 (521) T protein:vir:65 213 CAGGQVFSPN-SRVKIPRSAITYAHSGL-MDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMN 290 (521) T ss_pred eccceeecCC-cceeechhheeeeeccc-eeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCC Confidence 1111111100 11345555444443333 333444 7999999999999999999998876665533211111111111 Q ss_pred ---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHH Q lcl|NC_019524. 308 ---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYE 372 (556) Q Consensus 308 ---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~ 372 (556) +.+.. -+...+...+..-.+++..+..-..+. +| .|.+|+.+.....-+. .+-+ T Consensus 291 k~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLpGgqnlge-m~DV 359 (521) T protein:vir:65 291 NRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRD----GK------AITDVTTLPGASGMSD-IDDI 359 (521) T ss_pred chhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhcccccC----CC------CccceeecccCCCcCh-HHHH Confidence 00000 011222222333334444444443332 22 2444444443322223 3445 Q ss_pred HHHHHHHHHhcCCCHHHhhchhhcccchhHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCC Q lcl|NC_019524. 373 QSLLRNIAASLGMSYEQFSRDYTKTNYSSARAS-----MAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPG 447 (556) Q Consensus 373 ~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~-----~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~ 447 (556) ....+.+=.+|+||-.-+.-+- +.+++-.|.+ -+.|.+.+.+.|..|..-|.+++-. ..+|.|.+.. .. T Consensus 360 ~YF~kkLy~aLnVP~sRl~~e~-~~~~~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~----qLilKgiit~-ee 433 (521) T protein:vir:65 360 RYFNRKLYEALRVPLSRSNLSD-ANMVIGGDGSEITRDELEFSKFIRTLQSQFSEVLRDPLKY----NLILKNVITE-DD 433 (521) T ss_pred HHHHHHHHHHhCCCceeccCCC-CcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhhcCCCH-HH Confidence 5566666778999977652221 2344433333 4567888888888887777766444 2567777642 21 Q ss_pred cccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCC----CHH----HHHHHhCCCHHHHHHHHH Q lcl|NC_019524. 448 KNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLS----TYE----AEISRLGGDFREVFKQRA 514 (556) Q Consensus 448 ~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~----s~~----~~~ae~G~D~e~v~~q~a 514 (556) |+ ...-...|.-..-.+.-.+||++--..++.. +.. |.+ .+++..-.+.++...|++ T Consensus 434 w~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~k~I~ 503 (521) T protein:vir:65 434 WD----------REINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIE 503 (521) T ss_pred HH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHH Confidence 11 1112345555666677888887755554432 100 221 122222233333333333 Q ss_pred HHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 515 REEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 515 ~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) .|. +.|+-. +++++.++- T Consensus 504 ~E~----~~~~~~-------------~p~~~~~~f 521 (521) T protein:vir:65 504 EEA----NDPRFK-------------QTPDEIEDF 521 (521) T ss_pred Hhh----hCCCCC-------------CCcccccCC Confidence 332 122211 111122222 No 237 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=90.27 E-value=0.022 Score=29.70 Aligned_cols=458 Identities=12% Similarity=0.093 Sum_probs=218.5 Q ss_pred CCcchhhhHHH------HHhhHhhcccchhhhhhhhcchhccccC---------CCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTR------AKKAVDVVAETATATPMAVGGGMEGAER---------TTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~------a~~a~~~~~~~~~~~~~~~~~~y~aa~~---------~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) |+=.+-..-.. ......-.+...+ ..-..+||.- .+..++.+.. .+.. ..+-..| T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~-----~p~~~dGa~~i~~~~~~~~~~g~~~~~~~----~~~~-~~~~~eL 70 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIA-----TPKKDDGATEIETREGEATYNAVMQQFFG----IDNN-ISGTKDL 70 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCccc-----CCCCCCCceeeecCCCcccccceeeeeec----cccc-cchHHHH Confidence 11110000000 0000000000000 0011222210 0111111111 1111 2245778 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccC Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCT 142 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~ 142 (556) ..+=|.| .++|-+-+||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..++ T Consensus 71 I~~YR~m-a~~pEvd~Av~eIVneaiv~d~~~~pV------~l~L~~~~~s~~ik~kI~eeF~~Il----------~ll~ 133 (516) T protein:vir:10 71 INTYRQL-INNPEVERAVANIVNEAIVYERGHKVV------SLDLDDTDFGSNVKEKILEEFDEVC----------RLLD 133 (516) T ss_pred HHHHHHH-hhccchhhHHHHhhcceeEecCCCceE------EEEecccCcchHHHHHHHHHHHHHH----------HHhc Confidence 8899999 78999999999998887742 21121 122222 234666777888887643 2456 Q ss_pred HHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC----CCCCCCceEEEEEEECCCCCeEEE Q lcl|NC_019524. 143 LTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP----NNVMDTPNLRSGVQLDNNGAALGY 218 (556) Q Consensus 143 f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~----~~~~~g~~i~~GIE~d~~Gr~vaY 218 (556) |..--.-.+|.|.+||-+|..+...++.. |- ..|..|+|-.+..= ....+|..+..|+ .-| T Consensus 134 F~~~~~~~fR~WYVDgRi~fhKiid~~k~----GI---~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~--------~e~ 198 (516) T protein:vir:10 134 ASRKLDTLFRRWYVDSRIFFHKIMPNPKK----GI---AELRRLDPRFMEYYREIVTSDIGGTTIVKGY--------REF 198 (516) T ss_pred cchhhhHHHhhhhhcceEEEEEEecCccc----cc---eeeeeeCCcceeeEeeecccccccchhhhhh--------hhe Confidence 77777788999999999998876543322 21 36777888777421 1112222222222 246 Q ss_pred EEeecCCCccc-cCCccccceeeccccCChhHeEeeeccc---CCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019524. 219 WLRKAFPGDPT-DMEQWKWGYEPARFDWGRRRVIHIIEAL---LAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNAT 294 (556) Q Consensus 219 ~i~~~hpgd~~-~~~~~~~~rv~~~~~v~a~~viH~f~~~---r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~ 294 (556) |++. ||+.. ...+..+. -....+++.+-|.|.-... .-+.+ +|.|+++++.+.+|.=.+||.++=...-|- T Consensus 199 ~~Y~--~~~~~~~~~g~~~~-~~~~ikI~~dAI~y~hSGL~d~~~~~i--~syLhkAiKp~NQLkm~EDAlVIYRitRAP 273 (516) T protein:vir:10 199 FIYT--TGNEGYSYNGRIFE-PNTRIKIPRSAVVYASSGLMDCSDRGI--IGYLHNAVKPANQLKLLEDAMVIYRITRAP 273 (516) T ss_pred eeec--cCccccccccceeC-CCcceeechhheeeecccceeCCCCce--eeeehhhhHhHHhhHHHHhhHHHHhhhccc Confidence 6665 33321 11111100 0011356677676665432 23333 899999999999999999999887666553 Q ss_pred eeeeEeccCcccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeee Q lcl|NC_019524. 295 YAASVESELPSDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQ 359 (556) Q Consensus 295 ~~~fi~~~~~~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~ 359 (556) =--++.=+.+.-. +.+.. -....+...+....+++..+..-..+. +| .|.+|+.+ T Consensus 274 eRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTL 343 (516) T protein:vir:10 274 ERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRD----GK------SVTEVSSL 343 (516) T ss_pred cceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccC----CC------Cccceeec Confidence 3211111111111 00000 012222233334445555444444442 22 24444444 Q ss_pred cCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh-cccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 360 PAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT-KTNY---SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEE 435 (556) Q Consensus 360 ~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s-~~nY---Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~ 435 (556) .....-+. .+-+....+.+=.+|+||-.-|..|=. +.++ |-+----+.|.+.+.+.|..|..-|.+++-. . T Consensus 344 pGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~----q 418 (516) T protein:vir:10 344 PGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKT----N 418 (516) T ss_pred cccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----h Confidence 43322233 344555666677789999887754421 1111 2222234678888889999888777776544 3 Q ss_pred HHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc---------CCCCHHHHHHH----h Q lcl|NC_019524. 436 EVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN---------GLSTYEAEISR----L 502 (556) Q Consensus 436 a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~---------G~~s~~~~~ae----~ 502 (556) .+|.|.+.. ..|. ...-...|.-..-.+.-.+||++--..++.. ..-|.+-+.++ . T Consensus 419 LilKgiit~-eew~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t 487 (516) T protein:vir:10 419 LIYKRIITE-DEWD----------EQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMT 487 (516) T ss_pred hhhccCCCH-HHHH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCC Confidence 577787742 1111 1112345555666677888887655544432 11222222222 1 Q ss_pred CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 503 GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 503 G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) -.+..+...|++.|. +.|+-.+ ++++++- T Consensus 488 Deei~~e~k~I~~E~----~~~~~~~--------------p~~~~~f 516 (516) T protein:vir:10 488 EEQIAQEEKQIEQEA----GIKRFQN--------------PENEDDF 516 (516) T ss_pred HhhHHHHHHHHHHhh----hCCCCCC--------------CCccccC Confidence 222222333333331 2222111 1111111 No 238 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=90.27 E-value=0.022 Score=29.70 Aligned_cols=458 Identities=12% Similarity=0.093 Sum_probs=218.5 Q ss_pred CCcchhhhHHH------HHhhHhhcccchhhhhhhhcchhccccC---------CCcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTR------AKKAVDVVAETATATPMAVGGGMEGAER---------TTREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~------a~~a~~~~~~~~~~~~~~~~~~y~aa~~---------~~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) |+=.+-..-.. ......-.+...+ ..-..+||.- .+..++.+.. .+.. ..+-..| T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~-----~p~~~dGa~~i~~~~~~~~~~g~~~~~~~----~~~~-~~~~~eL 70 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIA-----TPKKDDGATEIETREGEATYNAVMQQFFG----IDNN-ISGTKDL 70 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCccc-----CCCCCCCceeeecCCCcccccceeeeeec----cccc-cchHHHH Confidence 11110000000 0000000000000 0011222210 0111111111 1111 2245778 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccC Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCT 142 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~ 142 (556) ..+=|.| .++|-+-+||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..++ T Consensus 71 I~~YR~m-a~~pEvd~Av~eIVneaiv~d~~~~pV------~l~L~~~~~s~~ik~kI~eeF~~Il----------~ll~ 133 (516) T protein:vir:10 71 INTYRQL-INNPEVERAVANIVNEAIVYERGHKVV------SLDLDDTDFGSNVKEKILEEFDEVC----------RLLD 133 (516) T ss_pred HHHHHHH-hhccchhhHHHHhhcceeEecCCCceE------EEEecccCcchHHHHHHHHHHHHHH----------HHhc Confidence 8899999 78999999999998887742 21121 122222 234666777888887643 2456 Q ss_pred HHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC----CCCCCCceEEEEEEECCCCCeEEE Q lcl|NC_019524. 143 LTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP----NNVMDTPNLRSGVQLDNNGAALGY 218 (556) Q Consensus 143 f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~----~~~~~g~~i~~GIE~d~~Gr~vaY 218 (556) |..--.-.+|.|.+||-+|..+...++.. |- ..|..|+|-.+..= ....+|..+..|+ .-| T Consensus 134 F~~~~~~~fR~WYVDgRi~fhKiid~~k~----GI---~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~--------~e~ 198 (516) T protein:vir:10 134 ASRKLDTLFRRWYVDSRIFFHKIMPNPKK----GI---AELRRLDPRFMEYYREIVTSDIGGTTIVKGY--------REF 198 (516) T ss_pred cchhhhHHHhhhhhcceEEEEEEecCccc----cc---eeeeeeCCcceeeEeeecccccccchhhhhh--------hhe Confidence 77777788999999999998876543322 21 36777888777421 1112222222222 246 Q ss_pred EEeecCCCccc-cCCccccceeeccccCChhHeEeeeccc---CCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019524. 219 WLRKAFPGDPT-DMEQWKWGYEPARFDWGRRRVIHIIEAL---LAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNAT 294 (556) Q Consensus 219 ~i~~~hpgd~~-~~~~~~~~rv~~~~~v~a~~viH~f~~~---r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~ 294 (556) |++. ||+.. ...+..+. -....+++.+-|.|.-... .-+.+ +|.|+++++.+.+|.=.+||.++=...-|- T Consensus 199 ~~Y~--~~~~~~~~~g~~~~-~~~~ikI~~dAI~y~hSGL~d~~~~~i--~syLhkAiKp~NQLkm~EDAlVIYRitRAP 273 (516) T protein:vir:10 199 FIYT--TGNEGYSYNGRIFE-PNTRIKIPRSAVVYASSGLMDCSDRGI--IGYLHNAVKPANQLKLLEDAMVIYRITRAP 273 (516) T ss_pred eeec--cCccccccccceeC-CCcceeechhheeeecccceeCCCCce--eeeehhhhHhHHhhHHHHhhHHHHhhhccc Confidence 6665 33321 11111100 0011356677676665432 23333 899999999999999999999887666553 Q ss_pred eeeeEeccCcccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeee Q lcl|NC_019524. 295 YAASVESELPSDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQ 359 (556) Q Consensus 295 ~~~fi~~~~~~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~ 359 (556) =--++.=+.+.-. +.+.. -....+...+....+++..+..-..+. +| .|.+|+.+ T Consensus 274 eRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTL 343 (516) T protein:vir:10 274 ERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRD----GK------SVTEVSSL 343 (516) T ss_pred cceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccC----CC------Cccceeec Confidence 3211111111111 00000 012222233334445555444444442 22 24444444 Q ss_pred cCCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh-cccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 360 PAGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT-KTNY---SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEE 435 (556) Q Consensus 360 ~~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s-~~nY---Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~ 435 (556) .....-+. .+-+....+.+=.+|+||-.-|..|=. +.++ |-+----+.|.+.+.+.|..|..-|.+++-. . T Consensus 344 pGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~----q 418 (516) T protein:vir:10 344 PGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKT----N 418 (516) T ss_pred cccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----h Confidence 43322233 344555666677789999887754421 1111 2222234678888889999888777776544 3 Q ss_pred HHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc---------CCCCHHHHHHH----h Q lcl|NC_019524. 436 EVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN---------GLSTYEAEISR----L 502 (556) Q Consensus 436 a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~---------G~~s~~~~~ae----~ 502 (556) .+|.|.+.. ..|. ...-...|.-..-.+.-.+||++--..++.. ..-|.+-+.++ . T Consensus 419 LilKgiit~-eew~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t 487 (516) T protein:vir:10 419 LIYKRIITE-DEWD----------EQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMT 487 (516) T ss_pred hhhccCCCH-HHHH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCC Confidence 577787742 1111 1112345555666677888887655544432 11222222222 1 Q ss_pred CCCHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 503 GGDFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 503 G~D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) -.+..+...|++.|. +.|+-.+ ++++++- T Consensus 488 Deei~~e~k~I~~E~----~~~~~~~--------------p~~~~~f 516 (516) T protein:vir:10 488 EEQIAQEEKQIEQEA----GIKRFQN--------------PENEDDF 516 (516) T ss_pred HhhHHHHHHHHHHhh----hCCCCCC--------------CCccccC Confidence 222222333333331 2222111 1111111 No 239 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=88.08 E-value=0.035 Score=28.60 Aligned_cols=480 Identities=12% Similarity=0.034 Sum_probs=180.1 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |+ ...+....+..+... ..|..--...-+-+..+-++...+.. .+.+. . ..|.----++.+. T Consensus 1 m~---~~~~~~l~~r~~~l~---~~R~~~e~~w~e~~~~~lP~~~~~~~--~~~~~----~------~~~~~~~~dst~~ 62 (559) T protein:vir:95 1 MA---ETTKERLNKQFAQLE---SERQSFEPHWRELSDYINPRGSRFLT--SEVNR----N------DRRNTRIIDSTGT 62 (559) T ss_pred CC---hhhHHHHHHHHHHHH---HHhhHHHHHHHHHHHHhccccCCcCC--CCCCc----c------cccccccccchHH Confidence 33 222221111111110 00100000000111111111111100 00000 0 0011112456677 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCCh------hHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPD------GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~------~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) .+++++++.+.+ ||+|..++ +-.|...+ .+.++|-+.+++.-.. +=-+.+|+.-...++... T Consensus 63 ~a~~~Las~l~~-~ltpp~~~-WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~----------~l~~snf~~~~~~~~~~L 130 (559) T protein:vir:95 63 MAARTLASGMMS-GITSPARP-WFRLATPDPEMMDYGPVKLWLEAVQNRMND----------MFNKSNLYQSLPQLYGSL 130 (559) T ss_pred HHHHHHHHHHHH-hhcCCCCc-ccccccCCccccchHHHHHHHHHHHHHHHH----------HHHhcCcHHHHHHHHHHH Confidence 888888887776 45553333 22232221 2333344444443222 222457888888888888 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchh--------------------------hcCCC----CC-CC-C-Cc Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPY--------------------------RMSNP----NN-VM-D-TP 201 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~d--------------------------rl~~~----~~-~~-~-g~ 201 (556) ++.|-+++-+...+.... .-..+|+.==+|+.| .|+.. ++ .+ + .- T Consensus 131 ~~~Gta~l~~~~d~~~~~--r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v 208 (559) T protein:vir:95 131 GTYSTGAMAVLDDDEDII--RTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWI 208 (559) T ss_pred HhhCceeeEeecCCCcee--EEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeE Confidence 888887755432221100 000111111111111 01000 00 00 0 11 Q ss_pred eEEEEEEECCCCCe---------E-EEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCc-hhhH Q lcl|NC_019524. 202 NLRSGVQLDNNGAA---------L-GYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGIS-EMVS 270 (556) Q Consensus 202 ~i~~GIE~d~~Gr~---------v-aY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs-~la~ 270 (556) .|++-|+-+.++.+ . .||+...-.++..... ..|.. . +-|+--| ..-+|..-|.+ +... T Consensus 209 ~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~e-sg~~e------~--P~~~~Rw-~~~~ge~YGrg~P~~~ 278 (559) T protein:vir:95 209 EVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRE-SGFDE------F--PIMAPRW-EVNGEDVYGSSCPGML 278 (559) T ss_pred EEEEEEeccccccccccccccceEEEEEEEecCCCceeeec-CCccc------C--Cccceee-eecCCccccccchHHH Confidence 34444443222211 1 1332221111100000 00000 0 0011111 24588899999 6889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeec Q lcl|NC_019524. 271 ALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL 350 (556) Q Consensus 271 ~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L 350 (556) +|-.+|.|+.+..+.+..+.+++.-...+-.+ . ......+.||.+... T Consensus 279 al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~-~-------------------------------~~~~~~l~pgg~~~~ 326 (559) T protein:vir:95 279 ALGPVKALQLLQKRKSQLIDKATNPPMVAPTS-L-------------------------------KNQRASLLPGDITYI 326 (559) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhcCceecccc-c-------------------------------cccceeeeccceeee Confidence 99999999999999999998887765553211 0 011223567766655 Q ss_pred CCCc---eeeeecCCCCCccHHHH-HHHHHHHHHHhcCCCHHH--hhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 351 YPGT---KLKMQPAGTPGGVGTDY-EQSLLRNIAASLGMSYEQ--FSRDYTKTNYSSARASMAETQKYMDSRKKLVADRF 424 (556) Q Consensus 351 ~pGe---~i~~~~~~~p~~~f~~F-~~~~lr~iaaglGi~ye~--l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~ 424 (556) ..+. .+++.....++-.+..- +..+-..|..++-..... ...|-.+++-.=++.-..|.-+.+-..=..|-..| T Consensus 327 ~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~ 406 (559) T protein:vir:95 327 DQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEC 406 (559) T ss_pred CCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 4433 34443323333222111 233444455554433221 22455555555555555555555555555677889 Q ss_pred HHHHHHHHHHHHHHcCCccCCCCc-ccc----cccchhhHHH--------hhCeeeecCcccccchh----hhhHHHHHH Q lcl|NC_019524. 425 ASAIYTLWLEEEVNAGNVPLPPGK-NWR----MFYDPMMRDA--------LCNAEWIGASRGQIDEK----KETEAAILR 487 (556) Q Consensus 425 ~~pi~~~~l~~a~l~G~l~~p~~~-~~~----~~~~~~~~~a--------~~~~~w~~p~~~~iDP~----Ke~~A~~~~ 487 (556) +.|+.++.+..+...|.||-|+.. ... .+-.|..+.- -.-..++++- ..++|. =+....+.. T Consensus 407 l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~l-aq~~Pevld~id~d~~~~~ 485 (559) T protein:vir:95 407 LNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQL-AQVKPEALDKLNVDQAIDA 485 (559) T ss_pred HHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHH-hccChhhhhcCCHHHHHHH Confidence 999999999999999999866431 100 0111111100 0000111110 011221 122222223 Q ss_pred HHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHH--HcCCCCCccccccCCCCCCCCCC--CCCCC--CCCcCCC Q lcl|NC_019524. 488 IKNGLSTYEAEISRLGGDFREVFKQRAREEGLIK--SLKLDFTGKMVEGNSTQSSNSSE--STSDN--PNEETTQ 556 (556) Q Consensus 488 i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~--~~Gl~~~~~~~~~~~~~~~~~~~--~~~~~--~~~e~~~ 556 (556) +...+.-+..+++.. .+.++.-+|++..++.+. ..++.........+.++.+.++. .-... +.....| T Consensus 486 ~a~~~Gvp~~~irs~-~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 486 FADMSGVSPTVIVPQ-EQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) T ss_pred HHHHhCCchhhcCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCChhHHHHHHHhhcCccccCC Confidence 322222222222211 111111122222222111 12211100000000000000000 00000 0000001 No 240 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=87.77 E-value=0.037 Score=28.46 Aligned_cols=472 Identities=11% Similarity=0.059 Sum_probs=225.7 Q ss_pred CCc--------chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCC----CCHHHHHHHHHHHHHHH Q lcl|NC_019524. 1 MKD--------VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSI----ISPDQQIAQNQDMASAR 68 (556) Q Consensus 1 ~sp--------~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~----~s~~~~i~~~~~~lr~R 68 (556) |++ -....=.............. +.+...-++++.....+-...+|.... ++.+.. ..+-..|..+ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~-~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~-~~~~~eLI~~ 78 (524) T protein:vir:72 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSI-TAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPG-MKTTRELIDT 78 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccc-cCccCCCCceeeeecccccccccceeeeehhcccccc-cchHHHHHHH Confidence 222 11111000000000000000 000011111111110000001222111 111111 2356778889 Q ss_pred HHHHHhcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccCHHH Q lcl|NC_019524. 69 AQDMVQNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTG 145 (556) Q Consensus 69 aRdl~rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~ 145 (556) =|.| .++|-+-.||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..++|.. T Consensus 79 YR~m-a~~pEvd~Av~eIVneaiv~d~~~~pV------~l~L~~~~~s~~iK~kI~eeF~~Il----------~ll~F~~ 141 (524) T protein:vir:72 79 YRNL-MNNYEVDNAVSEIVSDAIVYEDDTEVV------ALNLDKSKFSPKIKNMMLDEFSDVL----------NHLSFQR 141 (524) T ss_pred HHHH-hhccchhhHHHHhhcceeEecCCCceE------EEEecCcCcchHHHHHHHHHHHHHH----------HHhccch Confidence 9999 78999999999998887742 21121 122211 234566777888887543 3467777 Q ss_pred HHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC----CCCCCCceEEEEEEECCCCCeEEEEEe Q lcl|NC_019524. 146 LTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP----NNVMDTPNLRSGVQLDNNGAALGYWLR 221 (556) Q Consensus 146 lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~----~~~~~g~~i~~GIE~d~~Gr~vaY~i~ 221 (556) --.-.+|.|.+||-+|..++..+.. ++.|- ..|..|+|-.+..- ....++..++.| -.-|+++ T Consensus 142 ~~~~~fR~WYVDgRi~fhKiid~k~--pk~GI---~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~--------~~e~f~Y 208 (524) T protein:vir:72 142 KGSDHFRRWYVDSRIFFHKIIDPKR--PKEGI---KELRRLDPRQVQYVREIITETEAGTKIVKG--------YKEYFIY 208 (524) T ss_pred hhhHHHhhheeeeEEEEEEEEeCCC--ccccc---eeeeeeCCccceeeeeeccCCCccchhhcc--------hhhheee Confidence 7778899999999999887663221 11121 36777888877421 112233333333 3346665 Q ss_pred ecCCCccccCCccccceeeccccCChhHeEeeecccCCCcc-cCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEe Q lcl|NC_019524. 222 KAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQT-RGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVE 300 (556) Q Consensus 222 ~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~-RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~ 300 (556) +.. ...+...+..+ ......++|.+-|.|.-...-+.-. -=+|.|+++++.+.+|.=.+||.++=...-|-=--++. T Consensus 209 ~~~-~~~y~~~g~~~-~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFY 286 (524) T protein:vir:72 209 DTA-HESYACDGRMY-EAGTKIKIPKAAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWY 286 (524) T ss_pred ccC-ccccccCcccc-CCCcceecchhheeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEE Confidence 531 22222222110 0112346777777777665433222 33799999999999999999999887665553321111 Q ss_pred ccCcccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCC Q lcl|NC_019524. 301 SELPSDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPG 365 (556) Q Consensus 301 ~~~~~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~ 365 (556) =+.+.-. +.+.. -....+...+....+++..+..-..+. +| -|.+|+.+.....- T Consensus 287 IDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLpGgqnl 356 (524) T protein:vir:72 287 VDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRD----GK------AVTEVDTLPGADNT 356 (524) T ss_pred EecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccC----CC------cccceeeccccCCc Confidence 1111111 00000 012222233334445555444444442 22 24444444433222 Q ss_pred ccHHHHHHHHHHHHHHhcCCCHHHhhchhh-cccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019524. 366 GVGTDYEQSLLRNIAASLGMSYEQFSRDYT-KTNY---SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGN 441 (556) Q Consensus 366 ~~f~~F~~~~lr~iaaglGi~ye~l~~D~s-~~nY---Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~ 441 (556) +. .+-+....+.+=.+|+||-.-|..|-+ +.|+ |.+----+.|.+.+.+.|..|..-|.+++-. ..+|.|. T Consensus 357 ge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~----qLilKgi 431 (524) T protein:vir:72 357 GN-MEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKT----NLLLKGI 431 (524) T ss_pred Ch-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhccC Confidence 23 344556666677789999988865532 2333 2222334567788888888887777776544 3577787 Q ss_pred ccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCC----CHH----HHHHHhCCCHHH Q lcl|NC_019524. 442 VPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLS----TYE----AEISRLGGDFRE 508 (556) Q Consensus 442 l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~----s~~----~~~ae~G~D~e~ 508 (556) +.. ..|. ...-...|....-.+.-.+||++--..++.. +.- |.+ .+++..-.+.++ T Consensus 432 it~-eew~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~ 500 (524) T protein:vir:72 432 ITE-DEWN----------DEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQ 500 (524) T ss_pred CCH-HHHH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHH Confidence 742 1111 1112345555666677888887755554432 100 111 222222333333 Q ss_pred HHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 509 VFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 509 v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) ...|++.|. +.|+-. +++++++|- T Consensus 501 ~~k~I~~E~----k~~~~~-------------~~~~~~~~f 524 (524) T protein:vir:72 501 EAKQIEEES----KEARFQ-------------DPDQEQEDF 524 (524) T ss_pred HHHHHHHHh----hcCCCC-------------CCchhhhcC Confidence 333443332 223321 111111111 No 241 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=87.28 E-value=0.04 Score=28.26 Aligned_cols=465 Identities=12% Similarity=0.060 Sum_probs=186.5 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |++-.+. ...+..+... ..|..--...-+-+..+-++...|........ ..|.----++.+. T Consensus 1 m~~~~~~---~l~~r~~~l~---~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~------------~~~~~~~~dst~~ 62 (556) T protein:vir:73 1 MAETEKE---RLLKQLAQLK---NERTSFESHWLDLSDFINPRGSRFLTSDVNRD------------DRRNTKIVDPTGS 62 (556) T ss_pred CChhhHH---HHHHHHHHHH---HHhhHHHHHHHHHHHHhccccCCcCCCCCCcc------------hhhcCccccchHH Confidence 5442211 1111111110 00100000001111111111111111100000 0111122456677 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCCh------hHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPD------GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~------~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) .+++++++.+.+ ||+|..++ +-.|...+ .+.+.|-+.+++.-. ++=-+.+|+.....++... T Consensus 63 ~a~~~Las~l~~-~ltpp~~~-WF~l~~~d~~~~~~~~v~~~L~~ve~~~~----------~~l~~snf~~~~~~~~~~L 130 (556) T protein:vir:73 63 MAQRILSSGMMS-GITSPARP-WFKLATPDPDMMDYGPVKIWLEVVQRRMN----------EVFNKSNLYQSLPVMYASL 130 (556) T ss_pred HHHHHHHHHHHH-hhcCCCCc-ccccccCcccccchHHHHHHHHHHHHHHH----------HHHHhcCcHHHHHHHHHHH Confidence 888888887776 45553332 22233322 223333333443322 2222457888888888888 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchh--------------------------hcCCC----CC-CC-C-Cc Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPY--------------------------RMSNP----NN-VM-D-TP 201 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~d--------------------------rl~~~----~~-~~-~-g~ 201 (556) ++.|-+++-+...+.... ....+|+.==+|+.| .|+.. +. .+ + .- T Consensus 131 ~~~G~a~l~~~~~~~~~~--r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~ 208 (556) T protein:vir:73 131 GTFGTGAMAVMEDDQDVI--RTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWV 208 (556) T ss_pred HhhCceeeeeeecCCceE--EEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceE Confidence 888887755432211100 000111111111111 11100 00 01 1 12 Q ss_pred eEEEEEEECCCCC---------eEE-EEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCc-hhhH Q lcl|NC_019524. 202 NLRSGVQLDNNGA---------ALG-YWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGIS-EMVS 270 (556) Q Consensus 202 ~i~~GIE~d~~Gr---------~va-Y~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs-~la~ 270 (556) .|++-|+.+..+. |.+ ||+...-.++.....+ .+. .. + .+..--...+|..-|.+ +..- T Consensus 209 ~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~~~~~vl~es-g~~------e~--P-~~~~Rw~~~~ge~YGrg~P~~~ 278 (556) T protein:vir:73 209 EVNHCITPNVNRDSGKMDSKNKPYRSVYFESGGDSDKLLRES-GFD------EF--P-ILAPRWEVNGEDVYASSCPGML 278 (556) T ss_pred EEEEEEeccccccccccCcccceEEEEEEEecCCCceecccC-Ccc------cC--C-ceeeeeeecCCcccccCccHHH Confidence 3555555433222 222 4443221111111000 000 00 0 22222345689999999 6999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeec Q lcl|NC_019524. 271 ALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHL 350 (556) Q Consensus 271 ~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L 350 (556) +|-.++.|+.+..+.+..+.+++.-...+-.+. ......+.||.+... T Consensus 279 ~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~--------------------------------~~~~~~~~pgg~~~~ 326 (556) T protein:vir:73 279 ALGQVKALQVEQKRKAQLIDKATNPPMVAPTSL--------------------------------KNQRVSLLPGDVTYL 326 (556) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCceeccccc--------------------------------cccceeeccCccccc Confidence 999999999999999999999887655533210 011234566665533 Q ss_pred C---CCceeeeecCCCCCccHHHH---HHHHHHHHHHhcCCCHHH--hhchhhcccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 351 Y---PGTKLKMQPAGTPGGVGTDY---EQSLLRNIAASLGMSYEQ--FSRDYTKTNYSSARASMAETQKYMDSRKKLVAD 422 (556) Q Consensus 351 ~---pGe~i~~~~~~~p~~~f~~F---~~~~lr~iaaglGi~ye~--l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~ 422 (556) . ..+.|+++....+ +|... +..+-..|..++-....+ ...|-++++-.=++.-..|....+-..=..|-. T Consensus 327 ~~~~~~~~i~p~~~~~~--d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~ 404 (556) T protein:vir:73 327 DVISGQDGFKPAYLVNP--NTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 (556) T ss_pred cCCCCccceeeeccccc--cHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHH Confidence 3 2245666543433 34433 344455555555433211 223555555555555555555555555556778 Q ss_pred HHHHHHHHHHHHHHHHcCCccCCCCc-ccc----cccchhhHHH--------hhCeeeecCcccccch----hhhhHHHH Q lcl|NC_019524. 423 RFASAIYTLWLEEEVNAGNVPLPPGK-NWR----MFYDPMMRDA--------LCNAEWIGASRGQIDE----KKETEAAI 485 (556) Q Consensus 423 ~~~~pi~~~~l~~a~l~G~l~~p~~~-~~~----~~~~~~~~~a--------~~~~~w~~p~~~~iDP----~Ke~~A~~ 485 (556) .|+.|+.++.+..+...|.||-|+.. ... .+-.|..+.. ---..++++- ..++| .=+....+ T Consensus 405 E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~l-aq~~Pe~~d~id~d~~~ 483 (556) T protein:vir:73 405 EALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQL-AQFKPEALDKLDVDQAI 483 (556) T ss_pred HHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHH-hccChhhHhcCCHHHHH Confidence 89999999999999999998876431 000 0001110000 0000111110 00122 11222222 Q ss_pred HHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHH----------------HcCCCCCccccccCCCCCCCCC Q lcl|NC_019524. 486 LRIKNGLSTYEAEISRLGGDFREVFKQRAREEGLIK----------------SLKLDFTGKMVEGNSTQSSNSS 543 (556) Q Consensus 486 ~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E~~~~~----------------~~Gl~~~~~~~~~~~~~~~~~~ 543 (556) ..+...+.-+..+++. ..+.++.-+|++...+.+. +.+++.+........+..++.+ T Consensus 484 ~~~a~~~Gvp~~~irs-~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 484 DAFSEMSGVSPTVIVP-QEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHcCCChhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHHHHHHHHHhhcCCCC Confidence 2222222222222111 1111122222222222221 1111111111111111111111 No 242 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=86.90 E-value=0.042 Score=28.11 Aligned_cols=466 Identities=10% Similarity=0.066 Sum_probs=227.9 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhcccc-----CCCccccc-ccCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAE-----RTTREMFQ-WNPSIISPDQQIAQNQDMASARAQDMVQ 74 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~-----~~~r~~~~-w~~~~~s~~~~i~~~~~~lr~RaRdl~r 74 (556) ++|-....-..........+...++ --..+||. -....+++ ....-...+..+.. ..|..+=|.|+ T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~-----p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~--~eLI~~YR~ma- 72 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSA-----PDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPV--KELIKSYRALA- 72 (511) T ss_pred CCCccchhhhhhhhhccCCcccccC-----CCCCCCceEEecccccceecceeccccccccCccch--HHHHHHHHHHh- Confidence 3333322222222211111111111 11233331 00111111 11111111111111 36888888876 Q ss_pred cChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHh Q lcl|NC_019524. 75 NDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAV 151 (556) Q Consensus 75 Nn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~ 151 (556) ++|-+-.||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..++|..--.-.+ T Consensus 73 ~~pEvd~Av~eIvne~iv~d~~~~pV------~l~ld~~~~s~~iK~kI~eeF~~Il----------~ll~F~~~~~~~f 136 (511) T protein:vir:56 73 EYHEVDDAIQEIVDEAIVYENDKEVV------WLNLDNTDFSENIKAKINEEFDRVV----------SLLQMRKHGYKWF 136 (511) T ss_pred hccchhhHHHHhhcceeEecCCCceE------EEEecccCcchHHHHHHHHHHHHHH----------HHhccchhhhHHH Confidence 5788899988888877642 21121 122222 235667777888887644 3467777777889 Q ss_pred hhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCCCCCCCCceEEEEEEECCCCCeEEEEEeecCCCccccC Q lcl|NC_019524. 152 SGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNPNNVMDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDM 231 (556) Q Consensus 152 r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~~~~~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~ 231 (556) |.|.+||-+|..++..+..|- ..|..|||-.+..=..... ..+.|++. .+.-.-|++++.-....... T Consensus 137 R~WYVDgRi~fHkiid~k~GI--------~eLr~lDPr~i~~vr~i~~--~~~~~~~v--~~~~~ey~~Y~~~~~~~~~~ 204 (511) T protein:vir:56 137 RKWYVDSRIYFHKILDKDNNI--------IELRPLNPMKMELVREIQK--ETIDGVEV--VKGTLEYYVYKQSDYKMPSW 204 (511) T ss_pred hhhhhcceEEEEEEeccccce--------eehhhcCcccchhhhhhhc--cccccccc--ccceeeeeEecCCCcccCcc Confidence 999999999988776543322 3567777777753111100 11334432 22346788876432111100 Q ss_pred CccccceeeccccCChhHeEeeecccC---CCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccc- Q lcl|NC_019524. 232 EQWKWGYEPARFDWGRRRVIHIIEALL---AGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDV- 307 (556) Q Consensus 232 ~~~~~~rv~~~~~v~a~~viH~f~~~r---~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~- 307 (556) .+. ........++|.+.|.|+-...- ..-.-.+|.|+++++.+.+|.=.+||.++=...-|-=--++.=+.+.-. T Consensus 205 ~~~-~~~~~~~vkI~~daI~y~hSGL~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk 283 (511) T protein:vir:56 205 MSA-TNRAQTSFRIPKDAIVFAHSGLMRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPT 283 (511) T ss_pred ccc-ccccccceeechhheeeecccceeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCc Confidence 000 00112345789999977766542 3333579999999999999999999998876655533211111111111 Q ss_pred --ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHHHH Q lcl|NC_019524. 308 --VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDYEQ 373 (556) Q Consensus 308 --~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F~~ 373 (556) +.+.. -....+...+....+++..+..-..+. +| -|.+|+.+.....-+. .+-+. T Consensus 284 ~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLpGgqnlge-m~DV~ 352 (511) T protein:vir:56 284 QKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRRE----GS------KGTEVSTLPGGQSLGD-IEDVL 352 (511) T ss_pred hhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccC----CC------CccceeeccccCCcCh-HHHHH Confidence 00000 012222233333444554444444442 22 2444444443322223 34455 Q ss_pred HHHHHHHHhcCCCHHHhhchhhcccchhHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCc Q lcl|NC_019524. 374 SLLRNIAASLGMSYEQFSRDYTKTNYSSAR-----ASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGK 448 (556) Q Consensus 374 ~~lr~iaaglGi~ye~l~~D~s~~nYSs~R-----~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~ 448 (556) ...+.+=.+|+||-.-|..|=-+..|+-.| ---+.|.+.+.+.|..|..-|.+++-. ..+|.|.+.. ..| T Consensus 353 YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~----qLilKgiit~-eeW 427 (511) T protein:vir:56 353 YFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDPLKH----QLIVNNIITE-EEW 427 (511) T ss_pred HHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhccCCCH-HHH Confidence 666667778999988776442112333223 334678888888898887777776544 2577787742 111 Q ss_pred ccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCC----CHHH----HHHHhCCCHHHHHHHHHH Q lcl|NC_019524. 449 NWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLS----TYEA----EISRLGGDFREVFKQRAR 515 (556) Q Consensus 449 ~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~----s~~~----~~ae~G~D~e~v~~q~a~ 515 (556) . ...-...|....-.+.-.+||++--..++.. +.. |.+- +++..-.+.++...|++. T Consensus 428 ~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~ 497 (511) T protein:vir:56 428 D----------ANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDE 497 (511) T ss_pred H----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHH Confidence 1 1112345555666677888887755554432 111 2222 222223333333333333 Q ss_pred HHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 516 EEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 516 E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) |.. .++-.. .+++- T Consensus 498 E~k----~~~~~~----------------~e~~f 511 (511) T protein:vir:56 498 EET----NPRFQQ----------------DDQGF 511 (511) T ss_pred hhc----CCCCCC----------------cccCC Confidence 322 111100 00000 No 243 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=86.28 E-value=0.047 Score=27.88 Aligned_cols=450 Identities=12% Similarity=0.071 Sum_probs=187.6 Q ss_pred chhhhHHHHHhhHhhcccchhhh---hhhhcchhccccCCCccccccc-CCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATAT---PMAVGGGMEGAERTTREMFQWN-PSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~---~~~~~~~y~aa~~~~r~~~~w~-~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) |....+.. +.......+ .+.....|+... +..+.|. |..-..+... ... .+.+ . =++.+ T Consensus 1 m~~~~~~~------~~~~~~~~r~~~l~~~R~~~e~~w---~e~~~~~lP~~~~~~~~~--~~~---~~~~-~--~dst~ 63 (535) T protein:vir:33 1 MADSKRTG------LGEDGAKATYDRLTNDRRAYETRA---ENCAQYTIPSLFPKESDN--EST---DYTT-P--WQAVG 63 (535) T ss_pred CChhhhhc------cChhHHHHHHHHHHHHhhHHHHHH---HHHHHHhcccccCCCCCc--ccc---cccc-c--ccccH Confidence 33332211 111111000 001111122110 0111111 1100000000 000 0011 1 27778 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhHHHHH------HHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhh Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEF------QEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSG 153 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~------~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~ 153 (556) ..+++.+++.+.+ ||+|. ++ +-.|...+....++ ...++.++..-.+.- .++=-+.+||.-...++.. T Consensus 64 ~~a~~~Laa~l~~-~ltP~-~~-WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~---~~~~~~snf~~~~~~~~~~ 137 (535) T protein:vir:33 64 ARGLNNLASKLML-ALFPM-QS-WMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERII---MNYIESNSYRVTLFECLKQ 137 (535) T ss_pred HHHHHHHHHHHHH-hhcCC-Cc-ccccccChHHHhccccCcchHHHHHHHHHHHHHHH---HHHHHhcCcHHHHHHHHHH Confidence 8999999999998 57784 64 76666554322111 112222222211111 1223356899999999999 Q ss_pred heecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC-CC--------------------------------CCCCC Q lcl|NC_019524. 154 FLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN-PN--------------------------------NVMDT 200 (556) Q Consensus 154 ~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~-~~--------------------------------~~~~g 200 (556) .++.|-+++-+... .+. ..+|+.--|..-.|.. +. +..+. T Consensus 138 L~~~G~a~l~~~~~--~~~-----~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~ 210 (535) T protein:vir:33 138 LIVAGNALLYLPEP--EGS-----YNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEM 210 (535) T ss_pred HHhhCceeEEeecC--CCC-----ceeeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccC Confidence 99999887654322 111 1222222222111110 00 01123 Q ss_pred ceEEEEEEECCC-CCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHH Q lcl|NC_019524. 201 PNLRSGVQLDNN-GAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTR 279 (556) Q Consensus 201 ~~i~~GIE~d~~-Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~ 279 (556) ..|+..|..++. |....|+-....+.....+. .+. ...| .+..-....+|..-|.++...+|-.++.|+ T Consensus 211 ~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-----~~~P---~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~ 280 (535) T protein:vir:33 211 VDVYTHVYLDEESGDYLKYEEVEDVEIDGSDAT--YPT-----DAMP---YIPVRMVRIDGESYGRSYCEEYLGDLRSLE 280 (535) T ss_pred CeEEEEEEeeCCCCcEEEEEEEeCccccccccc--ccc-----ccCC---ceeeeeeecCCCccccchHHHHHHHHHHHH Confidence 347777777755 44444443322221111100 000 0011 233333567899999999999999999999 Q ss_pred HHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeee Q lcl|NC_019524. 280 NFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQ 359 (556) Q Consensus 280 ~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~ 359 (556) .+..+.+..+.+++.-...+... +..... .....++|.|+.-.+ ++++++ T Consensus 281 ~l~~~~l~~~~~~~~p~~lv~~~-g~~~~~----------------------------~~~~~~~g~~v~g~~-~~v~~~ 330 (535) T protein:vir:33 281 NLQEAIVKMSMISAKVIGLVNPA-GITQPR----------------------------RLTKAQTGDFVPGRR-EDIDFL 330 (535) T ss_pred HHHHHHHHHHHHHhcCceeeccc-cccchh----------------------------hcccCCceeeecCCc-ccceee Confidence 99999999999998777665421 100000 000112233333333 334444 Q ss_pred cCCCCCccH---HHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 360 PAGTPGGVG---TDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEE 436 (556) Q Consensus 360 ~~~~p~~~f---~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a 436 (556) ... ..++| ......+-..|..++=+. .+...|-++++-.=++.-..|....+-..=..|-..|+.|+.++.+..+ T Consensus 331 ~~~-~~~~~~~~~~~i~~~~~~I~~af~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il 408 (535) T protein:vir:33 331 QLE-KQADFTVAKAVSDQIEARLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQL 408 (535) T ss_pred ecc-cccchhHHHHHHHHHHHHHHHHHhhh-hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 322 11233 233344444454444111 1112454445444444444444444444445678889999999999999 Q ss_pred HHcCCccCCCCcccccccchhhHHHhhCeeeecCc-----------------------ccccchhhhhHHHHHHHHcCCC Q lcl|NC_019524. 437 VNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGAS-----------------------RGQIDEKKETEAAILRIKNGLS 493 (556) Q Consensus 437 ~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~-----------------------~~~iDP~Ke~~A~~~~i~~G~~ 493 (556) ...|.||.+.... ++++++.|= .+-+|+.=+....+..+... T Consensus 409 ~r~g~lP~~p~~~-------------v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~-- 473 (535) T protein:vir:33 409 QATSQIPELPKEA-------------VEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANA-- 473 (535) T ss_pred HhcCCCCCCCccc-------------eeEEEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHH-- Confidence 9999998554311 233333220 01222211222222222211 Q ss_pred CHHHHHHHhCCCHHHHH-------HHHHHH---H---HHHHHcCCCCCccccccCCCCCCCCCCCC--CCCCCCcCC Q lcl|NC_019524. 494 TYEAEISRLGGDFREVF-------KQRARE---E---GLIKSLKLDFTGKMVEGNSTQSSNSSEST--SDNPNEETT 555 (556) Q Consensus 494 s~~~~~ae~G~D~e~v~-------~q~a~E---~---~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~--~~~~~~e~~ 555 (556) .|.|+..++ +.++.. . +.+...|-.. .. ....+++..+ .+.-=-|++ T Consensus 474 --------~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~-----~~--~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 474 --------IGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGV-----GA--LATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred --------cCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhh-----cc--hhhcCChhHHHHHHhccCCCC Confidence 233322221 111111 0 1111111000 00 0000000000 000000111 No 244 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=83.60 E-value=0.067 Score=27.02 Aligned_cols=470 Identities=11% Similarity=0.017 Sum_probs=170.6 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCC--CHHHHHHHHHHHHHHHHHHHHhcChH Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSII--SPDQQIAQNQDMASARAQDMVQNDGY 78 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~--s~~~~i~~~~~~lr~RaRdl~rNn~~ 78 (556) |++.+-..-...+-.--...+...-....-.+.|.......+ ..-.|... ..+. .+-.|+ -.-++. T Consensus 20 ~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~--~~~~~~~~~~~~~~-------~~~~r~---ki~~~~ 87 (641) T protein:vir:94 20 LSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDR--QNTRARNFQTTGAD-------DADWRH---RINTGH 87 (641) T ss_pred CCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhh--hhcccccccccccc-------hhcccc---cccchh Confidence 222221111100000000000000000000111111100000 00001000 0000 011122 145777 Q ss_pred HHHHHHHHHhhhccCCceeeeecccccc-CCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 79 AAGVVAVHRDSIVGSQYKLNAKPNTIVL-GAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 79 a~~~v~~~~~nvVG~Gi~~~~~~~~~~l-g~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) +..+++.+..+..+. ++|+ .++-.+ +.+.+ +.+..+.+++.|+.-+. +.+|+..-.-.++..+.. T Consensus 88 ~~~~~~~l~s~Lm~~-~~p~--~~wf~~~p~~~e-d~~~A~~~~~~~~~~l~----------~~~~~~~~~~~~~d~~~~ 153 (641) T protein:vir:94 88 TFEVVETLVAYFKGA-TFPS--DDWFDLKGMVPE-LADAARVVKQLTKTKLE----------AASIRDIFETYVRNLVLY 153 (641) T ss_pred HHHHHHHHhhHHhhh-hcCC--CceEEEecCCCC-hHHHHHHHHHHHHHHHh----------hcchHHHHHHHHHHHhhc Confidence 888999999988875 5553 222211 11111 22333445555543221 113333333333344444 Q ss_pred CceEEEEeeccC-----------CCCcC--------CCcccceEEEEEchhhcCC------------------------- Q lcl|NC_019524. 158 GEVLATCEWLNP-----------TGTTM--------QRRPFGTAIQMISPYRMSN------------------------- 193 (556) Q Consensus 158 GE~f~~~~~~~~-----------~~~~~--------~~~~~~l~lq~ie~drl~~------------------------- 193 (556) |-++++..|-.. ++..- ......+++..|+|.-|.. T Consensus 154 g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~ 233 (641) T protein:vir:94 154 GVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELV 233 (641) T ss_pred CceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHH Confidence 444433322100 00000 0000011222222221100 Q ss_pred ------------------CCCCCCCceEEE--------EEEE----CCCCCeEE-EEEeecCCCccccCCccccceeecc Q lcl|NC_019524. 194 ------------------PNNVMDTPNLRS--------GVQL----DNNGAALG-YWLRKAFPGDPTDMEQWKWGYEPAR 242 (556) Q Consensus 194 ------------------~~~~~~g~~i~~--------GIE~----d~~Gr~va-Y~i~~~hpgd~~~~~~~~~~rv~~~ 242 (556) ..+.++...-.+ -+|+ +..|.+.+ ||+.-. | ....+...+ T Consensus 234 ~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~--g-------~~il~~~~~ 304 (641) T protein:vir:94 234 TSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFY--G-------KQLIRLSDS 304 (641) T ss_pred hcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEe--C-------CEEeecccc Confidence 000000000001 1111 11222222 111100 0 000000000 Q ss_pred ccC-ChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccc Q lcl|NC_019524. 243 FDW-GRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKE 321 (556) Q Consensus 243 ~~v-~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~ 321 (556) ..+ ..+ +++.-....+|..-|.|+..-++-.++.|+.+....+.++.+++.....+..+ + T Consensus 305 ~~~d~~P-f~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~-~----------------- 365 (641) T protein:vir:94 305 KYWCGSP-FVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVED-G----------------- 365 (641) T ss_pred cccCcCC-eEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccc-c----------------- Confidence 000 011 23333335789999999999999999999999999999998887654433211 0 Q ss_pred cccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHHHH---HHHHHHHHHHhcCCCHH-Hhh--chhh Q lcl|NC_019524. 322 IFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGTDY---EQSLLRNIAASLGMSYE-QFS--RDYT 395 (556) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~~F---~~~~lr~iaaglGi~ye-~l~--~D~s 395 (556) ......+.+.||.+.+......++++.+... +|..- ...+-..+-..+++..- +.. .|.. T Consensus 366 ------------~~~~~~l~~~PG~ii~~~~~~~v~pl~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 431 (641) T protein:vir:94 366 ------------ILKREDVKAKPGAVFKVAQHGSLQPIDMGRQ--DFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGE 431 (641) T ss_pred ------------ccccceeeccCCcceeeCCCCcceeecCCcc--ccchhHHHHHHHHHHHHHhhhhhhhhcccccccch Confidence 0111234567888888888888888765443 33221 12221233334554421 111 1223 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCe-eeecCcccc Q lcl|NC_019524. 396 KTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNA-EWIGASRGQ 474 (556) Q Consensus 396 ~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~-~w~~p~~~~ 474 (556) +.+-+.+++-+.+........-.+|...|+.|+++..++.....+..+...-. ....+++ .+.++.... T Consensus 432 ~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~----------~~~~~~~~~~~~~~p~~ 501 (641) T protein:vir:94 432 RVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRM----------YVPEEQMDGFFEVSPEY 501 (641) T ss_pred hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhh----------hchhhhcccCCCCCccc Confidence 34556666666666666666556677778888888777765554443321100 0000000 011111111 Q ss_pred c----chhhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHH---HHHHHHcCCCCCccccccCCCCCCCCCCCCC Q lcl|NC_019524. 475 I----DEKKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRARE---EGLIKSLKLDFTGKMVEGNSTQSSNSSESTS 547 (556) Q Consensus 475 i----DP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E---~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~ 547 (556) + |-++=..+..+.-.+.+..+..++.-.|.+|+ +.+.+--+ .+.++..|+..+....... ..+. . T Consensus 502 L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~-v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~---~~~~----~ 573 (641) T protein:vir:94 502 LHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQ-IGQSLDYALILEDLLRQMRFTDPMRYIKKA---EAPP----A 573 (641) T ss_pred eeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChh-hhhcCCHHHHHHHHHHHhCCCCchhhccCc---cCch----h Confidence 1 00111112222222222333333333344542 22221111 1222445654332211110 0000 0 Q ss_pred CCCCCcCCC Q lcl|NC_019524. 548 DNPNEETTQ 556 (556) Q Consensus 548 ~~~~~e~~~ 556 (556) ..+....++ T Consensus 574 ~~~~~~~~~ 582 (641) T protein:vir:94 574 APPIAPAEP 582 (641) T ss_pred HHHHHHHHH Confidence 000000000 No 245 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=83.23 E-value=0.07 Score=26.91 Aligned_cols=458 Identities=10% Similarity=0.029 Sum_probs=181.9 Q ss_pred chhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVV 83 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v 83 (556) |....-.+++.+..+.......|..--...-+-+..+-+... +...... .. .+.+ + =++.+..++ T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~---~~~~~~~----~~-----~~~~-~--~dst~~~a~ 65 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLF---PKESDNS----ST-----EYTT-P--WQAVGARCL 65 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccc---CCCCCcc----cc-----cccc-c--ccccHHHHH Confidence 333222222222211111111110000000000111111100 0000000 00 0111 1 267788899 Q ss_pred HHHHhhhccCCceeeeeccccccCCChhHHH------HHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheec Q lcl|NC_019524. 84 AVHRDSIVGSQYKLNAKPNTIVLGAPDGWGE------EFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMT 157 (556) Q Consensus 84 ~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~------~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~d 157 (556) +.+++.+.+ ||+| +++ +-.|..++.... +-...++.++..-.+. -.++=.+.+||.-...++...++. T Consensus 66 ~~Las~l~~-~ltP-~~~-WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~---~~~~~~~snf~~~~~~~~~~L~~~ 139 (522) T protein:vir:94 66 NNLAAKLML-ALFP-QSP-WMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERV---LMAYMETNSFRVPLFEALKQLIVS 139 (522) T ss_pred HHHHHHHHh-hcCC-CCc-ccccccchhhhhccCcccchhHHHHHHHHHHHHH---HHHHHHhcCcHHHHHHHHHHHHhh Confidence 999999988 5777 344 655554432111 1111122222221111 112333568999999999998888 Q ss_pred CceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC-CCCC------------------------------CCCceEEEE Q lcl|NC_019524. 158 GEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN-PNNV------------------------------MDTPNLRSG 206 (556) Q Consensus 158 GE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~-~~~~------------------------------~~g~~i~~G 206 (556) |-+++-+.- +. .+.+..|+.-.|..-.|.. +.+. .+.-.|++- T Consensus 140 G~a~l~~~~-~~-----~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~ 213 (522) T protein:vir:94 140 GNCLLYIPE-PE-----QGTYSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTH 213 (522) T ss_pred CcEeEeeec-cC-----CCceeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEE Confidence 987754321 11 1112223222222111111 0000 011235555 Q ss_pred EEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 207 VQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITL 286 (556) Q Consensus 207 IE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael 286 (556) |+.+.. +-.-|+-....+- .....++. +... + .+..--...+|..-|.++...+|-.++.|+.+..+.+ T Consensus 214 v~~~~~-~~~~~~~~~g~~~---~~~~~~~~----~~e~--P-~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 282 (522) T protein:vir:94 214 IYRQDD-EYLRYEEVEGIEV---TGTDGSYP----LTAC--P-YIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAIT 282 (522) T ss_pred EEeeCC-ceeEEeeccCcee---cccCCCCc----cccC--C-ceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 555433 2222222211110 00000000 0000 1 2333334568999999999999999999999999999 Q ss_pred HHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCc Q lcl|NC_019524. 287 QNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGG 366 (556) Q Consensus 287 ~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~ 366 (556) ..+.+++.-...+..+ +.... .......+|.|+.-.+ ++|+++.-..+ + T Consensus 283 ~~~~~~~~p~~~v~~~-g~~~~----------------------------~~~~~~~~g~~v~g~~-~~v~~~~~~~~-~ 331 (522) T protein:vir:94 283 KMAKVASKVVGLVNPN-GITQP----------------------------RRLNKAATGEFVAGRV-EDINFLQLTKG-Q 331 (522) T ss_pred HHHHHHhCCceeeccc-ccccc----------------------------hheeccCCceeecCCc-ccceeeecccc-c Confidence 9999998887655421 11000 0011123344433222 33444332211 2 Q ss_pred cH---HHHHHHHHHHHHHhcCCCHHHh-hchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCc Q lcl|NC_019524. 367 VG---TDYEQSLLRNIAASLGMSYEQF-SRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNV 442 (556) Q Consensus 367 ~f---~~F~~~~lr~iaaglGi~ye~l-~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l 442 (556) +| ......+-..|..++-+. .+ ..|-++++-.=++.-..|....+-..=..|-..|+.|+.++.+..+...|.| T Consensus 332 ~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~l 409 (522) T protein:vir:94 332 DFTIAKSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMI 409 (522) T ss_pred chhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 33 234444455566655333 22 2455555444444444444444444445677889999999999999999998 Q ss_pred cCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHH-----------cCCC--CHHHH-HHHhCCCHHH Q lcl|NC_019524. 443 PLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIK-----------NGLS--TYEAE-ISRLGGDFRE 508 (556) Q Consensus 443 ~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~-----------~G~~--s~~~~-~ae~G~D~e~ 508 (556) |.++.. .++++++.|=- .+--..++++-..-+. ..+. ..-+. +...|.|+.. T Consensus 410 P~~p~~-------------~v~v~~~s~La-~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ 475 (522) T protein:vir:94 410 PDLPKE-------------AVEPTVSTGLE-ALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAG 475 (522) T ss_pred CCCCcc-------------cEEeeEecHHH-HHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhh Confidence 765421 23333332200 0000000000000000 0000 00011 1112433322 Q ss_pred -------HHHHHHH--HHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCC Q lcl|NC_019524. 509 -------VFKQRAR--EEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETT 555 (556) Q Consensus 509 -------v~~q~a~--E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 555 (556) +.+.++. ..+.+...+.... ....+..+....++. +.. T Consensus 476 ivr~~ee~~~~~~q~~~~~~~~~~~~~~~--~~~~a~~~~~~~~~~-------~~~ 522 (522) T protein:vir:94 476 LLLTQDEKIQRMAEQSSQQAVVQGASAAG--ANMGAAVGQGAGEDM-------AQA 522 (522) T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHH--HHhhhhhhcccchhh-------hcC Confidence 2222111 1111111110000 000000000000000 000 No 246 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=82.68 E-value=0.075 Score=26.76 Aligned_cols=459 Identities=10% Similarity=0.010 Sum_probs=176.9 Q ss_pred CCcchhhhHHHHHhhHhhcccchhhhhhhhcchhccccCCCcccccc---cCCCCCHHHHHHHHHHHHHHHHHHHHhcCh Q lcl|NC_019524. 1 MKDVKKTTRTRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQW---NPSIISPDQQIAQNQDMASARAQDMVQNDG 77 (556) Q Consensus 1 ~sp~~~~~r~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w---~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~ 77 (556) ||--+...+....+....... .|..--...-+-+.-+-++...+ .+.... ..+.|.---=++ T Consensus 1 m~~d~~~~~~~l~~r~~~l~~---~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~------------~~~~~~~~~~ds 65 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKE---KRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSE------------KGRERSQKMFDS 65 (549) T ss_pred CCcchHHHHHHHHHHHHHHHH---HhhhHHHHHHHHHHHhccccccccccCCCCCC------------cccccccccccc Confidence 666554333222111111100 00000000000011111111111 110000 011111111245 Q ss_pred HHHHHHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhccccccee---hhcccCHHHHHHHHhhhh Q lcl|NC_019524. 78 YAAGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFD---ARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 78 ~a~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD---~~g~~~f~~lq~l~~r~~ 154 (556) .+..+++.+++-+.+ ||+|..++ +-.|...++...+. ..++ .|.+..+..|. .-...+||.-...++... T Consensus 66 tg~~a~~~LAs~l~~-~ltpp~~~-wF~l~~~~~~~~e~-~~v~----~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L 138 (549) T protein:vir:10 66 TAPLALRNFVAAMDS-MITPATQL-WHRLKTGNDALNEI-ASVK----AYLQGVVRTLFAARYRWQGGFVTQMGATYQSI 138 (549) T ss_pred hHHHHHHHHHHHHHh-hccCCCCc-cccccCCccchhhh-hHHH----HHHHHHHHHHHHHHhhhhcChHHHHHHHHHHH Confidence 677888888887776 46665554 33344433221111 1111 11111111111 123568888888888888 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC-CCCC-----------------------------------C Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN-PNNV-----------------------------------M 198 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~-~~~~-----------------------------------~ 198 (556) ++-|-+++-+.... +..+.|+...|..-.|.. +.+. + T Consensus 139 ~~~Gta~l~~~~~~-------~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~ 211 (549) T protein:vir:10 139 GLFGPGALMIEHDV-------GKGIVYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDP 211 (549) T ss_pred HhhcceeeEEeecC-------CCeeEEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCC Confidence 88887765543211 112222222222222211 0000 0 Q ss_pred C-CceEEEEEEECCCC---------CeE-EEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCch Q lcl|NC_019524. 199 D-TPNLRSGVQLDNNG---------AAL-GYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISE 267 (556) Q Consensus 199 ~-g~~i~~GIE~d~~G---------r~v-aY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~ 267 (556) + .-.|++-|+-+.++ .|. .||+... ++..... ..|. .. +-|+ .-....+|..-|.++ T Consensus 212 ~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~--~~~il~e-sg~~------e~--P~~~-~Rw~~~~ge~YGrgp 279 (549) T protein:vir:10 212 EKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEG--RDRIVQN-SGFR------TF--PFAI-GRFYVGTDDVYGGSP 279 (549) T ss_pred CceEEEEEEeecCCCCCccccccccCceEEEEEEec--CCEeecc-CCcc------cC--Ccce-eeeeecCCCccccch Confidence 0 11234444322111 111 1222211 1100000 0000 00 0011 111346888999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCcee Q lcl|NC_019524. 268 MVSALKQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKI 347 (556) Q Consensus 268 la~~l~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i 347 (556) ..-+|-.++.|+.+..+.+..+.+++.-...+... +. .....+.||.+ T Consensus 280 ~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~-g~-------------------------------~~~~~l~pgg~ 327 (549) T protein:vir:10 280 AYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANED-GV-------------------------------LDGFDLRSGAL 327 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccc-cc-------------------------------cccceeccCCc Confidence 99999999999999999999999888666654321 10 01112344444 Q ss_pred eec--CCCce--eeeecCCCCCccH---HHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 348 PHL--YPGTK--LKMQPAGTPGGVG---TDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLV 420 (556) Q Consensus 348 ~~L--~pGe~--i~~~~~~~p~~~f---~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~l 420 (556) ... .+|.+ ++.+... ++| ......+-..|..++=..-..+..|-.+++-.=++.-..|..+.+-..=..| T Consensus 328 ~~~~~~~~~~~~~~pl~~~---~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl 404 (549) T protein:vir:10 328 NWGGLNDKGEEMVKPLLTG---KQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRT 404 (549) T ss_pred cccccCCCCccceeeeccc---cchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHH Confidence 332 22232 4444322 233 2334444555555554332223345555555555555555555554444566 Q ss_pred HHHHHHHHHHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecC---------------------cccccch-- Q lcl|NC_019524. 421 ADRFASAIYTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGA---------------------SRGQIDE-- 477 (556) Q Consensus 421 v~~~~~pi~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p---------------------~~~~iDP-- 477 (556) ...|+.|+.++.+..+...|.||-|...-. + .-.-++++++.| ....+|| T Consensus 405 ~~E~l~Pli~R~~~il~r~g~lP~~p~~l~----~---~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ 477 (549) T protein:vir:10 405 QSELLGPMIAREVDILAEAGQLPDMPQELI----D---AGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAA 477 (549) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCChhhh----c---CCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhH Confidence 778899999999998888999875432100 0 000111222210 0001122 Q ss_pred --hhhhHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHH---HHHHH--HcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 478 --KKETEAAILRIKNGLSTYEAEISRLGGDFREVFKQRARE---EGLIK--SLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 478 --~Ke~~A~~~~i~~G~~s~~~~~ae~G~D~e~v~~q~a~E---~~~~~--~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) .=+....+..+...+.-+..+++ +.+||.+.++.. ++... +.+.......-..+.+..+ ....-- T Consensus 478 ld~id~d~~~~~~a~~~Gvp~~~ir----s~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~~~~~ta---~~~~~~ 549 (549) T protein:vir:10 478 AKVPNGARIARLLADYGGVPVEAMS----TDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDLSDAQTA---AQTARV 549 (549) T ss_pred HhcCCHHHHHHHHHHhcCCCccccC----CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCC---CcccCC Confidence 01111222222222111111111 123333222211 11110 1110000000000000000 000000 No 247 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=81.52 E-value=0.085 Score=26.45 Aligned_cols=469 Identities=10% Similarity=0.045 Sum_probs=222.6 Q ss_pred CCc--chhhhHHH--HHhh----HhhcccchhhhhhhhcchhccccCCC----cccccccCCCCCH--HH-HHHHHHHHH Q lcl|NC_019524. 1 MKD--VKKTTRTR--AKKA----VDVVAETATATPMAVGGGMEGAERTT----REMFQWNPSIISP--DQ-QIAQNQDMA 65 (556) Q Consensus 1 ~sp--~~~~~r~~--a~~a----~~~~~~~~~~~~~~~~~~y~aa~~~~----r~~~~w~~~~~s~--~~-~i~~~~~~l 65 (556) |++ .+-..-.. ..+. ....+...++.. .-+||.--. --...|.....+. +. .-..+-..| T Consensus 1 m~f~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~-----~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eL 75 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDERDYKDQEKENLESITSPK-----LDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTREL 75 (523) T ss_pred CCCchhhhhhhhhhhhhhhhhhhhhccCCCccccC-----CCCcceeeeccccccccccchhhhhhhhccccccchHHHH Confidence 444 11110000 0000 000000000000 011111000 0001111111111 01 112356778 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccC Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCT 142 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~ 142 (556) ..+=|.| .++|-+-.||+-|++-+|=. +=.|. . +.++. +..+.+.+.|...|+.-. ..++ T Consensus 76 I~~YR~m-a~~pEvd~Av~eIVneaiv~d~~~~pV-~-----i~Ld~~~~s~~iK~kI~eeF~~Il----------~ll~ 138 (523) T protein:vir:68 76 IDTYRNL-MTNYEVDNAVSEIVSDAIVYEDDTEVV-S-----INLDNTKFSPNIKSMMLDEFNEVL----------NHLS 138 (523) T ss_pred HHHHHHH-hhccchhhHHHHhhcceeeecCCCceE-E-----EEecccccchHHHHHHHHHHHHHH----------HHhc Confidence 8899999 78999999999998887742 21221 1 22222 245666777888887644 3466 Q ss_pred HHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCCC----CCCCCCceEEEEEEECCCCCeEEE Q lcl|NC_019524. 143 LTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSNP----NNVMDTPNLRSGVQLDNNGAALGY 218 (556) Q Consensus 143 f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~~----~~~~~g~~i~~GIE~d~~Gr~vaY 218 (556) |..--.-.+|.|.+||-+|..++..+.. ++.|- ..|..|+|-.+..- .....|..++. .-.-| T Consensus 139 F~~~~~~~fR~WYVDgRi~fhKiid~k~--pk~GI---~Elr~lDPr~i~~vr~i~~~~~~g~~vi~--------~~~e~ 205 (523) T protein:vir:68 139 FQRKGSDHFRRWYVDSRIFFHKIIDPKR--PKEGI---KELRRLDPRQVQYVREVITTTEAGVKIVK--------GYKEY 205 (523) T ss_pred cchhhhHHHHhheeeeEEEEEEEeeCCC--ccccc---eeeeeeCCcceeEEEeecCCCCcchhhhh--------hhhhh Confidence 7777778899999999999887664221 11121 36777888777421 11122333333 33446 Q ss_pred EEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcc-cCCchhhHHHHHHHHHHHHHHHHHHHHHHhcceee Q lcl|NC_019524. 219 WLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQT-RGISEMVSALKQMKMTRNFQEITLQNAVVNATYAA 297 (556) Q Consensus 219 ~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~-RGvs~la~~l~~l~~l~~~~dael~~a~i~A~~~~ 297 (556) ++++..- ..+...+. ........++|.+-|.|.-...-+.-. -=+|.|+++++.+.+|.=.+||.++=...-|-=-- T Consensus 206 f~Y~~~~-~~~~~~g~-~~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRR 283 (523) T protein:vir:68 206 FIYDTSH-ESYACDGR-IYEAGTKIKIPKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRR 283 (523) T ss_pred eeecccc-cccccccc-ccCCCcceecchhheeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccce Confidence 6665331 21111111 000012246777777777665433222 33799999999999999999999887665553321 Q ss_pred eEeccCcccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCC Q lcl|NC_019524. 298 SVESELPSDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAG 362 (556) Q Consensus 298 fi~~~~~~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~ 362 (556) ++.=+.+.-. +.+.. -....+...+....+++..+..-..+. +| .|.+|+.+... T Consensus 284 vFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLpGg 353 (523) T protein:vir:68 284 VWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRD----GK------AVTEVDTLPGA 353 (523) T ss_pred EEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccC----CC------cccceeecccc Confidence 1111111111 00000 011222233333444444444444432 22 24444444433 Q ss_pred CCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccch---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019524. 363 TPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYS---SARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNA 439 (556) Q Consensus 363 ~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYS---s~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~ 439 (556) ..-+. .+-+....+.+=.+|+||-.-|..|-++.|+- .+----+.|.+.+.++|..|..-|.+++-. ..+|. T Consensus 354 qnlge-m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~----qLilK 428 (523) T protein:vir:68 354 DNTGN-MEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKT----NLILK 428 (523) T ss_pred CCcCh-HHHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhc Confidence 22223 34455666667778999988786553223321 122223567778888888887777766544 35777 Q ss_pred CCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc-----CCC----CHH----HHHHHhCCCH Q lcl|NC_019524. 440 GNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN-----GLS----TYE----AEISRLGGDF 506 (556) Q Consensus 440 G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----G~~----s~~----~~~ae~G~D~ 506 (556) |.+.. ..|. ...-...|.-..-.+.-.+||++--..++.. +.- |.+ .+++..-.+. T Consensus 429 giit~-eew~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei 497 (523) T protein:vir:68 429 GIITE-DEWN----------DEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEEI 497 (523) T ss_pred cCCCH-HHHH----------HHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHH Confidence 87742 1111 1112345555666677888887765554432 100 111 2222223333 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 507 REVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 507 e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) ++...|++.|. +.|+-. +++++++|- T Consensus 498 ~~~~kqI~~E~----k~~~~~-------------~p~~e~~~f 523 (523) T protein:vir:68 498 EQEAKQIEEES----KEARFQ-------------DPDQEQEDF 523 (523) T ss_pred HHHHHHHHHHh----hcCCCC-------------CCchhhhcC Confidence 33333343332 223321 111111111 No 248 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=71.97 E-value=0.19 Score=24.55 Aligned_cols=450 Identities=12% Similarity=0.045 Sum_probs=172.1 Q ss_pred HHHHhhHhhcccchhhhhhhhcchhccccCCCccccccc-CCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHh Q lcl|NC_019524. 10 TRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWN-PSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRD 88 (556) Q Consensus 10 ~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~-~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~ 88 (556) ++. ++.++-. +.....|+... +..+.|. |.....+... .... ..|. =++.+..+++++++ T Consensus 1 mk~-~~~~~~~-------~lkr~~~e~~w---~e~a~~tlP~~~~~~~~~--~~~~---~~~~---~dstg~~a~~~LAa 61 (510) T protein:vir:78 1 MKS-TAAMLWE-------KLRDGSVEQRA---IEFAKTTLPYLMVDPMSG--SRGV---VEHD---FQSAGALLVNNLAA 61 (510) T ss_pred Chh-HHHHHHH-------HHhccchHHHH---HHHHHhhccccccCCCCc--cccc---ccCc---ccchHHHHHHHHHH Confidence 111 0000000 00011122110 0112221 1111111100 0000 0111 25677888888888 Q ss_pred hhccCCceeeeeccccccCCChhHHHHHH------HHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEE Q lcl|NC_019524. 89 SIVGSQYKLNAKPNTIVLGAPDGWGEEFQ------EVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLA 162 (556) Q Consensus 89 nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~------~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~ 162 (556) -+.+ ||+|..++ +-.|..+++..+++. ..++.++..-...- ..+=-+.+||.-...++...+..|-+++ T Consensus 62 ~l~~-~ltpp~~~-WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~---~~~l~~snf~~~~~~~~~~L~~~G~a~l 136 (510) T protein:vir:78 62 KLAR-SLFPTGIP-FFRSELTDAIRREADSRDTDITEVTAALARVDRKA---TQRLFQNASLAVLTQVIKLLIVTGNALL 136 (510) T ss_pred HHHH-hhcCCCCc-ccccCCChHHhhhcccCcchHHHHHHHHHHHHHHH---HHHHHhcCcHHHHHHHHHHHHhhCeEEE Confidence 8776 46665443 333444443322111 11222222111110 1122245889888888888888887754 Q ss_pred EEeeccCCCCcCCCcccceEEEEEchh---hcCCC--------------C-----------CCCCCceEEEEEEECCC-C Q lcl|NC_019524. 163 TCEWLNPTGTTMQRRPFGTAIQMISPY---RMSNP--------------N-----------NVMDTPNLRSGVQLDNN-G 213 (556) Q Consensus 163 ~~~~~~~~~~~~~~~~~~l~lq~ie~d---rl~~~--------------~-----------~~~~g~~i~~GIE~d~~-G 213 (556) -. ++.+ ..-..||+.==++.-| ++++- . +....-.|++-|+-... + T Consensus 137 ~~---~~~~--~~~~~~pl~~y~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~ 211 (510) T protein:vir:78 137 YR---NSDE--ATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTA 211 (510) T ss_pred EE---eCCC--CeEEEEEcceeEEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCC Confidence 32 1111 0112223221111111 11100 0 00111235555554332 2 Q ss_pred CeEE-EEEeecCCCccccCCccccceeeccccCChhHe--EeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 214 AALG-YWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRV--IHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAV 290 (556) Q Consensus 214 r~va-Y~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~v--iH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~ 290 (556) +|.+ ||+--+ |.... .....|-.++ +-.--..-+|..-|.++..-+|-.+|.|+.+.++.+..+. T Consensus 212 ~~~~sv~~e~d--g~~i~----------~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~ 279 (510) T protein:vir:78 212 MDYAEMYHEID--GVRVG----------ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYEL 279 (510) T ss_pred CcEEEEEEEec--Ceeec----------cccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222 222100 00000 0001111111 1111234589999999999999999999999999999988 Q ss_pred HhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHH- Q lcl|NC_019524. 291 VNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGT- 369 (556) Q Consensus 291 i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~- 369 (556) .++.....+. +.+...+. ....-+.|.|..-.+ ++|+++... +.++|. T Consensus 280 ~a~~~~~lv~-p~g~~~~~----------------------------~l~~~~~g~~v~g~~-~~v~~~~~~-~~~d~~~ 328 (510) T protein:vir:78 280 ESLEVLNLVD-EAKGAVVD----------------------------DYQDAEMGDYVPGGA-EAVRAYERG-DYNKMAA 328 (510) T ss_pred HhhcCCcccC-Cccccchh----------------------------hhccCCCceeecCCc-ccccccccC-cccchHH Confidence 8876664433 22110000 000011122221111 234443322 223443 Q ss_pred --HHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCC Q lcl|NC_019524. 370 --DYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPG 447 (556) Q Consensus 370 --~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~ 447 (556) .....+-..|..++= +.....|-.+++-.=++.-..|....+-..=..|...|+.|+.++.+..+.-.|.+++|.. T Consensus 329 ~~~~i~~~~~rI~~aF~--~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~ 406 (510) T protein:vir:78 329 IQQSLQAVVVRLNQAFM--YGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK 406 (510) T ss_pred HHHHHHHHHHHHHHHHh--hccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcc Confidence 344444445554431 1111223333433334443344444444444467788899999999988888888877765 Q ss_pred cccc---cccchhhHHHh-------hC-eeeecCcccccchhhhhHHHHHHHHcCCC-CHHHHHHHhCCCHHHHHHHHHH Q lcl|NC_019524. 448 KNWR---MFYDPMMRDAL-------CN-AEWIGASRGQIDEKKETEAAILRIKNGLS-TYEAEISRLGGDFREVFKQRAR 515 (556) Q Consensus 448 ~~~~---~~~~~~~~~a~-------~~-~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~-s~~~~~ae~G~D~e~v~~q~a~ 515 (556) .--. .+-.+..+..- .. +.-+++ +..+||.=+....+..+...+. ++..++ ++.|||.+.++. T Consensus 407 ~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~-~~q~~~~id~d~~~~~~a~~~Gv~p~~iv----rs~eev~a~~~~ 481 (510) T protein:vir:78 407 QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAP-IAQLDPRISLPKMMDTIWAAFSVDTSQFY----KSADELQAEAEE 481 (510) T ss_pred cccceeeecccHHHHHHHHHHHHHHHHHHHHhcC-hhhhhhcCCHHHHHHHHHHHhCCChhhhc----CCHHHHHHHHHH Confidence 3111 01111111100 00 011111 2333443333333333332222 122222 122333222221 Q ss_pred H-HHHHHH----cC-CCCCccccccCCCCC Q lcl|NC_019524. 516 E-EGLIKS----LK-LDFTGKMVEGNSTQS 539 (556) Q Consensus 516 E-~~~~~~----~G-l~~~~~~~~~~~~~~ 539 (556) . .+..+. .. +.-..+.. ...++- T Consensus 482 ~~~q~~~~~~~~~a~~~~~~~~~-~~~~g~ 510 (510) T protein:vir:78 482 QRRQAAQAQAAQETLLEGASDMT-NALAGV 510 (510) T ss_pred HHHHHHHHHHHHHHHHHhhhhhc-ccCCCC Confidence 1 011111 11 11111110 111111 No 249 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=71.32 E-value=0.2 Score=24.45 Aligned_cols=464 Identities=10% Similarity=0.030 Sum_probs=190.4 Q ss_pred HHHhhHhhcccchhhhhhhhcchhccccCCCc--------cccccc-CCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHH Q lcl|NC_019524. 11 RAKKAVDVVAETATATPMAVGGGMEGAERTTR--------EMFQWN-PSIISPDQQIAQNQDMASARAQDMVQNDGYAAG 81 (556) Q Consensus 11 ~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r--------~~~~w~-~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~ 81 (556) =|+.- +...+.... ...|+... +.| ..+.|. |.....+.... .. .+.+ + =++.+.. T Consensus 1 ~~~~~-----~~~~~~~~~-~~r~~~l~-~~R~~~e~~w~e~~~y~lP~~~~~~~~~~--~~---~~~~-~--~dst~~~ 65 (543) T protein:vir:88 1 MAETK-----REGLAEEGA-KAVYERLK-NDRVPYETRAENCAKVTIPSLFPKDSDNS--ST---DYTT-P--WQAVGAR 65 (543) T ss_pred Ccccc-----cCcchHHHH-HHHHHHHH-HHHhHHHHHHHHHHHHhccccCCCCCCcc--cc---cccc-c--ccchHHH Confidence 00100 000011100 11122211 111 111221 21111111100 00 0011 1 3778889 Q ss_pred HHHHHHhhhccCCceeeeeccccccCCChhHHH------HHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhe Q lcl|NC_019524. 82 VVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGE------EFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFL 155 (556) Q Consensus 82 ~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~------~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~ 155 (556) +++.+++.+.+ ||+|. ++ +-.|..++.... .-...+++++..-.+.- .++=-+.+||.-...++...+ T Consensus 66 a~~~Laa~l~~-~ltP~-~~-WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~---~~~~~~snf~~~~~~~~~~L~ 139 (543) T protein:vir:88 66 GLNNLSAKVML-ALFPL-QS-WMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERIL---MSYMEANSYRVTLFELIRQLA 139 (543) T ss_pred HHHHHHHHHHH-hhcCC-Cc-ccccccChHHHhcccCChhhHHHHHHHHHHHHHHH---HHHHHhcCcHHHHHHHHHHHH Confidence 99999999998 57785 64 766666553321 11222333332222211 122235689999988999988 Q ss_pred ecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC-CCCCCC-----------------CceEEEEEEECCCCCeEE Q lcl|NC_019524. 156 MTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN-PNNVMD-----------------TPNLRSGVQLDNNGAALG 217 (556) Q Consensus 156 ~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~-~~~~~~-----------------g~~i~~GIE~d~~Gr~va 217 (556) +.|-+++-+. ...+ ......++++..|..-.|.. +.+.-+ ...++.+-+.|.+.+..= T Consensus 140 ~~G~a~ly~~--~~~~--~~~~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v 215 (543) T protein:vir:88 140 LAGTALIYLP--PPDA--SSNSYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEV 215 (543) T ss_pred hhCceeeeec--cCcc--ccceecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEE Confidence 8888874331 1111 11122233333332211211 111000 001223334455555555 Q ss_pred EEEeecCCC-ccccC-Cccccceeecc-ccCChh--HeEeeecccCCCcccCCchhhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019524. 218 YWLRKAFPG-DPTDM-EQWKWGYEPAR-FDWGRR--RVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQEITLQNAVVN 292 (556) Q Consensus 218 Y~i~~~hpg-d~~~~-~~~~~~rv~~~-~~v~a~--~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~dael~~a~i~ 292 (556) ||.....+. +.+.. ....-..|+.. ...+-. =.+..-....+|..-|.++..-+|-.++.|+.+..+.+..+..+ T Consensus 216 ~~~V~pr~~~~~~~~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~ 295 (543) T protein:vir:88 216 YTHIYIDDESGDFLSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMIS 295 (543) T ss_pred EEEEEeecCCCcccccccccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654432222 21110 00111112111 111100 12233334678999999999999999999999999999999999 Q ss_pred cceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCCCCccHH--- Q lcl|NC_019524. 293 ATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGTPGGVGT--- 369 (556) Q Consensus 293 A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~f~--- 369 (556) +.....+..+ +...... ...-++|.|+.-. .+++.++... .+++|. T Consensus 296 ~~pp~~v~~~-g~~~~~~----------------------------~~~~~~g~~v~g~-~~~v~~~~~~-~~~~~~~~~ 344 (543) T protein:vir:88 296 SKVVGLVNPN-GITQVRR----------------------------LVKAQTGDFVAGR-KADIEFLQLE-KTADFTVAK 344 (543) T ss_pred hcCceeeccc-cccchhh----------------------------cccCCCceeecCC-CCcceeeecc-cccchhHHH Confidence 8888665422 1100000 0001122222211 2344433322 112332 Q ss_pred HHHHHHHHHHHHhcCCCHHHh-hchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCCc Q lcl|NC_019524. 370 DYEQSLLRNIAASLGMSYEQF-SRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAGNVPLPPGK 448 (556) Q Consensus 370 ~F~~~~lr~iaaglGi~ye~l-~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G~l~~p~~~ 448 (556) .....+-..|..++=+. .+ ..|-++++-.=++.-..|....+-..=..|-..|+.|+.++.+..+...|.||-+... T Consensus 345 ~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~ 422 (543) T protein:vir:88 345 SVADAIEARLSYVFMLN--SAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQE 422 (543) T ss_pred HHHHHHHHHHHHHHhhh--hhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchh Confidence 33344444555444222 22 2454445444444444444444444445677889999999999999999998755431 Q ss_pred ccccccchhhHHHhhCeeeecC-----------------------cccccchhhhhHHHHHHHHcCCCCHHHHHHHhCCC Q lcl|NC_019524. 449 NWRMFYDPMMRDALCNAEWIGA-----------------------SRGQIDEKKETEAAILRIKNGLSTYEAEISRLGGD 505 (556) Q Consensus 449 ~~~~~~~~~~~~a~~~~~w~~p-----------------------~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae~G~D 505 (556) .++++++.+ +.+.+.+.=+....+..+. ...|.| T Consensus 423 -------------~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a----------~~~Gv~ 479 (543) T protein:vir:88 423 -------------AVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLA----------NAIGID 479 (543) T ss_pred -------------ceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHH----------HHhCCC Confidence 123333311 0011111111111111111 112444 Q ss_pred HHHH------HHHHHHHHH-------HHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 506 FREV------FKQRAREEG-------LIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 506 ~e~v------~~q~a~E~~-------~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 556 (556) +..+ .+++..+++ .+...|.....+...+ +.......+..+-.+--..+| T Consensus 480 ~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~p~~~~ 542 (543) T protein:vir:88 480 TAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATAS-PEAMESAMDTAGVQPGPIATQ 542 (543) T ss_pred hhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccC-hHHHHHHhhhcCCCCCCCCCC Confidence 4322 122211111 1112222221111111 111011111112222223333 No 250 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=70.88 E-value=0.2 Score=24.38 Aligned_cols=446 Identities=12% Similarity=0.082 Sum_probs=181.9 Q ss_pred HHHhhHhhcccchh---h--hhhhhcchhccccCCCccccc----ccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHH Q lcl|NC_019524. 11 RAKKAVDVVAETAT---A--TPMAVGGGMEGAERTTREMFQ----WNPSIISPDQQIAQNQDMASARAQDMVQNDGYAAG 81 (556) Q Consensus 11 ~a~~a~~~~~~~~~---~--~~~~~~~~y~aa~~~~r~~~~----w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~ 81 (556) -+.+.++.+..+-+ . ......+.-.++ .+.++. +-|... .+. .... .+| ...+.=.++.+. T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G---~~~~r~~g~~YLPk~~--~E~-~~~Y---~~r-l~rA~~~n~~~~ 70 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGG---TEAMREAGETYLPRHQ--EET-DKGY---QER-LASAVLLNMVEQ 70 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcC---hHHHHhhcccCCCCCC--CCC-HHHH---HHH-HhcccCCChHHH Confidence 11122222221111 0 111111111111 222332 222221 110 1111 111 111122334444 Q ss_pred HHHHHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceE Q lcl|NC_019524. 82 VVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVL 161 (556) Q Consensus 82 ~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f 161 (556) + ++..||.=|+--++... ..-..++.. |+++ ||-.|. +++++.+.+++..+..|=|+ T Consensus 71 t----l~~l~G~vf~k~p~~~~-----------~~p~~~~~~---l~~d----~D~~G~-~L~~f~~~~~~~~l~~G~~~ 127 (513) T protein:vir:97 71 T----LDTLSGKPFSEPIKLNE-----------DVPKAIEET---ILPD----VDLQGN-NLDVFARQWFREGMAKALCH 127 (513) T ss_pred H----HHHHhhhhhhcCcccCc-----------CchHHHHHH---Hhhc----cCCCCC-CHHHHHHHHHHHHHhcCeEE Confidence 4 44445543332222110 011223322 3332 888887 99999999999999999999 Q ss_pred EEEeeccCCCCcCCCc-----------ccceEEEEEchhhcCC-CCCCCCCce------EEEEE-EECCCCCeEE--EEE Q lcl|NC_019524. 162 ATCEWLNPTGTTMQRR-----------PFGTAIQMISPYRMSN-PNNVMDTPN------LRSGV-QLDNNGAALG--YWL 220 (556) Q Consensus 162 ~~~~~~~~~~~~~~~~-----------~~~l~lq~ie~drl~~-~~~~~~g~~------i~~GI-E~d~~Gr~va--Y~i 220 (556) ++.-+-...+. ..+. ..|+ +.+|.|+.|-+ ....-+|.. |+.=+ +-|..|.-.- |++ T Consensus 128 ilVD~P~~~~~-~~~~~~T~Ade~~~~~rPy-~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rv 205 (513) T protein:vir:97 128 VLIDMPRPAPR-EDGQPRTLADDRREGLRPY-WVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRV 205 (513) T ss_pred EEEecCCCCCc-cchhHHhHHHHHhhccCce-EEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEE Confidence 88765432211 1111 1232 67799998853 222223321 22222 2233333222 333 Q ss_pred eecCCCcc--------ccCCccccceeeccccCChhHeEeee-cccCCCcccCCchhhHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_019524. 221 RKAFPGDP--------TDMEQWKWGYEPARFDWGRRRVIHII-EALLAGQTRGISEMVSALK-QMKMTRNFQEITLQNAV 290 (556) Q Consensus 221 ~~~hpgd~--------~~~~~~~~~rv~~~~~v~a~~viH~f-~~~r~gQ~RGvs~la~~l~-~l~~l~~~~dael~~a~ 290 (556) +. ||.+ ......+|..+... ..+-..|=.+| ...+-+=.-|-|+|..... .+++.... |.+.... T Consensus 206 L~--~g~~~v~r~~~~~~~~~~e~~~~~~g-~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~--Sd~~~il 280 (513) T protein:vir:97 206 LE--PGLVQLWEPVKKSNAQKEEWALADEW-ATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSA--SDQRHIL 280 (513) T ss_pred Ee--CceEEEEEeecCCCccccceEEecCC-CCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhh--hhHHHHH Confidence 32 2221 11112233322211 12222222222 1345566678888876432 23333222 2233333 Q ss_pred Hhccee-eeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecC-CCceeeeecCCCCCcc- Q lcl|NC_019524. 291 VNATYA-ASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLY-PGTKLKMQPAGTPGGV- 367 (556) Q Consensus 291 i~A~~~-~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~-pGe~i~~~~~~~p~~~- 367 (556) --+++. .||+ ...++ ....+.+.++.+..|+ +|-++.++.++-.+-. T Consensus 281 ~~~~~P~l~~~-G~~~~-----------------------------~~~~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~ 330 (513) T protein:vir:97 281 TVSRFPILACS-GASGE-----------------------------DSDPVVVGPNKVLYNPDPAGRFYYVEHTGQAIAA 330 (513) T ss_pred Hhcccceeeee-cCCcC-----------------------------CCCceEeeccccccCCCCCCcceeeccCchhHHH Confidence 333333 3433 11100 0113568889999998 5889999998744321 Q ss_pred HHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHcCCccCC Q lcl|NC_019524. 368 GTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL--VADRFASAIYTLWLEEEVNAGNVPLP 445 (556) Q Consensus 368 f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~--lv~~~~~pi~~~~l~~a~l~G~l~~p 445 (556) ....++.+..+| ..+|.. .|.. +..| .|+-+..+++.+....++.+ -+...++-+++++- ...|.- + T Consensus 331 ~~~~l~~le~qm-~~~Ga~--ll~~--~~~~-~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a---~wlg~~--~ 399 (513) T protein:vir:97 331 GRTDLKDLEEQM-AGYGAE--FLKR--KTGG-QTATARALDSAEATSDLSAMTGLFEDALAQALDITA---DWLRLG--P 399 (513) T ss_pred HHHHHHHHHHHH-HHHHHH--hhcc--CCcc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHhCCC--C Confidence 234444444443 223322 2332 1234 56666666666666665543 23333333333221 112211 1 Q ss_pred CCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHH---hCC-----CHHHHHHHHHHHH Q lcl|NC_019524. 446 PGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISR---LGG-----DFREVFKQRAREE 517 (556) Q Consensus 446 ~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae---~G~-----D~e~v~~q~a~E~ 517 (556) ....+ .++-+|... .+|+ .++++...++..|..|.+....+ +|. |.++..++++.+. T Consensus 400 ~~~~v-----------~in~dF~~~---~~~~-~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~ 464 (513) T protein:vir:97 400 NGGTV-----------ELVKDYDLE---EMDA-PGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEI 464 (513) T ss_pred CccEE-----------EeccccCcc---cCCH-HHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhh Confidence 10000 001122111 1233 48899999999999988766544 465 5666666665553 Q ss_pred HH-HHHcCCCCCccccccCCCCCCCCCCCCCCCCCC-cCCC Q lcl|NC_019524. 518 GL-IKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNE-ETTQ 556 (556) Q Consensus 518 ~~-~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~-e~~~ 556 (556) .. .-..|+..+... ..+....+. ...++.+. |+.+ T Consensus 465 ~~~~~~~~~d~~~~~--~~~~~~~~~--~~~~~~~~~~~~~ 501 (513) T protein:vir:97 465 SEAMGRAGLDLDPAQ--KNPPEGGEG--EGEGEGEGGEGGE 501 (513) T ss_pred hhccCCCCccccccC--CCCCCCCCC--CCCCCCCCCCCCC Confidence 21 112233322111 111000000 00001111 1111 No 251 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=64.98 E-value=0.29 Score=23.52 Aligned_cols=460 Identities=12% Similarity=0.092 Sum_probs=213.8 Q ss_pred CCcchhhhHHH------HHhhHhhcccchhhhhhhhcchhcccc------CC---CcccccccCCCCCHHHHHHHHHHHH Q lcl|NC_019524. 1 MKDVKKTTRTR------AKKAVDVVAETATATPMAVGGGMEGAE------RT---TREMFQWNPSIISPDQQIAQNQDMA 65 (556) Q Consensus 1 ~sp~~~~~r~~------a~~a~~~~~~~~~~~~~~~~~~y~aa~------~~---~r~~~~w~~~~~s~~~~i~~~~~~l 65 (556) |+=.+-..-.+ ......-.+...+ ..-..+||. .+ +..++.+.. .+..+ ..-..| T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~-----~p~~~DGa~~i~~~~~~~~~~g~~~~~~d----~~~~~-~~~~~L 70 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKQGHESIA-----TPKKDDGATEIEAREGESSYNALMQQFFG----IDNNI-SGTKDL 70 (516) T ss_pred CCchHhcccccchhhHHHHhhhcCCCCccc-----CCCCccCceeeecCcccccccceeeeeec----ccCcc-ccHHHH Confidence 11111110000 0000000000000 001122221 00 111111111 11111 233567 Q ss_pred HHHHHHHHhcChHHHHHHHHHHhhhccC--CceeeeeccccccCCCh-hHHHHHHHHHHHHHHHHhcccccceehhcccC Q lcl|NC_019524. 66 SARAQDMVQNDGYAAGVVAVHRDSIVGS--QYKLNAKPNTIVLGAPD-GWGEEFQEVVEARFNMAAESPENWFDARRMCT 142 (556) Q Consensus 66 r~RaRdl~rNn~~a~~~v~~~~~nvVG~--Gi~~~~~~~~~~lg~~~-~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~ 142 (556) ..+=|.|+ ++|=+-+||+-|++-+|=. +=.|. .+.++. +..+.+.+.|...|+.-. ..++ T Consensus 71 I~~YR~ma-~~pEvd~Av~eIvneaiv~d~~~~pV------~l~l~~~e~s~sik~kI~eeF~~Il----------~ll~ 133 (516) T protein:vir:10 71 INTYRQLT-NNPEVERAVANIVNEAVVYEKGHKVV------SLDLDDTEFSSSIKDKILEEFDEIC----------RLLD 133 (516) T ss_pred HHHHHHhh-hccchhHHHHHhhcceeEecCCCceE------EEEecccccchHHHHHHHHHHHHHH----------HHhc Confidence 77778776 5677888888888877642 21221 122222 245667777888887643 2456 Q ss_pred HHHHHHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC----CCCCCCCceEEEEEEECCCCCeEEE Q lcl|NC_019524. 143 LTGLTRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN----PNNVMDTPNLRSGVQLDNNGAALGY 218 (556) Q Consensus 143 f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~----~~~~~~g~~i~~GIE~d~~Gr~vaY 218 (556) |..--.-.+|.|.+||-+|..+...++.. |- ..|..|+|-.|.. +....++..++.|+. -| T Consensus 134 F~~~~~~~fR~WYVDgRi~fhKiid~~k~----GI---~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~--------e~ 198 (516) T protein:vir:10 134 ASRKLDTLFRRWYIDSRIFFHKIMPNPKE----GI---VELRRLDPRHVEYYREIVTSDVGGTSVVKGYR--------EF 198 (516) T ss_pred cchhhhHHHHhhhhcceEEEEEEecCccc----ce---eeeeeeCCcceeeEEeeecccCcchhhhhcee--------ee Confidence 77777788999999999998876543322 21 3677788887752 111233333444332 45 Q ss_pred EEeecCCCccc-cCCccccceeeccccCChhHeEeeecccCCCcccC-CchhhHHHHHHHHHHHHHHHHHHHHHHhccee Q lcl|NC_019524. 219 WLRKAFPGDPT-DMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRG-ISEMVSALKQMKMTRNFQEITLQNAVVNATYA 296 (556) Q Consensus 219 ~i~~~hpgd~~-~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RG-vs~la~~l~~l~~l~~~~dael~~a~i~A~~~ 296 (556) |++. +|+.. ...+..+.- ....+++.+-|..+....-+.-..+ +|.|+++++.+.+|.=.+||.++=...-|-=- T Consensus 199 ~~Y~--~~~~~~~~~g~~~~~-~~~ikI~~daI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeR 275 (516) T protein:vir:10 199 FVYT--TGNEGYAYNGRLFEP-NTRIKIPRSAIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPER 275 (516) T ss_pred eeee--cCccceeccccccCC-CCceecchhheeeeecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccc Confidence 5554 33311 111111100 0113455555544443332222111 79999999999999999999988766555332 Q ss_pred eeEeccCcccc---ccccc------------ccccccccccccccccccccccccccceecCCceeeecCCCceeeeecC Q lcl|NC_019524. 297 ASVESELPSDV---VFGQL------------GMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPA 361 (556) Q Consensus 297 ~fi~~~~~~~~---~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~ 361 (556) -++.=+.+.-. +.+.. -....+...+....+++..+..-..+. +| .|.+|+.+.. T Consensus 276 RvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRRe----Gg------rgTEItTLpG 345 (516) T protein:vir:10 276 RVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRD----GK------SVTEVTSLPG 345 (516) T ss_pred eEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccC----CC------cccceeeccc Confidence 21111111111 00000 012222233334445555444444442 22 2444444443 Q ss_pred CCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhh-cccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 362 GTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYT-KTNY---SSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEV 437 (556) Q Consensus 362 ~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s-~~nY---Ss~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~ 437 (556) ...-+. .+-+....+.+=.+|+||-.-|..|-. +.++ |-+----+.|.+.+.+.|..|..-|.+++-. ..+ T Consensus 346 gqnlge-m~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~----qLi 420 (516) T protein:vir:10 346 AQTMGE-MDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKT----NLI 420 (516) T ss_pred cCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hhh Confidence 322233 344555666677789999887754421 1111 2222234678888889998887777666443 357 Q ss_pred HcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHc---------CCCCHHHHHHH----hCC Q lcl|NC_019524. 438 NAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKN---------GLSTYEAEISR----LGG 504 (556) Q Consensus 438 l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~---------G~~s~~~~~ae----~G~ 504 (556) |.|.+.. ..|. ...-...|.-..-.+.-.+||++--..++.. ..-|.+-+.++ .-. T Consensus 421 lKgIit~-eeW~----------~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDe 489 (516) T protein:vir:10 421 YKKIILE-SEWE----------EQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDE 489 (516) T ss_pred hcCCCCH-HHHH----------HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHh Confidence 7777642 2111 1112345555666677888887755544432 11222222222 122 Q ss_pred CHHHHHHHHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCC Q lcl|NC_019524. 505 DFREVFKQRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDN 549 (556) Q Consensus 505 D~e~v~~q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 549 (556) +..+...|++.|. +.|+-.+ ++++.+- T Consensus 490 ei~~~~k~I~~E~----~~~~~~~--------------p~~e~~f 516 (516) T protein:vir:10 490 QIAQEEKQIEKEA----NVKRFQN--------------PENEDDF 516 (516) T ss_pred HHHHHHHHHHHhh----hCCCCCC--------------CCccccC Confidence 2233333333331 2232111 1111111 No 252 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=63.76 E-value=0.31 Score=23.36 Aligned_cols=458 Identities=11% Similarity=0.064 Sum_probs=184.1 Q ss_pred chhhhHHHHHhhHhhcccchhh---hhhhhcchhccccCCCcccccccCCCCCHHHHHHHHHHHHHHHHHHHHhcChHHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATA---TPMAVGGGMEGAERTTREMFQWNPSIISPDQQIAQNQDMASARAQDMVQNDGYAA 80 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~---~~~~~~~~y~aa~~~~r~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~ 80 (556) |....+. ++....... ........|+... +..+.|.-+...++.. .... ..+.+ . =++.+. T Consensus 1 m~~~~~~------~~~~~~~k~r~~~l~~~R~~~e~~w---~e~~~~~lP~~~~~~~-~~~~---~~~~~-~--~dst~~ 64 (535) T protein:vir:15 1 MADSKRT------GLGEDGAKATYDRLTNDRRAYETRA---ENCAQYTIPSLFPKES-DNES---TDYTT-P--WQAVGA 64 (535) T ss_pred CCccchh------ccchHHHHHHHHHHHHHhhHHHHHH---HHHHHHhcccccCCCC-Cccc---ccccc-c--ccccHH Confidence 2222211 011111100 0011111122110 0111111111100000 0000 00011 1 267788 Q ss_pred HHHHHHHhhhccCCceeeeeccccccCCChhHHHHH------HHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhh Q lcl|NC_019524. 81 GVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEF------QEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGF 154 (556) Q Consensus 81 ~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~------~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~ 154 (556) .+++.+++.+.+ ||+|. ++ +-.|...+....++ ...++.++..-.+.- .++=-+.+||.-...++... T Consensus 65 ~a~~~Laa~l~~-~ltP~-~~-WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~---~~~l~~snf~~~~~~~~~~L 138 (535) T protein:vir:15 65 RGLNNLASKLML-ALFPM-QS-WMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERII---MNYIESNSYRVTLFECLKQL 138 (535) T ss_pred HHHHHHHHHHHH-hhcCC-Cc-ccccccChHHHhccCCCcchHHHHHHHHHHHHHHH---HHHHHhcCcHHHHHHHHHHH Confidence 999999999988 67784 64 76666554322111 112222222211111 12223568999999999998 Q ss_pred eecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC-CCC--------------------------------CCCCc Q lcl|NC_019524. 155 LMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN-PNN--------------------------------VMDTP 201 (556) Q Consensus 155 ~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~-~~~--------------------------------~~~g~ 201 (556) ++.|-+++-+... .+ ...+|+.--|.--.|.. +.+ ..+.. T Consensus 139 ~~~G~a~l~~~~~--~~-----~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v 211 (535) T protein:vir:15 139 IVAGNALLYLPEP--EG-----SYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMV 211 (535) T ss_pred HhhCceeEEeecC--CC-----CceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCce Confidence 8888876544221 11 11222222222111110 000 01112 Q ss_pred eEEEEEEECCCC-CeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHH Q lcl|NC_019524. 202 NLRSGVQLDNNG-AALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRN 280 (556) Q Consensus 202 ~i~~GIE~d~~G-r~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~ 280 (556) .|+.=|..++.+ ....|+-....+.....+ ++ .+...| .+..-....+|..-|.++...+|-.++.|+. T Consensus 212 ~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~---~~----~~~~~P---~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~ 281 (535) T protein:vir:15 212 DVYTHVYLDEESGDYLKYEEVEDVEIDGSDA---TY----PTDAMP---YIPVRMVRIDGESYGRSYCEEYLGDLRSLEN 281 (535) T ss_pred eEEEEEEEecCCCcEEEEEEeeCcccccccc---cc----ccccCC---ceeeeeeecCCCccccchHHHHHHHHHHHHH Confidence 456666666543 333333222111110000 00 000011 2333345678999999999999999999999 Q ss_pred HHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeec Q lcl|NC_019524. 281 FQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQP 360 (556) Q Consensus 281 ~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~ 360 (556) +..+.+..+..++.-...+... +.... ......++|.|+.-.+ ++++++. T Consensus 282 l~~~~l~~~~~~~~p~~lv~~~-g~~~~----------------------------~~l~~~~~g~~v~g~~-~~v~~~~ 331 (535) T protein:vir:15 282 LQEAIVKMSMISAKVIGLVNPA-GITQP----------------------------RRLTKAQTGDFVPGRR-EDIDFLQ 331 (535) T ss_pred HHHHHHHHHHHHhcCceeeccc-ccccc----------------------------hhcccCCceeeecCCc-ccceeee Confidence 9999999999998777665421 10000 0000112333333333 3344443 Q ss_pred CCCCCccH---HHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 361 AGTPGGVG---TDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEV 437 (556) Q Consensus 361 ~~~p~~~f---~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~ 437 (556) .. ..++| ......+-..|..++=+. .+...|-++++-.=++.-..|....+-..=..|-..|+.|+.++.+..+. T Consensus 332 ~~-~~~~~~~~~~~i~~~~~~I~~af~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~ 409 (535) T protein:vir:15 332 LE-KQADFTVAKAVSDQIEARLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQ 409 (535) T ss_pred cc-cccchhHHHHHHHHHHHHHHHHHhhh-hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 11233 233344444454444111 11124544454444444444444444444456788899999999999999 Q ss_pred HcCCccCCCCcccccccchhhHHHhhCeeeecC-----------------------cccccchhhhhHHHHHHHHcCCCC Q lcl|NC_019524. 438 NAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGA-----------------------SRGQIDEKKETEAAILRIKNGLST 494 (556) Q Consensus 438 l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p-----------------------~~~~iDP~Ke~~A~~~~i~~G~~s 494 (556) ..|.|+.+.... ++++++.| ..+-+|+.=+....+..+...+.- T Consensus 410 r~g~lP~~p~~~-------------v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv 476 (535) T protein:vir:15 410 ATSQIPELPKEA-------------VEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGI 476 (535) T ss_pred hcCCCCCCCccc-------------eeEEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCC Confidence 999998554311 22333322 011223211222222222222221 Q ss_pred HHHHHHHhCCCHHHHHHHHHHH------HHHHHHcCCCCCccccccCCCCCCCCCCCC--CCCCCCcCC Q lcl|NC_019524. 495 YEAEISRLGGDFREVFKQRARE------EGLIKSLKLDFTGKMVEGNSTQSSNSSEST--SDNPNEETT 555 (556) Q Consensus 495 ~~~~~ae~G~D~e~v~~q~a~E------~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~--~~~~~~e~~ 555 (556) +...+.. +.+++.+.++.. .+.+...|=.. ...+..+++... .+..=-+++ T Consensus 477 p~~~i~~---~~eev~~~~~q~~~~~~~~~~a~~~g~~~-------~~~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 477 DTSGILL---TDEQKQALMMQDAAQTGIENAAATGGAGV-------GALATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred ChhhhcC---CHHHHHHHHHHHHHHHHHHHHHHHHHhhc-------cchhccChHHHHHHHhccCCCCC Confidence 1111100 111211111111 11111111000 000000000000 000000111 No 253 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=56.43 E-value=0.46 Score=22.45 Aligned_cols=446 Identities=12% Similarity=0.073 Sum_probs=182.5 Q ss_pred chhhhHHHHHhhHhhcccchhhh---hhhhcchhccccCCCccccccc-CCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATAT---PMAVGGGMEGAERTTREMFQWN-PSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~---~~~~~~~y~aa~~~~r~~~~w~-~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) |.. .|+ ...+.....+ .+.....|+... +..+.|. |.....+...... +... -=++.+ T Consensus 1 m~~-~~~------~~~~~~~~~r~~~lk~~R~~~e~~w---~e~~~~~lP~~~~~~~~~~~~------~~~~--~~dst~ 62 (536) T protein:vir:21 1 MAE-KRT------GLAEDGAKSVYERLKNDRAPYETRA---QNCAQYTIPSLFPKDSDNAST------DYQT--PWQAVG 62 (536) T ss_pred Ccc-hhh------chhHHHHHHHHHHHHHHhhHHHHHH---HHHHHHhcccccCCCCCcccc------cccc--cccccH Confidence 222 111 1111111111 000111122110 0111111 1111111000000 0011 235678 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhH-------------HHHHHHHHHHHHHHHhcccccceehhcccCHHHH Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGW-------------GEEFQEVVEARFNMAAESPENWFDARRMCTLTGL 146 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~-------------~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~l 146 (556) ..+++.+++.+.+ ||+|. ++ +-.|..++.. .++|-+.+++... ++=-+.+||.- T Consensus 63 ~~a~~~Laa~l~~-~ltP~-~~-WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~----------~~l~~snf~~~ 129 (536) T protein:vir:21 63 ARGLNNLASKLML-ALFPM-QT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIM----------NYIESNSYRVT 129 (536) T ss_pred HHHHHHHHHHHHH-hhcCC-Cc-ccccccChhhhhccccchhhHHHHHHHHHHHHHHHH----------HHHHhcCcHHH Confidence 8899999999988 67784 65 6666554422 2223333333322 22235688888 Q ss_pred HHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEc----------------------hhhcCCCCC-------- Q lcl|NC_019524. 147 TRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMIS----------------------PYRMSNPNN-------- 196 (556) Q Consensus 147 q~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie----------------------~drl~~~~~-------- 196 (556) ...++...++.|-+++-+.-.. ++.+.+|+.-.|. ...|....+ T Consensus 130 ~~~~~~~L~~~G~a~ly~~e~~------~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~ 203 (536) T protein:vir:21 130 LFEALKQLVVAGNVLLYLPEPE------GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGG 203 (536) T ss_pred HHHHHHHHHhHCcEeEEEeeCC------CCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhccccc Confidence 8888988888887664432111 1111222222221 111110000 Q ss_pred --C-CCCceEEEEEEECCCCCeEEEEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHH Q lcl|NC_019524. 197 --V-MDTPNLRSGVQLDNNGAALGYWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALK 273 (556) Q Consensus 197 --~-~~g~~i~~GIE~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~ 273 (556) . .+.-.|++.|..+..+.+..+|.- .-|.........+ ....-=.+..-....+|..-|.++..-+|- T Consensus 204 ~~~~~~~v~v~~~v~~~~~~~~~~~~~e--~~g~~v~~~~g~~-------~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~ 274 (536) T protein:vir:21 204 EKKADETIDVYTHIYLDEDSGEYLRYEE--VEGMEVQGSDGTY-------PKEACPYIPIRMVRLDGESYGRSYIEEYLG 274 (536) T ss_pred ccccccceeEEEEEEEecCCCcEEEEec--cCCeeeccccCcc-------ccccCCeeeeeeeecCCCccccchHHHHHH Confidence 0 112247777777766544433321 1111111111100 000001233333456899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCC Q lcl|NC_019524. 274 QMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPG 353 (556) Q Consensus 274 ~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG 353 (556) .++.|+.+..+.+..+.+++.-...+. +.+... .......++|.|+.-.+ T Consensus 275 D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~----------------------------~~~~~~~~~g~~v~g~~- 324 (536) T protein:vir:21 275 DLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQ----------------------------PRRLTKAQTGDFVTGRP- 324 (536) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCcccC-cccccc----------------------------hhhhccCCCcceecCCc- Confidence 999999999999998888876554443 111100 00011123444433233 Q ss_pred ceeeeecCCCCCccH---HHHHHHHHHHHHHhcCCCHHHhh-chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 354 TKLKMQPAGTPGGVG---TDYEQSLLRNIAASLGMSYEQFS-RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIY 429 (556) Q Consensus 354 e~i~~~~~~~p~~~f---~~F~~~~lr~iaaglGi~ye~l~-~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~ 429 (556) +++.+..-. .+++| ......+-..|..++=+ ..++ .|-++++-.=++.-..|....+-..=..|-..|+.|+. T Consensus 325 ~~v~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~~--~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli 401 (536) T protein:vir:21 325 EDISFLQLE-KQADFTVAKAVSDAIEARLSFAFML--NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 (536) T ss_pred ccceeeecc-ccccchHHHHHHHHHHHHHHHHHhh--hhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH Confidence 333332211 23344 23344455555555522 1222 45555544444444444444444444467788899999 Q ss_pred HHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecC-----------------------cccccchhhhhHHHHH Q lcl|NC_019524. 430 TLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGA-----------------------SRGQIDEKKETEAAIL 486 (556) Q Consensus 430 ~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p-----------------------~~~~iDP~Ke~~A~~~ 486 (556) ++-+..+...|.|+-|... .++++++.+ +..-+|+.=+....+. T Consensus 402 ~r~~~il~r~g~lP~~p~~-------------~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~ 468 (536) T protein:vir:21 402 RVLLKQLQATQQIPELPKE-------------AVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKL 468 (536) T ss_pred HHHHHHHHhCCCCCCCChh-------------hccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHH Confidence 9999999999998754321 122333221 0011122111222222 Q ss_pred HHHcCCCCHHHHHHHhCCCHHHHH-------HHHHHHH------HHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 487 RIKNGLSTYEAEISRLGGDFREVF-------KQRAREE------GLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNE 552 (556) Q Consensus 487 ~i~~G~~s~~~~~ae~G~D~e~v~-------~q~a~E~------~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (556) .+. ...|.||..++ +.++... +.+..+|-..... ...++...+......+-+|-- T Consensus 469 ~~a----------~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 469 RIA----------NAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQ-ATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HHH----------HHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhcChhhHHhhhhccccCCCC Confidence 221 12244443322 2221111 1111111000000 000000000000000000000 No 254 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=44.34 E-value=0.81 Score=21.09 Aligned_cols=442 Identities=12% Similarity=0.074 Sum_probs=180.8 Q ss_pred chhhhHHHHHhhHhhcccchhhh---hhhhcchhccccCCCccccccc-CCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Q lcl|NC_019524. 4 VKKTTRTRAKKAVDVVAETATAT---PMAVGGGMEGAERTTREMFQWN-PSIISPDQQIAQNQDMASARAQDMVQNDGYA 79 (556) Q Consensus 4 ~~~~~r~~a~~a~~~~~~~~~~~---~~~~~~~y~aa~~~~r~~~~w~-~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a 79 (556) |.. .|+ ...+.....+ .+.....|+... +..+.|. |.....+..... . +... .=++.+ T Consensus 1 m~~-~~~------~~~~~~~~~r~~~l~~~R~~~e~~w---~e~~~~~lP~~~~~~~~~~~---~---~~~~--~~dst~ 62 (536) T protein:vir:10 1 MAE-KRT------GLAEDGAKSVYERLKNDRAPYETRA---QNCAQYTIPSLFPKDSDNAS---T---DYQT--PWQAVG 62 (536) T ss_pred Ccc-hhh------chhHHHHHHHHHHHHHHhhHHHHHH---HHHHHHhcccccCCCCCccc---c---cccc--cccccH Confidence 222 111 1111111111 001111122110 0111111 111111100000 0 0011 235578 Q ss_pred HHHHHHHHhhhccCCceeeeeccccccCCChhH-------------HHHHHHHHHHHHHHHhcccccceehhcccCHHHH Q lcl|NC_019524. 80 AGVVAVHRDSIVGSQYKLNAKPNTIVLGAPDGW-------------GEEFQEVVEARFNMAAESPENWFDARRMCTLTGL 146 (556) Q Consensus 80 ~~~v~~~~~nvVG~Gi~~~~~~~~~~lg~~~~~-------------~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~l 146 (556) ..+++.+++.+.+ ||+|. ++ +-.|..++.. .++|-+.+++... ++=-+.+||.- T Consensus 63 ~~a~~~Laa~l~~-~ltP~-~~-WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~----------~~l~~snf~~~ 129 (536) T protein:vir:10 63 ARGLNNLASKLML-ALFPM-QT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIM----------NYIESNSYRVT 129 (536) T ss_pred HHHHHHHHHHHHh-hhcCC-Cc-ccccccChhhhhccccchhhHHHHHHHHHHHHHHHH----------HHHHhcCcHHH Confidence 8899999999988 67784 65 6666554422 2223333333322 22235688988 Q ss_pred HHHHhhhheecCceEEEEeeccCCCCcCCCcccceEEEEEchhhcCC-C---------------------CC-------- Q lcl|NC_019524. 147 TRLAVSGFLMTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPYRMSN-P---------------------NN-------- 196 (556) Q Consensus 147 q~l~~r~~~~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~drl~~-~---------------------~~-------- 196 (556) ...++...++.|-+++-+.-. . ++.+.+|+.-.|..-.|.. + .+ T Consensus 130 ~~~~~~~L~~~G~a~ly~~e~--~----~~~~~~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~ 203 (536) T protein:vir:10 130 LFEALKQLVVAGNVLLYLPEP--E----GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGG 203 (536) T ss_pred HHHHHHHHHhHCcEeEEEeeC--C----CCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhccccc Confidence 888898988888776443211 1 1111222222221111110 0 00 Q ss_pred --C-CCCceEEEEEEECCCCCeEEEE-EeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHH Q lcl|NC_019524. 197 --V-MDTPNLRSGVQLDNNGAALGYW-LRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSAL 272 (556) Q Consensus 197 --~-~~g~~i~~GIE~d~~Gr~vaY~-i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l 272 (556) . .+.-.|++.|..+..+.+..+| -....+.....+ ...+... + .+..-....+|..-|.++..-+| T Consensus 204 ~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~~v~~~~g-~~~f~~~--------P-~i~~Rw~~~~ge~YGrgp~~~~l 273 (536) T protein:vir:10 204 EKKADETIDVYTHIYLDEASGEYLRYEEVEGMEVQGSDG-TYPKEAC--------P-YIPIRMVRLDGESYGRSYIEEYL 273 (536) T ss_pred ccCcccceEEEEEEEEecCCCcEEEEEeecCcccccccc-ccccccC--------C-ceeeeeeecCCCccccchHHHHH Confidence 0 1122477777776543333332 221111100000 0000000 1 23333345689999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCC Q lcl|NC_019524. 273 KQMKMTRNFQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYP 352 (556) Q Consensus 273 ~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p 352 (556) -.++.|+.+..+.+..+.+++.-...+. +.+... .......++|.|+.-.+ T Consensus 274 ~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~----------------------------~~~~~~~~~g~~v~g~~ 324 (536) T protein:vir:10 274 GDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQ----------------------------PRRLTKAQTGDFVTGRP 324 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcccC-cccccc----------------------------hhhhccCCCcceecCCc Confidence 9999999999999998888876554443 111100 00011123444433233 Q ss_pred CceeeeecCCCCCccH---HHHHHHHHHHHHHhcCCCHHHhh-chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 353 GTKLKMQPAGTPGGVG---TDYEQSLLRNIAASLGMSYEQFS-RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAI 428 (556) Q Consensus 353 Ge~i~~~~~~~p~~~f---~~F~~~~lr~iaaglGi~ye~l~-~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi 428 (556) +++.+..-. .+++| ......+-..|..++=+ ..++ .|-++++-.=++.-..|....+-..=..|-..|+.|+ T Consensus 325 -~~v~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~~--~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pl 400 (536) T protein:vir:10 325 -EDISFLQLE-KQADFTVAKAVSDAIEARLSFAFML--NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 (536) T ss_pred -ccceeeecc-ccccchHHHHHHHHHHHHHHHHHhh--hhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 333332211 23344 23344455555555522 1222 4555554444444444444444444456778889999 Q ss_pred HHHHHHHHHHcCCccCCCCcccccccchhhHHHhhCeeeecCc-----------------------ccccchhhhhHHHH Q lcl|NC_019524. 429 YTLWLEEEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGAS-----------------------RGQIDEKKETEAAI 485 (556) Q Consensus 429 ~~~~l~~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~-----------------------~~~iDP~Ke~~A~~ 485 (556) .++-+..+...|.|+-|.. ..++++++.+= ..-+|+.=+....+ T Consensus 401 i~r~~~il~r~g~lP~~p~-------------~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~ 467 (536) T protein:vir:10 401 VRVLLKQLQATQQIPELPK-------------EAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIK 467 (536) T ss_pred HHHHHHHHHhCCCCCCCCh-------------hhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHH Confidence 9999999999999875432 11233332210 01112111111111 Q ss_pred HHHHcCCCCHHHHHHHhCCCHHHHH-------HHHHHHH------HHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019524. 486 LRIKNGLSTYEAEISRLGGDFREVF-------KQRAREE------GLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNE 552 (556) Q Consensus 486 ~~i~~G~~s~~~~~ae~G~D~e~v~-------~q~a~E~------~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (556) ..+. ...|.||..++ +.++... +.+..+|-..... ...++ +.-...++. T Consensus 468 ~~~a----------~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~-~~~~~-------~~~~~~~~~ 529 (536) T protein:vir:10 468 LRIA----------NAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQ-ATASP-------EAMAAAADS 529 (536) T ss_pred HHHH----------HHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhcCc-------hhHHhhhhc Confidence 1111 12244443322 2221111 1111111000000 00000 000000000 Q ss_pred cCCC Q lcl|NC_019524. 553 ETTQ 556 (556) Q Consensus 553 e~~~ 556 (556) -..| T Consensus 530 ~g~~ 533 (536) T protein:vir:10 530 VGLQ 533 (536) T ss_pred cccC Confidence 0001 No 255 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=39.52 E-value=1 Score=20.55 Aligned_cols=456 Identities=10% Similarity=0.005 Sum_probs=188.8 Q ss_pred hcccchhhhhhhhcchhccccCCCc--ccccccCCCCCHHHHHHHHHH------HHHHHHHHHHhcChHHHHHHHHHHhh Q lcl|NC_019524. 18 VVAETATATPMAVGGGMEGAERTTR--EMFQWNPSIISPDQQIAQNQD------MASARAQDMVQNDGYAAGVVAVHRDS 89 (556) Q Consensus 18 ~~~~~~~~~~~~~~~~y~aa~~~~r--~~~~w~~~~~s~~~~i~~~~~------~lr~RaRdl~rNn~~a~~~v~~~~~n 89 (556) +..+..+ . ..-..|+... +.| ....|. ....-+.+.+. .-..+.|.---=++.+..+++.+++- T Consensus 1 M~~~~~~--~-~l~~r~~~l~-~~R~~~e~~w~----e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:10 1 MAEQTER--K-LLLSRWGQLR-TERESWMSHWK----EISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcccH--H-HHHHHHHHHH-HHhhHHHHHHH----HHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHH Confidence 2222211 0 1112233321 111 111110 00000000000 00111111112356778888888888 Q ss_pred hccCCceeeeeccccccCCC-hh-----HHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 90 IVGSQYKLNAKPNTIVLGAP-DG-----WGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 90 vVG~Gi~~~~~~~~~~lg~~-~~-----~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+ ||+|..++=.+ |... .+ +.++|-+.+++.-. ++=-+.+||.-...++...++-|-+++- T Consensus 73 L~~-~ltpp~~~WF~-l~~~d~~l~e~~~v~~~L~~ve~~~~----------~~l~~snf~~~~~~~~~~Lv~~G~a~l~ 140 (555) T protein:vir:10 73 MMA-GMTSPARPWFR-LTTSIPELDESAAVKAWLANVTRLML----------MIFAKSNTYRALHSMYEELGAFGTASSI 140 (555) T ss_pred HHH-hhcCCCCcccc-cccCcccccchHHHHHHHHHHHHHHH----------HHHHhcCcHHHHHHHHHHHHhhCceEEE Confidence 776 45564443222 3322 11 22333333333211 2222468888888888888888887754 Q ss_pred EeeccCCCCcCCCcccceEEEEEc-----------------h---------hhcCCC----CC-CC-C-CceEEEEEEEC Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMIS-----------------P---------YRMSNP----NN-VM-D-TPNLRSGVQLD 210 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie-----------------~---------drl~~~----~~-~~-~-g~~i~~GIE~d 210 (556) +.....+.. .-..+|+.==+|+ . +.|+.. ++ .+ + .-.|++=|+.. T Consensus 141 ~~~d~~~~~--rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr 218 (555) T protein:vir:10 141 VLPDFDAVV--YHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPR 218 (555) T ss_pred EecCCCceE--EEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeec Confidence 322111100 0001111111111 1 011100 00 01 1 12345555532 Q ss_pred CCC---------CeEE-EEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHH Q lcl|NC_019524. 211 NNG---------AALG-YWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRN 280 (556) Q Consensus 211 ~~G---------r~va-Y~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~ 280 (556) ..+ .|.+ ||+...-.+......+ .+...| .+..--...+|..-|.++..-+|-.++.|+. T Consensus 219 ~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~es-gy~e~P---------~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~ 288 (555) T protein:vir:10 219 ADRDPSKRDDRNMAWKSVYFEPGADETRTLRES-GYRSFR---------ALCPRWALVGGDIYGNSPAMEALGDVRQLQH 288 (555) T ss_pred cCcCcCCCCccccceEEEEEEeccCCccccccC-CcccCC---------ceeeeeeecCCCccccchHHHHHHHHHHHHH Confidence 222 3333 4444322111111000 010001 1222234558999999999999999999999 Q ss_pred HHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeec Q lcl|NC_019524. 281 FQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQP 360 (556) Q Consensus 281 ~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~ 360 (556) +..+.+..+.+++.....+-... ......+.||.+....+|..-..+. T Consensus 289 l~~~~l~~~~~~~~pp~~v~~~~--------------------------------~~~~~~~~pgg~~~v~~g~~~d~~~ 336 (555) T protein:vir:10 289 EQLRKAQAIDYKSNPPLQLPVSA--------------------------------KNQDISTVPGGLSYVDAAAPNGGIR 336 (555) T ss_pred HHHHHHHHHHHHhcCceeecccc--------------------------------ccccceeccccccccccCCCCccee Confidence 99999999988776544433211 0112345677666555433322222 Q ss_pred CC-CCCccHHHH---HHHHHHHHHHhcCCCHHHhh--chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 361 AG-TPGGVGTDY---EQSLLRNIAASLGMSYEQFS--RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLE 434 (556) Q Consensus 361 ~~-~p~~~f~~F---~~~~lr~iaaglGi~ye~l~--~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~ 434 (556) |. .++.+|... ...+...|..++=.++.+.. .|-+.++-.=++.-..|....+-..=..|...|+.|+.++-+. T Consensus 337 ~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~ 416 (555) T protein:vir:10 337 TAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQ 416 (555) T ss_pred cccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 22 233455333 34444555555433322111 3555555555555555555555555556778899999999999 Q ss_pred HHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHH-hCCCHHHHHHHH Q lcl|NC_019524. 435 EEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISR-LGGDFREVFKQR 513 (556) Q Consensus 435 ~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae-~G~D~e~v~~q~ 513 (556) .+...|.||-|+..-. ..-+++ .+|-|+--++.... +. ++...-+.+.. .+.+|+- ++.+ T Consensus 417 il~r~g~lP~~P~~l~---------~~~i~v-------~yis~La~aq~~~~-~~-~i~~~l~~i~~laq~~P~v-ld~i 477 (555) T protein:vir:10 417 RMVEANILPPPPQEMQ---------GVDLNV-------EFVSMLAQAQRAIA-TN-SVDRFVGNLGAVAGIKPEV-LDKF 477 (555) T ss_pred HHHhcCCCCCCchhhc---------CceeEE-------EeccHHHHHHHHHH-HH-HHHHHHHHHHHHhcCChhh-hhcC Confidence 9999999986642100 000112 24555533332211 11 11111111211 2344432 2222 Q ss_pred HHH---HHHHHHcCCCCCcccccc---------------------CCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 514 ARE---EGLIKSLKLDFTGKMVEG---------------------NSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 514 a~E---~~~~~~~Gl~~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .-+ ...++.+|++........ ...+......-.+-++..++.. T Consensus 478 d~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~ 544 (555) T protein:vir:10 478 DADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNAL 544 (555) T ss_pred CHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhH Confidence 222 122234555431110000 0000000000001111111111 No 256 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=39.52 E-value=1 Score=20.55 Aligned_cols=456 Identities=10% Similarity=0.005 Sum_probs=188.8 Q ss_pred hcccchhhhhhhhcchhccccCCCc--ccccccCCCCCHHHHHHHHHH------HHHHHHHHHHhcChHHHHHHHHHHhh Q lcl|NC_019524. 18 VVAETATATPMAVGGGMEGAERTTR--EMFQWNPSIISPDQQIAQNQD------MASARAQDMVQNDGYAAGVVAVHRDS 89 (556) Q Consensus 18 ~~~~~~~~~~~~~~~~y~aa~~~~r--~~~~w~~~~~s~~~~i~~~~~------~lr~RaRdl~rNn~~a~~~v~~~~~n 89 (556) +..+..+ . ..-..|+... +.| ....|. ....-+.+.+. .-..+.|.---=++.+..+++.+++- T Consensus 1 M~~~~~~--~-~l~~r~~~l~-~~R~~~e~~w~----e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:10 1 MAEQTER--K-LLLSRWGQLR-TERESWMSHWK----EISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcccH--H-HHHHHHHHHH-HHhhHHHHHHH----HHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHH Confidence 2222211 0 1112233321 111 111110 00000000000 00111111112356778888888888 Q ss_pred hccCCceeeeeccccccCCC-hh-----HHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 90 IVGSQYKLNAKPNTIVLGAP-DG-----WGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 90 vVG~Gi~~~~~~~~~~lg~~-~~-----~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+ ||+|..++=.+ |... .+ +.++|-+.+++.-. ++=-+.+||.-...++...++-|-+++- T Consensus 73 L~~-~ltpp~~~WF~-l~~~d~~l~e~~~v~~~L~~ve~~~~----------~~l~~snf~~~~~~~~~~Lv~~G~a~l~ 140 (555) T protein:vir:10 73 MMA-GMTSPARPWFR-LTTSIPELDESAAVKAWLANVTRLML----------MIFAKSNTYRALHSMYEELGAFGTASSI 140 (555) T ss_pred HHH-hhcCCCCcccc-cccCcccccchHHHHHHHHHHHHHHH----------HHHHhcCcHHHHHHHHHHHHhhCceEEE Confidence 776 45564443222 3322 11 22333333333211 2222468888888888888888887754 Q ss_pred EeeccCCCCcCCCcccceEEEEEc-----------------h---------hhcCCC----CC-CC-C-CceEEEEEEEC Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMIS-----------------P---------YRMSNP----NN-VM-D-TPNLRSGVQLD 210 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie-----------------~---------drl~~~----~~-~~-~-g~~i~~GIE~d 210 (556) +.....+.. .-..+|+.==+|+ . +.|+.. ++ .+ + .-.|++=|+.. T Consensus 141 ~~~d~~~~~--rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr 218 (555) T protein:vir:10 141 VLPDFDAVV--YHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPR 218 (555) T ss_pred EecCCCceE--EEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeec Confidence 322111100 0001111111111 1 011100 00 01 1 12345555532 Q ss_pred CCC---------CeEE-EEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHH Q lcl|NC_019524. 211 NNG---------AALG-YWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRN 280 (556) Q Consensus 211 ~~G---------r~va-Y~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~ 280 (556) ..+ .|.+ ||+...-.+......+ .+...| .+..--...+|..-|.++..-+|-.++.|+. T Consensus 219 ~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~es-gy~e~P---------~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~ 288 (555) T protein:vir:10 219 ADRDPSKRDDRNMAWKSVYFEPGADETRTLRES-GYRSFR---------ALCPRWALVGGDIYGNSPAMEALGDVRQLQH 288 (555) T ss_pred cCcCcCCCCccccceEEEEEEeccCCccccccC-CcccCC---------ceeeeeeecCCCccccchHHHHHHHHHHHHH Confidence 222 3333 4444322111111000 010001 1222234558999999999999999999999 Q ss_pred HHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeec Q lcl|NC_019524. 281 FQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQP 360 (556) Q Consensus 281 ~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~ 360 (556) +..+.+..+.+++.....+-... ......+.||.+....+|..-..+. T Consensus 289 l~~~~l~~~~~~~~pp~~v~~~~--------------------------------~~~~~~~~pgg~~~v~~g~~~d~~~ 336 (555) T protein:vir:10 289 EQLRKAQAIDYKSNPPLQLPVSA--------------------------------KNQDISTVPGGLSYVDAAAPNGGIR 336 (555) T ss_pred HHHHHHHHHHHHhcCceeecccc--------------------------------ccccceeccccccccccCCCCccee Confidence 99999999988776544433211 0112345677666555433322222 Q ss_pred CC-CCCccHHHH---HHHHHHHHHHhcCCCHHHhh--chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 361 AG-TPGGVGTDY---EQSLLRNIAASLGMSYEQFS--RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLE 434 (556) Q Consensus 361 ~~-~p~~~f~~F---~~~~lr~iaaglGi~ye~l~--~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~ 434 (556) |. .++.+|... ...+...|..++=.++.+.. .|-+.++-.=++.-..|....+-..=..|...|+.|+.++-+. T Consensus 337 ~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~ 416 (555) T protein:vir:10 337 TAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQ 416 (555) T ss_pred cccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 22 233455333 34444555555433322111 3555555555555555555555555556778899999999999 Q ss_pred HHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHH-hCCCHHHHHHHH Q lcl|NC_019524. 435 EEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISR-LGGDFREVFKQR 513 (556) Q Consensus 435 ~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae-~G~D~e~v~~q~ 513 (556) .+...|.||-|+..-. ..-+++ .+|-|+--++.... +. ++...-+.+.. .+.+|+- ++.+ T Consensus 417 il~r~g~lP~~P~~l~---------~~~i~v-------~yis~La~aq~~~~-~~-~i~~~l~~i~~laq~~P~v-ld~i 477 (555) T protein:vir:10 417 RMVEANILPPPPQEMQ---------GVDLNV-------EFVSMLAQAQRAIA-TN-SVDRFVGNLGAVAGIKPEV-LDKF 477 (555) T ss_pred HHHhcCCCCCCchhhc---------CceeEE-------EeccHHHHHHHHHH-HH-HHHHHHHHHHHHhcCChhh-hhcC Confidence 9999999986642100 000112 24555533332211 11 11111111211 2344432 2222 Q ss_pred HHH---HHHHHHcCCCCCcccccc---------------------CCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 514 ARE---EGLIKSLKLDFTGKMVEG---------------------NSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 514 a~E---~~~~~~~Gl~~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .-+ ...++.+|++........ ...+......-.+-++..++.. T Consensus 478 d~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~ 544 (555) T protein:vir:10 478 DADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNAL 544 (555) T ss_pred CHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhH Confidence 222 122234555431110000 0000000000001111111111 No 257 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=39.52 E-value=1 Score=20.55 Aligned_cols=456 Identities=10% Similarity=0.005 Sum_probs=188.8 Q ss_pred hcccchhhhhhhhcchhccccCCCc--ccccccCCCCCHHHHHHHHHH------HHHHHHHHHHhcChHHHHHHHHHHhh Q lcl|NC_019524. 18 VVAETATATPMAVGGGMEGAERTTR--EMFQWNPSIISPDQQIAQNQD------MASARAQDMVQNDGYAAGVVAVHRDS 89 (556) Q Consensus 18 ~~~~~~~~~~~~~~~~y~aa~~~~r--~~~~w~~~~~s~~~~i~~~~~------~lr~RaRdl~rNn~~a~~~v~~~~~n 89 (556) +..+..+ . ..-..|+... +.| ....|. ....-+.+.+. .-..+.|.---=++.+..+++.+++- T Consensus 1 M~~~~~~--~-~l~~r~~~l~-~~R~~~e~~w~----e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:98 1 MAEQTER--K-LLLSRWGQLR-TERESWMSHWK----EISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcccH--H-HHHHHHHHHH-HHhhHHHHHHH----HHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHH Confidence 2222211 0 1112233321 111 111110 00000000000 00111111112356778888888888 Q ss_pred hccCCceeeeeccccccCCC-hh-----HHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEE Q lcl|NC_019524. 90 IVGSQYKLNAKPNTIVLGAP-DG-----WGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLAT 163 (556) Q Consensus 90 vVG~Gi~~~~~~~~~~lg~~-~~-----~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~ 163 (556) +.+ ||+|..++=.+ |... .+ +.++|-+.+++.-. ++=-+.+||.-...++...++-|-+++- T Consensus 73 L~~-~ltpp~~~WF~-l~~~d~~l~e~~~v~~~L~~ve~~~~----------~~l~~snf~~~~~~~~~~Lv~~G~a~l~ 140 (555) T protein:vir:98 73 MMA-GMTSPARPWFR-LTTSIPELDESAAVKAWLANVTRLML----------MIFAKSNTYRALHSMYEELGAFGTASSI 140 (555) T ss_pred HHH-hhcCCCCcccc-cccCcccccchHHHHHHHHHHHHHHH----------HHHHhcCcHHHHHHHHHHHHhhCceEEE Confidence 776 45564443222 3322 11 22333333333211 2222468888888888888888887754 Q ss_pred EeeccCCCCcCCCcccceEEEEEc-----------------h---------hhcCCC----CC-CC-C-CceEEEEEEEC Q lcl|NC_019524. 164 CEWLNPTGTTMQRRPFGTAIQMIS-----------------P---------YRMSNP----NN-VM-D-TPNLRSGVQLD 210 (556) Q Consensus 164 ~~~~~~~~~~~~~~~~~l~lq~ie-----------------~---------drl~~~----~~-~~-~-g~~i~~GIE~d 210 (556) +.....+.. .-..+|+.==+|+ . +.|+.. ++ .+ + .-.|++=|+.. T Consensus 141 ~~~d~~~~~--rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr 218 (555) T protein:vir:98 141 VLPDFDAVV--YHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPR 218 (555) T ss_pred EecCCCceE--EEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeec Confidence 322111100 0001111111111 1 011100 00 01 1 12345555532 Q ss_pred CCC---------CeEE-EEEeecCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHH Q lcl|NC_019524. 211 NNG---------AALG-YWLRKAFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRN 280 (556) Q Consensus 211 ~~G---------r~va-Y~i~~~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~ 280 (556) ..+ .|.+ ||+...-.+......+ .+...| .+..--...+|..-|.++..-+|-.++.|+. T Consensus 219 ~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~es-gy~e~P---------~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~ 288 (555) T protein:vir:98 219 ADRDPSKRDDRNMAWKSVYFEPGADETRTLRES-GYRSFR---------ALCPRWALVGGDIYGNSPAMEALGDVRQLQH 288 (555) T ss_pred cCcCcCCCCccccceEEEEEEeccCCccccccC-CcccCC---------ceeeeeeecCCCccccchHHHHHHHHHHHHH Confidence 222 3333 4444322111111000 010001 1222234558999999999999999999999 Q ss_pred HHHHHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeec Q lcl|NC_019524. 281 FQEITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQP 360 (556) Q Consensus 281 ~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~ 360 (556) +..+.+..+.+++.....+-... ......+.||.+....+|..-..+. T Consensus 289 l~~~~l~~~~~~~~pp~~v~~~~--------------------------------~~~~~~~~pgg~~~v~~g~~~d~~~ 336 (555) T protein:vir:98 289 EQLRKAQAIDYKSNPPLQLPVSA--------------------------------KNQDISTVPGGLSYVDAAAPNGGIR 336 (555) T ss_pred HHHHHHHHHHHHhcCceeecccc--------------------------------ccccceeccccccccccCCCCccee Confidence 99999999988776544433211 0112345677666555433322222 Q ss_pred CC-CCCccHHHH---HHHHHHHHHHhcCCCHHHhh--chhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 361 AG-TPGGVGTDY---EQSLLRNIAASLGMSYEQFS--RDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLE 434 (556) Q Consensus 361 ~~-~p~~~f~~F---~~~~lr~iaaglGi~ye~l~--~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~ 434 (556) |. .++.+|... ...+...|..++=.++.+.. .|-+.++-.=++.-..|....+-..=..|...|+.|+.++-+. T Consensus 337 ~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~ 416 (555) T protein:vir:98 337 TAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQ 416 (555) T ss_pred cccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 22 233455333 34444555555433322111 3555555555555555555555555556778899999999999 Q ss_pred HHHHcCCccCCCCcccccccchhhHHHhhCeeeecCcccccchhhhhHHHHHHHHcCCCCHHHHHHH-hCCCHHHHHHHH Q lcl|NC_019524. 435 EEVNAGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQIDEKKETEAAILRIKNGLSTYEAEISR-LGGDFREVFKQR 513 (556) Q Consensus 435 ~a~l~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~s~~~~~ae-~G~D~e~v~~q~ 513 (556) .+...|.||-|+..-. ..-+++ .+|-|+--++.... +. ++...-+.+.. .+.+|+- ++.+ T Consensus 417 il~r~g~lP~~P~~l~---------~~~i~v-------~yis~La~aq~~~~-~~-~i~~~l~~i~~laq~~P~v-ld~i 477 (555) T protein:vir:98 417 RMVEANILPPPPQEMQ---------GVDLNV-------EFVSMLAQAQRAIA-TN-SVDRFVGNLGAVAGIKPEV-LDKF 477 (555) T ss_pred HHHhcCCCCCCchhhc---------CceeEE-------EeccHHHHHHHHHH-HH-HHHHHHHHHHHHhcCChhh-hhcC Confidence 9999999986642100 000112 24555533332211 11 11111111211 2344432 2222 Q ss_pred HHH---HHHHHHcCCCCCcccccc---------------------CCCCCCCCCCCCCCCCCCcCCC Q lcl|NC_019524. 514 ARE---EGLIKSLKLDFTGKMVEG---------------------NSTQSSNSSESTSDNPNEETTQ 556 (556) Q Consensus 514 a~E---~~~~~~~Gl~~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~e~~~ 556 (556) .-+ ...++.+|++........ ...+......-.+-++..++.. T Consensus 478 d~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~ 544 (555) T protein:vir:98 478 DADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNAL 544 (555) T ss_pred CHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhH Confidence 222 122234555431110000 0000000000001111111111 No 258 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=34.63 E-value=1.3 Score=20.00 Aligned_cols=444 Identities=11% Similarity=0.019 Sum_probs=174.7 Q ss_pred HHHHhhHhhcccchhhhhhhhcchhccccCCCccccccc-CCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHh Q lcl|NC_019524. 10 TRAKKAVDVVAETATATPMAVGGGMEGAERTTREMFQWN-PSIISPDQQIAQNQDMASARAQDMVQNDGYAAGVVAVHRD 88 (556) Q Consensus 10 ~~a~~a~~~~~~~~~~~~~~~~~~y~aa~~~~r~~~~w~-~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~~~~~ 88 (556) ++. ++.++-. +.....|+... +..+.|. |.....+... ... ...|+ =++.+..+++++++ T Consensus 1 mk~-~~~~~~~-------~lkR~~~e~~w---~e~a~~tlP~~~~~~~~~--~~~---~~~~~---~dstg~~a~~~LAa 61 (510) T protein:vir:63 1 MKT-TAAMLWE-------KLRDGSVEQRA---IEFAKTTLPYLMVDPMSG--SRG---VVEHD---FQSAGALLVNNLAA 61 (510) T ss_pred Chh-HHHHHHH-------HHhccchHHHH---HHHHHhhccccCCCCCCc--ccc---ccCCC---ccchHHHHHHHHHH Confidence 111 0001100 00011121110 0111121 1111111110 000 01111 36778889998888 Q ss_pred hhccCCceeeeeccccccCCChhH-------------HHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhhe Q lcl|NC_019524. 89 SIVGSQYKLNAKPNTIVLGAPDGW-------------GEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFL 155 (556) Q Consensus 89 nvVG~Gi~~~~~~~~~~lg~~~~~-------------~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~ 155 (556) -+.+ ||+|..++ +-.|..+++. .+.|-+.+++.-. ++=-+.+||.-...++...+ T Consensus 62 ~l~~-~ltpp~~~-WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~----------~~l~~snf~~~~~~~~~~Li 129 (510) T protein:vir:63 62 KLAR-SLFPTGIP-FFRSELTDAIRREADSRDTDITEVTAALARVDRKAT----------QRLFQNASLAVLTQVIKLLI 129 (510) T ss_pred HHHh-hhcCCCCc-ccccCCChHHhhcccccchhHHHHHHHHHHHHHHHH----------HHHHhcCcHHHHHHHHHHHH Confidence 8876 46665443 3334444332 2333333433221 12224588888888888888 Q ss_pred ecCceEEEEeeccCCCCcCCCcccceEEEEEchh-----------------hcCCC------C-----CCCCCceEEEEE Q lcl|NC_019524. 156 MTGEVLATCEWLNPTGTTMQRRPFGTAIQMISPY-----------------RMSNP------N-----NVMDTPNLRSGV 207 (556) Q Consensus 156 ~dGE~f~~~~~~~~~~~~~~~~~~~l~lq~ie~d-----------------rl~~~------~-----~~~~g~~i~~GI 207 (556) .-|-+++.. .+.+ .....||+.==++.-| .|.-. . +....-.|++-| T Consensus 130 ~~G~a~l~~---~~~~--~~~~~~pl~~y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V 204 (510) T protein:vir:63 130 VTGNALLYR---DSDA--ATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) T ss_pred hhCeEEEEE---cCCC--cEEEEEEcceeEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEE Confidence 888765443 1111 0111222211111111 00000 0 000112356666 Q ss_pred EECCC-CCeEE-EEEe-e-cCCCccccCCccccceeeccccCChhHeEeeecccCCCcccCCchhhHHHHHHHHHHHHHH Q lcl|NC_019524. 208 QLDNN-GAALG-YWLR-K-AFPGDPTDMEQWKWGYEPARFDWGRRRVIHIIEALLAGQTRGISEMVSALKQMKMTRNFQE 283 (556) Q Consensus 208 E~d~~-Gr~va-Y~i~-~-~hpgd~~~~~~~~~~rv~~~~~v~a~~viH~f~~~r~gQ~RGvs~la~~l~~l~~l~~~~d 283 (556) +-+.. +.|.+ ||+- . .|++. .+..+... . + .+-.--..-+|-.-|.++-.-+|-.+|.|+.+.. T Consensus 205 ~~~~~~~~~~~sv~~e~dg~~~~~----~~~~~~~e-----~--P-~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~ 272 (510) T protein:vir:63 205 QRKKGTAMEYAELYHEIDGVRVGK----EGRWPIHL-----C--P-YIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSE 272 (510) T ss_pred EeecCCCceEEEEEEEecCceecc----cccccccc-----C--c-eeeeeeeecCCCccccchHHHHHHHHHHHHHHHH Confidence 65432 23322 2221 1 11110 00000000 0 0 1111123458899999999999999999999999 Q ss_pred HHHHHHHHhcceeeeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeecCCC Q lcl|NC_019524. 284 ITLQNAVVNATYAASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQPAGT 363 (556) Q Consensus 284 ael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~ 363 (556) +.+..+..++.....+. +.+...+. .....++|.|+.-.+ ++|++++.. T Consensus 273 ~~l~~a~~a~~~~~lv~-p~g~~~~~----------------------------~~~~~~~g~~v~g~~-~~v~~~~~~- 321 (510) T protein:vir:63 273 KLGLYELESLEVLNLVD-EAKGAVVD----------------------------DYQDAEMGDYVPGGA-EAVRAYERG- 321 (510) T ss_pred HHHHHHHHhccCCcccC-cccccchh----------------------------hhccCCCceeecCCc-ccceeeecC- Confidence 99999888876654433 22111000 000011222321111 334444322 Q ss_pred CCccHH---HHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019524. 364 PGGVGT---DYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKLVADRFASAIYTLWLEEEVNAG 440 (556) Q Consensus 364 p~~~f~---~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~lv~~~~~pi~~~~l~~a~l~G 440 (556) +.++|. .....+...|..++= +.....|-.+++-.=++.-..|....+-..=..|...|+.|+.++.+..+.-.| T Consensus 322 ~~~d~~~~~~~i~~~~~rI~~af~--~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g 399 (510) T protein:vir:63 322 DYNKMAAIQQSLQAVVVRLNQAFM--YGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL 399 (510) T ss_pred cccchHHHHHHHHHHHHHHHHHHH--hhcccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 234444 444455555555531 111223434444444444444444444444456778889999999998888888 Q ss_pred CccCCCCccccc---ccchhhHHH-------hhC-eeeecCcccccchhhhhHHHHHHHHcCCC-CHHHHHHHhCCCHHH Q lcl|NC_019524. 441 NVPLPPGKNWRM---FYDPMMRDA-------LCN-AEWIGASRGQIDEKKETEAAILRIKNGLS-TYEAEISRLGGDFRE 508 (556) Q Consensus 441 ~l~~p~~~~~~~---~~~~~~~~a-------~~~-~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~-s~~~~~ae~G~D~e~ 508 (556) .+++|...--.. +-.+..+.. +.. +.-+++ ...++|.=+....+..+...+. ++..++ ++.++ T Consensus 400 l~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~-~aq~~~~id~d~~~~~~a~~~Gv~p~~iv----rs~ee 474 (510) T protein:vir:63 400 LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAP-IAQLDPRISLPKMMDTIWAAFSVDTSQFY----KSADE 474 (510) T ss_pred CCCCCchhcccceecchhHHHHHHHHHHHHHHHHHHHHhcC-chhhhccCCHHHHHHHHHHHhCCChhHhc----CCHHH Confidence 887776532110 111111110 000 111111 3334444333333333332222 122222 11222 Q ss_pred HHHHHHHH-HHHHHH----cCCCCCccccccCCCCC Q lcl|NC_019524. 509 VFKQRARE-EGLIKS----LKLDFTGKMVEGNSTQS 539 (556) Q Consensus 509 v~~q~a~E-~~~~~~----~Gl~~~~~~~~~~~~~~ 539 (556) +-+.++.. .+.+.. ..|.........+.++- T Consensus 475 v~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 475 LQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 22211110 000000 00100000001111111 No 259 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=26.00 E-value=2 Score=18.94 Aligned_cols=443 Identities=11% Similarity=0.028 Sum_probs=171.8 Q ss_pred HHHHhhHhhcccchhhh--hhhhcchhccccCCCcc-cccccCCCC--CHHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Q lcl|NC_019524. 10 TRAKKAVDVVAETATAT--PMAVGGGMEGAERTTRE-MFQWNPSII--SPDQQIAQNQDMASARAQDMVQNDGYAAGVVA 84 (556) Q Consensus 10 ~~a~~a~~~~~~~~~~~--~~~~~~~y~aa~~~~r~-~~~w~~~~~--s~~~~i~~~~~~lr~RaRdl~rNn~~a~~~v~ 84 (556) +. .+....+..... .....+.--++...=|. -..+-|... .-..+.......-.+| .+--.++. T Consensus 1 m~---~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~r--------A~~~n~~~ 69 (501) T protein:vir:95 1 MP---NVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKR--------AVFYNVAR 69 (501) T ss_pred CC---CCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhc--------cccCchHH Confidence 00 011111111111 11111111111111111 012333211 1011111111111111 12223444 Q ss_pred HHHhhhccCCceeeeeccccccCCChhHHHHHHHHHHHHHHHHhcccccceehhcccCHHHHHHHHhhhheecCceEEEE Q lcl|NC_019524. 85 VHRDSIVGSQYKLNAKPNTIVLGAPDGWGEEFQEVVEARFNMAAESPENWFDARRMCTLTGLTRLAVSGFLMTGEVLATC 164 (556) Q Consensus 85 ~~~~nvVG~Gi~~~~~~~~~~lg~~~~~~~~~~~~ie~~~~~w~~~~~~~cD~~g~~~f~~lq~l~~r~~~~dGE~f~~~ 164 (556) ..++..||.-|+--+..+ +-..++. |+++ ||-.|. +++++.+.+++..+..|=|+++. T Consensus 70 ~t~~~l~G~vf~k~p~~~-------------~p~~l~~----l~~d----~D~~G~-~L~~f~~~~~~~~l~~G~~~ilV 127 (501) T protein:vir:95 70 RTLFGLVGQVFMRDPVVK-------------VPALLNP----LVAN----ATGSGI-NLTQLAKRAVSLNLAYSRAGLLV 127 (501) T ss_pred HHHHHHhhhhhcCCccee-------------CcHHHHH----HHhc----cCCCCC-CHHHHHHHHHHHHHhcCeEEEEE Confidence 455666665444323221 1233333 4433 899998 99999999999999999999887 Q ss_pred eeccCCCC--cC-----CCcccceEEEEEchhhcCC-CCCCCCCc------eEEE------------------EEEECCC Q lcl|NC_019524. 165 EWLNPTGT--TM-----QRRPFGTAIQMISPYRMSN-PNNVMDTP------NLRS------------------GVQLDNN 212 (556) Q Consensus 165 ~~~~~~~~--~~-----~~~~~~l~lq~ie~drl~~-~~~~~~g~------~i~~------------------GIE~d~~ 212 (556) -+...... .. .+...|+ +.+|.|+.|-+ .....+|. .|+. -++.|+. T Consensus 128 D~P~~~~~~~~t~a~~~~~~~rPy-~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~ 206 (501) T protein:vir:95 128 DYPTTEAEGGASIADLEAGRIRPT-LYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEE 206 (501) T ss_pred eecCCCCcccccHHHHHhccCCcE-EEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCC Confidence 65321110 00 0000132 77888877742 11112221 1221 1122222 Q ss_pred CCeEEEEEeecCCCc---------cccCCccccceeeccccCChhHeEee-ecccCCCcccCCchhhHHHHHHHHHHHHH Q lcl|NC_019524. 213 GAALGYWLRKAFPGD---------PTDMEQWKWGYEPARFDWGRRRVIHI-IEALLAGQTRGISEMVSALKQMKMTRNFQ 282 (556) Q Consensus 213 Gr~vaY~i~~~hpgd---------~~~~~~~~~~rv~~~~~v~a~~viH~-f~~~r~gQ~RGvs~la~~l~~l~~l~~~~ 282 (556) |. ..|.|+...... .......+|.-+. ....+-..|=.+ +...+-+=.-|.|+|..+... .+.-|. T Consensus 207 g~-~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~l--ni~hy~ 282 (501) T protein:vir:95 207 GY-YVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTD-AQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASL--NMAHYR 282 (501) T ss_pred ce-EEEEEEEecCCcccCcceecCCcccccceeeeec-cCCCcCCeeeEEEEecCCCCCCCCccchHHHHHH--HHHHHh Confidence 22 112222221100 0000000111000 000111111111 233444555567777754321 333344 Q ss_pred H-HHHHHHHHhccee-eeEeccCcccccccccccccccccccccccccccccccccccceecCCceeeecCCCceeeeec Q lcl|NC_019524. 283 E-ITLQNAVVNATYA-ASVESELPSDVVFGQLGMGQGGFKEIFNEYMTGLANYVAQTKNIAIDGAKIPHLYPGTKLKMQP 360 (556) Q Consensus 283 d-ael~~a~i~A~~~-~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~ 360 (556) . |.+....--+++. .+|+ ...++. ........+.+++..+..|++|-+++++. T Consensus 283 ~ssd~~~~l~~~~~P~l~i~-G~~~~~------------------------~~~~~~~~i~~G~~~~~~lP~~~~~~~ie 337 (501) T protein:vir:95 283 NSADYEESCYIVGQPTPVLI-GLTEEW------------------------VTNVLKGSVNFGSRGGIPLPVGADAKLLQ 337 (501) T ss_pred hhhHHHHHHHHcccceeeee-CCcccc------------------------cccCCCCceeecccccccCCCCCceeEEe Confidence 3 3344444444444 4444 211110 00111234678899999999999999999 Q ss_pred CCCCCccHHHHHHHHHHHHHHhcCCCHHHhhchhhcccchhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH Q lcl|NC_019524. 361 AGTPGGVGTDYEQSLLRNIAASLGMSYEQFSRDYTKTNYSSARASMAETQKYMDSRKKL--VADRFASAIYTLWLEEEVN 438 (556) Q Consensus 361 ~~~p~~~f~~F~~~~lr~iaaglGi~ye~l~~D~s~~nYSs~R~~~~e~~r~~~~~q~~--lv~~~~~pi~~~~l~~a~l 438 (556) ++ +++=...-++....++.. +|- ..+.... .| .|+=+.-+++.+....++.. -++..++-+++++- .. T Consensus 338 ~~-~~~i~~~~l~~l~~~m~~-~Ga--~ll~~~~--~~-~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a---~w 407 (501) T protein:vir:95 338 AS-ENTMLKEAMDTKERQMVA-LGA--KLVEQKE--VQ-RTATEAELEAASEGSTLSSATKNVSAAFEWALKWAA---RW 407 (501) T ss_pred cC-hhhHHHHHHHHHHHHHHH-HHH--hhccCCc--cc-hhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH---HH Confidence 75 333223333333333221 231 2232221 12 34444445555544444432 23344444454322 22 Q ss_pred cCCccCCCCcccccccchhhHHHhhCeeeecCccccc-chhhhhHHHHHHHHcCCCCHHHHH---HHhCC---CHHHHHH Q lcl|NC_019524. 439 AGNVPLPPGKNWRMFYDPMMRDALCNAEWIGASRGQI-DEKKETEAAILRIKNGLSTYEAEI---SRLGG---DFREVFK 511 (556) Q Consensus 439 ~G~l~~p~~~~~~~~~~~~~~~a~~~~~w~~p~~~~i-DP~Ke~~A~~~~i~~G~~s~~~~~---ae~G~---D~e~v~~ 511 (556) .|.. +....+. +.+-.... ....+++|..+++..|..|.+... ..+|. |+++..+ T Consensus 408 ~g~~--~~~~~v~----------------i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e 469 (501) T protein:vir:95 408 VGQA--DSGVKFE----------------LNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKE 469 (501) T ss_pred cCCC--CCceEEE----------------EecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHH Confidence 2321 1111110 01111111 134579999999999999999874 44453 3333344 Q ss_pred HHHHHHHHHHHcCCCCCccccccCCCCCCCCCCCCCCCCCCc Q lcl|NC_019524. 512 QRAREEGLIKSLKLDFTGKMVEGNSTQSSNSSESTSDNPNEE 553 (556) Q Consensus 512 q~a~E~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (556) +++.|.+-.+ ... .......+...++ +-.+.| T Consensus 470 ~i~~~~~~~~-------~~~--~~~~~~~~~~gg~-~~~~~~ 501 (501) T protein:vir:95 470 KIAKDTAEAM-------ALA--TPANVPGDGSGGD-NVGNSE 501 (501) T ss_pred HHHhhhcCcc-------ccc--ccCCCCCCCcccc-cccCCC Confidence 4444322110 000 0000111111111 111222 Done!