Query lcl|NC_015158.1_cdsid_YP_004251190.1 [gene=19] [protein=hypothetical protein] [protein_id=YP_004251190.1] [location=complement(16994..18739)] Match_columns 581 No_of_seqs 66 out of 73 Neff 7.3 Searched_HMMs 1612 Date Thu Nov 7 13:26:33 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_19 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_19_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95449 Length: 584 100.0 1E-175 9E-179 979.2 50.2 578 1-581 1-581 (584) 2 protein:vir:3139 Length: 599 # 100.0 6E-170 3E-173 948.5 46.1 578 1-581 1-590 (599) 3 protein:vir:80165 Length: 651 100.0 2E-110 1E-113 621.9 52.7 566 1-581 3-621 (651) 4 protein:vir:94599 Length: 641 100.0 4E-101 2E-104 571.1 46.9 554 1-581 5-598 (641) 5 protein:vir:95821 Length: 763 100.0 3.4E-94 2.1E-97 533.0 52.3 553 1-581 1-652 (763) 6 protein:vir:8846 Length: 705 # 100.0 1E-94 6.3E-98 535.9 49.3 550 1-581 1-624 (705) 7 protein:vir:93630 Length: 776 100.0 4.6E-61 2.9E-64 351.4 43.2 546 1-581 22-669 (776) 8 protein:vir:108295 Length: 711 100.0 1E-53 6.2E-57 311.2 46.9 533 1-581 1-646 (711) 9 protein:vir:7321 Length: 556 # 100.0 6.6E-50 4.1E-53 290.2 43.6 498 11-581 1-533 (556) 10 protein:vir:102668 Length: 547 100.0 1.8E-49 1.1E-52 287.8 44.4 496 12-581 1-533 (547) 11 protein:vir:104437 Length: 714 100.0 3E-49 1.8E-52 286.6 42.9 542 1-581 1-664 (714) 12 protein:vir:95315 Length: 559 100.0 1.2E-48 7.1E-52 283.4 43.8 498 11-581 1-533 (559) 13 protein:vir:817 Length: 714 # 100.0 1.1E-47 7E-51 278.0 44.8 538 1-581 8-641 (714) 14 protein:vir:10117 Length: 714 100.0 1.1E-47 7E-51 278.0 44.8 538 1-581 8-641 (714) 15 protein:vir:9950 Length: 714 # 100.0 1.1E-47 7E-51 278.0 44.8 538 1-581 8-641 (714) 16 protein:vir:2764 Length: 714 # 100.0 1.1E-47 7E-51 278.0 44.8 538 1-581 8-641 (714) 17 protein:vir:3296 Length: 714 # 100.0 1.1E-47 7E-51 278.0 44.8 538 1-581 8-641 (714) 18 protein:vir:3361 Length: 535 # 100.0 9.1E-47 5.6E-50 273.0 44.3 484 11-581 1-508 (535) 19 protein:vir:1538 Length: 535 # 100.0 4.1E-46 2.6E-49 269.4 44.3 488 11-581 1-512 (535) 20 protein:vir:1785 Length: 555 # 100.0 7E-46 4.4E-49 268.1 44.1 494 19-581 1-532 (555) 21 protein:vir:8883 Length: 543 # 100.0 1.1E-45 6.5E-49 267.2 44.1 492 11-581 1-516 (543) 22 protein:vir:10447 Length: 536 100.0 1.2E-45 7.3E-49 266.9 43.8 489 11-581 1-517 (536) 23 protein:vir:2198 Length: 536 # 100.0 1.3E-45 8.2E-49 266.6 43.8 489 11-581 1-517 (536) 24 protein:vir:94709 Length: 522 100.0 1.2E-45 7.5E-49 266.8 43.5 485 11-581 1-511 (522) 25 protein:vir:98506 Length: 555 100.0 1.1E-45 6.9E-49 267.0 41.8 499 11-581 1-533 (555) 26 protein:vir:107404 Length: 555 100.0 1.1E-45 6.9E-49 267.0 41.8 499 11-581 1-533 (555) 27 protein:vir:107822 Length: 555 100.0 1.1E-45 6.9E-49 267.0 41.8 499 11-581 1-533 (555) 28 protein:vir:94572 Length: 535 100.0 1.7E-44 1.1E-47 260.6 45.2 487 1-581 1-512 (535) 29 protein:vir:105619 Length: 772 100.0 4.9E-44 3E-47 258.0 44.3 547 1-581 1-644 (772) 30 protein:vir:103765 Length: 549 100.0 1.9E-42 1.1E-45 249.4 44.9 497 11-581 1-537 (549) 31 protein:vir:77597 Length: 725 100.0 3.3E-42 2.1E-45 248.0 43.4 521 11-581 1-630 (725) 32 protein:vir:78696 Length: 542 100.0 4.1E-42 2.5E-45 247.5 43.2 478 19-581 1-513 (542) 33 protein:vir:99672 Length: 532 100.0 6.2E-42 3.8E-45 246.5 43.0 489 11-581 1-515 (532) 34 protein:vir:100920 Length: 725 100.0 6E-42 3.7E-45 246.6 41.4 521 11-581 1-637 (725) 35 protein:vir:105520 Length: 706 100.0 5.9E-42 3.7E-45 246.6 41.0 531 11-581 1-620 (706) 36 protein:vir:9263 Length: 725 # 100.0 1.1E-41 6.9E-45 245.1 40.2 520 11-581 1-630 (725) 37 protein:vir:172 Length: 708 # 100.0 3.7E-41 2.3E-44 242.2 40.4 524 11-581 1-628 (708) 38 protein:vir:100039 Length: 522 100.0 1.7E-40 1.1E-43 238.6 42.8 475 21-581 1-503 (522) 39 protein:vir:105429 Length: 708 100.0 2.6E-40 1.6E-43 237.6 39.4 521 11-581 1-628 (708) 40 protein:vir:78942 Length: 510 100.0 1.9E-39 1.2E-42 232.9 43.5 469 19-581 1-497 (510) 41 protein:vir:6322 Length: 510 # 100.0 4.2E-39 2.6E-42 231.0 43.3 469 19-581 1-497 (510) 42 protein:vir:103330 Length: 517 100.0 9.5E-38 5.9E-41 223.5 43.7 477 12-581 1-511 (517) 43 protein:vir:96988 Length: 516 100.0 2.2E-38 1.3E-41 227.1 39.3 480 1-581 1-508 (516) 44 protein:vir:3520 Length: 720 # 100.0 3.3E-37 2E-40 220.6 42.4 518 11-581 1-626 (720) 45 protein:vir:7017 Length: 515 # 100.0 2E-36 1.2E-39 216.3 42.7 479 1-581 1-504 (515) 46 protein:vir:80211 Length: 514 100.0 3.1E-36 1.9E-39 215.2 43.3 476 19-581 1-512 (514) 47 protein:vir:105641 Length: 516 100.0 3.7E-36 2.3E-39 214.8 42.3 480 1-581 1-509 (516) 48 protein:vir:345 Length: 663 # 99.9 2.5E-23 1.6E-26 144.5 34.2 517 1-581 1-616 (663) 49 protein:vir:93747 Length: 472 99.8 3.6E-16 2.3E-19 105.2 38.6 430 1-581 5-453 (472) 50 protein:vir:2732 Length: 501 # 99.7 1.6E-15 1E-18 101.7 40.7 436 1-581 1-478 (501) 51 protein:vir:2341 Length: 488 # 99.7 8.1E-16 5E-19 103.3 38.9 443 11-581 1-472 (488) 52 protein:vir:96494 Length: 501 99.7 5.9E-16 3.6E-19 104.1 37.5 432 1-581 11-474 (501) 53 protein:vir:4898 Length: 502 # 99.7 1.8E-15 1.1E-18 101.4 39.9 434 1-581 1-478 (502) 54 protein:vir:101494 Length: 527 99.7 3.7E-17 2.3E-20 110.7 30.0 480 1-581 1-504 (527) 55 protein:vir:1236 Length: 483 # 99.7 1.1E-15 6.9E-19 102.6 38.0 432 1-581 15-466 (483) 56 protein:vir:102239 Length: 527 99.7 4.7E-17 2.9E-20 110.1 30.0 480 1-581 1-504 (527) 57 protein:vir:3964 Length: 453 # 99.7 4.2E-15 2.6E-18 99.4 40.0 424 3-581 1-441 (453) 58 protein:vir:95806 Length: 440 99.7 1.6E-15 1E-18 101.7 36.9 406 11-581 1-432 (440) 59 protein:vir:106639 Length: 481 99.7 6.9E-15 4.3E-18 98.2 39.7 426 1-581 16-466 (481) 60 protein:vir:80680 Length: 441 99.7 2.9E-15 1.8E-18 100.3 37.1 423 12-580 1-441 (441) 61 protein:vir:104082 Length: 485 99.7 5E-15 3.1E-18 99.0 38.2 437 1-581 1-463 (485) 62 protein:vir:5961 Length: 503 # 99.7 5.4E-15 3.4E-18 98.8 38.3 448 1-581 1-475 (503) 63 protein:vir:97336 Length: 492 99.7 5.3E-15 3.3E-18 98.8 38.2 432 1-581 24-475 (492) 64 protein:vir:96240 Length: 511 99.7 1.2E-14 7.7E-18 96.8 39.6 435 1-581 31-489 (511) 65 protein:vir:733 Length: 453 # 99.7 1.4E-14 8.8E-18 96.5 39.8 423 4-581 1-444 (453) 66 protein:vir:103951 Length: 511 99.7 1.3E-14 8E-18 96.7 39.6 437 1-581 13-489 (511) 67 protein:vir:94805 Length: 492 99.7 8.3E-15 5.1E-18 97.8 38.3 429 1-581 24-472 (492) 68 protein:vir:97171 Length: 512 99.7 2.2E-14 1.4E-17 95.4 40.6 439 1-581 24-490 (512) 69 protein:vir:3609 Length: 452 # 99.7 2E-14 1.2E-17 95.7 40.1 424 1-581 1-440 (452) 70 protein:vir:105292 Length: 478 99.7 5.5E-15 3.4E-18 98.7 37.1 431 1-581 1-461 (478) 71 protein:vir:96179 Length: 468 99.7 7E-15 4.3E-18 98.2 37.6 434 1-581 1-460 (468) 72 protein:vir:4223 Length: 486 # 99.7 8.2E-15 5.1E-18 97.8 37.5 432 8-581 1-469 (486) 73 protein:vir:2427 Length: 485 # 99.7 1.6E-14 9.7E-18 96.3 38.1 437 8-581 1-463 (485) 74 protein:vir:79043 Length: 479 99.7 2.1E-15 1.3E-18 101.0 33.2 446 1-581 7-471 (479) 75 protein:vir:78537 Length: 480 99.7 1.1E-14 6.6E-18 97.2 37.0 431 1-581 1-460 (480) 76 protein:vir:78227 Length: 480 99.7 9.7E-15 6E-18 97.4 36.6 428 1-581 1-448 (480) 77 protein:vir:94101 Length: 474 99.7 1.3E-15 7.9E-19 102.2 31.4 437 6-581 1-459 (474) 78 protein:vir:105889 Length: 474 99.7 1.3E-15 7.9E-19 102.2 31.4 437 6-581 1-459 (474) 79 protein:vir:106571 Length: 499 99.7 3.2E-14 2E-17 94.6 39.0 450 1-581 1-471 (499) 80 protein:vir:9871 Length: 429 # 99.7 5.6E-14 3.5E-17 93.2 40.3 408 11-581 1-420 (429) 81 protein:vir:99781 Length: 511 99.7 4E-14 2.5E-17 94.0 38.0 436 1-581 13-488 (511) 82 protein:vir:7430 Length: 563 # 99.7 3.7E-15 2.3E-18 99.7 32.2 483 11-581 1-522 (563) 83 protein:vir:97447 Length: 474 99.7 3.3E-14 2.1E-17 94.5 37.2 427 1-581 13-454 (474) 84 protein:vir:94498 Length: 474 99.7 3.3E-14 2.1E-17 94.5 37.2 427 1-581 13-454 (474) 85 protein:vir:96366 Length: 511 99.7 7.1E-14 4.4E-17 92.7 38.7 437 1-581 13-489 (511) 86 protein:vir:78805 Length: 511 99.7 7.1E-14 4.4E-17 92.7 38.7 437 1-581 13-489 (511) 87 protein:vir:99522 Length: 470 99.6 9.5E-14 5.9E-17 92.0 39.3 432 1-581 4-458 (470) 88 protein:vir:1587 Length: 508 # 99.6 2.1E-14 1.3E-17 95.5 35.7 460 1-581 3-499 (508) 89 protein:vir:107112 Length: 478 99.6 5.9E-14 3.6E-17 93.1 36.8 432 1-581 8-461 (478) 90 protein:vir:7768 Length: 484 # 99.6 7.9E-14 4.9E-17 92.4 36.9 437 1-581 1-458 (484) 91 protein:vir:38 Length: 496 # N 99.6 2E-14 1.3E-17 95.6 33.7 452 1-581 1-486 (496) 92 protein:vir:9815 Length: 500 # 99.6 2.8E-14 1.7E-17 94.9 34.0 451 1-581 3-498 (500) 93 protein:vir:3028 Length: 500 # 99.6 2.8E-14 1.7E-17 94.9 34.0 451 1-581 3-498 (500) 94 protein:vir:9306 Length: 511 # 99.6 2.5E-13 1.5E-16 89.7 39.9 433 1-581 31-489 (511) 95 protein:vir:2500 Length: 501 # 99.6 4.1E-14 2.5E-17 94.0 34.0 446 1-581 1-479 (501) 96 protein:vir:95113 Length: 474 99.6 1.6E-13 9.7E-17 90.8 36.6 427 1-581 7-454 (474) 97 protein:vir:98883 Length: 517 99.6 1.7E-13 1.1E-16 90.6 36.6 457 1-581 18-513 (517) 98 protein:vir:96266 Length: 474 99.6 7.5E-14 4.6E-17 92.5 34.5 430 1-581 1-454 (474) 99 protein:vir:95899 Length: 474 99.6 7.5E-14 4.6E-17 92.5 34.5 430 1-581 1-454 (474) 100 protein:vir:96839 Length: 474 99.6 4.1E-13 2.5E-16 88.5 38.4 433 1-581 1-459 (474) 101 protein:vir:99916 Length: 504 99.6 1.1E-13 7.1E-17 91.5 35.2 434 1-581 8-474 (504) 102 protein:vir:80959 Length: 499 99.6 1.7E-13 1E-16 90.6 36.0 447 1-581 1-489 (499) 103 protein:vir:94742 Length: 409 99.6 5.7E-14 3.5E-17 93.2 33.0 387 11-547 1-409 (409) 104 protein:vir:105461 Length: 470 99.6 4.1E-13 2.6E-16 88.5 37.6 430 12-581 1-461 (470) 105 protein:vir:102950 Length: 471 99.6 9.7E-14 6E-17 91.9 33.8 424 12-581 1-461 (471) 106 protein:vir:9922 Length: 489 # 99.6 6.9E-13 4.3E-16 87.3 38.4 438 1-581 1-475 (489) 107 protein:vir:102330 Length: 451 99.6 6E-13 3.7E-16 87.6 36.7 426 12-581 1-449 (451) 108 protein:vir:7987 Length: 456 # 99.6 4.4E-13 2.8E-16 88.3 35.1 429 11-581 1-452 (456) 109 protein:vir:9568 Length: 410 # 99.6 9.6E-14 6E-17 91.9 31.4 388 31-579 1-410 (410) 110 protein:vir:4782 Length: 522 # 99.6 5.6E-13 3.4E-16 87.8 35.1 461 1-581 18-504 (522) 111 protein:vir:94546 Length: 506 99.6 5.8E-13 3.6E-16 87.7 34.9 427 1-581 12-488 (506) 112 protein:vir:1634 Length: 409 # 99.5 4.3E-13 2.7E-16 88.4 33.5 388 11-547 1-409 (409) 113 protein:vir:102602 Length: 456 99.5 2.4E-12 1.5E-15 84.3 37.2 429 11-581 1-452 (456) 114 protein:vir:105819 Length: 456 99.5 2.4E-12 1.5E-15 84.3 37.2 429 11-581 1-452 (456) 115 protein:vir:79703 Length: 505 99.5 2.4E-12 1.5E-15 84.3 36.1 456 1-581 14-499 (505) 116 protein:vir:9751 Length: 422 # 99.5 1.1E-12 6.6E-16 86.2 33.8 403 11-578 1-422 (422) 117 protein:vir:8184 Length: 474 # 99.5 1.3E-12 8.2E-16 85.7 33.2 434 1-581 2-471 (474) 118 protein:vir:78907 Length: 518 99.5 2.1E-12 1.3E-15 84.6 33.9 471 1-581 4-513 (518) 119 protein:vir:78083 Length: 537 99.5 3.6E-12 2.2E-15 83.3 34.0 453 1-581 1-502 (537) 120 protein:vir:98444 Length: 434 99.4 6.5E-12 4E-15 81.9 32.0 396 47-581 1-416 (434) 121 protein:vir:99072 Length: 479 99.3 1.7E-10 1.1E-13 74.1 35.9 419 11-581 1-455 (479) 122 protein:vir:96403 Length: 666 99.2 1.8E-10 1.1E-13 74.0 25.9 565 1-581 3-660 (666) 123 protein:vir:103385 Length: 666 99.1 6.4E-10 3.9E-13 71.0 25.0 565 1-581 3-660 (666) 124 protein:vir:80453 Length: 535 98.6 1.8E-07 1.1E-10 57.6 25.6 468 1-581 1-512 (535) 125 protein:vir:94956 Length: 452 98.6 1.9E-07 1.2E-10 57.5 30.3 428 11-581 1-441 (452) 126 protein:vir:97265 Length: 513 98.4 1.1E-06 6.5E-10 53.4 27.9 454 1-581 1-479 (513) 127 protein:vir:95149 Length: 501 97.4 7.7E-05 4.8E-08 43.1 28.2 461 11-581 1-490 (501) 128 protein:vir:95014 Length: 491 97.0 0.00023 1.4E-07 40.5 26.3 455 1-574 1-491 (491) 129 protein:vir:78393 Length: 489 96.4 0.00065 4E-07 38.1 24.3 447 1-581 1-473 (489) 130 protein:vir:96783 Length: 488 93.2 0.0085 5.3E-06 31.9 25.6 453 1-580 1-488 (488) 131 protein:vir:98853 Length: 219 81.3 0.087 5.4E-05 26.4 15.0 194 259-489 1-219 (219) 132 protein:vir:78641 Length: 278 76.3 0.14 8.5E-05 25.3 16.9 255 185-497 1-278 (278) 133 protein:vir:80040 Length: 461 70.4 0.21 0.00013 24.3 28.3 428 11-578 1-461 (461) 134 protein:vir:80644 Length: 551 70.0 0.21 0.00013 24.2 22.8 417 1-581 1-498 (551) 135 protein:vir:107662 Length: 427 56.2 0.46 0.00029 22.4 19.1 388 33-581 1-419 (427) 136 protein:vir:99853 Length: 488 53.8 0.52 0.00032 22.1 30.1 394 14-581 1-427 (488) 137 protein:vir:104338 Length: 422 50.5 0.61 0.00038 21.8 20.4 388 11-581 1-417 (422) 138 protein:vir:63755 Length: 547 39.6 1 0.00063 20.6 24.5 428 1-581 7-507 (547) 139 protein:vir:108215 Length: 469 39.4 1 0.00063 20.5 22.4 413 1-581 1-467 (469) 140 protein:vir:80796 Length: 574 36.9 1.1 0.00071 20.3 25.6 431 1-581 27-502 (574) 141 protein:vir:96579 Length: 576 33.4 1.4 0.00084 19.9 22.2 434 1-581 1-512 (576) 142 protein:vir:4828 Length: 382 # 31.5 1.5 0.00093 19.6 16.0 350 139-560 1-382 (382) 143 protein:vir:77981 Length: 448 30.0 1.6 0.001 19.5 24.2 391 29-581 1-442 (448) 144 protein:vir:1326 Length: 457 # 24.0 2.2 0.0014 18.7 20.7 406 20-581 1-439 (457) 145 protein:vir:93943 Length: 409 21.8 2.5 0.0016 18.4 13.2 381 118-578 1-409 (409) 146 protein:vir:79647 Length: 435 20.9 2.7 0.0017 18.2 22.0 393 1-562 5-435 (435) No 1 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=1.4e-175 Score=979.15 Aligned_cols=578 Identities=56% Similarity=0.927 Sum_probs=549.1 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLH 80 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~ 80 (581) ||||++++++|++ +|++|++|+++|++|+++||+|+++|+|++||++++++++++++++|||||+|+|||++++++|| T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~ 78 (584) T protein:vir:95 1 MSVKVAELNSLLV--RDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLH 78 (584) T ss_pred CCcchhhhhhhcc--ccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHH Confidence 9999999999998 89999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcCCccEEEeecCChhHHHH--HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeee Q lcl|NC_015158. 81 SNYISALFPNERWLKWEGKSLQDEAK--RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGA 158 (581) Q Consensus 81 ~~l~~~~f~~~~~~~~~~~~~~d~~~--ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~ 158 (581) ++||++||||++||+|+|+.++|+++ +++|+.||.|||++|+|+++|+.+|+|+++||+||+||||.++++++.+. . T Consensus 79 ~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~-~ 157 (584) T protein:vir:95 79 SNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDG-T 157 (584) T ss_pred HHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeecc-c Confidence 99999999999999999999999876 99999999999999999999999999999999999999999999888864 4 Q ss_pred EeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhh Q lcl|NC_015158. 159 TRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEK 238 (581) Q Consensus 159 ~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (581) ....+++|++++||||||||||+|++++|+.||+|+++|+++|++|+...+...|..+++......++....+...+.++ T Consensus 158 ~v~~~~~prieriSP~d~~~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~ 237 (584) T protein:vir:95 158 LVPDYIGPRLVRISPLDIVFNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDK 237 (584) T ss_pred cccccccceEEeeChhheeecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCccccccc Confidence 55578899999999999999999999999999999999999999998665444555555444444444566667788999 Q ss_pred ccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCccc Q lcl|NC_015158. 239 AVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLY 318 (581) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~ 318 (581) .++++.+++.++++|+.+++|+||||||++|+..+|+..++++|+|++|+++||++.||||+|+.||++.+|.|+++++| T Consensus 238 ~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~y 317 (584) T protein:vir:95 238 AAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLW 317 (584) T ss_pred ccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccccccccCCceeEEeCCCCCcccccCCC-ccchhHHHHHHHH Q lcl|NC_015158. 319 AMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDVEEFVWGPMEQIYINGDGDVEMMAPNT-QALQADMQIQILE 397 (581) Q Consensus 319 G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~~~i~~~pG~vi~~~~~~~i~~~~~p~-~~~~~~~~lq~~~ 397 (581) |+|+++++.|+|+++|+++|+|+||+++++||.+....++++..++||++|++..+++++++++|+ ...+++++||+++ T Consensus 318 G~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~~~~~~~pg~~~~~~~~~~~q~~~p~a~~~~s~~~~lq~~e 397 (584) T protein:vir:95 318 AMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEVEEFVWGPGAEIHLDQGGDVQEIAKNVNYIINADNQIQMLE 397 (584) T ss_pred CCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeeccccchhcccCCceeecCCCCCcceecCchhhhhHHHHHHHHHH Confidence 999999999999999999999999999999999888888999999999999999999999999885 4567889999999 Q ss_pred HHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcc Q lcl|NC_015158. 398 AKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKV 477 (581) Q Consensus 398 ~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~ 477 (581) +.++++||||++++|+++.+++||||+|||||||++++++++++|++.+++||+.+|++|+++|||.+++||++|+|+|+ T Consensus 398 ~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~ 477 (584) T protein:vir:95 398 DRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGV 477 (584) T ss_pred HHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeecccccc Confidence 99999999999999999889999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHH Q lcl|NC_015158. 478 ATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVME 557 (581) Q Consensus 478 ~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~ 557 (581) ++|++|+|+||+|+|++||+||+++++|+|..|.|.++++.+++++++||.+.++|.+.+++.+++++|++|++++++.+ T Consensus 478 ~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~~~~~~~~~~~ 557 (584) T protein:vir:95 478 KEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYEIFRPNVAVAE 557 (584) T ss_pred ccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCCCcccccCCCcccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 558 AQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 558 ~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) |++.|.++.+||+.++.++|+|-= T Consensus 558 Q~~~q~~~~~~q~~~~~~~~~~~~ 581 (584) T protein:vir:95 558 QAETQSLVAQAQEDLQLQAQMPAE 581 (584) T ss_pred hHHHHhhhHHHHHHHHHHHhhhhc Confidence 999999888999999999998876 No 2 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=5.5e-170 Score=948.51 Aligned_cols=578 Identities=30% Similarity=0.503 Sum_probs=538.0 Q ss_pred Cccchhhhhhhcc--chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLD--DTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDN 78 (581) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~ 78 (581) ||+++++|+||++ +..+++|++|+++|++|+++|++|+++|+|++|||+++|+|+|||+++|||||+|+||+++++++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~~~~ 80 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHLHLM 80 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHHHHH Confidence 9999999999998 66778889999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcCCccEEEeecCChhHH--HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeee Q lcl|NC_015158. 79 LHSNYISALFPNERWLKWEGKSLQDE--AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES 156 (581) Q Consensus 79 ~~~~l~~~~f~~~~~~~~~~~~~~d~--~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~ 156 (581) ||++||+++|||++||+|+|..++++ ++++++++||++||+||+|+.+++.+|+|+++||+||++++|+++.++.. + T Consensus 81 l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~-d 159 (599) T protein:vir:31 81 ITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTA-E 159 (599) T ss_pred HHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeec-c Confidence 99999999999999999999999964 67999999999999999999999999999999999999999999988864 6 Q ss_pred eeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccc--- Q lcl|NC_015158. 157 GATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTR--- 233 (581) Q Consensus 157 ~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~--- 233 (581) +.+..++++|++++||||||||||+|++++||.||+|.++|+++|++|......+.| ..+..+..++.+...+..+ T Consensus 160 ~~v~~~~~~P~~ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y-~~d~~~~~~~~~~~~~~~~~d~ 238 (599) T protein:vir:31 160 NQVIKNYSGTVTERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLM-SMEDFQKLREERRTIREALADG 238 (599) T ss_pred cccccccccceEEeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCcccc-chHHHHHHHhhccCCCccccch Confidence 788999999999999999999999999999999999999999999999876655555 4555555555554333322 Q ss_pred -hhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccc Q lcl|NC_015158. 234 -EDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRI 312 (581) Q Consensus 234 -~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~ 312 (581) .++.+..+...++++++++||++++|++|||||++|+.++|+..++.++||++++++||++.||||||+.||++.+|.| T Consensus 239 ~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P 318 (599) T protein:vir:31 239 YNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEF 318 (599) T ss_pred hhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeee Confidence 3356666677799999999999999999999999999999999999999999989999999999999999999999999 Q ss_pred cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--ccccccCCceeEEeCCCCCcccccCCCccchhH Q lcl|NC_015158. 313 RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--VEEFVWGPMEQIYINGDGDVEMMAPNTQALQAD 390 (581) Q Consensus 313 ~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~~~i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~ 390 (581) +++++||+|+++++.++|.++|+++|+++||+.++..|++..+++ +.++.|.||++|++.+.+++++++||+...+++ T Consensus 319 ~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~eD~~~~P~~v~~~~d~~~vq~~~p~s~~~~a~ 398 (599) T protein:vir:31 319 QKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREKGMRGGPNHVFEVEETGDVQYMTPPAEVLQPD 398 (599) T ss_pred eccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhcccccccccccccCccCCCCcceeecCCCccccccCchhhhhHH Confidence 999999999999999999999999999999999999998887776 556889999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeee Q lcl|NC_015158. 391 MQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRV 470 (581) Q Consensus 391 ~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~ 470 (581) +++++++..+++.||+|++++|+++.+++||+|+++||+||++|+++++|+|++++++||++.+++..++++|.++|||+ T Consensus 399 ~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri 478 (599) T protein:vir:31 399 NQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKT 478 (599) T ss_pred HHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 471 FDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 471 ~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) +|+|.|+++|++|+|+||++++++|++||+++++|++..|+|+++++.++++.+.||+++++|++++++..+|.+|.+|+ T Consensus 479 ~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~ 558 (599) T protein:vir:31 479 FNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFT 558 (599) T ss_pred ecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHHHHHHHhccccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHhcccC--C Q lcl|NC_015158. 551 PNVAVMEAQTTSALVNQSQAQIEEEAQVPL--V 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~--~ 581 (581) |+.+++++|.+.+| +|+|.+.++++|+-- + T Consensus 559 ~~va~~eqq~~~~m-~Q~~lq~~~~~~~~~~~~ 590 (599) T protein:vir:31 559 FGIGVQEDQQLARM-AQKSTQQTEETALTQEEV 590 (599) T ss_pred CchhHHHHHHHHHH-HHHHHHHhHhhhhhhhhc Confidence 99999888766654 447777777776532 2 No 3 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=2.1e-110 Score=621.87 Aligned_cols=566 Identities=17% Similarity=0.256 Sum_probs=445.4 Q ss_pred Cccchhhh--hhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHH----------HHHHHhhccccccccccccccccccc Q lcl|NC_015158. 1 MTGKVLEL--QQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKS----------ELRNYIFATDTTTTTNSTLPWKNKTT 68 (581) Q Consensus 1 ~~~~~~~~--~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~----------~~~~y~~~~~~~~~~~~~~~~k~~~~ 68 (581) |++++++. ++|+| .|++|++|.++|++++++|++++++|. ++++|++.++.+.+++++.+|||+++ T Consensus 3 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~ 80 (651) T protein:vir:80 3 LATTTTDKNRQTYDE--THDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKIT 80 (651) T ss_pred ccccccchhhhhhhh--hHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCcccc Confidence 99999998 67776 699999999999999999999999995 56799999999999999999999999 Q ss_pred ccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHH--HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEee Q lcl|NC_015158. 69 LPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEA--KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEY 146 (581) Q Consensus 69 ~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~--~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~ 146 (581) .|+++.++++++++||+.|||+++||+|+|...++++ .+++++.++.++|.+|+|..+++.+++|++++|+||+|++| T Consensus 81 ~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~w 160 (651) T protein:vir:80 81 TGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPW 160 (651) T ss_pred ChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEee Confidence 9999999999999999999999999999999887754 46778999999999999999999999999999999999999 Q ss_pred ecceeeeeeee---------------e--EeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccC Q lcl|NC_015158. 147 VKETTKDEESG---------------A--TRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQP 209 (581) Q Consensus 147 ~~~~~~~~~~~---------------~--~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~ 209 (581) ..+.....+.. . ......+|++++|+|++|||||+|+++.||.||+|.++|+++|.++.. T Consensus 161 e~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~--- 237 (651) T protein:vir:80 161 RVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLS--- 237 (651) T ss_pred cceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHh--- Confidence 75543332211 1 111234689999999999999999999999999999999999988752 Q ss_pred ccchhHHHHHHHHhhhccCCc-ccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCC Q lcl|NC_015158. 210 ENASLASAIARRREFRRGLGT-YTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRM 288 (581) Q Consensus 210 ~~~~~~~d~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~ 288 (581) ++.|...+............+ +.....+..++.+.++ +.....|+|+|||+.+ +.+++++. .++++++|+ T Consensus 238 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~------~~~~~~v~v~E~~~~~-d~e~~~~~--~~~v~~~g~ 308 (651) T protein:vir:80 238 EGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSL------WSPHQNVELLEYWGDI-HLENKTYH--DVVVTIMGN 308 (651) T ss_pred cccccchhhHHHHhhhccccccCCccccccccCCCccc------cccccceEEEEEEEEe-eccCCceE--EEEEEEcCc Confidence 222323333333332222211 1111122222222111 1223468999999965 45566653 344466789 Q ss_pred EEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccC Q lcl|NC_015158. 289 FVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWG 364 (581) Q Consensus 289 ~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~ 364 (581) +|||.++||||++. ||++++|.++||++||+|+++++.+.|+.+|+++|+++||+++++||+|.+..+ ++++.++ T Consensus 309 ~il~~~~~~~~~~~-Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~ 387 (651) T protein:vir:80 309 EVLRFEQNPYWCGR-PFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTE 387 (651) T ss_pred EEecccccCCCCCC-CeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcC Confidence 99999999999765 999999999999999999999999999999999999999999999999998764 6678899 Q ss_pred CceeEEeCCCCCcccccCCC-ccchhHHHHHHHHHHHHHhcCCchHhcCCCCc--ccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 365 PMEQIYINGDGDVEMMAPNT-QALQADMQIQILEAKMEEFAGAPREAMGIRTP--GEKTAFEVQQLQNAAGRIFQEKIMN 441 (581) Q Consensus 365 pG~vi~~~~~~~i~~~~~p~-~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~--~~~TAtgv~~l~~aa~~~~~~i~r~ 441 (581) ||++|+++.+++++++++++ .+..++.+++++++.++++|||+++++|..+. .+.||||+++++++++.++..++++ T Consensus 388 pg~vi~~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~ 467 (651) T protein:vir:80 388 PGKVFLVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKH 467 (651) T ss_pred CCceEEecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999998754 45567788999999999999999999998553 5789999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc- Q lcl|NC_015158. 442 FEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV- 520 (581) Q Consensus 442 f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~- 520 (581) |+++|++|||+.++.++.+|++.++++|++|++.++++|++++++|++++++||++|+.++.+|.+.++.|.++.+... T Consensus 468 l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~ 547 (651) T protein:vir:80 468 IEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQ 547 (651) T ss_pred HHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhcc Confidence 9999999999999999999999999999999999999999999999999999999999999999998888887765321 Q ss_pred cccccchhHHHHHHHHHHHHhcCCCcccc-cCCCCcHHHHHHHH------------HHHHHHHHHHHHhcccCC Q lcl|NC_015158. 521 WQDIKPHVSTENLAKMLEHNLSLGGWDIF-KPNVAVMEAQTTSA------------LVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 521 ~~~i~p~~~~~~l~~~~~e~~~l~~~~~~-~~~~~~~~~~~~q~------------~~q~aq~~~~~~~~~~~~ 581 (581) .+.+...+....+++-+.+..|++..+.+ .+..+...++.+++ +.+++++++++.+|.-.. T Consensus 548 ~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~~~ 621 (651) T protein:vir:80 548 VPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQADGGTQMM 621 (651) T ss_pred CCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23344445545555556677888776543 22111111111111 111111222111111111 No 4 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=3.8e-101 Score=571.14 Aligned_cols=554 Identities=14% Similarity=0.184 Sum_probs=426.7 Q ss_pred Cccchhhh-----hhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHH----------HHhhcccccccccccccccc Q lcl|NC_015158. 1 MTGKVLEL-----QQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELR----------NYIFATDTTTTTNSTLPWKN 65 (581) Q Consensus 1 ~~~~~~~~-----~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~----------~y~~~~~~~~~~~~~~~~k~ 65 (581) |.+-++|= .+|- .|++|++|.++|++++++||+|+.+|+||. +|+++++++++++.+.+|+| T Consensus 5 ~~~~~~~~~~~~~~~~~---~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 81 (641) T protein:vir:94 5 MPTPIIEDKESAKRKLS---TDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRH 81 (641) T ss_pred CCcccccCCcchhhcCC---chhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccc Confidence 66666662 2333 488999999999999999999999999985 77889999999999999999 Q ss_pred cccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEe Q lcl|NC_015158. 66 KTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVE 145 (581) Q Consensus 66 ~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~ 145 (581) |++.|++++++++|+++||+.|||+++||+|.|.+++|+++|+.++++++++|.+++|+.+++.+++|++.+||||+|++ T Consensus 82 ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~ 161 (641) T protein:vir:94 82 RINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLG 161 (641) T ss_pred cccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecceeeee--------------eeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCcc Q lcl|NC_015158. 146 YVKETTKDE--------------ESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPEN 211 (581) Q Consensus 146 ~~~~~~~~~--------------~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~ 211 (581) |........ +.+.+....+++++++|+|+|||+||++...++ .|| +.++|++++.+++.. T Consensus 162 w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~-~f~-~~r~t~~t~~~l~~e---- 235 (641) T protein:vir:94 162 WDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTG-TFV-RLRHTREELHELVTS---- 235 (641) T ss_pred hhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccc-cce-ehhhhHHHHHHHHhc---- Confidence 954322111 122334456678999999999999999865444 444 456688888777422 Q ss_pred chhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEE Q lcl|NC_015158. 212 ASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVI 291 (581) Q Consensus 212 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~ii 291 (581) +|+..+...... .. .+..++.+... +++ +......+++||||++++. +.....++++++|++|| T Consensus 236 g~~~~d~v~~~~-~~---~~~~~~~d~~~--d~~-------~~~~~~~~~~e~~gd~~~d---~~~~~~~~~~~~g~~il 299 (641) T protein:vir:94 236 GYYDLDLTQVEQ-YV---DYKFADPDTPK--DVN-------GTDTSGWDIIEYYGPLLVE---GVQFWCVHAVFYGKQLI 299 (641) T ss_pred CCCChhhcchhh-cc---ccccccccccc--ccc-------cccccccceeeeeeeeccC---CCceeeEEEEEeCCEEe Confidence 222222221111 00 01111111111 111 1112235789999987543 22233455567899999 Q ss_pred EeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCce Q lcl|NC_015158. 292 EEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPME 367 (581) Q Consensus 292 r~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~ 367 (581) |.++|+||++ .||++++|.++||++||+|+++++++.|+++|+++|+++||+.+++||++.+..+ +++++.+||+ T Consensus 300 ~~~~~~~~d~-~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ 378 (641) T protein:vir:94 300 RLSDSKYWCG-SPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGA 378 (641) T ss_pred ecccccccCc-CCeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCc Confidence 9999998775 4999999999999999999999999999999999999999999999999987654 5668889999 Q ss_pred eEEeCCCCCcccccCCCccc-hhHHHHHHHHHHHHHhcCCchHhcCCCCc-c-cccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 368 QIYINGDGDVEMMAPNTQAL-QADMQIQILEAKMEEFAGAPREAMGIRTP-G-EKTAFEVQQLQNAAGRIFQEKIMNFEV 444 (581) Q Consensus 368 vi~~~~~~~i~~~~~p~~~~-~~~~~lq~~~~~~ee~TGv~~~~~G~~~~-~-~~TAtgv~~l~~aa~~~~~~i~r~f~~ 444 (581) +|+++..++++|+++++..+ ....++++++..+++.+|+..+.+|.++. + +.|||||++++++++.++..++|+|++ T Consensus 379 ii~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~ 458 (641) T protein:vir:94 379 VFKVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIED 458 (641) T ss_pred ceeeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999998776433 34566899999999999999999987653 2 579999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc-cccc Q lcl|NC_015158. 445 MLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP-VWQD 523 (581) Q Consensus 445 ~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~-~~~~ 523 (581) +|+.|||+.+++.+.++++.|.++|++|.+...++|++++|++|+|++++++.|..+..++++.++.|.++++.. ..+. T Consensus 459 e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~ 538 (641) T protein:vir:94 459 SSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQ 538 (641) T ss_pred HHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChh Confidence 999999999999999999999999999999999999999999999999999999999999999999888887542 1234 Q ss_pred ccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhc---ccCC Q lcl|NC_015158. 524 IKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQ---VPLV 581 (581) Q Consensus 524 i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~---~~~~ 581 (581) ++..+....+++.++++.|++.+..+.-..+..+++ .+..++++|+.+-+.+| .... T Consensus 539 v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~~~~~-~~~~~~~~q~~~~~~a~~~~~~~~ 598 (641) T protein:vir:94 539 IGQSLDYALILEDLLRQMRFTDPMRYIKKAEAPPAA-PPIAPAEPGALPPEMMNSVGGGLN 598 (641) T ss_pred hhhcCCHHHHHHHHHHHhCCCCchhhccCccCchhH-HHHHHHHHHHHHHHHHHHHHhhhH Confidence 555555666667777888887664331111111111 11111111111111111 0000 No 5 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=3.4e-94 Score=532.98 Aligned_cols=553 Identities=10% Similarity=0.081 Sum_probs=381.7 Q ss_pred Cccchh---------hhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc-cccccccccccccccc Q lcl|NC_015158. 1 MTGKVL---------ELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT-TTTNSTLPWKNKTTLP 70 (581) Q Consensus 1 ~~~~~~---------~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~-~~~~~~~~~k~~~~~p 70 (581) |--.+- -.-+..+=-.+. .++.+-.....+++.-..+..+...|+.++... ..-..+.++||+++.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~~ 77 (763) T protein:vir:95 1 MEQNTDSMVPLPDPSQATKLTSWKNEL---SLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQPK 77 (763) T ss_pred CCcCccCcCCCccccchhcCCCCCChH---HHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcccccCCCccccCH Confidence 111000 000111100122 333344444455555555444443333332211 2234566789999999 Q ss_pred chHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHH-HhcchHHHHHHHHHHHhhcCceEEEEeeecc Q lcl|NC_015158. 71 KLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKV-KESDFRTIMSQLLLDYIDYGNCFATVEYVKE 149 (581) Q Consensus 71 ki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l-~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~ 149 (581) .++..+++++++||+.|+++++||.|+|.+.+|+++|++.+.|++++| ++++.+.++++||+|+++.|+||+|++|..+ T Consensus 78 ~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~ 157 (763) T protein:vir:95 78 LVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNRE 157 (763) T ss_pred HHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeee Confidence 999999999999999999999999999999999999999999999988 6788899999999999999999999999644 Q ss_pred eeeeee----------------------------------------------------------------eeeEeeeecc Q lcl|NC_015158. 150 TTKDEE----------------------------------------------------------------SGATRDTYFG 165 (581) Q Consensus 150 ~~~~~~----------------------------------------------------------------~~~~~~~~~~ 165 (581) +..+.+ ....+..+.+ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ 237 (763) T protein:vir:95 158 IRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANH 237 (763) T ss_pred eeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCc Confidence 322110 0112334568 Q ss_pred ceEEecchhheeecCCCCC-cccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccc Q lcl|NC_015158. 166 PRAVRIDPKDIVFNPVAVD-FAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFS 243 (581) Q Consensus 166 p~ie~V~p~df~~DP~a~~-~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (581) |+|++|+|+||||||+|++ ++||.|| ++.++|++||.+|... |...+-........ ...+-.+..... T Consensus 238 p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~-----y~~~~~~~~~~~~~-----~~~~~~~~~~~~ 307 (763) T protein:vir:95 238 PTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDR-----YHNLNKIDWQSSAP-----VNEPDHATTTPQ 307 (763) T ss_pred eEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCC-----ccccchhcchhccc-----cccccccccchh Confidence 9999999999999999985 9999984 6778899999887321 21111111111000 000000000000 Q ss_pred cccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH Q lcl|NC_015158. 244 MDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL 323 (581) Q Consensus 244 ~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~ 323 (581) .+++.+ .....|+++|||+.+ +..++++.+.++| ++.|+++||.+.|||++|++||+++++.|+||++||+|++ T Consensus 308 ~~~~~d----~~~~~V~v~E~y~~~-d~~gdg~~~~~~v-~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~ 381 (763) T protein:vir:95 308 EFQISD----PMRKRVVAYEYWGFW-DIEGNGVLEPIVA-TWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDA 381 (763) T ss_pred hccCCC----cccceEEEEEeeeee-ccCCcceeEEEEE-EEEcCeeeecccccccCCCcCEEEecceeecCcccCCchH Confidence 011111 112479999999974 5667777776555 5678999999999999999999999999999999999999 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCCCCC----cccccCCCccchhHHHHHH Q lcl|NC_015158. 324 DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYINGDGD----VEMMAPNTQALQADMQIQI 395 (581) Q Consensus 324 ~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~~~~----i~~~~~p~~~~~~~~~lq~ 395 (581) ++++|+|+.+|+++|+++||+++++||+|.+.++ .+..+++||++++++.++. +++..+|+.+..+..++++ T Consensus 382 ~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~ 461 (763) T protein:vir:95 382 ELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATL 461 (763) T ss_pred HHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHH Confidence 9999999999999999999999999999988654 3346789999999997655 4566677788889999999 Q ss_pred HHHHHHHhcCCchHhcCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCch Q lcl|NC_015158. 396 LEAKMEEFAGAPREAMGIRTPG-EKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSD 474 (581) Q Consensus 396 ~~~~~ee~TGv~~~~~G~~~~~-~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~ 474 (581) ++++++++|||+++++|.++.+ +.||+|+++++++++.++..++|+|++ +++++|+.++.++.++++.+.++|+.|++ T Consensus 462 ~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~-~~k~l~~~~l~Li~q~~d~~rviRI~g~e 540 (763) T protein:vir:95 462 QNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAK-GMSEIGNKIIAMNAVFLAEHEVVRITNEE 540 (763) T ss_pred HHHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEeCCc Confidence 9999999999999999987654 899999999999999999999999998 67999999999999999999999999986 Q ss_pred hcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCc----cccc Q lcl|NC_015158. 475 DKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGW----DIFK 550 (581) Q Consensus 475 ~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~----~~~~ 550 (581) ++.|+++++.++|+|++..+.+ ..+.++++.|.++++. +++.+.| .....++.-+.+...++.. .-.. T Consensus 541 -----~v~v~~~~~~~~~DV~V~~~~a-s~~~q~~~~l~~ll~~-l~~~~~~-~~~~~il~~~~d~~~~~~~~~~lr~~q 612 (763) T protein:vir:95 541 -----FVTIKREDLKGNFDLEVDISTA-EVDNQKSQDLGFMLQT-IGPNVDQ-QITLNILAEIADLKRMPKLAHDLRTWQ 612 (763) T ss_pred -----cccccHHHhcCCcceEEecccc-hHHHHHHHHHHHHHHH-hccccCh-HHHHHHHHHHHhhhchhhhHHHHHhcC Confidence 8999999999999986655443 2233344555544433 1111222 2222333333333333332 1123 Q ss_pred CCCCcHHHHHHHHHHHHHHHHHH---HHhcc------cCC Q lcl|NC_015158. 551 PNVAVMEAQTTSALVNQSQAQIE---EEAQV------PLV 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q~aq~~~~---~~~~~------~~~ 581 (581) |.+++.++++.|+.+++++++.+ .++|+ -+. T Consensus 613 ~~~d~~~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~ 652 (763) T protein:vir:95 613 PQPDPVQEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAM 652 (763) T ss_pred CCccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44444444332222222221111 01110 011 No 6 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=1e-94 Score=535.90 Aligned_cols=550 Identities=13% Similarity=0.088 Sum_probs=389.8 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHH-HHHHHHHHHhhcccccccccccccccccccccchHHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWL-SQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNL 79 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~-~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~ 79 (581) |+..-+ ...|.+ +.+.+.|..+++++++.+-... .++.+..+|++. +.. ....+|+|+++.|+++.+++++ T Consensus 1 ~~k~~~-~~~~~~---~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g---~~~-~~~~~~~s~~~~~~v~~~v~~~ 72 (705) T protein:vir:88 1 MAKRRK-IKPMDD---EQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFG---EPF-GNERPGKSGIVSRDVQETVDWI 72 (705) T ss_pred CCcccc-cccCCH---HHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhC---CCC-CcccCCCCccccHHHHHHHHHH Confidence 765432 233333 5567777777765555444332 233443344443 222 3456899999999999999999 Q ss_pred HHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHH-HhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeee-- Q lcl|NC_015158. 80 HSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKV-KESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES-- 156 (581) Q Consensus 80 ~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l-~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~-- 156 (581) +++||+.||++++||+|+|++++|+++|++.+.+++++| ++++.+.++++||+|++++|+||+|++|......+.+. T Consensus 73 ~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~ 152 (705) T protein:vir:88 73 MPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFS 152 (705) T ss_pred HHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhc Confidence 999999999999999999999999999999999999987 55677899999999999999999999996554433221 Q ss_pred ------------------------------ee--EeeeeccceEEecchhheeecCCCCCcccCceE-EEEEecHHHHHH Q lcl|NC_015158. 157 ------------------------------GA--TRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKI-IRTVLNEGELLQ 203 (581) Q Consensus 157 ------------------------------~~--~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i-~r~~~T~~el~~ 203 (581) +. ....+.++++++|+|+||||||+|+++.||.|| ++.++|+++|++ T Consensus 153 ~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~ 232 (705) T protein:vir:88 153 GLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRL 232 (705) T ss_pred cCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHh Confidence 11 111235678999999999999999999999885 677899999988 Q ss_pred HhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccccccccc--cccc-ccC--CceEEEEEEeeeeecccCCceee Q lcl|NC_015158. 204 MEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFG--NLYD-YFQ--SPYVEVLTFYGDYHDTQSGTFKR 278 (581) Q Consensus 204 m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~~~--~~~vevlE~~g~~~d~~~d~~~e 278 (581) +... .+.+.+.... .........+.......+... ...+ ... ...|+++|||+.+ +..+++..+ T Consensus 233 ~g~~-------~~~~~~~~~~---~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~-d~~~d~~~~ 301 (705) T protein:vir:88 233 LGVP-------EDVIEELPYD---EYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLL-DVDGDGISE 301 (705) T ss_pred hcCC-------hhHhhhhhcc---cccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEe-cccCCccee Confidence 7422 1111111110 000000111110000001100 0000 011 1248899999864 555667777 Q ss_pred eeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc- Q lcl|NC_015158. 279 NMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD- 357 (581) Q Consensus 279 ~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d- 357 (581) .+++ ++.|++||+.+ ++|++||+++++.|+|+++||+|+++++.|+|+.+|+++|+++||+++++||++++.++ T Consensus 302 ~~~~-~~~g~~il~~~----~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~ 376 (705) T protein:vir:88 302 LRRI-LYVGDYIISNE----PWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQ 376 (705) T ss_pred eEEE-EEeCccccccc----cCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccc Confidence 6666 45789999875 46789999999999999999999999999999999999999999999999999988654 Q ss_pred ---ccccccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCC---cccccHHHHHHHHHHH Q lcl|NC_015158. 358 ---VEEFVWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRT---PGEKTAFEVQQLQNAA 431 (581) Q Consensus 358 ---~~~i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~---~~~~TAtgv~~l~~aa 431 (581) .+.++++||++|+++.+++++++++|+.+..++.+++++.+.++++|||+++++|.++ ++++||++++++++++ T Consensus 377 v~~~d~~~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~ 456 (705) T protein:vir:88 377 VNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAA 456 (705) T ss_pred cCcccccccCCCeeEEecCCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHH Confidence 4557789999999999999999999999999999999999999999999999999765 3579999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHH Q lcl|NC_015158. 432 GRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQS 511 (581) Q Consensus 432 ~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~ 511 (581) +.+++.++|+|+++++++||+.++.++.++++.+.++|++|. ++.++++++.++++|++..+.+..++++..+. T Consensus 457 ~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~------~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~ 530 (705) T protein:vir:88 457 EQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGK------WVAVNPANWRERSDLTVTVGIGNMNKDQQMLH 530 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccc------hhccchHhhccCCceEEeeccccchHHHHHHH Confidence 999999999999999999999999999999999999999996 68899999999999876665666666655544 Q ss_pred HHHhhc---c-cccccccchhHHHHHHHHHH---HHhcCCCcccccCCCCcHHHHHHHH------------------HHH Q lcl|NC_015158. 512 LMGIAN---T-PVWQDIKPHVSTENLAKMLE---HNLSLGGWDIFKPNVAVMEAQTTSA------------------LVN 566 (581) Q Consensus 512 L~~~~~---~-~~~~~i~p~~~~~~l~~~~~---e~~~l~~~~~~~~~~~~~~~~~~q~------------------~~q 566 (581) |..+++ . ...+.+.|.+....+.++++ +..++...+.+...+...++++.++ .++ T Consensus 531 l~~ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~ 610 (705) T protein:vir:88 531 LMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQ 610 (705) T ss_pred HHHHHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHH Confidence 444332 1 11223344444444444443 4445544433333332222221111 011 Q ss_pred HHHHHHHHHhcccCC Q lcl|NC_015158. 567 QSQAQIEEEAQVPLV 581 (581) Q Consensus 567 ~aq~~~~~~~~~~~~ 581 (581) ++|++.+. +|..+- T Consensus 611 k~q~e~~~-~q~e~q 624 (705) T protein:vir:88 611 RAQSDALA-KQAEAQ 624 (705) T ss_pred HHHHHHHH-HHHHHH Confidence 11111100 001110 No 7 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=4.6e-61 Score=351.36 Aligned_cols=546 Identities=12% Similarity=0.064 Sum_probs=345.5 Q ss_pred Cccchhhhhhhc------cchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccc----ccccccccccccc Q lcl|NC_015158. 1 MTGKVLELQQML------DDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTT----NSTLPWKNKTTLP 70 (581) Q Consensus 1 ~~~~~~~~~~~~------~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~----~~~~~~k~~~~~p 70 (581) |+.+.-...+=. .+..-++...|...|....+.-+.|-++..+-.+|+ ....+.. ..+..++..+|.+ T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy--~G~Qw~~~~~~~l~~~g~p~~~~N 99 (776) T protein:vir:93 22 LSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYY--DNIQWSQDEIDELKERGQAPTVYN 99 (776) T ss_pred CCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHh--CCCCCCHHHHHHHHhcCCceEEec Confidence 544443332222 122222334444445544444445555554444454 3333322 2234667779999 Q ss_pred chHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecce Q lcl|NC_015158. 71 KLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKET 150 (581) Q Consensus 71 ki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~ 150 (581) +|..+++++.++.. .|+-.++|.|...+|.+.|++++.+|++.+.+|++...+++.|+|+++.|+|++++.|.... T Consensus 100 ~i~~~i~~v~g~~~----~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~ 175 (776) T protein:vir:93 100 VISQSVNWIIGSEK----RGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDEN 175 (776) T ss_pred chHHHHHHHHHHHH----hCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccC Confidence 99999999998887 57778999999999999999999999999999999999999999999999999999875321 Q ss_pred eeeeeeeeEeeeeccce-EEecchhheeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhc Q lcl|NC_015158. 151 TKDEESGATRDTYFGPR-AVRIDPKDIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRR 226 (581) Q Consensus 151 ~~~~~~~~~~~~~~~p~-ie~V~p~df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~ 226 (581) ..++. +..|+|.+|||||+|+ +++||.|| ++.++|++++.+|.... .+.+.+...... T Consensus 176 ------------~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~------~~~~~~~~~~~~ 237 (776) T protein:vir:93 176 ------------DGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPER------AAQLRAAAVDNF 237 (776) T ss_pred ------------CCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCc------hHHHHHhhhhcc Confidence 11233 3568999999999987 57799885 56688999999884221 111111111100 Q ss_pred cCCcccchhhhhcccc-------ccccccccccccCCceEEEEEEeeeeec------------------c---------c Q lcl|NC_015158. 227 GLGTYTREDCEKAVGF-------SMDGFGNLYDYFQSPYVEVLTFYGDYHD------------------T---------Q 272 (581) Q Consensus 227 ~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~vevlE~~g~~~d------------------~---------~ 272 (581) ..+...+....... ..+...+.+......+|+|+|||-..+. . . T Consensus 238 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 315 (776) T protein:vir:93 238 --ETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVE 315 (776) T ss_pred --cccchhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhh Confidence 00000010000000 0011112222233357999999942110 0 0 Q ss_pred CCcee------eeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 273 SGTFK------RNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDL 346 (581) Q Consensus 273 ~d~~~------e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~ 346 (581) .|... ....+.++.|+++|+...+||+++++||+.++..++++++||.|++++++|+|+.+|+.+++++|+++ T Consensus 316 ~g~~~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~- 394 (776) T protein:vir:93 316 SGRAVLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS- 394 (776) T ss_pred cCceeehheeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc- Confidence 11110 11234457789999999999999999999999999999999999999999999999999999999874 Q ss_pred hcCCeEEEecc----cccc---ccCCceeEEeCCCC--CcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcc Q lcl|NC_015158. 347 IAFPPMKVKGD----VEEF---VWGPMEQIYINGDG--DVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPG 417 (581) Q Consensus 347 s~np~~~v~~d----~~~i---~~~pG~vi~~~~~~--~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~ 417 (581) +.++.+.++ .+++ .++||+||+++.++ .+...+.++.+.....+++++.+.++++|||++.++|..++ T Consensus 395 --~~~~~~~~gav~~~d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n- 471 (776) T protein:vir:93 395 --TNKVLMEEGAVDDIDEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTN- 471 (776) T ss_pred --CCceeeccccccchHHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcc- Confidence 445655432 3322 36899999998765 45555556667778888999999999999999999997653 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccC----HHHh-cCCc Q lcl|NC_015158. 418 EKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVN----KDDI-TAKG 492 (581) Q Consensus 418 ~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~----r~di-~~~~ 492 (581) +.++.++++++++++.++..+.++|++ +++.+++.++.++.++++.+.++||+|++ +...++.|+ ..|+ .|+| T Consensus 472 ~~Sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~~~~-~~~~~v~in~~~~~nd~~~~~~ 549 (776) T protein:vir:93 472 AVSGVAIQARQEQGSVATNKLFDNLRL-AFQQHGEKELSLIEQYMTEEKQFRITNSR-GNPEYVTVNDGLPENDITRTKA 549 (776) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcceEEEEeecC-CCcceEEecccchhhhhcccee Confidence 466777999999999999999999997 67889999999999999999999999986 445566664 3344 4677 Q ss_pred eEE-EecchhHHHHHHHHHHHHHhhccc---cccc-------ccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHH Q lcl|NC_015158. 493 RLR-PVGARHFAEQAQVVQSLMGIANTP---VWQD-------IKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTT 561 (581) Q Consensus 493 ~vv-a~ga~~~~~r~q~~q~L~~~~~~~---~~~~-------i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~ 561 (581) +|+ ..|...-..|++..+.|.++++.. +... ..+.-+..++.+.+.+..+.+..+ ...... +++.+ T Consensus 550 dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~--q~~~~~-e~~~~ 626 (776) T protein:vir:93 550 DFIIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPD--QDEPTP-EEIAR 626 (776) T ss_pred eEEEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccc--hhhcch-hHHHH Confidence 764 444444455777666666665321 0000 011111223334443333332211 111111 11111 Q ss_pred HHHHHHHHHHHHHHhcccC-----------------------C Q lcl|NC_015158. 562 SALVNQSQAQIEEEAQVPL-----------------------V 581 (581) Q Consensus 562 q~~~q~aq~~~~~~~~~~~-----------------------~ 581 (581) +.++++.++..++.++..+ . T Consensus 627 qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~ 669 (776) T protein:vir:93 627 EQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHI 669 (776) T ss_pred HHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhh Confidence 1111000000000000000 0 No 8 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=1e-53 Score=311.15 Aligned_cols=533 Identities=12% Similarity=0.076 Sum_probs=339.2 Q ss_pred Cccchh---------hhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHH-HHHhhccccccccc----cccccccc Q lcl|NC_015158. 1 MTGKVL---------ELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSEL-RNYIFATDTTTTTN----STLPWKNK 66 (581) Q Consensus 1 ~~~~~~---------~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~-~~y~~~~~~~~~~~----~~~~~k~~ 66 (581) |+.|-. -..+-+-...++.-..+..+.+.|+....-+.+-+.+. .++-|+....++.. .+..+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~ 80 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCc Confidence 544321 11111111123333445555555554333333333333 24444444333322 34455556 Q ss_pred ccccchHHHHHHHHHHHHHhhcCCccEEEeecCC----------------------hhHHHHHHHHHHHHHHHHHhcchH Q lcl|NC_015158. 67 TTLPKLCQIRDNLHSNYISALFPNERWLKWEGKS----------------------LQDEAKRDAIQQYMDNKVKESDFR 124 (581) Q Consensus 67 ~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~----------------------~~d~~~ae~~~~~i~~~l~e~n~~ 124 (581) +|.++|...++.+.+... .|+--++|.|+. .+|.+.|++++.++++...+|++. T Consensus 81 ~~~N~i~~~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~ 156 (711) T protein:vir:10 81 LVNNVLPTFVDQVLGDQR----QNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAE 156 (711) T ss_pred EEEcchHHHHHHHhhhHh----hCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChh Confidence 889999999999999888 688888999974 677889999999999999999999 Q ss_pred HHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEec-chhheeecCCCC--CcccCceE-EEEEecHHH Q lcl|NC_015158. 125 TIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRI-DPKDIVFNPVAV--DFAHSPKI-IRTVLNEGE 200 (581) Q Consensus 125 ~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V-~p~df~~DP~a~--~~~d~~~i-~r~~~T~~e 200 (581) .++++.|.|.++.|.|++.+-+.... ......++++.+| +|.+|||||.++ ++.||+|| ++.++|+++ T Consensus 157 ~~~s~af~d~~~~G~G~~ev~~d~~~--------~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~ 228 (711) T protein:vir:10 157 TEYDIAFQGAVESGMGYLRVRSDYLA--------DDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEK 228 (711) T ss_pred HHHHHHHHHhhhcCcceEEEEecccC--------CCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHH Confidence 99999999999999999988653211 0111234578888 799999999986 66899885 566789999 Q ss_pred HHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeee------------- Q lcl|NC_015158. 201 LLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGD------------- 267 (581) Q Consensus 201 l~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~------------- 267 (581) +..+. +. .. .+++ .. .+.. . + ..++..+.|++.|||-. T Consensus 229 ~~~~y--p~-~a--~~~~----~~----~~~~---~----------~---~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~ 279 (711) T protein:vir:10 229 FKALY--PD-AT--AEPV----YE----DSVA---D----------Y---DTWFTEKSVRVSEYFTREPVIREIALLSDG 279 (711) T ss_pred HHHhC--Cc-hh--hhhh----hc----cccc---c----------c---CcccCcceeeEEEEEeeeeeeeEEEeecCC Confidence 99883 21 11 0000 00 0000 0 0 01123355666677621 Q ss_pred ---eecc---------cCCcee-------e-eeEEEEEeCCEEEEeecCCCccCCCCeeEe-cc-cccCCcccCCCcHHh Q lcl|NC_015158. 268 ---YHDT---------QSGTFK-------R-NMKVTIIDRMFVIEEKENPSWFAQAPIFHC-GW-RIRQDNLYAMGPLDN 325 (581) Q Consensus 268 ---~~d~---------~~d~~~-------e-~~~itv~~g~~iir~~~nP~~~g~~Pf~~~-~~-~~~p~~~~G~s~~~~ 325 (581) .++. .+|... . ...+.++.|.++| ...+||+++.+||+.+ ++ .+++++.++.|+++. T Consensus 280 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L-~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~ 358 (711) T protein:vir:10 280 RSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRH 358 (711) T ss_pred ceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceee-cCCCCCCCCcccEEEEeeeeeccccccccchhhhh Confidence 0000 111110 0 1112345788888 6678999999999854 33 366788889999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc----ccCCceeEEeCCCC----CcccccCCCccchhHHHH Q lcl|NC_015158. 326 LVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF----VWGPMEQIYINGDG----DVEMMAPNTQALQADMQI 393 (581) Q Consensus 326 l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i----~~~pG~vi~~~~~~----~i~~~~~p~~~~~~~~~l 393 (581) ++|+|+.+|+++++++|++++++++++.+.++ .++. ..+||++++++.++ .+++.++|+.+.....++ T Consensus 359 ~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll 438 (711) T protein:vir:10 359 SKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLG 438 (711) T ss_pred hhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHH Confidence 99999999999999999999999999987543 2222 25799999999664 478888898999999999 Q ss_pred HHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCc Q lcl|NC_015158. 394 QILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDS 473 (581) Q Consensus 394 q~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~ 473 (581) ++..+.++++|||++.++|..++ +.++.++++++++++.++..+.++|+. .++.+++.++.++.++++.+.++||+|+ T Consensus 439 ~~~~~~i~~~tGi~~~~~G~~~n-~~Sg~ai~~~q~qg~~~l~~~~dn~~~-~~~~~g~~ll~li~~~~~~er~~rI~ge 516 (711) T protein:vir:10 439 QNSVEKIKSTMGMYDASLGAMGN-ETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFP 516 (711) T ss_pred HHHHHHHHHHhCCChHHcCCCcc-chHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCeEEEEecC Confidence 99999999999999999999765 467888999999999999999999997 6788999999999999999999999998 Q ss_pred hhcccCCCccCHH-------------Hh-cCCceEEEecchhH-HHHHHHHHHHHHhhccc-ccc-ccc-------chhH Q lcl|NC_015158. 474 DDKVATFMNVNKD-------------DI-TAKGRLRPVGARHF-AEQAQVVQSLMGIANTP-VWQ-DIK-------PHVS 529 (581) Q Consensus 474 ~~~~~~~~~v~r~-------------di-~~~~~vva~ga~~~-~~r~q~~q~L~~~~~~~-~~~-~i~-------p~~~ 529 (581) + +...++.+++. || .|+++|++.-+.+. ..|.+.++.|.++++.. ..+ .+. +.-+ T Consensus 517 d-~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~p~ 595 (711) T protein:vir:10 517 D-ETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPG 595 (711) T ss_pred C-CCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCCCC Confidence 7 44556777654 33 46677755543333 34555566666665432 100 000 1111 Q ss_pred HHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHH-HHHHHH-HHhcccCC Q lcl|NC_015158. 530 TENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQ-SQAQIE-EEAQVPLV 581 (581) Q Consensus 530 ~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~-aq~~~~-~~~~~~~~ 581 (581) ..++.+.+....+.++. .+.....++++++.++|+ ++++.+ +++|+-+. T Consensus 596 ~~el~e~lr~~~~~~~~---~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~ 646 (711) T protein:vir:10 596 ADVIAERLKKIVPPNVL---SKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMA 646 (711) T ss_pred HHHHHHHHHhhcCcccC---cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333444333332221 011111111111110100 000000 01111111 No 9 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=6.6e-50 Score=290.22 Aligned_cols=498 Identities=12% Similarity=0.111 Sum_probs=329.4 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccccc---ccccccccchHHHHHHHHHHHHHhh Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLP---WKNKTTLPKLCQIRDNLHSNYISAL 87 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~---~k~~~~~pki~~~~d~~~~~l~~~~ 87 (581) |. +..++.|+++|+.++..|++|+..|.+|.+|..|+.....+.+... ...+++-+.....+++|.+.||+.+ T Consensus 1 m~----~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~l 76 (556) T protein:vir:73 1 MA----ETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGI 76 (556) T ss_pred CC----hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhh Confidence 55 4568899999999999999999999999999988765433322222 2345677888899999999999999 Q ss_pred cC-CccEEEeecCChhHHHH------HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEe Q lcl|NC_015158. 88 FP-NERWLKWEGKSLQDEAK------RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATR 160 (581) Q Consensus 88 f~-~~~~~~~~~~~~~d~~~------ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~ 160 (581) || +..||++.+...+..+. -+.+++.|...|.+|||+..+++++.|++.+|||++-+....+. T Consensus 77 tpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~---------- 146 (556) T protein:vir:73 77 TSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQD---------- 146 (556) T ss_pred cCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCc---------- Confidence 99 89999998876544332 24578889999999999999999999999999999866542110 Q ss_pred eeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 161 DTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 161 ~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) ..++..+++.+||++.++..--|+ ++.+..+|..++.+.....+ ..+++.+.+.. ++++ T Consensus 147 ----~~r~~~~~l~~~~~~~d~~G~vd~-i~r~~~~t~~ql~~~fg~~~----l~~~v~~~~~~----~~~~-------- 205 (556) T protein:vir:73 147 ----VIRTMPFPIGSYYLANSPRGSVDT-CIRQFSMTVRQMVQEFGLDN----VSTSVKGMWEN----GTYE-------- 205 (556) T ss_pred ----eEEEEEeecceeEEeeCCCCCeEE-EEEEEeccHHHHHHHcCccc----CCHHHHHHHhc----CCcc-------- Confidence 125678999999999997654343 34455778888765432111 11222222211 0000 Q ss_pred ccccccccccccccCCceEEEEEE-eeeeecc------cCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccccc Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTF-YGDYHDT------QSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIR 313 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~-~g~~~d~------~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~ 313 (581) .+++++.+ +-+.... .+-++..+++..-.+++++++.. .| ...||.+..|..+ T Consensus 206 ----------------~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~~~~~vl~es--g~--~e~P~~~~Rw~~~ 265 (556) T protein:vir:73 206 ----------------TWVEVNHCITPNVNRDSGKMDSKNKPYRSVYFESGGDSDKLLRES--GF--DEFPILAPRWEVN 265 (556) T ss_pred ----------------ceEEEEEEEeccccccccccCcccceEEEEEEEecCCCceecccC--Cc--ccCCceeeeeeec Confidence 11222211 1000000 00112222222223456676654 33 5689999999999 Q ss_pred CCcccCCC-cHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc--cccccCCceeEEeCCCC---CcccccCCC-cc Q lcl|NC_015158. 314 QDNLYAMG-PLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV--EEFVWGPMEQIYINGDG---DVEMMAPNT-QA 386 (581) Q Consensus 314 p~~~~G~s-~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~--~~i~~~pG~vi~~~~~~---~i~~~~~p~-~~ 386 (581) ++..||.| +++..++.++.+|.+.++++++..++++|.+.+..+- ..++..||+++....++ +++|+...+ .. T Consensus 266 ~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~ 345 (556) T protein:vir:73 266 GEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQRVSLLPGDVTYLDVISGQDGFKPAYLVNPNT 345 (556) T ss_pred CCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccceeeccCccccccCCCCccceeeeccccccH Confidence 99999999 7999999999999999999999999999999987663 45778899987666444 355554332 12 Q ss_pred chhHHHHHHHHHHHHHhcCCchH-hcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Q lcl|NC_015158. 387 LQADMQIQILEAKMEEFAGAPRE-AMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVA 465 (581) Q Consensus 387 ~~~~~~lq~~~~~~ee~TGv~~~-~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~ 465 (581) ..+...++.+.+.+.+..-+.-+ +.+..+....||++|.+..+.....+.-+..++..+++.|||...+.++.+....| T Consensus 346 ~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP 425 (556) T protein:vir:73 346 ADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLP 425 (556) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 33444456666655544322222 23444555789999999999999999999999999999999998888876643222 Q ss_pred ceeeecCchhcccCCCccCHHHhcC-CceEEEecchhHHHHHHHHHHHHHhhcc-----cccccccchhHHHHHHHHHHH Q lcl|NC_015158. 466 DTIRVFDSDDKVATFMNVNKDDITA-KGRLRPVGARHFAEQAQVVQSLMGIANT-----PVWQDIKPHVSTENLAKMLEH 539 (581) Q Consensus 466 ~~iR~~~~~~~~~~~~~v~r~di~~-~~~vva~ga~~~~~r~q~~q~L~~~~~~-----~~~~~i~p~~~~~~l~~~~~e 539 (581) + .++++.+ .++|...+.-+-++|...++.+.++++. ++++.++.+++..++++.+++ T Consensus 426 ~-----------------~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~ 488 (556) T protein:vir:73 426 E-----------------PPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSE 488 (556) T ss_pred C-----------------CchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHH Confidence 2 2344433 3455555444444444333333332222 124455667899999999999 Q ss_pred HhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhccc----CC Q lcl|NC_015158. 540 NLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVP----LV 581 (581) Q Consensus 540 ~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~----~~ 581 (581) .+|++. .+++++.++.+..++++.+|++++..+..+|.- .+ T Consensus 489 ~~Gvp~-~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~ 533 (556) T protein:vir:73 489 MSGVSP-TVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTL 533 (556) T ss_pred HcCCCh-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999997 788887777554444443443333333322221 01 No 10 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=1.8e-49 Score=287.79 Aligned_cols=496 Identities=12% Similarity=0.123 Sum_probs=334.7 Q ss_pred ccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccc-----ccc-cccccccccchHHHHHHHHHHHHH Q lcl|NC_015158. 12 LDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTN-----STL-PWKNKTTLPKLCQIRDNLHSNYIS 85 (581) Q Consensus 12 ~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~-----~~~-~~k~~~~~pki~~~~d~~~~~l~~ 85 (581) ++ ++.|.++|+.+++.|++|+..|.+|.+|.+|+..+-.+. ... .+.++++-+.....+++|.+.||+ T Consensus 1 ~~------~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~ 74 (547) T protein:vir:10 1 ME------NSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHG 74 (547) T ss_pred CC------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHH Confidence 32 678999999999999999999999999999986442222 222 345567888889999999999999 Q ss_pred hhcC-CccEEEeecCChhHHH------HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeee Q lcl|NC_015158. 86 ALFP-NERWLKWEGKSLQDEA------KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGA 158 (581) Q Consensus 86 ~~f~-~~~~~~~~~~~~~d~~------~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~ 158 (581) .+|| +..||++++.+.+..+ --+..++.|...|.+|||+..+++.+.|++.+|||++.++.... T Consensus 75 ~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~--------- 145 (547) T protein:vir:10 75 SLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDED--------- 145 (547) T ss_pred hhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCC--------- Confidence 9999 8999999876654322 22556788889999999999999999999999999998874211 Q ss_pred EeeeeccceEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhh Q lcl|NC_015158. 159 TRDTYFGPRAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCE 237 (581) Q Consensus 159 ~~~~~~~p~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 237 (581) ...+.+++.+++.+|++..++.. +.. ++.+..+|..+|.+...... ..+++.+..... .+.+ T Consensus 146 ---~~~~~r~~~~pl~~~~v~~d~~G~v~~--i~r~~~~t~~qi~~~fg~~~----l~~~v~~~~~~~--~~~~------ 208 (547) T protein:vir:10 146 ---EEGSVVFQSSPIQDSYFEEDSRGQVVN--FYRVFRWTPAQIYDRFGDEG----TPEAIIKKAKEA--SNQA------ 208 (547) T ss_pred ---CCCceeEEEeecceEEEeeCCCcCeee--eeeeeeccHHHHHHhcCccc----CCHHHHHHHhcC--CCcc------ Confidence 11235688999999999998764 333 34556779988877643221 112222222110 0000 Q ss_pred hccccccccccccccccCCceEEEEEE-eeeee-----------cccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCe Q lcl|NC_015158. 238 KAVGFSMDGFGNLYDYFQSPYVEVLTF-YGDYH-----------DTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPI 305 (581) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~vevlE~-~g~~~-----------d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf 305 (581) ...++++.+ |.+.. +..+.++...++- +-+++++++... | ...|| T Consensus 209 ------------------~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e-~~~~~~~l~esg--~--~e~P~ 265 (547) T protein:vir:10 209 ------------------ALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWIL-KEGAVQLGEEGG--Y--YEMPA 265 (547) T ss_pred ------------------cceEEEEEEEeeccCCCCCccccceeeccccceeEEEEE-ecCceeeeecCC--c--ccCCe Confidence 011222222 11100 0011111111111 112356665553 3 46899 Q ss_pred eEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--ccccccCCceeEEeCCCCCcccccCC Q lcl|NC_015158. 306 FHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--VEEFVWGPMEQIYINGDGDVEMMAPN 383 (581) Q Consensus 306 ~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~~~i~~~pG~vi~~~~~~~i~~~~~p 383 (581) .+..|...++..||.|+++..++.++.+|.+.+++++++.++++|++.+..+ ...++..||+++...+..+++|+..+ T Consensus 266 ~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~pgg~~~~~~~~~v~pl~~~ 345 (547) T protein:vir:10 266 YAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISDIDLGASGLTVVRDMESMKPFESR 345 (547) T ss_pred eeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceecccccccccceecCCeeeecCCcccceeeecc Confidence 9999999999999999999999999999999999999999999999988644 44567789999999999999999888 Q ss_pred CccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 384 TQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLD 463 (581) Q Consensus 384 ~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d 463 (581) .....+...++.+.+.+.+.-=+..+. ..+....|||+|.+..+.....+.-+..++..+++.|||...+.++.+..- T Consensus 346 ~~~~~~~~~i~~~~~rI~~af~~d~~~--~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~ 423 (547) T protein:vir:10 346 ARFDVSSIQLTDLRSAVRRIYYVDQLQ--MKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGK 423 (547) T ss_pred cchHHHHHHHHHHHHHHHHHhhhhhhh--cCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 766667777888888776643122222 234457999999999999999999999999999999999988887665322 Q ss_pred ccceeeecCchhcccCCCccCHHHhcC---CceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHHHH Q lcl|NC_015158. 464 VADTIRVFDSDDKVATFMNVNKDDITA---KGRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENLAK 535 (581) Q Consensus 464 ~~~~iR~~~~~~~~~~~~~v~r~di~~---~~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l~~ 535 (581) +..++.+-+.+ +++|....+-+-+++...++.+.++++.. +++.++.+++..++++ T Consensus 424 ----------------lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~ 487 (547) T protein:vir:10 424 ----------------LGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVR 487 (547) T ss_pred ----------------CCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHH Confidence 22222332333 34555555544455544444444443321 2345556789999999 Q ss_pred HHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHH-HHHHHHhcccCC Q lcl|NC_015158. 536 MLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQ-AQIEEEAQVPLV 581 (581) Q Consensus 536 ~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq-~~~~~~~~~~~~ 581 (581) .+++.+|++. .++++..++.+-.+++.++|+.+ +...+++.+-.. T Consensus 488 ~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m 533 (547) T protein:vir:10 488 MLGSLLGAPQ-TLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAM 533 (547) T ss_pred HHHHHhCCCh-hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999996 67787666543222222222121 112223333222 No 11 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=3e-49 Score=286.63 Aligned_cols=542 Identities=13% Similarity=0.079 Sum_probs=328.8 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHH-H-HHHHhhccccccc----ccccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKS-E-LRNYIFATDTTTT----TNSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~-~-~~~y~~~~~~~~~----~~~~~~~k~~~~~pki~~ 74 (581) |+..+.+-.--.++ .+-+.....++..|...+. +...|. + ..++=|+....+. ...+..+.-.+|.++|.. T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~~-~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:10 1 MKNEINTTAMKNDH--GSTPRFSQRQLLSLCSDID-SQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred CCcCcCcccCCCcc--hhhhhhhHHHHHHHHHHHh-hhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHH Confidence 66655544333332 2222333344555554433 334553 2 2244344333332 233345566688999999 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHH--HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDE--AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTK 152 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~--~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~ 152 (581) .++.+.+... .|+--++|.|++.+++ +.|+++..++.+-...|++..+.++.|.+.++.|.|++.+.+... T Consensus 78 ~v~~v~g~~~----~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d--- 150 (714) T protein:vir:10 78 TVDGVLGMEA----KTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSE--- 150 (714) T ss_pred HHHHHHHHHH----hCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccC--- Confidence 9999999988 5777789999987664 579999999999999999999999999999999999887765421 Q ss_pred eeeeeeEeeeeccceEEecchhheeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCC Q lcl|NC_015158. 153 DEESGATRDTYFGPRAVRIDPKDIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLG 229 (581) Q Consensus 153 ~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~ 229 (581) .....+++++|+|.+|||||.|+ +++||.|| ++.++|++++++|.....+ ....+ .....+.. T Consensus 151 --------~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~--~i~~~----~~~~~~~~ 216 (714) T protein:vir:10 151 --------PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQ--VIDYA----IDDWRGFV 216 (714) T ss_pred --------CCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchh--hhhcc----chhhcCcc Confidence 11224579999999999999876 56899885 5668899999998432211 11110 00000000 Q ss_pred cccch---------hhhhccccccccccccccccC--CceEEEEEEeee--------------e--ecccC--------- Q lcl|NC_015158. 230 TYTRE---------DCEKAVGFSMDGFGNLYDYFQ--SPYVEVLTFYGD--------------Y--HDTQS--------- 273 (581) Q Consensus 230 ~~~~~---------~~~~~~~~~~~~~~~~~~~~~--~~~vevlE~~g~--------------~--~d~~~--------- 273 (581) ..... ..+..++.+. .-..++. ...|+++|||-. + |+..+ T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~ 292 (714) T protein:vir:10 217 DTTVTEGQPSPLMSAWEEYQSWDR----QQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVAS 292 (714) T ss_pred cchhhhhhcccccccchhhccccc----ccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHh Confidence 00000 0000111110 0011122 236888899822 1 11111 Q ss_pred Cc-------eeeeeEEEEEeCCEEEEeecCCCccCCCCee-Eecccc-cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHH Q lcl|NC_015158. 274 GT-------FKRNMKVTIIDRMFVIEEKENPSWFAQAPIF-HCGWRI-RQDNLYAMGPLDNLVGMQYRIDHLENLKADVF 344 (581) Q Consensus 274 d~-------~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~-~~~~~~-~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~ 344 (581) |. .. -..++++.|.++|....+||+++.+||+ ++++.. +.+. ..|+.+.++|+|+.+|...+..+.. T Consensus 293 g~~~~~~~~~~-rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~- 368 (714) T protein:vir:10 293 GRVQVKVGRVS-RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWL- 368 (714) T ss_pred ccceeccccee-eEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCc--cceehhhhhhHHHHHHHHHHHHHHH- Confidence 11 11 1234678899999999999999999986 333322 1233 3478999999999999887777664 Q ss_pred HHhcCCeEEEeccccc----c---ccCCceeEEeCCC--------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchH Q lcl|NC_015158. 345 DLIAFPPMKVKGDVEE----F---VWGPMEQIYINGD--------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPRE 409 (581) Q Consensus 345 ~~s~np~~~v~~d~~~----i---~~~pG~vi~~~~~--------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~ 409 (581) ++++..+...+...+ + -++||+++.++.. +.+++.++++.+.....++++..+.++++|||++. T Consensus 369 -l~~~~~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~ 447 (714) T protein:vir:10 369 -LQAKRVIMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSA 447 (714) T ss_pred -HhCCceeeccccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHH Confidence 456654433332211 2 2579999998642 23667777777888889999999999999999999 Q ss_pred hcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcc--cCCCccCHH- Q lcl|NC_015158. 410 AMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKV--ATFMNVNKD- 486 (581) Q Consensus 410 ~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~--~~~~~v~r~- 486 (581) ++|..+++ .+...+++++++++..+.++.++|.. .++.+.+.++.++.++++.+.++||++++... ..++.++++ T Consensus 448 ~lG~~~na-~SGvAI~~r~~qg~~~l~~~~dnl~~-~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~ 525 (714) T protein:vir:10 448 FLGQDSGA-TSGVAISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEG 525 (714) T ss_pred HcCCCcch-hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeecccc Confidence 99986543 45555889999999999999999996 57888888888888889999999999875432 334444433 Q ss_pred -------Hh-cCCceEEE-ecchhHHHHHHHHHHHHHhhcc--cccccccch--------hHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 487 -------DI-TAKGRLRP-VGARHFAEQAQVVQSLMGIANT--PVWQDIKPH--------VSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 487 -------di-~~~~~vva-~ga~~~~~r~q~~q~L~~~~~~--~~~~~i~p~--------~~~~~l~~~~~e~~~l~~~~ 547 (581) || .+.++|+. .|......|.+.++.|.++++. |..+.+.+. -+..++++.+.++++.+.. T Consensus 526 ~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~- 604 (714) T protein:vir:10 526 DNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS- 604 (714) T ss_pred CCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCC- Confidence 33 35566644 3443444577777777666542 111111111 1123455555555555442 Q ss_pred cccCCCCcHHHHHHHHHHH---HHHHHHHHH-hcc-------------------------cCC Q lcl|NC_015158. 548 IFKPNVAVMEAQTTSALVN---QSQAQIEEE-AQV-------------------------PLV 581 (581) Q Consensus 548 ~~~~~~~~~~~~~~q~~~q---~aq~~~~~~-~~~-------------------------~~~ 581 (581) +.+..++++++|..++ ++|++++.+ .+. .|. T Consensus 605 ---~~~~~~e~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~ 664 (714) T protein:vir:10 605 ---PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVA 664 (714) T ss_pred ---ccccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111112222111111 111111110 000 000 No 12 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=1.2e-48 Score=283.40 Aligned_cols=498 Identities=13% Similarity=0.110 Sum_probs=324.3 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccc---cccccccccccchHHHHHHHHHHHHHhh Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNS---TLPWKNKTTLPKLCQIRDNLHSNYISAL 87 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~---~~~~k~~~~~pki~~~~d~~~~~l~~~~ 87 (581) |. +.+++.|.++|+.++..|++|+..|.+|.+|+.|+..+..+.. ...+..+++-+.....+++|.+.||+.+ T Consensus 1 m~----~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~l 76 (559) T protein:vir:95 1 MA----ETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGI 76 (559) T ss_pred CC----hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhh Confidence 66 4567889999999999999999999999999998875533222 2234456778888899999999999999 Q ss_pred cC-CccEEEeecCChhHHHH------HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEe Q lcl|NC_015158. 88 FP-NERWLKWEGKSLQDEAK------RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATR 160 (581) Q Consensus 88 f~-~~~~~~~~~~~~~d~~~------ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~ 160 (581) || +..||++.+.+.+..+. -+.+++.|...|.+|||+..+++.+.|++.+|||++-+....+ T Consensus 77 tpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~----------- 145 (559) T protein:vir:95 77 TSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDE----------- 145 (559) T ss_pred cCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCC----------- Confidence 99 89999998876543222 2556778889999999999999999999999999987654211 Q ss_pred eeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 161 DTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 161 ~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) ...++..+++.+||+..++..--++ ++.+..+|..++.+...... ..+++.+.... +++ T Consensus 146 ---~~~r~~~~~l~~~~v~~d~~G~vd~-i~r~~~~t~~ql~~~fg~~~----l~~~~~~~~~~----~~~--------- 204 (559) T protein:vir:95 146 ---DIIRTMPFPIGSYYLANSPRGSVDT-CFRKFSMTVRQLVQEFGLNN----VSESVKSMWES----GTY--------- 204 (559) T ss_pred ---ceeEEEEeecCeEEEeeCCCCCeEE-EEEeEecCHHHHHHHcCccc----CCHHHHHHHhc----CCC--------- Confidence 1235778999999999987643332 34566779888875532211 11222222210 000 Q ss_pred ccccccccccccccCCceEEEEEEeeeeeccc-------CCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccccc Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQ-------SGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIR 313 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~-------~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~ 313 (581) ..+++++.+-..-.+.+ +-++..+++..-.+++++++... | ...||.+..|..+ T Consensus 205 ---------------~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~esg--~--~e~P~~~~Rw~~~ 265 (559) T protein:vir:95 205 ---------------EKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESG--F--DEFPIMAPRWEVN 265 (559) T ss_pred ---------------CCeEEEEEEEeccccccccccccccceEEEEEEEecCCCceeeecCC--c--ccCCccceeeeec Confidence 01233333211000111 11122222222223456766543 3 5689999999999 Q ss_pred CCcccCCC-cHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc--cccccCCceeEEeCCCCC---cccccCCC-cc Q lcl|NC_015158. 314 QDNLYAMG-PLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV--EEFVWGPMEQIYINGDGD---VEMMAPNT-QA 386 (581) Q Consensus 314 p~~~~G~s-~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~--~~i~~~pG~vi~~~~~~~---i~~~~~p~-~~ 386 (581) ++..||.| +++.+++.++.+|.+.+.+++++.++++|.+.+.++- ..++..||+++.+.+.+. ++++.... .. T Consensus 266 ~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~ 345 (559) T protein:vir:95 266 GEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVNPST 345 (559) T ss_pred CCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccccceeeeccceeeeCCCCCcccceeecccccch Confidence 99999999 7999999999999999999999999999999887653 346778999998876543 44443221 12 Q ss_pred chhHHHHHHHHHHHHHhcCCchH-hcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Q lcl|NC_015158. 387 LQADMQIQILEAKMEEFAGAPRE-AMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVA 465 (581) Q Consensus 387 ~~~~~~lq~~~~~~ee~TGv~~~-~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~ 465 (581) ..+...++.+.+.+.+..-+.-+ +.+..+....|||+|.+..+.....+.-+..++..+++.|||...+.++.+..-.| T Consensus 346 ~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP 425 (559) T protein:vir:95 346 ADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLP 425 (559) T ss_pred HHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 22333455555555444322211 22334445789999999999999999999999999999999999888876643222 Q ss_pred ceeeecCchhcccCCCccCHHHhc-CCceEEEecchhHHHHHHHHHHHHHhhcc-----cccccccchhHHHHHHHHHHH Q lcl|NC_015158. 466 DTIRVFDSDDKVATFMNVNKDDIT-AKGRLRPVGARHFAEQAQVVQSLMGIANT-----PVWQDIKPHVSTENLAKMLEH 539 (581) Q Consensus 466 ~~iR~~~~~~~~~~~~~v~r~di~-~~~~vva~ga~~~~~r~q~~q~L~~~~~~-----~~~~~i~p~~~~~~l~~~~~e 539 (581) + .++.+. .+++|...+.-+-++|...++.+.++++. ++++.++.+++..++++.+++ T Consensus 426 ~-----------------~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~ 488 (559) T protein:vir:95 426 P-----------------PPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFAD 488 (559) T ss_pred C-----------------CcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHH Confidence 2 233343 23455555544444444333333332222 134455667999999999999 Q ss_pred HhcCCCcccccCCCCcHHHHH----HHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 540 NLSLGGWDIFKPNVAVMEAQT----TSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 540 ~~~l~~~~~~~~~~~~~~~~~----~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+|++. +++++..++.+..+ +|.+||++|+..++.+-...+ T Consensus 489 ~~Gvp~-~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~ 533 (559) T protein:vir:95 489 MSGVSP-TVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTL 533 (559) T ss_pred HhCCch-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 999996 78888766543111 111122222222222222233 No 13 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=1.1e-47 Score=277.98 Aligned_cols=538 Identities=13% Similarity=0.070 Sum_probs=323.8 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHH--HHHHhhcccccccc----cccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSE--LRNYIFATDTTTTT----NSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~--~~~y~~~~~~~~~~----~~~~~~k~~~~~pki~~ 74 (581) |+.|.-+ +.....-..++..|...+.. ...|.+ .++|=|+....++. ..+..+.-.+|.++|.. T Consensus 8 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:81 8 MATKNDN---------GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred ccCCCCc---------chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHH Confidence 3333211 11111222333333333222 223432 22343333433322 22334566688999999 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHH--HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDE--AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTK 152 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~--~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~ 152 (581) .++.+.+... .|+--++|.|+..+++ +.|+++..++.+....|++....++.|.|.++.|.|++.+.+... T Consensus 78 ~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d--- 150 (714) T protein:vir:81 78 TVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD--- 150 (714) T ss_pred HHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC--- Confidence 9999999888 6888899999876554 689999999999999999999999999999999999877765311 Q ss_pred eeeeeeEeeeeccceEEecchhheeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCC Q lcl|NC_015158. 153 DEESGATRDTYFGPRAVRIDPKDIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLG 229 (581) Q Consensus 153 ~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~ 229 (581) ......++++|+|.+|||||.|+ +++||.|| ++.++|++++++|..... +.+..+. ..-.+.. T Consensus 151 --------~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~----~~~~~~~ 216 (714) T protein:vir:81 151 --------PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAI----DDWRGFV 216 (714) T ss_pred --------CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhh----hhhcccc Confidence 11123578999999999999876 68899885 566889999999843211 1111110 0000100 Q ss_pred cccchhh---------hhccccccccccccccccCCceEEEEEEeeeee------cccCCcee----------------- Q lcl|NC_015158. 230 TYTREDC---------EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYH------DTQSGTFK----------------- 277 (581) Q Consensus 230 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~------d~~~d~~~----------------- 277 (581) .....+. +..++. +...+.+......+|+|+|||-..+ +..+|... T Consensus 217 d~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:81 217 DTTVTEGQPSPLMSAWEEYQSW--DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccchhhhccc--cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0000000 000000 0001111112234688899995321 11111111 Q ss_pred --------eeeEEEEEeCCEEEEeecCCCccCCCCeeEe-cccc-cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 278 --------RNMKVTIIDRMFVIEEKENPSWFAQAPIFHC-GWRI-RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLI 347 (581) Q Consensus 278 --------e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~-~~~~-~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s 347 (581) ....+.++.|.++|+...+||+++.+||+-+ ++.. +.+.+| |+.+.++|+|+.+|+....++.. ++ T Consensus 295 ~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~ 370 (714) T protein:vir:81 295 VQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQ 370 (714) T ss_pred hhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hc Confidence 1234556889999999999999999998733 3222 234444 68999999999999877776664 46 Q ss_pred cCCeEEEecccccc-------ccCCceeEEeCCC--------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcC Q lcl|NC_015158. 348 AFPPMKVKGDVEEF-------VWGPMEQIYINGD--------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMG 412 (581) Q Consensus 348 ~np~~~v~~d~~~i-------~~~pG~vi~~~~~--------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G 412 (581) +|..+...+...+. -++||+++.++.+ ..+++.+++..+.....++++..+.++++|||++.++| T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG 450 (714) T protein:vir:81 371 AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG 450 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC Confidence 77766444332221 2689999998743 12566667777888889999999999999999999999 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhc--ccCCCccCHHHh-- Q lcl|NC_015158. 413 IRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDK--VATFMNVNKDDI-- 488 (581) Q Consensus 413 ~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~--~~~~~~v~r~di-- 488 (581) ..+++ .+...+++++++++..+..+.++++.. ++.+.+.++.++.++++.+.++||+|++.+ ..-++.++++.- T Consensus 451 ~~~na-~SGvAi~~rq~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~ 528 (714) T protein:vir:81 451 QDSGA-TSGVAISNLVEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNG 528 (714) T ss_pred CCccc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcc Confidence 86643 344458999999999999999999964 677888888888888999999999987533 223566665542 Q ss_pred -------cCCceEEEe-cchhHHHHHHHHHHHHHhhcc--ccccccc--------chhHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 489 -------TAKGRLRPV-GARHFAEQAQVVQSLMGIANT--PVWQDIK--------PHVSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 489 -------~~~~~vva~-ga~~~~~r~q~~q~L~~~~~~--~~~~~i~--------p~~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) .+.++|+.. |...-..|.+.++.|.++++. |..+.+. ..-+..+|++.+.++++.+.. T Consensus 529 ~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~---- 604 (714) T protein:vir:81 529 ELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS---- 604 (714) T ss_pred eecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC---- Confidence 345666443 333444567777776666542 1111111 111233455666566555432 Q ss_pred CCCCcHHHHHHHHHHHHHH---HHHHHH-hcccC--C Q lcl|NC_015158. 551 PNVAVMEAQTTSALVNQSQ---AQIEEE-AQVPL--V 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q~aq---~~~~~~-~~~~~--~ 581 (581) +.+.-+++++.|..+|+++ ++++.+ .+..+ . T Consensus 605 ~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:81 605 PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111122222222221111 111100 00000 0 No 14 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=1.1e-47 Score=277.98 Aligned_cols=538 Identities=13% Similarity=0.070 Sum_probs=323.8 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHH--HHHHhhcccccccc----cccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSE--LRNYIFATDTTTTT----NSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~--~~~y~~~~~~~~~~----~~~~~~k~~~~~pki~~ 74 (581) |+.|.-+ +.....-..++..|...+.. ...|.+ .++|=|+....++. ..+..+.-.+|.++|.. T Consensus 8 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:10 8 MATKNDN---------GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred ccCCCCc---------chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHH Confidence 3333211 11111222333333333222 223432 22343333433322 22334566688999999 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHH--HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDE--AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTK 152 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~--~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~ 152 (581) .++.+.+... .|+--++|.|+..+++ +.|+++..++.+....|++....++.|.|.++.|.|++.+.+... T Consensus 78 ~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d--- 150 (714) T protein:vir:10 78 TVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD--- 150 (714) T ss_pred HHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC--- Confidence 9999999888 6888899999876554 689999999999999999999999999999999999877765311 Q ss_pred eeeeeeEeeeeccceEEecchhheeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCC Q lcl|NC_015158. 153 DEESGATRDTYFGPRAVRIDPKDIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLG 229 (581) Q Consensus 153 ~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~ 229 (581) ......++++|+|.+|||||.|+ +++||.|| ++.++|++++++|..... +.+..+. ..-.+.. T Consensus 151 --------~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~----~~~~~~~ 216 (714) T protein:vir:10 151 --------PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAI----DDWRGFV 216 (714) T ss_pred --------CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhh----hhhcccc Confidence 11123578999999999999876 68899885 566889999999843211 1111110 0000100 Q ss_pred cccchhh---------hhccccccccccccccccCCceEEEEEEeeeee------cccCCcee----------------- Q lcl|NC_015158. 230 TYTREDC---------EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYH------DTQSGTFK----------------- 277 (581) Q Consensus 230 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~------d~~~d~~~----------------- 277 (581) .....+. +..++. +...+.+......+|+|+|||-..+ +..+|... T Consensus 217 d~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:10 217 DTTVTEGQPSPLMSAWEEYQSW--DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccchhhhccc--cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0000000 000000 0001111112234688899995321 11111111 Q ss_pred --------eeeEEEEEeCCEEEEeecCCCccCCCCeeEe-cccc-cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 278 --------RNMKVTIIDRMFVIEEKENPSWFAQAPIFHC-GWRI-RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLI 347 (581) Q Consensus 278 --------e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~-~~~~-~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s 347 (581) ....+.++.|.++|+...+||+++.+||+-+ ++.. +.+.+| |+.+.++|+|+.+|+....++.. ++ T Consensus 295 ~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~ 370 (714) T protein:vir:10 295 VQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQ 370 (714) T ss_pred hhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hc Confidence 1234556889999999999999999998733 3222 234444 68999999999999877776664 46 Q ss_pred cCCeEEEecccccc-------ccCCceeEEeCCC--------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcC Q lcl|NC_015158. 348 AFPPMKVKGDVEEF-------VWGPMEQIYINGD--------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMG 412 (581) Q Consensus 348 ~np~~~v~~d~~~i-------~~~pG~vi~~~~~--------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G 412 (581) +|..+...+...+. -++||+++.++.+ ..+++.+++..+.....++++..+.++++|||++.++| T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG 450 (714) T protein:vir:10 371 AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG 450 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC Confidence 77766444332221 2689999998743 12566667777888889999999999999999999999 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhc--ccCCCccCHHHh-- Q lcl|NC_015158. 413 IRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDK--VATFMNVNKDDI-- 488 (581) Q Consensus 413 ~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~--~~~~~~v~r~di-- 488 (581) ..+++ .+...+++++++++..+..+.++++.. ++.+.+.++.++.++++.+.++||+|++.+ ..-++.++++.- T Consensus 451 ~~~na-~SGvAi~~rq~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~ 528 (714) T protein:vir:10 451 QDSGA-TSGVAISNLVEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNG 528 (714) T ss_pred CCccc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcc Confidence 86643 344458999999999999999999964 677888888888888999999999987533 223566665542 Q ss_pred -------cCCceEEEe-cchhHHHHHHHHHHHHHhhcc--ccccccc--------chhHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 489 -------TAKGRLRPV-GARHFAEQAQVVQSLMGIANT--PVWQDIK--------PHVSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 489 -------~~~~~vva~-ga~~~~~r~q~~q~L~~~~~~--~~~~~i~--------p~~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) .+.++|+.. |...-..|.+.++.|.++++. |..+.+. ..-+..+|++.+.++++.+.. T Consensus 529 ~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~---- 604 (714) T protein:vir:10 529 ELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS---- 604 (714) T ss_pred eecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC---- Confidence 345666443 333444567777776666542 1111111 111233455666566555432 Q ss_pred CCCCcHHHHHHHHHHHHHH---HHHHHH-hcccC--C Q lcl|NC_015158. 551 PNVAVMEAQTTSALVNQSQ---AQIEEE-AQVPL--V 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q~aq---~~~~~~-~~~~~--~ 581 (581) +.+.-+++++.|..+|+++ ++++.+ .+..+ . T Consensus 605 ~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:10 605 PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111122222222221111 111100 00000 0 No 15 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=1.1e-47 Score=277.98 Aligned_cols=538 Identities=13% Similarity=0.070 Sum_probs=323.8 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHH--HHHHhhcccccccc----cccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSE--LRNYIFATDTTTTT----NSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~--~~~y~~~~~~~~~~----~~~~~~k~~~~~pki~~ 74 (581) |+.|.-+ +.....-..++..|...+.. ...|.+ .++|=|+....++. ..+..+.-.+|.++|.. T Consensus 8 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:99 8 MATKNDN---------GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred ccCCCCc---------chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHH Confidence 3333211 11111222333333333222 223432 22343333433322 22334566688999999 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHH--HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDE--AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTK 152 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~--~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~ 152 (581) .++.+.+... .|+--++|.|+..+++ +.|+++..++.+....|++....++.|.|.++.|.|++.+.+... T Consensus 78 ~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d--- 150 (714) T protein:vir:99 78 TVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD--- 150 (714) T ss_pred HHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC--- Confidence 9999999888 6888899999876554 689999999999999999999999999999999999877765311 Q ss_pred eeeeeeEeeeeccceEEecchhheeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCC Q lcl|NC_015158. 153 DEESGATRDTYFGPRAVRIDPKDIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLG 229 (581) Q Consensus 153 ~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~ 229 (581) ......++++|+|.+|||||.|+ +++||.|| ++.++|++++++|..... +.+..+. ..-.+.. T Consensus 151 --------~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~----~~~~~~~ 216 (714) T protein:vir:99 151 --------PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAI----DDWRGFV 216 (714) T ss_pred --------CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhh----hhhcccc Confidence 11123578999999999999876 68899885 566889999999843211 1111110 0000100 Q ss_pred cccchhh---------hhccccccccccccccccCCceEEEEEEeeeee------cccCCcee----------------- Q lcl|NC_015158. 230 TYTREDC---------EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYH------DTQSGTFK----------------- 277 (581) Q Consensus 230 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~------d~~~d~~~----------------- 277 (581) .....+. +..++. +...+.+......+|+|+|||-..+ +..+|... T Consensus 217 d~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:99 217 DTTVTEGQPSPLMSAWEEYQSW--DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccchhhhccc--cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0000000 000000 0001111112234688899995321 11111111 Q ss_pred --------eeeEEEEEeCCEEEEeecCCCccCCCCeeEe-cccc-cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 278 --------RNMKVTIIDRMFVIEEKENPSWFAQAPIFHC-GWRI-RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLI 347 (581) Q Consensus 278 --------e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~-~~~~-~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s 347 (581) ....+.++.|.++|+...+||+++.+||+-+ ++.. +.+.+| |+.+.++|+|+.+|+....++.. ++ T Consensus 295 ~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~ 370 (714) T protein:vir:99 295 VQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQ 370 (714) T ss_pred hhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hc Confidence 1234556889999999999999999998733 3222 234444 68999999999999877776664 46 Q ss_pred cCCeEEEecccccc-------ccCCceeEEeCCC--------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcC Q lcl|NC_015158. 348 AFPPMKVKGDVEEF-------VWGPMEQIYINGD--------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMG 412 (581) Q Consensus 348 ~np~~~v~~d~~~i-------~~~pG~vi~~~~~--------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G 412 (581) +|..+...+...+. -++||+++.++.+ ..+++.+++..+.....++++..+.++++|||++.++| T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG 450 (714) T protein:vir:99 371 AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG 450 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC Confidence 77766444332221 2689999998743 12566667777888889999999999999999999999 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhc--ccCCCccCHHHh-- Q lcl|NC_015158. 413 IRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDK--VATFMNVNKDDI-- 488 (581) Q Consensus 413 ~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~--~~~~~~v~r~di-- 488 (581) ..+++ .+...+++++++++..+..+.++++.. ++.+.+.++.++.++++.+.++||+|++.+ ..-++.++++.- T Consensus 451 ~~~na-~SGvAi~~rq~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~ 528 (714) T protein:vir:99 451 QDSGA-TSGVAISNLVEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNG 528 (714) T ss_pred CCccc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcc Confidence 86643 344458999999999999999999964 677888888888888999999999987533 223566665542 Q ss_pred -------cCCceEEEe-cchhHHHHHHHHHHHHHhhcc--ccccccc--------chhHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 489 -------TAKGRLRPV-GARHFAEQAQVVQSLMGIANT--PVWQDIK--------PHVSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 489 -------~~~~~vva~-ga~~~~~r~q~~q~L~~~~~~--~~~~~i~--------p~~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) .+.++|+.. |...-..|.+.++.|.++++. |..+.+. ..-+..+|++.+.++++.+.. T Consensus 529 ~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~---- 604 (714) T protein:vir:99 529 ELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS---- 604 (714) T ss_pred eecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC---- Confidence 345666443 333444567777776666542 1111111 111233455666566555432 Q ss_pred CCCCcHHHHHHHHHHHHHH---HHHHHH-hcccC--C Q lcl|NC_015158. 551 PNVAVMEAQTTSALVNQSQ---AQIEEE-AQVPL--V 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q~aq---~~~~~~-~~~~~--~ 581 (581) +.+.-+++++.|..+|+++ ++++.+ .+..+ . T Consensus 605 ~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:99 605 PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111122222222221111 111100 00000 0 No 16 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=1.1e-47 Score=277.98 Aligned_cols=538 Identities=13% Similarity=0.070 Sum_probs=323.8 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHH--HHHHhhcccccccc----cccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSE--LRNYIFATDTTTTT----NSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~--~~~y~~~~~~~~~~----~~~~~~k~~~~~pki~~ 74 (581) |+.|.-+ +.....-..++..|...+.. ...|.+ .++|=|+....++. ..+..+.-.+|.++|.. T Consensus 8 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:27 8 MATKNDN---------GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred ccCCCCc---------chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHH Confidence 3333211 11111222333333333222 223432 22343333433322 22334566688999999 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHH--HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDE--AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTK 152 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~--~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~ 152 (581) .++.+.+... .|+--++|.|+..+++ +.|+++..++.+....|++....++.|.|.++.|.|++.+.+... T Consensus 78 ~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d--- 150 (714) T protein:vir:27 78 TVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD--- 150 (714) T ss_pred HHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC--- Confidence 9999999888 6888899999876554 689999999999999999999999999999999999877765311 Q ss_pred eeeeeeEeeeeccceEEecchhheeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCC Q lcl|NC_015158. 153 DEESGATRDTYFGPRAVRIDPKDIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLG 229 (581) Q Consensus 153 ~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~ 229 (581) ......++++|+|.+|||||.|+ +++||.|| ++.++|++++++|..... +.+..+. ..-.+.. T Consensus 151 --------~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~----~~~~~~~ 216 (714) T protein:vir:27 151 --------PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAI----DDWRGFV 216 (714) T ss_pred --------CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhh----hhhcccc Confidence 11123578999999999999876 68899885 566889999999843211 1111110 0000100 Q ss_pred cccchhh---------hhccccccccccccccccCCceEEEEEEeeeee------cccCCcee----------------- Q lcl|NC_015158. 230 TYTREDC---------EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYH------DTQSGTFK----------------- 277 (581) Q Consensus 230 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~------d~~~d~~~----------------- 277 (581) .....+. +..++. +...+.+......+|+|+|||-..+ +..+|... T Consensus 217 d~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:27 217 DTTVTEGQPSPLMSAWEEYQSW--DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccchhhhccc--cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0000000 000000 0001111112234688899995321 11111111 Q ss_pred --------eeeEEEEEeCCEEEEeecCCCccCCCCeeEe-cccc-cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 278 --------RNMKVTIIDRMFVIEEKENPSWFAQAPIFHC-GWRI-RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLI 347 (581) Q Consensus 278 --------e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~-~~~~-~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s 347 (581) ....+.++.|.++|+...+||+++.+||+-+ ++.. +.+.+| |+.+.++|+|+.+|+....++.. ++ T Consensus 295 ~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~ 370 (714) T protein:vir:27 295 VQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQ 370 (714) T ss_pred hhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hc Confidence 1234556889999999999999999998733 3222 234444 68999999999999877776664 46 Q ss_pred cCCeEEEecccccc-------ccCCceeEEeCCC--------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcC Q lcl|NC_015158. 348 AFPPMKVKGDVEEF-------VWGPMEQIYINGD--------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMG 412 (581) Q Consensus 348 ~np~~~v~~d~~~i-------~~~pG~vi~~~~~--------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G 412 (581) +|..+...+...+. -++||+++.++.+ ..+++.+++..+.....++++..+.++++|||++.++| T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG 450 (714) T protein:vir:27 371 AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG 450 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC Confidence 77766444332221 2689999998743 12566667777888889999999999999999999999 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhc--ccCCCccCHHHh-- Q lcl|NC_015158. 413 IRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDK--VATFMNVNKDDI-- 488 (581) Q Consensus 413 ~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~--~~~~~~v~r~di-- 488 (581) ..+++ .+...+++++++++..+..+.++++.. ++.+.+.++.++.++++.+.++||+|++.+ ..-++.++++.- T Consensus 451 ~~~na-~SGvAi~~rq~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~ 528 (714) T protein:vir:27 451 QDSGA-TSGVAISNLVEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNG 528 (714) T ss_pred CCccc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcc Confidence 86643 344458999999999999999999964 677888888888888999999999987533 223566665542 Q ss_pred -------cCCceEEEe-cchhHHHHHHHHHHHHHhhcc--ccccccc--------chhHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 489 -------TAKGRLRPV-GARHFAEQAQVVQSLMGIANT--PVWQDIK--------PHVSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 489 -------~~~~~vva~-ga~~~~~r~q~~q~L~~~~~~--~~~~~i~--------p~~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) .+.++|+.. |...-..|.+.++.|.++++. |..+.+. ..-+..+|++.+.++++.+.. T Consensus 529 ~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~---- 604 (714) T protein:vir:27 529 ELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS---- 604 (714) T ss_pred eecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC---- Confidence 345666443 333444567777776666542 1111111 111233455666566555432 Q ss_pred CCCCcHHHHHHHHHHHHHH---HHHHHH-hcccC--C Q lcl|NC_015158. 551 PNVAVMEAQTTSALVNQSQ---AQIEEE-AQVPL--V 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q~aq---~~~~~~-~~~~~--~ 581 (581) +.+.-+++++.|..+|+++ ++++.+ .+..+ . T Consensus 605 ~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:27 605 PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111122222222221111 111100 00000 0 No 17 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=1.1e-47 Score=277.98 Aligned_cols=538 Identities=13% Similarity=0.070 Sum_probs=323.8 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHH--HHHHhhcccccccc----cccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSE--LRNYIFATDTTTTT----NSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~--~~~y~~~~~~~~~~----~~~~~~k~~~~~pki~~ 74 (581) |+.|.-+ +.....-..++..|...+.. ...|.+ .++|=|+....++. ..+..+.-.+|.++|.. T Consensus 8 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:32 8 MATKNDN---------GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred ccCCCCc---------chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHH Confidence 3333211 11111222333333333222 223432 22343333433322 22334566688999999 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHH--HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDE--AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTK 152 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~--~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~ 152 (581) .++.+.+... .|+--++|.|+..+++ +.|+++..++.+....|++....++.|.|.++.|.|++.+.+... T Consensus 78 ~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d--- 150 (714) T protein:vir:32 78 TVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD--- 150 (714) T ss_pred HHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC--- Confidence 9999999888 6888899999876554 689999999999999999999999999999999999877765311 Q ss_pred eeeeeeEeeeeccceEEecchhheeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCC Q lcl|NC_015158. 153 DEESGATRDTYFGPRAVRIDPKDIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLG 229 (581) Q Consensus 153 ~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~ 229 (581) ......++++|+|.+|||||.|+ +++||.|| ++.++|++++++|..... +.+..+. ..-.+.. T Consensus 151 --------~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~----~~~~~~~ 216 (714) T protein:vir:32 151 --------PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAI----DDWRGFV 216 (714) T ss_pred --------CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhh----hhhcccc Confidence 11123578999999999999876 68899885 566889999999843211 1111110 0000100 Q ss_pred cccchhh---------hhccccccccccccccccCCceEEEEEEeeeee------cccCCcee----------------- Q lcl|NC_015158. 230 TYTREDC---------EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYH------DTQSGTFK----------------- 277 (581) Q Consensus 230 ~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~------d~~~d~~~----------------- 277 (581) .....+. +..++. +...+.+......+|+|+|||-..+ +..+|... T Consensus 217 d~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~ 294 (714) T protein:vir:32 217 DTTVTEGQPSPLMSAWEEYQSW--DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGR 294 (714) T ss_pred ccccccccccccccchhhhccc--cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcc Confidence 0000000 000000 0001111112234688899995321 11111111 Q ss_pred --------eeeEEEEEeCCEEEEeecCCCccCCCCeeEe-cccc-cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 278 --------RNMKVTIIDRMFVIEEKENPSWFAQAPIFHC-GWRI-RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLI 347 (581) Q Consensus 278 --------e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~-~~~~-~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s 347 (581) ....+.++.|.++|+...+||+++.+||+-+ ++.. +.+.+| |+.+.++|+|+.+|+....++.. ++ T Consensus 295 ~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~ 370 (714) T protein:vir:32 295 VQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQ 370 (714) T ss_pred hhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hc Confidence 1234556889999999999999999998733 3222 234444 68999999999999877776664 46 Q ss_pred cCCeEEEecccccc-------ccCCceeEEeCCC--------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcC Q lcl|NC_015158. 348 AFPPMKVKGDVEEF-------VWGPMEQIYINGD--------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMG 412 (581) Q Consensus 348 ~np~~~v~~d~~~i-------~~~pG~vi~~~~~--------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G 412 (581) +|..+...+...+. -++||+++.++.+ ..+++.+++..+.....++++..+.++++|||++.++| T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG 450 (714) T protein:vir:32 371 AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG 450 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC Confidence 77766444332221 2689999998743 12566667777888889999999999999999999999 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhc--ccCCCccCHHHh-- Q lcl|NC_015158. 413 IRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDK--VATFMNVNKDDI-- 488 (581) Q Consensus 413 ~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~--~~~~~~v~r~di-- 488 (581) ..+++ .+...+++++++++..+..+.++++.. ++.+.+.++.++.++++.+.++||+|++.+ ..-++.++++.- T Consensus 451 ~~~na-~SGvAi~~rq~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~ 528 (714) T protein:vir:32 451 QDSGA-TSGVAISNLVEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNG 528 (714) T ss_pred CCccc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcc Confidence 86643 344458999999999999999999964 677888888888888999999999987533 223566665542 Q ss_pred -------cCCceEEEe-cchhHHHHHHHHHHHHHhhcc--ccccccc--------chhHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 489 -------TAKGRLRPV-GARHFAEQAQVVQSLMGIANT--PVWQDIK--------PHVSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 489 -------~~~~~vva~-ga~~~~~r~q~~q~L~~~~~~--~~~~~i~--------p~~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) .+.++|+.. |...-..|.+.++.|.++++. |..+.+. ..-+..+|++.+.++++.+.. T Consensus 529 ~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~---- 604 (714) T protein:vir:32 529 ELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKS---- 604 (714) T ss_pred eecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCC---- Confidence 345666443 333444567777776666542 1111111 111233455666566555432 Q ss_pred CCCCcHHHHHHHHHHHHHH---HHHHHH-hcccC--C Q lcl|NC_015158. 551 PNVAVMEAQTTSALVNQSQ---AQIEEE-AQVPL--V 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q~aq---~~~~~~-~~~~~--~ 581 (581) +.+.-+++++.|..+|+++ ++++.+ .+..+ . T Consensus 605 ~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:32 605 PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111122222222221111 111100 00000 0 No 18 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=100.00 E-value=9.1e-47 Score=273.00 Aligned_cols=484 Identities=14% Similarity=0.123 Sum_probs=321.7 Q ss_pred hccchhhhHH-HHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 11 MLDDTRDGLA-EQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP 89 (581) Q Consensus 11 ~~~~~~~~~a-~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~ 89 (581) |.+..++.++ ..++++|+.+++.|++|+..|.+|.+|+.|+.....+........+++-+.-...+++|.+.||+.+|| T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 6665566666 566789999999999999999999999999864443333333334556666789999999999999999 Q ss_pred CccEEEeecCChhHH----------HH---HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeee Q lcl|NC_015158. 90 NERWLKWEGKSLQDE----------AK---RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES 156 (581) Q Consensus 90 ~~~~~~~~~~~~~d~----------~~---ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~ 156 (581) +..||++.+...+.+ +. -+..++.|...|..|||+..+++.+.|++.+|||++.++...+. T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~------ 154 (535) T protein:vir:33 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGS------ 154 (535) T ss_pred CCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCC------ Confidence 999999987664321 11 24567788899999999999999999999999999998753221 Q ss_pred eeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 157 GATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 157 ~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) ...+..+++-++++..++..--++ .+.+..+|..+|-+-... +.. + ... T Consensus 155 --------~~~f~~~pl~~~~v~~d~~G~vd~-i~r~~~~t~~ql~~~~~~---------~~~----------~---~~~ 203 (535) T protein:vir:33 155 --------YNPMKLYRLSSYVVQRDAYGNVLQ-IVTRDQIAFGALPEDVRS---------AVE----------K---SGG 203 (535) T ss_pred --------ceeeEEEEcCeeEEeeCCCCCeeE-EEeeEeecHHHHHHHhhh---------hhc----------c---ccc Confidence 124667888999999886643232 345667787776332110 000 0 000 Q ss_pred hhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCc Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDN 316 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~ 316 (581) ++. ....++++++. +.+..++.+. .++...|.. +..+...++++..||.+..|...++. T Consensus 204 ~k~---------------~~~~~~v~~~v--~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~~~P~i~~Rw~~~~ge 262 (535) T protein:vir:33 204 EKK---------------MDEMVDVYTHV--YLDEESGDYL---KYEEVEDVE-IDGSDATYPTDAMPYIPVRMVRIDGE 262 (535) T ss_pred ccc---------------cccCCeEEEEE--EeeCCCCcEE---EEEEEeCcc-ccccccccccccCCceeeeeeecCCC Confidence 000 00123344332 1234455542 233345553 43344556778899999999999999 Q ss_pred ccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccccc-CCceeEEeCCCCCcccccC--CCccchh Q lcl|NC_015158. 317 LYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVW-GPMEQIYINGDGDVEMMAP--NTQALQA 389 (581) Q Consensus 317 ~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~-~pG~vi~~~~~~~i~~~~~--p~~~~~~ 389 (581) .||+|+++...+..+.+|.+.+.++.+..++++|++.+..+ +.++.. .+|. |....++++++++. +..+..+ T Consensus 263 ~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~~~~g~-~v~g~~~~v~~~~~~~~~~~~~~ 341 (535) T protein:vir:33 263 SYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGD-FVPGRREDIDFLQLEKQADFTVA 341 (535) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCcee-eecCCcccceeeecccccchhHH Confidence 99999999999999999999999999999999999988643 444332 3333 44566677777653 3345556 Q ss_pred HHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceee Q lcl|NC_015158. 390 DMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIR 469 (581) Q Consensus 390 ~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR 469 (581) ...++.+.+.+.+.. ... +.+..+....|||+|.+..+.....+.-+..++..+++.|||...+..+.+..- T Consensus 342 ~~~i~~~~~~I~~af-~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~------ 413 (535) T protein:vir:33 342 KAVSDQIEARLSYAF-MLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQ------ 413 (535) T ss_pred HHHHHHHHHHHHHHH-hhh-hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCC------ Confidence 666777777776643 111 233344456899999999999999999999999999999999988777655322 Q ss_pred ecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc--ccc-cccchhHHHHHHHHHHHHhcCCCc Q lcl|NC_015158. 470 VFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP--VWQ-DIKPHVSTENLAKMLEHNLSLGGW 546 (581) Q Consensus 470 ~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~--~~~-~i~p~~~~~~l~~~~~e~~~l~~~ 546 (581) +..++.++++.. ....-+-++|.+.++.|.+.++.. +++ .+.+.++..++++.+++..|++.. T Consensus 414 ----------lP~~p~~~v~~~----yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~ 479 (535) T protein:vir:33 414 ----------IPELPKEAVEPT----ISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS 479 (535) T ss_pred ----------CCCCCccceeEE----EecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHh Confidence 223334444333 232223345555555555554332 233 233568888999999999999976 Q ss_pred ccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 547 DIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 547 ~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .++++.-++ +| ++++++++.+..+|+..+ T Consensus 480 ~i~~~~ee~-----~~-~~~q~~~~~~~~~~~~~~ 508 (535) T protein:vir:33 480 GILLTDEQK-----QA-LMMQDAAQTGVENAAAAG 508 (535) T ss_pred HhcCCHHHH-----HH-HHHHHHHHHHHHHHHHhh Confidence 677773322 22 222233333333333333 No 19 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=100.00 E-value=4.1e-46 Score=269.39 Aligned_cols=488 Identities=13% Similarity=0.098 Sum_probs=318.9 Q ss_pred hccchhhhHH-HHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 11 MLDDTRDGLA-EQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP 89 (581) Q Consensus 11 ~~~~~~~~~a-~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~ 89 (581) |.+..+..|| ..++++|+.+++.|++|+..|.+|.+|+.|+.....+........+++-+.-...+++|.+.||+.+|| T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 7766666666 455579999999999999999999999999864333333323334566677789999999999999999 Q ss_pred CccEEEeecCChhH----------HHH---HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeee Q lcl|NC_015158. 90 NERWLKWEGKSLQD----------EAK---RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES 156 (581) Q Consensus 90 ~~~~~~~~~~~~~d----------~~~---ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~ 156 (581) +..||++.+...+. .+. -+..++.|...|..|||+..+++.+.|++.+|||++.++...+. T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~------ 154 (535) T protein:vir:15 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGS------ 154 (535) T ss_pred CCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCC------ Confidence 99999998765321 111 24567888899999999999999999999999999988753221 Q ss_pred eeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 157 GATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 157 ~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) ...+..+++-++++..++..--|+ .+.+..+|..+|-+-. ..++. ++. +.+ T Consensus 155 --------~~~f~~~pl~~~~v~~d~~G~vd~-i~r~~~~t~~~l~~~~---------~~~~~------~~~--~~~--- 205 (535) T protein:vir:15 155 --------YNPMKLYRLSSYVVQRDAYGNVLQ-IVTRDQIAFGALPEDV---------RSAVE------KAG--GEK--- 205 (535) T ss_pred --------ceeeEEEEcCeeEEeeCCCCCeeE-EEEeEeecHHHHHHHH---------hHhhh------ccc--ccc--- Confidence 124567888999999887642222 2345677877663211 00000 000 000 Q ss_pred hhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCc Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDN 316 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~ 316 (581) -....++++++.- .+.+++.+ .++....|..+ ...+..++++..||.+..|...++. T Consensus 206 -----------------~~~~~v~v~~~v~--~~~~~~~~---~~~~e~~g~~~-~~~~~~~~~~~~P~i~~Rw~~~~ge 262 (535) T protein:vir:15 206 -----------------KMDEMVDVYTHVY--LDEESGDY---LKYEEVEDVEI-DGSDATYPTDAMPYIPVRMVRIDGE 262 (535) T ss_pred -----------------CCCCceeEEEEEE--EecCCCcE---EEEEEeeCccc-cccccccccccCCceeeeeeecCCC Confidence 0112455665541 23344444 22333445433 3333445678899999999999999 Q ss_pred ccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccc-cCCceeEEeCCCCCcccccC--CCccchh Q lcl|NC_015158. 317 LYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFV-WGPMEQIYINGDGDVEMMAP--NTQALQA 389 (581) Q Consensus 317 ~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~-~~pG~vi~~~~~~~i~~~~~--p~~~~~~ 389 (581) .||+|+++...+..+.+|.+.+.++.+..++++|++.+..+ +.++. ..+|. |....++++++++. +..+..+ T Consensus 263 ~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~~~~g~-~v~g~~~~v~~~~~~~~~~~~~~ 341 (535) T protein:vir:15 263 SYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGD-FVPGRREDIDFLQLEKQADFTVA 341 (535) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcccCCcee-eecCCcccceeeecccccchhHH Confidence 99999999999999999999999999999999999988643 44443 23344 44566677777653 3345556 Q ss_pred HHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceee Q lcl|NC_015158. 390 DMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIR 469 (581) Q Consensus 390 ~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR 469 (581) ...++.+.+.+.+.. ... +.+..+....|||+|.+..+.....+.-+..++..+++.|||...+..+.+..- T Consensus 342 ~~~i~~~~~~I~~af-~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~------ 413 (535) T protein:vir:15 342 KAVSDQIEARLSYAF-MLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQ------ 413 (535) T ss_pred HHHHHHHHHHHHHHH-hhh-hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCC------ Confidence 666777777776643 111 233344456899999999999999999999999999999999988777655322 Q ss_pred ecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc--ccc-cccchhHHHHHHHHHHHHhcCCCc Q lcl|NC_015158. 470 VFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP--VWQ-DIKPHVSTENLAKMLEHNLSLGGW 546 (581) Q Consensus 470 ~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~--~~~-~i~p~~~~~~l~~~~~e~~~l~~~ 546 (581) +..++.++++.. ....-+-++|.+.++.|.+.++.. +++ .+.+.++..++++.+++..|++.. T Consensus 414 ----------lP~~p~~~v~~~----yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~ 479 (535) T protein:vir:15 414 ----------IPELPKEAVEPT----ISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS 479 (535) T ss_pred ----------CCCCCccceeEE----EecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChh Confidence 223333444333 232223345555555555554332 233 234568888999999999999976 Q ss_pred ccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 547 DIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 547 ~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .++++.-++.+.+++| |+++++..++.+.+-.+ T Consensus 480 ~i~~~~eev~~~~~q~--~~~~~~~~~a~~~g~~~ 512 (535) T protein:vir:15 480 GILLTDEQKQALMMQD--AAQTGIENAAATGGAGV 512 (535) T ss_pred hhcCCHHHHHHHHHHH--HHHHHHHHHHHHHHhhc Confidence 6777733332222111 11111222222222223 No 20 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=100.00 E-value=7e-46 Score=268.13 Aligned_cols=494 Identities=14% Similarity=0.085 Sum_probs=317.4 Q ss_pred HHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC-CccEEEee Q lcl|NC_015158. 19 LAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP-NERWLKWE 97 (581) Q Consensus 19 ~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~-~~~~~~~~ 97 (581) .=..+++.|+.+++.|++|+..|.+|.+|+.|+.....++.......+++-+.-...+++|.+.||+.+|| +..||++. T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFFKLQ 80 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 33447789999999999999999999999999876655554444445677777789999999999999999 89999999 Q ss_pred cCChhHHH-------------HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeec Q lcl|NC_015158. 98 GKSLQDEA-------------KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYF 164 (581) Q Consensus 98 ~~~~~d~~-------------~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~ 164 (581) +...+.++ .-+.+++.+...|..|||+..+++.+.|++.+|||++-++ +. T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~---~~-------------- 143 (555) T protein:vir:17 81 INDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQG---KK-------------- 143 (555) T ss_pred cCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEec---CC-------------- Confidence 87644211 1234677888999999999999999999999999986332 11 Q ss_pred cceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccccc Q lcl|NC_015158. 165 GPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSM 244 (581) Q Consensus 165 ~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (581) .+..+++-++++..++..--++ ++.+..+|..+|.+..+...- .+-.+.....+.+..... ++- T Consensus 144 --~~~~~pl~~y~v~~d~~G~vd~-v~rk~~~t~~ql~~~fg~~~l-----~~~~~~~~~~~~d~~~~~--~~~------ 207 (555) T protein:vir:17 144 --NLKLYPLDRFVVSRDGEGNVME-IVTEEQIDRSLLPEEFQKVGG-----LEGAPDSNAVGEDGPKMG--VTA------ 207 (555) T ss_pred --ceeEEEcCeEEEeeCCCcCeeE-EEeeeeecHHHHHHHhhhccc-----cchhhhhhhccccchhhh--hhh------ Confidence 1344667788888876532222 345667798888654321110 000000011111110000 000 Q ss_pred ccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEE-EeecCCCccCCCCeeEecccccCCcccCCCcH Q lcl|NC_015158. 245 DGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVI-EEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL 323 (581) Q Consensus 245 ~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~ii-r~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~ 323 (581) ....+......++++.++ ....+.+ .+..-.+|..+. ...+.|| ...||.+..|..+++..||.|++ T Consensus 208 ---~~~~~~~~~~~~~v~t~~----~~~~~~~---~~~~e~~~~~v~~~l~e~g~--~e~P~i~~Rw~~~~ge~YGrgp~ 275 (555) T protein:vir:17 208 ---PGGRDKGKSNDALVYTYV----CRKDGQV---KWHQECDGKVIPGSNSSAPY--THNPWIPLRFNIVDGEAYGRGRV 275 (555) T ss_pred ---hcccccCCCcceeEeecc----cccCCee---EEEEecCceeccccccccCc--ccCCeeeeeeeecCCCccccchH Confidence 001111222334444433 1222221 111122344331 1345555 47899999999999999999999 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCCCCCcccccCCC--ccchhHHHHHHHH Q lcl|NC_015158. 324 DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYINGDGDVEMMAPNT--QALQADMQIQILE 397 (581) Q Consensus 324 ~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~~~~i~~~~~p~--~~~~~~~~lq~~~ 397 (581) +...+.++.+|.+.+.+++++.++++|++.+..+ +.++...+++.|..+.++++++++... ....+...++.+. T Consensus 276 ~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~ 355 (555) T protein:vir:17 276 EEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLE 355 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHH Confidence 9999999999999999999999999999988643 455555555667677778888887543 3445667778887 Q ss_pred HHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcc Q lcl|NC_015158. 398 AKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKV 477 (581) Q Consensus 398 ~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~ 477 (581) +.+.+..-. ....+....|||+|....+.....+.-+..++..+++.|||...+.++.+... T Consensus 356 ~~I~~aFm~----~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~-------------- 417 (555) T protein:vir:17 356 QRISDAFLM----LQVRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRK-------------- 417 (555) T ss_pred HHHHHHHhh----cCCCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCC-------------- Confidence 777665321 22334456899999999999999999999999999999999999888766543 Q ss_pred cCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhc----ccccccccchhHHHHHHHHHHHHhcCCCcccccCCC Q lcl|NC_015158. 478 ATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIAN----TPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNV 553 (581) Q Consensus 478 ~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~----~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~ 553 (581) +..++.+.++.+.. +|- .-+.|.+.++.+.+.++ ....+.+...++..++++.+++.+|++--.++++.- T Consensus 418 --lP~~p~~~v~~~i~---~~l-~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~e 491 (555) T protein:vir:17 418 --LPQLPKDLVQPTVV---AGL-WGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPE 491 (555) T ss_pred --CCCCCHhhhcccee---ehH-HHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHH Confidence 33455555544432 222 22334444444444333 211234556788889999999999996557777755 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHhcc-------------cCC Q lcl|NC_015158. 554 AVMEAQTTSALVNQSQAQIEEEAQV-------------PLV 581 (581) Q Consensus 554 ~~~~~~~~q~~~q~aq~~~~~~~~~-------------~~~ 581 (581) ++.+..++|..+++.|+.++..+|+ ++- T Consensus 492 ev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~ 532 (555) T protein:vir:17 492 TMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQ 532 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccc Confidence 5543322222222222222221111 111 No 21 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=100.00 E-value=1.1e-45 Score=267.17 Aligned_cols=492 Identities=13% Similarity=0.142 Sum_probs=325.7 Q ss_pred hccchhhhHH-HHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 11 MLDDTRDGLA-EQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP 89 (581) Q Consensus 11 ~~~~~~~~~a-~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~ 89 (581) |.+-.++++| ..++++|+.+++.|++|+..|.+|.+|++|+..+..+.........++-+.-...+++|.+.||+.+|| T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALFP 80 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Confidence 7775566666 677889999999999999999999999999854433333333334566677789999999999999999 Q ss_pred CccEEEeecCChhHH----------H---HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeee Q lcl|NC_015158. 90 NERWLKWEGKSLQDE----------A---KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES 156 (581) Q Consensus 90 ~~~~~~~~~~~~~d~----------~---~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~ 156 (581) +..||++........ + --+..++.|...|..|||+..+++.+.|++.+|||++-++=..+ + T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~------~ 154 (543) T protein:vir:88 81 LQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDA------S 154 (543) T ss_pred CCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCcc------c Confidence 999999987653321 1 12445677888999999999999999999999999875531100 0 Q ss_pred eeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 157 GATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 157 ~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) +. .+ -.+..+++.++++..++..-=+ ..+.|..+|..+|-.. ..+++..... .++ T Consensus 155 ~~---~~--~~~~~~pl~~y~v~~d~~G~v~-~i~r~~~~~~~~l~~~---------~~~~v~~~~~----~~p------ 209 (543) T protein:vir:88 155 SN---SY--NPMKLYTLHNHVVQRDAFGNVL-QIVTLDKVAYAALPED---------VRNSLSGGQE----YKP------ 209 (543) T ss_pred cc---ee--cceEEeEcceEEEeeCCCCCee-eeeeeeeccHHHHhHH---------hhHHHHHHhh----cCC------ Confidence 00 00 0133466677888866543111 1233556677766321 1111111110 000 Q ss_pred hhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCc Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDN 316 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~ 316 (581) ...++|+++- +...+.+.+..+. -+. +..+...++.|+.+..||.+..|...++. T Consensus 210 -------------------~~~~~v~~~V--~pr~~~~~~~~~~---~~~-~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge 264 (543) T protein:vir:88 210 -------------------EQELEVYTHI--YIDDESGDFLSYQ---EIE-GVEVDGSDGQYPQDALPWIAVRWTKRDGE 264 (543) T ss_pred -------------------ccceEEEEEE--EeecCCCcccccc---ccc-CeeeecCCCccccccCCceeeeeeecCCC Confidence 0124444321 1122223332111 122 34555556677778899999999999999 Q ss_pred ccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccccc-CCceeEEeCCCCCcccccC--CCccchh Q lcl|NC_015158. 317 LYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVW-GPMEQIYINGDGDVEMMAP--NTQALQA 389 (581) Q Consensus 317 ~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~-~pG~vi~~~~~~~i~~~~~--p~~~~~~ 389 (581) .||.|+++...+..+.+|.+.+.++++..++++|++.+..+ +.++.. .+|. |-.+.++++.+++. +.....+ T Consensus 265 ~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~~~~g~-~v~g~~~~v~~~~~~~~~~~~~~ 343 (543) T protein:vir:88 265 HYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQTGD-FVAGRKADIEFLQLEKTADFTVA 343 (543) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCCce-eecCCCCcceeeecccccchhHH Confidence 99999999999999999999999999999999999988654 444433 2333 44456677776543 3456667 Q ss_pred HHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceee Q lcl|NC_015158. 390 DMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIR 469 (581) Q Consensus 390 ~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR 469 (581) ...|+.+.+.+.+..=+. +.+..+....|||+|....+.....+.-+..++..+++.|||...+.++.+... T Consensus 344 ~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~------ 415 (543) T protein:vir:88 344 KSVADAIEARLSYVFMLN--SAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQ------ 415 (543) T ss_pred HHHHHHHHHHHHHHHhhh--hhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCC------ Confidence 778888888887754221 233344456899999999999999999999999999999999988877665332 Q ss_pred ecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc--cc-ccccchhHHHHHHHHHHHHhcCCCc Q lcl|NC_015158. 470 VFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP--VW-QDIKPHVSTENLAKMLEHNLSLGGW 546 (581) Q Consensus 470 ~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~--~~-~~i~p~~~~~~l~~~~~e~~~l~~~ 546 (581) +..++.++++..+ + + ..+-+.|.+.++.|.++++.. +. ..+.+.++..++++.+++..|++-. T Consensus 416 ----------lP~~p~~~v~~~~--v-s-~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~ 481 (543) T protein:vir:88 416 ----------IPNLPQEAVEPTV--T-T-GAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTA 481 (543) T ss_pred ----------CCCCchhceeeeE--E-e-cHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChh Confidence 3445556664443 2 1 223366777666666665432 22 2466778899999999999999766 Q ss_pred ccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 547 DIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 547 ~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .++++.-++.+.+++|++||+++++.+++.....- T Consensus 482 ~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~ 516 (543) T protein:vir:88 482 GLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAA 516 (543) T ss_pred hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhh Confidence 77888666655555555444444433332222211 No 22 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=100.00 E-value=1.2e-45 Score=266.92 Aligned_cols=489 Identities=11% Similarity=0.083 Sum_probs=335.9 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcCC Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFPN 90 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~~ 90 (581) |.|.++..+|..++++|+.+++.|++|+..|.+|.+|+.|+.....++.......+++-+.....+.+|.+.||+.+||+ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhcCC Confidence 88888888999999999999999999999999999999998766666655555566777878899999999999999999 Q ss_pred ccEEEeecCChhHH-------H------HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 91 ERWLKWEGKSLQDE-------A------KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 91 ~~~~~~~~~~~~d~-------~------~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) ..||++.....+.+ . --+..++.+...|..|||+..+++.+.|++.+|||++-++-... T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~-------- 152 (536) T protein:vir:10 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG-------- 152 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCC-------- Confidence 99999977664421 1 13456778889999999999999999999999999986642110 Q ss_pred eEeeeeccc-eEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 158 ATRDTYFGP-RAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 158 ~~~~~~~~p-~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) .++ .++.+++.++++..++.. +.. .+.+..+|..+|-+-.. .+. . ....+ T Consensus 153 ------~~~~~~~~~pl~~~~v~~d~~G~vd~--i~r~~~~t~~~l~~~fg---------~~~----~-----~~~~~-- 204 (536) T protein:vir:10 153 ------SNYNPMKLYRLSSYVVQRDAFGNVLQ--MVTRDQIAFGALPEDIR---------KAV----E-----GQGGE-- 204 (536) T ss_pred ------CceeeEEEEEcCeEEEeeCCCCCeeE--EeeeeeccHHHHHHhhh---------hhh----c-----ccccc-- Confidence 012 356778899999988553 332 34566778777643211 000 0 00000 Q ss_pred hhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCC Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQD 315 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~ 315 (581) +.+ ...++++++- +.+..++++. +....+|..+++.+. -+++...||.+..|...++ T Consensus 205 ---------------~~~--~~~v~v~~~V--~~~~~~~~~~---~~~e~~g~~v~~~~g-~~~f~~~P~i~~Rw~~~~g 261 (536) T protein:vir:10 205 ---------------KKA--DETIDVYTHI--YLDEASGEYL---RYEEVEGMEVQGSDG-TYPKEACPYIPIRMVRLDG 261 (536) T ss_pred ---------------cCc--ccceEEEEEE--EEecCCCcEE---EEEeecCcccccccc-ccccccCCceeeeeeecCC Confidence 000 1245555543 1233445543 122346776666543 4456789999999999999 Q ss_pred cccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEeCCCCCcccc--cCCCccch Q lcl|NC_015158. 316 NLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYINGDGDVEMM--APNTQALQ 388 (581) Q Consensus 316 ~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~~~~~~i~~~--~~p~~~~~ 388 (581) ..||.|+++...+..+.+|.+.+.++.+..+++.|++.|+.+ ++.+ ...||.++. ..++++++. ..+..... T Consensus 262 e~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~-g~~~~v~~~~~~~~~~~~~ 340 (536) T protein:vir:10 262 ESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVT-GRPEDISFLQLEKQADFTV 340 (536) T ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceec-CCcccceeeeccccccchH Confidence 999999999999999999999999999999999999888643 4443 456777654 444666544 34445556 Q ss_pred hHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccee Q lcl|NC_015158. 389 ADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTI 468 (581) Q Consensus 389 ~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~i 468 (581) +...++.+++.+.+..=+. +.+..+....||++|....+.....+.-+..++..+++.|||...+..+.+..- T Consensus 341 ~~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~----- 413 (536) T protein:vir:10 341 AKAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ----- 413 (536) T ss_pred HHHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCC----- Confidence 7777888888887755221 233344446899999999999999999999999999999999988777654322 Q ss_pred eecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc--cccc-ccchhHHHHHHHHHHHHhcCCC Q lcl|NC_015158. 469 RVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP--VWQD-IKPHVSTENLAKMLEHNLSLGG 545 (581) Q Consensus 469 R~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~--~~~~-i~p~~~~~~l~~~~~e~~~l~~ 545 (581) +..++.+.++.++ +.. .+-++|.+.++.|.++++.. +++. +.+.++..++++.+++..|+.- T Consensus 414 -----------lP~~p~~~v~~~~-vs~---l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p 478 (536) T protein:vir:10 414 -----------IPELPKEAVEPTI-STG---LEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDT 478 (536) T ss_pred -----------CCCCChhhccceE-Eec---HHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCc Confidence 3344555555553 222 23466777777777766542 2232 2356888899999999999955 Q ss_pred cccccCCCCcHHHHHHHHHHHHHHHHHHHHh---cccCC Q lcl|NC_015158. 546 WDIFKPNVAVMEAQTTSALVNQSQAQIEEEA---QVPLV 581 (581) Q Consensus 546 ~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~---~~~~~ 581 (581) .+++++.-++.+.++++.++++.++..++.. +..+- T Consensus 479 ~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~ 517 (536) T protein:vir:10 479 SGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQAT 517 (536) T ss_pred hhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5678875444433333333222222222211 10110 No 23 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=100.00 E-value=1.3e-45 Score=266.62 Aligned_cols=489 Identities=11% Similarity=0.084 Sum_probs=334.5 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcCC Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFPN 90 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~~ 90 (581) |.|.++..+|..++++|+.+++.|++|+..|.+|.+|+.|+.....++.......+++-+.....+.+|.+.||+.+||+ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC Confidence 88888888999999999999999999999999999999998766666555555566777888899999999999999999 Q ss_pred ccEEEeecCChhHH-------H------HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 91 ERWLKWEGKSLQDE-------A------KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 91 ~~~~~~~~~~~~d~-------~------~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) ..||++.....+.+ . --+..++.+...|..|||+..+++.+.|++.+|||++-++-... T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~-------- 152 (536) T protein:vir:21 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG-------- 152 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCC-------- Confidence 99999977664421 1 13456778889999999999999999999999999986642110 Q ss_pred eEeeeeccc-eEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 158 ATRDTYFGP-RAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 158 ~~~~~~~~p-~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) .++ .++.+++.++++..++.. +.. .+.+..+|..+|.+.... +.. ....+. T Consensus 153 ------~~~~~f~~~pl~~~~v~~d~~G~vd~--i~r~~~~t~~~l~~~fg~---------~~~---------~~~~~~- 205 (536) T protein:vir:21 153 ------SNYNPMKLYRLSSYVVQRDAFGNVLQ--MVTRDQIAFGALPEDIRK---------AVE---------GQGGEK- 205 (536) T ss_pred ------CceeeEEEEEcCeEEEeeCCCCCeeE--EeeeeeccHHHHHHhhhh---------hhc---------cccccc- Confidence 012 356778899999988553 222 345667787776443110 000 000000 Q ss_pred hhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCC Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQD 315 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~ 315 (581) . -...++++++- +.+.+++.+.- ..-.+|..+++.+. -+++...||.+..|...++ T Consensus 206 ----------------~--~~~~v~v~~~v--~~~~~~~~~~~---~~e~~g~~v~~~~g-~~~f~~~P~i~~Rw~~~~g 261 (536) T protein:vir:21 206 ----------------K--ADETIDVYTHI--YLDEDSGEYLR---YEEVEGMEVQGSDG-TYPKEACPYIPIRMVRLDG 261 (536) T ss_pred ----------------c--cccceeEEEEE--EEecCCCcEEE---EeccCCeeeccccC-ccccccCCeeeeeeeecCC Confidence 0 01234554332 22334555421 11246666655543 4456789999999999999 Q ss_pred cccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEeCCCCCcccc--cCCCccch Q lcl|NC_015158. 316 NLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYINGDGDVEMM--APNTQALQ 388 (581) Q Consensus 316 ~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~~~~~~i~~~--~~p~~~~~ 388 (581) ..||.|+++...+..+.+|.+.+.++.+..+++.|++.|+.+ ++.+ ...||.++. ..++++++. ..+..... T Consensus 262 e~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~-g~~~~v~~~~~~~~~~~~~ 340 (536) T protein:vir:21 262 ESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVT-GRPEDISFLQLEKQADFTV 340 (536) T ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceec-CCcccceeeeccccccchH Confidence 999999999999999999999999999999999999888643 4443 456777654 444666544 34445666 Q ss_pred hHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccee Q lcl|NC_015158. 389 ADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTI 468 (581) Q Consensus 389 ~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~i 468 (581) +...++.+++.+.+..=+. +.+..+....||++|....+.....+.-+..++..+++.|||...+..+.+..- T Consensus 341 ~~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~----- 413 (536) T protein:vir:21 341 AKAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ----- 413 (536) T ss_pred HHHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCC----- Confidence 7778888888887755221 233344446899999999999999999999999999999999988877654322 Q ss_pred eecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc--cccc-ccchhHHHHHHHHHHHHhcCCC Q lcl|NC_015158. 469 RVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP--VWQD-IKPHVSTENLAKMLEHNLSLGG 545 (581) Q Consensus 469 R~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~--~~~~-i~p~~~~~~l~~~~~e~~~l~~ 545 (581) +..++.+.++.++ +.. .+-++|.+.++.|.+.++.. +++. +.+.++...+++.+++..|+.- T Consensus 414 -----------lP~~p~~~v~~~~-vs~---l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p 478 (536) T protein:vir:21 414 -----------IPELPKEAVEPTI-STG---LEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDT 478 (536) T ss_pred -----------CCCCChhhccceE-Eec---HHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCh Confidence 3344455555553 222 23466777777777766542 2222 3356888899999999999955 Q ss_pred cccccCCCCcHHHHHHHHHHHHHHHHHHHH---hcccCC Q lcl|NC_015158. 546 WDIFKPNVAVMEAQTTSALVNQSQAQIEEE---AQVPLV 581 (581) Q Consensus 546 ~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~---~~~~~~ 581 (581) .+++++.-++.+..++++++++.++..++. ++..+- T Consensus 479 ~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~ 517 (536) T protein:vir:21 479 SGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQAT 517 (536) T ss_pred hhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 567887544443333333222222222211 111111 No 24 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=100.00 E-value=1.2e-45 Score=266.82 Aligned_cols=485 Identities=11% Similarity=0.074 Sum_probs=322.8 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcCC Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFPN 90 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~~ 90 (581) |.+ |+-..+..++++|+.+++.|++|+..|.+|.+|+.|+.....+........+++-+.....+++|.+.||+.+||+ T Consensus 1 ~~~-~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP~ 79 (522) T protein:vir:94 1 MAE-REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFPQ 79 (522) T ss_pred Ccc-cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCCC Confidence 555 5666689999999999999999999999999999998765555554444455777777899999999999999999 Q ss_pred ccEEEeecCChhH-------------HHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 91 ERWLKWEGKSLQD-------------EAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 91 ~~~~~~~~~~~~d-------------~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) ..||++.+..... ++.-+..++.|...|..|||+..+++.+.|++.+|||++-+.=... + T Consensus 80 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~-------~ 152 (522) T protein:vir:94 80 SPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQ-------G 152 (522) T ss_pred CcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCC-------C Confidence 9999998664221 1223556778889999999999999999999999999976531100 0 Q ss_pred eEeeeeccceEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 158 ATRDTYFGPRAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 158 ~~~~~~~~p~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) ....++.+++.++++..++.. +. ..+.+..+|.+.|- +++.+.... + T Consensus 153 ------~~~~~~~~pl~~y~v~~d~~G~vd--~i~r~~~~~~~~l~-------------~~~~~~~~~----~------- 200 (522) T protein:vir:94 153 ------TYSPMRMYRLVSYVVQRDAFGNIL--QIVTIDKVAFSALP-------------EDVKSQLNA----D------- 200 (522) T ss_pred ------ceeeEEEEEcceEEEeeCCCcCeE--EEeeeeeccHHhcc-------------hHHHHHHhc----c------- Confidence 001356677888998877542 22 22345566655431 111111100 0 Q ss_pred hhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCc Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDN 316 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~ 316 (581) .+.....++++++.- ...+++. +..-..|..+.. .+.-|++...||.+..|.+.++. T Consensus 201 ---------------~~~p~~~v~v~~~v~----~~~~~~~---~~~~~~g~~~~~-~~~~~~~~e~P~~~~Rw~~~~ge 257 (522) T protein:vir:94 201 ---------------DYEPDTELEVYTHIY----RQDDEYL---RYEEVEGIEVTG-TDGSYPLTACPYIPVRMVRLDGE 257 (522) T ss_pred ---------------cCCccceEEEEEEEE----eeCCcee---EEeeccCceecc-cCCCCccccCCceeeeeeecCCC Confidence 001113456665441 2233332 111223443322 33445677899999999999999 Q ss_pred ccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEeCCCCCccccc--CCCccchh Q lcl|NC_015158. 317 LYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYINGDGDVEMMA--PNTQALQA 389 (581) Q Consensus 317 ~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~~~~~~i~~~~--~p~~~~~~ 389 (581) .||.|+++.+++.++.+|.+.+.++.+..++++|++.+..+ +.++ ...+|.+ ....++++++++ .+.....+ T Consensus 258 ~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~~~g~~-v~g~~~~v~~~~~~~~~~~~~~ 336 (522) T protein:vir:94 258 DYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAATGEF-VAGRVEDINFLQLTKGQDFTIA 336 (522) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheeccCCcee-ecCCcccceeeecccccchhHH Confidence 99999999999999999999999999999999999988643 4444 3445554 446667777554 34445567 Q ss_pred HHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceee Q lcl|NC_015158. 390 DMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIR 469 (581) Q Consensus 390 ~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR 469 (581) ...|+.+++.+.+..-+. +.+..+....|||+|.+..+.....+.-+..++..+++.|||...+..+.+... T Consensus 337 ~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~------ 408 (522) T protein:vir:94 337 KSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGM------ 408 (522) T ss_pred HHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCC------ Confidence 778888888887765332 333344457899999999999999999999999999999999988777655432 Q ss_pred ecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc--ccccc-cchhHHHHHHHHHHHHhcCCCc Q lcl|NC_015158. 470 VFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP--VWQDI-KPHVSTENLAKMLEHNLSLGGW 546 (581) Q Consensus 470 ~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~--~~~~i-~p~~~~~~l~~~~~e~~~l~~~ 546 (581) +..++.++++.. .+..-+-++|.+.++.|.++++.. +++.. .++++..++++.+++..|++.. T Consensus 409 ----------lP~~p~~~v~v~----~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~ 474 (522) T protein:vir:94 409 ----------IPDLPKEAVEPT----VSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTA 474 (522) T ss_pred ----------CCCCCcccEEee----EecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChh Confidence 223344444333 232333355666666666655432 23332 3567888999999999999766 Q ss_pred ccccCCCCcHHHHHHHHHHHHHH--HHHHHHhcccCC Q lcl|NC_015158. 547 DIFKPNVAVMEAQTTSALVNQSQ--AQIEEEAQVPLV 581 (581) Q Consensus 547 ~~~~~~~~~~~~~~~q~~~q~aq--~~~~~~~~~~~~ 581 (581) .+++++-++.+..++|+++++.| +..+-..+.-.+ T Consensus 475 ~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~ 511 (522) T protein:vir:94 475 GLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAV 511 (522) T ss_pred hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 77777544433333333222111 111112222222 No 25 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=1.1e-45 Score=267.03 Aligned_cols=499 Identities=11% Similarity=0.103 Sum_probs=322.5 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccc---cccccccccccccccchHHHHHHHHHHHHHhh Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTT---TTNSTLPWKNKTTLPKLCQIRDNLHSNYISAL 87 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~---~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~ 87 (581) |-. ...++.|.+.|+..+..|++|+..|.+|.+|..|+..+- .++....+..+++-+.-...+++|.+.||..+ T Consensus 1 M~~---~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:98 1 MAE---QTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCC---cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 554 346688999999999999999999999999999995432 23333333455777888899999999999999 Q ss_pred cC-CccEEEeecCChhHHHH------HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEe Q lcl|NC_015158. 88 FP-NERWLKWEGKSLQDEAK------RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATR 160 (581) Q Consensus 88 f~-~~~~~~~~~~~~~d~~~------ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~ 160 (581) || +..||++.+...+..+. -+..++.|...|..|||+..+++++.|++.+|||++-+....+ T Consensus 78 tpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~----------- 146 (555) T protein:vir:98 78 TSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD----------- 146 (555) T ss_pred cCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC----------- Confidence 99 89999999876654322 2446778889999999999999999999999999986543211 Q ss_pred eeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 161 DTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 161 ~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) ...++..+++.++++..++..--|+ ++.+..+|..+|.+...... ..+.+.+.+.. +++ T Consensus 147 ---~~~rf~~~pl~~~~v~~d~~G~vd~-i~r~~~~t~~ql~~~fg~~~----l~~~~~~~~~~----~~~--------- 205 (555) T protein:vir:98 147 ---AVVYHHSLTAGEYAIAADNQGRVNT-LYREFQITVAQMVREFGKDK----CSTTVQSLFDR----GAL--------- 205 (555) T ss_pred ---ceEEEEEeecceeEEeeCCCCCEEE-EEEEEeccHHHHHHhcCccc----CCHHHHHHHhc----CCC--------- Confidence 0124677999999999877643332 23456779888876532211 11111111110 000 Q ss_pred ccccccccccccccCCceEEEEEEeeeeeccc-------CCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccccc Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQ-------SGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIR 313 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~-------~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~ 313 (581) ..+++|+.+.-.-.+.+ +-++..+++..-.+|+++++... | ...||.+..|.+. T Consensus 206 ---------------~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esg--y--~e~P~i~~Rw~~~ 266 (555) T protein:vir:98 206 ---------------EQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESG--Y--RSFRALCPRWALV 266 (555) T ss_pred ---------------CceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCccccccCC--c--ccCCceeeeeeec Confidence 01244444431100100 11122222222235566765553 3 5689999999999 Q ss_pred CCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--ccccccCCceeEEeCCCC---Ccccc-cCCCccc Q lcl|NC_015158. 314 QDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--VEEFVWGPMEQIYINGDG---DVEMM-APNTQAL 387 (581) Q Consensus 314 p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~~~i~~~pG~vi~~~~~~---~i~~~-~~p~~~~ 387 (581) ++..||.|+++...+..+.+|.+.+.++.++.+.++|.+.+..+ ...++..||++..+..+. .+.++ .+..... T Consensus 267 ~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~ 346 (555) T protein:vir:98 267 GGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLS 346 (555) T ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccceeccccccccccCCCCcceecccccccchH Confidence 99999999999999999999999999999999999999988765 345788899987665432 23332 2222344 Q ss_pred hhHHHHHHHHHHHHHhcCCchHhcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPREAMGI-RTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVAD 466 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~~~G~-~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~ 466 (581) .+...|+.+.+.+.+..=+.-+.+.. .+....|||+|.+..+.....+.-+.-++..+++.||+...+.++.+..-.| T Consensus 347 ~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP- 425 (555) T protein:vir:98 347 HLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILP- 425 (555) T ss_pred HHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC- Confidence 45555677777665543122223333 3334799999999999999999999999999999999998888876643222 Q ss_pred eeeecCchhcccCCCccCHHHhcCC-ceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHHHHHHHHH Q lcl|NC_015158. 467 TIRVFDSDDKVATFMNVNKDDITAK-GRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENLAKMLEHN 540 (581) Q Consensus 467 ~iR~~~~~~~~~~~~~v~r~di~~~-~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l~~~~~e~ 540 (581) ..++.+.+. ++|.....-+-++|...++.+.++++.. +++.++.+++..++++.+++. T Consensus 426 ----------------~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~ 489 (555) T protein:vir:98 426 ----------------PPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADM 489 (555) T ss_pred ----------------CCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHH Confidence 223445443 3443333323233333332333332221 234455678899999999999 Q ss_pred hcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcc----cCC Q lcl|NC_015158. 541 LSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQV----PLV 581 (581) Q Consensus 541 ~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~----~~~ 581 (581) .|++. .++++..++.+-.+++..+|++|++.+.++|+ -.| T Consensus 490 ~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~ 533 (555) T protein:vir:98 490 LGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKL 533 (555) T ss_pred hCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99997 78888766543211111111111111112221 111 No 26 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=1.1e-45 Score=267.03 Aligned_cols=499 Identities=11% Similarity=0.103 Sum_probs=322.5 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccc---cccccccccccccccchHHHHHHHHHHHHHhh Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTT---TTNSTLPWKNKTTLPKLCQIRDNLHSNYISAL 87 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~---~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~ 87 (581) |-. ...++.|.+.|+..+..|++|+..|.+|.+|..|+..+- .++....+..+++-+.-...+++|.+.||..+ T Consensus 1 M~~---~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:10 1 MAE---QTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCC---cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 554 346688999999999999999999999999999995432 23333333455777888899999999999999 Q ss_pred cC-CccEEEeecCChhHHHH------HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEe Q lcl|NC_015158. 88 FP-NERWLKWEGKSLQDEAK------RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATR 160 (581) Q Consensus 88 f~-~~~~~~~~~~~~~d~~~------ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~ 160 (581) || +..||++.+...+..+. -+..++.|...|..|||+..+++++.|++.+|||++-+....+ T Consensus 78 tpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~----------- 146 (555) T protein:vir:10 78 TSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD----------- 146 (555) T ss_pred cCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC----------- Confidence 99 89999999876654322 2446778889999999999999999999999999986543211 Q ss_pred eeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 161 DTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 161 ~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) ...++..+++.++++..++..--|+ ++.+..+|..+|.+...... ..+.+.+.+.. +++ T Consensus 147 ---~~~rf~~~pl~~~~v~~d~~G~vd~-i~r~~~~t~~ql~~~fg~~~----l~~~~~~~~~~----~~~--------- 205 (555) T protein:vir:10 147 ---AVVYHHSLTAGEYAIAADNQGRVNT-LYREFQITVAQMVREFGKDK----CSTTVQSLFDR----GAL--------- 205 (555) T ss_pred ---ceEEEEEeecceeEEeeCCCCCEEE-EEEEEeccHHHHHHhcCccc----CCHHHHHHHhc----CCC--------- Confidence 0124677999999999877643332 23456779888876532211 11111111110 000 Q ss_pred ccccccccccccccCCceEEEEEEeeeeeccc-------CCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccccc Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQ-------SGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIR 313 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~-------~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~ 313 (581) ..+++|+.+.-.-.+.+ +-++..+++..-.+|+++++... | ...||.+..|.+. T Consensus 206 ---------------~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esg--y--~e~P~i~~Rw~~~ 266 (555) T protein:vir:10 206 ---------------EQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESG--Y--RSFRALCPRWALV 266 (555) T ss_pred ---------------CceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCccccccCC--c--ccCCceeeeeeec Confidence 01244444431100100 11122222222235566765553 3 5689999999999 Q ss_pred CCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--ccccccCCceeEEeCCCC---Ccccc-cCCCccc Q lcl|NC_015158. 314 QDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--VEEFVWGPMEQIYINGDG---DVEMM-APNTQAL 387 (581) Q Consensus 314 p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~~~i~~~pG~vi~~~~~~---~i~~~-~~p~~~~ 387 (581) ++..||.|+++...+..+.+|.+.+.++.++.+.++|.+.+..+ ...++..||++..+..+. .+.++ .+..... T Consensus 267 ~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~ 346 (555) T protein:vir:10 267 GGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLS 346 (555) T ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccceeccccccccccCCCCcceecccccccchH Confidence 99999999999999999999999999999999999999988765 345788899987665432 23332 2222344 Q ss_pred hhHHHHHHHHHHHHHhcCCchHhcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPREAMGI-RTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVAD 466 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~~~G~-~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~ 466 (581) .+...|+.+.+.+.+..=+.-+.+.. .+....|||+|.+..+.....+.-+.-++..+++.||+...+.++.+..-.| T Consensus 347 ~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP- 425 (555) T protein:vir:10 347 HLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILP- 425 (555) T ss_pred HHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC- Confidence 45555677777665543122223333 3334799999999999999999999999999999999998888876643222 Q ss_pred eeeecCchhcccCCCccCHHHhcCC-ceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHHHHHHHHH Q lcl|NC_015158. 467 TIRVFDSDDKVATFMNVNKDDITAK-GRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENLAKMLEHN 540 (581) Q Consensus 467 ~iR~~~~~~~~~~~~~v~r~di~~~-~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l~~~~~e~ 540 (581) ..++.+.+. ++|.....-+-++|...++.+.++++.. +++.++.+++..++++.+++. T Consensus 426 ----------------~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~ 489 (555) T protein:vir:10 426 ----------------PPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADM 489 (555) T ss_pred ----------------CCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHH Confidence 223445443 3443333323233333332333332221 234455678899999999999 Q ss_pred hcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcc----cCC Q lcl|NC_015158. 541 LSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQV----PLV 581 (581) Q Consensus 541 ~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~----~~~ 581 (581) .|++. .++++..++.+-.+++..+|++|++.+.++|+ -.| T Consensus 490 ~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~ 533 (555) T protein:vir:10 490 LGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKL 533 (555) T ss_pred hCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99997 78888766543211111111111111112221 111 No 27 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=1.1e-45 Score=267.03 Aligned_cols=499 Identities=11% Similarity=0.103 Sum_probs=322.5 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccc---cccccccccccccccchHHHHHHHHHHHHHhh Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTT---TTNSTLPWKNKTTLPKLCQIRDNLHSNYISAL 87 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~---~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~ 87 (581) |-. ...++.|.+.|+..+..|++|+..|.+|.+|..|+..+- .++....+..+++-+.-...+++|.+.||..+ T Consensus 1 M~~---~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:10 1 MAE---QTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCC---cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 554 346688999999999999999999999999999995432 23333333455777888899999999999999 Q ss_pred cC-CccEEEeecCChhHHHH------HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEe Q lcl|NC_015158. 88 FP-NERWLKWEGKSLQDEAK------RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATR 160 (581) Q Consensus 88 f~-~~~~~~~~~~~~~d~~~------ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~ 160 (581) || +..||++.+...+..+. -+..++.|...|..|||+..+++++.|++.+|||++-+....+ T Consensus 78 tpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~----------- 146 (555) T protein:vir:10 78 TSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD----------- 146 (555) T ss_pred cCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC----------- Confidence 99 89999999876654322 2446778889999999999999999999999999986543211 Q ss_pred eeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 161 DTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 161 ~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) ...++..+++.++++..++..--|+ ++.+..+|..+|.+...... ..+.+.+.+.. +++ T Consensus 147 ---~~~rf~~~pl~~~~v~~d~~G~vd~-i~r~~~~t~~ql~~~fg~~~----l~~~~~~~~~~----~~~--------- 205 (555) T protein:vir:10 147 ---AVVYHHSLTAGEYAIAADNQGRVNT-LYREFQITVAQMVREFGKDK----CSTTVQSLFDR----GAL--------- 205 (555) T ss_pred ---ceEEEEEeecceeEEeeCCCCCEEE-EEEEEeccHHHHHHhcCccc----CCHHHHHHHhc----CCC--------- Confidence 0124677999999999877643332 23456779888876532211 11111111110 000 Q ss_pred ccccccccccccccCCceEEEEEEeeeeeccc-------CCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccccc Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQ-------SGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIR 313 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~-------~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~ 313 (581) ..+++|+.+.-.-.+.+ +-++..+++..-.+|+++++... | ...||.+..|.+. T Consensus 206 ---------------~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esg--y--~e~P~i~~Rw~~~ 266 (555) T protein:vir:10 206 ---------------EQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESG--Y--RSFRALCPRWALV 266 (555) T ss_pred ---------------CceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCccccccCC--c--ccCCceeeeeeec Confidence 01244444431100100 11122222222235566765553 3 5689999999999 Q ss_pred CCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--ccccccCCceeEEeCCCC---Ccccc-cCCCccc Q lcl|NC_015158. 314 QDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--VEEFVWGPMEQIYINGDG---DVEMM-APNTQAL 387 (581) Q Consensus 314 p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~~~i~~~pG~vi~~~~~~---~i~~~-~~p~~~~ 387 (581) ++..||.|+++...+..+.+|.+.+.++.++.+.++|.+.+..+ ...++..||++..+..+. .+.++ .+..... T Consensus 267 ~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~ 346 (555) T protein:vir:10 267 GGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLS 346 (555) T ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccceeccccccccccCCCCcceecccccccchH Confidence 99999999999999999999999999999999999999988765 345788899987665432 23332 2222344 Q ss_pred hhHHHHHHHHHHHHHhcCCchHhcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPREAMGI-RTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVAD 466 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~~~G~-~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~ 466 (581) .+...|+.+.+.+.+..=+.-+.+.. .+....|||+|.+..+.....+.-+.-++..+++.||+...+.++.+..-.| T Consensus 347 ~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP- 425 (555) T protein:vir:10 347 HLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILP- 425 (555) T ss_pred HHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC- Confidence 45555677777665543122223333 3334799999999999999999999999999999999998888876643222 Q ss_pred eeeecCchhcccCCCccCHHHhcCC-ceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHHHHHHHHH Q lcl|NC_015158. 467 TIRVFDSDDKVATFMNVNKDDITAK-GRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENLAKMLEHN 540 (581) Q Consensus 467 ~iR~~~~~~~~~~~~~v~r~di~~~-~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l~~~~~e~ 540 (581) ..++.+.+. ++|.....-+-++|...++.+.++++.. +++.++.+++..++++.+++. T Consensus 426 ----------------~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~ 489 (555) T protein:vir:10 426 ----------------PPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADM 489 (555) T ss_pred ----------------CCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHH Confidence 223445443 3443333323233333332333332221 234455678899999999999 Q ss_pred hcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcc----cCC Q lcl|NC_015158. 541 LSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQV----PLV 581 (581) Q Consensus 541 ~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~----~~~ 581 (581) .|++. .++++..++.+-.+++..+|++|++.+.++|+ -.| T Consensus 490 ~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~ 533 (555) T protein:vir:10 490 LGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKL 533 (555) T ss_pred hCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99997 78888766543211111111111111112221 111 No 28 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=100.00 E-value=1.7e-44 Score=260.55 Aligned_cols=487 Identities=12% Similarity=0.100 Sum_probs=321.3 Q ss_pred CccchhhhhhhccchhhhHH-HHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLA-EQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNL 79 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a-~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~ 79 (581) ||.-++ ++.+| ..++++|+.+++.|++|+..|.+|.+|.+|+.....+....-...+++-+.....+.+| T Consensus 1 ~~~~~~---------~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~L 71 (535) T protein:vir:94 1 MASSQK---------REGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNL 71 (535) T ss_pred CCchhh---------hhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHH Confidence 443332 23444 55889999999999999999999999999986555554444444557777778999999 Q ss_pred HHHHHHhhcCCccEEEeecCChhH-------------HHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEee Q lcl|NC_015158. 80 HSNYISALFPNERWLKWEGKSLQD-------------EAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEY 146 (581) Q Consensus 80 ~~~l~~~~f~~~~~~~~~~~~~~d-------------~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~ 146 (581) .+.||+.+||+..||++......- ++.-+..++.|...|..|||+..+++++.|++.+|||++.++. T Consensus 72 aa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~ 151 (535) T protein:vir:94 72 ASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPE 151 (535) T ss_pred HHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeecc Confidence 999999999999999987765321 1123445677888999999999999999999999999998865 Q ss_pred ecceeeeeeeeeEeeeeccceEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhh Q lcl|NC_015158. 147 VKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFR 225 (581) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~ 225 (581) ..+. ...+..+++-++++..++.. +.. .+.|..+|.+.|-+- +-+.... T Consensus 152 ~~~~--------------~~~f~~~pl~~y~v~~d~~G~vd~--i~r~~~~~~~~l~~~-------------~~~~~~~- 201 (535) T protein:vir:94 152 PEGT--------------YNPMKLYRLSSYVVQRDAFGTVLQ--IVTLDKTAYAALPED-------------VRNSMDS- 201 (535) T ss_pred CcCc--------------ccceEEEEcCeEEEeeCCCCCeEE--EEeeeeccHHHhhHH-------------HHHHHHh- Confidence 3221 11345677788888877543 322 234556676665221 1111100 Q ss_pred ccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCe Q lcl|NC_015158. 226 RGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPI 305 (581) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf 305 (581) .. ++.....++++++- +.+..++.+. +.....|..+ ...+..+++...|| T Consensus 202 --~~----------------------~~~~~~~v~v~~~v--~~~~~~~~~~---~~~e~~g~~~-~~~~~~~g~~~~P~ 251 (535) T protein:vir:94 202 --SQ----------------------EHKGDEMIDVYTHI--YLDEESGEYL---KYEEIDGVEV-EGTDASYPVDACPY 251 (535) T ss_pred --cc----------------------ccCCCceeEEEEEE--EeeCCCCcEE---EEEEecCeee-ccccccCccccCCc Confidence 00 00111235555542 2234455543 2223445544 22234455678999 Q ss_pred eEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccccc-CCceeEEeCCCCCcccc Q lcl|NC_015158. 306 FHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVW-GPMEQIYINGDGDVEMM 380 (581) Q Consensus 306 ~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~-~pG~vi~~~~~~~i~~~ 380 (581) .+..|...++..||.|+++...+..+.+|.+.+.++.+..++++|++.++.+ ++.+.. .||.++ ...++++++. T Consensus 252 ~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v-~g~~~~v~~~ 330 (535) T protein:vir:94 252 IPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFV-SGRPEDISFL 330 (535) T ss_pred eeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcccCCCceee-cCCcccceee Confidence 9999999999999999999999999999999999999999999999988643 444433 455554 4556777655 Q ss_pred cCC--CccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 381 APN--TQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEIS 458 (581) Q Consensus 381 ~~p--~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~ 458 (581) +.. ..+..+...++.+.+.+.+.. --.+.+..+....||++|.+..+.....+.-+.-++..+++.|||...+.++ T Consensus 331 ~~~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il 408 (535) T protein:vir:94 331 QLEKAADFSVARAVSEQIEGRLSYAF--MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQL 408 (535) T ss_pred ecccccchhHHHHHHHHHHHHHHHHH--hHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 433 334455666777777666543 1112333444568999999999999999999999999999999999988887 Q ss_pred HhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc--cccccc-chhHHHHHHH Q lcl|NC_015158. 459 RRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP--VWQDIK-PHVSTENLAK 535 (581) Q Consensus 459 ~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~--~~~~i~-p~~~~~~l~~ 535 (581) .+..- +..++.+.++.++ +. ..+-++|.+.++.|.+.++.. +++.++ ++++..++++ T Consensus 409 ~r~g~----------------lP~~p~~~v~~~~--vs--~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~ 468 (535) T protein:vir:94 409 QATNQ----------------IPELPKEAVEPTI--ST--GMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKL 468 (535) T ss_pred HhCCC----------------CCCCChhhccceE--ee--hHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHH Confidence 65432 2334445554443 22 223356666666666665532 233333 4678889999 Q ss_pred HHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 536 MLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 536 ~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+++..|++...++++.-++.+.+++| ++++|++.+.++.+-.. T Consensus 469 ~~a~~~Gvp~~~i~rs~eev~~~~~q~--~~~~~~~~~~~~~g~~~ 512 (535) T protein:vir:94 469 RIANAIGIDTSGILKTPEEKQQEMAEA--AQGTAMQNAAASAGAGA 512 (535) T ss_pred HHHHHhCCChhhhcCCHHHHHHHHHHH--HHHHHHHHHHHHHHHhh Confidence 999999999767788755543222222 22222223333322222 No 29 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=4.9e-44 Score=258.05 Aligned_cols=547 Identities=10% Similarity=0.043 Sum_probs=321.4 Q ss_pred Cccchhhhhhhc--cchhhhHHHHHHHHHhhHHhhhhhHHHHHHH--HHHHhhccccccccc----ccccccccccccch Q lcl|NC_015158. 1 MTGKVLELQQML--DDTRDGLAEQIANTWQNWNSQRQEWLSQKSE--LRNYIFATDTTTTTN----STLPWKNKTTLPKL 72 (581) Q Consensus 1 ~~~~~~~~~~~~--~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~--~~~y~~~~~~~~~~~----~~~~~k~~~~~pki 72 (581) |--|--+=+++. ++..+ -.....+|..|...+.. ...|.+ .++|=|+....+... .+..+.-.+|.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~-q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i 77 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGD--TPLTVDEYADINYEIED-QPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLI 77 (772) T ss_pred CCcchhhHHhhccCCcccc--cccCHHHHHHHHHHHhc-cHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcch Confidence 333222222222 11000 01111222222222222 112322 234444444343322 23345556889999 Q ss_pred HHHHHHHHHHHHHhhcCCccEEEeecCC-hhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeeccee Q lcl|NC_015158. 73 CQIRDNLHSNYISALFPNERWLKWEGKS-LQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETT 151 (581) Q Consensus 73 ~~~~d~~~~~l~~~~f~~~~~~~~~~~~-~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~ 151 (581) ...++.+.+... .|+--++|.|+. .+|.+.|+++..++.+-..+|++....++.|.+.++.|.|++.+.+... T Consensus 78 ~~~v~~v~g~~~----~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d-- 151 (772) T protein:vir:10 78 GPALLSLQGYEA----VTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESD-- 151 (772) T ss_pred HHHHHHHHHHHH----hcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccC-- Confidence 999999999888 688889999985 4777889999999999999999999999999999999999766543211 Q ss_pred eeeeeeeEeeeec-cceEEecchhheeecCCCC-CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccC Q lcl|NC_015158. 152 KDEESGATRDTYF-GPRAVRIDPKDIVFNPVAV-DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGL 228 (581) Q Consensus 152 ~~~~~~~~~~~~~-~p~ie~V~p~df~~DP~a~-~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~ 228 (581) +.. ..++.+|+|.+|||||+|. ++.||.|| +..++++++++++..... ...... .+....-.+. T Consensus 152 ----------~~~~~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a--~~~~~~-~~~~~~~~~~ 218 (772) T protein:vir:10 152 ----------PFKFPYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHA--ELIGMV-GKYGSTWWGQ 218 (772) T ss_pred ----------CCCCCeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCch--hHHHhh-hhhcccccCc Confidence 111 2378999999999999985 78999886 566789999998843221 111100 0000000011 Q ss_pred Ccccchhhhhcc----ccccccccc--cccccC--CceEEEEEEeeee------ecccCCcee----------------- Q lcl|NC_015158. 229 GTYTREDCEKAV----GFSMDGFGN--LYDYFQ--SPYVEVLTFYGDY------HDTQSGTFK----------------- 277 (581) Q Consensus 229 ~~~~~~~~~~~~----~~~~~~~~~--~~~~~~--~~~vevlE~~g~~------~d~~~d~~~----------------- 277 (581) ...+..+...+. .......++ ...|+. ...|+|+|||-.. .+..+|... T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~ 298 (772) T protein:vir:10 219 PDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGR 298 (772) T ss_pred ccccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcc Confidence 111111111000 000000111 112222 3469999998321 111222111 Q ss_pred --------eeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015158. 278 --------RNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAF 349 (581) Q Consensus 278 --------e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~n 349 (581) .-+.+.++.|.++|+...+||+++.+||+-+-.......-...|+.+.++|+|+.+|+..+.+++.++... T Consensus 299 ~~~~~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~- 377 (772) T protein:vir:10 299 ISPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVAR- 377 (772) T ss_pred cchheeeeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhccc- Confidence 11234567899999999999999999988432222222222337999999999999999999999876553 Q ss_pred CeEEEecc-ccc-------cccCCceeEEeCCC------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCC Q lcl|NC_015158. 350 PPMKVKGD-VEE-------FVWGPMEQIYINGD------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRT 415 (581) Q Consensus 350 p~~~v~~d-~~~-------i~~~pG~vi~~~~~------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~ 415 (581) +...++ +++ --.+|++++.++.+ ..+++.++|..+.....+++...+.++++|||...++|..+ T Consensus 378 --~~~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~ 455 (772) T protein:vir:10 378 --VERTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKG 455 (772) T ss_pred --ccccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCc Confidence 222221 111 22579999998864 23455556667778889999999999999999999999754 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchh-cccCCCccCH--------- Q lcl|NC_015158. 416 PGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDD-KVATFMNVNK--------- 485 (581) Q Consensus 416 ~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~-~~~~~~~v~r--------- 485 (581) . +.+...++++..+++..+..+.+++.. .++.+.+.++.++.++++.+.++||+|+|. +..-++.++. T Consensus 456 n-a~SGvAi~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~ 533 (772) T protein:vir:10 456 T-ATSGIQEQQQIEQSNQSIGRIMDNFRA-GRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGA 533 (772) T ss_pred c-hhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccc Confidence 3 455566888999999999999999995 467778888888888899999999998752 2333343332 Q ss_pred ----HHh-cCCceEEEecchhHH-HHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc-cc------cCC Q lcl|NC_015158. 486 ----DDI-TAKGRLRPVGARHFA-EQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD-IF------KPN 552 (581) Q Consensus 486 ----~di-~~~~~vva~ga~~~~-~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~-~~------~~~ 552 (581) .|| .|.++|++.-+.+.. .|.+.++.|.+++.. +.|.+.. .+...+.++.++++-+ +. .+. T Consensus 534 ~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~-----~~P~~~~-~~~~~~le~~D~p~~~ei~~~ir~~~~~ 607 (772) T protein:vir:10 534 AYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKS-----MPPQYQA-AVLPFLVSLMDVPFKRDVVEAIRAVDQQ 607 (772) T ss_pred cceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhc-----cChhHHH-HHHHHHHhhcCCCChHHHHHHHHHHhcc Confidence 243 455676555544444 466667777777642 2344332 3334555555555431 11 111 Q ss_pred CCcHHHHHHHHH------HHHHHHHHHH---HhcccCC Q lcl|NC_015158. 553 VAVMEAQTTSAL------VNQSQAQIEE---EAQVPLV 581 (581) Q Consensus 553 ~~~~~~~~~q~~------~q~aq~~~~~---~~~~~~~ 581 (581) .++ ++++++.+ +++++++++. +++..+. T Consensus 608 ~~p-eq~~~~~~q~~qq~~~~~~~el~~~q~~a~~~~~ 644 (772) T protein:vir:10 608 QTP-EQIQQQIDQAVQDALAKAGNDIKLRELEIKERKA 644 (772) T ss_pred CCh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122 11111111 1111111110 0000011 No 30 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=100.00 E-value=1.9e-42 Score=249.38 Aligned_cols=497 Identities=11% Similarity=0.048 Sum_probs=321.4 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccc------cccccccccccccchHHHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTT------NSTLPWKNKTTLPKLCQIRDNLHSNYI 84 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~------~~~~~~k~~~~~pki~~~~d~~~~~l~ 84 (581) |.. +.+.++..|.+.|+..+..|++|+..|.+|.+|..|+..+-.+ ......-.+++-+--...+++|.+.|| T Consensus 1 m~~-d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~ 79 (549) T protein:vir:10 1 MTN-DDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMD 79 (549) T ss_pred CCc-chHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHH Confidence 554 3688889999999999999999999999999999998533221 111111223555556788899999999 Q ss_pred HhhcC-CccEEEeecCChhHHHH------HHHHHHHHHHHH--HhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeee Q lcl|NC_015158. 85 SALFP-NERWLKWEGKSLQDEAK------RDAIQQYMDNKV--KESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEE 155 (581) Q Consensus 85 ~~~f~-~~~~~~~~~~~~~d~~~------ae~~~~~i~~~l--~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~ 155 (581) ..+|| +..||++.....+..+. -+..++.+...+ ..|||+.++++.+.|++.+|||++.+....+ T Consensus 80 ~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~------ 153 (549) T protein:vir:10 80 SMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVG------ 153 (549) T ss_pred hhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCC------ Confidence 99999 89999999877554332 244556666654 4799999999999999999999988743111 Q ss_pred eeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 156 SGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 156 ~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) ...++..+++-++++..++..--|+ .+.+..+|..+|.+...... ..+++.+.... +++ T Consensus 154 --------~~~~f~~~pl~~~~v~~d~~G~vd~-i~r~~~~t~~ql~~~fg~~~----l~~~v~~~~~~----~~~---- 212 (549) T protein:vir:10 154 --------KGIVYRNVPMQRLWFAENNSGLIDK-THVQWELTLRQAAQRFGREN----LSPSMQSTLEK----DPE---- 212 (549) T ss_pred --------CeeEEEEEEcCeEEEeeCCCCCeEE-EEEEeecCHHHHHHhcCccc----CCHHHHHHhhc----CCC---- Confidence 0124677899999999987643343 34566779888876532211 11121111110 000 Q ss_pred hhhccccccccccccccccCCceEEEEEEe--eeeec-----ccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEe Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFY--GDYHD-----TQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHC 308 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~--g~~~d-----~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~ 308 (581) ..++++.+= ....+ -.+-++..+++. .+++++++... +...||.+. T Consensus 213 ---------------------~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e--~~~~~il~esg----~~e~P~~~~ 265 (549) T protein:vir:10 213 ---------------------KSAIFYHAVEPRADRDPRKLDGRNMQFASYWLD--EGRDRIVQNSG----FRTFPFAIG 265 (549) T ss_pred ---------------------ceEEEEEEeecCCCCCccccccccCceEEEEEE--ecCCEeeccCC----cccCCccee Confidence 123333220 00000 011122222222 35677777553 346899999 Q ss_pred cccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--ccccccCCceeEEeCC--C--CCcccccC Q lcl|NC_015158. 309 GWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--VEEFVWGPMEQIYING--D--GDVEMMAP 382 (581) Q Consensus 309 ~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~~~i~~~pG~vi~~~~--~--~~i~~~~~ 382 (581) .|...++..||.|+++...+..+.+|.+.+.++.++.++++|++.+..+ ....+..||++..+.. + ..+.|+.. T Consensus 266 Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~l~pgg~~~~~~~~~~~~~~~pl~~ 345 (549) T protein:vir:10 266 RFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLDGFDLRSGALNWGGLNDKGEEMVKPLLT 345 (549) T ss_pred eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccceeccCCccccccCCCCccceeeecc Confidence 9999999999999999999999999999999999999999999988643 3445667888865532 2 24667666 Q ss_pred CCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015158. 383 NTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNL 462 (581) Q Consensus 383 p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~ 462 (581) +.....+...++.+.+.+.+.-=+..+.+- ......|||+|.+..+.....+.-+.-++..+++.|||...+.++.+.. T Consensus 346 ~~~~~~~~~~i~~~~~rI~~af~~d~~~~~-~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g 424 (549) T protein:vir:10 346 GKQAQIGIEFAQDTRQTINQWFYVTLFQIL-VDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAG 424 (549) T ss_pred ccchhHHHHHHHHHHHHHHHHHhhhhhhhh-cCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 655555667777777777665433333322 2334689999999999999999999999999999999998887766533 Q ss_pred CccceeeecCchhcccCCCccCHHHhc---CCceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHHH Q lcl|NC_015158. 463 DVADTIRVFDSDDKVATFMNVNKDDIT---AKGRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENLA 534 (581) Q Consensus 463 d~~~~iR~~~~~~~~~~~~~v~r~di~---~~~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l~ 534 (581) . +..+ ++++. .+.+|..+++-+-++|.+.++.+.++++.. +++.+..+++..+++ T Consensus 425 ~----------------lP~~-p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~ 487 (549) T protein:vir:10 425 Q----------------LPDM-PQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIA 487 (549) T ss_pred C----------------CCCC-ChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHH Confidence 2 2222 33332 234455554444444444444444444321 223333468888999 Q ss_pred HHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHH----HhcccCC Q lcl|NC_015158. 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEE----EAQVPLV 581 (581) Q Consensus 535 ~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~----~~~~~~~ 581 (581) +.+++.+|++. .++++..++.+..++.++||++++..+. .+-...+ T Consensus 488 ~~~a~~~Gvp~-~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~ 537 (549) T protein:vir:10 488 RLLADYGGVPV-EAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDL 537 (549) T ss_pred HHHHHhcCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999997 6888876664422211112211111111 1111122 No 31 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=3.3e-42 Score=247.99 Aligned_cols=521 Identities=13% Similarity=0.072 Sum_probs=309.8 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccc--cccccccccchHHHHHHHHHHHHHhhc Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTL--PWKNKTTLPKLCQIRDNLHSNYISALF 88 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~--~~k~~~~~pki~~~~d~~~~~l~~~~f 88 (581) |.|. ...-..+...|+...+....|-++- ..++-|+....+...... .-..+.+.++|...++++.++.. T Consensus 1 m~d~--~~~~~~~~~~~~~~~~~~~~~r~~a--~~d~~fy~G~Qw~~~~~~~l~~q~rp~~N~i~~~i~~v~g~~~---- 72 (725) T protein:vir:77 1 MADN--ENRLESILSRFDADWTASDEARREA--KNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR---- 72 (725) T ss_pred CCch--HHHHHHHHHHHHHHHHhhHHHHHHH--HHHHHhhCCCCCCHHHHHHHHhcCCCccccHHHHHHHHHhhHH---- Confidence 6553 2222223333332222222221111 334544544444332211 11223467888888888888777 Q ss_pred CCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceE Q lcl|NC_015158. 89 PNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRA 168 (581) Q Consensus 89 ~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~i 168 (581) .|+.-++|.|+..+|.+.|+++..++.+-...|++....++.|.|.++-|.|.+.+-+..... ...++. .+| T Consensus 73 ~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~-d~~~~~-------~~i 144 (725) T protein:vir:77 73 QNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ-SPTSNN-------QVI 144 (725) T ss_pred hCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCC-CCCCCc-------eee Confidence 588889999999999999999999999999999999999999999999999977664321110 001111 133 Q ss_pred Eec----chhheeecCCCC--CcccCceEE-EEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccc Q lcl|NC_015158. 169 VRI----DPKDIVFNPVAV--DFAHSPKII-RTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVG 241 (581) Q Consensus 169 e~V----~p~df~~DP~a~--~~~d~~~i~-r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 241 (581) .++ +|.++||||.++ +..||+||. +.+++++++..+.... +-... ++....+ T Consensus 145 ~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~---~~~~~------------------~~~~~~~ 203 (725) T protein:vir:77 145 RREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKY---DLDAD------------------DIPSFQN 203 (725) T ss_pred EEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhC---Ccchh------------------hcccccc Confidence 333 466799999987 678998855 5566777666553211 10000 0000000 Q ss_pred cccccccccccccCCceEEEEEEeeee---------ecccCCceee---------------------------eeEE--E Q lcl|NC_015158. 242 FSMDGFGNLYDYFQSPYVEVLTFYGDY---------HDTQSGTFKR---------------------------NMKV--T 283 (581) Q Consensus 242 ~~~~~~~~~~~~~~~~~vevlE~~g~~---------~d~~~d~~~e---------------------------~~~i--t 283 (581) .. .....++..+.|+++|||-.. .+...|...+ -+.| . T Consensus 204 --~~--~~~~~~~~~d~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~ 279 (725) T protein:vir:77 204 --PN--DWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKS 279 (725) T ss_pred --cc--cccccccCCCeeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEe Confidence 00 001223455678888888521 1111221100 0122 2 Q ss_pred EEeCCEEEEeecCCCccCCCCee-Eeccc-ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-ccc Q lcl|NC_015158. 284 IIDRMFVIEEKENPSWFAQAPIF-HCGWR-IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VEE 360 (581) Q Consensus 284 v~~g~~iir~~~nP~~~g~~Pf~-~~~~~-~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~~ 360 (581) ++.|..++. ...||+++.+||+ +++++ ++.+..|+.|+++.++|+|+.+|...+.++++++.+.+.++.+..+ .+. T Consensus 280 ~~~g~~~l~-~~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~ 358 (725) T protein:vir:77 280 IITCTAVLK-DKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAG 358 (725) T ss_pred eecCceeec-cCCcCCCCccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhH Confidence 245666664 4568899999987 45554 5788999999999999999999999999999999999988766432 111 Q ss_pred ---cccCC-------ceeEEeCCC----CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHH Q lcl|NC_015158. 361 ---FVWGP-------MEQIYINGD----GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQ 426 (581) Q Consensus 361 ---i~~~p-------G~vi~~~~~----~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~ 426 (581) ...+| +..+..+++ +.+.+.++|..+.....+++.....++++|||...++|..+++ .++-.+++ T Consensus 359 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~-~SG~ai~~ 437 (725) T protein:vir:77 359 FEHMYDGNDDYPYYLLNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQ-VAFDTVNQ 437 (725) T ss_pred HHHHHHhccCCceecccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchh-hHHHHHHH Confidence 11112 222333433 2456677788888889999999999999999999999987654 33444788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH-------------HHhcCCce Q lcl|NC_015158. 427 LQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK-------------DDITAKGR 493 (581) Q Consensus 427 l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r-------------~di~~~~~ 493 (581) +.+++...+..+.++|.. .++.+.+.++.++.++++.+.++||+|++ +...++.++. .||.|.++ T Consensus 438 rq~qg~~~~~~~~Dnl~~-~~~~~g~~lL~lI~~~~~~~rv~RI~~ed-~~~~~v~in~~~~~~~~G~~~~~NDi~g~~D 515 (725) T protein:vir:77 438 LNMRADLETYVFQDNLAT-AMRRDGEIYQSIVNDIYDVPRNVTITLED-GSEKDVQLMAEVVDLATGEKQVLNDIRGRYE 515 (725) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCC-CCcceeeecccccccccchhHhhhhhcccee Confidence 899999999999999995 45777778888888889999999999996 3445555543 57888898 Q ss_pred EEEecchhHH-HHHHHHHHHHHhhcccccccccchh-----------H---HHHHHHHHHHHhcCCCcccccCCCCcHHH Q lcl|NC_015158. 494 LRPVGARHFA-EQAQVVQSLMGIANTPVWQDIKPHV-----------S---TENLAKMLEHNLSLGGWDIFKPNVAVMEA 558 (581) Q Consensus 494 vva~ga~~~~-~r~q~~q~L~~~~~~~~~~~i~p~~-----------~---~~~l~~~~~e~~~l~~~~~~~~~~~~~~~ 558 (581) |++.-+.+.. .|++.++.|.++++.. +...|.. . ..++.+.+........ ...|. .+.++ T Consensus 516 v~v~~~p~~~s~r~~~~~~l~qll~~~--~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~--~~q~~-~~~e~ 590 (725) T protein:vir:77 516 CYTDVGPSFQSMKQQNRAEILELLGKT--PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMG--VKKPE-TPEEQ 590 (725) T ss_pred eEEeeccchHHHHHHHHHHHHHHHHhc--cccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhh--ccCCC-ChhhH Confidence 8666544444 4677677666665321 1111111 1 1122222222211111 11121 11111 Q ss_pred H-HHHHHHH-HHHHHHH-HHhc--------------ccCC Q lcl|NC_015158. 559 Q-TTSALVN-QSQAQIE-EEAQ--------------VPLV 581 (581) Q Consensus 559 ~-~~q~~~q-~aq~~~~-~~~~--------------~~~~ 581 (581) + .++.+++ ++|+..+ .++| ...+ T Consensus 591 q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~ 630 (725) T protein:vir:77 591 QWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTL 630 (725) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1111111 1111111 0011 1111 No 32 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=100.00 E-value=4.1e-42 Score=247.50 Aligned_cols=478 Identities=13% Similarity=0.096 Sum_probs=305.8 Q ss_pred HHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC-CccEEEee Q lcl|NC_015158. 19 LAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP-NERWLKWE 97 (581) Q Consensus 19 ~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~-~~~~~~~~ 97 (581) .=...+++|+.+++.|++|+..|.+|.+|+.|+.....+........+++-+.-...+++|.+.||+.+|| +..||++. T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSFFKLQ 80 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 22334578999999999999999999999999876555544444445666677789999999999999999 89999998 Q ss_pred cCChhHHH--------------HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeee Q lcl|NC_015158. 98 GKSLQDEA--------------KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTY 163 (581) Q Consensus 98 ~~~~~d~~--------------~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~ 163 (581) +...+..+ --+.+++.+...|..|||+..+++.+.|++.+|+|++-++ + T Consensus 81 ~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~---~-------------- 143 (542) T protein:vir:78 81 INDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAG---K-------------- 143 (542) T ss_pred CCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEec---C-------------- Confidence 76533211 1244678899999999999999999999999999987442 1 Q ss_pred ccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccc Q lcl|NC_015158. 164 FGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFS 243 (581) Q Consensus 164 ~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (581) ..++.+++.++++..++..-=|+ .+.+..+|..+|.+..+... ....+.+.. ....+. T Consensus 144 --~~~~~~pl~~y~v~~d~~G~vd~-v~r~~~~t~~ql~~~fg~~~----l~~~~~~~~----~~~~~~----------- 201 (542) T protein:vir:78 144 --KTLKVYPLDRYVIERDGDGNVIE-IITRELVDRSLLPAEFQKQS----LLEGKDSNA----VGEDGP----------- 201 (542) T ss_pred --CCceEEecceeEEeeCCCCCeEE-EeeeeecCHHHHHHhhcccc----CchHHHhhc----cccCCC----------- Confidence 12456778889999886643332 34566779888876532111 111111000 000000 Q ss_pred cccccccccccCCceEEEEEEee-----eee--cccCCceeeeeEEEEEeCCEE-EEeecCCCccCCCCeeEecccccCC Q lcl|NC_015158. 244 MDGFGNLYDYFQSPYVEVLTFYG-----DYH--DTQSGTFKRNMKVTIIDRMFV-IEEKENPSWFAQAPIFHCGWRIRQD 315 (581) Q Consensus 244 ~~~~~~~~~~~~~~~vevlE~~g-----~~~--d~~~d~~~e~~~itv~~g~~i-ir~~~nP~~~g~~Pf~~~~~~~~p~ 315 (581) ++++++... +.| ..-+++.. .+..-++|..+ ....+.| +...||.+..|...++ T Consensus 202 --------------~~~v~~~v~pr~~~~~~~~~~~~~~~~--s~~~e~~g~~v~~~~~e~g--~~~~P~i~~Rw~~~~g 263 (542) T protein:vir:78 202 --------------KFGVAQGKGGRNDAEVFTCCKLVDGQH--RWHQECDGKEIKGSRSSSP--LKHSPWLPLRFNVVDG 263 (542) T ss_pred --------------eEEEEEEeecccCCccccccccCCCeE--EEEEEeccccccccccccc--cccCCceeeeeeecCC Confidence 011111110 000 00111111 11112344433 1233444 4678999999999999 Q ss_pred cccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEeCCCCCccccc--CCCccch Q lcl|NC_015158. 316 NLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYINGDGDVEMMA--PNTQALQ 388 (581) Q Consensus 316 ~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~~~~~~i~~~~--~p~~~~~ 388 (581) ..||+|+++...+..+.+|.+.+.++.+..++++|++.+..+ +..+ ...+|.+ ....++++++++ .+..... T Consensus 264 e~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~~~~~g~i-v~g~~~~v~~~~~~~~~~~~~ 342 (542) T protein:vir:78 264 ESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLARAGTGAI-IQGRAEDVSVVQANKGADFRT 342 (542) T ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCCcee-ecCCccceeeeecccccchhH Confidence 999999999999999999999999999999999999988653 4444 3445554 456777887665 3334555 Q ss_pred hHHHHHHHHHHHHHhcCCchHhcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccce Q lcl|NC_015158. 389 ADMQIQILEAKMEEFAGAPREAMG-IRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADT 467 (581) Q Consensus 389 ~~~~lq~~~~~~ee~TGv~~~~~G-~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~ 467 (581) +...|+.+++.+.+.. .+. ..+....|||+|.+..+.....+.-+..++..+++.|||...+.++.+....| T Consensus 343 ~~~~i~~~~~rI~~aF-----l~~~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP-- 415 (542) T protein:vir:78 343 VQEMIRDLSQRISDAF-----LILNVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLP-- 415 (542) T ss_pred HHHHHHHHHHHHHHHh-----cccccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-- Confidence 6777888888887653 222 22333579999999999999999999999999999999999888876654333 Q ss_pred eeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc---c-cccccchhHHHHHHHHHHHHhcC Q lcl|NC_015158. 468 IRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP---V-WQDIKPHVSTENLAKMLEHNLSL 543 (581) Q Consensus 468 iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~---~-~~~i~p~~~~~~l~~~~~e~~~l 543 (581) .++.+.+ ++..+..-+-++|.+.++.|.+.++.. + .+.+...++..++++.+++..|+ T Consensus 416 --------------~~p~~lv----~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gv 477 (542) T protein:vir:78 416 --------------SLPKGLV----MPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGI 477 (542) T ss_pred --------------CCchhce----eeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCC Confidence 2223333 232232223345555555544433321 1 12334457888999999999999 Q ss_pred CCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 544 GGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 544 ~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +...+++..-.+..++++ ..+|+.|+.+ .++...+- T Consensus 478 p~~~i~~s~e~~~~~~~q-~q~~~~~~al-~~~a~~~a 513 (542) T protein:vir:78 478 DTLNLVKSPETMANEAQQ-AQQQQMTASL-MGQAGQLA 513 (542) T ss_pred CHhhccCCHHHHHHHHHH-HHHHHHHHHH-HHhhhhcc Confidence 976667663332222211 1111122222 11222222 No 33 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=100.00 E-value=6.2e-42 Score=246.52 Aligned_cols=489 Identities=11% Similarity=0.082 Sum_probs=318.1 Q ss_pred hccchhhhH-HHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 11 MLDDTRDGL-AEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP 89 (581) Q Consensus 11 ~~~~~~~~~-a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~ 89 (581) |.+..++.+ +..++++|+.+++.|++|+..|.+|.+|+.|+.....+.....+-.+++-+.-...+++|.+.||+.+|| T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltp 80 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccccccchHHHHHHHHHHHHHHhhcC Confidence 666444444 5888999999999999999999999999999975544444444445677777789999999999999999 Q ss_pred -CccEEEeecCChhHH-------------HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeee Q lcl|NC_015158. 90 -NERWLKWEGKSLQDE-------------AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEE 155 (581) Q Consensus 90 -~~~~~~~~~~~~~d~-------------~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~ 155 (581) +..||++.+...+.. ..-+..++.+...|..|||+..+++.+.|++.+|||++-+++..... T Consensus 81 p~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~---- 156 (532) T protein:vir:99 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVE---- 156 (532) T ss_pred CCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccccccc---- Confidence 799999988764321 11245677888999999999999999999999999999877632210 Q ss_pred eeeEeeeeccceEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccch Q lcl|NC_015158. 156 SGATRDTYFGPRAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRE 234 (581) Q Consensus 156 ~~~~~~~~~~p~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (581) -....++.+++.++++..++.. +.. .+.|..++.+.|-+ ++.+... + +.++ T Consensus 157 -------~~~~~f~~~pl~~y~v~~d~~G~v~~--ivrr~~~~~~~l~e-------------~~~~~~~--~--~~~~-- 208 (532) T protein:vir:99 157 -------GQSNAPKLYKLHNFVVERDAYDNVLQ--IVTEDKIARAALPE-------------DVRKSLE--D--AQGD-- 208 (532) T ss_pred -------CcccceEEEEcCeEEEeeCCCCCeee--EeeeeeecHHhcCh-------------HHHHHhh--c--cccc-- Confidence 0123467788889999987653 332 24455666555411 1110000 0 0000 Q ss_pred hhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccC Q lcl|NC_015158. 235 DCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQ 314 (581) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p 314 (581) . .-...++|+++.- .+.++..+. ++....|..++.. ++-|+....||.+..|...+ T Consensus 209 -------------~-----~p~~~v~v~~~v~--~~~~~~~~~---~~~~~~g~~~~~~-~~~~~~~e~P~~~~Rw~~~~ 264 (532) T protein:vir:99 209 -------------Q-----NPSEEVTIYTHVY--RDPEAMVFR---SYQEIDGEIVAGT-EGEYPLDSCPWIPVRLIKMP 264 (532) T ss_pred -------------c-----CCCcceEEEEEEE--ecCCCCeeE---EEEeecCceeccc-ccccccccCCceeeeeeecC Confidence 0 0012355555441 122222222 2223345543332 34455667899999999999 Q ss_pred CcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEeCCCCCcccccC--CCccc Q lcl|NC_015158. 315 DNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYINGDGDVEMMAP--NTQAL 387 (581) Q Consensus 315 ~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~~~~~~i~~~~~--p~~~~ 387 (581) +..||.|+++...+..+.+|.+.+.++.+..++++|++.+..+ +..+ ...||.++. ..++++++++. +.... T Consensus 265 ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~-g~~~~i~~~~~~~~~~~~ 343 (532) T protein:vir:99 265 NEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVA-GRKQDVEVFQLEKYNDFQ 343 (532) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhccCCCcceec-CCcccceeeecccccchh Confidence 9999999999999999999999999999999999999988643 4443 345666543 45567776653 33455 Q ss_pred hhHHHHHHHHHHHHHhcCCchH-hcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPRE-AMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVAD 466 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~-~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~ 466 (581) .+...++.+.+.+.+.. -+ +....+....|||+|....+.....+.-+..++..+++.|||...+.++.+..-. T Consensus 344 ~~~~~i~~~~~rI~~af---~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~l-- 418 (532) T protein:vir:99 344 VAKATADDIEKRLSYAF---MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKI-- 418 (532) T ss_pred HHHHHHHHHHHHHHHHH---hhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC-- Confidence 56677777777776643 11 1222333457999999999999999999999999999999999888876553221 Q ss_pred eeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc--ccccccchhHHHHHHHHHHHHhcCC Q lcl|NC_015158. 467 TIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP--VWQDIKPHVSTENLAKMLEHNLSLG 544 (581) Q Consensus 467 ~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~--~~~~i~p~~~~~~l~~~~~e~~~l~ 544 (581) ..+ ++++.+...++ + .+.+.|+|+++.|.+.++.. +.+.+...++..++.+.+++..|++ T Consensus 419 --------------P~~-p~~~~~~~iv~--~-is~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~ 480 (532) T protein:vir:99 419 --------------PNL-PKEAVEPAIAT--G-LEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMD 480 (532) T ss_pred --------------CCC-Chhhcccceee--c-chHHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCC Confidence 222 33343322222 2 23466777776665554432 2344556788889999999999996 Q ss_pred CcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 545 GWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 545 ~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) -..++++.-++....+++ |++++++.+.++..... T Consensus 481 ~~~i~r~~ee~~~~~~q~--~~~~~~~~a~~~~~~~~ 515 (532) T protein:vir:99 481 TTGLILTQQDKQAKMAEA--STAAGMVTAGQQMGAAG 515 (532) T ss_pred hhhccCCHHHHHHHHHHH--HHHHHHHHHHHHHHHHH Confidence 656666633332222111 11111222222222222 No 34 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=6e-42 Score=246.58 Aligned_cols=521 Identities=12% Similarity=0.069 Sum_probs=307.7 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHH-HHHHhhcccccccccccc--cccccccccchHHHHHHHHHHHHHhh Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSE-LRNYIFATDTTTTTNSTL--PWKNKTTLPKLCQIRDNLHSNYISAL 87 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~-~~~y~~~~~~~~~~~~~~--~~k~~~~~pki~~~~d~~~~~l~~~~ 87 (581) |.|. ... +..+...|+.+.....+-+.+ ..++-|+....+...... .-..+.+.++|...++++.++.. T Consensus 1 m~d~--~~~---~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~rp~~N~i~~~v~~v~g~e~--- 72 (725) T protein:vir:10 1 MADN--ENR---LESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR--- 72 (725) T ss_pred CCch--HHH---HHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcccchHHHHHHHHhhHH--- Confidence 6653 222 222233333332222221122 334545544444322211 11223467888888888888877 Q ss_pred cCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccce Q lcl|NC_015158. 88 FPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPR 167 (581) Q Consensus 88 f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ 167 (581) .|+.-++|.|+..+|.+.|+++..++.+-...|++....++.|.+.++-|.|++.+-+...- ....++.+ . T Consensus 73 -~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~-~d~~~~~~-------~ 143 (725) T protein:vir:10 73 -QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYED-QSPTSNNQ-------V 143 (725) T ss_pred -hCCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccC-CCCCCCce-------e Confidence 58888999999999999999999999999999999999999999999999998877543111 01111111 2 Q ss_pred EEec----chhheeecCCCC--CcccCceEE-EEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 168 AVRI----DPKDIVFNPVAV--DFAHSPKII-RTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 168 ie~V----~p~df~~DP~a~--~~~d~~~i~-r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) |.++ ++.+|||||.++ +..||+||. +.++++..+..+... + +....+...++... T Consensus 144 i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~--------------~----~~~a~~~~~~~~~~ 205 (725) T protein:vir:10 144 IRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEK--------------Y----DLDADNIPSFQNPN 205 (725) T ss_pred eeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHh--------------C----CCcccccccccccc Confidence 3332 456699999987 668998854 556676544322110 0 00010111111100 Q ss_pred ccccccccccccccCCceEEEEEEeeee---------ecccCCceee---------------------------eeEEE- Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTFYGDY---------HDTQSGTFKR---------------------------NMKVT- 283 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~~g~~---------~d~~~d~~~e---------------------------~~~it- 283 (581) . ....++.++.|+|+|||-.. .+..+|...+ -+.|. T Consensus 206 ------~-~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~ 278 (725) T protein:vir:10 206 ------D-WVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYK 278 (725) T ss_pred ------c-ccccccCCCeEEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEE Confidence 0 01234456678888888522 1222232111 01222 Q ss_pred -EEeCCEEEEeecCCCccCCCCee-Eecc-cccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-cc Q lcl|NC_015158. 284 -IIDRMFVIEEKENPSWFAQAPIF-HCGW-RIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VE 359 (581) Q Consensus 284 -v~~g~~iir~~~nP~~~g~~Pf~-~~~~-~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~ 359 (581) ++.|.++|.. ..|++++.+||+ ++++ .++.+..|+.|+.+.++|+|+.+|..++.++++++++.+.++.+..+ ++ T Consensus 279 ~~~~g~~~l~~-~~~~~~~~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~ 357 (725) T protein:vir:10 279 SIITCTAVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIA 357 (725) T ss_pred EeecchhhhcC-CCCCCCCceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhh Confidence 2457766633 457788888887 3444 35788999999999999999999999999999999999988766432 11 Q ss_pred ---ccccCCceeEEe-------CCC----CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHH Q lcl|NC_015158. 360 ---EFVWGPMEQIYI-------NGD----GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQ 425 (581) Q Consensus 360 ---~i~~~pG~vi~~-------~~~----~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~ 425 (581) ....+|..+..+ +++ +.+++.++|..+.....+++...+.++++|||...++|..+++ .++-.++ T Consensus 358 ~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~-~SG~ai~ 436 (725) T protein:vir:10 358 GFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVN 436 (725) T ss_pred HHHHHHhccCCceeeecccccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchh-hHHHHHH Confidence 112233333222 222 2456677777788899999999999999999999999986643 3344488 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH-------------HHhcCCc Q lcl|NC_015158. 426 QLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK-------------DDITAKG 492 (581) Q Consensus 426 ~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r-------------~di~~~~ 492 (581) ++++++...+..+.+++.. .++.+.+.++.++.++++.+.++||+|++ |...++.++. .||.|++ T Consensus 437 ~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~lI~~~~~~er~~RI~~ed-g~~~~v~in~~~~d~~~G~~v~~Ndi~g~~ 514 (725) T protein:vir:10 437 QLNMRADLETYVFQDNLAT-AMRRDGEIYQSIVNDIYDVPRNVTITLED-GSEKEVQLMAEVVDLATGERQVLNDIRGRY 514 (725) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCC-CCcceeEeccccccccccchhhhhccccce Confidence 8999999999999999996 46777888888888889999999999996 4556666654 5788889 Q ss_pred eEEEecchhHHH-HHHHHHHHHHhhcccccccccchh--------------HHHHHHHHHHHHhcCCCcccccCCCCcHH Q lcl|NC_015158. 493 RLRPVGARHFAE-QAQVVQSLMGIANTPVWQDIKPHV--------------STENLAKMLEHNLSLGGWDIFKPNVAVME 557 (581) Q Consensus 493 ~vva~ga~~~~~-r~q~~q~L~~~~~~~~~~~i~p~~--------------~~~~l~~~~~e~~~l~~~~~~~~~~~~~~ 557 (581) +|++.-+.+..+ |.+.++.|.+++... +...|.. ...++.+.+.....-.+. -.|.....+ T Consensus 515 Dv~v~~~p~~~s~r~~~~~~l~qll~~~--~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~--~~~~~~e~~ 590 (725) T protein:vir:10 515 ECYTDVGPSFQSMKQQNRSEILELLGKT--PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGV--KKPETPEEQ 590 (725) T ss_pred eEEEeeccCcHHHHHHHHHHHHHHHHhc--cccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhcc--CCccccchh Confidence 886665544444 677777666665421 1111211 111233333222111110 011111111 Q ss_pred HHHHHHHHH-HHHHHHHHH--------h-------ccc-------CC Q lcl|NC_015158. 558 AQTTSALVN-QSQAQIEEE--------A-------QVP-------LV 581 (581) Q Consensus 558 ~~~~q~~~q-~aq~~~~~~--------~-------~~~-------~~ 581 (581) ++.+++.++ ++|+..+.. . +.. ++ T Consensus 591 q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~ 637 (725) T protein:vir:10 591 QWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAA 637 (725) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 101111000 000000000 0 000 00 No 35 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=5.9e-42 Score=246.61 Aligned_cols=531 Identities=13% Similarity=0.082 Sum_probs=305.1 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccc--------ccccccccccchHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNST--------LPWKNKTTLPKLCQIRDNLHSN 82 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~--------~~~k~~~~~pki~~~~d~~~~~ 82 (581) |.|..++ +-..+...|+...+..+.|-++..+=.+|++.....+..... ..++-.++.++|...++++.+. T Consensus 1 m~e~~~~-~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~ 79 (706) T protein:vir:10 1 MAESRQK-QHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISE 79 (706) T ss_pred CCcchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhH Confidence 6553222 222333333333333333333333333444333333333211 1234568899999999999999 Q ss_pred HHHhhcCCccEEEeecC-ChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEee Q lcl|NC_015158. 83 YISALFPNERWLKWEGK-SLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRD 161 (581) Q Consensus 83 l~~~~f~~~~~~~~~~~-~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~ 161 (581) .. .|+--++|.|. ...|.+.|+++..++.+-..+|++....++.|.|+++.|.|++.+..... ...+... T Consensus 80 ~~----~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~-----~~~d~~~ 150 (706) T protein:vir:10 80 YR----NNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFV-----NEYDPMD 150 (706) T ss_pred HH----hCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccc-----cccCCCC Confidence 88 57777899995 45577889999999999999999999999999999999999777643211 1111111 Q ss_pred eeccceEEec-chh-heeecCCCC--CcccCceEE-EEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 162 TYFGPRAVRI-DPK-DIVFNPVAV--DFAHSPKII-RTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 162 ~~~~p~ie~V-~p~-df~~DP~a~--~~~d~~~i~-r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) ..-+..+++| +|+ +|||||.++ +..||.||. +.+++++++.++....+ ..... ..+.+...++ T Consensus 151 ~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~--~~~~~----------~~~~~~~~d~ 218 (706) T protein:vir:10 151 ERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAP--TSLDR----------VGSVSWQYDW 218 (706) T ss_pred CCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCCh--hhhhh----------hccccccccc Confidence 1123456666 687 689999975 789998865 56779999998843221 10000 0111111111 Q ss_pred hhccccccccccccccccCCceEEEEEEeeee--------------e---c-ccCCceee---------eeEEEEEeCCE Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDY--------------H---D-TQSGTFKR---------NMKVTIIDRMF 289 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~--------------~---d-~~~d~~~e---------~~~itv~~g~~ 289 (581) ... ....+.+||....+.++.+|... . + ...++... .....++.|.. T Consensus 219 ~~~------d~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~ 292 (706) T protein:vir:10 219 FTP------DVVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDG 292 (706) T ss_pred cCC------CcceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeecccc Confidence 111 11122333333322233333210 0 0 11112111 01223356777 Q ss_pred EEEeecCCCccCCCCeeEe-ccc-ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEe-cccc------- Q lcl|NC_015158. 290 VIEEKENPSWFAQAPIFHC-GWR-IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVK-GDVE------- 359 (581) Q Consensus 290 iir~~~nP~~~g~~Pf~~~-~~~-~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~-~d~~------- 359 (581) ++ ...+||+++.+||+-+ ++. .+.++..+.|+.+.++|+|+.+|+.++.++++++++.+-...+. ++++ T Consensus 293 ~l-~~~~p~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 371 (706) T protein:vir:10 293 FL-EKPRRIPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWE 371 (706) T ss_pred cc-ccCCCCCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhh Confidence 77 4678999999998833 222 23566778899999999999999999999999988877544331 2211 Q ss_pred -------------ccccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHH Q lcl|NC_015158. 360 -------------EFVWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQ 426 (581) Q Consensus 360 -------------~i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~ 426 (581) ++...+|.++.... .+....+|..+.....+++.....++++|||...++|..+ |.++-.+++ T Consensus 372 ~~~~~~~~~l~~~~~~~~~g~i~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~s--n~SG~Ai~~ 447 (706) T protein:vir:10 372 GRNRKRPAFLPLRTVTDKTGNVVAPAN--VAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPS--NVARETVNS 447 (706) T ss_pred hcccccccchhcccccCCCCccccccc--ccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCcc--chHHHHHHH Confidence 11122344443222 2234445555666778899999999999999999999744 334455888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH-------------HHh-cCCc Q lcl|NC_015158. 427 LQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK-------------DDI-TAKG 492 (581) Q Consensus 427 l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r-------------~di-~~~~ 492 (581) +..+++..+.++.++|... ++.+.+.++.++.++++.+.++||+|++ +...++.++. .|| .|.+ T Consensus 448 rq~qg~~~~~~~~Dnl~~~-~~~~g~~lL~li~~~y~~~R~~RI~~ed-~~~~~v~in~~~~d~~~G~~~~~nDi~~g~y 525 (706) T protein:vir:10 448 LLNRSDMASFIYLDNMAKS-LKRAGEIWLSMAREIYGSDREVRIVHED-GTDDIALMNAAVLDNQTGRVVALNDLSTGRY 525 (706) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEecCC-CCccceeeccceeccccCceeeeecceeeeE Confidence 9999999999999999964 5777778888888889999999999987 4445555542 255 5667 Q ss_pred eEEEecchhHH-HHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccc----------cCCCCcHHHHHH Q lcl|NC_015158. 493 RLRPVGARHFA-EQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIF----------KPNVAVMEAQTT 561 (581) Q Consensus 493 ~vva~ga~~~~-~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~----------~~~~~~~~~~~~ 561 (581) +|++.-+.+.. .|.+..+.|.++++... ...|.. ..+...+.++.++++-+-+ ...+.+..++++ T Consensus 526 Dv~i~~~p~~~t~r~~~~~~m~el~~~~~--p~~~~~--~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq 601 (706) T protein:vir:10 526 DVSVDVGPSYSARRDATVNALTQLLQGML--PQDPMR--PALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQ 601 (706) T ss_pred EEEEecccCcchHHHHHHHHHHHHHHhcC--Ccchhh--HHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHH Confidence 77666444444 46766677777765310 011100 1223334444444442111 000111111112 Q ss_pred HHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 562 SALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 562 q~~~q~aq~~~~~~~~~~~~ 581 (581) ++++| +|+.-+.+++..+. T Consensus 602 ~~~~q-~qq~q~~q~~~~~~ 620 (706) T protein:vir:10 602 AIVQQ-AQQAQATQPDPNML 620 (706) T ss_pred HHHHH-HHHHHHHHHHHHHH Confidence 22111 11100111111111 No 36 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=1.1e-41 Score=245.10 Aligned_cols=520 Identities=13% Similarity=0.063 Sum_probs=308.6 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccccc--ccccccccchHHHHHHHHHHHHHhhc Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLP--WKNKTTLPKLCQIRDNLHSNYISALF 88 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~--~k~~~~~pki~~~~d~~~~~l~~~~f 88 (581) |.|. ++.-..+-..|+...+....|-++ ...++-|+....+....... -....+.++|...++++.++.. T Consensus 1 m~d~--~~~~~~~~~~~~~~~~~~~~~r~~--a~~d~~fy~G~Qw~~~~~~~l~~q~rp~~N~i~~~i~~v~g~e~---- 72 (725) T protein:vir:92 1 MADN--ENRLESILSRFDADWTASDEARRE--AKNDLFFSRISQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR---- 72 (725) T ss_pred CCch--HHHHHHHHHHHHHHHHhhHHHHHH--HHHHHHhhcCCCCCHHHHHHHHhcCCCcccchHHHHHHHHhhHH---- Confidence 6663 233333333333222222222111 13455555444443222111 1223457888888888888777 Q ss_pred CCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceE Q lcl|NC_015158. 89 PNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRA 168 (581) Q Consensus 89 ~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~i 168 (581) .|+.-++|.|+..+|.+.|+++..++.+-...|++....++.|.|+++-|.|.+.+-+..... ...++ ...| T Consensus 73 ~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~-d~~~~-------~~~i 144 (725) T protein:vir:92 73 QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQ-SPTSN-------NQVI 144 (725) T ss_pred hCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCC-CCCCC-------ceee Confidence 588889999999999999999999999999999999999999999999999976654321100 00111 1133 Q ss_pred Eec---chh-heeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccc Q lcl|NC_015158. 169 VRI---DPK-DIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVG 241 (581) Q Consensus 169 e~V---~p~-df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 241 (581) .++ +|+ ++||||.++ +..||+|| ++.+++++++..+....+ -...+ +....+ T Consensus 145 ~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~---~~~~~------------------~~~~~~ 203 (725) T protein:vir:92 145 RREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYD---LDADD------------------IPSFQN 203 (725) T ss_pred EEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcC---cchhh------------------hhhccc Confidence 332 455 599999987 67899884 566778776655432111 00011 100000 Q ss_pred cccccccccccccCCceEEEEEEeeee---------ecccCCceee---------------------------eeEEE-- Q lcl|NC_015158. 242 FSMDGFGNLYDYFQSPYVEVLTFYGDY---------HDTQSGTFKR---------------------------NMKVT-- 283 (581) Q Consensus 242 ~~~~~~~~~~~~~~~~~vevlE~~g~~---------~d~~~d~~~e---------------------------~~~it-- 283 (581) . .. ....++..+.|+|+|||-.. .+..+|...+ -+.|. T Consensus 204 ~--~~--~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~ 279 (725) T protein:vir:92 204 P--ND--WVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKS 279 (725) T ss_pred C--Cc--ccccccCCCeEEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeee Confidence 0 00 01223455678888887521 1222222111 01222 Q ss_pred EEeCCEEEEeecCCCccCCCCee-Eecc-cccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-ccc Q lcl|NC_015158. 284 IIDRMFVIEEKENPSWFAQAPIF-HCGW-RIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VEE 360 (581) Q Consensus 284 v~~g~~iir~~~nP~~~g~~Pf~-~~~~-~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~~ 360 (581) ++.|.++|.. ..|++++.+||+ ++++ .++.+..|+.|+.+.++|+|+.+|..++.++++++.+.+.++.+..+ +++ T Consensus 280 ~~~g~~~l~~-~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~ 358 (725) T protein:vir:92 280 IITCTAVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAG 358 (725) T ss_pred eecchhhhcC-CCCCCCCceeeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhH Confidence 2456766644 457888888987 3333 35788999999999999999999999999999999999988766432 111 Q ss_pred ---cccCCceeE-------EeCCC----CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHH Q lcl|NC_015158. 361 ---FVWGPMEQI-------YINGD----GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQ 426 (581) Q Consensus 361 ---i~~~pG~vi-------~~~~~----~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~ 426 (581) ...+|..+. ..+++ +.+.+.++|..+.....+++...+.++++|||...++|..+++ .++-.+++ T Consensus 359 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~-~SG~ai~~ 437 (725) T protein:vir:92 359 FEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQ 437 (725) T ss_pred HHHHHhccCccceeeccccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchh-hHHHHHHH Confidence 111222221 11221 2456666777788889999999999999999999999986643 33444888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH-------------HHhcCCce Q lcl|NC_015158. 427 LQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK-------------DDITAKGR 493 (581) Q Consensus 427 l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r-------------~di~~~~~ 493 (581) ++++++..+..+.++|.. ..+.+.+.++.++.++++.+.++||+|++ |...++.++. .||.|+++ T Consensus 438 rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~lI~~~~~~~r~~RI~~ed-g~~~~v~in~~~~~~~~G~~~~~Ndi~g~~D 515 (725) T protein:vir:92 438 LNMRADLETYVFQDNLAT-AMRRDGEIYQSIVNDIYDVPRNVTITLED-GSEKEVQLMAEVVDLATGERQVLNDIRGRYE 515 (725) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcEEEEecCC-CCcceEEeccccccccccchhhhhcccccee Confidence 999999999999999996 46777888888888889999999999986 5666776664 58888898 Q ss_pred EEEecchhHH-HHHHHHHHHHHhhcccccccccch-----------hHH---HHHHHHHHHHhcCCCcccccCCCCcHHH Q lcl|NC_015158. 494 LRPVGARHFA-EQAQVVQSLMGIANTPVWQDIKPH-----------VST---ENLAKMLEHNLSLGGWDIFKPNVAVMEA 558 (581) Q Consensus 494 vva~ga~~~~-~r~q~~q~L~~~~~~~~~~~i~p~-----------~~~---~~l~~~~~e~~~l~~~~~~~~~~~~~~~ 558 (581) |++.-+.+.. .|++..+.|.+++... +++.|. ... .++.+.+.....-.. ...+...++ T Consensus 516 v~v~~~p~~~s~r~~~~~~l~ql~~~~--~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~----~~~~~~~e~ 589 (725) T protein:vir:92 516 CYTDVGPSFQSMKQQNRAEILELLGKT--PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMG----VKKPETPEE 589 (725) T ss_pred eEEeeccChHHHHHHHHHHHHHHHHhc--ccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhc----cCCccchhh Confidence 8666554444 4666667666665321 111111 111 122222222211111 122222221 Q ss_pred HH--HHHHHH---HHHHHHH------HHhcccC-------C Q lcl|NC_015158. 559 QT--TSALVN---QSQAQIE------EEAQVPL-------V 581 (581) Q Consensus 559 ~~--~q~~~q---~aq~~~~------~~~~~~~-------~ 581 (581) ++ .++.++ ++++++. .+.|..| . T Consensus 590 ~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~ 630 (725) T protein:vir:92 590 QQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTL 630 (725) T ss_pred hHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111111 0000000 0001111 1 No 37 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=3.7e-41 Score=242.24 Aligned_cols=524 Identities=14% Similarity=0.105 Sum_probs=303.1 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccc--------cccccccccchHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTL--------PWKNKTTLPKLCQIRDNLHSN 82 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~--------~~k~~~~~pki~~~~d~~~~~ 82 (581) |.|. .+++-..+-..|+.+.+.-+.+-++|.+-..|-++-...+...... .++-.+|.++|...++.+.+. T Consensus 1 ma~~-~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~ 79 (708) T protein:vir:17 1 MAET-LEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) T ss_pred Cchh-HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhh Confidence 5542 2233333333444444444555566655333333333344332111 122357889999999999998 Q ss_pred HHHhhcCCccEEEeecCChh-HHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEee Q lcl|NC_015158. 83 YISALFPNERWLKWEGKSLQ-DEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRD 161 (581) Q Consensus 83 l~~~~f~~~~~~~~~~~~~~-d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~ 161 (581) .. .|+--++|.|+.++ |.+.|+++..++.+-..+|++..+.++.|.++++-|.|++.+-. ...+.-+... T Consensus 80 e~----~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~-----d~~~e~d~~~ 150 (708) T protein:vir:17 80 YR----NNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS-----MLVNEYDPMD 150 (708) T ss_pred Hh----hCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeee-----cccccCCCCC Confidence 88 68888999999754 77889999999999999999999999999999999999665532 1111111011 Q ss_pred eeccceEEec--chhheeecCCCC--CcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 162 TYFGPRAVRI--DPKDIVFNPVAV--DFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 162 ~~~~p~ie~V--~p~df~~DP~a~--~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) ...+..|+++ ++-+|||||.++ +..||+|| ++.++++++++++...... .+ .+.. ...+ T Consensus 151 ~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~-----~~-~~~~---------~~~~- 214 (708) T protein:vir:17 151 DRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPP-----AS-LDVT---------SMTS- 214 (708) T ss_pred CccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccc-----hh-hhhh---------hhcc- Confidence 1122344443 446899999986 67899885 5667799999887422110 00 0000 0000 Q ss_pred hhccccccccccccccccCCceEEEEEEeeeee---------cccCCcee---------------------------eee Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYH---------DTQSGTFK---------------------------RNM 280 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~---------d~~~d~~~---------------------------e~~ 280 (581) ..+.++.++.|+|.|||.... +..+|.+. .-+ T Consensus 215 ------------~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~ 282 (708) T protein:vir:17 215 ------------WEYDWFDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRR 282 (708) T ss_pred ------------ccccccCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEE Confidence 012345567788888874211 11112110 011 Q ss_pred EE--EEEeCCEEEEeecCCCccCCCCeeEe-ccc-ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec Q lcl|NC_015158. 281 KV--TIIDRMFVIEEKENPSWFAQAPIFHC-GWR-IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG 356 (581) Q Consensus 281 ~i--tv~~g~~iir~~~nP~~~g~~Pf~~~-~~~-~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~ 356 (581) .| ..+.|..++ ...+|+|++.+||+-+ ++. .+.+.....|+.+.++|+|+.+|..++.++++++++.+.+++++. T Consensus 283 ~v~~~~~~g~~~l-~~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~ 361 (708) T protein:vir:17 283 RVYVSVVDGDGFL-EKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGM 361 (708) T ss_pred EEEEEeecccccc-cCCCCCCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeech Confidence 11 223466555 4567888888888643 222 234555456889999999999999999999999999998886642 Q ss_pred c----c--------cc----c-cc-CCceeEEe-CCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcc Q lcl|NC_015158. 357 D----V--------EE----F-VW-GPMEQIYI-NGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPG 417 (581) Q Consensus 357 d----~--------~~----i-~~-~pG~vi~~-~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~ 417 (581) . . ++ . .+ .++.+-.+ +.+..+..+++|..+.....+++.....+++.|||...++|..+ T Consensus 362 ~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~s-- 439 (708) T protein:vir:17 362 EQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-- 439 (708) T ss_pred hhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCcc-- Confidence 1 0 00 0 01 12332222 22334556677777878888899999999999999999999633 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH------------ Q lcl|NC_015158. 418 EKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK------------ 485 (581) Q Consensus 418 ~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r------------ 485 (581) |.++-.++++.++++..+..+.+++... .+.+.+.++.++.++++.+.++||+|+| |...++.++. T Consensus 440 n~SG~Ai~~rq~qg~~~~~~~~Dnl~~~-~~~~g~~lL~lI~~~y~~~R~~RI~~ed-g~~~~v~in~~~~d~~~g~~~~ 517 (708) T protein:vir:17 440 NIAQETVNNLMNRADMASFIYLDNMAKS-LKRAGEVWLSMAREVYGSEREVRIVNED-GSDDIAVLSAQVVDRQTGAVVA 517 (708) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEecCC-CCcceeeecceeccCCCcccee Confidence 4334447888999999999999999953 5667777777778888999999999986 3333443321 Q ss_pred -HHh-cCCceEEEecchhH-HHHHHHHHHHHHhhccc-----ccccccch-------hHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 486 -DDI-TAKGRLRPVGARHF-AEQAQVVQSLMGIANTP-----VWQDIKPH-------VSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 486 -~di-~~~~~vva~ga~~~-~~r~q~~q~L~~~~~~~-----~~~~i~p~-------~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) .|| .|.++|+..-+.+. ..|++.++.|.++++.. +.+.+.+. -+..++++.+.+.....+. .. T Consensus 518 ~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~--~~ 595 (708) T protein:vir:17 518 LNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGI--AK 595 (708) T ss_pred eccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhcccc--cc Confidence 344 35667655443333 44666666665554321 11111111 1112333333333222221 12 Q ss_pred CCCCcHHHHHHHHHHH----HHHHHHHHHhcccCC Q lcl|NC_015158. 551 PNVAVMEAQTTSALVN----QSQAQIEEEAQVPLV 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q----~aq~~~~~~~~~~~~ 581 (581) |..+ .++++.+.++| ++++ .+.++|.-+. T Consensus 596 ~~~~-e~~q~~~q~qq~~q~q~~~-~~~eaqa~~~ 628 (708) T protein:vir:17 596 PRNE-KEQQIVQQAQMAAQSQPNP-EMVLAQAQMV 628 (708) T ss_pred Ccch-hhHHHHHHHHHHHHHHHHH-HHHHHHHHHH Confidence 2111 11111111111 1111 1111111111 No 38 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=100.00 E-value=1.7e-40 Score=238.59 Aligned_cols=475 Identities=13% Similarity=0.072 Sum_probs=311.0 Q ss_pred HHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccc--cccccccccchHHHHHHHHHHHHHhhcC-CccEEEee Q lcl|NC_015158. 21 EQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTL--PWKNKTTLPKLCQIRDNLHSNYISALFP-NERWLKWE 97 (581) Q Consensus 21 ~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~--~~k~~~~~pki~~~~d~~~~~l~~~~f~-~~~~~~~~ 97 (581) =.++++|+..+..|++|+..|.+|.+|..|+.....+.... ....+++-+--...+++|.+.||..+|| +..||++. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFKLQ 80 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 44788999999999999999999999999986554332221 1122355566678889999999999999 79999998 Q ss_pred cCChhHH------------HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeecc Q lcl|NC_015158. 98 GKSLQDE------------AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFG 165 (581) Q Consensus 98 ~~~~~d~------------~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~ 165 (581) +...+.. +.-+.+++.+...|..|||+..+++.+.|++.+|||++-++ +. T Consensus 81 ~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~---~~--------------- 142 (522) T protein:vir:10 81 VRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMG---KD--------------- 142 (522) T ss_pred CChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEc---CC--------------- Confidence 7664321 12355777888999999999999999999999999996432 11 Q ss_pred ceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccc Q lcl|NC_015158. 166 PRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMD 245 (581) Q Consensus 166 p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (581) .+..+++-++++..++..-=++ ++.|..+|..+|.+-..... +. ..... T Consensus 143 -~~~~~pl~~y~v~~d~~G~vd~-i~r~~~~t~~ql~~~fg~~~--------~~--------------~~~~~------- 191 (522) T protein:vir:10 143 -GLKTFPLTRYVINRDGDGNVLE-IVTKELISRKVLDIELPEPK--------PN--------------TGIDE------- 191 (522) T ss_pred -CceEEEcceEEEeeCCCCCeeE-EEeeeeccHHHHHHhcchhc--------cc--------------hhhhc------- Confidence 1345677899999876542222 34566779888764321100 00 00000 Q ss_pred cccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHh Q lcl|NC_015158. 246 GFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDN 325 (581) Q Consensus 246 ~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~ 325 (581) . + -....++++++.- ...+.+.+ .+..-..|+ ++...+.-+++...||.+..|...++..||.|+++. T Consensus 192 ~----~--~~~~~v~v~~~v~--p~~~~~~~---~~~~~~~~~-~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~ 259 (522) T protein:vir:10 192 S----S--TTNDDVTIYTYVK--LDKSSGRW---VWHQEAFDK-IIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEE 259 (522) T ss_pred c----c--CCCCceEEEEEEE--eeccCCce---EEEEccCCc-cccccccccccccCCceeeeeeecCCCccccchHHH Confidence 0 0 0012355655431 12222332 111112333 333323334567789999999999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCCCCCcccccCC--CccchhHHHHHHHHHH Q lcl|NC_015158. 326 LVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYINGDGDVEMMAPN--TQALQADMQIQILEAK 399 (581) Q Consensus 326 l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~~~~i~~~~~p--~~~~~~~~~lq~~~~~ 399 (581) ..+..+.+|.+.+.++.+..++++|.+.+..+ +.++...+++.+...+++++.+++.. .....+...++.+.+. T Consensus 260 ~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~r 339 (522) T protein:vir:10 260 FLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAKAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKR 339 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccCCCCcceecCCCccceeecccccccchHHHHHHHHHHHH Confidence 99999999999999999999999999988543 44443344444556667777766533 3344566677777777 Q ss_pred HHHhcCCchHhcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhccc Q lcl|NC_015158. 400 MEEFAGAPREAMGIR-TPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVA 478 (581) Q Consensus 400 ~ee~TGv~~~~~G~~-~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~ 478 (581) +.+. |.++.. .....|||+|....+.....+.-+..++..+++.|||...+.++.+..-.| T Consensus 340 i~~a-----Fl~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP------------- 401 (522) T protein:vir:10 340 LLEA-----FLVMNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIP------------- 401 (522) T ss_pred HHHH-----HhhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC------------- Confidence 7654 444443 334679999999999999999999999999999999999888765533222 Q ss_pred CCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc---c-cccccchhHHHHHHHHHHHHhcCCCcccccCCCC Q lcl|NC_015158. 479 TFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP---V-WQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVA 554 (581) Q Consensus 479 ~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~---~-~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~ 554 (581) .-++++-....| .+- +-+.|+|++++|.+.++.. . -..+...++..++++.+++..|++.-.++++.-+ T Consensus 402 ----~~p~~~~~~~~v--~~i-s~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~ee 474 (522) T protein:vir:10 402 ----KLPKDIVRPTIV--AGV-NALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQ 474 (522) T ss_pred ----CCCccccccccc--cch-hHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHH Confidence 122334222222 222 3356777777766665542 1 1234456778889999999999986567776555 Q ss_pred cHHHH--HHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 555 VMEAQ--TTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 555 ~~~~~--~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.+.+ .+|+++++++++.+.......+ T Consensus 475 v~~~~q~~q~~~~~~~~~~~a~~~~~~~~ 503 (522) T protein:vir:10 475 LAEEQQAAQQQAAQQSLVDQAGQMTGSPL 503 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 43322 2222222333333322222222 No 39 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=2.6e-40 Score=237.56 Aligned_cols=521 Identities=14% Similarity=0.122 Sum_probs=301.2 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccc--------ccccccccccchHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNST--------LPWKNKTTLPKLCQIRDNLHSN 82 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~--------~~~k~~~~~pki~~~~d~~~~~ 82 (581) |.| .++.+-..+...|+...+..+.|-++..+=.+|++.....+....+ ..++-.+|.++|...++.+.+. T Consensus 1 m~~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~ 79 (708) T protein:vir:10 1 MAE-TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) T ss_pred Cch-hHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHH Confidence 443 2334444444444444344444444443333454333334433211 1233467889999999999998 Q ss_pred HHHhhcCCccEEEeecCChh-HHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEee Q lcl|NC_015158. 83 YISALFPNERWLKWEGKSLQ-DEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRD 161 (581) Q Consensus 83 l~~~~f~~~~~~~~~~~~~~-d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~ 161 (581) .. .|+--++|.|...+ |.+.|+++..++.+-..+|++..+.++.|.|.++.|.|.+.+-.... ....... T Consensus 80 ~~----~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~-----~e~d~~~ 150 (708) T protein:vir:10 80 YR----NNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV-----NEYDPMD 150 (708) T ss_pred HH----hCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccc-----cccCCCC Confidence 88 58888999999755 67889999999999999999999999999999999999776533110 0000001 Q ss_pred eeccceE-Eecchh-heeecCCCC--CcccCceEE-EEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 162 TYFGPRA-VRIDPK-DIVFNPVAV--DFAHSPKII-RTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 162 ~~~~p~i-e~V~p~-df~~DP~a~--~~~d~~~i~-r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) ...+..+ .+.+|+ ++||||.|+ ++.||.||. +.++++++++++........+ + . ...+.+ T Consensus 151 ~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~---d---~----~~~~~~----- 215 (708) T protein:vir:10 151 DRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSL---D---V----TSMTSW----- 215 (708) T ss_pred CccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCccccc---c---c----ccCCCc----- Confidence 1112233 344564 799999986 789998865 667799999888432211000 0 0 000000 Q ss_pred hhccccccccccccccccCCceEEEEEEeee---------eecccCCceee---------------------------e- Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGD---------YHDTQSGTFKR---------------------------N- 279 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~---------~~d~~~d~~~e---------------------------~- 279 (581) ..++.+.+.+++.|||.. +.+..+|.+.+ - T Consensus 216 -------------~~~~~~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~ 282 (708) T protein:vir:10 216 -------------EYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRR 282 (708) T ss_pred -------------cccccCCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeE Confidence 011122233444444321 01111111110 0 Q ss_pred -eEEEEEeCCEEEEeecCCCccCCCCeeEe-ccc-ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec Q lcl|NC_015158. 280 -MKVTIIDRMFVIEEKENPSWFAQAPIFHC-GWR-IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG 356 (581) Q Consensus 280 -~~itv~~g~~iir~~~nP~~~g~~Pf~~~-~~~-~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~ 356 (581) +.+.++.|..++ ...+|+|++.+||+-+ ++. .+.+...+.|+.+.++|+|+.+|..++.+.++++++...++++.. T Consensus 283 ~v~~~~~~g~~~l-e~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~ 361 (708) T protein:vir:10 283 RVYVSVVDGDGFL-EKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGM 361 (708) T ss_pred EEEEEeecchhhh-ccCCCCCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccCh Confidence 112234566666 5568899998887733 222 345677778999999999999999999999999999887765532 Q ss_pred c-----------c----------cccccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCC Q lcl|NC_015158. 357 D-----------V----------EEFVWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRT 415 (581) Q Consensus 357 d-----------~----------~~i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~ 415 (581) + . ..+...+|.++. .+..+..+++|..+.....+++.....++++||+...++|..+ T Consensus 362 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~--~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~s 439 (708) T protein:vir:10 362 EQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIA--GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS 439 (708) T ss_pred hhhhhHHHHHhhccccchhhhcccccccccccccc--ccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCcc Confidence 1 0 001222344432 2223344566666777888899999999999999999999532 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH---------- Q lcl|NC_015158. 416 PGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK---------- 485 (581) Q Consensus 416 ~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r---------- 485 (581) |.++..+++++.+++..+..+.+++... ++.+.+.++.++.++++.+.++||+|++ |..-++.++. T Consensus 440 --n~SG~aI~~rq~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~y~~er~~RI~~ed-g~~~~v~in~~~~d~~~g~~ 515 (708) T protein:vir:10 440 --NIAQETVNNLMNRADMASFIYLDNMAKS-LKRAGEVWLSMAREVYGSEREVRIVNED-GSDDIAVLSAQVVDRQTGAV 515 (708) T ss_pred --chHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEecCC-CCcceEEecceeccCCCcce Confidence 4455568899999999999999999953 5666677777777778999999999986 3333333321 Q ss_pred ---HHh-cCCceEEEecchhHH-HHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccc----------- Q lcl|NC_015158. 486 ---DDI-TAKGRLRPVGARHFA-EQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIF----------- 549 (581) Q Consensus 486 ---~di-~~~~~vva~ga~~~~-~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~----------- 549 (581) .|| .|.++|++.-+.+.. .|++.++.|.++++... ...|... .+.-++.++.++++-+-+ T Consensus 516 ~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~--p~~~~~~--~~~~~~l~~~D~p~~~ei~erir~~~~~~ 591 (708) T protein:vir:10 516 VALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSML--PTDPMRP--AIQGIILDNIDGEGLDDFKEYNRNQLLIS 591 (708) T ss_pred eeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcC--CCchhhH--HHHHHHHHhcCCcChHHHHHHHHHhhccc Confidence 233 456777666544444 46666677766654310 0111111 122333344444432111 Q ss_pred --cCCCCcHHHHHHHHHHH----HHHHHHHHHhcccCC Q lcl|NC_015158. 550 --KPNVAVMEAQTTSALVN----QSQAQIEEEAQVPLV 581 (581) Q Consensus 550 --~~~~~~~~~~~~q~~~q----~aq~~~~~~~~~~~~ 581 (581) .......++++.+.+++ ++++.. .+.|.-+. T Consensus 592 ~~~~~~~~ee~q~~~~~q~~~q~q~~~~~-~e~qa~~~ 628 (708) T protein:vir:10 592 GIAKPRNEKEQQIVQQAQMAAQSQPNPEM-VLAQAQMV 628 (708) T ss_pred ccccccchhhHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 01111111111111111 111111 11111111 No 40 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=100.00 E-value=1.9e-39 Score=232.92 Aligned_cols=469 Identities=12% Similarity=0.106 Sum_probs=305.9 Q ss_pred HHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC-CccEEEee Q lcl|NC_015158. 19 LAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP-NERWLKWE 97 (581) Q Consensus 19 ~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~-~~~~~~~~ 97 (581) +=+.++++|+.++ |++|+..|.+|.+|..|+..+..+........+.+-+--...+.+|.+.||..+|| +..||++. T Consensus 1 mk~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) T protein:vir:78 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCcccccC Confidence 5567788898775 99999999999999999765434433322223344555568889999999999999 78999998 Q ss_pred cCChhH-------------HHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeec Q lcl|NC_015158. 98 GKSLQD-------------EAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYF 164 (581) Q Consensus 98 ~~~~~d-------------~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~ 164 (581) ...... ++.-+..++.+...|..|||+..+++.+.|++.+|+|++-++ +. T Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~---~~-------------- 141 (510) T protein:vir:78 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN---SD-------------- 141 (510) T ss_pred CChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEe---CC-------------- Confidence 765432 112345677888999999999999999999999999876443 10 Q ss_pred cceEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccc Q lcl|NC_015158. 165 GPRAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFS 243 (581) Q Consensus 165 ~p~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (581) +-++..+++-++++..++.. +.. .+.|..+|..+|-+... .+.. +... ++ T Consensus 142 ~~~~~~~pl~~y~v~~d~~G~vd~--i~rr~~~t~~~l~~~~~---------~~~~------~~~~--~~---------- 192 (510) T protein:vir:78 142 EATVVAWSLRSYAVRRDATGRWMD--IVLKQRYKSKDLDDVYK---------QDLM------RAGR--NL---------- 192 (510) T ss_pred CCeEEEEEcceeEEeeCCCcCeeE--EEeeeeccHHHHHHHhh---------HHhh------hhhh--cc---------- Confidence 11356677888999887653 333 34566778877754211 0000 0000 00 Q ss_pred cccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEE-EeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCc Q lcl|NC_015158. 244 MDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTI-IDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGP 322 (581) Q Consensus 244 ~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv-~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~ 322 (581) -....++++++.. -..++.. -.+.+.+ ++|.++.+. .-|++...||.+..|...++..||.|+ T Consensus 193 ----------~~~~~v~v~~~V~---~~~~~~~-~~~sv~~e~dg~~i~~~--~~~~~~e~P~~~~Rw~~~~ge~YGrgp 256 (510) T protein:vir:78 193 ----------SGSGSVDLYTHVQ---RRKGTAM-DYAEMYHEIDGVRVGET--GRWPIHLCPYIVPTWNLAPGEHYGRGH 256 (510) T ss_pred ----------CCCceEEEEEEEE---eecCCCC-cEEEEEEEecCeeeccc--cccccccCCeeeeeeeecCCCccccch Confidence 0012345555442 2222211 1223333 467766544 445667899999999999999999999 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCCCCCcccccCC--CccchhHHHHHHH Q lcl|NC_015158. 323 LDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYINGDGDVEMMAPN--TQALQADMQIQIL 396 (581) Q Consensus 323 ~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~~~~i~~~~~p--~~~~~~~~~lq~~ 396 (581) ++...+..+.+|.+.+..+.+..+++.|.+.|+.+ ++.+...+.+.|.-++.+++.+++.. ..+..+...++.+ T Consensus 257 ~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~ 336 (510) T protein:vir:78 257 VEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAV 336 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHH Confidence 99999999999999999999999999999988653 44444444344445566778876633 3345566777888 Q ss_pred HHHHHHhcCCchHhc-CCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCch Q lcl|NC_015158. 397 EAKMEEFAGAPREAM-GIRTPG-EKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSD 474 (581) Q Consensus 397 ~~~~ee~TGv~~~~~-G~~~~~-~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~ 474 (581) .+.+.+. |.. ....++ ..|||+|....+.....+.-+.-++..+++.||+...+..+.+.+ T Consensus 337 ~~rI~~a-----F~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g------------ 399 (510) T protein:vir:78 337 VVRLNQA-----FMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL------------ 399 (510) T ss_pred HHHHHHH-----HhhccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc------------ Confidence 7777764 222 222233 579999999999999999999999999999999998877765432 Q ss_pred hcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcc----cccccccchhHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 475 DKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANT----PVWQDIKPHVSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 475 ~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~----~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) .+.+.+++.+.. ++.+- +-+.|+|+++.+.++.+. ...+++.|.++..++++.+++..|++--.+++ T Consensus 400 -----l~p~p~~~~~~~---~v~~i-s~Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~ivr 470 (510) T protein:vir:78 400 -----LQGLITKQHKPA---IETGL-PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYK 470 (510) T ss_pred -----CCCCCcccccce---eeecc-cHHHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhcC Confidence 222333443322 22222 335566665555444322 12356789999999999999999985445676 Q ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 551 PNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 551 ~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.-++.+.. + +|+.|+..++.+|.-|+ T Consensus 471 s~eev~a~~-~---~~~~q~~~~~~~~~a~~ 497 (510) T protein:vir:78 471 SADELQAEA-E---EQRRQAAQAQAAQETLL 497 (510) T ss_pred CHHHHHHHH-H---HHHHHHHHHHHHHHHHH Confidence 633332211 1 11222222223333333 No 41 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=100.00 E-value=4.2e-39 Score=230.97 Aligned_cols=469 Identities=12% Similarity=0.090 Sum_probs=303.9 Q ss_pred HHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC-CccEEEee Q lcl|NC_015158. 19 LAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP-NERWLKWE 97 (581) Q Consensus 19 ~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~-~~~~~~~~ 97 (581) +=+.++++|+..+ |++|+..|.+|.+|..|+..+..+....-...+++-+--...+.+|.+.||..+|| +..||++. T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCcccccC Confidence 5577888999775 99999999999999999765444433221122345555568889999999999999 78999998 Q ss_pred cCChhH-------------HHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeec Q lcl|NC_015158. 98 GKSLQD-------------EAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYF 164 (581) Q Consensus 98 ~~~~~d-------------~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~ 164 (581) ...... ++.-+.+++.+...|..|||+..+++.+.|++.+|||++-++ + + T Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~---~------~-------- 141 (510) T protein:vir:63 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRD---S------D-------- 141 (510) T ss_pred CChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEc---C------C-------- Confidence 765322 122345678888999999999999999999999999877654 1 1 Q ss_pred cceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccccc Q lcl|NC_015158. 165 GPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSM 244 (581) Q Consensus 165 ~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (581) +-++..+++-++++..++..-=++ .+.|..+|.++|-+-. ..+ .. +.. .++ T Consensus 142 ~~~~~~~pl~~y~v~~d~~G~vd~-i~rr~~~t~~~l~e~~---------~~~----~~--~~~--~~~----------- 192 (510) T protein:vir:63 142 AATVVAWSLRSYAVRRDATGRWMD-IVLKQRYKSKDLDEEY---------KQD----LM--RAG--RNL----------- 192 (510) T ss_pred CcEEEEEEcceeEEeeCCCcCeeE-EEeeeeccHHHHhHHh---------hhh----hh--ccc--ccc----------- Confidence 113566788889998876643232 3456677877663210 000 00 000 000 Q ss_pred ccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEE-EeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH Q lcl|NC_015158. 245 DGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTI-IDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL 323 (581) Q Consensus 245 ~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv-~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~ 323 (581) + ....++++.+- +-..++.. ..+.+.+ ++|.++.+.. -|++...||.+..|...++..||.|++ T Consensus 193 ~---------~~~~v~v~~~V---~~~~~~~~-~~~sv~~e~dg~~~~~~~--~~~~~e~P~~~~Rw~~~~ge~YGrgp~ 257 (510) T protein:vir:63 193 S---------GSGSVDLYTHV---QRKKGTAM-EYAELYHEIDGVRVGKEG--RWPIHLCPYIVPTWNLAPGEHYGRGHV 257 (510) T ss_pred C---------CCcceEEEEEE---EeecCCCc-eEEEEEEEecCceecccc--ccccccCceeeeeeeecCCCccccchH Confidence 0 00124444432 11122221 1233333 4677665444 455678999999999999999999999 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccccc-CCceeEEeCCCCCcccccCC--CccchhHHHHHHH Q lcl|NC_015158. 324 DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVW-GPMEQIYINGDGDVEMMAPN--TQALQADMQIQIL 396 (581) Q Consensus 324 ~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~-~pG~vi~~~~~~~i~~~~~p--~~~~~~~~~lq~~ 396 (581) +...+..+.+|.+.+..+.+..++++|.+.|+.+ ++.+.. .+|.++ -++++++.+++.. .....+...++.+ T Consensus 258 ~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v-~g~~~~v~~~~~~~~~d~~~~~~~i~~~ 336 (510) T protein:vir:63 258 EDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYV-PGGAEAVRAYERGDYNKMAAIQQSLQAV 336 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhccCCCceee-cCCcccceeeecCcccchHHHHHHHHHH Confidence 9999999999999999999999999999988653 333333 345553 4456677776633 3345566777777 Q ss_pred HHHHHHhcCCchHhcCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchh Q lcl|NC_015158. 397 EAKMEEFAGAPREAMGIRTPG-EKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDD 475 (581) Q Consensus 397 ~~~~ee~TGv~~~~~G~~~~~-~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~ 475 (581) .+.+.+.- . .+....++ ..|||+|....+.....+--+.-++..+++.||+...+..+.+.+ T Consensus 337 ~~rI~~af---~-~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g------------- 399 (510) T protein:vir:63 337 VVRLNQAF---M-YGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL------------- 399 (510) T ss_pred HHHHHHHH---H-hhcccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc------------- Confidence 77776642 1 12222233 579999999999999999999999999999999998877765432 Q ss_pred cccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcc----cccccccchhHHHHHHHHHHHHhcCCCcccccC Q lcl|NC_015158. 476 KVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANT----PVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKP 551 (581) Q Consensus 476 ~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~----~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~ 551 (581) .+.+++++++.. +..+ .+.+.|+|+++.+.++.+. ...+++.|.++..++++.+++..|++--.++++ T Consensus 400 ----l~p~p~~~~~~~---~v~~-is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs 471 (510) T protein:vir:63 400 ----LQGLITKQHKPA---IETG-LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS 471 (510) T ss_pred ----CCCCCchhcccc---eecc-hhHHHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCC Confidence 233344555432 1222 2345566665554443322 123567899999999999999999854356666 Q ss_pred CCCcHHHHHHHHHHHHHH-HHHHHHhcccCC Q lcl|NC_015158. 552 NVAVMEAQTTSALVNQSQ-AQIEEEAQVPLV 581 (581) Q Consensus 552 ~~~~~~~~~~q~~~q~aq-~~~~~~~~~~~~ 581 (581) .-++ +|..+|+.| +..+++++..|+ T Consensus 472 ~eev-----~a~~~~~~qq~~~~~~~~~~~~ 497 (510) T protein:vir:63 472 ADEL-----QAEAEQQRQQAAQAQAAQETLL 497 (510) T ss_pred HHHH-----HHHHHHHHHHHHHHHHHHHHHH Confidence 3332 222222222 222222222333 No 42 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=100.00 E-value=9.5e-38 Score=223.55 Aligned_cols=477 Identities=12% Similarity=0.100 Sum_probs=305.8 Q ss_pred ccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC-C Q lcl|NC_015158. 12 LDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP-N 90 (581) Q Consensus 12 ~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~-~ 90 (581) +|=+-..=-+.|+++|+..+..|++|+..|.+|.+|+.|+..+..++... ..+++-+--...+.+|.+.||..+|| + T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~--~~~~~dstg~~a~~~LAa~l~~~ltpp~ 78 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLS--SQNAWQDDGASATNFLSNKLSQVLFPAQ 78 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCcc--ccccccchHHHHHHHHHHHHHHhhcCCC Confidence 22000111158899999999999999999999999999986443332221 12355555578889999999999999 7 Q ss_pred ccEEEeecCChhHH-------------HHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 91 ERWLKWEGKSLQDE-------------AKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 91 ~~~~~~~~~~~~d~-------------~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) ..||++.+...+.+ +.-+..++.+...|..|||+..+++.+.|++.+|||++-.+ +. T Consensus 79 ~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~--~~-------- 148 (517) T protein:vir:10 79 RSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHP--DK-------- 148 (517) T ss_pred CccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEe--CC-------- Confidence 89999987664322 22355678888999999999999999999999999986542 11 Q ss_pred eEeeeeccceEEecchhheeecCCCC-CcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 158 ATRDTYFGPRAVRIDPKDIVFNPVAV-DFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 158 ~~~~~~~~p~ie~V~p~df~~DP~a~-~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) ...+..+++-++++..++. .+.+ .+.|..+|..+|-+..+... ...+....+ T Consensus 149 -------~~~~~~~pl~~y~v~~d~~G~v~~--ivrr~~~~~~~l~~~~~~~~-------------~~~~~~~~~----- 201 (517) T protein:vir:10 149 -------TSPIQAVPLHHYCVRRDNNGTVLD--IVFLQEKALETFEPSIRMAI-------------QASRKGKQY----- 201 (517) T ss_pred -------CCcEEEEEcCeEEEeeCCCcCeEE--EEeeeeccHHHHHHHhhhhc-------------chhhhhhcc----- Confidence 1235667788899988875 3444 35566778877754321100 000000000 Q ss_pred hhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCc Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDN 316 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~ 316 (581) .....++++++ ++-..+|.+. +..-.+|..+.+ +.-||++..||.+..|...++. T Consensus 202 -----------------~~~~~v~v~~~---v~~~~~~~~~---~~~~~d~~~~~~--~s~y~~~e~P~~~~Rw~~~~ge 256 (517) T protein:vir:10 202 -----------------KDKDNVKLYTH---AKRTKDGKYL---IRQSADDVPVGK--ESTVTEDKSPFLILTWKRSYGE 256 (517) T ss_pred -----------------CCcCceEEEEE---EEEeCCCceE---EEEEeCceeecc--ccccccccCCeeeeeeeecCCC Confidence 00123444432 2222334321 222235555533 3456678899999999999999 Q ss_pred ccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccc-cCCceeEEeCCCCCcccccCC--Cccchh Q lcl|NC_015158. 317 LYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFV-WGPMEQIYINGDGDVEMMAPN--TQALQA 389 (581) Q Consensus 317 ~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~-~~pG~vi~~~~~~~i~~~~~p--~~~~~~ 389 (581) .||.|+++..++..+.+|.+.+.++.+..+++.|++.+..+ +.++. ..+|.+ .-.+.+++.+++.. ..+..+ T Consensus 257 ~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~~~~g~~-~~g~~~~v~~~~~~~~~d~~~~ 335 (517) T protein:vir:10 257 DYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVEGGSGAV-LHGVEGDIHIVQLGKYADYTPI 335 (517) T ss_pred CcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccCCCcccc-ccCCcccceeeecccccchhHH Confidence 99999999999999999999999999999999999988643 33332 223333 33455677776532 234556 Q ss_pred HHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceee Q lcl|NC_015158. 390 DMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIR 469 (581) Q Consensus 390 ~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR 469 (581) ...++.+.+.+.+..=+. ..+.......|||+|....+.....+--+.-++..+++.||+...+..+...+ T Consensus 336 ~~~i~~~~~rI~~af~~~--~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l------- 406 (517) T protein:vir:10 336 QAVLNDYRQRIGRVFMME--AMTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSIL------- 406 (517) T ss_pred HHHHHHHHHHHHHHHhhh--hhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhc------- Confidence 677888887777654111 12333334689999999999888889999999999999999987765432211 Q ss_pred ecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHHHHHHHHHhcCC Q lcl|NC_015158. 470 VFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENLAKMLEHNLSLG 544 (581) Q Consensus 470 ~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l~~~~~e~~~l~ 544 (581) .. ++++.+ + .+| .+-++|.+.++.|.++++.. ..+.+...++..++++++++.+|++ T Consensus 407 -~~-------------~~v~~~--~-~s~-la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp 468 (517) T protein:vir:10 407 -TS-------------KNVSPT--I-LTG-IEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISAN 468 (517) T ss_pred -CC-------------CCccce--e-ecc-HHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCC Confidence 11 112222 1 122 23456666665555554332 1223444678889999999999999 Q ss_pred CcccccCCCCcHHHHHHHHHHHHHHHHHHHH-------hcccCC Q lcl|NC_015158. 545 GWDIFKPNVAVMEAQTTSALVNQSQAQIEEE-------AQVPLV 581 (581) Q Consensus 545 ~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~-------~~~~~~ 581 (581) . .++++..++.+.++++.++|++++..++. ++-|.. T Consensus 469 ~-~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~ 511 (517) T protein:vir:10 469 F-PFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQI 511 (517) T ss_pred h-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 6 78888777755544444333332222111 111111 No 43 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=100.00 E-value=2.2e-38 Score=227.08 Aligned_cols=480 Identities=13% Similarity=0.099 Sum_probs=303.9 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLH 80 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~ 80 (581) |+ ++|. .+.+.-.+.|+++|+..++.|++|+..|.+|.+|+.|+.-...++.. ...+++-+--...+.+|. T Consensus 1 ~~------~~~~-~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~--~~~~~~dstg~~a~~~LA 71 (516) T protein:vir:96 1 MK------QSID-LEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNE--TSQNGWQGVGAQATNHLA 71 (516) T ss_pred Cc------chhh-hhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCcc--ccCCcccchHHHHHHHHH Confidence 33 2333 35677779999999999999999999999999999998533222211 112345555568889999 Q ss_pred HHHHHhhcC-CccEEEeecCChhH-------------HHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEee Q lcl|NC_015158. 81 SNYISALFP-NERWLKWEGKSLQD-------------EAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEY 146 (581) Q Consensus 81 ~~l~~~~f~-~~~~~~~~~~~~~d-------------~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~ 146 (581) +.||+.+|| +..||++.+.+... ++--+..++.+...|..|||+..+++.+.+++.+|||++-.+ T Consensus 72 a~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d- 150 (516) T protein:vir:96 72 NKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKP- 150 (516) T ss_pred HHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEec- Confidence 999999999 78999998765422 111344677888999999999999999999999999987543 Q ss_pred ecceeeeeeeeeEeeeeccceEEecchhheeecCCCC-CcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhh Q lcl|NC_015158. 147 VKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAV-DFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFR 225 (581) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~-~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~ 225 (581) +. ..+..+++-++++..++. ++.+ .+.|..+|..+|-+.... . .... T Consensus 151 --~~---------------~~~~~~pl~~y~v~~d~~G~v~~--i~rr~~~~~~~l~~~~~~------~-------~~~~ 198 (516) T protein:vir:96 151 --SK---------------GAISAIPMHHYVVNRDTNGDLLD--IILLQEKALRTFDPATRA------V-------VEVG 198 (516) T ss_pred --CC---------------CCEEEEEcCeEEEeeCCCCCeee--ehhhhHhhHHHHHHhhhh------h-------hhhh Confidence 11 124566778888887766 3443 123444455554322100 0 0000 Q ss_pred ccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCe Q lcl|NC_015158. 226 RGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPI 305 (581) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf 305 (581) + ....+ .. ...++++.+- +- ..++.. .+..-.+|.++++....| +...|| T Consensus 199 ~-------~~~~~------~~---------~~~v~v~~~v---~~-~~~~~~--~~~~~~d~~~~~~es~~~--~~e~P~ 248 (516) T protein:vir:96 199 L-------KGKKC------KE---------DDSVKLYTHA---KY-LGDGFW--ELKQSADDIPVGKVSKIK--SEKLPF 248 (516) T ss_pred h-------hhhhc------CC---------CCceEEEEee---ee-eCCcee--EEEEEeCceeeccccccc--cccCCe Confidence 0 00000 00 0123333221 11 112221 112224566776655544 457899 Q ss_pred eEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCCCCCccccc Q lcl|NC_015158. 306 FHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYINGDGDVEMMA 381 (581) Q Consensus 306 ~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~~~~i~~~~ 381 (581) .+..|...++..||.|+++..++..+.+|.+.+.++.+..++++|.+.+..+ +..+...+.+.|..++++++.+++ T Consensus 249 ~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~i~~g~~~~v~~~q 328 (516) T protein:vir:96 249 IPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIHIVQ 328 (516) T ss_pred eeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhccCCCceeecCCcccceeee Confidence 9999999999999999999999999999999999999999999999988643 444444443445566677788776 Q ss_pred CCC--ccchhHHHHHHHHHHHHHhcCCchHhcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 382 PNT--QALQADMQIQILEAKMEEFAGAPREAMG-IRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEIS 458 (581) Q Consensus 382 ~p~--~~~~~~~~lq~~~~~~ee~TGv~~~~~G-~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~ 458 (581) ..+ ....+...++.+.+.+.+.. -.... .......|||+|....+.....+.-+.-++..+++.||+..++... T Consensus 329 ~~~~~d~~~~~~~i~~~~~rI~~af---~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~ 405 (516) T protein:vir:96 329 LGKYADLTPISAVLEVYTRRIGVVF---MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA 405 (516) T ss_pred cCcccchhHHHHHHHHHHHHHHHHH---hhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhc Confidence 443 34455666777777766632 11112 2233458999999999888888899999999999999998764221 Q ss_pred HhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHH Q lcl|NC_015158. 459 RRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENL 533 (581) Q Consensus 459 ~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l 533 (581) ++ .++..++ ++......+-+.|.+.++.|.++++.. ..+.+...++..++ T Consensus 406 -------------~p--------~lp~~~v----~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~ 460 (516) T protein:vir:96 406 -------------GE--------SFTSDLV----DPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDY 460 (516) T ss_pred -------------CC--------CCccccc----cceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHH Confidence 11 1111222 332333345566776665555554432 23345566888899 Q ss_pred HHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHH-hcccCC Q lcl|NC_015158. 534 AKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEE-AQVPLV 581 (581) Q Consensus 534 ~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~-~~~~~~ 581 (581) ++.+++.+|++. .++++.-++.+..++|..+|+.++..+.- +-++-+ T Consensus 461 ~~~~a~~~Gvp~-~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~ 508 (516) T protein:vir:96 461 MDWVRGQISAEL-PFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGV 508 (516) T ss_pred HHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHH Confidence 999999999996 67777555533333333222222211111 111111 No 44 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=3.3e-37 Score=220.61 Aligned_cols=518 Identities=14% Similarity=0.099 Sum_probs=297.0 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHH-HHHHhhcc--ccccccc--------ccccccccccccchHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSE-LRNYIFAT--DTTTTTN--------STLPWKNKTTLPKLCQIRDNL 79 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~-~~~y~~~~--~~~~~~~--------~~~~~k~~~~~pki~~~~d~~ 79 (581) |.| .+...+...+.+|+.+.....+-+.+ +.++-|++ ...+... .+..++-.++.++|...++.+ T Consensus 1 ma~----~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v 76 (720) T protein:vir:35 1 MAE----TLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRI 76 (720) T ss_pred Cch----HHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHH Confidence 543 34444555555554444333222222 22444432 3333221 112233346679999999999 Q ss_pred HHHHHHhhcCCccEEEeecCChh-HHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeee Q lcl|NC_015158. 80 HSNYISALFPNERWLKWEGKSLQ-DEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGA 158 (581) Q Consensus 80 ~~~l~~~~f~~~~~~~~~~~~~~-d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~ 158 (581) .+... .|+.-++|.|+..+ |.+.|+++..++.+-..+|++..+.++.|.+.++.|.|++.+-+.....- +.. T Consensus 77 ~g~~~----~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~---d~~ 149 (720) T protein:vir:35 77 ISEYR----HNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNAL---DPM 149 (720) T ss_pred HhHHH----hCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccC---CCC Confidence 98887 58888999999665 77889999999999999999999999999999999999998865321100 000 Q ss_pred EeeeeccceEEe--cchhheeecCCCC--CcccCceEE-EEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccc Q lcl|NC_015158. 159 TRDTYFGPRAVR--IDPKDIVFNPVAV--DFAHSPKII-RTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTR 233 (581) Q Consensus 159 ~~~~~~~p~ie~--V~p~df~~DP~a~--~~~d~~~i~-r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (581) . ..-..++++ +++.++||||.|+ +.+||.||. +.+++++++.++..... ... .. ... T Consensus 150 ~--~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a--~~~--------~~--~~~---- 211 (720) T protein:vir:35 150 D--ERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDP--ATL--------MS--GIE---- 211 (720) T ss_pred c--ccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCcc--ccc--------cc--ccc---- Confidence 0 000113444 3567999999997 567998855 55679999988732211 000 00 000 Q ss_pred hhhhhccccccccccccccccCCceEEEEEEeeee-----------------ecccCCce-------------------e Q lcl|NC_015158. 234 EDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDY-----------------HDTQSGTF-------------------K 277 (581) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~-----------------~d~~~d~~-------------------~ 277 (581) .....+++..+.|++.|||-.- ....++.+ . T Consensus 212 -------------~~~~~d~~~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 278 (720) T protein:vir:35 212 -------------RSWDYDWYDVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTI 278 (720) T ss_pred -------------ccccccccCCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccce Confidence 0001122334455555555211 00111100 0 Q ss_pred eeeEEEE-EeCCEEEEeecCCCccCCCCeeEe-c-ccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEE Q lcl|NC_015158. 278 RNMKVTI-IDRMFVIEEKENPSWFAQAPIFHC-G-WRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKV 354 (581) Q Consensus 278 e~~~itv-~~g~~iir~~~nP~~~g~~Pf~~~-~-~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v 354 (581) ..+.+.+ ..+.+.+-...+|+|++.+||+-+ + ...+.++....|+++.++|+|+.+|..++.+++.++.+..-+... T Consensus 279 ~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~ 358 (720) T protein:vir:35 279 KRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIV 358 (720) T ss_pred eEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCcccccc Confidence 0111211 123444445678888888887532 2 234467788889999999999999999999999997654432211 Q ss_pred -eccccc--------------------cccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCC Q lcl|NC_015158. 355 -KGDVEE--------------------FVWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGI 413 (581) Q Consensus 355 -~~d~~~--------------------i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~ 413 (581) +++.+. +...+|.++.. ++.+.+.+++..+.....+++.....+++.|||...+.|. T Consensus 359 a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~--~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~ 436 (720) T protein:vir:35 359 GKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAP--PTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPM 436 (720) T ss_pred CcchHHHHHHHhhccccccccccccccccccCcccccC--CCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCc Confidence 111111 11123333322 2345566667777777888999999999999999999996 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH-------- Q lcl|NC_015158. 414 RTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK-------- 485 (581) Q Consensus 414 ~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r-------- 485 (581) .+ |.++-.+++++.+++..+..+.+++... .+.+.+.++.+..+.++.+.++||+|++ +...++.++. T Consensus 437 ~s--n~SG~Ai~~rq~qg~~~~~~~~Dnl~~~-~~~~g~~lL~lI~~~y~~er~~RI~~ed-~~~~~v~~n~~~~d~~~g 512 (720) T protein:vir:35 437 PS--NIAKETVNHLMHRSDMSSFIYLDNMAKS-LKRAGEVWLSMAREVYGSDRQVRIVNAD-GTDDIALMSVVINDNQTG 512 (720) T ss_pred cc--chHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEecCC-CCcceEeechhhhccCCC Confidence 44 3333348889999999999999999965 4667777777778888999999999986 4444544432 Q ss_pred -----HHh-cCCceEEEecchhHH-HHHHHHHHHHHhhcccccccccchh-HHHHHHHHHHHHhcCCCcccc-------- Q lcl|NC_015158. 486 -----DDI-TAKGRLRPVGARHFA-EQAQVVQSLMGIANTPVWQDIKPHV-STENLAKMLEHNLSLGGWDIF-------- 549 (581) Q Consensus 486 -----~di-~~~~~vva~ga~~~~-~r~q~~q~L~~~~~~~~~~~i~p~~-~~~~l~~~~~e~~~l~~~~~~-------- 549 (581) .|| .|.++|+..-+.+.. .|++..+.|.++++. +.|.. ....+...+.++.++++-+-+ T Consensus 513 ~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~-----~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~ 587 (720) T protein:vir:35 513 QVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAG-----MLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQL 587 (720) T ss_pred ceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHh-----cCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhc Confidence 344 467787666544444 467666666666542 11110 011122233333333331000 Q ss_pred ----cCC-CCcHHHHHHHHHHHHHHHHH--HHHhcccCC Q lcl|NC_015158. 550 ----KPN-VAVMEAQTTSALVNQSQAQI--EEEAQVPLV 581 (581) Q Consensus 550 ----~~~-~~~~~~~~~q~~~q~aq~~~--~~~~~~~~~ 581 (581) ... ..+.+++..+.++|++|+.. .++.|..|- T Consensus 588 ~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~ 626 (720) T protein:vir:35 588 LTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLM 626 (720) T ss_pred chhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHH Confidence 011 11112222121111111110 001111111 No 45 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=100.00 E-value=2e-36 Score=216.31 Aligned_cols=479 Identities=12% Similarity=0.095 Sum_probs=295.2 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLH 80 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~ 80 (581) |-.++.+.. .=-+.|++.|+.++..|++|+..|.+|.+|+.|+.....+... ...+++-+--...+.+|. T Consensus 1 ~~~~~~~~~--------~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~--~~~~~~dstg~~a~~~LA 70 (515) T protein:vir:70 1 MQDTILEYG--------GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE--TSQNGWQGVGAQATNHLA 70 (515) T ss_pred Ccchhhhhc--------CCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcc--cccccccchHHHHHHHHH Confidence 222211111 1127899999999999999999999999999997533222211 111244455568889999 Q ss_pred HHHHHhhcC-CccEEEeecCChhH-------HH------HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEee Q lcl|NC_015158. 81 SNYISALFP-NERWLKWEGKSLQD-------EA------KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEY 146 (581) Q Consensus 81 ~~l~~~~f~-~~~~~~~~~~~~~d-------~~------~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~ 146 (581) +.||..+|| +..||++....... .+ .-+.+++.+...|..|||+..+++.+.+++.+|||++-++ T Consensus 71 a~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d- 149 (515) T protein:vir:70 71 NKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKP- 149 (515) T ss_pred HHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEe- Confidence 999999999 78999998654322 11 2245678888999999999999999999999999987653 Q ss_pred ecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhc Q lcl|NC_015158. 147 VKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRR 226 (581) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~ 226 (581) +. ..+..+++-++++..++..-=++ ++.|..+|..+|-+...... ..... T Consensus 150 --~~---------------~~~~~~pl~~y~v~~d~~G~v~~-i~rr~~~t~~~l~~~f~~~~---------~~~~~--- 199 (515) T protein:vir:70 150 --SK---------------GAMSAVPMHHYVVNRDTNGDLMD-VILLQEKALRTFDPATRMAI---------EVGMK--- 199 (515) T ss_pred --CC---------------CCeEEEEcCeEEEeeCCCcCeeE-EEeeeeccHHHHHHhhhhhh---------hhhhh--- Confidence 11 01455777889998877643332 34566778877754321100 00000 Q ss_pred cCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCee Q lcl|NC_015158. 227 GLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIF 306 (581) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~ 306 (581) ..+ .+. ...+++|.+ ++ ...++...++. -.+|.++++ +.-||+...||. T Consensus 200 ---~~~-----------~~~---------~~~v~i~~~---v~-~~~~~~~~~~~--e~d~~~~~~--es~y~~~e~P~~ 248 (515) T protein:vir:70 200 ---GKK-----------CKE---------DDNVKLYTH---AQ-YAGEGFWKINQ--SADDIPVGK--ESRIKSEKLPFI 248 (515) T ss_pred ---hhh-----------cCC---------CCceEEEEE---EE-ecCCCceEEEE--ecCceeecc--ccccccccCCce Confidence 000 000 012333322 11 12223221111 234555544 445667889999 Q ss_pred EecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCCCCCcccccC Q lcl|NC_015158. 307 HCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYINGDGDVEMMAP 382 (581) Q Consensus 307 ~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~~~~i~~~~~ 382 (581) +..|...++..||.|+++...+..+.+|.+.+.++.+..++++|.+.+..+ +..+...+.+.|..++.+++.+++. T Consensus 249 ~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~~~~g~iv~g~~~~v~~~~~ 328 (515) T protein:vir:70 249 PLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVAEDIHIVQL 328 (515) T ss_pred eeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccccCCceeecCCcccceeeec Confidence 999999999999999999999999999999999999999999999988643 4444444434455566778877764 Q ss_pred CC--ccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 383 NT--QALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRR 460 (581) Q Consensus 383 p~--~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~ 460 (581) .+ ....+...|+.+.+.+.+..=+.. .....+...|||+|....+.....+--+.-++..+++.||+.+.. . T Consensus 329 ~~~~d~~~~~~~i~~~~~rI~~af~~~~--l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~----~ 402 (515) T protein:vir:70 329 GKYADLTPISAVLEVYTRRIGVIFMMET--MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGL----Q 402 (515) T ss_pred CcccchhHHHHHHHHHHHHHHHHHhhhh--hhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH----H Confidence 32 344456667777776655321111 111223368999999988888888888888899999999876532 1 Q ss_pred hcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc-ccc----ccchhHHHHHHH Q lcl|NC_015158. 461 NLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV-WQD----IKPHVSTENLAK 535 (581) Q Consensus 461 n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~-~~~----i~p~~~~~~l~~ 535 (581) .. +..++.++++..+ .. ..+-+.|.|.++.|.++++... .++ +.-.++..++++ T Consensus 403 ~~-----------------~p~~P~~~v~~~~---vs-~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~ 461 (515) T protein:vir:70 403 EA-----------------GDSFTSELVDPVI---VT-GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMD 461 (515) T ss_pred hh-----------------CCCCChhhcccce---eh-hHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHH Confidence 11 2233444443332 22 3344567776666655554421 223 333466778888 Q ss_pred HHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 536 MLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 536 ~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++++..|.+. .++++.-++.+..+ |+.+++. ++...++=+... T Consensus 462 ~~a~~~g~p~-~~~rs~eev~~~r~-q~~~~~~-~~~~~~~~~~a~ 504 (515) T protein:vir:70 462 WVRGQISAEL-PFLKSEEEMQQEMA-QQAQAQQ-EAMLNEGVAKAV 504 (515) T ss_pred HHHHHhCCCc-cccCCHHHHHHHHH-HHHHHHH-HHHHHHhhhhhc Confidence 9988888776 35555433322211 1111111 111111111111 No 46 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=100.00 E-value=3.1e-36 Score=215.22 Aligned_cols=476 Identities=11% Similarity=0.085 Sum_probs=295.9 Q ss_pred HHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccccccc--ccccccchHHHHHHHHHHHHHhhcC-CccEEE Q lcl|NC_015158. 19 LAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWK--NKTTLPKLCQIRDNLHSNYISALFP-NERWLK 95 (581) Q Consensus 19 ~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k--~~~~~pki~~~~d~~~~~l~~~~f~-~~~~~~ 95 (581) .-+..+.+|. +..|++|+..|.+|.+|+.|+..+..+++..-+. .+.+-+--...+++|.+.||..+|| +..||+ T Consensus 1 m~~~~~~l~~--k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (514) T protein:vir:80 1 MRQQASAMWA--EYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQ 78 (514) T ss_pred CccchHHHHH--HhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccc Confidence 2222344455 4569999999999999999975433222211111 1122333457789999999999999 789999 Q ss_pred eecCCh-------hHHHH------HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeee Q lcl|NC_015158. 96 WEGKSL-------QDEAK------RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDT 162 (581) Q Consensus 96 ~~~~~~-------~d~~~------ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~ 162 (581) +.+.+. ++.+. -+..++.+...|..|||+..+++.+.|++.+|||++.++= +. T Consensus 79 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~---------~~----- 144 (514) T protein:vir:80 79 IELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREP---------GT----- 144 (514) T ss_pred cccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEec---------CC----- Confidence 987532 11122 2446777889999999999999999999999999877631 00 Q ss_pred eccceEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccc Q lcl|NC_015158. 163 YFGPRAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVG 241 (581) Q Consensus 163 ~~~p~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 241 (581) ..+..++.-++++..++.. +.. ++.|..+|.++|-+-.. .+.. +....+ T Consensus 145 ---~~~~~~pl~~y~v~~d~~G~v~~--i~rr~~~~~~~l~~~~~---------~~~~-------------~~~~~~--- 194 (514) T protein:vir:80 145 ---GKMLVWTMQSYTVRRTSHGDPAV--VVLRQQMPFRELTPEIQ---------ADAQ-------------AKQIAK--- 194 (514) T ss_pred ---CcEEEEEcCeEEEeeCCCcCeEE--EEeeeeecHHHhhhhhh---------hhhh-------------hhhccC--- Confidence 1245677788999887663 333 35566778777632110 0000 000000 Q ss_pred cccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCC Q lcl|NC_015158. 242 FSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMG 321 (581) Q Consensus 242 ~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s 321 (581) -....++++.+.. ..+..+++...++ .-.+|.++++. .-|++...||.+..|...++..||.| T Consensus 195 ------------~~~~~v~v~~~v~-~~~~~~~~~~sv~--~e~~g~~i~~e--s~y~~~e~P~i~~Rw~~~~ge~YGrg 257 (514) T protein:vir:80 195 ------------RDSDKCDLYTVIE-WQPTPNGKRCAVW--HELEGKRVGPE--SSYPAHLCPYVPVAWNVPDGEHYGRG 257 (514) T ss_pred ------------CCCCceEEEEEEE-eecCCCCeEEEEE--Eeccceeeccc--CccccccCCeeeeeeEecCCCCcccc Confidence 0011244444431 1112223322111 12466776554 44556788999999999999999999 Q ss_pred cHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCCCCCcccccCCC--ccchhHHHHHH Q lcl|NC_015158. 322 PLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYINGDGDVEMMAPNT--QALQADMQIQI 395 (581) Q Consensus 322 ~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~~~~i~~~~~p~--~~~~~~~~lq~ 395 (581) +++...+..+.+|.+.+.++.+..+++.|.+.+..+ ++.+...+.+.+..++++++.+++... ....+...|+. T Consensus 258 p~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~ 337 (514) T protein:vir:80 258 YVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRDAETGDFVPGQVGSVASYERGDYNKIAQASASVES 337 (514) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcccCCceeecCCCccceeeecCcccchHHHHHHHHH Confidence 999999999999999999999999999999988653 454544444445566677788766432 34445666777 Q ss_pred HHHHHHHhcCCchHhcCC--CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCc Q lcl|NC_015158. 396 LEAKMEEFAGAPREAMGI--RTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDS 473 (581) Q Consensus 396 ~~~~~ee~TGv~~~~~G~--~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~ 473 (581) +.+.+.+. |.+.. +.....|||+|....+.....+--+..++..+++.||+...+..+.+... + T Consensus 338 ~~~rI~~a-----Fml~~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~--------g- 403 (514) T protein:vir:80 338 IVMRLNRA-----FMYTGQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNG--------G- 403 (514) T ss_pred HHHHHHHH-----HhhhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhcc--------C- Confidence 77776653 22222 22234799999999988888889999999999999999887766543210 1 Q ss_pred hhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHHHHHHHHHhcCCCccc Q lcl|NC_015158. 474 DDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENLAKMLEHNLSLGGWDI 548 (581) Q Consensus 474 ~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l~~~~~e~~~l~~~~~ 548 (581) .+..++.+-++.++ + ++ .+-+.|.+.++.|.++++.. ..+.+...++..++++.+++..|++.-.+ T Consensus 404 -----~lP~~p~~l~~~~~--v-s~-la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i 474 (514) T protein:vir:80 404 -----MLLGIAQGVYRPSI--I-TG-IPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTL 474 (514) T ss_pred -----CCCCCCchhhccee--e-ec-HHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhc Confidence 12333344444332 2 22 23345554444444433321 23445666888899999999999996333 Q ss_pred ccCCCCcHHHHH------HHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 549 FKPNVAVMEAQT------TSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 549 ~~~~~~~~~~~~------~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++. +++.+... +|-+|+.+|.+..++++.-+| T Consensus 475 ~~~-~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (514) T protein:vir:80 475 SKD-PDVVAAEAEQEAALAQQQLDVASGALAAETSAGVL 512 (514) T ss_pred cCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 333 22211111 112222344445556666666 No 47 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=100.00 E-value=3.7e-36 Score=214.85 Aligned_cols=480 Identities=12% Similarity=0.099 Sum_probs=304.5 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLH 80 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~ 80 (581) |. ++| +.+.+.-.+.|+++|+..+..|++|+..|.+|.+|+.|+.-...++.. ...+++-+--...+.+|. T Consensus 1 ~~------~~~-~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~--~~~~~~dstg~~a~~~LA 71 (516) T protein:vir:10 1 MK------QST-DLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNE--TSQNGWQGVGAQATNHLA 71 (516) T ss_pred CC------chh-hHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcc--cccccccchHHHHHHHHH Confidence 32 223 335666679999999999999999999999999999997533222221 112345555568889999 Q ss_pred HHHHHhhcC-CccEEEeecCChhH-------------HHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEee Q lcl|NC_015158. 81 SNYISALFP-NERWLKWEGKSLQD-------------EAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEY 146 (581) Q Consensus 81 ~~l~~~~f~-~~~~~~~~~~~~~d-------------~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~ 146 (581) +.||+.+|| +..||++....... ++-.+.+++.+...|..|||+..+++.+.+++.+|||++-.+ T Consensus 72 a~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d- 150 (516) T protein:vir:10 72 NKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKP- 150 (516) T ss_pred HHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEec- Confidence 999999999 78999998765432 112355777888999999999999999999999999976442 Q ss_pred ecceeeeeeeeeEeeeeccceEEecchhheeecCCCCC-cccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhh Q lcl|NC_015158. 147 VKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVD-FAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFR 225 (581) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~-~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~ 225 (581) +. ..+..+++-++++..++.. +.+ .+.|..+|..+|-+... + ...... T Consensus 151 --~~---------------~~~~~~pl~~y~v~~d~~G~v~~--ivrr~~~~~~~l~e~~~--~----~~~~~~------ 199 (516) T protein:vir:10 151 --SK---------------GAISAIPMHHYVVNRDTNGDLLD--IILLQEKSLRTFDPATR--A----VVEVGL------ 199 (516) T ss_pred --CC---------------CCeEEEEcCeEEEeeCCCCCeEE--EeeeecccHHHHHHHhh--h----hhhhhh------ Confidence 11 1245667788999887653 433 34566668766643210 0 000000 Q ss_pred ccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEE-EeCCEEEEeecCCCccCCCC Q lcl|NC_015158. 226 RGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTI-IDRMFVIEEKENPSWFAQAP 304 (581) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv-~~g~~iir~~~nP~~~g~~P 304 (581) ....+ +. ...++++.+ ++-..++. +.+.. .+|..+.+... |+....| T Consensus 200 --------~~~~~------~~---------~~~~~i~t~---v~~~~~~~----~~~~~~~d~~~~~~~s~--~~~~e~P 247 (516) T protein:vir:10 200 --------KGKKC------KE---------DDSIKLYTH---AKYLGEGF----WELKQSADDIPVGKVSK--IKSEKLP 247 (516) T ss_pred --------hhhcc------CC---------CCceEEEEE---EEecCCCc----eEEEEeeCceeeccccc--cccccCC Confidence 00000 00 011233221 11111222 22222 34455544444 4456789 Q ss_pred eeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCCCCCcccc Q lcl|NC_015158. 305 IFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYINGDGDVEMM 380 (581) Q Consensus 305 f~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~~~~i~~~ 380 (581) |.+..|...++..||.|+++..++..+.+|.+.+.++.++.++++|.+.+..+ +..+...+.+.|..++++++.++ T Consensus 248 ~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~~~~g~~~~v~~~ 327 (516) T protein:vir:10 248 FIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIHIV 327 (516) T ss_pred eeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhccCCCceeecCCcccceee Confidence 99999999999999999999999999999999999999999999999988643 34443333234445666777776 Q ss_pred cCCC--ccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 381 APNT--QALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEIS 458 (581) Q Consensus 381 ~~p~--~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~ 458 (581) +..+ ....+...|+.+.+.+.+..=+. +.....+...|||+|....+.....+.-+.-.+..+++.|||...+.- T Consensus 328 q~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~- 404 (516) T protein:vir:10 328 QLGKYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLE- 404 (516) T ss_pred ecCcccchHHHHHHHHHHHHHHHHHHhhh--hhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHh- Confidence 6443 24445566777776665532111 112223346899999998888888888899999999999998765311 Q ss_pred HhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc-----ccccccchhHHHHH Q lcl|NC_015158. 459 RRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP-----VWQDIKPHVSTENL 533 (581) Q Consensus 459 ~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~-----~~~~i~p~~~~~~l 533 (581) + .+ .++.+.+..+ + .. ..+-+.|+|.++.|.++++.. ..+.+...++..++ T Consensus 405 ---~-~p----------------~~P~~lv~~~--~-v~-~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~ 460 (516) T protein:vir:10 405 ---A-GD----------------SFTSDLVDPV--I-IT-GIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDY 460 (516) T ss_pred ---h-CC----------------CCChhhcCcc--e-eh-hHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHH Confidence 1 11 1223333222 2 22 234566777766665554432 12334455677788 Q ss_pred HHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHH--HHHHhcccCC Q lcl|NC_015158. 534 AKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQ--IEEEAQVPLV 581 (581) Q Consensus 534 ~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~--~~~~~~~~~~ 581 (581) .+.++++.|++. .++++.-++.+..++++.+|+.++. -...++.-++ T Consensus 461 ~~~~a~~~gvp~-~~irs~eev~~~r~~~~~~q~~~~~~~~~~~~~~~~~ 509 (516) T protein:vir:10 461 MDWVRGQISAEL-PFLKSAEEMEQEQEAQMQAQQAQMLEEGVAKAVPGVI 509 (516) T ss_pred HHHHHHHhCCCh-hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhcccchh Confidence 999999999996 6788877775555555444433222 1122222222 No 48 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=99.90 E-value=2.5e-23 Score=144.49 Aligned_cols=517 Identities=13% Similarity=0.107 Sum_probs=279.4 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHh-hHHhhhhhHHHHHHH----HHH-HhhcccccccccccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQ-NWNSQRQEWLSQKSE----LRN-YIFATDTTTTTNSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~-~~~~~r~~~~~~~~~----~~~-y~~~~~~~~~~~~~~~~k~~~~~pki~~ 74 (581) |+-.-.. -..++-+. +++.|. .-..+|+.|+ .|.+ +++ |..-...+. . +.+ -.+=|-. T Consensus 1 m~~~~~~---~~~~tpe~----la~~W~~~I~~a~~~~~-~~h~r~~~~~k~y~~~~~~~~---~----~~~-r~nl~~s 64 (663) T protein:vir:34 1 MNESQPT---DFADTPQG----WAQRWQEEMSAAREPLE-KWHTQGKEIVKRYRDERDSAH---D----AET-RWNLFST 64 (663) T ss_pred CCccccc---cchhcchh----HHHHHHHHHHHHHhccc-hHHHHHHHHHHHhhccccCCC---c----ccc-ccchhhh Confidence 3321110 01122222 555665 4455565433 3322 222 322111111 1 111 1243445 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhH-----HHHHHHHHHHHHHHHH--hcchHHHHHHHHHHHhhcCceEEEEeee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQD-----EAKRDAIQQYMDNKVK--ESDFRTIMSQLLLDYIDYGNCFATVEYV 147 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d-----~~~ae~~~~~i~~~l~--e~n~~~~~~~~~~d~~~~G~~i~k~~~~ 147 (581) .+.++.+++. +......+.++=.+. ...+|.++++++.-+. +.+|...+...++|+++.|.|++++-|. T Consensus 65 ni~~i~P~iY----ar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye 140 (663) T protein:vir:34 65 NIQTQMASLY----GQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYE 140 (663) T ss_pred hHHHHhhhhh----cCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEee Confidence 6666666666 333344555554442 3456888888876663 3569999999999999999999999996 Q ss_pred cceeeeeee-----------------eeEeeeeccceEEecchhheeecCCCCCcccCceE-EEEEecHHHHHHHhhccC Q lcl|NC_015158. 148 KETTKDEES-----------------GATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKI-IRTVLNEGELLQMEQDQP 209 (581) Q Consensus 148 ~~~~~~~~~-----------------~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i-~r~~~T~~el~~m~~~~~ 209 (581) .+...+.-. ......+.+-.|++|.=.||..|| |+...+..++ .|..+|++++...- + T Consensus 141 ~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~p-Ar~W~ev~wva~r~~mtk~e~~~rf---~ 216 (663) T protein:vir:34 141 VEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSP-ARVWHEVRWLAFRNLLDMREFNARF---D 216 (663) T ss_pred cccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccch-hhccccccceeeeccCCHHHHHHhh---c Confidence 543321110 111222445678999999999999 6777787664 57788998876542 1 Q ss_pred ccchhHHHHHHHHh-hhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeC- Q lcl|NC_015158. 210 ENASLASAIARRRE-FRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDR- 287 (581) Q Consensus 210 ~~~~~~~d~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g- 287 (581) ...| .... .....+. +.++ .++-. ..-....+|+|.|... ...|++++.| T Consensus 217 ~~~~------~~~~a~~~~~~~----~~~~-----~~~~~----~~~~~~a~VwEIWdK~---------~~~V~w~~eg~ 268 (663) T protein:vir:34 217 ADGS------RNLWASVPKVGK----PKDG-----KDGQS----CHPWDRAEVWEIWDKG---------GRKVDWYVEGY 268 (663) T ss_pred CChh------hhhhhhccCcCC----cccc-----CCCCC----cchhcCcceeEEEecC---------CcEEEEEEcCc Confidence 1111 0000 0000000 0000 01111 1112367899999421 2355566555 Q ss_pred CEEEEeecCCCccCCC-----CeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec----cc Q lcl|NC_015158. 288 MFVIEEKENPSWFAQA-----PIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG----DV 358 (581) Q Consensus 288 ~~iir~~~nP~~~g~~-----Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~----d~ 358 (581) +.+|+..+.| .|.. ||+.+++.. ++++++.+..-+..+.|.++| ..+..|..+....-+.+.... ++ T Consensus 269 ~~~L~~~~p~--lgl~~ffPcPrpl~~~~~-~ds~ipvpd~~~y~~~~~E~n-~~t~Rin~l~d~ikv~gvy~~~~g~~i 344 (663) T protein:vir:34 269 SAVLDTQPDP--LGLESFFPCPKPLLANWT-TDKVVPRPDFVLAQDLYKEID-LVSTRITLLERAIRVVGVYDKSSGLTI 344 (663) T ss_pred ceecccCCCC--CCCCCCCCCcccccceec-CCCeecCCcHHHHHHHHHHHH-HHHHHHHHHHhhhhhceeeccccchhH Confidence 4777754443 4432 556555555 368888888889999999999 667788888888888887542 11 Q ss_pred cc-c-ccCCceeEEeCC-------C---CCcccccCCCccchhHHH---HHHHHHHHHHhcCCchHhcCCCCcccccHHH Q lcl|NC_015158. 359 EE-F-VWGPMEQIYING-------D---GDVEMMAPNTQALQADMQ---IQILEAKMEEFAGAPREAMGIRTPGEKTAFE 423 (581) Q Consensus 359 ~~-i-~~~pG~vi~~~~-------~---~~i~~~~~p~~~~~~~~~---lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtg 423 (581) -+ + ...-+..+.+.+ + +.|.+++.++....++.. -..+....+++||+.+.+-|. ..+++|||+ T Consensus 345 ~~~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga-~~a~ETatA 423 (663) T protein:vir:34 345 GRLLSEAAQNDLIPVENWLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGA-SDPRETAMA 423 (663) T ss_pred HHHHHHhhCCCceecchhhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcc-cCcchhhHH Confidence 11 1 111223444431 1 346667666555444443 355667789999999998885 567999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccC-------HHHhcCCceEEE Q lcl|NC_015158. 424 VQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVN-------KDDITAKGRLRP 496 (581) Q Consensus 424 v~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~-------r~di~~~~~vva 496 (581) -++..+.+|.|++..-+...+ +.+.++++..+.+..+.+.+.+-+++|.++-. -.++. .+.+ -.|+|-+ T Consensus 424 Q~IKsq~gS~RIqe~qdevqR-~arDi~ql~AEIl~~~~~~etl~~m~~~elp~--~~ei~~~~~~L~n~~~-r~~~ldI 499 (663) T protein:vir:34 424 QGVKAKFGSIRLQRLQDEVAR-FASDIQRLKAEVIAEHYDVASILAQANAEFTF--DKELAPKAAELIKSRF-SMYRVEV 499 (663) T ss_pred HHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCc--ccchhHHHHHHhcCCC-cceeeee Confidence 999999999999999988885 67889999999999998877777777765321 11221 1112 1123311 Q ss_pred -ecch----hHHHHHHHHHHHHHhhccc--------ccccccchhHHHHHHHHHHHHhcCCCccc------c-------- Q lcl|NC_015158. 497 -VGAR----HFAEQAQVVQSLMGIANTP--------VWQDIKPHVSTENLAKMLEHNLSLGGWDI------F-------- 549 (581) Q Consensus 497 -~ga~----~~~~r~q~~q~L~~~~~~~--------~~~~i~p~~~~~~l~~~~~e~~~l~~~~~------~-------- 549 (581) .+++ ...+|+..+..|..+.+.. +.++..|.+. +|+|+..-.+.... ++ | T Consensus 500 e~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~l~--Ellk~~~~~f~~~~-qie~ai~~~~~~~e~aa 576 (663) T protein:vir:34 500 KPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSAPFLL--QMLKWSVSGLRGSS-TIEGVLDKAIAAAEEAQ 576 (663) T ss_pred ccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHH--HHHHHHhhcCChhh-hHHHHHHHHHhhhHHHh Confidence 1222 2234444333333332221 2334455333 56665433222211 11 1 Q ss_pred ----cCCCCcHHHHHH----HHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 550 ----KPNVAVMEAQTT----SALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 550 ----~~~~~~~~~~~~----q~~~q~aq~~~~~~~~~~~~ 581 (581) .|.|+....+.+ |+-.|.+.++.+.++|+... T Consensus 577 ~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeAq~e~q~~~~ 616 (663) T protein:vir:34 577 KQAAQQSPAPQQPDPKVVAQAMKGQQEMAKVQAEVQGDLL 616 (663) T ss_pred hccCCCCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222222211 11111111233333343333 No 49 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.75 E-value=3.6e-16 Score=105.23 Aligned_cols=430 Identities=12% Similarity=0.043 Sum_probs=236.6 Q ss_pred Cccchhhhhhhccchh--hhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc---cc--c--ccccccc--cccccc Q lcl|NC_015158. 1 MTGKVLELQQMLDDTR--DGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT---TT--T--TNSTLPW--KNKTTL 69 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~--~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~---~~--~--~~~~~~~--k~~~~~ 69 (581) |.|.+-.++...--+. +.....|.++.+.+.. ....+.++++|+..-+. +. . .....+. .+++.+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~----~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~ 80 (472) T protein:vir:93 5 QPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLE----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMIT 80 (472) T ss_pred CCcchhhhhceeeecCchhhHHHHHHHHHHHHHH----HHHHHHHHHHHhccccccccccchhhcccccccccccccccc Confidence 7777766666654211 2334555555554432 33566667777766431 11 0 0111122 235677 Q ss_pred cchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecc Q lcl|NC_015158. 70 PKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKE 149 (581) Q Consensus 70 pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~ 149 (581) |-...+++...++|+ +++ +++.+ ++++..+. +++.+. .++.....++.+++.+||.|+..+.+... T Consensus 81 n~~~~ivd~~~~~l~----g~~--~~~~~---~d~~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~d 146 (472) T protein:vir:93 81 NFHANLVDQKVSYIV----GKP--IAFKH---TDDEVVKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE 146 (472) T ss_pred chHHHHHHHHhhhhc----ccC--eeecc---CChHHHHH----HHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEECCC Confidence 888888888888775 333 23322 33333344 444443 47888889999999999999888765211 Q ss_pred eeeeeeeeeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhcc Q lcl|NC_015158. 150 TTKDEESGATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRG 227 (581) Q Consensus 150 ~~~~~~~~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~ 227 (581) .+|++..++|.++++ |+... .+--+++|.+.+..+ . T Consensus 147 --------------~~~~i~~~~p~~~~~i~d~~~~--~~~~~~ir~~~~~~~-----------~--------------- 184 (472) T protein:vir:93 147 --------------GEFKLFRVPAEQGIPIWTDKEH--EELEAFIRMYKLENE-----------T--------------- 184 (472) T ss_pred --------------CceEEEEEcccceEEEEcCCCC--CceEEEEEEEEeecc-----------e--------------- Confidence 236788899999754 54322 222223343332100 0 Q ss_pred CCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeE Q lcl|NC_015158. 228 LGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFH 307 (581) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~ 307 (581) ..+++...++..+.+-+ +.......- . .+...+. ..|.+.|..|++. T Consensus 185 ----------------------~~~~~~~~~~~~~~~~~-------~~~~~~~~~-~-~~~~~~~--~~~~~~~~vPvv~ 231 (472) T protein:vir:93 185 ----------------------KVEYWDKVTVNYYVYEN-------GSLIPDYSN-N-LENSKTH--FSTGSWGKIPFIP 231 (472) T ss_pred ----------------------eEEEEecCeEEEEEEec-------Ceeeecccc-c-ccccccc--cccCCCCCcceEE Confidence 00112222222222111 111000000 0 0111111 2234467788875 Q ss_pred ecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-cc---cc--ccCCceeEEeCCCCCccccc Q lcl|NC_015158. 308 CGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VE---EF--VWGPMEQIYINGDGDVEMMA 381 (581) Q Consensus 308 ~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~---~i--~~~pG~vi~~~~~~~i~~~~ 381 (581) +.. +-+|.|..+.++++++.+|.+...+.+++...++|.+.+.+. .+ +. ..+.++++.+.+++++.++. T Consensus 232 ~~n-----n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 306 (472) T protein:vir:93 232 FKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQ 306 (472) T ss_pred ecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHHHhhccccccCCCCcceeEe Confidence 543 458999999999999999999999999999999998877652 11 11 12356788888889999888 Q ss_pred CCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015158. 382 PNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRN 461 (581) Q Consensus 382 ~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n 461 (581) .+.......+.++.+...+-+.+++|..+.+.- +++.+|.++...+...........+.|.. +++++++++.+++... T Consensus 307 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~-~l~~~~~li~~~~~~~ 384 (472) T protein:vir:93 307 VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKV-AIQELLWFVFEHFDIK 384 (472) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccCchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCC Confidence 776666777788999999999999998776542 24567777777777788888888888885 6788888887764321 Q ss_pred cCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHh Q lcl|NC_015158. 462 LDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNL 541 (581) Q Consensus 462 ~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~ 541 (581) .+. .++...| ...-.....+.++.++.| . .+ ++.+.++ +. T Consensus 385 ~~~---------------------~~i~v~f--~~~~p~~~~~~~~~~~k~---~------gi---is~et~l----~~- 424 (472) T protein:vir:93 385 GEH---------------------KDVDISF--NYNKVANTELQVQTAQQS---M------GI---VSHETVL----EN- 424 (472) T ss_pred ccc---------------------ceeeEEe--CCCCCCCHHHHHHHHHHH---h------cc---CchHHHH----Hh- Confidence 110 1122111 111122233333322222 1 12 2322222 22 Q ss_pred cCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 542 SLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 542 ~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++. .+ + .++|.++++++-++.. +++.++- T Consensus 425 -l~~----~~--d--~~~E~~ri~~E~~~~~--~~~~~~~ 453 (472) T protein:vir:93 425 -HPF----VE--D--LQAELERIEQEQMEYN--KQLPNLD 453 (472) T ss_pred -CCC----CC--C--HHHHHHHHHHHHHHHH--HhccCcC Confidence 121 12 1 2334444444322211 2222222 No 50 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.74 E-value=1.6e-15 Score=101.68 Aligned_cols=436 Identities=12% Similarity=0.047 Sum_probs=230.9 Q ss_pred Ccc--------chhhhhh-------------hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhc-ccc-cccc Q lcl|NC_015158. 1 MTG--------KVLELQQ-------------MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFA-TDT-TTTT 57 (581) Q Consensus 1 ~~~--------~~~~~~~-------------~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~-~~~-~~~~ 57 (581) |.. -+..++- -+++........|.+.-+.++..+.+ +|.++.+|+.. .|. .+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~---r~~~l~~yY~g~~~~i~~~~ 77 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAP---RIQELLDYARGENHDVLQFG 77 (501) T ss_pred CCceeEEeccchhhhhhcccChhHHHhhccccccccccccHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccC Confidence 111 0100000 01111111223455555544333333 45566667654 332 2222 Q ss_pred ccccccc--ccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHh Q lcl|NC_015158. 58 NSTLPWK--NKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYI 135 (581) Q Consensus 58 ~~~~~~k--~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~ 135 (581) ..+-..+ +++.+|-..-+++...++++ +++ +++.....+ ..+..+.++.+.+..++|...+.++.+++. T Consensus 78 ~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~----g~p--~~~~~~d~~---~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~ 148 (501) T protein:vir:27 78 RRKDREMADKRAVHNYGRMISKFKTGYLA----GNP--IRVEYDDND---NNSQNDDTIKRIGRINDIDSHNRTLIRDLS 148 (501) T ss_pred ccCccccccceeccchHHHHHHHHhhhhc----ccC--eeEecCCcc---chHHHHHHHHHHHHhcChhHHHHHHHHHHh Confidence 2222222 35677888888888888776 333 233333222 234456678888888999999999999999 Q ss_pred hcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccch Q lcl|NC_015158. 136 DYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENAS 213 (581) Q Consensus 136 ~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~ 213 (581) +||.|+..+++... .+|++..++|.++|+ |++.. .+.-+.+|.+.+..+ T Consensus 149 ~~G~a~~~vy~ded--------------~~~~i~~~~p~~~~~v~d~~~~--~~~~~~ir~~~~~~~------------- 199 (501) T protein:vir:27 149 QTGRAYEVIYRNEY--------------DETRIKRLNPLETFVIYDNSLE--DNSIAAVRYYNRGTL------------- 199 (501) T ss_pred hCCeEEEEEEeCCC--------------CceEEEEEccceeEEEecCCCC--CceEEEEEEEEeeec------------- Confidence 99999988875321 246788899999854 54432 122223333322100 Q ss_pred hHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEe Q lcl|NC_015158. 214 LASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEE 293 (581) Q Consensus 214 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~ 293 (581) + +.+..+++|-+ +. +.....++..... T Consensus 200 -------------------------------~-----------~~~~~~~vyt~------~~-----v~~~~~~~~~~~~ 226 (501) T protein:vir:27 200 -------------------------------Q-----------NAKDVVEIYTN------EH-----IYTLDASDDFNEI 226 (501) T ss_pred -------------------------------C-----------CcEEEEEEEeC------Ce-----EEEEEeCCceeec Confidence 0 00011122210 00 0000001111111 Q ss_pred ecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccccc----c--ccCCce Q lcl|NC_015158. 294 KENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDVEE----F--VWGPME 367 (581) Q Consensus 294 ~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~~~----i--~~~pG~ 367 (581) ...|.+.|+.|++.+. .+-+|.|..+.++++++.++.+...+.+.+....+|.+.+.+...+ . .....+ T Consensus 227 ~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~ 301 (501) T protein:vir:27 227 SVTTHAFGTVPITEFL-----NNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTR 301 (501) T ss_pred cccccCCCcccEEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcC Confidence 2223346788877543 3457999999999999999999999999999999999887653111 1 111233 Q ss_pred eEEeCCC---------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 368 QIYINGD---------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEK 438 (581) Q Consensus 368 vi~~~~~---------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i 438 (581) .+.+..+ ++++++..+.........+..+...+-..|++|..+.|.. .++.++.++......+....... T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~ka~~~ 380 (501) T protein:vir:27 302 LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNF-SGNTSGEALKYKLFGLDQDRVDT 380 (501) T ss_pred ceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccc-ccCchHHHHHHHHHHHHHHHHHH Confidence 4444332 2456666655555666778999999999999998777643 24567777777777777777888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcc Q lcl|NC_015158. 439 IMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANT 518 (581) Q Consensus 439 ~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~ 518 (581) .+.|.. +++++++++++++.....- . ...-.++...| .........+.++ .+..+. T Consensus 381 ~~~~~~-~l~~~~~li~~~~~~~~~~--------~--------~~d~~~i~v~f--~~~~p~n~~e~ad---~~~kl~-- 436 (501) T protein:vir:27 381 QSQFTQ-GLKRRYRLAARIGSLVNEF--------K--------DFDESLLKITF--TPNLPKSLNEQVS---ILTGLG-- 436 (501) T ss_pred HHHHHH-HHHHHHHHHHHHHhhcccc--------c--------ccccccceEEe--CCCCCcCHHHHHH---HHHHHh-- Confidence 888885 5788889888775432110 0 00111222222 1222223344343 332222 Q ss_pred cccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 519 PVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 519 ~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ ++.+.++ +. ++. .+ + .++|.+++.++.+..-...++..+- T Consensus 437 ----g~---iS~et~l----~~--l~~----v~--D--~~~E~eri~~E~~e~~~~~~~~~~~ 478 (501) T protein:vir:27 437 ----GQ---VSQETAL----SL--SGL----VE--S--PNEELDKINKEVSEIDFKGYSNDFN 478 (501) T ss_pred ----cc---CcHHHHH----Hh--CCC----CC--C--HHHHHHHHHHHHHhhhHhhhcCccc Confidence 12 3332232 22 222 12 1 3445666555433222223333343 No 51 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.74 E-value=8.1e-16 Score=103.33 Aligned_cols=443 Identities=12% Similarity=0.088 Sum_probs=221.6 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-ccccccccc-cc-cccccccchHHHHHHHHHHHHH-h Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNSTL-PW-KNKTTLPKLCQIRDNLHSNYIS-A 86 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~~-~~-k~~~~~pki~~~~d~~~~~l~~-~ 86 (581) |-.+..-+-+..|.++...+...+. .+..+..|+...+ ....+...- .. .+++.++-+.-+++.+..+|.- - T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~----r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~G 76 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQN----ELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEG 76 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHH----HHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccc Confidence 5554344444556666665555543 3344445543332 221111111 11 2234455555566666655531 1 Q ss_pred hc-CCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeecc Q lcl|NC_015158. 87 LF-PNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFG 165 (581) Q Consensus 87 ~f-~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~ 165 (581) |+ +... .......++++..+ .+++.+.++++......+.+++.+||.|++.+....... ........ T Consensus 77 f~~~~~~--~~~~~~~~d~~~~~----~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~------~~~~~~~~ 144 (488) T protein:vir:23 77 FRIPSAN--GEEPESGGENDPAS----ELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEV------DFDVDPEV 144 (488) T ss_pred eeccCCc--ccccccccchhHHH----HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCccc------ccCCCCCc Confidence 11 1111 11112223333333 355667889999999999999999999999986543211 11122233 Q ss_pred ceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccc Q lcl|NC_015158. 166 PRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFS 243 (581) Q Consensus 166 p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (581) |.|..+||.++ ++||....+ .+.++.+.+. ++ + T Consensus 145 ~~i~~~~p~~~~~~~d~~~~~~---~~~~~~~~~~-----------~~-----------------~-------------- 179 (488) T protein:vir:23 145 PLIRVEPPTALYAEVDPRTRKV---LYAIRAIYGA-----------DG-----------------N-------------- 179 (488) T ss_pred ceEEEeccceeEEEEecCCCce---EEEEEEEEec-----------CC-----------------C-------------- Confidence 57788899996 466653321 1112221110 00 0 Q ss_pred cccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH Q lcl|NC_015158. 244 MDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL 323 (581) Q Consensus 244 ~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~ 323 (581) .... ..+|.++. ++.| ...+|+ ..+ ...-|.+.|+.|++.+...+..+..+|+|.. T Consensus 180 --~~~~-~~~y~~~~--~~~~-----~~~~~~-------------~~~-~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i 235 (488) T protein:vir:23 180 --EIVS-ATLYLPDT--TMTW-----LRAEGE-------------WEA-PTSTPHGLEMVPVIPISNRTRLSDLYGTSEI 235 (488) T ss_pred --cEEE-EEEEecCc--EEEE-----EecCCc-------------eEe-ccccccCCCCcceEEeccccccCCcCCccch Confidence 0000 00011111 0010 011111 111 1123445788999988888889999999987 Q ss_pred H-hhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-cccc-----------cccCCceeEEeCCCCCcccccCCCccchhH Q lcl|NC_015158. 324 D-NLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEE-----------FVWGPMEQIYINGDGDVEMMAPNTQALQAD 390 (581) Q Consensus 324 ~-~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~-----------i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~ 390 (581) . .++++|+.+|.....+.+.+...++|+..+.+ ++++ +...+|++|....+..+...+.+.. ++. T Consensus 236 ~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~~--~~~ 313 (488) T protein:vir:23 236 SPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFSAA--ELR 313 (488) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCCCCCCceeEecCCC--ChH Confidence 5 57899999999999999999999999865533 1111 1234678887766666555554432 344 Q ss_pred HHHHHHHHHHHH---hcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccce Q lcl|NC_015158. 391 MQIQILEAKMEE---FAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADT 467 (581) Q Consensus 391 ~~lq~~~~~~ee---~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~ 467 (581) +.+..++..+.. .|++|....|.....+-+|-++...+...........+.|.. .+++++++++.+... .+ T Consensus 314 ~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~-~l~~~~~l~~~~~~~-~~---- 387 (488) T protein:vir:23 314 NFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGG-AWEQAMRLAYKMVKG-GD---- 387 (488) T ss_pred HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcC-CC---- Confidence 456666666555 577888887764443445666777777777777888888885 678889988776321 10 Q ss_pred eeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 468 IRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 468 iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~ 547 (581) .+.+..+..+.-...-...+.+.+. .+..+.+. +..+.|. ..+ .+.+ ++ T Consensus 388 ---------------~~~~~~~i~v~f~~~~~~s~~~~ad---a~~kl~~~--g~~~~s~---et~----~~~l---~~- 436 (488) T protein:vir:23 388 ---------------IPTEYYRMETVWRDPSTPTYAAKAD---AAAKLFAN--GAGLIPR---ERG----WVDM---GY- 436 (488) T ss_pred ---------------cchhhccceEEecCCCCCCHHHHHH---HHHHHHhc--ccccCCH---HHH----HHhC---CC- Confidence 0011111111111111223344443 33333321 2223333 222 2322 21 Q ss_pred cccCCCCcHHHHHHHHHHHHHHHHHHHHhcc------cCC Q lcl|NC_015158. 548 IFKPNVAVMEAQTTSALVNQSQAQIEEEAQV------PLV 581 (581) Q Consensus 548 ~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~------~~~ 581 (581) .+ ++.++.++.+.++++|..-+.++.. ... T Consensus 437 --~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (488) T protein:vir:23 437 --TI--VEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKP 472 (488) T ss_pred --Cc--hHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccC Confidence 11 1122212111122222222111111 011 No 52 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.74 E-value=5.9e-16 Score=104.08 Aligned_cols=432 Identities=13% Similarity=0.078 Sum_probs=230.1 Q ss_pred Cccchhhh----hh-------hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcc-c-ccccccccccc--cc Q lcl|NC_015158. 1 MTGKVLEL----QQ-------MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFAT-D-TTTTTNSTLPW--KN 65 (581) Q Consensus 1 ~~~~~~~~----~~-------~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~-~-~~~~~~~~~~~--k~ 65 (581) ..+.++++ +. -+++........|.+..+.++..|.+ .+.++++|+... + ........-.+ .+ T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~---r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ 87 (501) T protein:vir:96 11 GQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAP---RIQELLDYARGENHDVLKSGRRKDNEMADK 87 (501) T ss_pred cceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCCcccCccccCccccccc Confidence 11212111 00 11111122234566666655555444 455556666542 2 22222222222 23 Q ss_pred cccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEe Q lcl|NC_015158. 66 KTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVE 145 (581) Q Consensus 66 ~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~ 145 (581) ++.+|-..-+++...++++ .++ +++.+... +..+..++.|++.+..++|...+.++.+++.+||.|+..++ T Consensus 88 ri~~n~~k~Ivd~~~~yl~----g~p--~~~~~~~~---~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 158 (501) T protein:vir:96 88 RAVHNYGRMISKFKTGYLA----GNP--IRVEYDDN---DDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIY 158 (501) T ss_pred eeecchHHHHHHHHhhhhc----ccC--eeEeeCCc---cchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEE Confidence 5677777788888877776 332 23433332 23345666788888889999999999999999999998886 Q ss_pred eecceeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHh Q lcl|NC_015158. 146 YVKETTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRRE 223 (581) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~ 223 (581) +... ..+++..++|.++| +|++.. .+.-+.+|.+....+ T Consensus 159 ~ded--------------g~~~i~~~~p~~~~~v~d~~~~--~~~~~~v~~~~~~~~----------------------- 199 (501) T protein:vir:96 159 RSEY--------------DETRIKRLSPLETFVIYDNSLE--DNSIAAVRYYNRGTL----------------------- 199 (501) T ss_pred EcCC--------------CceEEEEEccceeEEEEcCCCC--CceEEEEEEEEeecC----------------------- Confidence 5321 24678889999974 444322 111122222211000 Q ss_pred hhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCC Q lcl|NC_015158. 224 FRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQA 303 (581) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~ 303 (581) .+.+..+++|-+ +. +...-.++........|.+.|+. T Consensus 200 --------------------------------~~~~~~~~vyt~------~~-----i~~~~~~~~~~~~~~~~~~~g~v 236 (501) T protein:vir:96 200 --------------------------------QSAKDVVEIYTD------EH-----IYTLDASDDFNEISVTTHAFGTV 236 (501) T ss_pred --------------------------------CCcEEEEEEEcC------Cc-----EEEEeeCCCceeccccccCCCcc Confidence 000111222210 00 01010111111112234456788 Q ss_pred CeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecccc-----cc-ccCCceeEEeCCCC-- Q lcl|NC_015158. 304 PIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDVE-----EF-VWGPMEQIYINGDG-- 375 (581) Q Consensus 304 Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~~-----~i-~~~pG~vi~~~~~~-- 375 (581) |++.+. .+.+|.|..+.++++++.+|.+...+.+++...++|++.+.+... .. .....+.+.+..++ T Consensus 237 Pvv~~~-----nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (501) T protein:vir:96 237 PITEYL-----NNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSA 311 (501) T ss_pred ceEEec-----CCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeecccccc Confidence 876443 345799999999999999999999999999999999987765311 11 12234555554332 Q ss_pred -------CcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 376 -------DVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLME 448 (581) Q Consensus 376 -------~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~ 448 (581) +++++..+....++...+..+...+-..|++|..+.|..+ ++.++.++...+...........+.|.. +++ T Consensus 312 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~-~l~ 389 (501) T protein:vir:96 312 DGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFS-GNTSGEALKYKLFGLDQDRVDTQSQFTK-GLK 389 (501) T ss_pred cccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 3455555544555667788888999999999988877432 4567777777777777777888888885 568 Q ss_pred HHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchh Q lcl|NC_015158. 449 KVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHV 528 (581) Q Consensus 449 ~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~ 528 (581) ++++++..++.....- ....-.++...| ...-.....+.++ .+..+.+ + + T Consensus 390 ~~~~li~~~~~~~~~~----------------~~~d~~~i~i~f--~~~~p~n~~e~ad---~~~kl~g------~---i 439 (501) T protein:vir:96 390 RRYRLAARIGSLVNEF----------------KDFDESLLKITF--TPNLPKSLNEQVS---ILTGLGG------Q---V 439 (501) T ss_pred HHHHHHHHHHHhcccc----------------cccccccceEEe--CCCCCcCHHHHHH---HHHHHhc------c---C Confidence 8888887775432110 000111232222 1222223334343 3333221 2 3 Q ss_pred HHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 529 STENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 529 ~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.+.++ +. ++. .+ + .++|.+++..+.+ +..+-+.. T Consensus 440 S~et~~----~~--l~~----v~--D--~~~E~~ri~~E~~----~~~~~~~~ 474 (501) T protein:vir:96 440 SQETAL----SL--SGL----VE--S--PNEELDKINKEMS----EIDFKGYS 474 (501) T ss_pred chHHHH----Hh--CCC----CC--C--HHHHHHHHHHHHH----Hhhccccc Confidence 332222 22 221 12 1 2334444433222 21111111 No 53 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.73 E-value=1.8e-15 Score=101.42 Aligned_cols=434 Identities=13% Similarity=0.089 Sum_probs=227.9 Q ss_pred Cc---------cchhhhh-------------hhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcc-c-cccc Q lcl|NC_015158. 1 MT---------GKVLELQ-------------QMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFAT-D-TTTT 56 (581) Q Consensus 1 ~~---------~~~~~~~-------------~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~-~-~~~~ 56 (581) |. |.+..++ .-+++.....+..|.+.-+.++..+.+ .++++.+|+... + .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~---rl~~l~~yY~g~~~~i~~~ 77 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAP---RIQELLDYARGENHDVLKS 77 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCcccccc Confidence 10 1110000 011111112234455555544433333 445556666552 2 2222 Q ss_pred cccccccc--ccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHH Q lcl|NC_015158. 57 TNSTLPWK--NKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDY 134 (581) Q Consensus 57 ~~~~~~~k--~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~ 134 (581) ...+-.++ +++.+|-..-+++...++|+ +++- ++.+... +..+...+++++.+..++|...+.++.+++ T Consensus 78 ~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~----g~p~--~~~~~d~---~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~ 148 (502) T protein:vir:48 78 GRRKDNEMADKRAVHNYGRMISKFKTGYLA----GNPI--RVEYDDN---EDNSQNDDAIKRIGRINDIDTHNRNLIRDL 148 (502) T ss_pred ccccccccccceeecchHHHHHHHHhhhhc----ccCe--eEecCCc---cchhHHHHHHHHHHhhcCHhHHHHHHHHHH Confidence 22222222 46667777778888777766 3333 3333332 223455667888888899999999999999 Q ss_pred hhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccc Q lcl|NC_015158. 135 IDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENA 212 (581) Q Consensus 135 ~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~ 212 (581) .+||.|++.+++.. ..++++..++|.++|+ |+... .+.-+.+|.+.+..+ . T Consensus 149 ~~~G~a~~~v~~de--------------dg~~~i~~~~p~~~~~vydd~~~--~~~~~~ir~~~~~~~--------~--- 201 (502) T protein:vir:48 149 SQTGRAYEVIYRSE--------------YDETRIKRLSPLETFVIYDNSLE--DNSIAAVRYYNRGTL--------Q--- 201 (502) T ss_pred hhcCeEEEEEEeCC--------------CCceEEEEEcccceEEEEcCCCC--CceEEEEEEEEEeec--------C--- Confidence 99999988876531 1246788899999754 43321 122222333322100 0 Q ss_pred hhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCC-EEE Q lcl|NC_015158. 213 SLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRM-FVI 291 (581) Q Consensus 213 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~-~ii 291 (581) .+. ...+.|+...+..+ ...|+ ..+ T Consensus 202 --------------------------------~~~-~~~~iyt~~~i~~~---------------------~~~~~~~~~ 227 (502) T protein:vir:48 202 --------------------------------NAK-DVVEIYTNQHIYTL---------------------DASDSFNEI 227 (502) T ss_pred --------------------------------CcE-EEEEEEeCCeEEEE---------------------EeCCceeec Confidence 000 00011111111100 00111 112 Q ss_pred EeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccccc-----cc-cCC Q lcl|NC_015158. 292 EEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDVEE-----FV-WGP 365 (581) Q Consensus 292 r~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~~~-----i~-~~p 365 (581) ... |.+.|+.|++.+. ..-.|.|..+.++++++.++.+...+.+.+...++|++.+.+.... .. .+. T Consensus 228 ~~~--~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~ 300 (502) T protein:vir:48 228 SVT--PHAFGTVPITEFL-----NNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKR 300 (502) T ss_pred cce--ecCCCccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhh Confidence 222 3345778877543 3457999999999999999999999999999999999887653211 11 112 Q ss_pred ceeEEeC---------CCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_015158. 366 MEQIYIN---------GDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQ 436 (581) Q Consensus 366 G~vi~~~---------~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~ 436 (581) .+.+... .++++.++..+.........+..+...+-..|++|..+.+.. .++.++.++...+........ T Consensus 301 ~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~ 379 (502) T protein:vir:48 301 TRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHF-SGNASGEALKYKLFGLDQDRV 379 (502) T ss_pred cceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-ccCchHHHHHHHHHHHHHHHH Confidence 3333333 223566666665555566778888999999999998776643 245677777777778888878 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhh Q lcl|NC_015158. 437 EKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIA 516 (581) Q Consensus 437 ~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~ 516 (581) ...+.|.+ +++++++++.+++....... .....++...| ...-.....+.++ .+..+. T Consensus 380 ~~~~~~~~-~l~~~~~li~~~~~~~~~~~----------------~~d~~~i~i~f--~~~~p~d~~e~a~---~~~kl~ 437 (502) T protein:vir:48 380 DTQSQFTQ-GLKRRYRLAARIGSLVNEFK----------------DFDESRLKITF--TPNLPKSLYEQVS---ILNDLG 437 (502) T ss_pred HHHHHHHH-HHHHHHHHHHHHHhhccccc----------------ccccccceEEe--CCCCCcCHHHHHH---HHHHHh Confidence 88888885 57888888887754321100 00111222222 1122223334343 332222 Q ss_pred cccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 517 NTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 517 ~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ ++.+.++ +. ++. .+ + .++|.+++..+.+ +.+.+...+.. T Consensus 438 ------g~---iS~et~l----~~--l~~----v~--D--~~~E~~ri~~E~~-~~~~~~~~~~~ 478 (502) T protein:vir:48 438 ------GQ---VSQETAL----SL--SGL----VE--N--PTEELDKINEESS-KIDFKGYPSYF 478 (502) T ss_pred ------cc---CcHHHHH----Hh--CCC----CC--C--HHHHHHHHHHHHH-hhhhhcccccc Confidence 12 3332232 22 121 12 2 2345555544322 12222222222 No 54 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.73 E-value=3.7e-17 Score=110.66 Aligned_cols=480 Identities=11% Similarity=0.088 Sum_probs=247.4 Q ss_pred CccchhhhhhhccchhhhHHHH--HHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccccc---ccccccccchHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQ--IANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLP---WKNKTTLPKLCQI 75 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~--i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~---~k~~~~~pki~~~ 75 (581) |.-. .+.+++..-..+.. +.+.-..++..| .+.|+.|.+|+......-.....-. -...+..| T Consensus 1 ~~~~----~~~~~~~~~~~~g~~~~p~~v~~~d~~R---l~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~p----- 68 (527) T protein:vir:10 1 MGQD----KRQYGSTQQLRAGEANFPNAVTDFDKAR---LASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVP----- 68 (527) T ss_pred CCcc----ccccCCCcCcCCccccCcccCCHHHHHH---HHHHHHHHHHhcCchhheeeecCCccccccceeeeh----- Confidence 2111 12222222222211 112223333444 3456666677665432221111000 01123333 Q ss_pred HHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeee Q lcl|NC_015158. 76 RDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEE 155 (581) Q Consensus 76 ~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~ 155 (581) .++.++....-|.+.|-...+....+..+.++++.+.+.|+...+.+.-+++++.|=|++++-|.... T Consensus 69 -------s~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k----- 136 (527) T protein:vir:10 69 -------NGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEK----- 136 (527) T ss_pred -------hhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCC----- Confidence 23666777777777777777777888899999999999999999999999999999999999995321 Q ss_pred eeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 156 SGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 156 ~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) ..-.+|++..|+|--+|+= .+.++..++.+.++-. ..++-..+ ..++ .++-.+.+.. ...+ T Consensus 137 -----~~~~R~~v~~~DP~~~f~~---ed~d~~~~v~~v~~~~--~~~~P~d~-~~~~-~~ar~~~~~~-------~l~~ 197 (527) T protein:vir:10 137 -----DEGSRLSLHEVDPSTYFPY---EDPRYPGQVLGVYLVD--EYPHPDSE-KKNE-KCARVQKYMK-------TLDD 197 (527) T ss_pred -----CcCCCceEeecCcceeeee---ecCCCCCceeeEEEee--eccCCccc-cccc-eehhhhhhhh-------hcCc Confidence 0011366777887776554 4455556665554321 00110000 0000 0000000100 0000 Q ss_pred hhhccccccccccccccccCCceEEEE-EEe--eeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccc Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVL-TFY--GDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRI 312 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevl-E~~--g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~ 312 (581) .| .+..++.++.. ++| |+|.|..+=.+....+- ...+..++.-..||+ +-+|+++++-.| T Consensus 198 ---------~g-----~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~-~~~~~~~l~~lp~pi--~fiPvV~~~t~p 260 (527) T protein:vir:10 198 ---------DG-----KPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIK-KLSTLTEEEPLPEQI--TTLPVFHFRGHP 260 (527) T ss_pred ---------cc-----ccccCcceeeeeceeeccccccccccccchhhhh-hhcCceeeecccCCC--CccceEeecCCC Confidence 00 00111222222 233 23433222221111111 122456666666764 568999999999 Q ss_pred cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec--------cccccccCCceeEEeCCCCCcccccCCC Q lcl|NC_015158. 313 RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG--------DVEEFVWGPMEQIYINGDGDVEMMAPNT 384 (581) Q Consensus 313 ~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~--------d~~~i~~~pG~vi~~~~~~~i~~~~~p~ 384 (581) .|++.||+|.+..++.+++++|+.....-..+..+.+|++.+.+ ...++.-.||.+|+..+++.+..+...+ T Consensus 261 ~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~~ 340 (527) T protein:vir:10 261 IMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYRVNGVA 340 (527) T ss_pred ccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCCceeEecCCCcceeeccchh Confidence 99999999999999999999999999999999999999997753 2344555699999999999877666543 Q ss_pred ccchhHHHHHHHHHHHHHhcCCchHhcCC-CCcccccHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 385 QALQADMQIQILEAKMEEFAGAPREAMGI-RTPGEKTAFEVQQLQNAAGRIF-------QEKIMNFEVMLMEKVLNAMLE 456 (581) Q Consensus 385 ~~~~~~~~lq~~~~~~ee~TGv~~~~~G~-~~~~~~TAtgv~~l~~aa~~~~-------~~i~r~f~~~~~~~li~~~~~ 456 (581) .-..+..-+..+..-+.+.+|+|+.+.|. +.+.+.|...+...+...-++. +.+.|.|-.+.+...+. T Consensus 341 ~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~---- 416 (527) T protein:vir:10 341 SLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLP---- 416 (527) T ss_pred hhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHH---- Confidence 33345555778888899999999999994 3344555555443333332211 22222222111111111 Q ss_pred HHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHH Q lcl|NC_015158. 457 ISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKM 536 (581) Q Consensus 457 f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~ 536 (581) .+.. +...+++ =....+++ -|----..+++.++++..+-+. ..+|++...++ T Consensus 417 aye~---------v~~~d~~-----------~~~~v~iv-f~p~lP~D~~avie~v~tL~~a-------Gi~S~~tAv~~ 468 (527) T protein:vir:10 417 AYEG---------VGIDDAD-----------KKLTVTIT-FRDPKPVNSEKRFNQLLQLWEA-------GLIPAKKLTEE 468 (527) T ss_pred Hhhh---------cccCCCc-----------cccceEEE-ecccCCCCHHHHHHHHHHHHHc-------CchhHHHHHHH Confidence 1110 1111100 00011111 1100112345566666555432 44565555555 Q ss_pred HHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 537 LEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 537 ~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.+. +++ ..|..+..+-.+..++|..| ++++..|+= T Consensus 469 L~~~---~g~--eD~E~E~~~I~~era~~a~a----~a~A~~~~~ 504 (527) T protein:vir:10 469 LSKI---MGF--ELTEEDFKQATEDKKTQGIA----QAEAADPFG 504 (527) T ss_pred HHhc---cCC--CChHHHHHHHHHHHHHHhHH----hhhhcCchh Confidence 5443 321 12322222222222222222 233333332 No 55 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.73 E-value=1.1e-15 Score=102.57 Aligned_cols=432 Identities=12% Similarity=0.041 Sum_probs=234.3 Q ss_pred Cccch-hhhhhhcc--chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc-----c--ccccccccc--cccc Q lcl|NC_015158. 1 MTGKV-LELQQMLD--DTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT-----T--TTNSTLPWK--NKTT 68 (581) Q Consensus 1 ~~~~~-~~~~~~~~--~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~-----~--~~~~~~~~k--~~~~ 68 (581) .+-++ .+++.+.. ...+.....|.++.+.+.. ....+.++++|+...+.- . ......+++ +++. T Consensus 15 ~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~----~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~ 90 (483) T protein:vir:12 15 SQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLE----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 90 (483) T ss_pred CcchhhhhhhcccccCCchhhHHHHHHHHHHHHHH----HHHHHHHHHHHhccccccccccccccccccccccccccccc Confidence 22222 33455543 1122334555555554432 334566677777655311 0 011111222 4577 Q ss_pred ccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeec Q lcl|NC_015158. 69 LPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVK 148 (581) Q Consensus 69 ~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~ 148 (581) +|-..-+++...++|+ +++- ++.+ ++++..+.++ +.+. .++.....++.+++.+||.|+..+.+.. T Consensus 91 ~n~~k~Ivd~~~~~l~----G~p~--~~~~---~d~~~~~~l~----~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~d~ 156 (483) T protein:vir:12 91 TNFHANLVDQKVSYIV----GKPI--AFKH---TDDEVVKRID----EVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE 156 (483) T ss_pred cchHHHHHHHHhhhhc----ccCc--eecc---CChHHHHHHH----HHHh-ccHHHHHHHHHHHHhhCCeEEEEEEEcC Confidence 8888888888887775 3332 2322 3333344443 3333 4678888999999999999988876531 Q ss_pred ceeeeeeeeeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhc Q lcl|NC_015158. 149 ETTKDEESGATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRR 226 (581) Q Consensus 149 ~~~~~~~~~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~ 226 (581) . ..|++..++|.++|+ |++.. .+--+++|.+.+..+ T Consensus 157 d--------------~~~~i~~~~p~~~~~v~d~~~~--~~~~~~ir~~~~~~~-------------------------- 194 (483) T protein:vir:12 157 E--------------GEFKLFRVPAEQGIPIWTDKEH--EELEAFIRMYKLENE-------------------------- 194 (483) T ss_pred C--------------CceEEEEEcccceEEEEcCCCC--CceEEEEEEEEeecc-------------------------- Confidence 1 236788999999754 54322 111223333322100 Q ss_pred cCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCee Q lcl|NC_015158. 227 GLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIF 306 (581) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~ 306 (581) . ..+++...++..+.+.+ +.......- . .+...+....| +.|..|++ T Consensus 195 ------------------~----~~~~y~~~~v~~~~~~~-------~~~~~~~~~-~-~~~~~~~~~~~--~~g~vPvv 241 (483) T protein:vir:12 195 ------------------T----KVEYWDKVTVNYYVYEN-------GSLIPDYSN-N-LENSKTHFSTG--SWGKIPFI 241 (483) T ss_pred ------------------e----EEEEEecCeEEEEEEeC-------Ceeeecccc-c-ccccccccccC--CCCccceE Confidence 0 01122223333322221 111000000 0 01112222233 45677776 Q ss_pred EecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-ccc---c--ccCCceeEEeCCCCCcccc Q lcl|NC_015158. 307 HCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VEE---F--VWGPMEQIYINGDGDVEMM 380 (581) Q Consensus 307 ~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~~---i--~~~pG~vi~~~~~~~i~~~ 380 (581) .+. .+-+|.|..+.+.++++.+|.+...+.+.+...++|.+.+.+. .++ . ..+.++++.+.+++++.++ T Consensus 242 ~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (483) T protein:vir:12 242 PFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTI 316 (483) T ss_pred Eec-----CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhhhccccccCCCCcceEE Confidence 443 3557999999999999999999999999999999999877542 121 1 1235678888889999998 Q ss_pred cCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 381 APNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRR 460 (581) Q Consensus 381 ~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~ 460 (581) ..+.....+...++.+...+-+.|++|..+.+.- +++.||.++..++..+........+.|.. +++++++++.+++.. T Consensus 317 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~-~l~~~~~li~~~~~~ 394 (483) T protein:vir:12 317 QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKV-AIQELLWFVFEHFDI 394 (483) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccCcHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcC Confidence 8776666777888999999999999998776532 24567777777788888888888888885 678888888776432 Q ss_pred hcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHH Q lcl|NC_015158. 461 NLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHN 540 (581) Q Consensus 461 n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~ 540 (581) ..+. .++...| ...-.....+.++.++.| . .+ +|.+.++ +. T Consensus 395 ~~~~---------------------~~i~v~f--~~~~p~~~~~~a~~~~kl---~------Gi---iS~et~~----~~ 435 (483) T protein:vir:12 395 KGEH---------------------KDVDISF--NYNKVANTELQVQTAQQS---M------GI---VSHETVL----EN 435 (483) T ss_pred CCcc---------------------ceeeEEe--CCCCCCCHHHHHHHHHHH---h------cc---CchHHHH----Hh Confidence 1110 1222222 111222334444333332 1 12 3332222 22 Q ss_pred hcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 541 LSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 541 ~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++. .+ ++ ++|.+++.++-++..+..+..+-. T Consensus 436 --~~~----v~--d~--~~E~~ri~~E~~~~~~~~~~~~~~ 466 (483) T protein:vir:12 436 --HPF----VE--DL--QAELERIEQEQMEYNKQLPNLDDG 466 (483) T ss_pred --CCC----CC--CH--HHHHHHHHHHHHHHHhhccccccc Confidence 121 12 22 334455444332222211111111 No 56 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.72 E-value=4.7e-17 Score=110.12 Aligned_cols=480 Identities=11% Similarity=0.087 Sum_probs=247.0 Q ss_pred CccchhhhhhhccchhhhHHHH--HHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccccc---ccccccccchHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQ--IANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLP---WKNKTTLPKLCQI 75 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~--i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~---~k~~~~~pki~~~ 75 (581) |.-. .+.+++..-..+.. +.+.-..++..| .+.|+.|.+|+......-.....-. -...+..| T Consensus 1 ~~~~----~~~~~~~~~~~~g~~~~p~~v~~~d~~R---l~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~p----- 68 (527) T protein:vir:10 1 MGQD----KRQYGSTQQLRAGEANFPNAVTDFDKAR---LASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVP----- 68 (527) T ss_pred CCcc----ccccCCCcCcCCccccCcccCCHHHHHH---HHHHHHHHHHhcCchhheeeecCCccccccceeeeh----- Confidence 2111 12222222222211 112223333443 3456666677665432221111000 01123333 Q ss_pred HHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeee Q lcl|NC_015158. 76 RDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEE 155 (581) Q Consensus 76 ~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~ 155 (581) .++.++....-|.+.|-...+....+..+.++.+.+.+.|+...+.+.-+++++.|=|++++-|.... T Consensus 69 -------s~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k----- 136 (527) T protein:vir:10 69 -------NGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEK----- 136 (527) T ss_pred -------hhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCC----- Confidence 23666777777777777777777888899999999999999999999999999999999999995321 Q ss_pred eeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 156 SGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 156 ~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) ..-.+|++..|+|--+|+= .+.++..++.+.++-. ..++-..+ ..++ ..+-.+.+.. ...+ T Consensus 137 -----~~~~R~~v~~~DP~~~f~~---ed~d~~~~v~~v~~~~--~~~~P~d~-~~~~-~~ar~~~~~~-------~l~~ 197 (527) T protein:vir:10 137 -----DEGSRLSLHEVDPSTYFPY---EDPRYPGQVLGVYLVD--EYPHPDSE-KKNE-KCARVQKYMK-------TLDD 197 (527) T ss_pred -----CcCCCceEeecCcceeeee---ecCCCCCceeeEEEee--eccCCccc-cccc-eehhhhhhhh-------hcCc Confidence 0011366777887776554 4455556665554321 00110000 0000 0000000100 0000 Q ss_pred hhhccccccccccccccccCCceEEEE-EEe--eeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccc Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVL-TFY--GDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRI 312 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevl-E~~--g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~ 312 (581) .| .+..++.++.. ++| |+|.|..+=.+....+- ...+..++.-..||+ +-+|+++++-.| T Consensus 198 ---------~g-----~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~-~~~~~~~l~~lp~pi--~fiPvV~~~t~p 260 (527) T protein:vir:10 198 ---------DG-----KPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIK-KLSTLTEEEPLPEQI--TTLPVFHFRGHP 260 (527) T ss_pred ---------cc-----ccccCcceeeeeceeeccccccccccccchhhhh-hhcCceeeecccCCC--CccceEeecCCC Confidence 00 00111222222 233 23433222221111111 122456666666764 568999999999 Q ss_pred cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec--------cccccccCCceeEEeCCCCCcccccCCC Q lcl|NC_015158. 313 RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG--------DVEEFVWGPMEQIYINGDGDVEMMAPNT 384 (581) Q Consensus 313 ~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~--------d~~~i~~~pG~vi~~~~~~~i~~~~~p~ 384 (581) .|++.||+|.+..++.+++++|+.....-..+..+.+|++.+.+ ...++.-.||.+|+..+++.+..+...+ T Consensus 261 ~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~~ 340 (527) T protein:vir:10 261 IMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYRVNGVA 340 (527) T ss_pred ccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCCceeEecCCCcceeeccchh Confidence 99999999999999999999999999999999999999997753 2344555699999999999877666543 Q ss_pred ccchhHHHHHHHHHHHHHhcCCchHhcCC-CCcccccHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 385 QALQADMQIQILEAKMEEFAGAPREAMGI-RTPGEKTAFEVQQLQNAAGRIF-------QEKIMNFEVMLMEKVLNAMLE 456 (581) Q Consensus 385 ~~~~~~~~lq~~~~~~ee~TGv~~~~~G~-~~~~~~TAtgv~~l~~aa~~~~-------~~i~r~f~~~~~~~li~~~~~ 456 (581) .-..+..-+..+..-+.+.+|+|+.+.|. +.+.+.|...+...+...-++. +.+.|.|-.+.+...+. T Consensus 341 ~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~---- 416 (527) T protein:vir:10 341 SLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLP---- 416 (527) T ss_pred hhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHH---- Confidence 33345555788888899999999999994 3344555555443333332211 22222222111111111 Q ss_pred HHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHH Q lcl|NC_015158. 457 ISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKM 536 (581) Q Consensus 457 f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~ 536 (581) .+.. +...+++ =....+++ -|----..+++.++++..+-+ +..+|++...++ T Consensus 417 aye~---------v~~~d~~-----------~~~~v~iv-f~p~lP~D~~avie~v~tL~~-------aGiiS~etAv~~ 468 (527) T protein:vir:10 417 AYEG---------VGIDDAD-----------KKLTVTIT-FRDPKPVNNEKRFAQLLELWE-------AGLIPAKKLTEE 468 (527) T ss_pred Hhhh---------cccCCCc-----------cccceEEE-ecccCCCCHHHHHHHHHHHHH-------cCchhHHHHHHH Confidence 1110 1111100 00011111 111011234556666655543 244565555555 Q ss_pred HHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 537 LEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 537 ~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.+. +++ ..|..+..+=.+..++|..| ++++..|+= T Consensus 469 L~~~---~g~--eD~E~E~~~I~~era~~a~a----~a~a~~~~~ 504 (527) T protein:vir:10 469 LSKI---MGF--ELTEEDFRQATEDKKTQGIA----QAEAADPFG 504 (527) T ss_pred HHhc---cCC--CchHHHHHHHHHHHHHHhHH----hhhhcCchh Confidence 5443 321 12222211111222222222 233333332 No 57 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.72 E-value=4.2e-15 Score=99.38 Aligned_cols=424 Identities=13% Similarity=0.092 Sum_probs=228.0 Q ss_pred cchhhhhhhccchhhhHH-HHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccc--cccccccchHHHHHHH Q lcl|NC_015158. 3 GKVLELQQMLDDTRDGLA-EQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPW--KNKTTLPKLCQIRDNL 79 (581) Q Consensus 3 ~~~~~~~~~~~~~~~~~a-~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~--k~~~~~pki~~~~d~~ 79 (581) -|...-+-|.-+....+. ..|.++.+.++. ....+.++.+|+...+.--....+.++ .+++.+|-..-+++.. T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~----~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~ 76 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEVVTKFMEKHRL----EVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTF 76 (453) T ss_pred CeecCCcceEcCCCCCCCHHHHHHHHHHHHH----HHHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHH Confidence 233333333334333333 334444443332 223456666676544311111112222 2567778778888888 Q ss_pred HHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeE Q lcl|NC_015158. 80 HSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGAT 159 (581) Q Consensus 80 ~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~ 159 (581) .++|+ +++- .+.+. ++ ..++.|.+.+..++|...+.++.+++.+||.|+..+.+... T Consensus 77 ~~~l~----g~~~--~~~~~---d~----~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~---------- 133 (453) T protein:vir:39 77 TGYFN----GIPV--KKSHS---DK----ETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEE---------- 133 (453) T ss_pred hhhhc----ccCc--eeccC---Ch----HHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCC---------- Confidence 88775 3332 33322 21 12345777788899999999999999999999999876321 Q ss_pred eeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhh Q lcl|NC_015158. 160 RDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCE 237 (581) Q Consensus 160 ~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 237 (581) ..|++..++|.++ ++|+.... ..-+.+|..... + T Consensus 134 ----g~~~i~~~~p~~~~~v~d~~~~~--~~~~~ir~~~~~-------------~------------------------- 169 (453) T protein:vir:39 134 ----TQTNVIYNTPENMFMVYDDTIKQ--EPLFAVRYGYDD-------------D------------------------- 169 (453) T ss_pred ----CceEEEEEcccceEEEecCCCCC--eEEEEEEEEEeC-------------C------------------------- Confidence 2467888999986 44544321 112222222110 0 Q ss_pred hccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcc Q lcl|NC_015158. 238 KAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNL 317 (581) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~ 317 (581) .....++|+.+.+. .|- ..++++ .+.. ..|-+.|..|++.+. ..- T Consensus 170 ---------~~~~~~~yt~~~i~--~~~-----~~~~~~------------~~~~--~~~~~~g~vPvv~~~-----n~~ 214 (453) T protein:vir:39 170 ---------YKLYGEVYTKETTY--ALN-----GTMGFY------------NMTE--QAPNPFDDLPVVEFY-----FNE 214 (453) T ss_pred ---------eEEEEEEEeCCeEE--EEE-----ecCCce------------eeec--ccccCCCceeEEEec-----CCC Confidence 00001112222111 111 111111 1111 223345677776443 345 Q ss_pred cCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec---ccccc-ccCCceeEEeC------CCCCcccccCCCccc Q lcl|NC_015158. 318 YAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG---DVEEF-VWGPMEQIYIN------GDGDVEMMAPNTQAL 387 (581) Q Consensus 318 ~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~---d~~~i-~~~pG~vi~~~------~~~~i~~~~~p~~~~ 387 (581) +|.|..+.++++++.+|.+...+.+.+...++|++.+.+ +.+++ ..+.++++... .++++.+++.+.... T Consensus 215 ~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~ 294 (453) T protein:vir:39 215 ERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDSDS 294 (453) T ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCchhhhhhhhcceeeecCCCCCCCCCceeEEeecCCHH Confidence 799999999999999999999999999999999987754 12222 23345565543 345567777665555 Q ss_pred hhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccce Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADT 467 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~ 467 (581) .+...+..+...+-..|++|.++.+. .++.|+.++...............+.|.. +++++++++.++...... T Consensus 295 ~~~~~~~~l~~~I~~~s~~p~~~~~~--~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~-~l~~~~~li~~~~~~~~~---- 367 (453) T protein:vir:39 295 QTENLLDRLTKLIFQTTMVANISDES--FGSSSGVSLAYKLQAMSNLALSFQRKFQS-SLNSRYKLYCELSTNVSN---- 367 (453) T ss_pred HHHHHHHHHHHHHHHHhCCccccccc--ccCChHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCC---- Confidence 66667888888888999999876543 34667777777777777777888888885 678899988877543211 Q ss_pred eeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 468 IRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 468 iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~ 547 (581) . ..-.+|+..| .........+.++.++.| . ++.|. +.+ .+. ++. T Consensus 368 -----~---------~~~~~i~v~f--~~~~p~~~~~~a~~~~kl---~------g~is~---et~----l~~--l~~-- 411 (453) T protein:vir:39 368 -----K---------EAWKDIEYTF--TRNEPKDIKEQAETANIL---M------GITSQ---ETA----LSV--ISV-- 411 (453) T ss_pred -----c---------cccccceEEe--CCCCCcCHHHHHHHHHHH---h------ccCCh---HHH----HHh--CCC-- Confidence 0 0112333333 122233344444433332 2 12232 222 232 221 Q ss_pred cccCCCCcHHHHHHHHHHHHHHHHHHHH--hcccCC Q lcl|NC_015158. 548 IFKPNVAVMEAQTTSALVNQSQAQIEEE--AQVPLV 581 (581) Q Consensus 548 ~~~~~~~~~~~~~~q~~~q~aq~~~~~~--~~~~~~ 581 (581) .+ ++ +.|.+++.++-....+.. ...+-- T Consensus 412 --v~--D~--~~E~~ri~~E~~~~~~~~~~~~~~~~ 441 (453) T protein:vir:39 412 --IP--DV--QAEMEKIKKEEASTAIFDKDKQPSEK 441 (453) T ss_pred --CC--CH--HHHHHHHHHHHHHHHHHHHhccCCCC Confidence 12 22 334444443222222211 111111 No 58 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.71 E-value=1.6e-15 Score=101.69 Aligned_cols=406 Identities=12% Similarity=0.113 Sum_probs=223.0 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-c-----cccccccccccccccccchHHHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-T-----TTTTNSTLPWKNKTTLPKLCQIRDNLHSNYI 84 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~-----~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~ 84 (581) |+ .++...|.+ .|.++++|+.-.+ . +.....+. .+++.+|-...+++....+|+ T Consensus 1 ~~---------------~~~~~~~~~---r~~~l~~yy~g~~~~~~~~~~~~~~~~~--~~ki~~n~~~~ivd~~~~~l~ 60 (440) T protein:vir:95 1 ML---------------AAFLGSQKQ---RLAILASYAQGDNFSILSGHRRLDDEKA--DYRVRHKWGGYISSFATGYVI 60 (440) T ss_pred Ch---------------hhHHHHHHH---HHHHHHHHhccCCcccccccccccccCC--cceeecchHHHHHHhhhhhee Confidence 22 223333433 3455555553322 1 11111221 235667877888888777764 Q ss_pred HhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeec Q lcl|NC_015158. 85 SALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYF 164 (581) Q Consensus 85 ~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~ 164 (581) .++- ++......+.+. .+.+++.+..+++...+.++.+++.+||.|+..+.+... . T Consensus 61 ----g~~~--~~~~~~~~~~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~--------------~ 116 (440) T protein:vir:95 61 ----GNPV--SIGVMEGGSADQ----LSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKD--------------K 116 (440) T ss_pred ----ccCc--eEeeCCCccHHH----HHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCC--------------C Confidence 4432 333333333332 335677788889999999999999999999998865321 1 Q ss_pred cceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccc Q lcl|NC_015158. 165 GPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGF 242 (581) Q Consensus 165 ~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (581) .|++..++|.++++ ||.... ..-+.+|.+... +. T Consensus 117 ~~~i~~~~p~~~~~~~d~~~~~--~~~~~i~~~~~~-----------~~------------------------------- 152 (440) T protein:vir:95 117 VDRVVLISPLEMFVIRDLTVEQ--NIIAAVHLPIYA-----------DK------------------------------- 152 (440) T ss_pred ceEEEEEcccceEEEEcCCCCC--ceEEEEEEEEec-----------Cc------------------------------- Confidence 36788899999754 554321 111122222110 00 Q ss_pred ccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCc Q lcl|NC_015158. 243 SMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGP 322 (581) Q Consensus 243 ~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~ 322 (581) .+ .+.|+...+..+++++. ..+++ .++....|| .|..|++.+. ..-+|.|. T Consensus 153 ---~~---~~vyt~~~~~~~~~~~~----~~~~~------------~~~~~~~~~--~g~vPvv~~~-----n~~~g~sd 203 (440) T protein:vir:95 153 ---VN---MTVYTKDKVITYKPYSN----NSVRL------------VVDDVKKHS--YNDVPVVEWW-----NNRFRMGD 203 (440) T ss_pred ---eE---EEEEeCCeEEEEEEecC----Cccce------------eecceeecc--CceeeEEEee-----CCCCCCCc Confidence 00 01123333333343321 01110 111222343 5677877543 34579999 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-------ccccc-cCCceeEEe---------CCCCCcccccCCCc Q lcl|NC_015158. 323 LDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-------VEEFV-WGPMEQIYI---------NGDGDVEMMAPNTQ 385 (581) Q Consensus 323 ~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-------~~~i~-~~pG~vi~~---------~~~~~i~~~~~p~~ 385 (581) .+.+.++++.+|.+...+.+++...++|++.+.+. .+... .+..+.+.. ..+++++++..+.. T Consensus 204 ~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~ 283 (440) T protein:vir:95 204 YESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYD 283 (440) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCC Confidence 99999999999999999999999999999877652 11111 111222221 23345777776655 Q ss_pred cchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Q lcl|NC_015158. 386 ALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVA 465 (581) Q Consensus 386 ~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~ 465 (581) ...+...++.+...+-..|++|..+.+.- .++.+|.++..++...........+.|.. +++++++++.+++... +. T Consensus 284 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~~~~~~-~~- 359 (440) T protein:vir:95 284 VNGTEAYKNRLANDIHRFSRIPNLDDDRF-NSTSSGIALLYKMIGLEQVRKDKETYFTK-ALRRRYELISNIHKAI-NG- 359 (440) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhc-CC- Confidence 55677778999999999999998777643 24667888888888888888888888886 5688999888775421 10 Q ss_pred ceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCC Q lcl|NC_015158. 466 DTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGG 545 (581) Q Consensus 466 ~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~ 545 (581) .+ ....++...| .-.......+.++.++.| . .+ ++.+.+. +. ++. T Consensus 360 -------~~--------~~~~~v~i~f--~~~~p~~~~~~ad~~~kl---~------g~---iS~et~~----~~--l~~ 404 (440) T protein:vir:95 360 -------PV--------IEANKLTFTF--HPNIPQDVWTEIKAYIEA---G------GE---ISQETLM----EN--ASF 404 (440) T ss_pred -------cc--------cccccceEEe--CCCCCCCHHHHHHHHHHH---h------cc---CcHHHHH----Hh--CCC Confidence 00 0111222222 222233445545443333 1 12 2322222 22 222 Q ss_pred cccccCCCCcHHHHHHHHHHH-HHHHHHHHHhcccCC Q lcl|NC_015158. 546 WDIFKPNVAVMEAQTTSALVN-QSQAQIEEEAQVPLV 581 (581) Q Consensus 546 ~~~~~~~~~~~~~~~~q~~~q-~aq~~~~~~~~~~~~ 581 (581) .++. +|.++++. +.....+..++.+-. T Consensus 405 -------~d~~--~E~~ri~~E~~~~~~~~~~~~~~~ 432 (440) T protein:vir:95 405 -------TDYK--TEHSRILKQGGSSDLEIGQIVGDA 432 (440) T ss_pred -------CCcH--HHHHHHHHHHHHhhhhHHhhccCC Confidence 2222 23333333 333334444444544 No 59 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.71 E-value=6.9e-15 Score=98.23 Aligned_cols=426 Identities=12% Similarity=0.113 Sum_probs=220.5 Q ss_pred Cccch---hhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc------ccccccccccccccccc Q lcl|NC_015158. 1 MTGKV---LELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT------TTTNSTLPWKNKTTLPK 71 (581) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~------~~~~~~~~~k~~~~~pk 71 (581) +..++ -+++.|+++ +. |.++-+.+. ......|.++.+|+...+.. .....+....+++.+|- T Consensus 16 ~~~~~~~~~~~~~~~~~--~~----i~~~i~~~~---~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~ 86 (481) T protein:vir:10 16 LANDDFVVSDLAELLKE--EN----LRNFISRHQ---TEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNY 86 (481) T ss_pred ccCceeeeecchhhcCH--HH----HHHHHHHHH---HHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecch Confidence 11111 133445542 22 333333222 23344577777777554321 11111111233566677 Q ss_pred hHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeeccee Q lcl|NC_015158. 72 LCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETT 151 (581) Q Consensus 72 i~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~ 151 (581) +..+++....++. +.+- ++.+. ++ ..++.+.+.+..++|...+.++.+++.++|.|++.+.+.+. T Consensus 87 ~~~ivd~~~~~l~----g~~~--~~~~~---d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~d-- 151 (481) T protein:vir:10 87 AKYVSRFIVGYLT----GNPI--TITHQ---DN----QTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFE-- 151 (481) T ss_pred HHHHHHHHHhhhc----cCCc--eEecC---Ch----hHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCC-- Confidence 7777777776664 3332 23222 21 12235666677789999999999999999999988865211 Q ss_pred eeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCC Q lcl|NC_015158. 152 KDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLG 229 (581) Q Consensus 152 ~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~ 229 (581) ..|++..++|.++| +|+... ...-+.+|.+.+..+ + T Consensus 152 ------------g~~~i~~~~p~~~~~v~d~~~~--~~~~~~i~~~~~~~~---------~------------------- 189 (481) T protein:vir:10 152 ------------DRDTFKVLDPKSTFVVYDQTLD--KKVVAGVRYFEKQDK---------D------------------- 189 (481) T ss_pred ------------CeEEEEEEcccceEEEEcCCCC--CceEEEEEEEEEeeC---------C------------------- Confidence 13678889999975 343321 112222233222100 0 Q ss_pred cccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEec Q lcl|NC_015158. 230 TYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCG 309 (581) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~ 309 (581) ...+...++|-. +.+ +.. ...++.+...+.-|.+.|..|++.+. T Consensus 190 --------------------------~~~~~~~~~y~~------~~i---~~~-~~~~~~~~~~~~~~~~~g~vPvv~~~ 233 (481) T protein:vir:10 190 --------------------------KVPVQHVEVYTT------DKI---YYI-EIKGGTYHRVEEVEHYYNDVPIIEYL 233 (481) T ss_pred --------------------------CceEEEEEEEec------CeE---EEE-EecCCceeecccccccCCceeEEEee Confidence 001111222210 000 000 01112121222334446778876433 Q ss_pred ccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccc-cccCCceeEEeC---------CCC Q lcl|NC_015158. 310 WRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEE-FVWGPMEQIYIN---------GDG 375 (581) Q Consensus 310 ~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~-i~~~pG~vi~~~---------~~~ 375 (581) ..-+|.|..+.++++++.++.+...+.+++....+|++.+.+. .+. ...+.++.+... .++ T Consensus 234 -----n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (481) T protein:vir:10 234 -----NDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKA 308 (481) T ss_pred -----cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCCCc Confidence 3457999999999999999999999999999999999877642 111 122334443332 234 Q ss_pred CcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 376 DVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAML 455 (581) Q Consensus 376 ~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~ 455 (581) ++.++..+.....+...+..+...+-..|++|..+.|.. .++.+|.++...+.+.........+.|.. ++++++++++ T Consensus 309 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~ 386 (481) T protein:vir:10 309 EVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQF-SGVQSGESMKYKLFGLEQVRAIKERLFKK-GLMKRYKLLL 386 (481) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Confidence 566666665555666778888889999999998877742 34556666666666777777777788885 5677888887 Q ss_pred HHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHH Q lcl|NC_015158. 456 EISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAK 535 (581) Q Consensus 456 ~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~ 535 (581) .++....- . .....++...| .........+.++.++.| . .+ ++.+.+++ T Consensus 387 ~~~~~~~~---------~--------~~~~~~i~v~f--~~~~~~~~~~~a~~~~kl---~------g~---is~et~~~ 435 (481) T protein:vir:10 387 NNVNLTGL---------K--------QHNYAELTITF--TPNLPKSMMESINAFNAL---S------GG---VSESTRLS 435 (481) T ss_pred HHHhccCC---------C--------ccccceeeEEe--CCCCCcCHHHHHHHHHHH---h------cc---CChHHHHH Confidence 76432110 0 00111222222 122233344444433333 1 12 33322322 Q ss_pred HHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 536 MLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 536 ~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) . ++. .+ + .++|.+++..+-.+. +...+...+ T Consensus 436 ----~--l~~----i~--d--~~~E~~ri~~E~~~~-~~~~~~~~~ 466 (481) T protein:vir:10 436 ----L--LDF----ID--N--PKEELEKMQEEEAQR-EKQADKRGY 466 (481) T ss_pred ----h--CCC----CC--C--HHHHHHHHHHHHHHH-HhhhhhccC Confidence 2 221 11 2 234455544432222 222333333 No 60 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.70 E-value=2.9e-15 Score=100.30 Aligned_cols=423 Identities=13% Similarity=0.053 Sum_probs=211.6 Q ss_pred ccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-ccccccccc--ccccccccchHHHHHHHHHHHHHhhc Q lcl|NC_015158. 12 LDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-TTTTNSTLP--WKNKTTLPKLCQIRDNLHSNYISALF 88 (581) Q Consensus 12 ~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-~~~~~~~~~--~k~~~~~pki~~~~d~~~~~l~~~~f 88 (581) ++++-.++...|.+.|. . +++ .+.+++.|+.-.+. +..+.+.-+ ..++++++-+.-+++.+..++. T Consensus 1 ~~~~~~~~i~~l~~~~~---~-~~~---r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~---- 69 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQ---R-LSS---WHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLD---- 69 (441) T ss_pred CCccHHHHHHHHHHHHH---H-HHH---HHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhc---- Confidence 44333333444444443 2 222 35556666654432 111111111 1344555655566665555542 Q ss_pred CCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceE Q lcl|NC_015158. 89 PNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRA 168 (581) Q Consensus 89 ~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~i 168 (581) +. |....+.+ .++..+.+++|.....+.++++.+||.|+..+... ....|++ T Consensus 70 ~~-------g~~~~d~~-------~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d--------------~~g~~~i 121 (441) T protein:vir:80 70 WL-------GWTNGDGY-------GLDGVYAANRLATASCDVHLDALIFGLSFVAIIPH--------------GDGTVSV 121 (441) T ss_pred cc-------cccCCChH-------HHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC--------------CCCceEE Confidence 21 22222221 24444567899999999999999999998887431 1123678 Q ss_pred Eecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccccccc Q lcl|NC_015158. 169 VRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDG 246 (581) Q Consensus 169 e~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (581) ..++|.+++ +||....+. .++.+..... . . T Consensus 122 ~~~~p~~~~~i~d~~~~~~~--~~~~~~~~~~----------~------------------------------------~ 153 (441) T protein:vir:80 122 RPQSPKNCTGKFSADGSRLD--AGLVVQQTCD----------P------------------------------------E 153 (441) T ss_pred EEEccceEEEEEeCCCCcee--EEEEEEEEec----------C------------------------------------c Confidence 899999974 576543222 1121111100 0 0 Q ss_pred ccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH-Hh Q lcl|NC_015158. 247 FGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL-DN 325 (581) Q Consensus 247 ~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~-~~ 325 (581) ......| .++ .++.|+.+ .++ .+...+.-|.++|+.|++.+...+.++.+||.|.. +. T Consensus 154 ~~~~~vy-~~~--~~~~~~~~----~~~--------------~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~ 212 (441) T protein:vir:80 154 VVEAELL-LPD--VIVQVERR----GSR--------------EWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRS 212 (441) T ss_pred eEEEEEE-ecC--eEEEEEEc----CCc--------------ceeeccccccCCCceeEEEeeccccCCccCCcccchhh Confidence 0000001 111 11222211 111 12222333445688999988888999999999965 56 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-ccc-----ccccCCceeEEeCCCCC---cccccCCCccchhHHHHHHH Q lcl|NC_015158. 326 LVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVE-----EFVWGPMEQIYINGDGD---VEMMAPNTQALQADMQIQIL 396 (581) Q Consensus 326 l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~-----~i~~~pG~vi~~~~~~~---i~~~~~p~~~~~~~~~lq~~ 396 (581) ++++++.+|.+...+.+++..+++|+..+.+ +.+ .....+|++|.+..+.+ +...+.+. .++.+.+..+ T Consensus 213 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l 290 (441) T protein:vir:80 213 IRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQPGWVLSMASVWAVDKDDDGDTPNVGSFPV--NSPTPYSDQM 290 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCccccccchhhhcccccccCCCCCCCCcceeEecCc--cchHHHHHHH Confidence 8999999999999999999999999887754 221 12345788887664432 23222222 2344455666 Q ss_pred HHHHHH---hcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCc Q lcl|NC_015158. 397 EAKMEE---FAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDS 473 (581) Q Consensus 397 ~~~~ee---~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~ 473 (581) +..+.. .|++|....|..+....+|.++...+...........+.|.. .+++++++++.+.....+.+. . T Consensus 291 ~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~-~l~~~~~l~~~~~~~~~~~~~------~ 363 (441) T protein:vir:80 291 RLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQ-GWLSVGFLAAKALDSRVDEAD------F 363 (441) T ss_pred HHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccc------c Confidence 655555 478888888865544456666777777777777888888886 567788877666322111000 0 Q ss_pred hhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCC Q lcl|NC_015158. 474 DDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNV 553 (581) Q Consensus 474 ~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~ 553 (581) -.+++..| .........+.+ +.+.++.+. .+ +..+...+ .+.++...- +.. T Consensus 364 -----------~~~i~~~f--~~~~~~~~~e~a---d~~~kl~~~----g~-~~~s~~~~----~~~l~~~~~----e~~ 414 (441) T protein:vir:80 364 -----------FGDVGLRW--RDASTPTRAATA---DAVTKLVGA----GI-LPADSRTV----LEMLGLDDV----QVE 414 (441) T ss_pred -----------ceeeeEEe--CCCCCcCHHHHH---HHHHHHHhc----Cc-ccccHHHH----HHhCCCCHH----HHH Confidence 01111111 111122333433 333333321 12 22232222 233222110 000 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHhcccC Q lcl|NC_015158. 554 AVMEAQTTSALVNQSQAQIEEEAQVPL 580 (581) Q Consensus 554 ~~~~~~~~q~~~q~aq~~~~~~~~~~~ 580 (581) ...++++++.-++.+.....+.+.-.| T Consensus 415 ~~~~e~~e~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 415 AVMRHRAESSDPLAVLAGAISRQTNEV 441 (441) T ss_pred HHHHHHHHHHHHHHHHhhhhhcccccC Confidence 000000000000111122222222223 No 61 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.70 E-value=5e-15 Score=99.00 Aligned_cols=437 Identities=12% Similarity=0.084 Sum_probs=215.9 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-cccccccccc--cccccccchHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-TTTTNSTLPW--KNKTTLPKLCQIRD 77 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-~~~~~~~~~~--k~~~~~pki~~~~d 77 (581) |||-+-- |.+. +..+..|..+...|...+ ..+.++++|+.--+. ...+...-+. ..+..++-+.-+++ T Consensus 1 ~~~~i~~---~~~~--~~~~~~~~~l~~~~~~~~----~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd 71 (485) T protein:vir:10 1 MTAPLPG---QEEI--EDPAIARDEMVSAFEDST----QNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVD 71 (485) T ss_pred CCCCCCC---CCCC--CCHHHHHHHHHHHHHHHH----HHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHH Confidence 8876543 3331 222333444444443333 235556666655442 2222222111 22344566667777 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) .+..+|. ++. |.. .++++.. +.+.+.+.+++|.....++.+++.+||.|++.+...+.... T Consensus 72 ~~~~~l~----~~g--~~~----~~~~~~~----~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~----- 132 (485) T protein:vir:10 72 SIAERQA----VEG--FRF----GDADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQID----- 132 (485) T ss_pred HHHhhhc----ccc--eec----CCCchhH----HHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccc----- Confidence 7766652 221 111 1222222 23445567789999999999999999999998865432111 Q ss_pred eEeeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 158 ATRDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 158 ~~~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) .......|.|..+||.++ ++||....+. .++ +..... + T Consensus 133 -~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~--~~~-~~~~~~-----------~------------------------- 172 (485) T protein:vir:10 133 -LGWDPNTPIIRVEPPTRMYAEIDPRIGRVS--KAI-RVAYDA-----------E------------------------- 172 (485) T ss_pred -cccCCCeeEEEEEccceeEEEEcCCCCcee--EEE-EEEEee-----------C------------------------- Confidence 111223467888999996 4666432211 111 111110 0 Q ss_pred hhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCC Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQD 315 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~ 315 (581) .+......+|..+.+ ++|. ..++++ .-....|.+.|..|++.+...+.++ T Consensus 173 ---------~~~~~~~~~y~~~~~--~~~~-----~~~~~~--------------~~~~~~~~~~g~vPvv~~~n~~~~~ 222 (485) T protein:vir:10 173 ---------GNEIQAATLYTPNDI--FGWY-----RVENEW--------------QEWFNNPHGLGVVPVVPIPNRTRLS 222 (485) T ss_pred ---------CCeEEEEEEEeCCeE--EEEE-----EcCCce--------------EEeccccCCCCcccEEEeccccccC Confidence 000000011111111 1111 111221 1111234456889999999999999 Q ss_pred cccCCCcHH-hhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-cccc-----------cccCCceeEEeCCCCCcccccC Q lcl|NC_015158. 316 NLYAMGPLD-NLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEE-----------FVWGPMEQIYINGDGDVEMMAP 382 (581) Q Consensus 316 ~~~G~s~~~-~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~-----------i~~~pG~vi~~~~~~~i~~~~~ 382 (581) ..||+|... .++++|+.+|.....+.+.....++|+..+.+ +.++ +...+|.+|.... +++...+. T Consensus 223 ~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~d~k~~q~ 301 (485) T protein:vir:10 223 DLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFED-AEGKIQQF 301 (485) T ss_pred CCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccceeccCC-CCceEEee Confidence 999999876 58899999999999999999999999865533 1111 1224677776543 33343333 Q ss_pred CCccchhHHHHHHHHHHHHHh---cCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 383 NTQALQADMQIQILEAKMEEF---AGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISR 459 (581) Q Consensus 383 p~~~~~~~~~lq~~~~~~ee~---TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~ 459 (581) +. .++.+.++.++..+..+ |++|....|..+..+-+|.++...+...........+.|.. .+++++++++.+.. T Consensus 302 ~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~-~l~~~~~l~~~~~~ 378 (485) T protein:vir:10 302 SA--AELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGG-AWEEAMRLAYRMMK 378 (485) T ss_pred cc--cchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhC Confidence 32 23445567777777666 66777777754433345666777777777777778888884 56788888766531 Q ss_pred hhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHH Q lcl|NC_015158. 460 RNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEH 539 (581) Q Consensus 460 ~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e 539 (581) . ... ..+..+..+.-.-.-...+.+.++ .+..+.+. +..+ ++...+. + T Consensus 379 ~-~~~-------------------~~~~~~i~v~w~~~~~~~~~~~ad---a~~kl~~a--g~~~---~s~et~~----~ 426 (485) T protein:vir:10 379 G-GDV-------------------PPDMLRMETVWRDPSTPTYAAKAD---AASKLYNG--GTGV---IPRERAR----K 426 (485) T ss_pred C-CCC-------------------cccceeeeEEecCCCCCCHHHHHH---HHHHHHhc--cccC---CCHHHHH----H Confidence 1 000 001111111111111223344333 33333221 1123 3332232 2 Q ss_pred HhcCCCcccccCCCCcHHHHHHHHHHH--HHHHHHHHHhc---ccCC Q lcl|NC_015158. 540 NLSLGGWDIFKPNVAVMEAQTTSALVN--QSQAQIEEEAQ---VPLV 581 (581) Q Consensus 540 ~~~l~~~~~~~~~~~~~~~~~~q~~~q--~aq~~~~~~~~---~~~~ 581 (581) .+ ++ .+ +..++ .+++.+ .++......++ .+=+ T Consensus 427 ~l---g~---~~--~~~~~--~~~~~ee~~~~~~~~~~~~~~~~~~~ 463 (485) T protein:vir:10 427 DM---GY---SI--AEREE--MRRWDEEEAAMGLGLIGTMVDPNPTV 463 (485) T ss_pred hC---CC---CH--hHHHH--HHHHHHHHHHHHHHHHHHhhccCCCC Confidence 22 32 12 11122 222211 11111111111 1111 No 62 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.70 E-value=5.4e-15 Score=98.80 Aligned_cols=448 Identities=12% Similarity=0.053 Sum_probs=233.3 Q ss_pred Cc-------cchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-cccc---------ccccccc Q lcl|NC_015158. 1 MT-------GKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTT---------TNSTLPW 63 (581) Q Consensus 1 ~~-------~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~---------~~~~~~~ 63 (581) |+ +.+-.+.....+..+.++.....+..++-+.. ....+.++.+|+.-.+ ..+. ....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDT 78 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhh--cHHHHHHHHHHhccccchhhccchhcccccccccccc Confidence 21 22223333332222223322222222221111 1245667777765433 1100 0000011 Q ss_pred --cccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceE Q lcl|NC_015158. 64 --KNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCF 141 (581) Q Consensus 64 --k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i 141 (581) .+++.+|-...+++...++++ +.+- ++.+ ++++..+. ++..+ ..+|.....++.+++.+||.|+ T Consensus 79 ~~~~ri~~n~~~~ivd~~~~yl~----g~~~--~~~~---~d~~~~~~----l~~~~-~n~~~~~~~~~~~~~~~~G~~~ 144 (503) T protein:vir:59 79 KTNNRTSHAWHKLFVDQKTQYLV----GEPV--TFTS---DNKTLLEY----VNELA-DDDFDDILNETVKNMSNKGIEY 144 (503) T ss_pred cccceeecchHHHHHHHHHhhhh----cCCe--eecc---CcHHHHHH----HHHHH-hcCHHHHHHHHHHHHhhCCeEE Confidence 134556766677777777765 3332 3322 33333333 33333 3688889999999999999999 Q ss_pred EEEeeecceeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHH Q lcl|NC_015158. 142 ATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIA 219 (581) Q Consensus 142 ~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~ 219 (581) +.+++... .++++..|+|.++| +|+.. ..+..+++|.+.+... .+ .. T Consensus 145 ~~v~~d~d--------------g~~~i~~~~p~~~~~i~d~~~--~~~~~~~ir~~~~~~~-------~~--~~------ 193 (503) T protein:vir:59 145 WHPFVDEE--------------GEFDYVIFPAEEMIVVYKDNT--RRDILFALRYYSYKGI-------MG--EE------ 193 (503) T ss_pred EEEeecCC--------------CceEEEEEccceeEEEEeCCC--CCceEEEEEEEEEecC-------CC--ce------ Confidence 99876321 24678899999975 44432 1233334444443200 00 00 Q ss_pred HHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCc Q lcl|NC_015158. 220 RRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSW 299 (581) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~ 299 (581) ....++|++..+..+++-+.... .+....+.... ..+....-|.+ T Consensus 194 ----------------------------~~~~evy~~~~i~~~~~~~~~~~-~~~~~~~~~~~------~~~~~~~~~~~ 238 (503) T protein:vir:59 194 ----------------------------TQKAELYTDTHVYYYEKIDGVYQ-MDYSYGENNPR------PHMTKGGQAIG 238 (503) T ss_pred ----------------------------EEEEEEEeCCcEEEEEEcCCccc-ccccccccccc------cceeecceecc Confidence 00011233333322221111000 00000000000 01111223455 Q ss_pred cCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-c---cc--cccCCceeEEeCC Q lcl|NC_015158. 300 FAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-V---EE--FVWGPMEQIYING 373 (581) Q Consensus 300 ~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~---~~--i~~~pG~vi~~~~ 373 (581) .++.|++.+. .+-+|.|..+.++++++.+|.+...+.+++..+++|++.+.+. . .+ .....++++.+.+ T Consensus 239 ~~~vPiv~~~-----nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~ 313 (503) T protein:vir:59 239 WGRVPIIPFK-----NNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSG 313 (503) T ss_pred CCccceEEec-----CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccC Confidence 6778877553 3557999999999999999999999999999999999877642 1 12 1234567888888 Q ss_pred CCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 374 DGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNA 453 (581) Q Consensus 374 ~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~ 453 (581) ++++.++........+...++.+...+-+.+++|..+.+. .+++.|+.++...+...........+.|. .++++++++ T Consensus 314 ~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~-~~l~~~~~~ 391 (503) T protein:vir:59 314 DGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPET-IGGGATGPALENLYALLDLKANMAERKIR-AGLRLFFWF 391 (503) T ss_pred CCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCccc-ccccccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 8999988877666667777899999999999998766543 23466777777777777777788888887 467888888 Q ss_pred HHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHH Q lcl|NC_015158. 454 MLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENL 533 (581) Q Consensus 454 ~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l 533 (581) ++.++....... ..+ -.++...|. -.......+.+ +.+..+.+ +.+.|+ +.+ T Consensus 392 i~~~~~~~~~~~-----~~~-----------~~~i~i~f~--~~~p~d~~~~~---~~~~kl~~----~GiiS~---et~ 443 (503) T protein:vir:59 392 FAEYLRNTGKGD-----FNP-----------DKELTMTFT--RTRIQNDSEIV---QSLVQGVT----GGIMSK---ETA 443 (503) T ss_pred HHHHHHhccCcc-----ccc-----------ccceeEEeC--CCCCCCHHHHH---HHHHHHHh----CCCCch---HHH Confidence 877754321100 000 012222221 11122333333 33333322 123333 222 Q ss_pred HHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 534 AKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 534 ~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) + +. ++. .+ ++ ++|.+++.+ +.+.+.+++.++. T Consensus 444 l----~~--l~~----v~--d~--~~E~~ri~~--E~~~~~~~~~~~~ 475 (503) T protein:vir:59 444 V----AR--NPF----VQ--DP--EEELARIEE--EMNQYAEMQGNLL 475 (503) T ss_pred H----Hh--CCC----CC--CH--HHHHHHHHH--HHHHHHhhhcccc Confidence 2 22 222 12 22 344555443 2233344555555 No 63 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.70 E-value=5.3e-15 Score=98.83 Aligned_cols=432 Identities=12% Similarity=0.048 Sum_probs=234.5 Q ss_pred Cccchh-hhhhhcc--chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc-----cc--cccccccc--cccc Q lcl|NC_015158. 1 MTGKVL-ELQQMLD--DTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT-----TT--TNSTLPWK--NKTT 68 (581) Q Consensus 1 ~~~~~~-~~~~~~~--~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~-----~~--~~~~~~~k--~~~~ 68 (581) .+.+++ +++.|+. ...+.....|.++.+.+.. | ...+.++++|+...+.- .. +....+++ +++. T Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~---~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~ 99 (492) T protein:vir:97 24 SQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLE-K---LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 99 (492) T ss_pred cchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccccccccccccccccccccc Confidence 443333 3455553 2234444566666665543 2 34556666776655421 00 01111222 3566 Q ss_pred ccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeec Q lcl|NC_015158. 69 LPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVK 148 (581) Q Consensus 69 ~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~ 148 (581) +|-..-+++...++|+ .++- ++.+ ++++..+. +++.+. .++.....++.+++.+||.|+..+...+ T Consensus 100 ~n~~k~Ivd~~~~yl~----g~p~--~~~~---~d~~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~~d~ 165 (492) T protein:vir:97 100 TNFHANLVDQKVSYIV----GKPI--AFKH---TDDEVVKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE 165 (492) T ss_pred cchHHHHHHHHhhhhc----ccCc--eecc---CchHHHHH----HHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEecC Confidence 7777788888887765 3332 3322 33333343 444443 5788888999999999999988775421 Q ss_pred ceeeeeeeeeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhc Q lcl|NC_015158. 149 ETTKDEESGATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRR 226 (581) Q Consensus 149 ~~~~~~~~~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~ 226 (581) . ..+++..++|.++|+ |++.. .+--+++|.+.+..+ T Consensus 166 d--------------g~~~~~~~~p~~~~~i~d~~~~--~~~~~~vr~~~~~~~-------------------------- 203 (492) T protein:vir:97 166 E--------------GEFKLFRVPAEQGIPIWTDKEH--EELEAFIRMYKLENE-------------------------- 203 (492) T ss_pred C--------------CceEEEEEcccceEEEEcCCCC--CceEEEEEEEeeccc-------------------------- Confidence 1 236788899999754 43321 222223333222100 Q ss_pred cCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCee Q lcl|NC_015158. 227 GLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIF 306 (581) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~ 306 (581) ...+++..+++..+.+.+ +.....+.. ..+...+....| +.|..|++ T Consensus 204 ----------------------~~~~~y~~~~v~~~~~~~-------~~~~~~~~~--~~~~~~~~~~~~--~~g~vPvv 250 (492) T protein:vir:97 204 ----------------------TKVEYWDKVTVNYYVYEN-------GSLIPDYSN--NLENSKTHFSTG--SWGKIPFI 250 (492) T ss_pred ----------------------eeEEEEecCeEEEEEEec-------Ceeeecccc--cccccccccccC--CCCCcceE Confidence 001123333333333322 111000000 001112222233 45777877 Q ss_pred EecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-ccc---c--ccCCceeEEeCCCCCcccc Q lcl|NC_015158. 307 HCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VEE---F--VWGPMEQIYINGDGDVEMM 380 (581) Q Consensus 307 ~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~~---i--~~~pG~vi~~~~~~~i~~~ 380 (581) .+. .+-+|.|..+.+.++++.+|.+...+.+.+...++|.+.+.+. .++ . ..+..+++.+.+++++.++ T Consensus 251 ~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 325 (492) T protein:vir:97 251 PFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGGVDTI 325 (492) T ss_pred Eec-----CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHHHhhccceecCCCCcceeE Confidence 443 3457999999999999999999999999999999999877652 221 1 1245678888899999988 Q ss_pred cCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 381 APNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRR 460 (581) Q Consensus 381 ~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~ 460 (581) ..+.....+...++.+...+-+.|++|..+.+.- +++.+|.++..++..+........+.|.. +++++++++.+++.. T Consensus 326 ~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~-~l~~~~~li~~~~~~ 403 (492) T protein:vir:97 326 QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKV-AIQELLWFVFEHFDI 403 (492) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccCcHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcC Confidence 8776666677788999999999999998765432 24567777777888888888888999985 678889988776532 Q ss_pred hcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHH Q lcl|NC_015158. 461 NLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHN 540 (581) Q Consensus 461 n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~ 540 (581) ..+. .+|...| ...-.....+.++.++.| . .+ +|.+.+++++ T Consensus 404 ~~~~---------------------~~i~v~f--~~~~p~~~~e~a~~~~kl---~------G~---iS~et~l~~l--- 445 (492) T protein:vir:97 404 KGEH---------------------KDVDISF--NYNKVANTELQVQTAQQS---M------GI---VSHETVLENH--- 445 (492) T ss_pred Cccc---------------------ceeeEEe--cCCCCCCHHHHHHHHHHH---h------cc---CchHHHHHhC--- Confidence 2111 1122222 111122333434333333 1 12 3332222221 Q ss_pred hcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 541 LSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 541 ~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +. .+ ++ ++|.+++.++-+...+..+...-. T Consensus 446 ---~~----v~--d~--~~Eleri~~E~~~~~~~~~~~~~~ 475 (492) T protein:vir:97 446 ---PF----VE--DL--QAELERIEQEQTEYNKQLPNLDDG 475 (492) T ss_pred ---CC----CC--CH--HHHHHHHHHHHHHHHHhhhccccC Confidence 21 12 12 234444433222111111111111 No 64 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.70 E-value=1.2e-14 Score=96.81 Aligned_cols=435 Identities=11% Similarity=0.090 Sum_probs=227.7 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--ccccccccc--cccccccchHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNSTLPW--KNKTTLPKLCQIR 76 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~~~~--k~~~~~pki~~~~ 76 (581) |++.. .....+ .+ .|.+.-+.....|. ..++++++|+...+.. .....+-.+ .+++.+|-..-++ T Consensus 31 ~~~~e--~~~~~~--~~----~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv 99 (511) T protein:vir:96 31 YDGTE--SDLLQN--VN----EVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (511) T ss_pred cchhh--hhhhcc--HH----HHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHH Confidence 22211 111111 12 23333232223333 3566677777654421 111111111 2456677777888 Q ss_pred HHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeee Q lcl|NC_015158. 77 DNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES 156 (581) Q Consensus 77 d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~ 156 (581) +...++++ +++- ++.+. ++ ...+.|++.+..++|.....++.+++.+||.|+..+++.+. T Consensus 100 ~~~~~yl~----g~p~--~~~~~---~~----~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded------- 159 (511) T protein:vir:96 100 DFINGYFL----GNPI--QYQDD---DK----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD------- 159 (511) T ss_pred HHHHhhhc----cCCc--eeecC---ch----HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCC------- Confidence 87777665 3322 23221 21 23456888888899999999999999999999888765311 Q ss_pred eeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccch Q lcl|NC_015158. 157 GATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRE 234 (581) Q Consensus 157 ~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (581) .++++..++|.++|+ |.+.. ...-+.+|.+.+... +.. .. T Consensus 160 -------~~~~i~~~~p~~~~~vydd~~~--~~~~~~vr~~~~~~~---------d~~-------------------~~- 201 (511) T protein:vir:96 160 -------DETRLYKSDAMSTFVIYDNTIE--RNSIAGVRYLRTKPI---------DKT-------------------DE- 201 (511) T ss_pred -------CceEEEEEccceeEEEEcCCCC--CceEEEEEEEEeeec---------ccc-------------------cc- Confidence 136788899999764 54422 112223444333100 000 00 Q ss_pred hhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccC Q lcl|NC_015158. 235 DCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQ 314 (581) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p 314 (581) .. ....+.|++..+. .|.. .+++.. .. +.... ...|.+.|..|++.+. T Consensus 202 ----------~~-~~~~~iyt~~~i~--~~~~-----~~~~~~---~~----~~~~~--~~~~~~~~~vPvv~~~----- 249 (511) T protein:vir:96 202 ----------DE-VFTVDLFTSHGVY--RYLT-----SRTNGL---KL----TPREN--GFESHSFERMPITEFS----- 249 (511) T ss_pred ----------ce-EEEEEEEeCCcEE--EEEe-----cCCCcc---cc----ccccc--ccccccCCceeeEEec----- Confidence 00 0000111221111 1110 011100 00 01111 1223345677776543 Q ss_pred CcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEe-------------CCCCC Q lcl|NC_015158. 315 DNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYI-------------NGDGD 376 (581) Q Consensus 315 ~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~-------------~~~~~ 376 (581) ..-+|.|..+.++++++.+|.+...+.+.+...++|.+.+.+. ..++ ....+.++.. ..+++ T Consensus 250 nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:96 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVD 329 (511) T ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcc Confidence 3457999999999999999999999999999999999877652 1111 1223444433 23445 Q ss_pred cccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 377 VEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLE 456 (581) Q Consensus 377 i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~ 456 (581) +.++..+.........+..+...+-..|++|..+.+.-+ ++.||.++..++..+........+.|.. +++++++++.. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~ 407 (511) T protein:vir:96 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLET 407 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 667776655566677788999999999999998766432 5668888888888888888889999995 57888888877 Q ss_pred HHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHH Q lcl|NC_015158. 457 ISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKM 536 (581) Q Consensus 457 f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~ 536 (581) ++....... .+. ...+++..|. ........+.++.++.| . .+.|. +.++ T Consensus 408 ~~~~~~~~~-----~~~----------d~~~i~~~f~--~~~p~n~~e~~~~~~kl---~------G~iS~---et~l-- 456 (511) T protein:vir:96 408 ILKNTWSID-----ANK----------DFNTVRYVYN--RNLPKSLIEELKAYIDS---G------GKISQ---TTLM-- 456 (511) T ss_pred HHHhhcCcc-----ccc----------ccccceEEeC--CCCCCCHHHHHHHHHHH---h------ccCCh---HHHH-- Confidence 654321100 000 1123333331 22222334444333322 1 12233 2222 Q ss_pred HHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 537 LEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 537 ~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +. ++. .+ + .++|.+++..+.+..++..+..+-. T Consensus 457 --~~--l~~----v~--D--~~~E~~ri~~E~~~~~~~~~~~~~~ 489 (511) T protein:vir:96 457 --SL--FSF----FQ--D--PELEVKKIEEDEKESIKKAQKGIYK 489 (511) T ss_pred --Hh--CCC----CC--C--HHHHHHHHHHHHHHHHHHHhhcccc Confidence 22 221 12 2 2345555555443333333322222 No 65 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.69 E-value=1.4e-14 Score=96.50 Aligned_cols=423 Identities=14% Similarity=0.110 Sum_probs=227.0 Q ss_pred chhhhhhhcc-chhhhH-HHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccc--cccccccchHHHHHHH Q lcl|NC_015158. 4 KVLELQQMLD-DTRDGL-AEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPW--KNKTTLPKLCQIRDNL 79 (581) Q Consensus 4 ~~~~~~~~~~-~~~~~~-a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~--k~~~~~pki~~~~d~~ 79 (581) --++-.+++. ++.+.+ ...|.+..+.+. .|.+ ...++.+|+...+.-.....+.++ .+++.+|-..-+++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~---r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~ 76 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEITDKVVNDFMKKHQ-EEVE---RYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTF 76 (453) T ss_pred CccccceeeeccccccCCHHHHHHHHHHHH-HHHH---HHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHHh Confidence 1111122222 212222 233444444332 3333 334445565544322222233333 3467778777788877 Q ss_pred HHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeE Q lcl|NC_015158. 80 HSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGAT 159 (581) Q Consensus 80 ~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~ 159 (581) ..+++ +++ +++.+. ++. .++.+++.+..++|.....++.+++.+||.|+..+++.+. T Consensus 77 ~~~l~----g~~--~~~~~~---d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---------- 133 (453) T protein:vir:73 77 VGYFN----GIP--IKKTHD---DKS----VLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNES---------- 133 (453) T ss_pred hhhhc----ccC--ceeecC---ChH----HHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCC---------- Confidence 77764 433 233332 222 2335667777889999999999999999999988876321 Q ss_pred eeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhh Q lcl|NC_015158. 160 RDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCE 237 (581) Q Consensus 160 ~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 237 (581) ..|++..++|.++ ++|+..... --+.++..... T Consensus 134 ----~~~~i~~~~p~~~~~v~dd~~~~~--~~~~i~~~~~~--------------------------------------- 168 (453) T protein:vir:73 134 ----TESEVIYCSPLNVFMVYDDSIKQK--PLFAVYYGFDE--------------------------------------- 168 (453) T ss_pred ----CceEEEEEcccceEEEEeCCCCce--eEEEEEEEEec--------------------------------------- Confidence 2367788899886 334332211 11122222110 Q ss_pred hccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcc Q lcl|NC_015158. 238 KAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNL 317 (581) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~ 317 (581) ++.. ..+.|... ++++|.+. .+.+ .++....| +.|+.|++.+. ..- T Consensus 169 -------~~~~-~~~vyt~~--~i~~~~~~-----~~~~------------~~~~~~~~--~~g~vPvv~~~-----n~~ 214 (453) T protein:vir:73 169 -------EGNL-SGTVYTLL--ETISITGK-----AGEV------------KFGESTYN--VYSDLPIVEYN-----FNE 214 (453) T ss_pred -------CceE-EEEEEeCC--eEEEEEec-----CCce------------EEccceec--cCCceeEEEec-----CCC Confidence 0000 00111111 12222110 1110 11222234 35777876443 345 Q ss_pred cCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec---cccccc-cCCceeEE-----------eCCCCCcccccC Q lcl|NC_015158. 318 YAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG---DVEEFV-WGPMEQIY-----------INGDGDVEMMAP 382 (581) Q Consensus 318 ~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~---d~~~i~-~~pG~vi~-----------~~~~~~i~~~~~ 382 (581) +|.|..+.++++++.+|.+...+.+++...++|++.+.+ +.+... ..+++++. ...+++++++.. T Consensus 215 ~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~ 294 (453) T protein:vir:73 215 ERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDK 294 (453) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhhcccccccccccccccccccccccCceeEEeee Confidence 799999999999999999999999999999999987654 111111 11222221 122345667766 Q ss_pred CCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015158. 383 NTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNL 462 (581) Q Consensus 383 p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~ 462 (581) +.....+.+.++.+...+-..|++|.++.+. .++.|+.++...............+.|.. +++++++++..+..... T Consensus 295 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~-~l~~~~~li~~~~~~~~ 371 (453) T protein:vir:73 295 PDSDVQTENLLNRLERSIFQFTMAANISDEN--FGNSSGVALAYKLQAMSNLALSFQRKFQS-ALNRRYSLWSSLSTNAS 371 (453) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccCccc--ccCccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccC Confidence 6555566677888888899999999876653 35667777877788888888888888885 57888888877643211 Q ss_pred CccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhc Q lcl|NC_015158. 463 DVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLS 542 (581) Q Consensus 463 d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~ 542 (581) . + ....+++..| ...-.....+.++.++.+ . .+.|+ +. +.+. T Consensus 372 ~-~-----------------~~~~~i~v~f--~~~~p~~~~~~a~~~~k~---~------giis~---et----~~~~-- 413 (453) T protein:vir:73 372 N-K-----------------DAWKDIEYTF--TRNEPKDIKEQAETANIL---K------GITSE---ET----ALSV-- 413 (453) T ss_pred C-c-----------------cccccceEEe--CCCCCCCHHHHHHHHHHH---h------ccCcH---HH----HHHh-- Confidence 0 0 0112233333 122233344444433333 1 13333 22 2222 Q ss_pred CCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 543 LGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 543 l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++. .+ + .++|.++++++.++.++.++...+. T Consensus 414 ~~~----~~--d--~~~E~~ri~~E~~~~~~~~~~~~~~ 444 (453) T protein:vir:73 414 ISV----IP--D--VQAEMEKIKKKKLLQLSLTRTSNLV 444 (453) T ss_pred CCC----CC--C--HHHHHHHHHHHHHHHHHHHHhccCC Confidence 222 12 2 3455666666666666666665555 No 66 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.69 E-value=1.3e-14 Score=96.73 Aligned_cols=437 Identities=12% Similarity=0.104 Sum_probs=227.3 Q ss_pred Cccch----------------hhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--cccccccc Q lcl|NC_015158. 1 MTGKV----------------LELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNSTLP 62 (581) Q Consensus 1 ~~~~~----------------~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~~~ 62 (581) .+|.+ .|...+.+ ...|.+..+.....+. ..+.++++|+...+.- .....+-. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~~~~~~~~------~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~i~~~~~~~~~~ 83 (511) T protein:vir:10 13 LRGNINYLFNDEANVVYTYDGTESDLLQN------VNEVSKCIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEE 83 (511) T ss_pred hhhhhhhhhhhhhcCCccCchhhhhcccC------HHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCccccc Confidence 11110 11111111 1233344433333333 3455566676554321 11111111 Q ss_pred c--cccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCce Q lcl|NC_015158. 63 W--KNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNC 140 (581) Q Consensus 63 ~--k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~ 140 (581) + .+++.+|-..-+++...++|+ +++- ++.+. +++ ..+.+++.+..++|.....++.+++.+||.| T Consensus 84 ~~~~~ki~~n~~k~Iv~~~~~yl~----g~p~--~~~~~---d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a 150 (511) T protein:vir:10 84 YMADNRVAHDYASYISDFINGYFL----GNPI--QYQDD---DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKA 150 (511) T ss_pred ccCcceeecchHHHHHHHHhhhhc----ccCc--eeecC---chH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCee Confidence 1 256667877888888777775 3322 23222 221 3356888888899999999999999999999 Q ss_pred EEEEeeecceeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHH Q lcl|NC_015158. 141 FATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAI 218 (581) Q Consensus 141 i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~ 218 (581) +..+++.. ..++++..++|.++| +|++... ..-+.+|.+.+... + T Consensus 151 y~~vy~de--------------dg~~~i~~~~p~~~~~vydd~~~~--~~~~~vr~~~~~~~---------d-------- 197 (511) T protein:vir:10 151 YEIMIRNQ--------------DDETRLYKSDAMSTFVIYDNTIER--NSIAGVRYLRTKPI---------D-------- 197 (511) T ss_pred EEEEEeCC--------------CCceEEEEEccceeEEEEcCCCCC--ceEEEEEEEEeeec---------c-------- Confidence 88776521 123678889999975 3544321 11223344333100 0 Q ss_pred HHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCC Q lcl|NC_015158. 219 ARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPS 298 (581) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~ 298 (581) +. .. .. ....+.|+...+ +.|.. .+++.. .. +.... ...|. T Consensus 198 ----------~~-~~-----------~~-~~~~~iyt~~~i--~~~~~-----~~~~~~---~~----~~~~~--~~~~~ 238 (511) T protein:vir:10 198 ----------KT-DE-----------DE-VFTVDLFTSHGV--YRYLT-----SRTNGL---KL----TPREN--GFESH 238 (511) T ss_pred ----------cC-cc-----------ce-EEEEEEEeCCcE--EEEEe-----cCCCcc---cc----ccccc--ccccc Confidence 00 00 00 000011111111 11110 011100 00 00111 12233 Q ss_pred ccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEeC- Q lcl|NC_015158. 299 WFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYIN- 372 (581) Q Consensus 299 ~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~~- 372 (581) +.|..|++.+. ..-+|.|..+.++++++.+|.+...+.+.+...++|.+.+.+. ..++ ....++++..+ T Consensus 239 ~~~~vPvv~f~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~ 313 (511) T protein:vir:10 239 SFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEP 313 (511) T ss_pred cCcceeEEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceeccc Confidence 45677776543 3457999999999999999999999999999999999877652 2221 22334444432 Q ss_pred ------------CCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 373 ------------GDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIM 440 (581) Q Consensus 373 ------------~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r 440 (581) .++++.++..+.......+.+..+...+-..|++|..+.+.-+ ++.+|.++..+...+........+ T Consensus 314 ~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~ 392 (511) T protein:vir:10 314 TVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEG 392 (511) T ss_pred ccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHH Confidence 3455667776655566677788899999999999998765432 466777788888888888888899 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc Q lcl|NC_015158. 441 NFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV 520 (581) Q Consensus 441 ~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~ 520 (581) .|.. +++++++++.+++....... .. ....+++..| .........+.++.+..| . T Consensus 393 ~f~~-~l~~~~~li~~~~~~~~~~~------~~---------~d~~~i~i~f--~~~~p~d~~~~~~~~~kl---~---- 447 (511) T protein:vir:10 393 LFTK-GLRRRAKLLETILKNTRSID------AN---------KDFNTVRYVY--NRNLPKSLIEELKAYIDS---G---- 447 (511) T ss_pred HHHH-HHHHHHHHHHHHHHhhCCcc------cc---------cccceeeEEe--CCCCCcCHHHHHHHHHHH---h---- Confidence 9985 57888888877754321100 00 0111233332 122233444444433333 1 Q ss_pred cccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 521 WQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 521 ~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ +|.+.++ +. ++. .+ + .++|.+++..+-+..++..+..+-. T Consensus 448 --G~---iS~et~~----~~--l~~----v~--d--~~~E~~ri~~E~~~~~~~~~~~~~~ 489 (511) T protein:vir:10 448 --GK---ISQTTLM----SL--FSF----FQ--D--PELEVKKIEEDEKESIKKAQKGIYK 489 (511) T ss_pred --cc---CcHHHHH----Hh--CCC----CC--C--HHHHHHHHHHHHHHHHHHHhhhccc Confidence 12 2322222 22 221 12 2 2344555544333333322221111 No 67 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.69 E-value=8.3e-15 Score=97.79 Aligned_cols=429 Identities=12% Similarity=0.066 Sum_probs=232.1 Q ss_pred Cccch-hhhhhhcc--chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc-----cc--cccccccc--cccc Q lcl|NC_015158. 1 MTGKV-LELQQMLD--DTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT-----TT--TNSTLPWK--NKTT 68 (581) Q Consensus 1 ~~~~~-~~~~~~~~--~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~-----~~--~~~~~~~k--~~~~ 68 (581) .+.+. -+++.|+- ...+.....|.++.+.+.. | ...+.++++|+..-+.- .. ...+-+++ +++. T Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~---~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~ 99 (492) T protein:vir:94 24 SQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLE-K---LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 99 (492) T ss_pred CccchhhhhhcccccCCchhhHHHHHHHHHHHHHH-H---HHHHHHHHHHhccccccccccccccccccccccccccccc Confidence 33333 34445542 3334555666666665543 3 23456666776554311 00 01111222 4567 Q ss_pred ccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeec Q lcl|NC_015158. 69 LPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVK 148 (581) Q Consensus 69 ~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~ 148 (581) +|-..-+++....+++ .++- ++. .++++..+.++.+ +. .++.....++.+++.+||.|+..+.+.+ T Consensus 100 ~n~~k~Ivd~~~~yl~----G~p~--~~~---~~d~~~~~~l~~~----~~-n~~~~~~~~~~~~a~~~G~a~~~v~~d~ 165 (492) T protein:vir:94 100 TNFHANLVDQKVSYIV----GKPI--AFK---HTDDEVVKRIDEV----LG-NRFDDKLHSVLTGASNKGIEWLHPYLDE 165 (492) T ss_pred cchHHHHHHHHHhhhc----ccCc--eec---cCchHHHHHHHHH----Hh-ccHHHHHHHHHHHHhhCCeEEEEEEecC Confidence 7888888888887764 3332 222 2333444444443 32 4778888999999999999998886532 Q ss_pred ceeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhc Q lcl|NC_015158. 149 ETTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRR 226 (581) Q Consensus 149 ~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~ 226 (581) . .+|++..++|.++| +|++... +--+.+|.+.+... . T Consensus 166 d--------------g~~~~~~~~p~~~~~v~d~~~~~--~~~a~ir~~~~~~~-----------~-------------- 204 (492) T protein:vir:94 166 E--------------GEFKLFRVPAEQGIPIWTDKEHE--ELEAFIRMYKLENE-----------T-------------- 204 (492) T ss_pred C--------------CceEEEEEcccceEEEEcCCCCC--ceEEEEEEEeeccc-----------e-------------- Confidence 1 24678889999964 4543221 11222333222100 0 Q ss_pred cCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCee Q lcl|NC_015158. 227 GLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIF 306 (581) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~ 306 (581) ..+++....+..+.+.+ +..... +-.. .+...+....| +.|..|++ T Consensus 205 -----------------------~~~~y~~~~v~~~~~~~-------~~~~~~-~~~~-~~~~~~~~~~~--~~g~vPvv 250 (492) T protein:vir:94 205 -----------------------KVEYWDKVTVNYYVYEN-------GSLIPD-YSNN-LENSKTHFSTG--SWGKIPFI 250 (492) T ss_pred -----------------------eEEEEecCeEEEEEEec-------Ceeeec-cccc-ccccccccccc--CCCccceE Confidence 01112222222222211 111000 0000 01112222333 45777876 Q ss_pred EecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-cc---cc--ccCCceeEEeCCCCCcccc Q lcl|NC_015158. 307 HCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VE---EF--VWGPMEQIYINGDGDVEMM 380 (581) Q Consensus 307 ~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~---~i--~~~pG~vi~~~~~~~i~~~ 380 (581) .+. .+-+|.|..+.+.++++.+|.+...+.+.+...++|.+.+.+- .+ +. ..+.++++.+.+++++.++ T Consensus 251 ~~~-----nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 325 (492) T protein:vir:94 251 PFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGGVDTI 325 (492) T ss_pred Eec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHHHhhccceecCCCCcceeE Confidence 443 3457999999999999999999999999999999999877552 11 11 1234678888888999888 Q ss_pred cCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 381 APNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRR 460 (581) Q Consensus 381 ~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~ 460 (581) ..+.....+...++.+...+-+.|++|..+.+.- +++.+|.++...............+.|.. +++++++++.+++.. T Consensus 326 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~-~l~~~~~li~~~~~~ 403 (492) T protein:vir:94 326 QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKV-AIQELLWFVFEHFDI 403 (492) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCcCCCcccc-ccCchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcC Confidence 8776666677778999999999999998776532 23556666766777777777888888885 678888888776532 Q ss_pred hcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHH Q lcl|NC_015158. 461 NLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHN 540 (581) Q Consensus 461 n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~ 540 (581) ..+. .++...| ...-.....+.++.++.| . .+.|. +.++ +. T Consensus 404 ~~~~---------------------~~i~v~f--~~~~p~~~~e~~~~~~kl---~------giiS~---et~~----~~ 444 (492) T protein:vir:94 404 KGEH---------------------KDVDISF--NYNKVANTELQVQTAQQS---M------GIVSH---ETVL----EN 444 (492) T ss_pred Cccc---------------------ceeeEEe--cCCCCCCHHHHHHHHHHH---h------ccCch---HHHH----Hh Confidence 2111 1122222 111112233333333332 1 12232 2222 22 Q ss_pred hcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 541 LSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 541 ~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++. .+ + .++|.+++.++-++..+ +++=+ T Consensus 445 --l~~----v~--d--~~~E~eri~~E~~~~~~---~~~~~ 472 (492) T protein:vir:94 445 --HPF----VE--D--LQAELERIEQEQMEYNK---QLPNL 472 (492) T ss_pred --CCC----CC--C--HHHHHHHHHHHHHHHHh---hcccc Confidence 121 12 2 23345554443222221 22222 No 68 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.69 E-value=2.2e-14 Score=95.42 Aligned_cols=439 Identities=11% Similarity=0.085 Sum_probs=227.4 Q ss_pred Cccch---hhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--ccccccccc--cccccccchH Q lcl|NC_015158. 1 MTGKV---LELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNSTLPW--KNKTTLPKLC 73 (581) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~~~~--k~~~~~pki~ 73 (581) .+..+ ..++.-+....+.+...|. .+...|.+ .++++++|+...+.- .....+..+ .+++.+|-.. T Consensus 24 ~~~~~~~~~~~e~~~~~~~~~i~~~i~----~~~~~~~~---r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k 96 (512) T protein:vir:97 24 EANVVYTYDGTESDLLQNINEVSKYIE----HHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYAS 96 (512) T ss_pred ccccccccCchhhhhhhhHHHHHHHHH----HHHHhhHH---HHHHHHHHhcccCccccccCcccccccCcceeecchHH Confidence 11111 0111111111222333333 23333333 455666676554421 111111122 2556677777 Q ss_pred HHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeee Q lcl|NC_015158. 74 QIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKD 153 (581) Q Consensus 74 ~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~ 153 (581) -+++...++++. ++- ++.+ +++ ..++.|++.+.+++|.....++.+++.+||.|+..+++... T Consensus 97 ~Ivd~~~~yl~g----~p~--~~~~---~d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded---- 159 (512) T protein:vir:97 97 YISDFINGYFLG----NPI--QCQD---DDK----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD---- 159 (512) T ss_pred HHHHHHhhhhcc----cCc--eecc---CCh----HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCC---- Confidence 888887777753 222 2322 221 13456888888899999999999999999999888765311 Q ss_pred eeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcc Q lcl|NC_015158. 154 EESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTY 231 (581) Q Consensus 154 ~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~ 231 (581) .++++..++|.++| +|+.... ..-+.+|.+.+... +.. T Consensus 160 ----------~~~~i~~~~p~~~~~iyd~~~~~--~~~~~vr~~~~~~~---------~~~------------------- 199 (512) T protein:vir:97 160 ----------DETRLYKSDAMSTFVIYDNTIER--NSIAGVRYLRTKPI---------DKT------------------- 199 (512) T ss_pred ----------CceEEEEEcccceEEEEcCCCCC--ceEEEEEEEEeeec---------ccc------------------- Confidence 24678889999974 4655421 22233444433100 000 Q ss_pred cchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccc Q lcl|NC_015158. 232 TREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWR 311 (581) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~ 311 (581) ... .. ...+.|+...+ +.|.. .+++... . .... ...-|.++|..|++.+. T Consensus 200 ~~~-----------~~-~~~~vyt~~~i--~~~~~-----~~~~~~~---~----~~~~--~~~~~~~~g~vPvv~~~-- 249 (512) T protein:vir:97 200 DED-----------EV-FTVDLFTSHGV--YRYLT-----SRTNGLK---L----TPRE--NGFESHSFERMPITEFS-- 249 (512) T ss_pred ccc-----------eE-EEEEEEeCCcE--EEEEe-----cCCCccc---c----cccc--cccccccCcccceEeec-- Confidence 000 00 00011122111 11110 0111000 0 0011 11223456777876443 Q ss_pred ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEE--------------eC Q lcl|NC_015158. 312 IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIY--------------IN 372 (581) Q Consensus 312 ~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~--------------~~ 372 (581) ..-+|.|..+.++++++.+|.+...+.+.+...++|++.+.+. ..++ ....+.++. .. T Consensus 250 ---nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (512) T protein:vir:97 250 ---NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETE 326 (512) T ss_pred ---CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCC Confidence 3457999999999999999999999999999999999877552 1211 111222222 23 Q ss_pred CCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 373 GDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLN 452 (581) Q Consensus 373 ~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~ 452 (581) .++++.++..+.......+.+..+...+-..|++|..+.|.-+ ++.||.++...+..+........+.|.. +++++++ T Consensus 327 ~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~k~~~f~~-~l~~~~~ 404 (512) T protein:vir:97 327 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTK-GLRRRAK 404 (512) T ss_pred CCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 3455667776655556667788999999999999998776432 4667777888888888888888999985 5788889 Q ss_pred HHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHH Q lcl|NC_015158. 453 AMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTEN 532 (581) Q Consensus 453 ~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~ 532 (581) ++..++....... ......+++..|. ..-.....+.++.++.| . .+.|. +. T Consensus 405 li~~~~~~~~~~~---------------~~~d~~~i~~~f~--~~~p~~~~e~~~~~~kl---~------giiS~---et 455 (512) T protein:vir:97 405 LLETILKNTRSID---------------ANKDFNTVRYVYN--RNLPKSLIEELKAYIDS---G------GKISQ---TT 455 (512) T ss_pred HHHHHHHhcCCcc---------------cccccccceEEeC--CCCCcCHHHHHHHHHHH---h------ccCch---HH Confidence 8887754321100 0001113333331 22223344444433333 1 12233 22 Q ss_pred HHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 533 LAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 533 l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++ +. |+. .+ ++ ++|.+++.++.+...+..++.+-. T Consensus 456 ~~----~~--l~~----v~--d~--~~E~eri~~E~~~~~~~~~~~~~~ 490 (512) T protein:vir:97 456 LM----SL--FSF----FQ--DP--ELEVKKIEEDEKESIKKAQKGIYK 490 (512) T ss_pred HH----Hh--CCC----CC--CH--HHHHHHHHHHHHHHHHHHhhcccC Confidence 22 22 222 12 22 344555444433333322221111 No 69 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.69 E-value=2e-14 Score=95.70 Aligned_cols=424 Identities=13% Similarity=0.087 Sum_probs=225.1 Q ss_pred CccchhhhhhhccchhhhH-HHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccc--cccccccchHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGL-AEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPW--KNKTTLPKLCQIRD 77 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~--k~~~~~pki~~~~d 77 (581) |.-+-- +-+.......+ ...|.++.+.++. | ...+.++++|+...+.-..-..+.++ .+++.+|-..-+++ T Consensus 1 ~~~~~~--~~~~~~~~~~~~~~~i~~~i~~~~~-~---~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd 74 (452) T protein:vir:36 1 MKYKPP--KLMTFSKDEPITVEVVTKFMEKHKL-E---VARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVD 74 (452) T ss_pred CcccCc--eeEEcCCccCCCHHHHHHHHHHHHH-H---HHHHHHHHHHhccccccccCccccccCccceeecchHHHHHH Confidence 211000 00000101111 1334444443322 2 23445566666654422111112222 24566777777888 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) ...++|+ +++- ++.+. ++ ..++.+++.+..++|.....++.+++.+||.|+..+.+.. T Consensus 75 ~~~~~l~----g~~~--~~~~~---d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~--------- 132 (452) T protein:vir:36 75 TFTGYFN----GIPV--KKSHS---DK----EILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDE--------- 132 (452) T ss_pred HHhhhhc----ccCc--eeecC---Ch----hHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecC--------- Confidence 8777775 4433 33332 21 1234577777888999999999999999999998886532 Q ss_pred eEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 158 ATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 158 ~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) ...|++..++|.++++ |+.... ..-+++|.+... T Consensus 133 -----~g~~~i~~~~p~~~~~v~d~~~~~--~~~~~i~~~~~~------------------------------------- 168 (452) T protein:vir:36 133 -----DTQTNVVYNSPENMFMVYDDTVKQ--EPLFAVRYGVDE------------------------------------- 168 (452) T ss_pred -----CCeeEEEEEcccceEEEEcCCCCC--ceEEEEEEEEec------------------------------------- Confidence 1246788899998743 543211 111222222110 Q ss_pred hhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCC Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQD 315 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~ 315 (581) ++.. ..+.|+...+ +.|.+ .+++ +......|.+.|..|++.+. . T Consensus 169 ---------~~~~-~~~vyt~~~i--~~~~~-----~~~~--------------~~~~~~~~~~~g~iPvv~~~-----n 212 (452) T protein:vir:36 169 ---------DKKL-QGEVYTLLET--IKISG-----ENDE--------------ISFGEGTYNPYPDLPVVEFY-----F 212 (452) T ss_pred ---------CceE-EEEEEecCeE--EEEEE-----cCCc--------------eEEecceeccCCcccEEEec-----C Confidence 0000 0001111111 11111 0111 11112223346778877543 3 Q ss_pred cccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc---ccc-cccCCceeEEeCCCC-----CcccccCCCcc Q lcl|NC_015158. 316 NLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD---VEE-FVWGPMEQIYINGDG-----DVEMMAPNTQA 386 (581) Q Consensus 316 ~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d---~~~-i~~~pG~vi~~~~~~-----~i~~~~~p~~~ 386 (581) .-.|.|..+.+.++++.+|.+...+.+++...++|++.+.+. .+. ...++++++.+..++ ++.++..+... T Consensus 213 ~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 292 (452) T protein:vir:36 213 NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYADGEGKNVDVKFLEKPDSD 292 (452) T ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCchhhhhhhhcceEEecCCCCccCCcceeEeecCCH Confidence 446999999999999999999999999999999999877552 122 234467777776543 46666666555 Q ss_pred chhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Q lcl|NC_015158. 387 LQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVAD 466 (581) Q Consensus 387 ~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~ 466 (581) ....+.+..+...+-..|++|.++.+.. ++.|+.++..+............+.|.. ++++++++++.+..... T Consensus 293 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~--gn~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~~---- 365 (452) T protein:vir:36 293 SQTENLLDRLTKLIFQTTMVANISDESF--GSSSGVSLAYKLQAMSNLALSFQRKFQS-SLNSRYKLFCELSTNVS---- 365 (452) T ss_pred HHHHHHHHHHHHHHHHHhCccccCcccc--cCCcHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccC---- Confidence 5566778888899999999998766543 4667777777777777777888888885 67889998888754311 Q ss_pred eeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCc Q lcl|NC_015158. 467 TIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGW 546 (581) Q Consensus 467 ~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~ 546 (581) .. ....+|+..|. ........+.++.++.+ . .+ +|.+.+ .+. ++. T Consensus 366 ------~~--------~~~~~i~i~f~--~~~p~d~~~~a~~~~k~---~------g~---iS~et~----~~~--~~~- 410 (452) T protein:vir:36 366 ------NK--------DSWKDIEYTFT--RNEPKDIKEQAETANIL---M------GI---TSQETA----LSV--ISV- 410 (452) T ss_pred ------Cc--------cccccceEEeC--CCCCcCHHHHHHHHHHH---h------cc---CChHHH----HHh--CCC- Confidence 10 01123443331 22233444544433332 1 12 232222 232 221 Q ss_pred ccccCCCCcHHHHHHHHHHHHHHHHHHHHhcc--cCC Q lcl|NC_015158. 547 DIFKPNVAVMEAQTTSALVNQSQAQIEEEAQV--PLV 581 (581) Q Consensus 547 ~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~--~~~ 581 (581) .+ ++ ++|.+++.++-.+..+..+.. +.= T Consensus 411 ---~~--d~--~~E~~ri~~E~~~~~~~~~~~~~~~~ 440 (452) T protein:vir:36 411 ---IP--DV--QAEMEKIKKEEASTAIFDKDKQPSEK 440 (452) T ss_pred ---CC--CH--HHHHHHHHHHHHHHHHHHhhccCCCC Confidence 12 22 344444444322222222221 111 No 70 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.69 E-value=5.5e-15 Score=98.75 Aligned_cols=431 Identities=11% Similarity=0.061 Sum_probs=223.2 Q ss_pred Cc-------cchh-hhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccc----cccc---cccc-- Q lcl|NC_015158. 1 MT-------GKVL-ELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTT----TTNS---TLPW-- 63 (581) Q Consensus 1 ~~-------~~~~-~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~----~~~~---~~~~-- 63 (581) |. .+.+ |+-+-++..-+.....|.++.+.+. .....+.++++|+..-+.-. ..+. .-+. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~----~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHK----ENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKP 76 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCchhccccccccccccccccc Confidence 11 1111 1111111111112234444444333 23345677777776543210 0000 0011 Q ss_pred cccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEE Q lcl|NC_015158. 64 KNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFAT 143 (581) Q Consensus 64 k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k 143 (581) .+++.+|-...+++...++++ .++- ++.+ ++++..+.+++ .+. .++.....++.+++.+||.|++. T Consensus 77 ~~ki~~n~~~~ivd~~~~~l~----g~~~--~~~~---~~d~~~~~l~~----~~~-n~~~~~~~~~~~~~~~~G~~~~~ 142 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAV----ANPV--TFGV---DNDKALKQIQH----TLN-HKWDDKLVDILTAASNKGIEWVQ 142 (478) T ss_pred cceeccchHHHHHHHHHhhhc----cCCe--eeec---CChHHHHHHHH----HHh-cCHHHHHHHHHHHHHhcCeEEEE Confidence 134667777778888887766 3322 2322 23333333333 333 47888999999999999999988 Q ss_pred EeeecceeeeeeeeeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHH Q lcl|NC_015158. 144 VEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARR 221 (581) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~ 221 (581) +.+... ..+++..++|.++|+ |++.. .+--+++|.+.+... .+ T Consensus 143 ~~~d~~--------------g~~~~~~~~p~~~~~i~d~~~~--~~~~~~v~~~~~~~~-----------~~-------- 187 (478) T protein:vir:10 143 PYVDEE--------------GEFKTFRVPAEQAVPIWTNKER--DELQAFIRVYELDGA-----------ER-------- 187 (478) T ss_pred EEecCC--------------CeeEEEEEcccceEEEEcCCCC--CceEEEEEEEEecCc-----------eE-------- Confidence 865321 136788899999764 54432 222223333322100 00 Q ss_pred HhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeee--eEEEEEeCCEEEEeecCCCc Q lcl|NC_015158. 222 REFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRN--MKVTIIDRMFVIEEKENPSW 299 (581) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~--~~itv~~g~~iir~~~nP~~ 299 (581) .+.|....+..+++-+ +..... .......+-.. ...-|.+ T Consensus 188 -----------------------------~~~y~~~~i~~~~~~~-------~~~~~~~~~~~~~~~~~~~--~~~~~~~ 229 (478) T protein:vir:10 188 -----------------------------VEYWTKDDVTYYELKE-------GQLIPDFYRSDDHIQPHYY--QGNKLMS 229 (478) T ss_pred -----------------------------EEEEeCCeEEEEEEcC-------Ceeecccccccccccccee--ccccccc Confidence 0111112222111100 000000 00000000011 1122445 Q ss_pred cCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-c---cccc--cCCceeEEeC- Q lcl|NC_015158. 300 FAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-V---EEFV--WGPMEQIYIN- 372 (581) Q Consensus 300 ~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~---~~i~--~~pG~vi~~~- 372 (581) .|..|++.+. .+-+|.|..+.+.++++.+|.+...+.+++...++|++.+.+. . .+.. ...++++.+. T Consensus 230 ~~~vPvv~~~-----n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 304 (478) T protein:vir:10 230 WGRVPFIPFK-----NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAG 304 (478) T ss_pred CCccceEEec-----cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhhhhcceEEecC Confidence 6788876553 3558999999999999999999999999999999999877652 1 2221 2346666665 Q ss_pred -CCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 373 -GDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVL 451 (581) Q Consensus 373 -~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li 451 (581) +++++.++..+....+....++.+...+-+.|++|..+.+.. .++.||.++...+.++........+.|.. ++++++ T Consensus 305 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~ 382 (478) T protein:vir:10 305 ESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKANKLKNKTLT-ALQELL 382 (478) T ss_pred CCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Confidence 557788887665566677778999999999999998776542 24667777878888888888888888885 578888 Q ss_pred HHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHH Q lcl|NC_015158. 452 NAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTE 531 (581) Q Consensus 452 ~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~ 531 (581) ++++.+...+.+ ..+++..|. ..-.....+.++.++.| . .+ +|.+ T Consensus 383 ~li~~~~g~~~~---------------------~~~i~i~f~--~~~p~d~~e~a~~~~kl---~------g~---iS~e 427 (478) T protein:vir:10 383 QYIIDFYRLDVK---------------------VQDIEITFN--FNVMVNELENSQIAMNS---T------GL---LSKE 427 (478) T ss_pred HHHHHHhCCCcc---------------------cccceEEec--CCCCCCHHHHHHHHHHH---h------CC---CChH Confidence 888776422111 123333331 11122344444433333 1 12 3333 Q ss_pred HHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccC-C Q lcl|NC_015158. 532 NLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPL-V 581 (581) Q Consensus 532 ~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~-~ 581 (581) .+++++ +. .+ + .++|.+++.++ +. +..++++= . T Consensus 428 t~~~~l------~~----v~--D--~~~E~~ri~~E-~~--~~~~~~~~~~ 461 (478) T protein:vir:10 428 TILSNH------AW----VE--D--PVAEMERIEQE-NI--ELNQQLPDIE 461 (478) T ss_pred HHHHhC------CC----CC--C--HHHHHHHHHHH-HH--HHHhhccccc Confidence 333222 21 12 2 23445554432 11 11111111 1 No 71 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.69 E-value=7e-15 Score=98.19 Aligned_cols=434 Identities=12% Similarity=0.054 Sum_probs=228.0 Q ss_pred Cccchhhhh---------hhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc-c------cccccccc- Q lcl|NC_015158. 1 MTGKVLELQ---------QMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT-T------TTNSTLPW- 63 (581) Q Consensus 1 ~~~~~~~~~---------~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~-~------~~~~~~~~- 63 (581) |.-..-+.+ ++.+ ...-....|.++.+.... |.+. ..++++|+...+.- . ......++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~i~~~i~~~~~-~~~~---~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~ 75 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKP-QYETQEEMILRLITKHKE-NVED---ITVGERYYNHQPDVLFNAPKRNVKGEIDPFK 75 (468) T ss_pred CccccCCcCceeehheeecccc-cccCcHHHHHHHHHHHHH-HHHH---HHHHHHHhcCCCccccccccccccccccccc Confidence 333322211 1111 111222344444444332 3333 34455565544311 0 00001112 Q ss_pred -cccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEE Q lcl|NC_015158. 64 -KNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFA 142 (581) Q Consensus 64 -k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~ 142 (581) .+++.+|-...+++...++++ +.+- ++. .++++..+.++++ +. .|+.....++..++.+||.|+. T Consensus 76 ~~~ki~~n~~~~Iv~~~~~~l~----g~p~--~~~---~~d~~~~~~l~~~----~~-n~~~~~~~~~~~~~~~~G~~~~ 141 (468) T protein:vir:96 76 PDWRMYTNYHQNLVDQKVAYAV----ANPV--TYG---TEDEKSLKTIQEV----LN-HKWDDKLVDILTAASNKGVEWI 141 (468) T ss_pred cccccccchHHHHHHHHHhhhc----cCCc--eec---cCChHHHHHHHHH----Hh-cCHHHHHHHHHHHHhhcCeEEE Confidence 346777877778888777775 3332 222 2333333444444 32 5788888999999999999999 Q ss_pred EEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHH Q lcl|NC_015158. 143 TVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRR 222 (581) Q Consensus 143 k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~ 222 (581) .+.+... ..+++..++|.++|+=-.-....+.-+++|.+...+. .+ T Consensus 142 ~v~~d~~--------------~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~-----------~~--------- 187 (468) T protein:vir:96 142 QPYVDEQ--------------GEFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDGG-----------ER--------- 187 (468) T ss_pred EEEEcCC--------------CceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCc-----------eE--------- Confidence 8876321 1367888999997643222222333223333321100 00 Q ss_pred hhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCC Q lcl|NC_015158. 223 EFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQ 302 (581) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~ 302 (581) .+++..+++..+.+++ +......................|.+.|+ T Consensus 188 ----------------------------~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (468) T protein:vir:96 188 ----------------------------VEYWTANDVTFYELKD-------GQLIPDYYQGEEHVQAHYYVGNKSMSWNR 232 (468) T ss_pred ----------------------------EEEEeCCeEEEEEEcC-------CceeecccccccccccceeeccccccCCc Confidence 0112222333322221 11000000000000011112223455788 Q ss_pred CCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccc--cCCceeEEeCC--C Q lcl|NC_015158. 303 APIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFV--WGPMEQIYING--D 374 (581) Q Consensus 303 ~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~--~~pG~vi~~~~--~ 374 (581) .|++.+.. .-+|.|..+.+.++++.+|.+...+.+++...++|.+.+.+. ..... .+.++++.+.. + T Consensus 233 iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~ 307 (468) T protein:vir:96 233 VPFIPFKN-----NPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGS 307 (468) T ss_pred ccEEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCCC Confidence 88875543 456999999999999999999999999999999999877541 12221 23467777653 3 Q ss_pred CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 375 GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAM 454 (581) Q Consensus 375 ~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~ 454 (581) ++++++..+.........++.+...+-..|++|.++.+.. +++.||.++...+..+........+.|.+ +++++++++ T Consensus 308 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~-~l~~~~~li 385 (468) T protein:vir:96 308 GGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKANKLKNKTLT-ALQELLQYI 385 (468) T ss_pred CcceEEeecCChHHHHHHHHHHHHHHHHHhCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 5678888776566677778999999999999998765432 34667777777788888777888888884 678889888 Q ss_pred HHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHH Q lcl|NC_015158. 455 LEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 (581) Q Consensus 455 ~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~ 534 (581) ++++..+.+ ..++...|. -.......+.++ .+.+ . +.+|.+.++ T Consensus 386 ~~~~g~~~d---------------------~~~i~i~f~--~~~p~d~~e~a~---~~~~-------~---g~iS~et~i 429 (468) T protein:vir:96 386 IDFYKLSIK---------------------VQDVEITFN--FNVMVNELEQSQ---IGVN-------S---QYLSKETVV 429 (468) T ss_pred HHHhCCCcc---------------------cceeeEEec--CCCCcCHHHHHH---HHHh-------c---CCCchHHHH Confidence 877532222 122322221 111222333333 2211 1 223433332 Q ss_pred HHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 535 ~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +. ++. .. + .++|.+++.++-++ ..++|..+. T Consensus 430 ----~~--l~~----v~--D--~~~E~~ri~~E~~~--~~~~~~~~~ 460 (468) T protein:vir:96 430 ----TN--HPW----VD--D--PVAEMERIDQEELA--LPSIEEGLN 460 (468) T ss_pred ----Hh--CCC----CC--C--HHHHHHHHHHHHHH--HHHHhhccC Confidence 22 232 12 2 24455555543322 233444555 No 72 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.69 E-value=8.2e-15 Score=97.80 Aligned_cols=432 Identities=13% Similarity=0.114 Sum_probs=212.7 Q ss_pred hhhhcc--chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-cccccccccccc---cccccchHHHHHHHHH Q lcl|NC_015158. 8 LQQMLD--DTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-TTTTNSTLPWKN---KTTLPKLCQIRDNLHS 81 (581) Q Consensus 8 ~~~~~~--~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-~~~~~~~~~~k~---~~~~pki~~~~d~~~~ 81 (581) |.+.++ +..++....|..+.+.|... . ..+.++..|+...+. ...+.. .|.+. +..++-+.-+++.+.. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~-~---~r~~~l~~YY~G~~~i~~~~~~-~~~~~~~~~~v~n~~~~iVd~~~~ 75 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDA-S---KDLASNTSYYDAERRPEAIGVT-VPREMQQLLAHVGYPRLYVDSVAE 75 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHH-H---HHHHHHHHHhcccCcchhcccc-cchhHhhhhhccchHHHHHHHHHh Confidence 333222 33444455566666655442 2 334445556554442 222221 22211 2234444555555555 Q ss_pred HHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEee Q lcl|NC_015158. 82 NYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRD 161 (581) Q Consensus 82 ~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~ 161 (581) +|. +.. |.+ .+++... +.++..+.++++.....++++++.+||.|++.+........ ... T Consensus 76 ~l~----~~g--~~~----~~~~~~~----~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~------~~~ 135 (486) T protein:vir:42 76 RQA----VEG--FRL----GDADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLD------LGW 135 (486) T ss_pred hhc----ccc--eec----CCCchhH----HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccc------ccc Confidence 441 211 111 1121221 23455567788999999999999999999999854322111 011 Q ss_pred eeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhc Q lcl|NC_015158. 162 TYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKA 239 (581) Q Consensus 162 ~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 239 (581) ....|.+..++|.++ ++||.... -..++ |...+. ++.. T Consensus 136 ~~~~~~i~~~~p~~~~~i~d~~~~~--~~~~~-~~~~~~-----------~~~~-------------------------- 175 (486) T protein:vir:42 136 DQNVPIIRVEPPTRMHAEIDPRINR--VSKAI-RVAYDK-----------EGNE-------------------------- 175 (486) T ss_pred CCCeeEEEEecccceEEEEeCCCCC--eEEEE-EEEEec-----------CCCe-------------------------- Confidence 122367888999996 56765322 11112 222110 0000 Q ss_pred cccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccC Q lcl|NC_015158. 240 VGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYA 319 (581) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G 319 (581) ....++|..+. ++.|. ..+|+ ..+. ...|.+.|+.|++.+...+..+..|| T Consensus 176 --------~~~~~~y~~~~--~~~~~-----~~~~~-------------~~~~-~~~~h~~g~vPvv~~~n~~~~~~~~G 226 (486) T protein:vir:42 176 --------IQAATLYTPME--TIGWF-----RADGE-------------WAEW-FNVPHGLGVVPVVPLPNRTRLSDLYG 226 (486) T ss_pred --------EEEEEEEcCCc--EEEEE-----ecCCc-------------EEee-cceecCCCCceEEEeccccccCCCCC Confidence 00001111111 11111 01111 1111 12234568899998999999999999 Q ss_pred CCcHH-hhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-cccc-----------cccCCceeEEeCCCCCcccccCCCcc Q lcl|NC_015158. 320 MGPLD-NLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEE-----------FVWGPMEQIYINGDGDVEMMAPNTQA 386 (581) Q Consensus 320 ~s~~~-~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~-----------i~~~pG~vi~~~~~~~i~~~~~p~~~ 386 (581) +|..+ .++++|+.+|.....+.+++...++|+..+.+ +++. +...+|++|.... +++...+.+. T Consensus 227 ~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~~-- 303 (486) T protein:vir:42 227 TSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARILAFED-AEGKIQQFSA-- 303 (486) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchhcccCC-CCceEEeecc-- Confidence 99887 48899999999999999999999999876543 2111 1123566665432 3344444332 Q ss_pred chhHHHHHHHHHHHHHh---cCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 387 LQADMQIQILEAKMEEF---AGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLD 463 (581) Q Consensus 387 ~~~~~~lq~~~~~~ee~---TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d 463 (581) .+..+.++.++..+..+ +++|....|..+..+-+|.++...+...........+.|.. .+++++++++.+.. +.+ T Consensus 304 ~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~-~l~~~~~l~~~~~~-~~~ 381 (486) T protein:vir:42 304 AELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGG-AWEEAMRIAYRIMK-GGD 381 (486) T ss_pred cCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhc-CCC Confidence 23455677777777766 66777777754433345666777777777777788888886 56888888876531 111 Q ss_pred ccceeeecCchhcccCCCccCHH--HhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHh Q lcl|NC_015158. 464 VADTIRVFDSDDKVATFMNVNKD--DITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNL 541 (581) Q Consensus 464 ~~~~iR~~~~~~~~~~~~~v~r~--di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~ 541 (581) . +.+ +|+..| .........+.++-+..|.+. +..+.+. ..+ .+.+ T Consensus 382 ~-------------------~~d~~~i~v~w--~~~~~~s~~~~ad~~~kl~~~-----~~g~~s~---et~----~~~l 428 (486) T protein:vir:42 382 V-------------------PPDMLRMETVW--RDPSTPTYAAKADAATKLYGN-----GQGVIPR---ERA----RIDM 428 (486) T ss_pred c-------------------cccceeeeEEe--cCCCCCCHHHHHHHHHHHHhc-----ccCCCCH---HHH----HhcC Confidence 0 011 122222 111223444444433333221 1123332 222 2222 Q ss_pred cCCCcccccCCCCcHHHHHHHHHHH-HHHHHHHHHhcc----------cCC Q lcl|NC_015158. 542 SLGGWDIFKPNVAVMEAQTTSALVN-QSQAQIEEEAQV----------PLV 581 (581) Q Consensus 542 ~l~~~~~~~~~~~~~~~~~~q~~~q-~aq~~~~~~~~~----------~~~ 581 (581) ++ .+ ++.+ +.+++.. +.........++ +.- T Consensus 429 ---g~---~~--d~~~--e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (486) T protein:vir:42 429 ---GY---SV--KERE--EMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSP 469 (486) T ss_pred ---CC---Ch--hHHH--HHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCC Confidence 32 11 1112 2222211 111111111110 000 No 73 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.68 E-value=1.6e-14 Score=96.27 Aligned_cols=437 Identities=12% Similarity=0.075 Sum_probs=213.5 Q ss_pred hhhhcc--chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-cccccccc--ccccccccccchHHHHHHHHHH Q lcl|NC_015158. 8 LQQMLD--DTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNST--LPWKNKTTLPKLCQIRDNLHSN 82 (581) Q Consensus 8 ~~~~~~--~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~--~~~k~~~~~pki~~~~d~~~~~ 82 (581) |...+. ++.+.-+..+..+.+.|.+.+ ..+.+++.|+...+ ....+... .-...++.++-+.-+++.+..+ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~----~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~ 76 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQN----QNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAER 76 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHH----HHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhh Confidence 222222 222333333333334332322 23334455655443 22222111 1112334456666677776666 Q ss_pred HHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeee Q lcl|NC_015158. 83 YISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDT 162 (581) Q Consensus 83 l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~ 162 (581) |. ++. |. ..+.+. ..+.+++.+.++++.....++.+++.+||.|++.+...+.... .... T Consensus 77 l~----~~g--~~-----~~~~~~---~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~------~~~~ 136 (485) T protein:vir:24 77 QA----VEG--FR-----LGDADE---ADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQID------LGWD 136 (485) T ss_pred hc----cCc--ee-----cCCCch---hHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccc------cccC Confidence 52 221 11 111111 1223555667789999999999999999999999866432211 1112 Q ss_pred eccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 163 YFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 163 ~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) ...|+|..+||.++ ++||+...+ ..++.+. .+. +. T Consensus 137 ~~~~~i~~~~p~~~~~i~D~~~~~~--~~~~~~~-~~~-----------~~----------------------------- 173 (485) T protein:vir:24 137 PNVPLIRVEPPTRMYAEIDPRIGRP--AKAIRVA-YDA-----------EG----------------------------- 173 (485) T ss_pred CCcceEEEeccceeEEEeeCCcCce--eEEEEEE-Eee-----------cC----------------------------- Confidence 23467889999997 556653221 1112111 110 00 Q ss_pred ccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCC Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAM 320 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~ 320 (581) +......+|..+ ++++|. ..+| +... ....|-+.|..|++.+...+..+..||+ T Consensus 174 -----~~~~~~~~y~~~--~~~~~~-----~~~~-------------~~~~-~~~~~h~~g~vPvv~f~n~~~~~~~~G~ 227 (485) T protein:vir:24 174 -----NEIQAATLYTPN--ETFGWF-----RAEG-------------EWVE-WFSDPHGLGAVPVVPLPNRTRLSDLYGT 227 (485) T ss_pred -----CeEEEEEEEcCC--cEEEEE-----ecCC-------------ceEe-ecccccCCCcccEEEeccCcccCCcCCc Confidence 000000001111 011111 1111 1111 1122334688999999988999999999 Q ss_pred CcHH-hhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-cccc-----------cccCCceeEEeCCCCCcccccCCCccc Q lcl|NC_015158. 321 GPLD-NLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEE-----------FVWGPMEQIYINGDGDVEMMAPNTQAL 387 (581) Q Consensus 321 s~~~-~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~-----------i~~~pG~vi~~~~~~~i~~~~~p~~~~ 387 (581) |... .++++++.+|.++..+.+++...++|+..+.+ +++. +...+|.+|.... +++...+.+ .. T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~q~~--~~ 304 (485) T protein:vir:24 228 SEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFED-AEGKIQQFS--AA 304 (485) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccCC-CCceEEeec--cc Confidence 9876 58999999999999999999999999876543 1111 1234677776643 233333322 22 Q ss_pred hhHHHHHHHHHHHHHh---cCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc Q lcl|NC_015158. 388 QADMQIQILEAKMEEF---AGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDV 464 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~---TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~ 464 (581) ++.+.++.++..+..+ +++|....|..+..+-++.++...+...........+.|.. .+++++++++.+... ... T Consensus 305 ~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~-~l~~~~~l~~~~~~~-~~~ 382 (485) T protein:vir:24 305 ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGG-AWEEAMRLAYRLMKG-GDV 382 (485) T ss_pred chHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcC-CCC Confidence 3445567777766665 67777777755443446666777777777888888888885 578888888765321 110 Q ss_pred cceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCC Q lcl|NC_015158. 465 ADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLG 544 (581) Q Consensus 465 ~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~ 544 (581) + ..-.+|...|. -.-.....+.+. .+..+.+. +. +.++.+.++ +++ T Consensus 383 ~-----------------~d~~~i~v~f~--~~~~~s~~~~ad---~~~kl~~~--g~---~~~s~et~~----~~l--- 428 (485) T protein:vir:24 383 P-----------------PDMLRMETVWR--DPSTPTYAAKAD---AATKLYGN--GQ---GVIPRERAR----KDM--- 428 (485) T ss_pred c-----------------cccceeeEEec--CCCCCCHHHHHH---HHHHHHhc--cc---ccCCHHHHH----hhC--- Confidence 0 00011222221 111223344333 33333221 11 223332232 322 Q ss_pred CcccccCCCCcHHHHHHHHHHHHHHHH---HHHHhcccCC Q lcl|NC_015158. 545 GWDIFKPNVAVMEAQTTSALVNQSQAQ---IEEEAQVPLV 581 (581) Q Consensus 545 ~~~~~~~~~~~~~~~~~q~~~q~aq~~---~~~~~~~~~~ 581 (581) ++ .+ ++.++.++.+-.+.++.. .....+.+-. T Consensus 429 ~~---~~--d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~ 463 (485) T protein:vir:24 429 GY---SI--AEREEMRRWDEEEAAMGLGLLGTMVDADPTV 463 (485) T ss_pred CC---CH--hHHHHHHHHHHHHhhhhhhHHHhhcccCCCC Confidence 22 11 111221111111111111 1111223333 No 74 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.68 E-value=2.1e-15 Score=101.03 Aligned_cols=446 Identities=13% Similarity=0.084 Sum_probs=226.2 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-ccc-----cc---ccccc--ccccccc Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTT-----TT---NSTLP--WKNKTTL 69 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~-----~~---~~~~~--~k~~~~~ 69 (581) |-...++..---+ ...++-..|.+... ..|-+ .+.++.+|+.-.+ .-. .+ ..+.. -.+++.+ T Consensus 7 ~~~~~~~~~~~~~-~~~~~~~~i~~~~~---~~~~~---~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~ 79 (479) T protein:vir:79 7 SETDLIKVQLKKE-STINLVKVIEHYIL---KHRPE---KYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAIN 79 (479) T ss_pred cccceEeeccccC-ChhHHHHHHHHHHh---hhhHH---HHHHHHHHhccCCcccccccccccccccccccccCcceeec Confidence 4444443322222 22333344444333 22333 3445555653322 100 00 00101 1235667 Q ss_pred cchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecc Q lcl|NC_015158. 70 PKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKE 149 (581) Q Consensus 70 pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~ 149 (581) |-...+++...++++ +++- ++.+ +++.. +++++..+ ..+|...+.++.+++.++|.|+..+.+... T Consensus 80 ~~~~~Ivd~~~~~l~----g~p~--~~~~---~~~~~----~~~~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~ 145 (479) T protein:vir:79 80 NYHKLLVDQKVGYSV----GNPI--VFNA---DDDNL----TKLLNDLL-GEEFDDTITELYLNASNKGVEWLHPYINRK 145 (479) T ss_pred chHHHHHHHHHhhhh----cCCc--eecc---CCHHH----HHHHHHHH-hcCHHHHHHHHHHHHHhcCeEEEEEEeCCC Confidence 767777787777765 3332 3333 22222 23443333 368999999999999999999988865322 Q ss_pred eeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhcc Q lcl|NC_015158. 150 TTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRG 227 (581) Q Consensus 150 ~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~ 227 (581) ..+++..++|.++| +|+... ...-+++|.+.+... .+ . T Consensus 146 --------------~~~~i~~~~p~~~~~v~d~~~~--~~~~~~ir~y~~~~~-------~~--~--------------- 185 (479) T protein:vir:79 146 --------------GEFKYVIIPAEEAIPIWDSKRQ--RELVAFIRFYYIEDI-------DG--N--------------- 185 (479) T ss_pred --------------CceEEEEEccceeEEEEeCCCC--CceEEEEEEEEEeec-------CC--c--------------- Confidence 13678889999974 444422 111222333322100 00 0 Q ss_pred CCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeE Q lcl|NC_015158. 228 LGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFH 307 (581) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~ 307 (581) . ....+.|..+.+..+.+.+.. ................-.........|.+.|..|++. T Consensus 186 ------------------~-~~~~e~y~~~~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~ 244 (479) T protein:vir:79 186 ------------------K-IKRVEYYTENDITYFIERGNS--FIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIP 244 (479) T ss_pred ------------------e-EEEEEEEeCCcEEEEEecCCc--ccccccccccccccccccccccccccccCCCcccEEE Confidence 0 000111111222111111000 0000000000000000011112223344457778775 Q ss_pred ecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc----cc-c-ccCCceeEEeCCCCCccccc Q lcl|NC_015158. 308 CGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV----EE-F-VWGPMEQIYINGDGDVEMMA 381 (581) Q Consensus 308 ~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~----~~-i-~~~pG~vi~~~~~~~i~~~~ 381 (581) +. .+-+|.|..+.+.++++.+|.+...+.+++...++|.+.+.+.. .+ . ..+.++++.+++++++.+++ T Consensus 245 ~~-----nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~ 319 (479) T protein:vir:79 245 FK-----NNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGGVDKLE 319 (479) T ss_pred ec-----CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhhhhhccceecCCCCcceEEe Confidence 43 45679999999999999999999999999999999998776521 11 1 22467788889999999888 Q ss_pred CCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015158. 382 PNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRN 461 (581) Q Consensus 382 ~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n 461 (581) .+.......+.++.+...+-..|++|.++.+. .++.|+.++...+..+........+.|.. +++++++++++++... T Consensus 320 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~ 396 (479) T protein:vir:79 320 INIPVEAKKELLDRLEKNIIIFGQGVNPESQN--TGDKSGVALKFLYSLLDLKCSKTEKKFKK-AIRELLWFVCEYLKIS 396 (479) T ss_pred ccCCHHHHHHHHHHHHHHHHHHhCcccccccc--ccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcc Confidence 77666666777888888999999999887764 35667777877777777777888888885 6788888887775321 Q ss_pred cCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHh Q lcl|NC_015158. 462 LDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNL 541 (581) Q Consensus 462 ~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~ 541 (581) . + ..+...++...|. -.-.....+.++.++.| . .+ +|.+.++ +. T Consensus 397 ~---------~--------~~~~~~~i~i~f~--~~~p~~~~~~a~~~~kl---~------g~---iS~et~l----~~- 440 (479) T protein:vir:79 397 G---------N--------KSYDYKTVQITFN--HSMIINEAEKIDMAAKS---T------GI---VSDETIV----SN- 440 (479) T ss_pred C---------C--------CccccccceEEeC--CCCCcCHHHHHHHHHHH---h------cc---CcHHHHH----Hh- Confidence 1 0 1111223333331 11122334444433332 1 12 2332222 22 Q ss_pred cCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 542 SLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 542 ~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++. .+ ++ ++|.+++..+-....+.....+=. T Consensus 441 -l~~----v~--d~--~~E~~ri~~E~~~~~~~~~~~~~~ 471 (479) T protein:vir:79 441 -HPW----VE--DV--NDELERLKKQEDTQKEYDDLIPNN 471 (479) T ss_pred -CCC----CC--CH--HHHHHHHHHHHHHHHHHHhccCcc Confidence 221 12 22 334444333211111111111111 No 75 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.68 E-value=1.1e-14 Score=97.18 Aligned_cols=431 Identities=12% Similarity=0.073 Sum_probs=205.4 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-cccccccccc--ccccccccchHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNSTLP--WKNKTTLPKLCQIRD 77 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~~~--~k~~~~~pki~~~~d 77 (581) |+|- +++...|.+.+. . ...++.++..|+...+ .+..+...-+ ...++.++-+.-+++ T Consensus 1 ~~t~------------~d~i~~L~~~~~---~----~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd 61 (480) T protein:vir:78 1 MTTY------------HEHVERLQGLLA---R----DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLR 61 (480) T ss_pred CCCH------------HHHHHHHHHHHH---H----HHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHH Confidence 3332 333333333332 2 2234445555654443 2222222111 123344565556666 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) .+..+|. +.. |. ..++++. .+.++..+.++++.....+.++++.+||.|++.+.-.+. T Consensus 62 ~~~~~l~----~~g--~~----~~~d~~~----~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~-------- 119 (480) T protein:vir:78 62 TLSDRLD----IEG--FR----ISEDSEG----LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDV-------- 119 (480) T ss_pred HHHhhhc----cCc--ee----cCCCchh----HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCcc-------- Confidence 6666552 221 11 1122222 234555667889999999999999999999988742100 Q ss_pred eEeeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 158 ATRDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 158 ~~~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) ........|++..+||.++ ++||.... ..-+.+|......+ T Consensus 120 ~~~d~~~~~~i~~~~p~~~~~i~D~~~~~--~~~~~i~~~~~~d~----------------------------------- 162 (480) T protein:vir:78 120 ESGDPAGIPLIRVESPLYMYAELDPRNTR--RVTRAVRLYTTRDD----------------------------------- 162 (480) T ss_pred ccCCCCCeeEEEEEcccceEEEEcCCCcc--ceEEEEEEEEeecC----------------------------------- Confidence 0111223477889999996 55665331 11222333222100 Q ss_pred hhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCC Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQD 315 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~ 315 (581) .+.....++|..+.+. .|. ..++... +.....+..|.+.|..|++.+...+..+ T Consensus 163 ---------~~~~~~~~~y~~~~~~--~~~-----~~~~~~~----------~~~~~~~~~~~~~g~vPvv~f~n~~~~~ 216 (480) T protein:vir:78 163 ---------VAVPDRATLYLPDETV--PLR-----RNGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) T ss_pred ---------CcceEEEEEEeCCeEE--EEE-----ecCCCcc----------cccccccccccCCCCcceEEeecccccC Confidence 0000001112222111 111 0111100 1111112234456889999999999999 Q ss_pred cccCCCcHHh-hhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-ccccc---------ccCCceeEEeCCCCCcccccCCC Q lcl|NC_015158. 316 NLYAMGPLDN-LVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEEF---------VWGPMEQIYINGDGDVEMMAPNT 384 (581) Q Consensus 316 ~~~G~s~~~~-l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~i---------~~~pG~vi~~~~~~~i~~~~~p~ 384 (581) ..||.|.... ++++|+.+|.++..+.+.+...++|+..+.+ +.++. ...+|.++.. .++++...+.+. T Consensus 217 ~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 295 (480) T protein:vir:78 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEFKA 295 (480) T ss_pred CccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhccC-CCCCceEEecCc Confidence 9999998875 8999999999999999999999999865543 22211 1123444433 334444444333 Q ss_pred ccchhHHHHHHHHHHHHH---hcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015158. 385 QALQADMQIQILEAKMEE---FAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRN 461 (581) Q Consensus 385 ~~~~~~~~lq~~~~~~ee---~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n 461 (581) . ++.+.++.++..+.. .|++|....|..+..+-+|-++...+...........+.|.. .+++++++++.+.... T Consensus 296 ~--~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~-~l~~~~rl~~~~~~~~ 372 (480) T protein:vir:78 296 A--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGG-AWERAMRIAMQIMGRE 372 (480) T ss_pred c--CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCC Confidence 2 233344555555444 577777787754332335555666777777777888888875 5777888887663210 Q ss_pred cCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHh Q lcl|NC_015158. 462 LDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNL 541 (581) Q Consensus 462 ~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~ 541 (581) .. . .+ ..++..| .........+.+ ..+.++.+. ..+.++. +.+.+.+ T Consensus 373 ~~-~-------~~-----------~~i~v~w--~~~~~~s~~~~a---d~~~kl~~~-----g~~~~s~----et~~~~l 419 (480) T protein:vir:78 373 VT-E-------EY-----------TRLETVW--RDPSTPTVAAKA---DAVSKLYAN-----GQGPIPK----EQARIDL 419 (480) T ss_pred cc-c-------cc-----------eeeeEEe--cCCCCCCHHHHH---HHHHHHHHh-----cccCCCH----HHHHhcC Confidence 00 0 00 0111111 111122333333 333333321 1122332 2222322 Q ss_pred cCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHH----------hcccCC Q lcl|NC_015158. 542 SLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEE----------AQVPLV 581 (581) Q Consensus 542 ~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~----------~~~~~~ 581 (581) ++ .+ ++.++. .++..+++++.+... +.-|-. T Consensus 420 ---g~---~~--d~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) T protein:vir:78 420 ---GY---TA--TQREQM-RDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) T ss_pred ---CC---CH--hHHHHH-HHHHHHHHHHHHHHhhccccCCCccccCCCC Confidence 32 11 111111 111111111111110 001111 No 76 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.67 E-value=9.7e-15 Score=97.41 Aligned_cols=428 Identities=12% Similarity=0.078 Sum_probs=204.6 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-ccccccccccc---cccccccchHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNSTLPW---KNKTTLPKLCQIR 76 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~~~~---k~~~~~pki~~~~ 76 (581) |+| -++....|.+.+. +. ...+.++.+|+...+ .+..+.. .|. ..++.++-+.-++ T Consensus 1 ~~t------------~~~~i~~L~~~~~---~~----~~r~~~l~~Yy~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~iv 60 (480) T protein:vir:78 1 MTT------------YHEHVERLQGLLA---RD----LPNLLEAEAYRNGTRRLKTIGIG-APPELAYLDVQPGWVATYL 60 (480) T ss_pred CCC------------HHHHHHHHHHHHH---HH----HHHHHHHHHHHhccccccccccc-cchhHhhhhhhcchHHHHH Confidence 332 2333333333332 22 233445555655443 2222222 221 2234455555666 Q ss_pred HHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeee Q lcl|NC_015158. 77 DNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES 156 (581) Q Consensus 77 d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~ 156 (581) +.+..++. +.+ |. ..++++.. +.++..+.++++...+.++++++.+||.|+..+.-.+.. T Consensus 61 d~~~~~l~---~~g---~~----~~~d~~~~----~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~------ 120 (480) T protein:vir:78 61 RTLSDRLD---IEG---FR----ISEDSEGL----EELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVE------ 120 (480) T ss_pred HHHHhhhc---cCc---ee----cCCCchhH----HHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccc------ Confidence 66665552 111 11 11222222 245556678899999999999999999998887421110 Q ss_pred eeEeeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccch Q lcl|NC_015158. 157 GATRDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRE 234 (581) Q Consensus 157 ~~~~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (581) .......|++..+||.++ ++||.... ..-+.+|...+..+ T Consensus 121 --~~d~~g~~~i~~~~p~~~~~~~D~~~~~--~~~~~i~~~~~~~~---------------------------------- 162 (480) T protein:vir:78 121 --SGDPAGIPLIRVESPLYMYAELDPRNTR--RVTRAVRLYTTRDD---------------------------------- 162 (480) T ss_pred --cCCCCCeeEEEEEcccceEEEEcCCCcc--ceEEEEEEEEeecC---------------------------------- Confidence 011223477889999997 55665321 11112222222100 Q ss_pred hhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccC Q lcl|NC_015158. 235 DCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQ 314 (581) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p 314 (581) .+.....++|+.+.+..+.+ .++... ..+...+..|...|+.|++.+...+.. T Consensus 163 ----------~~~~~~~~~y~~~~~~~~~~-------~~~~~~----------~~~~~~~~~~~~~g~vPvv~f~n~~~~ 215 (480) T protein:vir:78 163 ----------VAVPDRATLYLPDETVPLRR-------NGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRL 215 (480) T ss_pred ----------CCceEEEEEEeCCeEEEEEe-------cCCCcc----------ccccccccccCCCCCcceEEeeccccc Confidence 00000011122222211111 111110 001111222345688999999999999 Q ss_pred CcccCCCcHHh-hhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-ccccc---------ccCCceeEEeCCCCCcccccCC Q lcl|NC_015158. 315 DNLYAMGPLDN-LVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEEF---------VWGPMEQIYINGDGDVEMMAPN 383 (581) Q Consensus 315 ~~~~G~s~~~~-l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~i---------~~~pG~vi~~~~~~~i~~~~~p 383 (581) +..||.|.... ++++++.+|.++..+.+++...++|+..+.+ ++++. ...+|.++.. .++++...+.+ T Consensus 216 ~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 294 (480) T protein:vir:78 216 GNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEFK 294 (480) T ss_pred CCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhccC-CCCCceEEecC Confidence 99999998875 8999999999999999999999999865543 22221 1123444433 33444444444 Q ss_pred CccchhHHHHHHHHHHHHH---hcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 384 TQALQADMQIQILEAKMEE---FAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRR 460 (581) Q Consensus 384 ~~~~~~~~~lq~~~~~~ee---~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~ 460 (581) .. ++.+.++.++..+.. .||+|....|..+..+-+|.++...+...........+.|.. .++++++++..+... T Consensus 295 ~~--~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~-~l~~~~~l~~~~~g~ 371 (480) T protein:vir:78 295 AA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGG-AWERAMRIAMQIMGR 371 (480) T ss_pred cc--CHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCC Confidence 32 233345555555554 577888888864433345555666677777777778888885 567788877665321 Q ss_pred hcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHH Q lcl|NC_015158. 461 NLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHN 540 (581) Q Consensus 461 n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~ 540 (581) ... .++ ..+...|. -.....+.+.+ ..+.++.+. +. +.++.+. +.+. T Consensus 372 ~~~--------~~~-----------~~i~v~f~--~~~~~s~~~~a---d~~~kl~~~--g~---~~~s~et----~~~~ 418 (480) T protein:vir:78 372 EVT--------EEY-----------TRLETVWR--DPSTPTVAAKA---DAVSKLYAN--GQ---GPIPKEQ----ARID 418 (480) T ss_pred Ccc--------ccc-----------eeeeEEec--CCCCCCHHHHH---HHHHHHHHh--cc---ccCCHHH----HHhc Confidence 000 000 01111111 11122333333 333333321 11 2233322 2232 Q ss_pred hcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 541 LSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 541 ~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) + ++ .+ +..++ ...+ ++++++-...++.... T Consensus 419 l---g~---~~--d~~~~--~~~~-~~e~~~~~~~~~~~~~ 448 (480) T protein:vir:78 419 L---GY---TA--TQREQ--MRDW-DKQETEDMIDTLYSTT 448 (480) T ss_pred C---CC---CH--hHHHH--HHHH-HHHHHHHHHHHhhccc Confidence 2 32 11 11111 1111 1111111111111111 No 77 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.67 E-value=1.3e-15 Score=102.24 Aligned_cols=437 Identities=13% Similarity=0.068 Sum_probs=227.2 Q ss_pred hhhhhhccchhh--hHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhc---cc--------ccccc--cccccc--ccccc Q lcl|NC_015158. 6 LELQQMLDDTRD--GLAEQIANTWQNWNSQRQEWLSQKSELRNYIFA---TD--------TTTTT--NSTLPW--KNKTT 68 (581) Q Consensus 6 ~~~~~~~~~~~~--~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~---~~--------~~~~~--~~~~~~--k~~~~ 68 (581) ..|.+.+++-.+ -.+..|.+.-+.++..|....+.+..++.+... .+ ....+ ..+.+. .+++. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 233333331111 112334444333332222222222222211110 00 00000 011112 23677 Q ss_pred ccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeec Q lcl|NC_015158. 69 LPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVK 148 (581) Q Consensus 69 ~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~ 148 (581) +|-...+++...++++ .++- ++.. .++.+..+..++.|++.+.++++.....++.+++.+||.|+..++... T Consensus 81 ~n~~~~ivd~~~~yl~----g~pv--~~~~--~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~ 152 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLH----GVPV--TYDL--DENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT 152 (474) T ss_pred cchHHHHHHhHhhhee----ccce--eEee--CCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC Confidence 7877788888777765 3333 2222 112234466677888888889999999999999999999988764321 Q ss_pred ceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccC Q lcl|NC_015158. 149 ETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGL 228 (581) Q Consensus 149 ~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~ 228 (581) ...+++..++|.++|+=-. .+.+. -+.+|.+.+..+ . T Consensus 153 --------------~~~~~~~~i~p~~~~~v~d-~~~~~-~~~i~~~~~~~~--------~------------------- 189 (474) T protein:vir:94 153 --------------NGDIRIKNIDPYNVIFVGD-NILEP-TYSLRYFYEKDD--------D------------------- 189 (474) T ss_pred --------------CCeeEEEEEcccceEEEEc-CCCce-EEEEEEEEEeeC--------C------------------- Confidence 1136788899999643211 11111 123333322100 0 Q ss_pred CcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEe Q lcl|NC_015158. 229 GTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHC 308 (581) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~ 308 (581) + + ..+...++|.. ..+ +...--..+........|-+.|..|++.+ T Consensus 190 ~--~------------------------~~~~~~~~y~~------~~~---~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 234 (474) T protein:vir:94 190 N--G------------------------TDYVYAEFYDN------AYY---YVFRGEGIDALQEVGRYEHLFDYNPLFGV 234 (474) T ss_pred C--c------------------------eEEEEEEEEcC------ceE---EEEeecCCCcccccccccCCCCccceEEe Confidence 0 0 00001112210 000 00000000111111122334577887744 Q ss_pred cccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec---ccccc-ccCCcee-EEeCCCCCcccccCC Q lcl|NC_015158. 309 GWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG---DVEEF-VWGPMEQ-IYINGDGDVEMMAPN 383 (581) Q Consensus 309 ~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~---d~~~i-~~~pG~v-i~~~~~~~i~~~~~p 383 (581) . ..-+|.|..+.++++++.+|.+...+.+++...++|.+.+.+ +.+.. ....++. +-...++++.++..+ T Consensus 235 ~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~ 309 (474) T protein:vir:94 235 P-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEMIQETQKSGAFELFDKDMDVKYLTKD 309 (474) T ss_pred c-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchhhhhhhhcceeEecCCCCceeEEecc Confidence 3 456799999999999999999999999999999999987654 12222 2333444 444667788888877 Q ss_pred CccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 384 TQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLD 463 (581) Q Consensus 384 ~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d 463 (581) .........+..+...+-..|++|..+.+.-+ ++.||.++...+..+........+.|.. +++++++++..++..... T Consensus 310 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~l~~~~~ 387 (474) T protein:vir:94 310 VNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKMTA-MLRYQFKVILSALKRKGY 387 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccC Confidence 66666777789999999999999988765432 4677777888888888888888888885 578899988877543110 Q ss_pred ccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcC Q lcl|NC_015158. 464 VADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSL 543 (581) Q Consensus 464 ~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l 543 (581) . .......+++..| .........+.++.++.|. .+ +|.+.+++++ T Consensus 388 ----------~-----~~~~~~~~i~~~f--~~~~p~d~~e~a~~~~kl~---------g~---iS~et~~~~l------ 432 (474) T protein:vir:94 388 ----------N-----LDDDSYLNLIFKF--TRNIPVNKLEESQVLINLK---------GQ---VSERTRLGQS------ 432 (474) T ss_pred ----------C-----CCccccccceEEe--CCCCCCCHHHHHHHHHHHh---------cc---CchHHHHHhC------ Confidence 0 0000112333333 1222334455554444331 12 3333333322 Q ss_pred CCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 544 GGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 544 ~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +. .+ ++ ++|.+++..+- .+...+.+-+ T Consensus 433 ~~----v~--d~--~~E~eri~~E~---~e~~~~~~~~ 459 (474) T protein:vir:94 433 QL----VD--DV--DYELDEMEKES---LEFNDKLPDI 459 (474) T ss_pred CC----CC--CH--HHHHHHHHHHH---HHHHhhcccc Confidence 21 12 22 33444443321 1222233333 No 78 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.67 E-value=1.3e-15 Score=102.24 Aligned_cols=437 Identities=13% Similarity=0.068 Sum_probs=227.2 Q ss_pred hhhhhhccchhh--hHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhc---cc--------ccccc--cccccc--ccccc Q lcl|NC_015158. 6 LELQQMLDDTRD--GLAEQIANTWQNWNSQRQEWLSQKSELRNYIFA---TD--------TTTTT--NSTLPW--KNKTT 68 (581) Q Consensus 6 ~~~~~~~~~~~~--~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~---~~--------~~~~~--~~~~~~--k~~~~ 68 (581) ..|.+.+++-.+ -.+..|.+.-+.++..|....+.+..++.+... .+ ....+ ..+.+. .+++. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 233333331111 112334444333332222222222222211110 00 00000 011112 23677 Q ss_pred ccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeec Q lcl|NC_015158. 69 LPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVK 148 (581) Q Consensus 69 ~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~ 148 (581) +|-...+++...++++ .++- ++.. .++.+..+..++.|++.+.++++.....++.+++.+||.|+..++... T Consensus 81 ~n~~~~ivd~~~~yl~----g~pv--~~~~--~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~ 152 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLH----GVPV--TYDL--DENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT 152 (474) T ss_pred cchHHHHHHhHhhhee----ccce--eEee--CCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC Confidence 7877788888777765 3333 2222 112234466677888888889999999999999999999988764321 Q ss_pred ceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccC Q lcl|NC_015158. 149 ETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGL 228 (581) Q Consensus 149 ~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~ 228 (581) ...+++..++|.++|+=-. .+.+. -+.+|.+.+..+ . T Consensus 153 --------------~~~~~~~~i~p~~~~~v~d-~~~~~-~~~i~~~~~~~~--------~------------------- 189 (474) T protein:vir:10 153 --------------NGDIRIKNIDPYNVIFVGD-NILEP-TYSLRYFYEKDD--------D------------------- 189 (474) T ss_pred --------------CCeeEEEEEcccceEEEEc-CCCce-EEEEEEEEEeeC--------C------------------- Confidence 1136788899999643211 11111 123333322100 0 Q ss_pred CcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEe Q lcl|NC_015158. 229 GTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHC 308 (581) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~ 308 (581) + + ..+...++|.. ..+ +...--..+........|-+.|..|++.+ T Consensus 190 ~--~------------------------~~~~~~~~y~~------~~~---~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 234 (474) T protein:vir:10 190 N--G------------------------TDYVYAEFYDN------AYY---YVFRGEGIDALQEVGRYEHLFDYNPLFGV 234 (474) T ss_pred C--c------------------------eEEEEEEEEcC------ceE---EEEeecCCCcccccccccCCCCccceEEe Confidence 0 0 00001112210 000 00000000111111122334577887744 Q ss_pred cccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec---ccccc-ccCCcee-EEeCCCCCcccccCC Q lcl|NC_015158. 309 GWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG---DVEEF-VWGPMEQ-IYINGDGDVEMMAPN 383 (581) Q Consensus 309 ~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~---d~~~i-~~~pG~v-i~~~~~~~i~~~~~p 383 (581) . ..-+|.|..+.++++++.+|.+...+.+++...++|.+.+.+ +.+.. ....++. +-...++++.++..+ T Consensus 235 ~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~ 309 (474) T protein:vir:10 235 P-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEMIQETQKSGAFELFDKDMDVKYLTKD 309 (474) T ss_pred c-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchhhhhhhhcceeEecCCCCceeEEecc Confidence 3 456799999999999999999999999999999999987654 12222 2333444 444667788888877 Q ss_pred CccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 384 TQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLD 463 (581) Q Consensus 384 ~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d 463 (581) .........+..+...+-..|++|..+.+.-+ ++.||.++...+..+........+.|.. +++++++++..++..... T Consensus 310 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~l~~~~~ 387 (474) T protein:vir:10 310 VNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKMTA-MLRYQFKVILSALKRKGY 387 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccC Confidence 66666777789999999999999988765432 4677777888888888888888888885 578899988877543110 Q ss_pred ccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcC Q lcl|NC_015158. 464 VADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSL 543 (581) Q Consensus 464 ~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l 543 (581) . .......+++..| .........+.++.++.|. .+ +|.+.+++++ T Consensus 388 ----------~-----~~~~~~~~i~~~f--~~~~p~d~~e~a~~~~kl~---------g~---iS~et~~~~l------ 432 (474) T protein:vir:10 388 ----------N-----LDDDSYLNLIFKF--TRNIPVNKLEESQVLINLK---------GQ---VSERTRLGQS------ 432 (474) T ss_pred ----------C-----CCccccccceEEe--CCCCCCCHHHHHHHHHHHh---------cc---CchHHHHHhC------ Confidence 0 0000112333333 1222334455554444331 12 3333333322 Q ss_pred CCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 544 GGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 544 ~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +. .+ ++ ++|.+++..+- .+...+.+-+ T Consensus 433 ~~----v~--d~--~~E~eri~~E~---~e~~~~~~~~ 459 (474) T protein:vir:10 433 QL----VD--DV--DYELDEMEKES---LEFNDKLPDI 459 (474) T ss_pred CC----CC--CH--HHHHHHHHHHH---HHHHhhcccc Confidence 21 12 22 33444443321 1222233333 No 79 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.67 E-value=3.2e-14 Score=94.57 Aligned_cols=450 Identities=12% Similarity=0.050 Sum_probs=237.4 Q ss_pred Cccchhhhh-hhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--ccccccccccccccccchHHHHH Q lcl|NC_015158. 1 MTGKVLELQ-QMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNSTLPWKNKTTLPKLCQIRD 77 (581) Q Consensus 1 ~~~~~~~~~-~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~~~~k~~~~~pki~~~~d 77 (581) |+- .++ +++++-.+.-...|.+..+.+.. ....+.++.+|+...+.- ..-..+..-.+++.+|...-+++ T Consensus 1 ~~~---~~~~~~~~~~~~~~~~~i~~~i~~~~~----~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~ 73 (499) T protein:vir:10 1 MAV---VIDKDLLDDVNEPNIEAINYAIRELQN----RKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITD 73 (499) T ss_pred Ccc---chhhhHHhhhhcCCHHHHHHHHHHHHH----HHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHH Confidence 321 111 12222122223445554444422 244566777787766521 11111111245566777677777 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeee--- Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDE--- 154 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~--- 154 (581) ....+|+ .++- .+.+. +++..+ .+.+.+...+|.....++.+++.+||.|+..+.+........ T Consensus 74 ~~~~~l~----g~p~--~~~~~---~~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~ 140 (499) T protein:vir:10 74 MNVGFMT----GNPV--KYVAE---KGKNID----DILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDE 140 (499) T ss_pred HHhhhhc----ccCc--eeecC---ChhHHH----HHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccc Confidence 7776665 4332 33332 222222 355666777899999999999999999988886643322111 Q ss_pred eeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccch Q lcl|NC_015158. 155 ESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRE 234 (581) Q Consensus 155 ~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (581) ........+.+.++..|+|.+.|+=-.-..-..--+++|.+.+... + + . T Consensus 141 ~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~---------~------------------~--~-- 189 (499) T protein:vir:10 141 LGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDL---------E------------------G--N-- 189 (499) T ss_pred ccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeec---------C------------------C--C-- Confidence 1112223344567888999986432111111111223333333100 0 0 0 Q ss_pred hhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccC Q lcl|NC_015158. 235 DCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQ 314 (581) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p 314 (581) ......+.|+++.+..+.+.+. +. ...+..++....|| .|..|++.+. T Consensus 190 -----------~~~~~~~iyt~~~i~~~~~~~~------~~--------~~~~~~~~~~~~~~--~g~vPvv~~~----- 237 (499) T protein:vir:10 190 -----------TNGYSITVYMPQRIVEYRTKTT------ME--------VSANDPIVYDGENL--FGAVPIIEFR----- 237 (499) T ss_pred -----------ceEEEEEEEeCCeEEEEEecCC------cc--------ccCcceecccccCC--CCccceEEec----- Confidence 0000011223333333222211 00 00112233334454 5778877544 Q ss_pred CcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-ccc-----cccCCceeEEeC--CCCCcccccCCCcc Q lcl|NC_015158. 315 DNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VEE-----FVWGPMEQIYIN--GDGDVEMMAPNTQA 386 (581) Q Consensus 315 ~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~~-----i~~~pG~vi~~~--~~~~i~~~~~p~~~ 386 (581) .+-+|.|..+.+.++++.+|.+...+.+++...++|++.+.+. .++ .....|.++.+. +++++.++..+... T Consensus 238 n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~ 317 (499) T protein:vir:10 238 NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWLTKSFDE 317 (499) T ss_pred CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCCCcceEEeccCCH Confidence 3567999999999999999999999999999999999887652 111 122457777654 45667888877666 Q ss_pred chhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Q lcl|NC_015158. 387 LQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVAD 466 (581) Q Consensus 387 ~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~ 466 (581) ..+...+..+...+-+.|++|..+-+.- .++.++.++..++..+........+.|.. +++++++++..++... T Consensus 318 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~~~~~~----- 390 (499) T protein:vir:10 318 TQVNLLSQSIENDIHKISYVPNMNDEKF-MGNVSGEAMKFKLFGLENLLSIKQRYFFD-GLRRRLKLIQTIVNIK----- 390 (499) T ss_pred HHHHHHHHHHHHHHHHHhCcccCCchhh-cccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcc----- Confidence 6677778999999999999998765432 34667777888888888888888999995 5788999998875321 Q ss_pred eeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCc Q lcl|NC_015158. 467 TIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGW 546 (581) Q Consensus 467 ~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~ 546 (581) |.. ....+++..| ...-.....+.++.++.| . .+ +|.+.+++ . ++. T Consensus 391 -----~~~--------~d~~~i~i~f--~~~~p~n~~e~~~~~~kl---~------g~---iS~et~~~----~--l~~- 436 (499) T protein:vir:10 391 -----GAN--------DDASGCKISL--VANIPSNLSDVVNNVKNA---D------GI---IPRKYTYS----W--LPD- 436 (499) T ss_pred -----CCc--------cccccceEEe--CCCCCCCHHHHHHHHHHH---h------cc---CChHHHHH----h--CCC- Confidence 110 0011233333 111123344555444443 1 12 33322322 2 222 Q ss_pred ccccCCCCcHHHHHHHHHHHHHHHHHHHHhc-------ccCC Q lcl|NC_015158. 547 DIFKPNVAVMEAQTTSALVNQSQAQIEEEAQ-------VPLV 581 (581) Q Consensus 547 ~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~-------~~~~ 581 (581) .+ ++ +.|.+++.++.+..++..++ .+.- T Consensus 437 ---v~--d~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 471 (499) T protein:vir:10 437 ---VD--NP--QDVIDEMNQQDAETIKKNQEALRGQDPDRLE 471 (499) T ss_pred ---CC--CH--HHHHHHHHHHHHHHHHHHHhhhccCCCCCCC Confidence 12 12 33344443332222211111 1111 No 80 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.67 E-value=5.6e-14 Score=93.23 Aligned_cols=408 Identities=13% Similarity=0.086 Sum_probs=219.0 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccc--ccccccccccccchHHHHHHHHHHHHHhhc Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTN--STLPWKNKTTLPKLCQIRDNLHSNYISALF 88 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~--~~~~~k~~~~~pki~~~~d~~~~~l~~~~f 88 (581) |.. + .|.++.+.++ .|+ ..+.++++|+...+.-.... .+..-.+++.+|-..-+++...++|+ T Consensus 1 l~~---~----~l~~~i~~~~-~~~---~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~---- 65 (429) T protein:vir:98 1 MTK---D----LLSELIQKHR-SFN---LSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFI---- 65 (429) T ss_pred CCH---H----HHHHHHHHHH-HHH---HHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhc---- Confidence 221 2 2333333332 233 33444555655443111111 11112346777777788888877775 Q ss_pred CCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceE Q lcl|NC_015158. 89 PNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRA 168 (581) Q Consensus 89 ~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~i 168 (581) .++- .+.+ +++ ..++.+++.+..+++...+.++.+++.+||.|+..+.+.+. ..|++ T Consensus 66 g~~~--~~~~---~~~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--------------g~~~~ 122 (429) T protein:vir:98 66 GVPV--QTSH---ENK----QVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDEN--------------AEAGI 122 (429) T ss_pred ccCc--eeec---CCh----HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCC--------------CcEEE Confidence 4432 2332 222 23335666677788999999999999999999988865311 24678 Q ss_pred Eecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccccccc Q lcl|NC_015158. 169 VRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDG 246 (581) Q Consensus 169 e~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (581) ..++|.+++ +|..... ..-+.+|.+.+.+. T Consensus 123 ~~~~p~~~~~v~dd~~~~--~~~~~i~~~~~~~~---------------------------------------------- 154 (429) T protein:vir:98 123 TYLTPLEAFIVYDDSIRQ--KPLFAVRYFYNKGG---------------------------------------------- 154 (429) T ss_pred EEEcccceEEEEeCCCCC--ceEEEEEEEEecCc---------------------------------------------- Confidence 889999974 3322111 11112222211000 Q ss_pred ccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhh Q lcl|NC_015158. 247 FGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNL 326 (581) Q Consensus 247 ~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l 326 (581) .....++.... ..+|. ..+++ -.++... |.+.|..|++.+. ..-+|.|..+.+ T Consensus 155 -~~~~~~~~~~~---~~~~~----~~~~~------------~~~~~~~--~~~~g~vPvv~~~-----n~~~g~sd~e~v 207 (429) T protein:vir:98 155 -VLEGSYSDASN---ITYFK----DGEKG------------IEIGESE--PHPFDGVPMIEYV-----ENEERQSLLASV 207 (429) T ss_pred -eEEEEEEeCce---EEEEE----ecCCc------------eEecccc--cccCCccceEEec-----CCCCCCCcHHHH Confidence 00001111111 11121 11111 1122222 3345777876543 355799999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCeEEEecc---cccc-ccCCceeEEeCCCC----CcccccCCCccchhHHHHHHHHH Q lcl|NC_015158. 327 VGMQYRIDHLENLKADVFDLIAFPPMKVKGD---VEEF-VWGPMEQIYINGDG----DVEMMAPNTQALQADMQIQILEA 398 (581) Q Consensus 327 ~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d---~~~i-~~~pG~vi~~~~~~----~i~~~~~p~~~~~~~~~lq~~~~ 398 (581) .++++.++.+...+.+++...++|++.+.+- .+.. ....++++.+.+++ ++.++..+.....+...++.+.. T Consensus 208 ~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 287 (429) T protein:vir:98 208 VTLINAFNKAISEKANDVEYFADAYLKILGAELDDETLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLEN 287 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCCcchhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHH Confidence 9999999999999999999999999876542 1112 22356777776443 46677766555556677889999 Q ss_pred HHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhccc Q lcl|NC_015158. 399 KMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVA 478 (581) Q Consensus 399 ~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~ 478 (581) .+-+.|++|..+-+. .++.|+.++...............+.|.. +++++++++..+...... . T Consensus 288 ~i~~~s~~p~~~~~~--~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~~~---------~----- 350 (429) T protein:vir:98 288 LIFRTAMVANISDES--FGTASGIALRYRLQAMDNLAKTKERKFMS-GMNRRYKLIASYPTSKIG---------P----- 350 (429) T ss_pred HHHHHhCccccCccc--cccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccCCC---------c----- Confidence 999999999765543 35667777877777887888888888885 578888888877432110 0 Q ss_pred CCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHH Q lcl|NC_015158. 479 TFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEA 558 (581) Q Consensus 479 ~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~ 558 (581) ....+++..| .........+.++.++.| . .+.|. +.+ .+.+ +. .+ ++ + T Consensus 351 ----~d~~~i~v~f--~~~~p~~~~~~a~~~~kl---~------g~is~---et~----~~~l--~~----v~--d~--~ 398 (429) T protein:vir:98 351 ----KDWIGIKYKF--TRNLPANLLEESQIAGNL---A------GIVSE---ETQ----VGVL--SI----VE--NP--Q 398 (429) T ss_pred ----cccccceEEe--CCCCCcCHHHHHHHHHHH---h------ccCch---HHH----HHhC--CC----CC--CH--H Confidence 0111233232 122223344444333322 1 13333 222 2321 21 12 22 3 Q ss_pred HHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 559 QTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 559 ~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +|.+++.++.... .+.+|.-+- T Consensus 399 ~E~~ri~~E~~~~-~~~~~~~~~ 420 (429) T protein:vir:98 399 KEIERKNSDKSTL-ISRQAGGLN 420 (429) T ss_pred HHHHHHHHHHHHH-HHHHHhhhc Confidence 4444444432222 223333333 No 81 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.66 E-value=4e-14 Score=94.01 Aligned_cols=436 Identities=12% Similarity=0.104 Sum_probs=227.3 Q ss_pred Cccch----------------hhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--cccccc-- Q lcl|NC_015158. 1 MTGKV----------------LELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNST-- 60 (581) Q Consensus 1 ~~~~~----------------~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~-- 60 (581) ++|.+ .|...+.+ .+ .|.+.-+.....|.+ .+.++++|+...+.- .....+ T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~--~~----~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~i~~~~~~~~~~ 83 (511) T protein:vir:99 13 LRGNINYLFNDEANVVYTYDGTESDLLQN--VN----EVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEE 83 (511) T ss_pred hhhhhhhhhhhhhCCccccchhhhhhhcc--HH----HHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCccccc Confidence 22211 11111111 12 233333333333333 556667776554421 111111 Q ss_pred ccccccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCce Q lcl|NC_015158. 61 LPWKNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNC 140 (581) Q Consensus 61 ~~~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~ 140 (581) ..-.+++.+|-..-+++...++|+ +++- ++.+. ++ ..++.+++.+.+++|.....++.+++.+||.| T Consensus 84 ~~~~~ki~~n~~k~Iv~~~~~yl~----g~p~--~~~~~---d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a 150 (511) T protein:vir:99 84 YMADNRVAHDYASYISDFINGYFL----GNPI--QYQDD---DK----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA 150 (511) T ss_pred ccCcceeecchHHHHHHHHHhhhc----ccCc--eeecC---ch----HHHHHHHHHHhhcCHhHHHHHHHHHHHhcCee Confidence 111245667777788888777765 3322 33222 21 23457888888899999999999999999999 Q ss_pred EEEEeeecceeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHH Q lcl|NC_015158. 141 FATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAI 218 (581) Q Consensus 141 i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~ 218 (581) +..+++.+. .++++..++|.++| +|++.. ...-+++|.+.+... +.. T Consensus 151 ~~~vy~ded--------------~~~~i~~~~p~~~~~vyd~~~~--~~~~~~vr~~~~~~~---------~~~------ 199 (511) T protein:vir:99 151 YELMIRNQD--------------DETRLYKSDAMSTFVIYDNTIE--RNSIAGVRYLRTKPI---------DKT------ 199 (511) T ss_pred EEEEEeCCC--------------CceEEEEEccceeEEEEcCCCC--CceEEEEEEEEeeec---------ccC------ Confidence 988865321 23678889999975 465432 112223444433100 000 Q ss_pred HHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCC Q lcl|NC_015158. 219 ARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPS 298 (581) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~ 298 (581) . ..... ..+.|+++.+..+..- .++... .+....... |. T Consensus 200 -------------~-----------~~~~~-~~~vyt~~~i~~~~~~------~~~~~~--------~~~~~~~~~--~~ 238 (511) T protein:vir:99 200 -------------D-----------EDEVF-TVDLFTSHGVYRYLTS------RTNGLK--------LTPRENGFE--SH 238 (511) T ss_pred -------------c-----------cceEE-EEEEEeCCcEEEEEec------CCcccc--------ccccccccc--cC Confidence 0 00000 0011222211111100 001100 001111122 33 Q ss_pred ccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEe-- Q lcl|NC_015158. 299 WFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYI-- 371 (581) Q Consensus 299 ~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~-- 371 (581) +.|..|++.++ ..-+|.|..+.++++++.++.+...+.+.+...++|.+.+.+. ...+ ....++++.. T Consensus 239 ~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~ 313 (511) T protein:vir:99 239 SFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEP 313 (511) T ss_pred CCCccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceeccc Confidence 45677876554 3457999999999999999999999999999999998776542 1111 1122333322 Q ss_pred -----------CCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 372 -----------NGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIM 440 (581) Q Consensus 372 -----------~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r 440 (581) ..++++.++..+.......+.+..+...+-..|++|..+.+.-+ ++.++.++..++..+........+ T Consensus 314 ~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-gn~Sg~Alk~~~~~l~~ka~~k~~ 392 (511) T protein:vir:99 314 TVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEG 392 (511) T ss_pred ccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHH Confidence 33456777776655556677788899999999999998765432 566777788888888888888899 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc Q lcl|NC_015158. 441 NFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV 520 (581) Q Consensus 441 ~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~ 520 (581) .|.+ +++++++++..++..+.... ......+++..| .........+.++.++.| . T Consensus 393 ~~~~-~l~~~~~li~~~~~~~~~~~---------------~~~~~~~i~i~f--~~~~p~n~~e~~~~~~kl---~---- 447 (511) T protein:vir:99 393 LFTK-GLRRRAKLLETILKNTRSID---------------VSKDFNTVRYVY--NRNLPKSLIEELKAYIDS---G---- 447 (511) T ss_pred HHHH-HHHHHHHHHHHHHHhcCCcc---------------cccccccceEEe--CCCCCcCHHHHHHHHHHH---h---- Confidence 9985 56888898888765432100 000111233233 122223344444433333 1 Q ss_pred cccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 521 WQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 521 ~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ +|.+.++ +. ++. .+ + .++|.+++.++-+...+ .++.+.- T Consensus 448 --Gi---iS~et~l----~~--l~~----v~--D--~~~E~~ri~~E~~~~~~-~~~~~~~ 488 (511) T protein:vir:99 448 --GK---ISQTTLM----SL--FSF----FQ--D--PELEVKKIEEDEKESIK-KAQKNMY 488 (511) T ss_pred --cc---CCHHHHH----Hh--CCC----CC--C--HHHHHHHHHHHHHHHHH-HHhhccc Confidence 12 3332222 22 222 12 2 24455555543333222 2222222 No 82 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.65 E-value=3.7e-15 Score=99.71 Aligned_cols=483 Identities=14% Similarity=0.136 Sum_probs=234.7 Q ss_pred hccchhh----hHH-HHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccc-ccccccccc--ccchHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRD----GLA-EQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNS-TLPWKNKTT--LPKLCQIRDNLHSN 82 (581) Q Consensus 11 ~~~~~~~----~~a-~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~-~~~~k~~~~--~pki~~~~d~~~~~ 82 (581) |-++.+- ..| ..-...|=...+ .+.|.+++.|+..+-+++-.-+ -++++.+.+ .|.-.-.+++.+ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D-----~~RlaaY~ly~d~y~n~~~el~~il~G~dr~~~~~ps~r~~V~~~~-- 73 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDEND-----KNRVRAYDLYENIYLNSAETLKLVLRGDDSVPILMPSGRKIVEAVH-- 73 (563) T ss_pred CCccccccCCCcccccccccccCCHHH-----HHHHHHHHHHHHhhcCchhhhhhhcCCCceeeeccchHHHHHHHHH-- Confidence 3332110 000 111222222222 2356667777666654443322 245555544 444446667643 Q ss_pred HHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeee Q lcl|NC_015158. 83 YISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDT 162 (581) Q Consensus 83 l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~ 162 (581) .|++.+--+.++|...++ -..+..|.|+.+..++.++...+.+.-+++++.|=|++++-|.-. +.. T Consensus 74 ---~~Lg~~~~~~Ve~~~~de-~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~----------K~~ 139 (563) T protein:vir:74 74 ---RFLGVGFDYLVEPDMGDE-GIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPN----------KKA 139 (563) T ss_pred ---HhcCCCcEEecCccccCc-chHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccc----------ccc Confidence 334666667778877544 333568999999999999999999999999999999999999421 111 Q ss_pred eccceEEecchhheeecCCCCCcccC--ceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 163 YFGPRAVRIDPKDIVFNPVAVDFAHS--PKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 163 ~~~p~ie~V~p~df~~DP~a~~~~d~--~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) -.++++..|+|.-+|+ +.+.+.. .|.++...-. .... +..+.+ ..+..-.+.+++ T Consensus 140 g~R~rv~~vDP~~~fp---~~dpd~v~g~~~v~v~~~~----------~~pd----d~~~~~-~r~~~~~~~lnd----- 196 (563) T protein:vir:74 140 GERISVDEVDPRQIFL---IEDGSTVVGFHMVDIVQDF----------RSPD----DPSKKL-ARRRTFRRVRND----- 196 (563) T ss_pred CCCceEeecCCceeee---ccCCCCcccceeeecccCC----------CCCc----chhccc-eeeeeeeeeeCC----- Confidence 1245666777776655 2222222 2222211110 0000 000000 000000000000 Q ss_pred ccccccccccccccCCceEEEEEEe--eeeecccCC--ceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCc Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTFY--GDYHDTQSG--TFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDN 316 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~~--g~~~d~~~d--~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~ 316 (581) .+ .+..-.....|+| |++.+..-+ ......--++.. .+-...+.-|-+.+.+||++++-.|.+++ T Consensus 197 ----eg------~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~-~~d~e~~~LP~pi~~iPiv~~~tip~~~s 265 (563) T protein:vir:74 197 ----EG------MFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSA-QHDEEEEELPEPISQLPLYRWRNKPPQNS 265 (563) T ss_pred ----CC------CccceeeeccchhccccccccCccchhhhcccchhhhh-hhhchhhhccccccCccEEEcCCCCCccc Confidence 00 0111111223444 333222111 111000111110 11112222255667899999999999999 Q ss_pred ccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec---------cccccccCCceeEEeCCCCCcccccCCCccc Q lcl|NC_015158. 317 LYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG---------DVEEFVWGPMEQIYINGDGDVEMMAPNTQAL 387 (581) Q Consensus 317 ~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~---------d~~~i~~~pG~vi~~~~~~~i~~~~~p~~~~ 387 (581) +||.|.+..++.+++++|......--.+..+.||++.+.. +..+.+-+||.+|+..++...+.|..-+..+ T Consensus 266 ~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~ 345 (563) T protein:vir:74 266 SWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQ 345 (563) T ss_pred ccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEeccccccccccccccccccCCceeEeccCCccccceeeecchh Confidence 9999999999999999999999888899999999998753 2333445799999999876544444333322 Q ss_pred h---hHHHHHHHHH-HHHHhcCCchHhcCC-CCcccccHHHHHH----HHHHHHH-H--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 388 Q---ADMQIQILEA-KMEEFAGAPREAMGI-RTPGEKTAFEVQQ----LQNAAGR-I--FQEKIMNFEVMLMEKVLNAML 455 (581) Q Consensus 388 ~---~~~~lq~~~~-~~ee~TGv~~~~~G~-~~~~~~TAtgv~~----l~~aa~~-~--~~~i~r~f~~~~~~~li~~~~ 455 (581) + +..-+..++. -+...+|+|+.+.|. +.+...+...+.. |.++-+. + +....|.|....+..++.++. T Consensus 346 ~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~e 425 (563) T protein:vir:74 346 DVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYE 425 (563) T ss_pred hhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2333445554 347789999999994 2222333322221 3332222 2 223345554444444443332 Q ss_pred HHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEE-EecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHH Q lcl|NC_015158. 456 EISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLR-PVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 (581) Q Consensus 456 ~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vv-a~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~ 534 (581) ....- .+-++ +.| -+++-+.-.|. .=|.---..+++.+++...+-+ +..++++... T Consensus 426 rl~~~-g~~~~-------~~g--------~~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~-------aGiiSretAv 482 (563) T protein:vir:74 426 SDFQE-QDGSR-------PFA--------SADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQ-------AHLILRKMAV 482 (563) T ss_pred hHhhh-hcccc-------ccc--------ccccCCceEEEEEeCCCCCccHHHHHHHHHHHHH-------cCchhHHHHH Confidence 21110 11110 111 12232222121 1121112345666666655543 3445655454 Q ss_pred HHHHHHhcCCCcccccCCCCcHHHHHHHHHHH-HHHH--HHHHHhcccCC Q lcl|NC_015158. 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVN-QSQA--QIEEEAQVPLV 581 (581) Q Consensus 535 ~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q-~aq~--~~~~~~~~~~~ 581 (581) +++.+. +| +.+++ +++.+++.- +.+. -.|+++--|+= T Consensus 483 ~~L~~~----g~----~~pda--e~e~~~ie~~~i~~~~~a~a~ad~~~~ 522 (563) T protein:vir:74 483 AKLRSI----GW----EYPEV--DDQGNALTDDDIADMLLAEAEADASLG 522 (563) T ss_pred HHHHhC----CC----CCCcH--HHHHhhcCHHHHHHHHHHHhhccCccc Confidence 555443 54 22332 222333211 1111 01112222222 No 83 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.65 E-value=3.3e-14 Score=94.47 Aligned_cols=427 Identities=13% Similarity=0.089 Sum_probs=224.6 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-ccc----c--ccccccc--cccccccc Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTT----T--TNSTLPW--KNKTTLPK 71 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~----~--~~~~~~~--k~~~~~pk 71 (581) |++. .++++.+ ..+.--..|.++.+.+. .|.+....+ .+|+.-.+ +.. . +...-.. .+++.+|- T Consensus 13 ~~~~--~~~~~~~-~~~~~~~~i~~~i~~~~-~~~~~~~~~---~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:97 13 YGEE--VVEQLKP-QFETQEEMIVRLIDDHR-KQLDKITVG---QRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNF 85 (474) T ss_pred hhhH--HHHhhhh-cccCHHHHHHHHHHHHH-HHHHHHHHH---HHHhccccchhcccchhccccccccccCcceeecch Confidence 3222 2333333 33333355555555543 344444444 44543322 100 0 0001111 23567787 Q ss_pred hHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeeccee Q lcl|NC_015158. 72 LCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETT 151 (581) Q Consensus 72 i~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~ 151 (581) ...+++...++|+ +.+- ++. .++++..+.++. .+ ..||...+.++.+++.+||.|+..+++... T Consensus 86 ~k~Ivd~~~~~l~----g~p~--~~~---~~d~~~~~~l~~----~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~-- 149 (474) T protein:vir:97 86 HQNLVDQKVSYVA----SKPV--TYS---CEDENVLKVIHD----VL-DTRWDNKLIDILTATSNKGIDWLQVYINEN-- 149 (474) T ss_pred HHHHHHHHHhhhh----cCCc--eec---cCcHHHHHHHHH----HH-hccHHHHHHHHHHHHhhcCceEEEEEecCC-- Confidence 7888888887775 3332 332 233333444443 33 357888899999999999999888765211 Q ss_pred eeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcc Q lcl|NC_015158. 152 KDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTY 231 (581) Q Consensus 152 ~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~ 231 (581) ..+++..++|.++|+--.-....+--+++|.+...+. . T Consensus 150 ------------~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~-----------~------------------- 187 (474) T protein:vir:97 150 ------------GEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNE-----------E------------------- 187 (474) T ss_pred ------------CeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCe-----------E------------------- Confidence 1367888999997544221222222223333221100 0 Q ss_pred cchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccc Q lcl|NC_015158. 232 TREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWR 311 (581) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~ 311 (581) ..+.|+...+..+.+. +++...... .....+....-|-+.|+.|++.+. T Consensus 188 ------------------~~~~yt~~~~~~y~~~-------~~~~~~~~~----~~~~~~~~~~~~~~~g~vPvv~~~-- 236 (474) T protein:vir:97 188 ------------------KVEFWTDTTVTYYVLE-------NGGLIPDYY----YGANHVQSHFSNGNWGRVPFIAFK-- 236 (474) T ss_pred ------------------EEEEEeCCeEEEEEEc-------CCccccccc----cCcCcccccccccCCCccceEEec-- Confidence 0011222222211111 111110000 000011111123346777877543 Q ss_pred ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-c---ccc-c-cCCceeEEeCCCCCcccccCCCc Q lcl|NC_015158. 312 IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-V---EEF-V-WGPMEQIYINGDGDVEMMAPNTQ 385 (581) Q Consensus 312 ~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~---~~i-~-~~pG~vi~~~~~~~i~~~~~p~~ 385 (581) .+-+|.|..+.++++++.+|.+...+.+++...++|++.+.+- . ... . ...++++.+.+++++.++..+.. T Consensus 237 ---nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~ 313 (474) T protein:vir:97 237 ---NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGGVETIQVEVP 313 (474) T ss_pred ---CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEEeecCC Confidence 3457999999999999999999999999999999999877652 1 221 1 13567888899999999887766 Q ss_pred cchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Q lcl|NC_015158. 386 ALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVA 465 (581) Q Consensus 386 ~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~ 465 (581) ...+.+.++.+...+-+.|++|..+.+.- .++.||.++..++...........+.|.. +++++++++.+++....+. T Consensus 314 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~~~~~~~d~- 390 (474) T protein:vir:97 314 VSSTKEYIDLMRVYIMEFGQGVDFQTDKF-GSAPSGIALKFLYGNLDLKANKLKNKATV-AIQELISFIIDFNNLKTDV- 390 (474) T ss_pred HHHHHHHHHHHHHHHHHHhCccccCcccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCCccc- Confidence 66677788999999999999998765432 24567777777777777777888888885 6788888888775322111 Q ss_pred ceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCC Q lcl|NC_015158. 466 DTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGG 545 (581) Q Consensus 466 ~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~ 545 (581) .+|...|. ..-.....+.++ .+.+. .+ ++.+.++++ ++. T Consensus 391 --------------------~~i~v~f~--~~~p~~~~e~a~---~~~~~-------g~---iS~et~l~~------l~~ 429 (474) T protein:vir:97 391 --------------------KDIEISFN--FNRMMNDAEQSQ---IIAQS-------QY---LSRETLVKS------SPL 429 (474) T ss_pred --------------------ceeeEEec--cCcccCHHHHHH---HHHHc-------CC---CCHHHHHHh------CCC Confidence 12322221 111112333333 22221 12 333333222 222 Q ss_pred cccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 546 WDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 546 ~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ ++ ++|.+++..+-++. .++.+=+ T Consensus 430 ----v~--D~--~~E~eri~~E~~~~---~~~~~~~ 454 (474) T protein:vir:97 430 ----VD--DY--KAELERIEQEQMEY---NKQLPNL 454 (474) T ss_pred ----CC--CH--HHHHHHHHHHHHHH---Hhhcccc Confidence 12 22 23344433322111 1122222 No 84 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.65 E-value=3.3e-14 Score=94.47 Aligned_cols=427 Identities=13% Similarity=0.089 Sum_probs=224.6 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-ccc----c--ccccccc--cccccccc Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTT----T--TNSTLPW--KNKTTLPK 71 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~----~--~~~~~~~--k~~~~~pk 71 (581) |++. .++++.+ ..+.--..|.++.+.+. .|.+....+ .+|+.-.+ +.. . +...-.. .+++.+|- T Consensus 13 ~~~~--~~~~~~~-~~~~~~~~i~~~i~~~~-~~~~~~~~~---~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:94 13 YGEE--VVEQLKP-QFETQEEMIVRLIDDHR-KQLDKITVG---QRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNF 85 (474) T ss_pred hhhH--HHHhhhh-cccCHHHHHHHHHHHHH-HHHHHHHHH---HHHhccccchhcccchhccccccccccCcceeecch Confidence 3222 2333333 33333355555555543 344444444 44543322 100 0 0001111 23567787 Q ss_pred hHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeeccee Q lcl|NC_015158. 72 LCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETT 151 (581) Q Consensus 72 i~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~ 151 (581) ...+++...++|+ +.+- ++. .++++..+.++. .+ ..||...+.++.+++.+||.|+..+++... T Consensus 86 ~k~Ivd~~~~~l~----g~p~--~~~---~~d~~~~~~l~~----~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~-- 149 (474) T protein:vir:94 86 HQNLVDQKVSYVA----SKPV--TYS---CEDENVLKVIHD----VL-DTRWDNKLIDILTATSNKGIDWLQVYINEN-- 149 (474) T ss_pred HHHHHHHHHhhhh----cCCc--eec---cCcHHHHHHHHH----HH-hccHHHHHHHHHHHHhhcCceEEEEEecCC-- Confidence 7888888887775 3332 332 233333444443 33 357888899999999999999888765211 Q ss_pred eeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcc Q lcl|NC_015158. 152 KDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTY 231 (581) Q Consensus 152 ~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~ 231 (581) ..+++..++|.++|+--.-....+--+++|.+...+. . T Consensus 150 ------------~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~-----------~------------------- 187 (474) T protein:vir:94 150 ------------GEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNE-----------E------------------- 187 (474) T ss_pred ------------CeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCe-----------E------------------- Confidence 1367888999997544221222222223333221100 0 Q ss_pred cchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccc Q lcl|NC_015158. 232 TREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWR 311 (581) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~ 311 (581) ..+.|+...+..+.+. +++...... .....+....-|-+.|+.|++.+. T Consensus 188 ------------------~~~~yt~~~~~~y~~~-------~~~~~~~~~----~~~~~~~~~~~~~~~g~vPvv~~~-- 236 (474) T protein:vir:94 188 ------------------KVEFWTDTTVTYYVLE-------NGGLIPDYY----YGANHVQSHFSNGNWGRVPFIAFK-- 236 (474) T ss_pred ------------------EEEEEeCCeEEEEEEc-------CCccccccc----cCcCcccccccccCCCccceEEec-- Confidence 0011222222211111 111110000 000011111123346777877543 Q ss_pred ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-c---ccc-c-cCCceeEEeCCCCCcccccCCCc Q lcl|NC_015158. 312 IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-V---EEF-V-WGPMEQIYINGDGDVEMMAPNTQ 385 (581) Q Consensus 312 ~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~---~~i-~-~~pG~vi~~~~~~~i~~~~~p~~ 385 (581) .+-+|.|..+.++++++.+|.+...+.+++...++|++.+.+- . ... . ...++++.+.+++++.++..+.. T Consensus 237 ---nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~ 313 (474) T protein:vir:94 237 ---NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGGVETIQVEVP 313 (474) T ss_pred ---CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEEeecCC Confidence 3457999999999999999999999999999999999877652 1 221 1 13567888899999999887766 Q ss_pred cchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Q lcl|NC_015158. 386 ALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVA 465 (581) Q Consensus 386 ~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~ 465 (581) ...+.+.++.+...+-+.|++|..+.+.- .++.||.++..++...........+.|.. +++++++++.+++....+. T Consensus 314 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~~~~~~~d~- 390 (474) T protein:vir:94 314 VSSTKEYIDLMRVYIMEFGQGVDFQTDKF-GSAPSGIALKFLYGNLDLKANKLKNKATV-AIQELISFIIDFNNLKTDV- 390 (474) T ss_pred HHHHHHHHHHHHHHHHHHhCccccCcccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCCccc- Confidence 66677788999999999999998765432 24567777777777777777888888885 6788888888775322111 Q ss_pred ceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCC Q lcl|NC_015158. 466 DTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGG 545 (581) Q Consensus 466 ~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~ 545 (581) .+|...|. ..-.....+.++ .+.+. .+ ++.+.++++ ++. T Consensus 391 --------------------~~i~v~f~--~~~p~~~~e~a~---~~~~~-------g~---iS~et~l~~------l~~ 429 (474) T protein:vir:94 391 --------------------KDIEISFN--FNRMMNDAEQSQ---IIAQS-------QY---LSRETLVKS------SPL 429 (474) T ss_pred --------------------ceeeEEec--cCcccCHHHHHH---HHHHc-------CC---CCHHHHHHh------CCC Confidence 12322221 111112333333 22221 12 333333222 222 Q ss_pred cccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 546 WDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 546 ~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ ++ ++|.+++..+-++. .++.+=+ T Consensus 430 ----v~--D~--~~E~eri~~E~~~~---~~~~~~~ 454 (474) T protein:vir:94 430 ----VD--DY--KAELERIEQEQMEY---NKQLPNL 454 (474) T ss_pred ----CC--CH--HHHHHHHHHHHHHH---Hhhcccc Confidence 12 22 23344433322111 1122222 No 85 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.65 E-value=7.1e-14 Score=92.67 Aligned_cols=437 Identities=12% Similarity=0.104 Sum_probs=224.4 Q ss_pred Cccch----------------hhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--cccccccc Q lcl|NC_015158. 1 MTGKV----------------LELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNSTLP 62 (581) Q Consensus 1 ~~~~~----------------~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~~~ 62 (581) .+|++ .|...+.+ .+.+...|.+ ....+. ..++++++|+...+.. .....+-. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~--~~~i~~~i~~----~~~~~~---~r~~~l~~Yy~g~~~il~~~~~~~~~ 83 (511) T protein:vir:96 13 LRGNINYLFNDEANVVYTYDGTESDLLQN--VNEVSKYIEH----HMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEE 83 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhcC--HHHHHHHHHH----HHHhhh---HHHHHHHHHhhccCccccccCccccc Confidence 11110 11111111 1223333332 222333 3456667777554421 11111111 Q ss_pred c--cccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCce Q lcl|NC_015158. 63 W--KNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNC 140 (581) Q Consensus 63 ~--k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~ 140 (581) + .+++.+|-..-+++...++|+ .++- ++.+ +++ ...+.+++.+..+++.....++.+++.+||.| T Consensus 84 ~~~~~ki~~n~~k~Iv~~~~~yl~----g~p~--~~~~---~d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a 150 (511) T protein:vir:96 84 YMADNRVAHDYASYISDFINGYFL----GNPI--QYQD---DDK----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA 150 (511) T ss_pred ccCcceeecchHHHHHHHHhhhhc----ccCc--eeec---Cch----HHHHHHHHHHhhcChhHHHHHHHHHHHhcCee Confidence 1 245667777788888877765 3322 2222 221 23346778888889999999999999999999 Q ss_pred EEEEeeecceeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHH Q lcl|NC_015158. 141 FATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAI 218 (581) Q Consensus 141 i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~ 218 (581) +..+++... ..+++..++|.++| +|+... ...-+.+|.+.+... + T Consensus 151 ~~~vy~d~d--------------g~~~i~~~~p~~~~~v~dd~~~--~~~~~~vr~~~~~~~---------~-------- 197 (511) T protein:vir:96 151 YELMIRNQD--------------DETRLYKSDAMSTFIIYDNTVE--RNSIAGVRYLRTKPI---------D-------- 197 (511) T ss_pred EEEEEeCCC--------------CceEEEEEcccceEEEEcCCCC--CceEEEEEEEEeeec---------c-------- Confidence 888765211 13678889999975 454432 112223344333100 0 Q ss_pred HHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCC Q lcl|NC_015158. 219 ARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPS 298 (581) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~ 298 (581) +. . ..... ..+.|+...+..+. ..+++.. .. +... ....|. T Consensus 198 ----------~~-~-----------~~~~~-~~~vyt~~~i~~~~-------~~~~~~~---~~----~~~~--~~~~~~ 238 (511) T protein:vir:96 198 ----------KT-D-----------EDEVF-TVDLFTSHGVYRYL-------TNRTNGL---KL----TPRE--NSFESH 238 (511) T ss_pred ----------cc-c-----------cceEE-EEEEEeCCcEEEEE-------ecCCCcc---cc----cccc--cccccC Confidence 00 0 00000 00112222111110 0111100 00 0011 112233 Q ss_pred ccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccc-cCCceeEEe-- Q lcl|NC_015158. 299 WFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFV-WGPMEQIYI-- 371 (581) Q Consensus 299 ~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~-~~pG~vi~~-- 371 (581) ++|..|++.++ ..-+|.|..+.+.++++.+|.+...+.+.+...++|.+.+.+. .+++. ...++++.. T Consensus 239 ~~g~vPvv~~~-----n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~ 313 (511) T protein:vir:96 239 SFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEP 313 (511) T ss_pred cCcccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccc Confidence 45677776543 3457999999999999999999999999999999998876552 22221 122333322 Q ss_pred -----------CCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 372 -----------NGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIM 440 (581) Q Consensus 372 -----------~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r 440 (581) ..++++.++..+.........+..+...+-..|++|..+.+.-+ ++.+|.++...+..+........+ T Consensus 314 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~ 392 (511) T protein:vir:96 314 TVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEG 392 (511) T ss_pred cceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHH Confidence 22345666766655555667788888899999999988766433 466777787778888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc Q lcl|NC_015158. 441 NFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV 520 (581) Q Consensus 441 ~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~ 520 (581) .|.. +++++++++..++....... . .....+++..| .........+.++.++.| . T Consensus 393 ~f~~-~l~~~~~li~~~~~~~~~~~-------~--------~~~~~~i~~~f--~~~~p~n~~e~~d~~~kl---~---- 447 (511) T protein:vir:96 393 LFTK-GLRRRAKLLETILKNTRSID-------A--------NKDFNTVRYVY--NRNLPKSLIEELKAYIDS---G---- 447 (511) T ss_pred HHHH-HHHHHHHHHHHHHHhcCCCc-------c--------ccccccceEEe--CCCCCcCHHHHHHHHHHH---h---- Confidence 8885 56888888887754321100 0 00111233333 122223334444333333 1 Q ss_pred cccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 521 WQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 521 ~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+.|. +.++ +. ++. .+ + .++|.+++..+.+.+.+..+..+-. T Consensus 448 --G~iS~---et~l----~~--l~~----v~--d--~~~El~ri~~E~~~~~~~~~~~~~~ 489 (511) T protein:vir:96 448 --GKISQ---TTLM----SL--FSF----FQ--D--PELEVKKIEEDEKESIKKAQKGIYK 489 (511) T ss_pred --ccCCh---HHHH----Hh--CCC----CC--C--HHHHHHHHHHHHHHHHHHHhhcccc Confidence 12233 2222 22 221 22 2 2445555555433333332222221 No 86 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.65 E-value=7.1e-14 Score=92.67 Aligned_cols=437 Identities=12% Similarity=0.104 Sum_probs=224.4 Q ss_pred Cccch----------------hhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--cccccccc Q lcl|NC_015158. 1 MTGKV----------------LELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNSTLP 62 (581) Q Consensus 1 ~~~~~----------------~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~~~ 62 (581) .+|++ .|...+.+ .+.+...|.+ ....+. ..++++++|+...+.. .....+-. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~--~~~i~~~i~~----~~~~~~---~r~~~l~~Yy~g~~~il~~~~~~~~~ 83 (511) T protein:vir:78 13 LRGNINYLFNDEANVVYTYDGTESDLLQN--VNEVSKYIEH----HMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEE 83 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhcC--HHHHHHHHHH----HHHhhh---HHHHHHHHHhhccCccccccCccccc Confidence 11110 11111111 1223333332 222333 3456667777554421 11111111 Q ss_pred c--cccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCce Q lcl|NC_015158. 63 W--KNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNC 140 (581) Q Consensus 63 ~--k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~ 140 (581) + .+++.+|-..-+++...++|+ .++- ++.+ +++ ...+.+++.+..+++.....++.+++.+||.| T Consensus 84 ~~~~~ki~~n~~k~Iv~~~~~yl~----g~p~--~~~~---~d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a 150 (511) T protein:vir:78 84 YMADNRVAHDYASYISDFINGYFL----GNPI--QYQD---DDK----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA 150 (511) T ss_pred ccCcceeecchHHHHHHHHhhhhc----ccCc--eeec---Cch----HHHHHHHHHHhhcChhHHHHHHHHHHHhcCee Confidence 1 245667777788888877765 3322 2222 221 23346778888889999999999999999999 Q ss_pred EEEEeeecceeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHH Q lcl|NC_015158. 141 FATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAI 218 (581) Q Consensus 141 i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~ 218 (581) +..+++... ..+++..++|.++| +|+... ...-+.+|.+.+... + T Consensus 151 ~~~vy~d~d--------------g~~~i~~~~p~~~~~v~dd~~~--~~~~~~vr~~~~~~~---------~-------- 197 (511) T protein:vir:78 151 YELMIRNQD--------------DETRLYKSDAMSTFIIYDNTVE--RNSIAGVRYLRTKPI---------D-------- 197 (511) T ss_pred EEEEEeCCC--------------CceEEEEEcccceEEEEcCCCC--CceEEEEEEEEeeec---------c-------- Confidence 888765211 13678889999975 454432 112223344333100 0 Q ss_pred HHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCC Q lcl|NC_015158. 219 ARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPS 298 (581) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~ 298 (581) +. . ..... ..+.|+...+..+. ..+++.. .. +... ....|. T Consensus 198 ----------~~-~-----------~~~~~-~~~vyt~~~i~~~~-------~~~~~~~---~~----~~~~--~~~~~~ 238 (511) T protein:vir:78 198 ----------KT-D-----------EDEVF-TVDLFTSHGVYRYL-------TNRTNGL---KL----TPRE--NSFESH 238 (511) T ss_pred ----------cc-c-----------cceEE-EEEEEeCCcEEEEE-------ecCCCcc---cc----cccc--cccccC Confidence 00 0 00000 00112222111110 0111100 00 0011 112233 Q ss_pred ccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccc-cCCceeEEe-- Q lcl|NC_015158. 299 WFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFV-WGPMEQIYI-- 371 (581) Q Consensus 299 ~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~-~~pG~vi~~-- 371 (581) ++|..|++.++ ..-+|.|..+.+.++++.+|.+...+.+.+...++|.+.+.+. .+++. ...++++.. T Consensus 239 ~~g~vPvv~~~-----n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~ 313 (511) T protein:vir:78 239 SFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEP 313 (511) T ss_pred cCcccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccc Confidence 45677776543 3457999999999999999999999999999999998876552 22221 122333322 Q ss_pred -----------CCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 372 -----------NGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIM 440 (581) Q Consensus 372 -----------~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r 440 (581) ..++++.++..+.........+..+...+-..|++|..+.+.-+ ++.+|.++...+..+........+ T Consensus 314 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~ 392 (511) T protein:vir:78 314 TVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEG 392 (511) T ss_pred cceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHH Confidence 22345666766655555667788888899999999988766433 466777787778888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc Q lcl|NC_015158. 441 NFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV 520 (581) Q Consensus 441 ~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~ 520 (581) .|.. +++++++++..++....... . .....+++..| .........+.++.++.| . T Consensus 393 ~f~~-~l~~~~~li~~~~~~~~~~~-------~--------~~~~~~i~~~f--~~~~p~n~~e~~d~~~kl---~---- 447 (511) T protein:vir:78 393 LFTK-GLRRRAKLLETILKNTRSID-------A--------NKDFNTVRYVY--NRNLPKSLIEELKAYIDS---G---- 447 (511) T ss_pred HHHH-HHHHHHHHHHHHHHhcCCCc-------c--------ccccccceEEe--CCCCCcCHHHHHHHHHHH---h---- Confidence 8885 56888888887754321100 0 00111233333 122223334444333333 1 Q ss_pred cccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 521 WQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 521 ~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+.|. +.++ +. ++. .+ + .++|.+++..+.+.+.+..+..+-. T Consensus 448 --G~iS~---et~l----~~--l~~----v~--d--~~~El~ri~~E~~~~~~~~~~~~~~ 489 (511) T protein:vir:78 448 --GKISQ---TTLM----SL--FSF----FQ--D--PELEVKKIEEDEKESIKKAQKGIYK 489 (511) T ss_pred --ccCCh---HHHH----Hh--CCC----CC--C--HHHHHHHHHHHHHHHHHHHhhcccc Confidence 12233 2222 22 221 22 2 2445555555433333332222221 No 87 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.65 E-value=9.5e-14 Score=91.99 Aligned_cols=432 Identities=11% Similarity=0.085 Sum_probs=219.8 Q ss_pred Cccch--hhhhh---hccchhhhH-HHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-cccccccccccccccccchH Q lcl|NC_015158. 1 MTGKV--LELQQ---MLDDTRDGL-AEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-TTTTNSTLPWKNKTTLPKLC 73 (581) Q Consensus 1 ~~~~~--~~~~~---~~~~~~~~~-a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-~~~~~~~~~~k~~~~~pki~ 73 (581) +++.- .|=++ |-. ...+ ...|.+.-+.+...+. ..+.++++|+.--+. .+....+..-.+++.+|-.. T Consensus 4 ~~~~~~~~~~~~~~~~~~--~~~~~~~~i~~~i~~~~~~~~---~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~~ 78 (470) T protein:vir:99 4 INYGRDKVTGNSSFIFPK--GEKLTSNELLGFIAYNETVLK---PRYRENMKLYLGKHKILTAPEKETGADNRIVVNSAK 78 (470) T ss_pred ccCCcccccCCceEEeCC--CCCcCHHHHHHHHHHHHHhhH---HHHHHHHHHhccccccccCcccccCCcceeecchHH Confidence 11100 00000 000 0011 1123333232222222 245556666654331 11111111113456677777 Q ss_pred HHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeee Q lcl|NC_015158. 74 QIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKD 153 (581) Q Consensus 74 ~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~ 153 (581) .+++...++|+ +++- +|... +|.+. .+.+.+.+..++|...+.++++++.+||.|+..+++... T Consensus 79 ~Ivd~~~~~l~----g~p~--~~~~~--~d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d---- 142 (470) T protein:vir:99 79 YVVDVYNGYFC----GIEP--KLALL--NDSSK----IDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGED---- 142 (470) T ss_pred HHHHHHhhhhc----cCCe--eEeeC--CchhH----HHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC---- Confidence 78887777765 3332 22221 12122 224556667789999999999999999999888765321 Q ss_pred eeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcc Q lcl|NC_015158. 154 EESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTY 231 (581) Q Consensus 154 ~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~ 231 (581) ..|++..++|.++| +|+.... ..-+++|.+....+ T Consensus 143 ----------g~~~i~~~~p~~~~~i~d~~~~~--~~~~~vr~~~~~~~------------------------------- 179 (470) T protein:vir:99 143 ----------ARPHLMYSSPNHAFIIYDDTVQR--QPLAFVHYQIDNSN------------------------------- 179 (470) T ss_pred ----------CeEEEEEEccceeEEEEcCCCCc--ceEEEEEEEEEecC------------------------------- Confidence 13678889999974 4443221 11112232221000 Q ss_pred cchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEeccc Q lcl|NC_015158. 232 TREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWR 311 (581) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~ 311 (581) +... -+. .. +... +++.+.+. +++. .........| +.|..|++.+. T Consensus 180 ~~~~----------~~~--~~-~~~~--~~~~~~~~-----~~~~----------~~~~~~~~~~--~~g~vPvv~~~-- 225 (470) T protein:vir:99 180 NWTD----------AYG--VI-QYAD--KFYKFKGY-----DIEE----------DTNAAGYAIN--PYGLVPAVEFF-- 225 (470) T ss_pred CeeE----------EEE--EE-EecC--eEEEEEec-----cccc----------cccccccccc--CCCccceEeec-- Confidence 0000 000 00 0001 11111110 0000 0011112234 45777876443 Q ss_pred ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--c--cc---c-ccCCceeEEeC-----CCCCcc Q lcl|NC_015158. 312 IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--V--EE---F-VWGPMEQIYIN-----GDGDVE 378 (581) Q Consensus 312 ~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~--~~---i-~~~pG~vi~~~-----~~~~i~ 378 (581) ..-+|.|..+.++++++.+|.+...+.+++...++|++.+.+. . ++ + ....++++... .++++. T Consensus 226 ---n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (470) T protein:vir:99 226 ---ENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIG 302 (470) T ss_pred ---CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcce Confidence 4558999999999999999999999999999999999887642 1 11 1 12344555443 345677 Q ss_pred cccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 379 MMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEIS 458 (581) Q Consensus 379 ~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~ 458 (581) ++..+.......+.++.+...+-..||+|..+.+.. .++.||.++...............+.|.. +++++++++..++ T Consensus 303 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~ 380 (470) T protein:vir:99 303 FIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNF-AGNSSGVALQYKLFAMKNKADSKERKFDK-SLMQLYRIVLATL 380 (470) T ss_pred EEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-ccCchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 887776666667778889999999999998765543 24567777777777777777888888885 5688888887775 Q ss_pred HhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHH Q lcl|NC_015158. 459 RRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLE 538 (581) Q Consensus 459 ~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~ 538 (581) ...... .+ ...++...| ...-.....+.++.++.|. .+.|. +.++ T Consensus 381 ~~~~~~--------~~---------~~~~i~v~f--~~~~p~~~~e~a~~~~kl~---------giis~---et~l---- 425 (470) T protein:vir:99 381 FNNKQD--------QE---------LWSELDFKF--TRNLPEDMASAIDNAKNAE---------GIVSK---KTQL---- 425 (470) T ss_pred hccCCc--------cc---------ccccceEEe--CCCCCcCHHHHHHHHHHHh---------ccCCH---HHHH---- Confidence 332110 00 011222222 1111223455554444331 12232 2222 Q ss_pred HHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhccc-CC Q lcl|NC_015158. 539 HNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVP-LV 581 (581) Q Consensus 539 e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~-~~ 581 (581) +. ++ +.++ +.|.+++.++.....+..++.. -. T Consensus 426 ~~--l~-------~vd~--~~E~eri~~E~~~~~~~~~~~~~~~ 458 (470) T protein:vir:99 426 GM--IP-------DIEP--DAEMKQIAKEKADAIKQTQQLSMPI 458 (470) T ss_pred Hh--CC-------CCCH--HHHHHHHHHHHHHHHHHHHhhcCCC Confidence 22 22 2222 3445555544333333333222 12 No 88 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.65 E-value=2.1e-14 Score=95.53 Aligned_cols=460 Identities=11% Similarity=0.083 Sum_probs=216.1 Q ss_pred Cccchh--------------hhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccc--ccccccccc Q lcl|NC_015158. 1 MTGKVL--------------ELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTT--TTNSTLPWK 64 (581) Q Consensus 1 ~~~~~~--------------~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~--~~~~~~~~k 64 (581) |-.+++ .|++.++. .+.+ ....+....+.|.++ |-...+..+ +.......+ T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~--~~i~---------~~~~~~~ri~~~~~~--y~g~~~~~~~~~~~~~~~~~ 69 (508) T protein:vir:15 3 LIQRIKDLFWKGAAATGVTGSLSKITDD--PRIS---------IDPDEYVRIQTDLDY--YSDKLQYIHYQASDGIKKKR 69 (508) T ss_pred hHHHHHHHHHHHHHHhccccchHHhhcc--cccc---------cCHHHHHHHHHHHHH--hcCCCcccccccCCCCcccc Confidence 111221 23333321 0010 112222333444332 222211111 111111123 Q ss_pred ccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEE Q lcl|NC_015158. 65 NKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATV 144 (581) Q Consensus 65 ~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~ 144 (581) .++++|-...+++.+...++ +-..=+++. ++ +..++.|+..|.+.+|...+.+.+.++..+|.+++|. T Consensus 70 ~~~sln~~~~i~~~~A~lv~----~e~~~i~v~----~~----~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~ 137 (508) T protein:vir:15 70 LKNTINMAKTAARRIASVVF----NEKAEIHVK----DN----NEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRP 137 (508) T ss_pred ceeecchHHHHHHHHHhhhh----CCCceEEeC----Cc----hHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEE Confidence 33445544555555554443 322222221 11 1234578888899999999999999999999999999 Q ss_pred eeecceeeeeeeeeEeeeeccceEEecchhheee-cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHh Q lcl|NC_015158. 145 EYVKETTKDEESGATRDTYFGPRAVRIDPKDIVF-NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRRE 223 (581) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~-DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~ 223 (581) .|... +++|+.|+|..||| .=....+..|.|+.+...+... +...|... T Consensus 138 ~~d~~---------------~~~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~--------~~~~yt~l------- 187 (508) T protein:vir:15 138 YIDGN---------------HIKIAWVRADQFYPLQSNTNDISEAAIASRTQRTESN--------QTKYYTLL------- 187 (508) T ss_pred EEeCC---------------eeEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCC--------CceEEEEE------- Confidence 88421 46789999999986 2223446666666555443100 00001000 Q ss_pred hhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccC-CceeeeeEEEEEeCCEEEEeecCC--Ccc Q lcl|NC_015158. 224 FRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQS-GTFKRNMKVTIIDRMFVIEEKENP--SWF 300 (581) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~-d~~~e~~~itv~~g~~iir~~~nP--~~~ 300 (581) ..+...+ + +.++|+ ++.|-. ....+ |.. ...-++=.-..+ ++.- ... T Consensus 188 -----E~h~~~~---------~---------~~~~I~-n~ly~~-~~~~~lG~~--v~l~~~~e~~~l---~~~~~~~g~ 237 (508) T protein:vir:15 188 -----EFHQWQD---------N---------GSYQIT-NELYKS-DSPDIVGNQ--VPLSTLPVYKEL---APQVTISGL 237 (508) T ss_pred -----EEEEEec---------C---------cceEEE-EEEEec-CCchhcCcc--cchhhcccccCC---CcceEecCC Confidence 0000000 0 001111 111100 00000 000 000000000000 0000 013 Q ss_pred CCCCeeEecc----cccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-----cccc-ccCCc-eeE Q lcl|NC_015158. 301 AQAPIFHCGW----RIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-----VEEF-VWGPM-EQI 369 (581) Q Consensus 301 g~~Pf~~~~~----~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-----~~~i-~~~pG-~vi 369 (581) .++||.++.. ...++|.+|.|+.+.++++++.+|....+..+.+. .+.+++.|.++ .+.. ...++ .++ T Consensus 238 ~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~-~~~~~i~v~~~~l~~d~~~~~~~~~~~~~~ 316 (508) T protein:vir:15 238 QRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIR-LGQKHIAVQPGMLRFDDEHKPTFDTEQNVY 316 (508) T ss_pred CcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHH-hcccceeechHHhcCCCCCccccCCCCeeE Confidence 3566665542 12237899999999999999999999999999995 54555555332 1110 01122 223 Q ss_pred E-eC--CC--CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 370 Y-IN--GD--GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEV 444 (581) Q Consensus 370 ~-~~--~~--~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~ 444 (581) + ++ .+ ..++.+++.-...+....++.+...++...|++....|.++.+.+|||++....++.-.....+.+.|.. T Consensus 317 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~~~~~ 396 (508) T protein:vir:15 317 VGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSYLTMVEK 396 (508) T ss_pred EeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 11 11 2355555554445566678888889999999999999988888899999988888888888888888885 Q ss_pred HHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccc Q lcl|NC_015158. 445 MLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDI 524 (581) Q Consensus 445 ~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i 524 (581) .+++|++.|+.++..+.-...-+-.+.. -..+..-++..+|. .+. ...+++.++.+.++-. +++ T Consensus 397 -al~~lv~~il~l~~~~~~~~~g~~~~~~------~~~~~~~~v~v~f~---D~i--~~d~~~~~~~~~~~v~----aGi 460 (508) T protein:vir:15 397 -AIDELCQSIFELANAGALFDDGKPLFTL------DSASQPLDIECHFD---DGV--FVNKDKQLEEDAKVLA----IGA 460 (508) T ss_pred -HHHHHHHHHHHHHHHhcccccccccccc------ccccCCcceEEEeC---CCC--CCCHHHHHHHHHHHHh----cCC Confidence 5788999998887654211100000000 00001112222221 110 1122333333333321 123 Q ss_pred cchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHH-HHhcccCC Q lcl|NC_015158. 525 KPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIE-EEAQVPLV 581 (581) Q Consensus 525 ~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~-~~~~~~~~ 581 (581) .|.. .+ +.+ ++++ ++++.++. +++.+++.. ..-+.+.. T Consensus 461 ~s~e---~~---i~~---~~g~---------~deea~~e-l~ri~~E~~~~~~~~~~~ 499 (508) T protein:vir:15 461 LSKQ---TF---LQR---NYGM---------TDEQAAEE-LAKIQSEAPTDTFEGGRS 499 (508) T ss_pred CCHH---HH---HHh---cCCC---------ChHHHHHH-HHHHHHhccccCcccccc Confidence 3331 11 112 2332 23333333 222222211 11111212 No 89 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.64 E-value=5.9e-14 Score=93.12 Aligned_cols=432 Identities=10% Similarity=0.044 Sum_probs=228.1 Q ss_pred Cccchhh--hhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc---cc--cccc--cccc--cccccc Q lcl|NC_015158. 1 MTGKVLE--LQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT---TT--TTNS--TLPW--KNKTTL 69 (581) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~---~~--~~~~--~~~~--k~~~~~ 69 (581) +..+..+ ++++-+ .-+-....|.++.+.+.. ....+.++++|+.-.+. +. .... ...+ .+++.+ T Consensus 8 ~~~~~~~~~~~~~~~-~~~~~~~~i~~~i~~~~~----~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~ 82 (478) T protein:vir:10 8 WDKPYHEQVVEQIKP-KYETQEEMILRLVREHKE----NIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYT 82 (478) T ss_pred CCchhhhHHHHHhhh-ccCChHHHHHHHHHHHHH----HHHHHHHHHHHhcccccccccchhhhcccccccccccceecc Confidence 3333332 333332 112234456666664432 23456777777765431 10 0000 0011 235667 Q ss_pred cchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecc Q lcl|NC_015158. 70 PKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKE 149 (581) Q Consensus 70 pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~ 149 (581) |-...+++...++++ .++- ++.+ ++++..+.++. .+. .+|.....++.+++.+||.|+..+.+... T Consensus 83 n~~k~ivd~~~~yl~----g~p~--~~~~---~~~~~~~~l~~----~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~ 148 (478) T protein:vir:10 83 NYHQNLVDQKVAYAV----ANPV--TFGV---DNDKALKQIQH----TLN-HKWDDKLVDILTAASNKGIEWVQPYVDEE 148 (478) T ss_pred chHHHHHHHHhhhhc----ccCc--eeec---CChHHHHHHHH----HHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCC Confidence 777778888877776 3332 3322 33333344333 332 57888999999999999999988865321 Q ss_pred eeeeeeeeeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhcc Q lcl|NC_015158. 150 TTKDEESGATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRG 227 (581) Q Consensus 150 ~~~~~~~~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~ 227 (581) .++++..++|.++++ |+... .+--+++|.+.+... . T Consensus 149 --------------~~~~~~~~~p~~~~~v~d~~~~--~~~~~~ir~~~~~~~-----------~--------------- 186 (478) T protein:vir:10 149 --------------GEFKTFRVPAEQAVPIWTNKER--DELQAFIRVYELDGA-----------E--------------- 186 (478) T ss_pred --------------CceEEEEEcccceEEEEcCCCC--CceEEEEEEEeeeCc-----------e--------------- Confidence 246788899999754 43322 222223333322100 0 Q ss_pred CCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeE Q lcl|NC_015158. 228 LGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFH 307 (581) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~ 307 (581) -.++|+.+.+..+.+.+ +.+..........-.........|.+.|+.|++. T Consensus 187 ----------------------~~~~y~~~~i~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 237 (478) T protein:vir:10 187 ----------------------RVEYWTKDDVTFYELKE-------GQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIP 237 (478) T ss_pred ----------------------EEEEEeCCcEEEEEecC-------CeeeccccccccccccceecccccccCCcceEEE Confidence 00112222222222211 1100000000000001111122355678888775 Q ss_pred ecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-ccc---ccc--cCCceeEEeC--CCCCccc Q lcl|NC_015158. 308 CGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVE---EFV--WGPMEQIYIN--GDGDVEM 379 (581) Q Consensus 308 ~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~---~i~--~~pG~vi~~~--~~~~i~~ 379 (581) +.. +-.|.|..+.+.++++.+|.+...+.+++....+|++.+.+ +.+ +.. ...++++.+. .++++.+ T Consensus 238 ~~n-----~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (478) T protein:vir:10 238 FKN-----NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVDT 312 (478) T ss_pred ecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhhhCceeEecCCCCCcceE Confidence 554 44699999999999999999999999999999999987754 212 111 1234555553 5577888 Q ss_pred ccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 380 MAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISR 459 (581) Q Consensus 380 ~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~ 459 (581) +..+.....+...++.+.+.+-+.|++|.++.+.. +++.|+.++..++..+........+.|.. +++++++++++++. T Consensus 313 l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~ 390 (478) T protein:vir:10 313 IKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKANKLKNKTLT-ALQELLQYIIDFYR 390 (478) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCcCcCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhC Confidence 88776666677778999999999999998766533 24567777777788888888888888885 67888888887753 Q ss_pred hhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHH Q lcl|NC_015158. 460 RNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEH 539 (581) Q Consensus 460 ~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e 539 (581) ...+ ..++...|. ..-.....+.++.++.+ . .+.|. +.++ + T Consensus 391 ~~~d---------------------~~~i~i~f~--~~~p~~~~e~~~~~~~~---~------g~iS~---et~i----~ 431 (478) T protein:vir:10 391 LDVR---------------------VQDIEITFN--FNVMVNELENSQIAMNS---T------GLLSK---ETIL----G 431 (478) T ss_pred CCcc---------------------cccceEEeC--CCCCCCHHHHHHHHHHH---h------CCCCh---HHHH----H Confidence 2211 122332321 11112233333322222 1 13232 2222 2 Q ss_pred HhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhccc-CC Q lcl|NC_015158. 540 NLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVP-LV 581 (581) Q Consensus 540 ~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~-~~ 581 (581) . ++. .. + .++|.++++++... ...+.+ +. T Consensus 432 ~--~~~----v~--d--~~~E~~ri~~E~~~---~~~~~~~~~ 461 (478) T protein:vir:10 432 N--HSW----VQ--D--PVAEMERIEQENIE---LNQQLPDIE 461 (478) T ss_pred h--CCC----CC--C--HHHHHHHHHHHHHH---HHHhccccC Confidence 2 222 11 1 23344443332211 122222 22 No 90 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.63 E-value=7.9e-14 Score=92.42 Aligned_cols=437 Identities=12% Similarity=0.082 Sum_probs=215.5 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-ccccccccccc--cccccccchHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNSTLPW--KNKTTLPKLCQIRD 77 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~~~~--k~~~~~pki~~~~d 77 (581) ||+-+--+.- .+-++++..|.+.++.+ ++. +.+++.|+...+ .+..+...-+. ..++.++-+.-+++ T Consensus 1 ~~~~~~~~~~---~~~~~~~~~l~~~~~~~----~~r---l~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd 70 (484) T protein:vir:77 1 MTSPLQKQEN---VDPEKAREEMLNLFTER----TQD---LGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYID 70 (484) T ss_pred CCCcccccCC---CCHHHHHHHHHHHHHHH----HHH---HHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHH Confidence 6654444333 23367777787777643 222 334455554432 22222221111 11233454455566 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) .+..++. ++. |.+ .++ +. ..+.+++...+++|.....++++++.+||.|++.+.+.... . T Consensus 71 ~~~~~l~----~~g--~~~----~~~-~~---~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~------~ 130 (484) T protein:vir:77 71 AIAARQE----LEG--FRL----GGA-DK---ADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPN------I 130 (484) T ss_pred HHHhhhc----cCc--eec----CCc-ch---hHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCC------c Confidence 6655543 221 111 122 11 13356666788999999999999999999999998764221 0 Q ss_pred eEeeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 158 ATRDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 158 ~~~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) ........|.|..+||.++ ++||.... .-+.+|...+.. . T Consensus 131 ~~~~~~~~~~i~~~~p~~~~~~~D~~~~~---~~~a~~~~~~~~-----------~------------------------ 172 (484) T protein:vir:77 131 DPGVDPEVPIIRVEPPTNLYAQIDPRTRQ---VMRAIRAIEDEE-----------G------------------------ 172 (484) T ss_pred ccccccccceEEEeccceeEEEecCCCCc---eEEEEEEEEeec-----------C------------------------ Confidence 1111223467888899997 46665322 222223222200 0 Q ss_pred hhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCC Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQD 315 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~ 315 (581) +.......|..+. ++.|. ..+|++ ..+....| ++|+.|++.+...+..+ T Consensus 173 ----------~~~~~~~~y~~~~--~~~~~-----~~~~~~------------~~~~~~~~--~~g~vPvv~f~N~~~~~ 221 (484) T protein:vir:77 173 ----------NEVIGATLYLPNN--TVIWN-----REDGQW------------VQVANVAH--NLEMVPVIPIPNRTRLS 221 (484) T ss_pred ----------CcEEEEEEEecCe--EEEEE-----ecCCce------------EeeccccC--CCCCcceEEeccccccC Confidence 0000000111111 11111 111221 01111234 46889999888888899 Q ss_pred cccCCCcHH-hhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-cccc-----------cccCCceeEEeCCCCCcccccC Q lcl|NC_015158. 316 NLYAMGPLD-NLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEE-----------FVWGPMEQIYINGDGDVEMMAP 382 (581) Q Consensus 316 ~~~G~s~~~-~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~-----------i~~~pG~vi~~~~~~~i~~~~~ 382 (581) .+||+|..+ .++++|+.+|.+...+.+++...++|+..+.+ +.++ +...+|.+|....+ ++...+. T Consensus 222 ~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~q~ 300 (484) T protein:vir:77 222 DLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAFEDH-ESKAQQF 300 (484) T ss_pred ccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhhcccCCC-CceeEee Confidence 999999886 58899999999999999999999999875533 1111 11235566554432 3333333 Q ss_pred CCccchhHHHHHHHHHHHHHh---cCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 383 NTQALQADMQIQILEAKMEEF---AGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISR 459 (581) Q Consensus 383 p~~~~~~~~~lq~~~~~~ee~---TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~ 459 (581) + ..+..+.+..++..+..+ +++|....|..+..+-+|.++...+...........+.|.. .+++++++++.+.. T Consensus 301 ~--~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~-~l~~~~~l~~~~~~ 377 (484) T protein:vir:77 301 S--AAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGG-AWEQAMRVAYKVMN 377 (484) T ss_pred c--CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhC Confidence 3 223344567777766665 67777787764443345666666666666777777888885 46788888766521 Q ss_pred hhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHH Q lcl|NC_015158. 460 RNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEH 539 (581) Q Consensus 460 ~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e 539 (581) +.+ .+.+..+..+.-.........+.+ +.+.++.+. +..+.+. ..+ .+ T Consensus 378 -~~~-------------------~~~~~~~i~v~w~~~~~~s~~~~a---d~~~kl~~~--g~gi~s~---et~----~~ 425 (484) T protein:vir:77 378 -GGD-------------------IPPEYYRMESIWRDPSTPTYAAKA---DAATKLYNN--GQGVIPK---ERA----RI 425 (484) T ss_pred -CCC-------------------cccccccceEEecCCCCCCHHHHH---HHHHHHHhc--cCCCCCH---HHH----Hh Confidence 110 001111111111112222333333 334343321 2223332 222 22 Q ss_pred HhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 540 NLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 540 ~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ ++ .+ ++.++.++.+..++++.. +.-.++.-. T Consensus 426 ~l---~~---~~--~~~~e~~~~~~ee~~~~~-~~~~~~~~~ 458 (484) T protein:vir:77 426 DM---GY---SI--TEREEMRKWDEEEQAQGL-GLMGTMFGT 458 (484) T ss_pred cC---CC---Ch--hHHHHHHHHHHHHHHHHH-HHHhhhccc Confidence 22 32 11 112221111111122211 011111111 No 91 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.63 E-value=2e-14 Score=95.64 Aligned_cols=452 Identities=12% Similarity=0.049 Sum_probs=209.6 Q ss_pred Cccchhh-----hhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc----cc-ccccccccccccccc Q lcl|NC_015158. 1 MTGKVLE-----LQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT----TT-TTNSTLPWKNKTTLP 70 (581) Q Consensus 1 ~~~~~~~-----~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~----~~-~~~~~~~~k~~~~~p 70 (581) |-.+++. +++|.. .-.+-..+.+.+-.+...+....+.|.. |+.-.+. +. ..+.+..-++++++| T Consensus 1 m~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~i~~~~~---yy~g~~~~~~~~~~~~~~~~~~~~~~~~n 75 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGL--LKALKDVKDHKKVNANDEDYKYIDMWKR---LYQGHYAEWHNLNYEHNGNPVNRRQLSMN 75 (496) T ss_pred ChhHHHHHHHHHHHHhcc--chhhHHHHhcCCCcCCHHHHHHHHHHHH---HhcCCCchhhcchhccCCCccccceeecc Confidence 3333322 122211 1111111111111222333344445533 3322111 11 111111112334455 Q ss_pred chHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecce Q lcl|NC_015158. 71 KLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKET 150 (581) Q Consensus 71 ki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~ 150 (581) -...+++.+...| |+..-=+++ ++ +..+++|++.+.+.+|...+.+.+.++..+|.+++++.|... T Consensus 76 ~~k~i~~~~a~~l----~~~p~~i~~-----~d----~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~- 141 (496) T protein:vir:38 76 LPKVTAKYMSKLL----FNEKVKINI-----DD----KAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGN- 141 (496) T ss_pred hHHHHHHHHhhhh----hCCcceEee-----CC----hHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCC- Confidence 4444555555444 344333333 22 345668888889999999999999999999999999987421 Q ss_pred eeeeeeeeEeeeeccceEEecchhheeec-CCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCC Q lcl|NC_015158. 151 TKDEESGATRDTYFGPRAVRIDPKDIVFN-PVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLG 229 (581) Q Consensus 151 ~~~~~~~~~~~~~~~p~ie~V~p~df~~D-P~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~ 229 (581) .+++++.|+|..|||= -....+..+.|+.+.... ...|... . T Consensus 142 -------------~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~------------~~~y~~l------------e 184 (496) T protein:vir:38 142 -------------KNVKVSFATADCMYPLSNDSENVDECVIANSFHKN------------NKYYTLL------------E 184 (496) T ss_pred -------------CcEEEEEEcccceEEEEecCCcEEEEEEEEEEEeC------------CeEEEEE------------E Confidence 2468999999999851 112345555555433211 0011000 0 Q ss_pred cccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCc-e-eeeeEEEEEeCCEEEEeecCCCccCCCCeeE Q lcl|NC_015158. 230 TYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGT-F-KRNMKVTIIDRMFVIEEKENPSWFAQAPIFH 307 (581) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~-~-~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~ 307 (581) .+... ++ ..+|+ +++|- ..++. + .......+.. + +....-.....++||.+ T Consensus 185 ~h~~~----------~~---------~~~I~-~~~y~----~~~~~~~g~~v~~~~~~~-~--~~~~~~~~~~~~~~f~~ 237 (496) T protein:vir:38 185 WNEWQ----------GD---------VYTVT-TELYQ----SDDPNELGTKVSLTLLFD-D--IEPVVPLPDFTRPTFIY 237 (496) T ss_pred EEEEe----------Cc---------eEEEE-EEEEe----cCCccccCcccccccccc-c--cccceeecCCCcceEEE Confidence 00000 00 00000 11110 00000 0 0000000000 0 00000001134566665 Q ss_pred ecc----cccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecccc-ccccC----------CceeEEe- Q lcl|NC_015158. 308 CGW----RIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDVE-EFVWG----------PMEQIYI- 371 (581) Q Consensus 308 ~~~----~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~~-~i~~~----------pG~vi~~- 371 (581) +.. ...+++.+|.|+.+.++++++.+|.......+.+..+ -+++.+..+.- ..... +-.++.. T Consensus 238 ~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~-~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~ 316 (496) T protein:vir:38 238 IKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLY 316 (496) T ss_pred ecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhc-ccceecchHHhhccCCCCCccccCCCCccceEEEe Confidence 543 2245789999999999999999999999999998864 44444422110 00000 1122211 Q ss_pred --CCCC---CcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 372 --NGDG---DVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVML 446 (581) Q Consensus 372 --~~~~---~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~ 446 (581) .+.+ .++.+++.-...+....++.+...+...+|++..+-|..+++.+||+++....+..-.....+.+.|.. . T Consensus 317 ~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~-~ 395 (496) T protein:vir:38 317 QGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQ-G 395 (496) T ss_pred ecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH-H Confidence 1222 233333332223344557777788888899999999988888899999987777777777778888884 5 Q ss_pred HHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_015158. 447 MEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKP 526 (581) Q Consensus 447 ~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p 526 (581) ++++++.++++...+.- .+. ..++..++...|.- . .... .+..++++.++-+ +++.| T Consensus 396 l~~l~~~il~~~~~~~~-------~~g-------~~~~~~~i~v~f~d-~-i~~d---~~~~~~~~~~~~~----~GiiS 452 (496) T protein:vir:38 396 IKEMIVSILEVGKFIEA-------YSG-------EVVELDTITVDFDD-S-IAQD---EDTTINRYTNAKN----QGMIP 452 (496) T ss_pred HHHHHHHHHHHHHHHHh-------hcC-------CCCCccceEEEeCC-C-CCCC---HHHHHHHHHHHHh----cCCCC Confidence 78899999887654321 000 01122334333320 0 0111 1222333333321 12333 Q ss_pred hhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 527 HVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 527 ~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) . ..+ +.+ +++ +++++..+. +++.+.+. ++++|-- T Consensus 453 ~---et~---l~~---~~~---------~~d~ea~~e-l~ri~~E~--~~~~~~~ 486 (496) T protein:vir:38 453 L---KIA---LQR---AWN---------ITEAEADEW-AEMLAKEK--QAEMPNN 486 (496) T ss_pred H---HHH---HHh---cCC---------CChHHHHHH-HHHHHHhh--hccCccc Confidence 2 112 111 222 122222222 22222222 2233322 No 92 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.63 E-value=2.8e-14 Score=94.87 Aligned_cols=451 Identities=13% Similarity=0.087 Sum_probs=213.3 Q ss_pred Cccc-------------hhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--ccccccccccc Q lcl|NC_015158. 1 MTGK-------------VLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNSTLPWKN 65 (581) Q Consensus 1 ~~~~-------------~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~~~~k~ 65 (581) |-.+ ..+|++..+.-+-.+. ..+....+.|..+ |-...+.. ........-++ T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~-----------~~~~~~i~~~~~~--Y~g~~~~~~~~~~~~~~~~~~ 69 (500) T protein:vir:98 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAIS-----------KLEYDRITTNLKY--YKSDWDSVLYLNTDGETKKRD 69 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCC-----------HHHHHHHHHHHHH--hcCCCCCcccccCCCCcccCc Confidence 1111 2334444442111111 1222223333222 22221111 11111112233 Q ss_pred cccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEe Q lcl|NC_015158. 66 KTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVE 145 (581) Q Consensus 66 ~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~ 145 (581) +.++|-...+++.+...+ |+-..-+++. + +..+++++..|.+.+|...+.+.+.++..+|.+++|.+ T Consensus 70 ~~slnl~~~i~~~~A~lv----~~e~~~i~~~-----d----~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~ 136 (500) T protein:vir:98 70 LNHLPIARTAAKKIASLV----FNEQAEIKVD-----D----DAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPY 136 (500) T ss_pred eeecchHHHHHHHHhhhh----cCCcceEecC-----C----hHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEE Confidence 344453344455444433 3433333332 2 34556788888889999999999999999999999998 Q ss_pred eecceeeeeeeeeEeeeeccceEEecchhheeec-CCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhh Q lcl|NC_015158. 146 YVKETTKDEESGATRDTYFGPRAVRIDPKDIVFN-PVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREF 224 (581) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~D-P~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~ 224 (581) |... +|+|+.|++..|||= -.......|.++.+...+... +...|...+ .+ T Consensus 137 ~d~~---------------~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~--------~~~~yt~lE-----~h 188 (500) T protein:vir:98 137 VDGD---------------KVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTING--------KEVYYTLIE-----FH 188 (500) T ss_pred EeCC---------------ceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecC--------CceEEEEEE-----EE Confidence 8421 367899999998862 222334445555444444210 000010000 00 Q ss_pred hccCC-cccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCC Q lcl|NC_015158. 225 RRGLG-TYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQA 303 (581) Q Consensus 225 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~ 303 (581) .-..+ .+. .....+++.+ .+..+..|.+.++|.++ . .... .....++ T Consensus 189 ~~~~~~~~~-I~n~ly~~~~--------~~~lG~~v~l~~~~~~l--------~---------~~~~------~~~~~~p 236 (500) T protein:vir:98 189 EWQSSDDYV-ISNELYRSDD--------KAKVGSRVPLSEVYKDL--------K---------DEAK------VTDVTRP 236 (500) T ss_pred EEeCCceeE-EEEEEEeccc--------ccccCcccccccccCCc--------C---------cceE------eccCCCc Confidence 00000 000 0000000000 00000011111222100 0 0000 0123445 Q ss_pred CeeEecc----cccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-----cccccccCCc-------- Q lcl|NC_015158. 304 PIFHCGW----RIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-----DVEEFVWGPM-------- 366 (581) Q Consensus 304 Pf~~~~~----~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-----d~~~i~~~pG-------- 366 (581) ||.++.. ....+|.+|.|+.+.+.++++.+|....++.+.+..+ ..++.|.. +.+...+.++ T Consensus 237 ~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~ 315 (500) T protein:vir:98 237 IFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMG-QRRVAVPESLTALTVRTTDGDVVPRPRFESD 315 (500) T ss_pred cEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhC-cceeeechHHhcccCCCCCccccCCcccCCC Confidence 5555432 2234789999999999999999999999999999764 44444422 1211122111 Q ss_pred -eeEE-eCC--C--CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 367 -EQIY-ING--D--GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIM 440 (581) Q Consensus 367 -~vi~-~~~--~--~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r 440 (581) .+++ ++. + ..++.+++.-...+....++.+..+++...|++.-..|.++.+.+|||++....++.-.....+.+ T Consensus 316 ~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~ 395 (500) T protein:vir:98 316 QNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVA 395 (500) T ss_pred cceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHH Confidence 1222 121 1 235555544334456666888888888899999999998888889999998888888788888888 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc Q lcl|NC_015158. 441 NFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV 520 (581) Q Consensus 441 ~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~ 520 (581) .|.. .+++|++.++++..-+ + +.+... ....++..+|. .+. ...++..++.+.++-. T Consensus 396 ~~~~-al~~lv~~il~~~~~~-~------~~~~~~-------~~~~~v~v~f~---d~i--~~d~~~~~~~~~~~v~--- 452 (500) T protein:vir:98 396 LVEQ-SLKELVISIFEIAKAY-D------LYQSEV-------PSMDNISISLD---DGV--FTDRDAELDYWIKVVN--- 452 (500) T ss_pred HHHH-HHHHHHHHHHHHHHHH-h------hcCCCC-------CCCcceEEEeC---CCC--CCCHHHHHHHHHHHHH--- Confidence 8875 5788999998775432 1 111110 00112333331 000 1112223333333322 Q ss_pred cccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHh-----cccCC Q lcl|NC_015158. 521 WQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEA-----QVPLV 581 (581) Q Consensus 521 ~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~-----~~~~~ 581 (581) +++.|. +.. +.+. .++ ++++ .++++++.+++...+- +.-++ T Consensus 453 -aGi~s~---~~~---i~~~---~g~---------~eee-a~~~l~~i~~E~~~~~~~~~~~~~~~ 498 (500) T protein:vir:98 453 -AGFGTR---EMA---IQKV---LNV---------TEEK-AQEIAAEINTGIVDEINQQRTDTHLY 498 (500) T ss_pred -cCCCCH---HHH---HHhc---CCC---------CHHH-HHHHHHHHHHhccccCCCCCcccccc Confidence 123333 221 2222 222 2333 3333444444321111 00111 No 93 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.63 E-value=2.8e-14 Score=94.87 Aligned_cols=451 Identities=13% Similarity=0.087 Sum_probs=213.3 Q ss_pred Cccc-------------hhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc--ccccccccccc Q lcl|NC_015158. 1 MTGK-------------VLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT--TTTNSTLPWKN 65 (581) Q Consensus 1 ~~~~-------------~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~--~~~~~~~~~k~ 65 (581) |-.+ ..+|++..+.-+-.+. ..+....+.|..+ |-...+.. ........-++ T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~-----------~~~~~~i~~~~~~--Y~g~~~~~~~~~~~~~~~~~~ 69 (500) T protein:vir:30 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAIS-----------KLEYDRITTNLKY--YKSDWDSVLYLNTDGETKKRD 69 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCC-----------HHHHHHHHHHHHH--hcCCCCCcccccCCCCcccCc Confidence 1111 2334444442111111 1222223333222 22221111 11111112233 Q ss_pred cccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEe Q lcl|NC_015158. 66 KTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVE 145 (581) Q Consensus 66 ~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~ 145 (581) +.++|-...+++.+...+ |+-..-+++. + +..+++++..|.+.+|...+.+.+.++..+|.+++|.+ T Consensus 70 ~~slnl~~~i~~~~A~lv----~~e~~~i~~~-----d----~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~ 136 (500) T protein:vir:30 70 LNHLPIARTAAKKIASLV----FNEQAEIKVD-----D----DAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPY 136 (500) T ss_pred eeecchHHHHHHHHhhhh----cCCcceEecC-----C----hHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEE Confidence 344453344455444433 3433333332 2 34556788888889999999999999999999999998 Q ss_pred eecceeeeeeeeeEeeeeccceEEecchhheeec-CCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhh Q lcl|NC_015158. 146 YVKETTKDEESGATRDTYFGPRAVRIDPKDIVFN-PVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREF 224 (581) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~D-P~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~ 224 (581) |... +|+|+.|++..|||= -.......|.++.+...+... +...|...+ .+ T Consensus 137 ~d~~---------------~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~--------~~~~yt~lE-----~h 188 (500) T protein:vir:30 137 VDGD---------------KVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTING--------KEVYYTLIE-----FH 188 (500) T ss_pred EeCC---------------ceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecC--------CceEEEEEE-----EE Confidence 8421 367899999998862 222334445555444444210 000010000 00 Q ss_pred hccCC-cccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCC Q lcl|NC_015158. 225 RRGLG-TYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQA 303 (581) Q Consensus 225 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~ 303 (581) .-..+ .+. .....+++.+ .+..+..|.+.++|.++ . .... .....++ T Consensus 189 ~~~~~~~~~-I~n~ly~~~~--------~~~lG~~v~l~~~~~~l--------~---------~~~~------~~~~~~p 236 (500) T protein:vir:30 189 EWQSSDDYV-ISNELYRSDD--------KAKVGSRVPLSEVYKDL--------K---------DEAK------VTDVTRP 236 (500) T ss_pred EEeCCceeE-EEEEEEeccc--------ccccCcccccccccCCc--------C---------cceE------eccCCCc Confidence 00000 000 0000000000 00000011111222100 0 0000 0123445 Q ss_pred CeeEecc----cccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-----cccccccCCc-------- Q lcl|NC_015158. 304 PIFHCGW----RIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-----DVEEFVWGPM-------- 366 (581) Q Consensus 304 Pf~~~~~----~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-----d~~~i~~~pG-------- 366 (581) ||.++.. ....+|.+|.|+.+.+.++++.+|....++.+.+..+ ..++.|.. +.+...+.++ T Consensus 237 ~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~ 315 (500) T protein:vir:30 237 IFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMG-QRRVAVPESLTALTVRTTDGDVVPRPRFESD 315 (500) T ss_pred cEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhC-cceeeechHHhcccCCCCCccccCCcccCCC Confidence 5555432 2234789999999999999999999999999999764 44444422 1211122111 Q ss_pred -eeEE-eCC--C--CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 367 -EQIY-ING--D--GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIM 440 (581) Q Consensus 367 -~vi~-~~~--~--~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r 440 (581) .+++ ++. + ..++.+++.-...+....++.+..+++...|++.-..|.++.+.+|||++....++.-.....+.+ T Consensus 316 ~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~ 395 (500) T protein:vir:30 316 QNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVA 395 (500) T ss_pred cceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHH Confidence 1222 121 1 235555544334456666888888888899999999998888889999998888888788888888 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc Q lcl|NC_015158. 441 NFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV 520 (581) Q Consensus 441 ~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~ 520 (581) .|.. .+++|++.++++..-+ + +.+... ....++..+|. .+. ...++..++.+.++-. T Consensus 396 ~~~~-al~~lv~~il~~~~~~-~------~~~~~~-------~~~~~v~v~f~---d~i--~~d~~~~~~~~~~~v~--- 452 (500) T protein:vir:30 396 LVEQ-SLKELVISIFEIAKAY-D------LYQSEV-------PSMDNISISLD---DGV--FTDRDAELDYWIKVVN--- 452 (500) T ss_pred HHHH-HHHHHHHHHHHHHHHH-h------hcCCCC-------CCCcceEEEeC---CCC--CCCHHHHHHHHHHHHH--- Confidence 8875 5788999998775432 1 111110 00112333331 000 1112223333333322 Q ss_pred cccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHh-----cccCC Q lcl|NC_015158. 521 WQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEA-----QVPLV 581 (581) Q Consensus 521 ~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~-----~~~~~ 581 (581) +++.|. +.. +.+. .++ ++++ .++++++.+++...+- +.-++ T Consensus 453 -aGi~s~---~~~---i~~~---~g~---------~eee-a~~~l~~i~~E~~~~~~~~~~~~~~~ 498 (500) T protein:vir:30 453 -AGFGTR---EMA---IQKV---LNV---------TEEK-AQEIAAEINTGIVDEINQQRTDTHLY 498 (500) T ss_pred -cCCCCH---HHH---HHhc---CCC---------CHHH-HHHHHHHHHHhccccCCCCCcccccc Confidence 123333 221 2222 222 2333 3333444444321111 00111 No 94 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.62 E-value=2.5e-13 Score=89.68 Aligned_cols=433 Identities=11% Similarity=0.095 Sum_probs=224.6 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc------cccccccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT------TTTTNSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~------~~~~~~~~~~k~~~~~pki~~ 74 (581) +++...+ ...+ ...|.+.-+.+...|.+ .+.++++|+...+. +.....+. .+++.+|-..- T Consensus 31 ~~~~e~~--~~~~------~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~il~~~~~~~~~~~~--~~ki~~n~~k~ 97 (511) T protein:vir:93 31 YDGTESD--LLQN------VNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMA--DNRVAHDYASY 97 (511) T ss_pred ccchhhh--hhcc------HHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcCcccccC--cceeecchHHH Confidence 2221111 1110 12233333333333433 45556667655442 11111222 24577787778 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDE 154 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~ 154 (581) +++...++|+ .++- ++.+ +++ ..++.+++.+.+++|.....++.+++.+||.|+..+++... T Consensus 98 Iv~~~~~yl~----g~p~--~~~~---~d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~----- 159 (511) T protein:vir:93 98 ISDFINGYFL----GNPI--QYQD---DDK----DVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD----- 159 (511) T ss_pred HHHHHhhhhc----ccCe--eecc---CCh----HHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC----- Confidence 8888887775 3322 2322 221 23456777788899999999999999999999988865311 Q ss_pred eeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCccc Q lcl|NC_015158. 155 ESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYT 232 (581) Q Consensus 155 ~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 232 (581) .++++..++|.++| +|+.... ..-+.+|.+.+... +.. . T Consensus 160 ---------~~~~i~~~~p~~~~~vydd~~~~--~~~~~vr~~~~~~~---------~~~-------------------~ 200 (511) T protein:vir:93 160 ---------DETRLYKSDAMSTFVIYDNTIER--NSIAGVRYLRTKPI---------DKT-------------------D 200 (511) T ss_pred ---------CceEEEEEccceeEEEEcCCCCC--ceEEEEEEEEeeec---------ccc-------------------c Confidence 23678889999974 4544321 11223344333100 000 0 Q ss_pred chhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccc Q lcl|NC_015158. 233 REDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRI 312 (581) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~ 312 (581) .... ...+.|+...+..+. ..+++... . +....... |-+.|..|++.+. T Consensus 201 -----------~~~~-~~~~iyt~~~i~~~~-------~~~~~~~~---~----~~~~~~~~--~~~~g~vPvv~~~--- 249 (511) T protein:vir:93 201 -----------EDEV-FTVDLFTSHGVYRYL-------TSRTNGLK---L----TPRENGFE--SHSFERMPITEFS--- 249 (511) T ss_pred -----------cceE-EEEEEEeCCcEEEEE-------ecCCCccc---c----cccccccc--ccCCCccceEEec--- Confidence 0000 000112222111110 00111000 0 00111112 3345777876543 Q ss_pred cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc-ccCCceeEEe-------------CCC Q lcl|NC_015158. 313 RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF-VWGPMEQIYI-------------NGD 374 (581) Q Consensus 313 ~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i-~~~pG~vi~~-------------~~~ 374 (581) ..-+|.|..+.++++++.+|.+...+.+.+...++|.+.+.+. ..++ ....++++.. ..+ T Consensus 250 --nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (511) T protein:vir:93 250 --NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGS 327 (511) T ss_pred --CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCC Confidence 3457999999999999999999999999999999998877552 1111 1222333322 234 Q ss_pred CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 375 GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAM 454 (581) Q Consensus 375 ~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~ 454 (581) +++.++..+.........+..+...+-+.|++|..+.+..+ ++.++.++..++..+........+.|.. +++++++++ T Consensus 328 ~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~-~l~~~~~li 405 (511) T protein:vir:93 328 VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTK-GLRRRAKLL 405 (511) T ss_pred cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 55667766655555667788888999999999988765432 5667777888888888888888999995 678888888 Q ss_pred HHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHH Q lcl|NC_015158. 455 LEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 (581) Q Consensus 455 ~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~ 534 (581) ..++........ +. .-.+++..| ...-.....+.++ .+..+. .+.|. +.++ T Consensus 406 ~~~l~~~~~~~~-----~~----------d~~~i~~~f--~~~~p~n~~e~~~---~~~kl~------g~iS~---et~~ 456 (511) T protein:vir:93 406 ETILKNTWSIDA-----NK----------DFNTVRYVY--NRNLPKSLIEELK---AYIDSG------GKISQ---TTLM 456 (511) T ss_pred HHHHHhccCccc-----cc----------ccccceEEe--CCCCCCCHHHHHH---HHHHHh------ccCch---HHHH Confidence 776543211000 00 011232223 1222223344443 333322 12233 2222 Q ss_pred HHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 535 ~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +. ++. .+ ++ ++|.+++..+.+.+++..+...-. T Consensus 457 ----~~--l~~----v~--d~--~~E~~ri~~E~~~~~~~~~~~~~~ 489 (511) T protein:vir:93 457 ----SL--FSF----FQ--DP--ELEVKKIEEDEKESIKKAQKGIYK 489 (511) T ss_pred ----Hh--CCC----CC--CH--HHHHHHHHHHHHHHHHHHhhhccc Confidence 22 222 12 22 344555555433333222211111 No 95 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.62 E-value=4.1e-14 Score=94.00 Aligned_cols=446 Identities=16% Similarity=0.141 Sum_probs=206.0 Q ss_pred Cccc-----------hhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-cccccccccccc--- Q lcl|NC_015158. 1 MTGK-----------VLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-TTTTNSTLPWKN--- 65 (581) Q Consensus 1 ~~~~-----------~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-~~~~~~~~~~k~--- 65 (581) ||.- +.--..+++ .+.+...+.+++..+...+ ..+.++.+|+.-.+. ...+ .+++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~--~~~~~~l~~~l~~~~~~~~----~rl~~l~~YY~G~~~~~~~~-~~~~~~~~~~ 73 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMS--REQLGALVADMWRLHISER----QWLDRIYEYTKGLRGRPEVP-EGASDEVKEL 73 (501) T ss_pred CcccchhhhccCcccccCCcccCC--hHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCchhcc-ccCChhhhhh Confidence 3332 222222222 3444444555555444322 355556666554432 2222 2222221 Q ss_pred --cccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEE Q lcl|NC_015158. 66 --KTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFAT 143 (581) Q Consensus 66 --~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k 143 (581) +..++-+.-+++.+...+. +. |+.-.+.+..+ .+.....++++.....++.+++.+||.|++. T Consensus 74 ~~~~v~n~~~~ivd~~a~~l~----~~-------gf~~~d~~~~~----~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~ 138 (501) T protein:vir:25 74 AKLSVKNVLSLVRDSFAQNLS----VV-------GYRNALAKEND----PAWEMWQRNRMDARQAEVHRPALTYGASYVT 138 (501) T ss_pred HhhhhcChHHHHHHHHHhhhc----cc-------ceecCCccchH----HHHHHHHhcChhHHHHHHHHHHhhcCceEEE Confidence 1223444445554444332 21 22222222112 2334456788999999999999999999988 Q ss_pred EeeecceeeeeeeeeEeeeeccceEEecchhhee--e-cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHH Q lcl|NC_015158. 144 VEYVKETTKDEESGATRDTYFGPRAVRIDPKDIV--F-NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIAR 220 (581) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~-DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~ 220 (581) +...+ .++.|..+||.+.+ + ||...... .+++|......+- .. T Consensus 139 v~~de---------------~~~~i~~~sp~~~~~iy~D~~~~~~~--~~ai~~~~~~~~~-------~~---------- 184 (501) T protein:vir:25 139 VTPTD---------------EGPVFRTRSPRQILAVYADPSVDAWP--QYALETWVAQKDA-------KP---------- 184 (501) T ss_pred EecCC---------------CCCeEEEeccccEEEEEecCCCCcce--eEEEEEEeecccc-------Cc---------- Confidence 85421 13567888999974 2 66644221 1223322221000 00 Q ss_pred HHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeee-ecccCCceeeeeEEEEEeCCEEEEeecCCCc Q lcl|NC_015158. 221 RREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDY-HDTQSGTFKRNMKVTIIDRMFVIEEKENPSW 299 (581) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~-~d~~~d~~~e~~~itv~~g~~iir~~~nP~~ 299 (581) ..+ ..+|....+..+...+.+ .....+............++ .......|.+ T Consensus 185 ------------------------~~~---~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 236 (501) T protein:vir:25 185 ------------------------HRR---GVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTD-VIEHGATFEG 236 (501) T ss_pred ------------------------cee---EEEecCeeEEEEecCceeeeecccccccccccccccccc-ccccccccCC Confidence 000 000111111100000000 00001111111111111111 1122223445 Q ss_pred cCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec---c-ccccccCCceeEEeCCCC Q lcl|NC_015158. 300 FAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG---D-VEEFVWGPMEQIYINGDG 375 (581) Q Consensus 300 ~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~---d-~~~i~~~pG~vi~~~~~~ 375 (581) ++..|++.+...+..+ -+|+|..+.++++|+.+|..+..+...+...++|+..+.+ + .+.+...+|++|...+ + T Consensus 237 ~~~vPiv~f~N~~~~~-~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~i~~~~~-~ 314 (501) T protein:vir:25 237 KPVCPVVRFVNGRDAD-DMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAEVLKASALRVWTFED-P 314 (501) T ss_pred ccceeeEeccCccccC-ccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccchhhhcccceeccCC-C Confidence 6778888887777654 4699999999999999999999999999999999754433 2 2223445788877653 2 Q ss_pred CcccccCCCccchhHHHHHHHHHH---HHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 376 DVEMMAPNTQALQADMQIQILEAK---MEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLN 452 (581) Q Consensus 376 ~i~~~~~p~~~~~~~~~lq~~~~~---~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~ 452 (581) ++...+-+.. ++.+.+++++.. +-..|++|...-|... .|.+|.++...+...........+.|.. .++++++ T Consensus 315 ~~~~~q~~~~--~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~-~N~Sg~Al~~~~~~l~~ka~~k~~~f~~-~l~~~~r 390 (501) T protein:vir:25 315 EVKAQAFPPA--SVEPYNLILEEMLQHVAMVAQISPAQVTGKM-INVSAEALAAAEANQQRKLAAKRESFGE-SWEQLLR 390 (501) T ss_pred CceEEEeccc--ChHHHHHHHHHHHHHHHhhcCCChhhhcccc-CChHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 3333333322 233344555544 4556888887777322 3456667777777777777888888885 4677888 Q ss_pred HHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHH Q lcl|NC_015158. 453 AMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTEN 532 (581) Q Consensus 453 ~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~ 532 (581) +++.+.... +.. .+ -++...| .-......++.++-+..|.++ .+ |. +. T Consensus 391 l~~~~~~~~-~~~-------~~-----------~~i~v~w--~~~~~~s~~~~ada~~kl~~~-------gi-s~---et 438 (501) T protein:vir:25 391 LAAEMDDDP-DTA-------AD-----------SGAEVLW--RDTEARSFGAVVDGITKLASA-------GI-PI---EH 438 (501) T ss_pred HHHHHhCCC-ccc-------cc-----------eeeeEEe--cCCCCCCHHHHHHHHHHHHhc-------CC-CH---HH Confidence 776553211 000 00 0121111 112233444544433333222 12 22 22 Q ss_pred HHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHH--HHHHHHHHH---hcccCC Q lcl|NC_015158. 533 LAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVN--QSQAQIEEE---AQVPLV 581 (581) Q Consensus 533 l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q--~aq~~~~~~---~~~~~~ 581 (581) + +. .+++++ + .+.++....++ +++..+..- +..+-. T Consensus 439 ~---~~---~~~g~~---~----~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~ 479 (501) T protein:vir:25 439 L---LS---MVPGMT---Q----QTIQAIKDSLRGGEVKSLVDKLLSNEPAPVP 479 (501) T ss_pred H---HH---HcCCCC---H----HHHHHHHHHHHHHhHHHHHHHhhccCcCCCC Confidence 2 11 234432 1 11111111111 111111111 001111 No 96 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.61 E-value=1.6e-13 Score=90.79 Aligned_cols=427 Identities=14% Similarity=0.104 Sum_probs=223.8 Q ss_pred Cccc-hhh---hhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-----cc---ccc-ccccccccc Q lcl|NC_015158. 1 MTGK-VLE---LQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-----TT---TTN-STLPWKNKT 67 (581) Q Consensus 1 ~~~~-~~~---~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-----~~---~~~-~~~~~k~~~ 67 (581) |-.+ ... .+.+ ++..+.....|.++-+.+.. |.+ .+.++++|+.--+. +. .+. .+..-.+++ T Consensus 7 ~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~i~~~~~-~~~---~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:95 7 MPWDKPYGEEVVEQL-KPQFETQEEMIIRLIDDHRK-QLD---KITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRI 81 (474) T ss_pred cCCCCchhhHHHHhh-hhccCChHHHHHHHHHHHHH-HHH---HHHHHHHHhcccCchhcccccccccccccccccccee Confidence 2222 111 2222 22222333344444444432 222 33444555543321 00 000 001112466 Q ss_pred cccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeee Q lcl|NC_015158. 68 TLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYV 147 (581) Q Consensus 68 ~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~ 147 (581) .+|-...+++...++|+ +++- ++.+ ++++..+.++.+ + +.+|.....++.+++.+||.|+..+++. T Consensus 82 ~~n~~~~Ivd~~~~~l~----g~p~--~~~~---~d~~~~~~l~~~----~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d 147 (474) T protein:vir:95 82 TTNFHQNLVDQKVSYVA----SKPV--TYSC---EDESVLKIIHDV----L-DTRWDNKLIDILTATSNKGIDWLQVYIN 147 (474) T ss_pred ccchHHHHHHHHHhhhc----cCCc--eecc---CchHHHHHHHHH----H-hccHHHHHHHHHHHHhhcCcEEEEEEec Confidence 77777778888777765 4332 2322 333333444433 3 2578888999999999999999888653 Q ss_pred cceeeeeeeeeEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhh Q lcl|NC_015158. 148 KETTKDEESGATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFR 225 (581) Q Consensus 148 ~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~ 225 (581) .. .++++..++|.++|+ |+... .+.-+++|.+...++ . T Consensus 148 ~~--------------~~~~i~~~~p~~~~~v~d~~~~--~~~~~~i~~~~~~~~-----------~------------- 187 (474) T protein:vir:95 148 EN--------------GEMKLFRVPAEQAIPIWVDKER--EELKSFIRYYKFNNE-----------E------------- 187 (474) T ss_pred CC--------------CceEEEEEcccceEEEEcCCCC--CceEEEEEEEEEcCe-----------e------------- Confidence 21 136788899999753 43321 222222333221100 0 Q ss_pred ccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCe Q lcl|NC_015158. 226 RGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPI 305 (581) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf 305 (581) ..+.|....+..+.+. .++....... +...+....-|.+.|..|+ T Consensus 188 ------------------------~~~~y~~~~~~~~~~~-------~~~~~~~~~~----~~~~~~~~~~~~~~g~iPv 232 (474) T protein:vir:95 188 ------------------------KVEFWTDTTVTYYVLE-------NGGLIPDYYY----GANHIQSHFSNGNWGRVPF 232 (474) T ss_pred ------------------------EEEEEeCCeEEEEEEc-------CCcccccccc----CcccccccccccCCCccce Confidence 0011222222111111 1111110000 0111111122334677787 Q ss_pred eEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc--ccCCceeEEeCCCCCccc Q lcl|NC_015158. 306 FHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF--VWGPMEQIYINGDGDVEM 379 (581) Q Consensus 306 ~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i--~~~pG~vi~~~~~~~i~~ 379 (581) +.+. .+..|.|..+.++++++.+|.+...+.+++...++|.+.+.+. .... ....++++.+.+++++.+ T Consensus 233 v~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 307 (474) T protein:vir:95 233 IAFK-----NNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKAINVDGDGGVET 307 (474) T ss_pred Eeec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCceeE Confidence 7543 3457999999999999999999999999999999999877541 2222 223577888899999998 Q ss_pred ccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 380 MAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISR 459 (581) Q Consensus 380 ~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~ 459 (581) +..+.....+...+..+...+-..+++|..+.+. ..++.||.++..++...........+.|.. ++++++++++++.. T Consensus 308 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~~~g 385 (474) T protein:vir:95 308 IQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDK-FGSAPSGIALKFLYGNLDLKANKLKNKATV-AIQELIGFIIDFNN 385 (474) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhC Confidence 8877666667777889999999999999876553 234667777888888888888888888885 67889998887753 Q ss_pred hhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHH Q lcl|NC_015158. 460 RNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEH 539 (581) Q Consensus 460 ~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e 539 (581) ...+. .+|...|. -.-.....+.++ .|.+. ++ +|.+.++ . T Consensus 386 ~~~d~---------------------~~i~v~f~--~~~p~d~~e~a~---~~~~~-------g~---iS~et~i----~ 425 (474) T protein:vir:95 386 LKMDV---------------------KDIEISFN--FNRMMNDAEQSQ---IIAQS-------QY---LSRETLV----K 425 (474) T ss_pred CCccc---------------------ceeeEEec--cCCCcCHHHHHH---HHHhc-------CC---CchHHHH----H Confidence 22221 12222221 111112333333 22211 12 3433332 2 Q ss_pred HhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 540 NLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 540 ~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) . ++. .++ + ++|.+++..+-+ ++.++++.. T Consensus 426 ~--l~~----v~d--~--~~E~~ri~~E~~---~~~~~~~~~ 454 (474) T protein:vir:95 426 S--SPL----VDD--Y--KAELERIEQEQM---EYNKQLPNL 454 (474) T ss_pred h--CCC----CCC--H--HHHHHHHHHHHH---HHHhccccc Confidence 2 222 122 2 233444333211 112233333 No 97 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.61 E-value=1.7e-13 Score=90.59 Aligned_cols=457 Identities=10% Similarity=0.003 Sum_probs=215.2 Q ss_pred Cccchhhhhhhccc-hh---hhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDD-TR---DGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIR 76 (581) Q Consensus 1 ~~~~~~~~~~~~~~-~~---~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~ 76 (581) |.+ +.++++++. +. ++-...|.. |..|-....+|.. +. ...+..+...+.++.+| ..+. T Consensus 18 ~~~--~~~~~~~~~~~i~~~~~~~~~I~~-w~~~Y~g~~~~~~----------~~--~~~~~~~~~~~~sl~~~--~~i~ 80 (517) T protein:vir:98 18 LSG--QTLKSINDHEKINIDPNELARIER-NLRQYEGDYPQVE----------YI--NSQGKIQERDYMTLNLR--KLSA 80 (517) T ss_pred hcc--cchhHhhcCCceecCHHHHHHHHH-HHHHhcCCCcccc----------cc--cccccccccceeecCcH--HHHH Confidence 221 334444441 11 111112221 3333222222110 00 11122222223333344 2333 Q ss_pred HHHHHHHHHhhcCCccEEEeecCChh--HHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeee Q lcl|NC_015158. 77 DNLHSNYISALFPNERWLKWEGKSLQ--DEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDE 154 (581) Q Consensus 77 d~~~~~l~~~~f~~~~~~~~~~~~~~--d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~ 154 (581) ..+. +| +|+-..-+.+.+.... ........+++|+..+.+++|...+.+.+.+++-.|.+++|..|... T Consensus 81 ~~~A-~L---l~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~----- 151 (517) T protein:vir:98 81 DVLS-GL---VFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDNG----- 151 (517) T ss_pred HHhh-hh---hcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeCC----- Confidence 3332 22 3454444444432211 12233446778988899999999999999999999999999988522 Q ss_pred eeeeEeeeeccceEEecchhheee-cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccc Q lcl|NC_015158. 155 ESGATRDTYFGPRAVRIDPKDIVF-NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTR 233 (581) Q Consensus 155 ~~~~~~~~~~~p~ie~V~p~df~~-DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (581) +++|+.|++..||| .-+...+-.|.++....++.++ +...|.- . T Consensus 152 ----------~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~--------~~~~Yt~-----------------l 196 (517) T protein:vir:98 152 ----------EIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGN--------KTVYYTL-----------------L 196 (517) T ss_pred ----------eeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecC--------CceEEEE-----------------E Confidence 35688999999987 2223344445544444444211 0001100 0 Q ss_pred hhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCC--C--ccCCCCeeEec Q lcl|NC_015158. 234 EDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENP--S--WFAQAPIFHCG 309 (581) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP--~--~~g~~Pf~~~~ 309 (581) +.++.......++ ..+|+ .+.|....+..-|.. ...-.++. ...| + ...+++|.++. T Consensus 197 E~H~~~~~~~~~~---------~y~I~-n~ly~s~~~~~lG~~--v~L~~~~e-------~l~~~~~~~g~~~Plf~y~~ 257 (517) T protein:vir:98 197 EFHEWEKTEEGES---------LYVIT-NELYKSDNEGEIGKR--IPLEELYE-------GMQEKTYIQGLSRPLFNYLK 257 (517) T ss_pred EEEecCceeccCC---------cEEEE-EEEEecCCCcccccc--cccccccc-------CCCcceeECCCCcceEEEec Confidence 0000000000000 00111 122210000000000 00000000 0001 0 12233343332 Q ss_pred c----cccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-----ccccccCCc-------eeEEe-C Q lcl|NC_015158. 310 W----RIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-----VEEFVWGPM-------EQIYI-N 372 (581) Q Consensus 310 ~----~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-----~~~i~~~pG-------~vi~~-~ 372 (581) . ....+|.+|.|+.+.+.+..+.+|....++++-+.++-. ++.|.++ .+.....++ .+++. + T Consensus 258 ~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~ 336 (517) T protein:vir:98 258 PSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-TVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIR 336 (517) T ss_pred CCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-ceecChhhhccccCCCCcccCCCCCcccceeeecc Confidence 1 122378899999999999999999999999998888544 3434322 222222222 22221 1 Q ss_pred C---CCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 373 G---DGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEK 449 (581) Q Consensus 373 ~---~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~ 449 (581) . ...++.+++.-...+...-++.+.++++...|++.-..|.++.+.+|||++..-.++.-..++.+.+.+.. .+++ T Consensus 337 ~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~~~~-aL~~ 415 (517) T protein:vir:98 337 MGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYEVEQ-FIKG 415 (517) T ss_pred CCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 1 12344444443344566678889999999999999999998888899999988777777777888888885 5788 Q ss_pred HHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhH Q lcl|NC_015158. 450 VLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVS 529 (581) Q Consensus 450 li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~ 529 (581) |++.|+.+..-+. ++|..+ ....++..+|. .+ -...+++.++.+.++-. +++.|. T Consensus 416 lv~~i~~l~~~~~-------~~~~~~-------~~~~~v~v~f~---D~--i~~D~~~~~~~~~~~v~----aG~ms~-- 470 (517) T protein:vir:98 416 LVISVLELAKTYK-------LFGGEI-------PSAEHIGVDFD---DG--VFQDRSALLRFYGQAKT----FGFIPT-- 470 (517) T ss_pred HHHHHHHHHHHHh-------hcCCCC-------CCCcceEEEcC---CC--CCCCHHHHHHHHHHHHh----cCCCCH-- Confidence 9998877654211 122210 01112333331 11 01122333333333321 123332 Q ss_pred HHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHh--------cccCC Q lcl|NC_015158. 530 TENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEA--------QVPLV 581 (581) Q Consensus 530 ~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~--------~~~~~ 581 (581) .. -+.+..|+ ++++. ++++++++.+....- +.++- T Consensus 471 -~~---~i~~~~g~------------~eeeA-~~e~~~i~~E~~~~~~~~~~~~~~~~~~ 513 (517) T protein:vir:98 471 -VE---AIQRIFKV------------PKKTA-EQWLEEIRKDQIELDPVTISQRAQKRMF 513 (517) T ss_pred -HH---HHHHhCCC------------ChHHH-HHHHHHHHHhccccCCCCccccccCCCC Confidence 11 12233232 22322 333444554443222 11222 No 98 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.61 E-value=7.5e-14 Score=92.54 Aligned_cols=430 Identities=13% Similarity=0.080 Sum_probs=232.8 Q ss_pred Cccc---------hhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc---c--cccccccc---- Q lcl|NC_015158. 1 MTGK---------VLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT---T--TTTNSTLP---- 62 (581) Q Consensus 1 ~~~~---------~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~---~--~~~~~~~~---- 62 (581) |+.+ +-|+=+.++...+.....|.++.+.+.. | .....++++|+...|. + .+...+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQ-K---LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTK 76 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccchhhhcccccccc Confidence 3332 2233333443344455566666665543 2 2345566677665431 1 01111111 Q ss_pred ccccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEE Q lcl|NC_015158. 63 WKNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFA 142 (581) Q Consensus 63 ~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~ 142 (581) -.+++++|-..-+++...++|+ .++- ++.+ ++++..+. +++.+ ..+|.....++.+++.+||.|+. T Consensus 77 ~~~ki~~n~~k~Iv~~~~~yl~----g~p~--~~~~---~~~~~~~~----l~~~~-~n~~~~~~~~l~~~~~~~G~~~~ 142 (474) T protein:vir:96 77 PDWRITTNFHQNLVDQKVSYVA----GKPV--TYAH---DDDKVLDV----IHQVL-DTRWDNKLIDILTAASNKGIDWL 142 (474) T ss_pred cccccccchHHHHHHhhhhhhc----ccCc--eecc---CChHHHHH----HHHHH-hccHHHHHHHHHHHHhhCCeEEE Confidence 1235677777778888877765 3332 3332 23233333 33333 36788899999999999999998 Q ss_pred EEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHH Q lcl|NC_015158. 143 TVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRR 222 (581) Q Consensus 143 k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~ 222 (581) .+++... ..+++..++|.++|+=..-....+.-+++|.+...+. T Consensus 143 ~~~~d~~--------------~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~---------------------- 186 (474) T protein:vir:96 143 QVYINED--------------GELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGE---------------------- 186 (474) T ss_pred EeeeCCC--------------CceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCe---------------------- Confidence 8865311 1367888999998643322222222222332211000 Q ss_pred hhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCC Q lcl|NC_015158. 223 EFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQ 302 (581) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~ 302 (581) ...++|....+..+.+ .+++.... ...+........-|...|. T Consensus 187 --------------------------~~~~vy~~~~i~~~~~-------~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 229 (474) T protein:vir:96 187 --------------------------TKVEYWTAETVTYYVY-------ENGGLIPD----FYYGDEHIQTHFSTGSWER 229 (474) T ss_pred --------------------------eEEEEEeCCeEEEEEE-------cCCceeec----cccccccccCcccccCCCc Confidence 0011122222221111 11111100 0001111111122334566 Q ss_pred CCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-c---cc--cccCCceeEEeCCCCC Q lcl|NC_015158. 303 APIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-V---EE--FVWGPMEQIYINGDGD 376 (581) Q Consensus 303 ~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~---~~--i~~~pG~vi~~~~~~~ 376 (581) .|++.+. .+-.|.|..+.++++++.+|.+...+.+.+...++|.+.+.+. . .+ ...+.++++.+.++++ T Consensus 230 vPvv~~~-----nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~ 304 (474) T protein:vir:96 230 VPFIAFK-----NNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGG 304 (474) T ss_pred cceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCc Confidence 7776443 3557999999999999999999999999999999998876541 1 12 1234567888999999 Q ss_pred cccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 377 VEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLE 456 (581) Q Consensus 377 i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~ 456 (581) +.+++.+.......+.++.+...+-..|++|..+.+.. +++.++.++..++..+........+.|.. +++++++++.+ T Consensus 305 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~-~l~~~~~~i~~ 382 (474) T protein:vir:96 305 VETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-GSATSGIALKFLYTNLNLKANKLKNKANV-ALQELMQFILD 382 (474) T ss_pred eeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 99999887777777889999999999999998765432 24566666777888888888888899985 67888988877 Q ss_pred HHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHH Q lcl|NC_015158. 457 ISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKM 536 (581) Q Consensus 457 f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~ 536 (581) +.....+ ..+|...|. ..-.....+.++.+ .+ .++ +|.+.++ T Consensus 383 ~~g~~~d---------------------~~~i~i~f~--~~~p~~~~e~a~~~---~~-------~gi---iS~et~~-- 424 (474) T protein:vir:96 383 FNKIKLD---------------------AKEIEITFN--FNVMVNDLEQSQIG---AQ-------SQY---LSKETLV-- 424 (474) T ss_pred HhCCCcc---------------------cceeeEEec--CCCccCHHHHHHHH---HH-------cCC---CChHHHH-- Confidence 6432211 122333331 11122334444322 11 122 3332232 Q ss_pred HHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 537 LEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 537 ~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .. ++. .+ ++ ++|.+++..+-.. +..+++-+ T Consensus 425 --~~--lp~----v~--D~--~~E~eri~~E~~~---~~~~~~~~ 454 (474) T protein:vir:96 425 --RH--HPW----VD--DP--KAELERLDEEQLE---LNKQLPNL 454 (474) T ss_pred --Hh--CCC----CC--CH--HHHHHHHHHHHHH---HHhhcccc Confidence 22 121 12 22 3344444332221 12222222 No 99 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.61 E-value=7.5e-14 Score=92.54 Aligned_cols=430 Identities=13% Similarity=0.080 Sum_probs=232.8 Q ss_pred Cccc---------hhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc---c--cccccccc---- Q lcl|NC_015158. 1 MTGK---------VLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT---T--TTTNSTLP---- 62 (581) Q Consensus 1 ~~~~---------~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~---~--~~~~~~~~---- 62 (581) |+.+ +-|+=+.++...+.....|.++.+.+.. | .....++++|+...|. + .+...+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQ-K---LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTK 76 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccchhhhcccccccc Confidence 3332 2233333443344455566666665543 2 2345566677665431 1 01111111 Q ss_pred ccccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEE Q lcl|NC_015158. 63 WKNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFA 142 (581) Q Consensus 63 ~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~ 142 (581) -.+++++|-..-+++...++|+ .++- ++.+ ++++..+. +++.+ ..+|.....++.+++.+||.|+. T Consensus 77 ~~~ki~~n~~k~Iv~~~~~yl~----g~p~--~~~~---~~~~~~~~----l~~~~-~n~~~~~~~~l~~~~~~~G~~~~ 142 (474) T protein:vir:95 77 PDWRITTNFHQNLVDQKVSYVA----GKPV--TYAH---DDDKVLDV----IHQVL-DTRWDNKLIDILTAASNKGIDWL 142 (474) T ss_pred cccccccchHHHHHHhhhhhhc----ccCc--eecc---CChHHHHH----HHHHH-hccHHHHHHHHHHHHhhCCeEEE Confidence 1235677777778888877765 3332 3332 23233333 33333 36788899999999999999998 Q ss_pred EEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHH Q lcl|NC_015158. 143 TVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRR 222 (581) Q Consensus 143 k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~ 222 (581) .+++... ..+++..++|.++|+=..-....+.-+++|.+...+. T Consensus 143 ~~~~d~~--------------~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~---------------------- 186 (474) T protein:vir:95 143 QVYINED--------------GELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGE---------------------- 186 (474) T ss_pred EeeeCCC--------------CceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCe---------------------- Confidence 8865311 1367888999998643322222222222332211000 Q ss_pred hhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCC Q lcl|NC_015158. 223 EFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQ 302 (581) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~ 302 (581) ...++|....+..+.+ .+++.... ...+........-|...|. T Consensus 187 --------------------------~~~~vy~~~~i~~~~~-------~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 229 (474) T protein:vir:95 187 --------------------------TKVEYWTAETVTYYVY-------ENGGLIPD----FYYGDEHIQTHFSTGSWER 229 (474) T ss_pred --------------------------eEEEEEeCCeEEEEEE-------cCCceeec----cccccccccCcccccCCCc Confidence 0011122222221111 11111100 0001111111122334566 Q ss_pred CCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-c---cc--cccCCceeEEeCCCCC Q lcl|NC_015158. 303 APIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-V---EE--FVWGPMEQIYINGDGD 376 (581) Q Consensus 303 ~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~---~~--i~~~pG~vi~~~~~~~ 376 (581) .|++.+. .+-.|.|..+.++++++.+|.+...+.+.+...++|.+.+.+. . .+ ...+.++++.+.++++ T Consensus 230 vPvv~~~-----nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~ 304 (474) T protein:vir:95 230 VPFIAFK-----NNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGG 304 (474) T ss_pred cceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCc Confidence 7776443 3557999999999999999999999999999999998876541 1 12 1234567888999999 Q ss_pred cccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 377 VEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLE 456 (581) Q Consensus 377 i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~ 456 (581) +.+++.+.......+.++.+...+-..|++|..+.+.. +++.++.++..++..+........+.|.. +++++++++.+ T Consensus 305 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~-~l~~~~~~i~~ 382 (474) T protein:vir:95 305 VETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-GSATSGIALKFLYTNLNLKANKLKNKANV-ALQELMQFILD 382 (474) T ss_pred eeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 99999887777777889999999999999998765432 24566666777888888888888899985 67888988877 Q ss_pred HHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHH Q lcl|NC_015158. 457 ISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKM 536 (581) Q Consensus 457 f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~ 536 (581) +.....+ ..+|...|. ..-.....+.++.+ .+ .++ +|.+.++ T Consensus 383 ~~g~~~d---------------------~~~i~i~f~--~~~p~~~~e~a~~~---~~-------~gi---iS~et~~-- 424 (474) T protein:vir:95 383 FNKIKLD---------------------AKEIEITFN--FNVMVNDLEQSQIG---AQ-------SQY---LSKETLV-- 424 (474) T ss_pred HhCCCcc---------------------cceeeEEec--CCCccCHHHHHHHH---HH-------cCC---CChHHHH-- Confidence 6432211 122333331 11122334444322 11 122 3332232 Q ss_pred HHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 537 LEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 537 ~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .. ++. .+ ++ ++|.+++..+-.. +..+++-+ T Consensus 425 --~~--lp~----v~--D~--~~E~eri~~E~~~---~~~~~~~~ 454 (474) T protein:vir:95 425 --RH--HPW----VD--DP--KAELERLDEEQLE---LNKQLPNL 454 (474) T ss_pred --Hh--CCC----CC--CH--HHHHHHHHHHHHH---HHhhcccc Confidence 22 121 12 22 3344444332221 12222222 No 100 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.60 E-value=4.1e-13 Score=88.50 Aligned_cols=433 Identities=11% Similarity=0.066 Sum_probs=228.4 Q ss_pred Ccc--------chhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc-----cc--cccccccc- Q lcl|NC_015158. 1 MTG--------KVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT-----TT--TNSTLPWK- 64 (581) Q Consensus 1 ~~~--------~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~-----~~--~~~~~~~k- 64 (581) |+- ..-++-..++..-+.....|.+..+.+.. | .....++.+|+..-+.- .. .....+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~---~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKP-K---IDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKP 76 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHH-H---HHHHHHHHHHhccCCcchhccchhccccccccccc Confidence 211 11111111222223344555566654432 2 33444555565443310 00 00111222 Q ss_pred -ccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEE Q lcl|NC_015158. 65 -NKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFAT 143 (581) Q Consensus 65 -~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k 143 (581) +++.+|-...+++...++|+ +++- ++.+ ++++..+.++++++ .++.....+..+++.++|.|+.. T Consensus 77 ~~ki~~n~~~~Ivd~~~~~l~----g~p~--~~~~---~d~~~~~~l~~~~~-----n~~~~~~~~~~~~~~~~G~~~~~ 142 (474) T protein:vir:96 77 DWRMFTNYHQNLVDQKVAYAV----ANPV--TFSS---DDDKSLKTIQEVLN-----HKWDDKLVDILTAASNKGIEWLQ 142 (474) T ss_pred chhcccchHHHHHHhhhhhhc----ccCc--eeec---CchHHHHHHHHHHh-----cCHHHHHHHHHHHHHhcCeeEEE Confidence 35667777777777777765 4433 2322 23333444444442 47778888999999999999988 Q ss_pred EeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHh Q lcl|NC_015158. 144 VEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRRE 223 (581) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~ 223 (581) +++... ..+++..|+|.++|+--......+.-+++|.+.+.+. . T Consensus 143 ~y~d~~--------------~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~-----------~----------- 186 (474) T protein:vir:96 143 PYIDEN--------------GEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGA-----------E----------- 186 (474) T ss_pred EEecCC--------------CceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCc-----------e----------- Confidence 876321 1367888999998643221222233333444322100 0 Q ss_pred hhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeE--EEEEeCCEEEEeecCCCccC Q lcl|NC_015158. 224 FRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMK--VTIIDRMFVIEEKENPSWFA 301 (581) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~--itv~~g~~iir~~~nP~~~g 301 (581) -.++|+.+.+..+.+- ++....... ........++ ..-|.++| T Consensus 187 --------------------------~~~~yt~~~v~~~~~~-------~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g 231 (474) T protein:vir:96 187 --------------------------RVEYWTDSDVTYYEYQ-------DGILIPDYYHGEEHIQSHYYV--GNKRVSWG 231 (474) T ss_pred --------------------------EEEEEeCCeEEEEEec-------CCceeeccccccccccccccc--cccccCCC Confidence 0011122222211110 011000000 0000001111 12244567 Q ss_pred CCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccc--cccCCceeEEeCC-C Q lcl|NC_015158. 302 QAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEE--FVWGPMEQIYING-D 374 (581) Q Consensus 302 ~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~--i~~~pG~vi~~~~-~ 374 (581) +.|++.+.. .-+|.|..+.+.++++.+|.+...+.+.+....+|.+.+.+. ..+ ...+.++++.+.. + T Consensus 232 ~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~ 306 (474) T protein:vir:96 232 RVPFIPFKN-----NPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGDG 306 (474) T ss_pred ceeEEEecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecCCC Confidence 888876543 457999999999999999999999999999999999877542 122 1334678888874 5 Q ss_pred CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 375 GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAM 454 (581) Q Consensus 375 ~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~ 454 (581) +++++++.+.........++.+...+-+.|++|..+.+.. +++.||.++...+.++-.......+.|.. +++++++++ T Consensus 307 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~-~l~~~~~~i 384 (474) T protein:vir:96 307 SGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKANKLKNKTLT-ALQELLQYI 384 (474) T ss_pred CceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 6788888776666677778999999999999998765432 34667777777788887777888888985 678899998 Q ss_pred HHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHH Q lcl|NC_015158. 455 LEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 (581) Q Consensus 455 ~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~ 534 (581) ++++..+.+.. ++...|. -.......+.++ .+.+ ++.+|.+.++ T Consensus 385 ~~~~~~~~~~~---------------------~i~i~f~--~~~p~~~~e~~~---~~~~----------ag~iS~et~~ 428 (474) T protein:vir:96 385 IDFYKLNIKVQ---------------------DVEITFN--FNVMVNELEQSQ---IGVQ----------SQYLSKETVV 428 (474) T ss_pred HHHhCCCcccc---------------------eeeEEec--cCCCcCHHHHHH---HHHh----------cCCCchHHHH Confidence 88764332211 1222221 111112233332 2211 1234443333 Q ss_pred HHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 535 ~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +. ++. .+ + .++|.+++.++..... +...++- T Consensus 429 ~~------~~~----v~--d--~~~E~~ri~~E~~e~~--~~~~~~~ 459 (474) T protein:vir:96 429 TN------HPW----VD--D--PVAELERIEQDNIDFN--KQLPPLE 459 (474) T ss_pred Hh------CCC----CC--C--HHHHHHHHHHHHHHHH--hcccccc Confidence 32 222 12 1 2345555544322111 1222222 No 101 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.60 E-value=1.1e-13 Score=91.53 Aligned_cols=434 Identities=13% Similarity=0.088 Sum_probs=206.6 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-ccccccccccc--cccccccchHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNSTLPW--KNKTTLPKLCQIRD 77 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~~~~--k~~~~~pki~~~~d 77 (581) -|.-+.++..|.++. ...|..++..|.+.+ ..+.++..|+.--+ .+..+.+.-+. +...+++-+.-+++ T Consensus 8 ~~~~~~~~~~l~~~e----~~~i~~L~~~~~~~~----~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd 79 (504) T protein:vir:99 8 ASKFTFRIPELNDDV----VDKVNGLYQQLVDRT----PRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVD 79 (504) T ss_pred ccccccccCCCCHHH----HHHHHHHHHHHHHHh----HHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHH Confidence 233334444554432 233455555444433 34445555654332 22222211110 11223343344445 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) .+..++. +.|+...+.+.. ...+.....+++|.....+..+++.+||.|++.+.-. + ++ T Consensus 80 ~~a~rl~-----------~~Gf~~~d~~~~---~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~-~------d~ 138 (504) T protein:vir:99 80 TLARRCN-----------LESFVWPDGDYG---SIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEG-G------AG 138 (504) T ss_pred HHHhhhc-----------cceeeCCCCChh---hHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecC-C------CC Confidence 4443332 112222121111 1245556678899999999999999999999888421 1 11 Q ss_pred eEeeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 158 ATRDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 158 ~~~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) ...|.|..+||.+. ++||....+.-+- +.+.. + . T Consensus 139 -----~~~~~I~~~sP~~~~~iyD~~~~~~~~a~---~~~~~--d--------~-------------------------- 174 (504) T protein:vir:99 139 -----EPDSLIHVKSAMQATGEWNSRRNAMDSLL---SITSR--D--------A-------------------------- 174 (504) T ss_pred -----CceeEEEEeccceeEEEEeCCCCceeEEE---EEEEe--c--------C-------------------------- Confidence 11356788999997 5787643322211 11100 0 0 Q ss_pred hhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCC Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQD 315 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~ 315 (581) ++.....++|..+.+ +.+.. .+++ ........|| .| .|++.++..+..+ T Consensus 175 ---------~g~~~~~~~y~~~~~--~~~~~-----~~~~------------~~~~~~~~~~--~g-vPvV~~~n~~~~~ 223 (504) T protein:vir:99 175 ---------EGHPTGIALYEDGVT--VTADM-----DDDG------------DWHADVRTHK--LG-VPVEVLPYKPRED 223 (504) T ss_pred ---------CCeEEEEEEEcCCcE--EEEEE-----cCCc------------eeeeccccCC--CC-cceEEecccccCc Confidence 000000001111111 11111 1111 1111223455 45 6888888888889 Q ss_pred cccCCCcH-HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-cccc-----------cccCCceeEEeCCCCC------ Q lcl|NC_015158. 316 NLYAMGPL-DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEE-----------FVWGPMEQIYINGDGD------ 376 (581) Q Consensus 316 ~~~G~s~~-~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~-----------i~~~pG~vi~~~~~~~------ 376 (581) ..||.|.. +.++++|+.+|..+..++......++|+..+-+ ++++ .....+++|.+..+.. T Consensus 224 ~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 303 (504) T protein:vir:99 224 RPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAAR 303 (504) T ss_pred cccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccC Confidence 99999955 589999999999999999999999999865533 1111 1223466776654322 Q ss_pred --cccccCCCccchhHHHHHHHHHHHHH---hcCCchHhcCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 377 --VEMMAPNTQALQADMQIQILEAKMEE---FAGAPREAMGIRTPG-EKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKV 450 (581) Q Consensus 377 --i~~~~~p~~~~~~~~~lq~~~~~~ee---~TGv~~~~~G~~~~~-~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~l 450 (581) +...+-+ ..+..+.++.++..+.. .||+|....|..+.. +.+|-++..............-|.|.. .++++ T Consensus 304 ~~~~~~q~~--~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~-~l~~~ 380 (504) T protein:vir:99 304 ARADVKQFP--ASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSP-AFRRS 380 (504) T ss_pred ccceeeecC--CCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 1111111 11233345555555444 599999999976654 567777877777777788888888885 46888 Q ss_pred HHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHH Q lcl|NC_015158. 451 LNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVST 530 (581) Q Consensus 451 i~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~ 530 (581) +++++.+.. +.+. ++.++.+..+.-.-.-....++.+.-+..|.+... ..+.+ T Consensus 381 ~rla~~~~~-~~~~------------------~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~----~l~~~---- 433 (504) T protein:vir:99 381 MIRALAIKN-GLDR------------------IPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGP----EWLKE---- 433 (504) T ss_pred HHHHHHHhc-CCCc------------------cccccccceeEecCCCccCHHHHHHHHHHHHhhcc----ccccc---- Confidence 888766532 1110 00111111111111112234444433333322211 01111 Q ss_pred HHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHH--HHHH-HhcccCC Q lcl|NC_015158. 531 ENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQA--QIEE-EAQVPLV 581 (581) Q Consensus 531 ~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~--~~~~-~~~~~~~ 581 (581) .+.+.+.+|+.. .+.+....++++++. -+.. ..+.+-- T Consensus 434 ---~~~l~~~lg~~~----------~ei~r~~~e~~~~~~~~~~~~l~~~~~~~ 474 (504) T protein:vir:99 434 ---TEVGLELLGLTP----------QQAKRALAERRRASSVSIIEALNRRQQEA 474 (504) T ss_pred ---hHHHHhhcCCCH----------HHHHHHHHHHHHHhhHHHHHHHhcccCCC Confidence 122223333221 111111111111111 1110 0111111 No 102 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.60 E-value=1.7e-13 Score=90.65 Aligned_cols=447 Identities=12% Similarity=0.061 Sum_probs=208.1 Q ss_pred Cccc--------------hhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc---cccccc-cccc Q lcl|NC_015158. 1 MTGK--------------VLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD---TTTTTN-STLP 62 (581) Q Consensus 1 ~~~~--------------~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~---~~~~~~-~~~~ 62 (581) |-.+ ..+++.+++... . .+.+......+.|..+ |-...+ .+.... .+.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--i---------~~~~~~~~~i~~~~~~--Y~g~~~~~~~~~~~~~~~~~ 67 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKK--V---------NANDEDYKYIDMWKRL--YQGNYAEWHNLNYEHNGNPV 67 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCC--C---------cCCHHHHHHHHHHHHH--hcCCcchhhccccccCCCcc Confidence 1111 122222222100 0 0111111223334332 211111 111111 1111 Q ss_pred ccccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEE Q lcl|NC_015158. 63 WKNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFA 142 (581) Q Consensus 63 ~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~ 142 (581) -++++.++-...+++.+...|+ +..--+++. + +..++++++.|.+.+|...+.+.+.++..+|.+++ T Consensus 68 ~~~~~s~n~~~~iv~~~a~~l~----~ep~~i~~~-----d----~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~ 134 (499) T protein:vir:80 68 NRRQLSMNLPKVTAKYMSKLLF----NEKVKINID-----D----ETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVI 134 (499) T ss_pred ccceeecchHHHHHHHHHHhhh----CCcceEeeC-----C----HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEE Confidence 1334445544556666665554 333323332 2 34556788888888999999999999999999999 Q ss_pred EEeeecceeeeeeeeeEeeeeccceEEecchhheeec-CCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHH Q lcl|NC_015158. 143 TVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFN-PVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARR 221 (581) Q Consensus 143 k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~D-P~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~ 221 (581) |+.|... .++++..|+|..|||= -....+..|.|+.+.... ...|... T Consensus 135 ~~~~D~~--------------~~~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~------------~~~y~~l----- 183 (499) T protein:vir:80 135 KVYHDGN--------------KNVKVSFATADCMYPLSNDSENVDECLIANSFHKN------------NKYYKLL----- 183 (499) T ss_pred EEEECCC--------------CcEEEEEEcCCceEEEEecCCCeEEEEEEEEEeec------------CeEEEEE----- Confidence 9988421 2468899999999862 122455666665433221 0000000 Q ss_pred HhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeee-cccCCceeeeeEEEEEeCCEEEEeecCCC-c Q lcl|NC_015158. 222 REFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYH-DTQSGTFKRNMKVTIIDRMFVIEEKENPS-W 299 (581) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~-d~~~d~~~e~~~itv~~g~~iir~~~nP~-~ 299 (581) +.++.. ....+ ..+|+ .+.|-... +..+..+ ....+. .+ + . +.-++ . T Consensus 184 ------------E~h~~~--~~~~~---------~y~I~-n~~~~~~~~~~lG~~v---~l~~~~-~~-~-~-~~~~~~~ 232 (499) T protein:vir:80 184 ------------EWNEWK--GEKEE---------VYTVT-TELYQSDDPNELGGKV---SLKLLF-ND-I-E-PVVPLPS 232 (499) T ss_pred ------------EEEEec--cccee---------eEEEE-EEEEeccCccccCccc---chhhhc-cC-c-C-CceeecC Confidence 000000 00000 00011 01110000 0000000 000000 00 0 0 00001 1 Q ss_pred cCCCCeeEeccc----ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc-c---ccc-------cC Q lcl|NC_015158. 300 FAQAPIFHCGWR----IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV-E---EFV-------WG 364 (581) Q Consensus 300 ~g~~Pf~~~~~~----~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~-~---~i~-------~~ 364 (581) .+++||.++... ..+++.+|.|+.+.++++++.+|....++.+.+..+-+..+ |..+. . +.. .. T Consensus 233 ~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~-v~~~~l~~~~~~~g~~~~~~~~ 311 (499) T protein:vir:80 233 LTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVL-VPSSFVKTAVNLDGSTTQYFDS 311 (499) T ss_pred CCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhccccee-cchhhhhccCCCCCCcccCCCc Confidence 356677666431 23578899999999999999999999999999987543333 32110 0 000 01 Q ss_pred CceeEE---eCCCC---CcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 365 PMEQIY---INGDG---DVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEK 438 (581) Q Consensus 365 pG~vi~---~~~~~---~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i 438 (581) .-.+++ ...++ .++.+++.-...+....++.+...++...|++..+-|.++.+.+||+++....+..-.....+ T Consensus 312 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~ 391 (499) T protein:vir:80 312 TDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSH 391 (499) T ss_pred ccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHH Confidence 122222 22222 244444443333444567777788888999999999988888899999987777777777778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcc Q lcl|NC_015158. 439 IMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANT 518 (581) Q Consensus 439 ~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~ 518 (581) .+.|. ..+++|++.++.+..-+.- ++. ..+...++...|. .+.. ...+..++.+.++-. T Consensus 392 ~~~~~-~~l~~l~~~il~~~~~~~~-------~~~-------~~~~~~~v~v~f~---d~i~--~d~~~~~~~~~~~~~- 450 (499) T protein:vir:80 392 SQLIE-QGIKEMIVSILEVGKLIKA-------YDG-------DTVELDTITVDFD---DSIA--QDEDTTINRYTTAKN- 450 (499) T ss_pred HHHHH-HHHHHHHHHHHHHHHHhcc-------ccC-------CCCCccceEEEeC---CCCC--CCHHHHHHHHHHHHH- Confidence 88887 4578899998887653221 111 0111233433332 0000 112223333333321 Q ss_pred cccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 519 PVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 519 ~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +++.+.. .+ +.+ ..+ +++++.. .++++.+++.+ .++|=- T Consensus 451 ---~Gi~S~e---t~---l~~---~~~---------~~d~ea~-~el~~i~~E~~--~~~~~~ 489 (499) T protein:vir:80 451 ---QGMIPLK---IA---LQR---AWN---------ITEAEAD-EWAEMLAKEKQ--AEIPNN 489 (499) T ss_pred ---cCCCCHH---HH---Hhh---cCC---------CChHHHH-HHHHHHHHHhh--cCCCCC Confidence 1233321 11 212 112 1222222 22222222221 122211 No 103 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.60 E-value=5.7e-14 Score=93.20 Aligned_cols=387 Identities=10% Similarity=0.057 Sum_probs=205.6 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-cccccccccc---ccccccccchHHHHHHHHHHHHHh Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNSTLP---WKNKTTLPKLCQIRDNLHSNYISA 86 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~~~---~k~~~~~pki~~~~d~~~~~l~~~ 86 (581) |- .++-.+|.++|.. |++ .+.+++.|+.-.+ ....+...-+ -..+..+|=+.-+++.+..++ T Consensus 1 ~~----~~~i~~L~~~~~~----~~~---r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl--- 66 (409) T protein:vir:94 1 MT----EKGIGYLRFKLSV----HKR---RAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRL--- 66 (409) T ss_pred CC----HHHHHHHHHHHHH----HhH---HHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhc--- Confidence 33 3344444444441 223 2344445544332 2222221111 112233344444444443322 Q ss_pred hcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccc Q lcl|NC_015158. 87 LFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGP 166 (581) Q Consensus 87 ~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p 166 (581) .|.|...+|.+ +.+...++++.....+..+++++||.|++.+.=. + ..+| T Consensus 67 --------~~~Gf~~~d~~--------l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~-~-------------dg~~ 116 (409) T protein:vir:94 67 --------VFREFENDDFT--------VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKG-E-------------NDAV 116 (409) T ss_pred --------ccCcccCCchH--------HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecC-C-------------CCce Confidence 23344444422 4556678899999999999999999999988421 1 1246 Q ss_pred eEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccccc Q lcl|NC_015158. 167 RAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSM 244 (581) Q Consensus 167 ~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (581) .|..+||.++ ++||....+.-+ .+ ++..+ +. T Consensus 117 ~i~~~sp~~~~~i~D~~~~~~~~a---~~--~~~~d---------~~--------------------------------- 149 (409) T protein:vir:94 117 RLQVIEAVNATGIIDPITGLLTEG---YA--VLERD---------EN--------------------------------- 149 (409) T ss_pred EEEEeccceEEEEEecCCCceeee---EE--EEEec---------CC--------------------------------- Confidence 7888999985 678754332211 11 11000 00 Q ss_pred ccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH- Q lcl|NC_015158. 245 DGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL- 323 (581) Q Consensus 245 ~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~- 323 (581) +......++.++ +++.++. .+|. .....|| .|..|.+.+...+..+.+||+|.. T Consensus 150 -~~~~~~~~~~~~--~~~~~~~-----~~~~---------------~~~~~n~--~g~vPvV~f~n~~~~~~~~G~s~I~ 204 (409) T protein:vir:94 150 -NNVVLEAHFLPD--RTDYYYR-----DSRN---------------NISIANP--TGHPLLVPIIHRPDAVRPFGRSRIT 204 (409) T ss_pred -CceEEEEEEecC--cEEEEEe-----cCce---------------eEeeeCC--CCCcceEEeccccccccccCccccc Confidence 000000011111 1112221 1111 1123565 578999999999999999999965 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec---cccc---cccCCceeEEeCCCC-----CcccccCCCccchhHHH Q lcl|NC_015158. 324 DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG---DVEE---FVWGPMEQIYINGDG-----DVEMMAPNTQALQADMQ 392 (581) Q Consensus 324 ~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~---d~~~---i~~~pG~vi~~~~~~-----~i~~~~~p~~~~~~~~~ 392 (581) +.++++|+.+|..+-.++......++|+..+.+ |... ....+|++|.+..+. .+..++..+.. +.... T Consensus 205 e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~-~~~~~ 283 (409) T protein:vir:94 205 RSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQ 283 (409) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhHHHhhcCCCCCCCCCceEEecCCCChh-HHHHH Confidence 889999999999999999999999999865543 2222 333468888875332 24444444322 22233 Q ss_pred HHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecC Q lcl|NC_015158. 393 IQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFD 472 (581) Q Consensus 393 lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~ 472 (581) +..+-..+-..||+|....|.....+-+|-++...+..........-+.|.. .+++++++.+.+... .+.. T Consensus 284 l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~-~~~~~~rla~~i~~~-~~~~------- 354 (409) T protein:vir:94 284 LRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGA-GLLNVAYLAACLRDD-APYL------- 354 (409) T ss_pred HHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCC-CCcc------- Confidence 4444455556789999999976643456777776666666676777777875 467888877666321 1100 Q ss_pred chhcccCCCccCHHH-hcCCceEE---EecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 473 SDDKVATFMNVNKDD-ITAKGRLR---PVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 473 ~~~~~~~~~~v~r~d-i~~~~~vv---a~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~ 547 (581) +++ ....++-. ..-+... ++.+..+..+.+. +..+ .. ...+.+.+|+..-| T Consensus 355 ------------~~~~~~~~v~W~p~~~~~~~~~---a~~aDa~~Kl~~a--g~~~---~~----~~~~~~~lG~~~~d 409 (409) T protein:vir:94 355 ------------REQFRKTKPKWEPLFEADASML---SLIGDGAIKLNQA--IPEF---IN----KDTIRDLTGIEGGE 409 (409) T ss_pred ------------ccccccceEEeccCCCcchHHH---HHHHHHHHHHHHh--cccc---cc----hhHHHHHcCCCCCC Confidence 011 11111111 1112223 3333344444332 1111 11 13455667777654 No 104 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.60 E-value=4.1e-13 Score=88.48 Aligned_cols=430 Identities=14% Similarity=0.112 Sum_probs=224.7 Q ss_pred ccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc---ccc-----------ccccccccccccccchHHHHH Q lcl|NC_015158. 12 LDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT---TTT-----------TNSTLPWKNKTTLPKLCQIRD 77 (581) Q Consensus 12 ~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~---~~~-----------~~~~~~~k~~~~~pki~~~~d 77 (581) ++ .+.+-..|.+... .++.....+.++++|+.--|. +.. .....+ -+++..|-...+++ T Consensus 1 ~~--~~~~~~~i~~~~~----~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~-~~ki~~n~~k~Iv~ 73 (470) T protein:vir:10 1 ME--LDALKKLIQNTST----SRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSA-DNRIPSNFYQLLVD 73 (470) T ss_pred Cc--hHHHHHHHHHHHH----HHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccC-CcccccchHHHHHH Confidence 33 3444444444444 333445666677777665441 110 000001 13566676667777 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeee Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESG 157 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~ 157 (581) ...++++ +++- ++.. ++++..+.+++++ . .+|.....+..+++.++|.|+..+++.+. T Consensus 74 ~~~~yl~----G~p~--~~~~---~d~~~~~~l~~~~----~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~~-------- 131 (470) T protein:vir:10 74 QEAGYVA----SVFP--DIDV---GKDADNKKIIDVL----G-DDRALTLNGLLVDSSNAGRAWLHYWIDED-------- 131 (470) T ss_pred hhhhhee----ccce--eeec---CchHHHHHHHHHH----h-hhHHHHHHHHHHHHhhcCeeEEEEEecCC-------- Confidence 7777665 4443 2322 2333334444444 3 25677778889999999999998876322 Q ss_pred eEeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchh Q lcl|NC_015158. 158 ATRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRED 235 (581) Q Consensus 158 ~~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (581) ..+++..++|.+.|| |++. ..+.-+++|.+.+..+ . + . T Consensus 132 ------~~~~~~~~~p~~~~~v~d~~~--~~~~~a~ir~y~~~~~-------~--------------------~--~--- 171 (470) T protein:vir:10 132 ------GNFRYGIIQPDQITPIYATTL--DNKLLGILRSYKQLDP-------D--------------------S--G--- 171 (470) T ss_pred ------CceEEEEEcccceEEEEcCCC--CCceEEEEEEEEeeec-------C--------------------C--c--- Confidence 135678899998744 4331 1222223344333110 0 0 0 Q ss_pred hhhccccccccccccccccCCceEEEEEEeeeeecccCCceeee-eEEEEEeCC---EEEEeecCCCccCCCCeeEeccc Q lcl|NC_015158. 236 CEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRN-MKVTIIDRM---FVIEEKENPSWFAQAPIFHCGWR 311 (581) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~-~~itv~~g~---~iir~~~nP~~~g~~Pf~~~~~~ 311 (581) ......+.|+.+.+..+..-+ ......+. ..++...+. .......-|..+|+.|++.+. T Consensus 172 ----------~~~~~~e~yt~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-- 234 (470) T protein:vir:10 172 ----------KYFTVHEYWTDKEAQFFRTNA-----TDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFS-- 234 (470) T ss_pred ----------eEEEEEEEEcCCcEEEEEeec-----CcceeccccccccccccccccccccccccccCCCeeeEEEee-- Confidence 000001112222222111100 00000000 000000000 001111223345677776544 Q ss_pred ccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccc-cc-cCCceeEEeCC-----CCCcccc Q lcl|NC_015158. 312 IRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEE-FV-WGPMEQIYING-----DGDVEMM 380 (581) Q Consensus 312 ~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~-i~-~~pG~vi~~~~-----~~~i~~~ 380 (581) .+-+|.|..+.+.++++.+|.+...+.|++...++|.+.+.+. ..+ .. ...++.+.+.. ++++.++ T Consensus 235 ---nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l 311 (470) T protein:vir:10 235 ---KNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKL 311 (470) T ss_pred ---cCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEE Confidence 3457999999999999999999999999999999999987652 111 22 22445555553 3457788 Q ss_pred cCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 381 APNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRR 460 (581) Q Consensus 381 ~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~ 460 (581) ..+.........++.+...+-+.+++|..+.+. .++.|+.++..++.++........+.|.. +++++++++.+++.. T Consensus 312 t~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~--~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~-~l~~~~~~i~~~l~~ 388 (470) T protein:vir:10 312 QIDIPVEARDDALKITRKNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTYFEH-AINELVRAIMRYLNF 388 (470) T ss_pred eecCChHHHHHHHHHHHHHHHHHhCCCCCCccc--cccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcc Confidence 877666667788899999999999999876543 35777777888899999888999999996 578889988876521 Q ss_pred hcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHH Q lcl|NC_015158. 461 NLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHN 540 (581) Q Consensus 461 n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~ 540 (581) . +. .-.++...| ...-.....+.++.++.+ . ++ +|.+.++++ T Consensus 389 ~----------~~----------d~~~i~i~f--~~~~p~d~~e~~~~~~~~---~------g~---iS~et~l~~---- 430 (470) T protein:vir:10 389 S----------DA----------DKRHISQHW--TRTKVEDSLTKAQIVSTV---A------NY---SSKEAVAKA---- 430 (470) T ss_pred c----------Cc----------ccceeeEEe--ccCCCCCHHHHHHHHHHH---h------cc---CcHHHHHHh---- Confidence 1 00 012333333 122222344444433332 1 12 343333222 Q ss_pred hcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 541 LSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 541 ~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++. .. ++ ++|.+++..+-++.....++.+=+ T Consensus 431 --~p~----v~--D~--~~E~eri~~E~~e~~~~~~~~~~~ 461 (470) T protein:vir:10 431 --NPI----VD--DW--QQELKDLAKDKEENDPYSNQADEL 461 (470) T ss_pred --CCC----CC--CH--HHHHHHHHHHHHHHHHhhcccccc Confidence 121 12 22 334454444322222211111111 No 105 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.59 E-value=9.7e-14 Score=91.93 Aligned_cols=424 Identities=13% Similarity=0.078 Sum_probs=223.5 Q ss_pred ccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccc--------cccccccc---------cccccccchHH Q lcl|NC_015158. 12 LDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTT--------TTNSTLPW---------KNKTTLPKLCQ 74 (581) Q Consensus 12 ~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~--------~~~~~~~~---------k~~~~~pki~~ 74 (581) ++ ++.+...|..+...... | ...+.++++|+.--|..- .+.....+ -+++.+|-... T Consensus 1 ~~--~e~~~~~i~~~~~~~~~-~---~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ 74 (471) T protein:vir:10 1 ME--IEVIKKIISSQMVKHGK-F---VSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQL 74 (471) T ss_pred CC--HHHHHHHHHHHHHHHHH-H---HHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHH Confidence 43 56666666666654432 2 335666777775543210 00000000 12455666666 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDE 154 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~ 154 (581) +++..+++++ +++- ++.+ ++++..+ .++..+. .+|.....++.+++.++|.|+..+.+..+ T Consensus 75 Ivd~~~~yl~----G~p~--~~~~---~~~~~~~----~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----- 135 (471) T protein:vir:10 75 LLDQKKAYAL----TYPP--TFDV---DDKKVND----MIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDAS----- 135 (471) T ss_pred HHHhhhhhhc----ccCc--eecc---CChHHHH----HHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCC----- Confidence 7777676664 3332 2222 2222233 3444443 67888899999999999999988865321 Q ss_pred eeeeEeeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCccc Q lcl|NC_015158. 155 ESGATRDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYT 232 (581) Q Consensus 155 ~~~~~~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 232 (581) ...+++..++|.++ ++|+.... ..-+++|.+.+..+.. .... T Consensus 136 --------~g~~~~~~~~p~~~~~i~d~~~~~--~~~~~ir~~~~~~~~~------~~~~-------------------- 179 (471) T protein:vir:10 136 --------DNSFRYACVDSKEVIPIYSKSLDK--KSIGVLRVYSSIDETD------GKNY-------------------- 179 (471) T ss_pred --------CCeeEEEEEcccceEEEEcCCCCC--ceEEEEEEEEeeccCC------Ccee-------------------- Confidence 11467888999996 44544211 1222334433321100 0000 Q ss_pred chhhhhccccccccccccccccCCceEEEEEEeeeeecccCCcee-------eeeEEEEEeCCEEEEeecCCCccCCCCe Q lcl|NC_015158. 233 REDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFK-------RNMKVTIIDRMFVIEEKENPSWFAQAPI 305 (581) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~-------e~~~itv~~g~~iir~~~nP~~~g~~Pf 305 (581) . ..+.|+...+. .|.+ .+++.. .........|. ......-|...|..|+ T Consensus 180 -~---------------~~~vy~~~~~~--~y~~-----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~iPv 235 (471) T protein:vir:10 180 -T---------------VYEYWNDKECS--FYRH-----EKEKPLEELETFQAISLIDTMNGD-RSSDNSFKHDFGLVPF 235 (471) T ss_pred -E---------------EEEEEeCCcEE--EEEe-----cCCccccccccccccccccccccc-ccccccccCCCCceeE Confidence 0 00001111111 1110 000000 00001111111 1222222334577787 Q ss_pred eEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccc-c-ccCCceeEEeCCC----- Q lcl|NC_015158. 306 FHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEE-F-VWGPMEQIYINGD----- 374 (581) Q Consensus 306 ~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~-i-~~~pG~vi~~~~~----- 374 (581) +.+. .+-.|.|..+.+.++++.+|.+...+.|++...++|.+.+.+. ..+ . ....++.+.+... T Consensus 236 v~~~-----n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 310 (471) T protein:vir:10 236 IPFK-----NNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQ 310 (471) T ss_pred EEec-----cCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccC Confidence 6553 3557899999999999999999999999999999999887652 111 1 1124566666533 Q ss_pred CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 375 GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAM 454 (581) Q Consensus 375 ~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~ 454 (581) +++.++..+....++...++.+.+.+-+.+++|..+.+. .++.|+.++..++..+........+.|.+ +++++++++ T Consensus 311 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~--~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~-~l~~~~~li 387 (471) T protein:vir:10 311 SGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDK--LGNSSGVALKFLYSLLELKAGNMETQFRS-GYATLVKMI 387 (471) T ss_pred ccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCccc--ccCccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 367778777666667778899999999999999876653 34667777888888888888888899986 568888888 Q ss_pred HHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHH Q lcl|NC_015158. 455 LEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 (581) Q Consensus 455 ~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~ 534 (581) ..++.. .+. .++...|. -.-.....+.++.++.| . .+ +|.+.++ T Consensus 388 ~~~~~~-~d~---------------------~~i~i~f~--~~~p~n~~e~~~~~~kl---~------g~---iS~et~~ 431 (471) T protein:vir:10 388 LKHLGL-SDK---------------------LKIKQTWT--RNSINNDTEMAQVVSTL---A------TI---TSRENVA 431 (471) T ss_pred HHHhcc-CCC---------------------ceeEEEeC--CCCCCCHHHHHHHHHHH---h------cc---CchHHHH Confidence 877532 111 12222221 11122334444333332 1 12 3433332 Q ss_pred HHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 535 ~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ ++. .. ++ ++|.+++..+.++. .++.+-+ T Consensus 432 ~~------~p~----v~--D~--~~E~eri~~E~~~~---~~~~~~~ 461 (471) T protein:vir:10 432 KS------NPI----VE--DW--QDELRLQKAEQEGR---SEKLYDM 461 (471) T ss_pred Hh------CCC----CC--CH--HHHHHHHHHHHHHH---Hhccccc Confidence 22 222 12 22 34455544433222 2223333 No 106 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.58 E-value=6.9e-13 Score=87.25 Aligned_cols=438 Identities=11% Similarity=0.079 Sum_probs=219.9 Q ss_pred CccchhhhhhhccchhhhH-HHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-cc----ccccccccccccccccchHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGL-AEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TT----TTTNSTLPWKNKTTLPKLCQ 74 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~----~~~~~~~~~k~~~~~pki~~ 74 (581) |+..=. ..+ +....+ ...|.++-+++...|. ..+.++++|+.-.+ .. +....+. -+++.+|-..- T Consensus 1 ~~~~~~--~~~--~~~~~~~~~~~~~~i~~~~~~~~---~r~~~~~~yy~g~~~i~~~~~~~~~~~~--~~ki~~n~~~~ 71 (489) T protein:vir:99 1 MLQEDF--EAI--DYESKLWIDQLKNYISRFKAEQL---ERLKELKRYYLGDNNIKYRPAKTDKYAA--DNRIASDFAKY 71 (489) T ss_pred CCccce--eee--CCCCCCCHHHHHHHHHHHHHHHH---HHHHHHHHHhcccCccccccccccccCC--cceeecchHHH Confidence 222100 000 001111 2334444444444433 34566666665333 11 1111111 23577787788 Q ss_pred HHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeee Q lcl|NC_015158. 75 IRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDE 154 (581) Q Consensus 75 ~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~ 154 (581) +++...++++ +++- ++.+ +++ ..++.+++.+..++|...+..+.+++.+||.|+..+.+.... T Consensus 72 iv~~~~~~l~----g~~~--~~~~---~d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~---- 134 (489) T protein:vir:99 72 ITVFEQGYML----GVPV--EYKN---ENK----DLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKID---- 134 (489) T ss_pred HHHHHhhhhc----cCCc--eeec---CCh----hHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCc---- Confidence 8888877775 3332 2332 222 234567777777899999999999999999998888653211 Q ss_pred eeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCccc Q lcl|NC_015158. 155 ESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYT 232 (581) Q Consensus 155 ~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 232 (581) .....+++..|+|.++| +|+... .+.-+.+|.+.+.. .+ + T Consensus 135 ------d~~~~~~i~~~~p~~~~~v~dd~~~--~~~~~~i~~~~~~~---------~~------------------~--- 176 (489) T protein:vir:99 135 ------DKKTEVKLYQLPAEQTFVIYDDTYQ--RNSLMAVHFYDIDY---------GS------------------G--- 176 (489) T ss_pred ------CCCcceEEEEEcccceEEEEcCCCC--CceEEEEEEEEEec---------CC------------------C--- Confidence 12234678899999974 333321 22222333332210 00 0 Q ss_pred chhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccc Q lcl|NC_015158. 233 REDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRI 312 (581) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~ 312 (581) +.....+.|..+.+..+++.+ .+. .+-.+.....| +.|..|++.+.. T Consensus 177 -------------~~~~~~~~y~~~~i~~~~~~~-------~~~---------~~~~~~~~~~~--~~g~vPvv~~~n-- 223 (489) T protein:vir:99 177 -------------KRKQIIKAYTSDTIYTYEDYN-------LET---------KGMRLKDYEGH--FFKGVPVNEYAN-- 223 (489) T ss_pred -------------ceEEEEEEEeCCcEEEEEecC-------CCc---------ccceecccccc--cCCceeEEEeec-- Confidence 000011112222222211110 000 01112222233 457788775543 Q ss_pred cCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecccc------c----cc------------cCCceeEE Q lcl|NC_015158. 313 RQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDVE------E----FV------------WGPMEQIY 370 (581) Q Consensus 313 ~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~~------~----i~------------~~pG~vi~ 370 (581) .-+|.|..+.+.++++.+|.+...+.+++...++|++.+.+... . .. ...++++. T Consensus 224 ---~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (489) T protein:vir:99 224 ---NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLI 300 (489) T ss_pred ---CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccceeee Confidence 34799999999999999999999999999999999887654210 0 01 11233333 Q ss_pred eCCCC-------CcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 371 INGDG-------DVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFE 443 (581) Q Consensus 371 ~~~~~-------~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~ 443 (581) +.+.+ ++.++..+.........+..+...+-+.||+|..+.+. ..++.|+.++...+.+.........+.|. T Consensus 301 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~ 379 (489) T protein:vir:99 301 LDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMK-FSGVQSGESMKYKLMASDNYREKQERLFK 379 (489) T ss_pred eccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 33322 23445544444455566788888899999999765432 23456776676677777777788888888 Q ss_pred HHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccccccc Q lcl|NC_015158. 444 VMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQD 523 (581) Q Consensus 444 ~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~ 523 (581) .+++++++++.+++..... +. .......++...| ...-.....+.++ .+.++. . T Consensus 380 -~~l~~~~~li~~~~~~~~~---------~~-----~~~~~~~~i~v~f--~~~~p~d~~~~~~---~~~kl~------g 433 (489) T protein:vir:99 380 -KGLMRRLRLAANIWAIKGN---------EA-----TTYSLVNDTSIVF--TPNLPQNDNEIVT---AAQNLY------G 433 (489) T ss_pred -HHHHHHHHHHHHHHhhcCC---------cc-----ccccccccceEEe--CCCCCcCHHHHHH---HHHHHh------c Confidence 4678888988877533210 00 0000111233333 1111223344443 333322 1 Q ss_pred ccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 524 IKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 524 i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.|+ +.++++ +++. ...++ ++|.+++..+. .+.+...|.+.. T Consensus 434 iis~---et~~~~------l~~v----~~~d~--~~E~~ri~~E~-~~~~~~~~~~~~ 475 (489) T protein:vir:99 434 IVSD---QTIFEI------LNTV----TGVDA--EAELKRLKEEA-DKKQSLPEPRLV 475 (489) T ss_pred cCCH---HHHHHh------cCCC----CchhH--HHHHHHHHHHH-HHHhcccccccc Confidence 3333 223222 2332 22233 33344433321 122222344444 No 107 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.58 E-value=6e-13 Score=87.58 Aligned_cols=426 Identities=11% Similarity=0.085 Sum_probs=220.3 Q ss_pred ccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-cc--------cccccccccccccccchHHHHHHHHHH Q lcl|NC_015158. 12 LDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-TT--------TTNSTLPWKNKTTLPKLCQIRDNLHSN 82 (581) Q Consensus 12 ~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-~~--------~~~~~~~~k~~~~~pki~~~~d~~~~~ 82 (581) |+ +..|.++.+.+.. |.+.. .++++|+..-+. -. ..+.+..-.+++..|....+++...++ T Consensus 1 l~------~~~i~~~i~~~~~-~~~r~---~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~y 70 (451) T protein:vir:10 1 ME------LEKIRAIISADAA-RRQEI---LQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASY 70 (451) T ss_pred CC------HHHHHHHHHHHHH-HHHHH---HHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhh Confidence 32 2344444444443 33333 334445443321 00 011111112356677777778877776 Q ss_pred HHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeee Q lcl|NC_015158. 83 YISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDT 162 (581) Q Consensus 83 l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~ 162 (581) ++ .++- ++.. .++++..+.++ ..+ ..++.....++.+++.+||.|+..+++.+.... .... T Consensus 71 l~----G~p~--~~~~--~~~~~~~~~~~----~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~------~~~~ 131 (451) T protein:vir:10 71 MF----TYPV--LFDI--DNNKELNEKVT----DVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSG------EQVT 131 (451) T ss_pred ee----cccc--eeec--CCcHHHHHHHH----HHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCccc------cccc Confidence 64 3332 2221 22333334433 333 357888899999999999999988766432211 1111 Q ss_pred eccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcc Q lcl|NC_015158. 163 YFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAV 240 (581) Q Consensus 163 ~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (581) ....++..++|.++|+ |.+.. .+..+++|.+....+.. + ...... T Consensus 132 ~~~~~~~~i~p~~~~~vydd~~~--~~~~~~ir~~~~~~~~~---------~-----------------~~~~~~----- 178 (451) T protein:vir:10 132 NQTFKYGVVNTEEIIPIYRNGIE--RELEAVIRYYIQLEDVK---------G-----------------QIQKQA----- 178 (451) T ss_pred ccceeEEEEcccceEEEEcCCCC--CceEEEEEEEEeeeccc---------c-----------------cccceE----- Confidence 1235678899999754 43221 12233444443211100 0 000000 Q ss_pred ccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCC Q lcl|NC_015158. 241 GFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAM 320 (581) Q Consensus 241 ~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~ 320 (581) ....+.++...+..+.+. +++ ..++.+.. ..-|-..|..|++.+. .+-.|. T Consensus 179 -------~~~~e~yt~~~~~~~~~~-------~~~---------~~~~~~~~-~~~~~~~g~vPvv~~~-----nn~~~~ 229 (451) T protein:vir:10 179 -------YTYVEFWTDKILDKYKFF-------GVS---------CCGSQIEH-ITVQHRFNSVPFVEFS-----NNIKKQ 229 (451) T ss_pred -------EEEEEEEeCCeEEEEEec-------ccC---------cccccccc-ccccCCCCeeeEEEec-----cCCCCC Confidence 000011111111111110 000 01121111 1112235667766443 345689 Q ss_pred CcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----cccc--ccCCceeEEeC-----CCCCcccccCCCccchh Q lcl|NC_015158. 321 GPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEF--VWGPMEQIYIN-----GDGDVEMMAPNTQALQA 389 (581) Q Consensus 321 s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i--~~~pG~vi~~~-----~~~~i~~~~~p~~~~~~ 389 (581) |..+.+.++++.+|.+...+.+++.-.++|.+.+.+- ..+. ..+.++++... .++++.++..+....++ T Consensus 230 ~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~ 309 (451) T protein:vir:10 230 SDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPTEAR 309 (451) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCCHHHH Confidence 9999999999999999999999999999999877541 1111 12344555554 34678888877667777 Q ss_pred HHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceee Q lcl|NC_015158. 390 DMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIR 469 (581) Q Consensus 390 ~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR 469 (581) ...+..+...+-+.|++|..+.+. .++.|+.++..++...........+.|.. +++++++++.+++... + T Consensus 310 ~~~~~~l~~~I~~~s~~p~~~~~~--~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~-~l~~~~~li~~~~~~~-d------ 379 (451) T protein:vir:10 310 KIILEILKKQIYESGQGLQQDTEN--FGNASGVALKFFYRKLELKSGLLETEFRT-SFDKLIKAILYFLGVT-D------ 379 (451) T ss_pred HHHHHHHHHHHHHHhCcccccccc--cccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCC-C------ Confidence 888999999999999999765542 34667777777888888888888899996 5688888888775321 1 Q ss_pred ecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccc Q lcl|NC_015158. 470 VFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIF 549 (581) Q Consensus 470 ~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~ 549 (581) ..+++..| .-.-.....+.++.++.| . .+ +|.+.++++ ++ T Consensus 380 ---------------~~~i~i~f--~~~~p~n~~e~~~~~~kl---~------g~---iS~et~~~~------~p----- 419 (451) T protein:vir:10 380 ---------------YKKIQQTY--TRNMMSNDLEDADIATKS---V------GI---IPTKIILRH------HP----- 419 (451) T ss_pred ---------------ccceeEEe--cCCCCCCHHHHHHHHHHH---h------cc---CchHHHHHh------CC----- Confidence 12233232 111122334444433333 1 12 333333322 22 Q ss_pred cCCCCcHHHHHHHHH-HHHHHHHHHHHhcccCC Q lcl|NC_015158. 550 KPNVAVMEAQTTSAL-VNQSQAQIEEEAQVPLV 581 (581) Q Consensus 550 ~~~~~~~~~~~~q~~-~q~aq~~~~~~~~~~~~ 581 (581) +.+-.++ +.+++ .++..+..+....++=+ T Consensus 420 --~v~d~~~-e~~~~~ee~~~~~~~~~~~~~~~ 449 (451) T protein:vir:10 420 --WVDDVEE-AEKLYLEEKKIQASKVSDDYNNF 449 (451) T ss_pred --CCCCHHH-HHHHHHHHHHHHHHHHHhhcCCC Confidence 2221222 33333 33333333344444444 No 108 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.57 E-value=4.4e-13 Score=88.31 Aligned_cols=429 Identities=10% Similarity=0.058 Sum_probs=208.4 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-ccccccc-cccccc---cccccchHHHHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNS-TLPWKN---KTTLPKLCQIRDNLHSNYIS 85 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~-~~~~k~---~~~~pki~~~~d~~~~~l~~ 85 (581) |...+-++.+++|.+.++ . +.+ .+.+++.|+...+ ....+.+ +...|+ ++.++-..-+++.+..+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~---~-~~~---r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRID---D-GMS---RVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHH---H-HHH---HHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhcc Confidence 655666777777777766 2 222 3445566665543 3222221 222222 23455556667777666642 Q ss_pred hhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeecc Q lcl|NC_015158. 86 ALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFG 165 (581) Q Consensus 86 ~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~ 165 (581) +. |++.. ..|.+..+. +...+.+++|.....+.++++.+||.|+..++.. + ... T Consensus 74 ----~g--~~~~~--~~d~~~~~~----~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~-e-------------dg~ 127 (456) T protein:vir:79 74 ----NG--ITVGG--SADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR-D-------------DGT 127 (456) T ss_pred ----CC--eecCC--CCCccHHHH----HHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeC-C-------------CCc Confidence 21 12211 122222222 3344566889999999999999999998766432 1 123 Q ss_pred ceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccc Q lcl|NC_015158. 166 PRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFS 243 (581) Q Consensus 166 p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (581) |++..++|+++ .+||.... .....+|...+.++ ... T Consensus 128 ~~i~~~~p~~~~~i~d~~~~~--~~~~~~~~~~~~d~----------~~~------------------------------ 165 (456) T protein:vir:79 128 ATITADSPETMVVSVDPLQPW--RIRSAMRWWRDLDA----------ESD------------------------------ 165 (456) T ss_pred eEEEEeccceeEEEEcCCCCC--ceEEEEEEEEecCC----------cee------------------------------ Confidence 67889999996 44554322 11122222222100 000 Q ss_pred cccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH Q lcl|NC_015158. 244 MDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL 323 (581) Q Consensus 244 ~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~ 323 (581) ....|.....+..+.+| ..+... . ....+..++......+-|..++.+|+..+ .+..|.|.. T Consensus 166 -----~~~~~~~~~~~~~~~~~-~~~~~~-~-----~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~~~gd~ 227 (456) T protein:vir:79 166 -----FAIVWSGDGWQKFARPC-FVQSSS-R-----RRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEV 227 (456) T ss_pred -----EEEEEcCCceEEEEEEE-Eeeccc-c-----ceeeeccCCceeecccccCCCCceeEEEe------cCCCCCchh Confidence 00000111111111111 111100 0 01111112222222223434677776533 246789999 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc----------------cccccCCceeEEeCCCCCcccccCCCccc Q lcl|NC_015158. 324 DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV----------------EEFVWGPMEQIYINGDGDVEMMAPNTQAL 387 (581) Q Consensus 324 ~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~----------------~~i~~~pG~vi~~~~~~~i~~~~~p~~~~ 387 (581) +.++++++.+|.....+++.+...+.|+..+.+.. ..+...+|.+|..+.+..+..++..+. . T Consensus 228 e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~-~ 306 (456) T protein:vir:79 228 EPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTNDF-T 306 (456) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCCCCcceeeecccCh-H Confidence 99999999999999999999999999876553310 012234788888777776665554322 2 Q ss_pred hhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccce Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADT 467 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~ 467 (581) .....+..+-..+-..||+|...-|... +|.++.++...............+.|.. .+++++++++.+.. + ++ T Consensus 307 ~~~~~l~~~i~~i~~~t~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k~~~~~~~f~~-~l~~~~~l~~~~~g---~-~~- 379 (456) T protein:vir:79 307 PMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKI-GLEAILVKALQIEG---E-SV- 379 (456) T ss_pred HHHHHHHHHHHHHHhhcCCChhHhcccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcC---C-Cc- Confidence 2333455555566677889988877422 3557777777777777777788888885 57888888876521 1 10 Q ss_pred eeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 468 IRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 468 iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~ 547 (581) ...++..+. ..-....++.++-+..|.+ +++.+.. .+ ..+.++. T Consensus 380 -----------------~~~i~v~w~--~~~~~s~~~~ada~~kl~~-------~G~~~~~-------~~---~~~lg~~ 423 (456) T protein:vir:79 380 -----------------EDTVDVSFE--SPDRVTLGEKYSAASLAKA-------AGESWAS-------IR---RNILNYN 423 (456) T ss_pred -----------------cccceEEeC--CCCCcCHHHHHHHHHHHHh-------cCCChHH-------HH---HhcCCCC Confidence 011222221 1111233444433333321 1232211 11 1233431 Q ss_pred cccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 548 IFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 548 ~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) + .++ ++.+.++..++..+....-+|.|=- T Consensus 424 ---~-~~i-~~~e~~r~~~e~~~~~~~~~~~~~~ 452 (456) T protein:vir:79 424 ---A-DQI-KQDDLDRAREQITLFAGNPVQRPQE 452 (456) T ss_pred ---H-HHH-HHHHHHHHHHHHHHHhhhHhhcCCC Confidence 1 111 1112222111111111111111111 No 109 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.56 E-value=9.6e-14 Score=91.94 Aligned_cols=388 Identities=10% Similarity=0.053 Sum_probs=194.5 Q ss_pred HhhhhhHHHHHHHHHHHhhccc-cccccccccc--c--cccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHH Q lcl|NC_015158. 31 NSQRQEWLSQKSELRNYIFATD-TTTTTNSTLP--W--KNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEA 105 (581) Q Consensus 31 ~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~~~--~--k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~ 105 (581) =+.-++. ...+.+|+...+ ....+. .+| + ..+..++-+.-++|.+...+ .|.|+..+|.+ T Consensus 1 l~~~~~r---~~~~~~yY~g~~~~~~~~~-~~p~~~~~~~~~v~nw~~~~Vds~a~rl-----------~~~Gf~~~d~~ 65 (410) T protein:vir:95 1 MNLYQSR---VNLRYKHYAMQHYEAPTGI-TIPAHIRAKYQAVLGWAAKGVDSLADRL-----------IFRAFANDDFN 65 (410) T ss_pred CCcchhh---HHHHHHHhcCCCCccccch-hccHHHHhHHHhhcchhHHHHHHhHhhh-----------ccccccCCCch Confidence 0111121 222334443332 111111 111 1 12233344444444443322 23344444432 Q ss_pred HHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhhe--eecCCCC Q lcl|NC_015158. 106 KRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDI--VFNPVAV 183 (581) Q Consensus 106 ~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df--~~DP~a~ 183 (581) +.....++++.....+..+++++||.|++.+.= .+ ..+|.|..+||.+. ++||... T Consensus 66 --------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~-~~-------------d~~~~i~~~sP~~~~~i~Dp~~~ 123 (410) T protein:vir:95 66 --------VTEIFDRNNPDIFFDSAILSALIGSCSFVYISK-GE-------------DDEVRLQVIESSNATGVIDPITG 123 (410) T ss_pred --------HHHHHhhcChHHHHHHHHHHHHHhCceeEEEec-CC-------------CCceEEEEEcccceEEEEeCCCC Confidence 445567789999999999999999999998841 11 12477889999996 6787532 Q ss_pred CcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEE Q lcl|NC_015158. 184 DFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLT 263 (581) Q Consensus 184 ~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE 263 (581) .+.-+ .+...+. ++ +......++.++.+. . T Consensus 124 ~~~~a---l~~~~~~-----------~~----------------------------------~~~~~~~~~~~~~~~--~ 153 (410) T protein:vir:95 124 LLVEG---YAVLARD-----------DY----------------------------------NRPTLEAYFEPNATH--F 153 (410) T ss_pred ceEEE---EEEEEec-----------CC----------------------------------CeEEEEEEEeCCcEE--E Confidence 22111 1111000 00 000000011111100 0 Q ss_pred EeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCC-cHHhhhhHHHHHHHHHHHHHH Q lcl|NC_015158. 264 FYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMG-PLDNLVGMQYRIDHLENLKAD 342 (581) Q Consensus 264 ~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s-~~~~l~d~Q~~~n~~~R~~iD 342 (581) +- .+++. ...+|| .|..|++.++..+..+..||.| +.+.++++|+.+|..+-.+.. T Consensus 154 ~~------~~~~~---------------~~~~~~--~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~ 210 (410) T protein:vir:95 154 IP------KDGEP---------------YSVTNE--TGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADI 210 (410) T ss_pred Ee------eCCcc---------------ccccCC--CCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHH Confidence 00 01110 112455 5789999999999999999999 558999999999999999999 Q ss_pred HHHHhcCCeEEEec---c---ccccccCCceeEEeCCCC-----CcccccCCCccchhHHHHHHHHHHHHHhcCCchHhc Q lcl|NC_015158. 343 VFDLIAFPPMKVKG---D---VEEFVWGPMEQIYINGDG-----DVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAM 411 (581) Q Consensus 343 n~~~s~np~~~v~~---d---~~~i~~~pG~vi~~~~~~-----~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~ 411 (581) .....++|+..+-+ + .+.....+|++|.+.++. .+..++..+.. +....+..+-..+-..||+|.... T Consensus 211 ~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~-~~~~~l~~l~~~~a~~s~lP~~~l 289 (410) T protein:vir:95 211 TAEFYSWPQKYILGLDPDAEPMEKWKATVSSLLTISSSDKGVKPSVGQFTTASMS-PFTEQLRTAAAGFAGEMGLTLDDL 289 (410) T ss_pred HHHHhcchhheeeccCCCCCcCchhhhhhhhheeccCCCCCCcceEEecCCCChH-HHHHHHHHHHHHHhhhcCCCHHHh Confidence 99999999865533 1 112334478899887542 23344444332 223344444555666789999999 Q ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCC Q lcl|NC_015158. 412 GIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAK 491 (581) Q Consensus 412 G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~ 491 (581) |.....+-+|-++..............-+.|.. .++.++++.+.+... .+. .+.++.+.. T Consensus 290 g~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~-~l~~~~rla~~i~~~-~~~------------------~~~~~~~~~ 349 (410) T protein:vir:95 290 GFVSDNPSSVEAIKASHENLRLAGRKAQRSLGA-GLLNVAYVAACLRDE-FRY------------------TRSQFVRTA 349 (410) T ss_pred ccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcC-CCC------------------cccccceee Confidence 965543345666666666666666677777775 467788887666321 110 001111111 Q ss_pred ceEEE---ecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHH Q lcl|NC_015158. 492 GRLRP---VGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQS 568 (581) Q Consensus 492 ~~vva---~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~a 568 (581) .+=.. .-+.+.++.+--+..|.+.. ..+.+. ..+.+.+|+.. ++...++ .++ T Consensus 350 v~W~p~~d~~~~s~a~~aDa~~Kl~~a~-----~g~~~~-------~~~~~~lg~~~-----------~~~~~~~-~~e- 404 (410) T protein:vir:95 350 VKWEPLFEADANTMTMIGDGVVKLNQAL-----PGYINA-------ETIRDLTGIAG-----------DMSAKPV-VSE- 404 (410) T ss_pred EEeeecCCcchhhHHHHHHHHHHHHHhc-----cCCccH-------HHHHHhcCCCh-----------HHHHHHH-HHH- Confidence 11110 11223333332223332221 122221 22334444331 2222222 111 Q ss_pred HHHHHHHhccc Q lcl|NC_015158. 569 QAQIEEEAQVP 579 (581) Q Consensus 569 q~~~~~~~~~~ 579 (581) .++++. T Consensus 405 -----~~~~g~ 410 (410) T protein:vir:95 405 -----GGSNGE 410 (410) T ss_pred -----HHhCCC Confidence 122222 No 110 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.56 E-value=5.6e-13 Score=87.77 Aligned_cols=461 Identities=11% Similarity=0.072 Sum_probs=216.7 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccccccc--ccccccchHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWK--NKTTLPKLCQIRDN 78 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k--~~~~~pki~~~~d~ 78 (581) |+++ .++++.+.-.=.+ ...+....+.|..+ |....+....-+..-+|. +.+++|-...+++. T Consensus 18 ~~~~--~~~~i~~~~~i~~-----------~~~~~~~i~~~~~~--y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~ 82 (522) T protein:vir:47 18 MQTS--NLNSILEHPKIAV-----------TQEEYDRIKRNLVY--YQSKWDDVQYKNTDGDIKSRPMNHLPIARTASKK 82 (522) T ss_pred hhcc--cchhccccCCCCC-----------CHHHHHHHHHHHHH--hcCCcccccccccCcchhcccceecchHHHHHHH Confidence 2222 2333332100000 11111222333322 333333222222222222 23444433445555 Q ss_pred HHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeee Q lcl|NC_015158. 79 LHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGA 158 (581) Q Consensus 79 ~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~ 158 (581) +...++ +-..=+++ ++ +..+++++..|.+.+|...+.+.+.++.-.|.+++|..|... T Consensus 83 ~A~lv~----~e~~~i~v-----~d----~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~--------- 140 (522) T protein:vir:47 83 IASLVY----NEQATITT-----KN----EILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGD--------- 140 (522) T ss_pred Hhhhhc----CCcceeec-----CC----hHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcCC--------- Confidence 444433 33222222 12 345557778888899999999999999999999999887422 Q ss_pred EeeeeccceEEecchhheeec-CCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhh Q lcl|NC_015158. 159 TRDTYFGPRAVRIDPKDIVFN-PVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCE 237 (581) Q Consensus 159 ~~~~~~~p~ie~V~p~df~~D-P~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 237 (581) +++|..|++..|||= =.......|.++.+...+..+- ...|... +.++ T Consensus 141 ------~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~--------~~~yt~l-----------------E~he 189 (522) T protein:vir:47 141 ------KVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRK--------NVYYTLV-----------------EFHE 189 (522) T ss_pred ------ceEEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccc--------eeEEEEE-----------------EEee Confidence 467889999999873 2233455566666665542110 0001000 0000 Q ss_pred hccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEec----cccc Q lcl|NC_015158. 238 KAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCG----WRIR 313 (581) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~----~~~~ 313 (581) ...+... ... ..+.+.+.+|+- +.|- ...++.+-..+-.+.+..-.-|..+..-....+++|.++. -... T Consensus 190 ~~~~~~~-~~~-~~~~~~~~~I~n-~ly~---~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~ 263 (522) T protein:vir:47 190 WVTADGQ-ETG-STNDKKYYRITN-ELYR---SDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKD 263 (522) T ss_pred ecccccc-ccc-ccccCCceEEEE-EEee---cCCCcccCccccccccccccCCCCceEeCCCCcceEEEecCCcccccc Confidence 0000000 000 000011111211 1110 0000000000000000000000000000012344444332 1223 Q ss_pred CCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc-----c--------ccccCCc-eeEE-eC----CC Q lcl|NC_015158. 314 QDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV-----E--------EFVWGPM-EQIY-IN----GD 374 (581) Q Consensus 314 p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~-----~--------~i~~~pG-~vi~-~~----~~ 374 (581) .+|.+|.|+.+.+++..+.+|....+.++-+.++-...+ |.++. . ...+.++ .+++ ++ .+ T Consensus 264 ~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~-v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~ 342 (522) T protein:vir:47 264 INSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVI-VPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDA 342 (522) T ss_pred cCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceee-cchHHhccCCCCCCcccccccccCcccceEeecCCCCCCC Confidence 478999999999999999999999999999887765443 32110 0 0011111 2222 11 22 Q ss_pred CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 375 GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAM 454 (581) Q Consensus 375 ~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~ 454 (581) +.++.+++.-...+....++.+...++...|++...-|.++.+.+|||++....++.-...+.+.+.|.. .+++|+..| T Consensus 343 ~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~-al~~lv~~i 421 (522) T protein:vir:47 343 GGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVEQ-SIKELCVSM 421 (522) T ss_pred CcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 3466666554455666778888899999999999999988888999999988888888888888888885 468899988 Q ss_pred HHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHH Q lcl|NC_015158. 455 LEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 (581) Q Consensus 455 ~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~ 534 (581) +++..-. + +.|.. .+...++..+|. .+ -...++..++.+.++-. +++.+. ... T Consensus 422 ~~l~~~~-~------~~~~~-------~~~~~~i~v~f~---D~--i~~D~~~~~~~~~~~v~----aG~~s~---e~~- 474 (522) T protein:vir:47 422 CELGKAV-G------VYSGE-------IPELDDISVNLD---DG--VFTDRHAELDYWAKMVA----AGFSTK---KRA- 474 (522) T ss_pred HHHHhhh-h------hccCC-------CCCcceeEEEcC---CC--CCCCHHHHHHHHHHHHh----cCCCCH---HHH- Confidence 8775321 1 11110 011122333332 11 01122333334333321 123332 111 Q ss_pred HHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 535 ~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.+ +.++ ++++. ++++++++++.. .+.|.- T Consensus 475 --i~~---~~g~---------~eeea-~~el~ri~~E~~--~~~~~~ 504 (522) T protein:vir:47 475 --IGK---TLNI---------SGVEA-EKELNAINSELL--PMNDAE 504 (522) T ss_pred --HHh---cCCC---------ChHHH-HHHHHHHHHhhc--cCCCCC Confidence 222 2222 23332 333444544422 233322 No 111 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.56 E-value=5.8e-13 Score=87.69 Aligned_cols=427 Identities=12% Similarity=0.075 Sum_probs=224.5 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc---ccccccccc--ccccccccchHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT---TTTTNSTLP--WKNKTTLPKLCQI 75 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~---~~~~~~~~~--~k~~~~~pki~~~ 75 (581) .-..-.+++.|.. ..|.++-+.+...|.+ .|.++++|+...+. ++....... -.+++.+|....+ T Consensus 12 ~~~~~~~~~~l~~-------~~i~~li~~~~~~~~~---r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~I 81 (506) T protein:vir:94 12 NLIYQESLENLTP-------NKIMKFITHHFNYQRP---RLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYI 81 (506) T ss_pred eeecccchhcCCH-------HHHHHHHHHHHHHHHH---HHHHHHHHhcCCCccccccccccccccCCcceeecchHHHH Confidence 1111123333332 2333333333344433 46667777655442 111111111 1356777888888 Q ss_pred HHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeee Q lcl|NC_015158. 76 RDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEE 155 (581) Q Consensus 76 ~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~ 155 (581) ++...++|+ +++- ++.+.. + ..++.|++.+..+++...+.++.+++.++|.|+..+++.+. T Consensus 82 v~~~~~~l~----G~p~--~~~~~d---~----~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded------ 142 (506) T protein:vir:94 82 ADFQTSYSV----GNPI--NVKLPD---D----GSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGED------ 142 (506) T ss_pred HHHhhhhhc----ccCc--eeecCc---c----hHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCC------ Confidence 888888775 3332 333321 1 12456778888899999999999999999999998876321 Q ss_pred eeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccc Q lcl|NC_015158. 156 SGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTR 233 (581) Q Consensus 156 ~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (581) ..+++..++|.++| +|+.... ..-+++|.+.+... . + + T Consensus 143 --------~~~~i~~~~p~~~~~v~dd~~~~--~~~~~v~~~~~~~~--------~-------------------~--~- 182 (506) T protein:vir:94 143 --------NEEHLAKLDPLDTFVIYSTDVDP--KPIMAVRYHQIELV--------D-------------------D--N- 182 (506) T ss_pred --------CeeEEEEEcccceEEEecCCCCC--ceEEEEEEEeeeec--------c-------------------C--C- Confidence 14678889999974 3543221 11223333332100 0 0 0 Q ss_pred hhhhhccccccccccccccccCCceEEEEEEeee----eecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEec Q lcl|NC_015158. 234 EDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGD----YHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCG 309 (581) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~----~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~ 309 (581) +.. ......+.|.. ++...++ +..++....+| .|..|++.++ T Consensus 183 ------------~~~--------~~~~~~~~yt~~~~~~~~~~~~------------~~~~~~~~~~~--~g~vPvv~~~ 228 (506) T protein:vir:94 183 ------------QVS--------TINYVPETWTADTYTLYNPTPI------------MGKMQVDTTKP--ITTFPVVEFK 228 (506) T ss_pred ------------cee--------EEEEEEEEEeCceEEEeccccC------------ccceecccccc--CCccceEEec Confidence 000 00011111210 0000000 11222223343 5777877554 Q ss_pred ccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccccc-c--------------------------- Q lcl|NC_015158. 310 WRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDVEE-F--------------------------- 361 (581) Q Consensus 310 ~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~~~-i--------------------------- 361 (581) - .-.|.|..+.++++++.+|.+...+.+++.-.++|.+.+.+.... . T Consensus 229 n-----~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (506) T protein:vir:94 229 N-----SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLEL 303 (506) T ss_pred C-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHH Confidence 3 335899999999999999999999999999999987665432100 0 Q ss_pred --ccCCceeEEeCCC---------CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHH Q lcl|NC_015158. 362 --VWGPMEQIYINGD---------GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNA 430 (581) Q Consensus 362 --~~~pG~vi~~~~~---------~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~a 430 (581) ..+-++.+.+.++ ++++++..+....++.+.+..+...+-..|++|.++.+. ..++.||.++..++.+ T Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~Aik~~~~~ 382 (506) T protein:vir:94 304 IKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDEN-FASNSSGVAMQYKVLG 382 (506) T ss_pred HhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcccccccc-ccccchHHHHHHHHHH Confidence 0001233333332 245556555556667777899999999999999876443 2356677788888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHH Q lcl|NC_015158. 431 AGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQ 510 (581) Q Consensus 431 a~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q 510 (581) +........+.|.+ +++++++++++++...... ....-.+++..| .........+.++.++ T Consensus 383 l~~k~~~k~~~~~~-~l~~~~~li~~~~~~~~~~----------------~~~d~~~i~i~f--~~~~p~d~~e~a~~~~ 443 (506) T protein:vir:94 383 TVELASTKRRMFER-GLYARYQIISDIENSIHGD----------------WTFDPQELTFTF--RDNLPADNISQIKALV 443 (506) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCc----------------cccccccceEEe--CCCCCcCHHHHHHHHH Confidence 88888888888985 6788999988876431100 011112233333 1222233444444333 Q ss_pred HHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 511 SLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 511 ~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .| . .+ +|.+.++.+ ++.. + ++ ++|.+++..+.+...+...+.... T Consensus 444 kl---~------g~---iS~et~~~~------lp~v----~--d~--~~E~~ri~~E~~~~~~~~~~~~~~ 488 (506) T protein:vir:94 444 QA---G------AT---LPQKYLYQQ------LPGV----T--NP--QDIVDMMKEQSANGDYSFDQNGVI 488 (506) T ss_pred HH---h------cc---CChHHHHHh------CCCC----C--CH--HHHHHHHHHHHHHHhhcchhhcCC Confidence 33 1 12 333333222 2321 2 12 233444433322222211111111 No 112 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.55 E-value=4.3e-13 Score=88.36 Aligned_cols=388 Identities=9% Similarity=0.043 Sum_probs=201.6 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-cccccccccc-c--cccccccchHHHHHHHHHHHHHh Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-TTTTTNSTLP-W--KNKTTLPKLCQIRDNLHSNYISA 86 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-~~~~~~~~~~-~--k~~~~~pki~~~~d~~~~~l~~~ 86 (581) |- .++-.+|.+.+. + +++ .+.++..|+.-.+ .+..+.+--+ + ..+..+|-+.-+++.+..++ T Consensus 1 ~~----~~~i~~L~~~~~---~-~~~---r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl--- 66 (409) T protein:vir:16 1 MT----EKGIGYLRFKLS---V-HKR---RAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRL--- 66 (409) T ss_pred CC----HHHHHHHHHHHH---H-HhH---HHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhc--- Confidence 33 233344444443 2 222 2333444544322 2122221111 1 12233344444444443322 Q ss_pred hcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccc Q lcl|NC_015158. 87 LFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGP 166 (581) Q Consensus 87 ~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p 166 (581) .|.|...+|.+ +...+.++++.....+..+++++||.|++.+.=. + ..+| T Consensus 67 --------~~~Gf~~~d~~--------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~-~-------------dg~~ 116 (409) T protein:vir:16 67 --------VFREFENDDFT--------VNEIFEENNPDIFFDSTVLSALIASCSFTYISKG-E-------------NDAV 116 (409) T ss_pred --------ccccccCcchH--------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecC-C-------------CCce Confidence 23344444422 4455678899999999999999999999987421 1 1246 Q ss_pred eEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccccc Q lcl|NC_015158. 167 RAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSM 244 (581) Q Consensus 167 ~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (581) .|..+||.+. ++||....+.-+ +. +.... + T Consensus 117 ~i~~~sP~~~~~i~D~~~~~~~~a-~~-~~~~d------------~---------------------------------- 148 (409) T protein:vir:16 117 RLQVIEATNATGIIDPITGLLTEG-YA-VLERD------------E---------------------------------- 148 (409) T ss_pred EEEEEcccceEEEeecccccceee-eE-EEEec------------C---------------------------------- Confidence 7888999985 678764443221 10 00000 0 Q ss_pred ccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH- Q lcl|NC_015158. 245 DGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL- 323 (581) Q Consensus 245 ~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~- 323 (581) .+......++.++ +++.++. .++. -....|| .|..|++.+...++.+.+||.|-. T Consensus 149 ~~~~~~~~~~~~~--~~~~~~~-----~~~~---------------~~~~~~~--~g~vPvV~f~n~~~~~~~~G~seI~ 204 (409) T protein:vir:16 149 NNNVVLEAHFLPD--RTDYYYR-----DSRN---------------NISIANP--TGNPLLVPIIHRPDAVRPFGRSRIT 204 (409) T ss_pred CCceEEEEEEecC--cEEEEEe-----cCcc---------------ccceecC--CCCcceEEecccccccccCCccccc Confidence 0000000111111 1111111 1111 1122465 588999999999999999999954 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec---cccc---cccCCceeEEeCCCC-----CcccccCCCccchhHHH Q lcl|NC_015158. 324 DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG---DVEE---FVWGPMEQIYINGDG-----DVEMMAPNTQALQADMQ 392 (581) Q Consensus 324 ~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~---d~~~---i~~~pG~vi~~~~~~-----~i~~~~~p~~~~~~~~~ 392 (581) +.++++|+.+|..+-.++-.....++|+..+.+ +.+. ....+|++|.+..+. .++.++..+.. +.... T Consensus 205 ~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~-~~~~~ 283 (409) T protein:vir:16 205 RSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQ 283 (409) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccchhhhhhhHhhccCCCCCCCCceEEecCCCChh-HHHHH Confidence 889999999999999999999999999876543 2222 334478898875331 24445444332 22334 Q ss_pred HHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecC Q lcl|NC_015158. 393 IQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFD 472 (581) Q Consensus 393 lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~ 472 (581) +..+-..+-..||+|....|......-+|.++..............-+.|.. .+++++++.+.+... .+. T Consensus 284 l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~-~l~~~~rla~~~~~~-~~~-------- 353 (409) T protein:vir:16 284 LRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGA-GLLNVAYLAACLRDD-VPY-------- 353 (409) T ss_pred HHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcC-CCc-------- Confidence 4445555666789999999975543345666666666666666666777775 467788887766322 110 Q ss_pred chhcccCCCccCHHHhcCCceEE---EecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 473 SDDKVATFMNVNKDDITAKGRLR---PVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 473 ~~~~~~~~~~v~r~di~~~~~vv---a~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~ 547 (581) .+.+..+..++-. -..+..+++.+ .-+..+.+. +..++ . ...+.+.+|+..-| T Consensus 354 ----------~~~~~~~~~v~W~~~~~~~~~s~a~~a---Da~~Kl~~a--~~~~~----~---~~v~~~~~g~~~~d 409 (409) T protein:vir:16 354 ----------LREQFSKTKPKWEPLFEADASMLSLIG---DGAIKLNQA--IPEFI----N---KDTIRDLTGIKGAE 409 (409) T ss_pred ----------cchhhccceEEecCCCCcchhhHHHHH---HHHHHHHhh--ccccc----c---hhHHHHhccCCCCC Confidence 0001111111111 00122333333 333333321 11111 1 13344666776644 No 113 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.53 E-value=2.4e-12 Score=84.30 Aligned_cols=429 Identities=11% Similarity=0.068 Sum_probs=208.2 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc-cccccc-ccc---cccccccchHHHHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT-TTTNST-LPW---KNKTTLPKLCQIRDNLHSNYIS 85 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~-~~~~~~-~~~---k~~~~~pki~~~~d~~~~~l~~ 85 (581) |.-.+-+.++++|...++ .. ...+.+++.|+...+.- ..+.+. ... -+++.++-+.-+++.+..+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~---~~----~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRID---DG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHH---HH----HHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc Confidence 555556677777766654 22 23344556666554321 111111 111 1345567777777777777652 Q ss_pred hhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeecc Q lcl|NC_015158. 86 ALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFG 165 (581) Q Consensus 86 ~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~ 165 (581) + . |.+.... |.+..+. +...+.++++.....++++++.+||.|+..+.-. + ... T Consensus 74 ----~-~-~~~~~~~--d~~~~~~----~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d-~-------------~g~ 127 (456) T protein:vir:10 74 ----N-G-ITVGGSA--DSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR-D-------------DGT 127 (456) T ss_pred ----C-C-eecCCCC--CcchHHH----HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC-C-------------CCc Confidence 1 1 1221111 1111122 3334556788899999999999999998766421 1 124 Q ss_pred ceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccc Q lcl|NC_015158. 166 PRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFS 243 (581) Q Consensus 166 p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (581) |++..++|.+. ++||.... .....+|.+.+.++ ...+ T Consensus 128 ~~i~~~~p~~~~~i~d~~~~~--~~~~~i~~~~~~d~---------~~~~------------------------------ 166 (456) T protein:vir:10 128 ATITADSPETMVVSVDPLQPW--RIRAAMRWWRDLDA---------ESDF------------------------------ 166 (456) T ss_pred eEEEEEccceeEEEEcCCCCc--ceEEEEEEEEecCC---------ceeE------------------------------ Confidence 67889999996 45554332 22222333322100 0000 Q ss_pred cccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH Q lcl|NC_015158. 244 MDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL 323 (581) Q Consensus 244 ~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~ 323 (581) ...|+....+..+..+ ..+...+ .......++.......-|...|.+|+..+ .+.+|.|.. T Consensus 167 ------~~~~~~~~~~~~~~~~-~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~ 227 (456) T protein:vir:10 167 ------AIVWSGDGWQKFARPC-FVQSSSR------RRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEV 227 (456) T ss_pred ------EEEEeccceeEEEEEE-EEeeccc------ceeeeecCCceeeccccCCCCCceeEEEe------cCCCCCchh Confidence 0000000011111100 0000000 01111112222221222333466665432 245799999 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-cc---------------ccccCCceeEEeCCCCCcccccCCCccc Q lcl|NC_015158. 324 DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VE---------------EFVWGPMEQIYINGDGDVEMMAPNTQAL 387 (581) Q Consensus 324 ~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~---------------~i~~~pG~vi~~~~~~~i~~~~~p~~~~ 387 (581) +.++++++.+|.....++..+...+.|+..+.+. .. .+...+|.+|..+.+..+..++..+ .. T Consensus 228 e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~-~~ 306 (456) T protein:vir:10 228 EPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWESQAND-FT 306 (456) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcceEEecccC-hh Confidence 9999999999999999999999999987655331 00 1223477888777777766655432 12 Q ss_pred hhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccce Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADT 467 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~ 467 (581) .....+..+-..+-..||+|....|-.. +|.+|.++...............+.|.. .+++++++++.+. + + + T Consensus 307 ~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~-~l~~~~rl~~~~~--g-~-~-- 378 (456) T protein:vir:10 307 PMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKI-GLEAILVKALQIE--G-E-S-- 378 (456) T ss_pred HHHHHHHHHHHHHHhccCCChHHhcccc-cChHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhc--C-C-C-- Confidence 2333344444555567789988877422 3557777777777777777888888885 5678888876541 1 1 0 Q ss_pred eeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 468 IRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 468 iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~ 547 (581) + ...++..++ -.-....++.++-+..| .+ +++.+.. .+.+ +.|+. T Consensus 379 ------~----------~~~~~v~w~--~~~~~~~~~~ada~~kl---~~----~gi~~~~-------~~~~---~lg~~ 423 (456) T protein:vir:10 379 ------V----------EDTVDVSFE--SPDRVTLGEKYSAASLA---KA----AGESWAS-------IRRN---ILNYN 423 (456) T ss_pred ------c----------ccceeEEec--CCCCcCHHHHHHHHHHH---HH----cCCChHH-------HHHh---hCCCC Confidence 0 011222221 11122334433323332 21 1232221 1112 33431 Q ss_pred cccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 548 IFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 548 ~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) + .++ ++.+++++-.+..+....-.+.|=- T Consensus 424 ---~-~~i-~~~e~er~~~e~~~~~~~~~~~~~~ 452 (456) T protein:vir:10 424 ---A-DQI-KQDDLDRAREQITLFAGNPVQRPQE 452 (456) T ss_pred ---H-HHH-HHHHHHHHHHHHHHHhhhhhhcCCC Confidence 1 111 2222222222222222222333322 No 114 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.53 E-value=2.4e-12 Score=84.30 Aligned_cols=429 Identities=11% Similarity=0.068 Sum_probs=208.2 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc-cccccc-ccc---cccccccchHHHHHHHHHHHHH Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT-TTTNST-LPW---KNKTTLPKLCQIRDNLHSNYIS 85 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~-~~~~~~-~~~---k~~~~~pki~~~~d~~~~~l~~ 85 (581) |.-.+-+.++++|...++ .. ...+.+++.|+...+.- ..+.+. ... -+++.++-+.-+++.+..+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~---~~----~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRID---DG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHH---HH----HHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc Confidence 555556677777766654 22 23344556666554321 111111 111 1345567777777777777652 Q ss_pred hhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeecc Q lcl|NC_015158. 86 ALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFG 165 (581) Q Consensus 86 ~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~ 165 (581) + . |.+.... |.+..+. +...+.++++.....++++++.+||.|+..+.-. + ... T Consensus 74 ----~-~-~~~~~~~--d~~~~~~----~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d-~-------------~g~ 127 (456) T protein:vir:10 74 ----N-G-ITVGGSA--DSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR-D-------------DGT 127 (456) T ss_pred ----C-C-eecCCCC--CcchHHH----HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC-C-------------CCc Confidence 1 1 1221111 1111122 3334556788899999999999999998766421 1 124 Q ss_pred ceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccc Q lcl|NC_015158. 166 PRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFS 243 (581) Q Consensus 166 p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (581) |++..++|.+. ++||.... .....+|.+.+.++ ...+ T Consensus 128 ~~i~~~~p~~~~~i~d~~~~~--~~~~~i~~~~~~d~---------~~~~------------------------------ 166 (456) T protein:vir:10 128 ATITADSPETMVVSVDPLQPW--RIRAAMRWWRDLDA---------ESDF------------------------------ 166 (456) T ss_pred eEEEEEccceeEEEEcCCCCc--ceEEEEEEEEecCC---------ceeE------------------------------ Confidence 67889999996 45554332 22222333322100 0000 Q ss_pred cccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH Q lcl|NC_015158. 244 MDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL 323 (581) Q Consensus 244 ~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~ 323 (581) ...|+....+..+..+ ..+...+ .......++.......-|...|.+|+..+ .+.+|.|.. T Consensus 167 ------~~~~~~~~~~~~~~~~-~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~ 227 (456) T protein:vir:10 167 ------AIVWSGDGWQKFARPC-FVQSSSR------RRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEV 227 (456) T ss_pred ------EEEEeccceeEEEEEE-EEeeccc------ceeeeecCCceeeccccCCCCCceeEEEe------cCCCCCchh Confidence 0000000011111100 0000000 01111112222221222333466665432 245799999 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-cc---------------ccccCCceeEEeCCCCCcccccCCCccc Q lcl|NC_015158. 324 DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VE---------------EFVWGPMEQIYINGDGDVEMMAPNTQAL 387 (581) Q Consensus 324 ~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~---------------~i~~~pG~vi~~~~~~~i~~~~~p~~~~ 387 (581) +.++++++.+|.....++..+...+.|+..+.+. .. .+...+|.+|..+.+..+..++..+ .. T Consensus 228 e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~-~~ 306 (456) T protein:vir:10 228 EPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWESQAND-FT 306 (456) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcceEEecccC-hh Confidence 9999999999999999999999999987655331 00 1223477888777777766655432 12 Q ss_pred hhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccce Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADT 467 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~ 467 (581) .....+..+-..+-..||+|....|-.. +|.+|.++...............+.|.. .+++++++++.+. + + + T Consensus 307 ~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~-~l~~~~rl~~~~~--g-~-~-- 378 (456) T protein:vir:10 307 PMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKI-GLEAILVKALQIE--G-E-S-- 378 (456) T ss_pred HHHHHHHHHHHHHHhccCCChHHhcccc-cChHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhc--C-C-C-- Confidence 2333344444555567789988877422 3557777777777777777888888885 5678888876541 1 1 0 Q ss_pred eeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 468 IRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 468 iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~ 547 (581) + ...++..++ -.-....++.++-+..| .+ +++.+.. .+.+ +.|+. T Consensus 379 ------~----------~~~~~v~w~--~~~~~~~~~~ada~~kl---~~----~gi~~~~-------~~~~---~lg~~ 423 (456) T protein:vir:10 379 ------V----------EDTVDVSFE--SPDRVTLGEKYSAASLA---KA----AGESWAS-------IRRN---ILNYN 423 (456) T ss_pred ------c----------ccceeEEec--CCCCcCHHHHHHHHHHH---HH----cCCChHH-------HHHh---hCCCC Confidence 0 011222221 11122334433323332 21 1232221 1112 33431 Q ss_pred cccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 548 IFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 548 ~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) + .++ ++.+++++-.+..+....-.+.|=- T Consensus 424 ---~-~~i-~~~e~er~~~e~~~~~~~~~~~~~~ 452 (456) T protein:vir:10 424 ---A-DQI-KQDDLDRAREQITLFAGNPVQRPQE 452 (456) T ss_pred ---H-HHH-HHHHHHHHHHHHHHHhhhhhhcCCC Confidence 1 111 2222222222222222222333322 No 115 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.53 E-value=2.4e-12 Score=84.26 Aligned_cols=456 Identities=14% Similarity=0.094 Sum_probs=215.3 Q ss_pred Cccc---hhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc--cccccccccccccccccchHHH Q lcl|NC_015158. 1 MTGK---VLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT--TTTTNSTLPWKNKTTLPKLCQI 75 (581) Q Consensus 1 ~~~~---~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~--~~~~~~~~~~k~~~~~pki~~~ 75 (581) |..+ ...|+++.|.-+=.+. .......+.|.++ |-...+. ....+.+..-++.+.+|--..+ T Consensus 14 ~~~~~~~~~~~~~i~d~~~i~~~-----------~~~~~~i~~~~~~--Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 14 GSAAVGMTKSLGQIIDDPRINLP-----------ADEVERIARDKRY--YMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred hhhhhcchhhhhhhhcccCCCCC-----------HHHHHHHHHHHHH--hcCCCccccccccCCCccccceeecchHHHH Confidence 2111 2234443331111111 1111222334332 2222110 1111112222333444433444 Q ss_pred HHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeee Q lcl|NC_015158. 76 RDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEE 155 (581) Q Consensus 76 ~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~ 155 (581) ++.+...+ |+-..=+++. + +..+++|+..+.+.+|...+.+.+.++..+|.+++|.+|.. T Consensus 81 ~~~~A~ll----~~e~~~i~~~-----d----~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~------- 140 (505) T protein:vir:79 81 SAKLASLI----FNEQCQVTVS-----D----ETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDS------- 140 (505) T ss_pred HHHHHhhh----cCCCceeecC-----C----hHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeC------- Confidence 44444433 3433333332 2 34466788888899999999999999999999999988742 Q ss_pred eeeEeeeeccceEEecchhheeec-CCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccch Q lcl|NC_015158. 156 SGATRDTYFGPRAVRIDPKDIVFN-PVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRE 234 (581) Q Consensus 156 ~~~~~~~~~~p~ie~V~p~df~~D-P~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (581) .+++|+.|+|..|||= =....+.+|.|+.+...+.+. ...|+. ....+.+. T Consensus 141 --------~~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~---------~~~~yt-----------~lE~h~~~ 192 (505) T protein:vir:79 141 --------GKIKLAWATADQVYPLQADTNQVNELAIASRTTEVENH---------RTIYYT-----------LLEFHQWD 192 (505) T ss_pred --------CceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCC---------cceEEE-----------EEEEEEec Confidence 1357899999998862 123456677777665544210 000100 00000000 Q ss_pred hhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeE-EEEEeCCEEEEeecCCCccCCCCeeEecc--- Q lcl|NC_015158. 235 DCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMK-VTIIDRMFVIEEKENPSWFAQAPIFHCGW--- 310 (581) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~-itv~~g~~iir~~~nP~~~g~~Pf~~~~~--- 310 (581) + +..+|+ ++.|. ....+.+- ..+ .+.+..-.-|.-+.......+++|.++.. T Consensus 193 ~-------------------~~~~I~-n~ly~---~~~~~~lG-~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~ 248 (505) T protein:vir:79 193 H-------------------GDYVIT-NELYR---SEAAETVG-INVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGA 248 (505) T ss_pred C-------------------ceEEEE-EEEEe---cCCCCccC-cccchhhcccccccCcceeecCCCcceEEEecCCcc Confidence 0 000111 01110 00000000 000 00000000000000001234556665542 Q ss_pred -cccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-----c----c--cc---ccCCc-eeEEe-CC Q lcl|NC_015158. 311 -RIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-----V----E--EF---VWGPM-EQIYI-NG 373 (581) Q Consensus 311 -~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-----~----~--~i---~~~pG-~vi~~-~~ 373 (581) ...++|.+|.|+.+.++++++.+|....+..+.+.++-...+. ..+ . + .. ...+. .+++. .. T Consensus 249 N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v-~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~ 327 (505) T protein:vir:79 249 NNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIV-PAEWLKTGSSYGGQASETHPPMFDPDETVYQAMYG 327 (505) T ss_pred cccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceee-chHHhcccCCCCcccccccccCCCccceeeeeccC Confidence 2234778999999999999999999999999999876554443 111 0 0 00 01111 12211 11 Q ss_pred ---CCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 374 ---DGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKV 450 (581) Q Consensus 374 ---~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~l 450 (581) ++.++.+++.-...+....++.+...++..+|++.-..|.++.+.+|||++....++.-.....+.+.|.. .+++| T Consensus 328 ~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~~~-al~~l 406 (505) T protein:vir:79 328 DASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQVEK-TIKAL 406 (505) T ss_pred CCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHHHH-HHHHH Confidence 12355555543344455667888888899999999999988888899999988777777777888888875 56889 Q ss_pred HHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHH Q lcl|NC_015158. 451 LNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVST 530 (581) Q Consensus 451 i~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~ 530 (581) ++.++.+..-+.-.+. -.... -..+...++..+|. .+. ...+++.++...++-+. ++.+.. T Consensus 407 i~~i~~~~~~~~~~~~-g~~~~-------~~~~~~~~i~v~f~---d~i--~~d~~~~~~~~~~~v~~----Gi~s~e-- 467 (505) T protein:vir:79 407 TYAILELASVPSFYAD-GQARW-------TGDVDSLDITINFN---DGV--FVDQESKRAADLQAVQA----QVMPKK-- 467 (505) T ss_pred HHHHHHHHHHhccccc-ccccc-------cCCCCceeEEEEeC---CCC--CCCHHHHHHHHHHHHHc----CCCCHH-- Confidence 9999887654421110 00000 00111112333331 000 01122233333333221 233321 Q ss_pred HHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 531 ENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 531 ~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+ +.+ .+++ ++++. ++++++++++.. .+.|=. T Consensus 468 -~~---l~~---~~~~---------~eeea-~~el~ri~~E~~--~~~p~~ 499 (505) T protein:vir:79 468 -QF---LMR---NYGL---------DEEEA-DEWLAQIDAENS--TAEPEF 499 (505) T ss_pred -HH---HHh---cCCC---------ChHHH-HHHHHHHHHhcc--ccCCCc Confidence 11 112 2232 23332 333444554432 233444 No 116 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.52 E-value=1.1e-12 Score=86.21 Aligned_cols=403 Identities=11% Similarity=0.079 Sum_probs=201.8 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-cccccccccccccccccchHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-TTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP 89 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~ 89 (581) |-. .+-.+|.+.|. +. ++ .+..++.|+...+. ...+. .+|-+ +..++..+-+|-..+..++-. T Consensus 1 m~~----~~i~~L~~~~~---~~-~~---r~~~~~~yy~g~~~~~~~~~-~~p~~----~~~~~~~v~nw~~~~Vd~~a~ 64 (422) T protein:vir:97 1 MNY----MGMGYLRRKLA---LF-KT---GVDKRYRYYAMDDRDDTRSI-VMPNN----VREMYRSVLEWTAKGVDSLAD 64 (422) T ss_pred CCh----HHHHHHHHHHH---HH-HH---HHHHHHHHHhcCCChhhcCc-cccHH----HHHHHHhhcchhHHHHHHHHh Confidence 222 22223333333 22 22 23334455544331 11111 11100 011122233444444443322 Q ss_pred CccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEE Q lcl|NC_015158. 90 NERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAV 169 (581) Q Consensus 90 ~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie 169 (581) -..|.|++..|.+ +.+.+.++++.....+..+++++||.|++.+...+.. ..|.|. T Consensus 65 ---rl~~~Gf~~~d~~--------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~-------------~~p~i~ 120 (422) T protein:vir:97 65 ---RIIFREFTNDDFN--------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAED-------------GLPKMQ 120 (422) T ss_pred ---ccccceeeCCchh--------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCC-------------CeeEEE Confidence 1223344444432 3445567889999999999999999999998542111 136677 Q ss_pred ecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccc Q lcl|NC_015158. 170 RIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGF 247 (581) Q Consensus 170 ~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (581) .+||.+. ++||....+..+..+ .... + .+. T Consensus 121 ~~sp~~~~~i~D~~~~~~~~a~~~--~~~~------------~----------------------------------~~~ 152 (422) T protein:vir:97 121 VIEASKATGILDPTTFLLTEGYAI--LESD------------S----------------------------------NGN 152 (422) T ss_pred EechhhEEEEEeCCCCcceeeEEE--EEec------------C----------------------------------CCc Confidence 8899986 678864433221101 0000 0 000 Q ss_pred cccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcH-Hhh Q lcl|NC_015158. 248 GNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPL-DNL 326 (581) Q Consensus 248 ~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~-~~l 326 (581) .....|+.... + | +...++.. ...+|| .|..|++.+...+..+.+||.|.. +.+ T Consensus 153 ~~~~~~~~~~~--~---~---~~~~~~~~---------------~~~~~~--~g~vPvv~~~n~~~~~~~~G~s~I~e~v 207 (422) T protein:vir:97 153 PTLEAYFTDKD--I---W---YYPKKGKP---------------YNIKNP--TGHPLLVPIIHRPDAVRPFGRSRITKAG 207 (422) T ss_pred EEEEEEEcCce--E---E---EEcCCCcc---------------ccccCC--CCCcceEEecccCCCccccCccccchhH Confidence 00000111100 0 0 00111110 012455 477899999999999999999965 889 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCeEEEec-c-----ccccccCCceeEEeCCCC-----CcccccCCCccchhHHHHHH Q lcl|NC_015158. 327 VGMQYRIDHLENLKADVFDLIAFPPMKVKG-D-----VEEFVWGPMEQIYINGDG-----DVEMMAPNTQALQADMQIQI 395 (581) Q Consensus 327 ~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d-----~~~i~~~pG~vi~~~~~~-----~i~~~~~p~~~~~~~~~lq~ 395 (581) +++|+.+|..+..++.+....++|+..+-+ + .+.....+|++|.+..+. .++.++..+.. .....+.. T Consensus 208 ~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~-~~~~~l~~ 286 (422) T protein:vir:97 208 MYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRATVSTLLEISKDEDGDKPTVGQFTTASMA-PFMEHLKM 286 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhhhhhhhhhccCCCCCCCcceeeecCCCChh-HHHHHHHH Confidence 999999999999999999999999865533 1 122334467888876432 23334333322 22233444 Q ss_pred HHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchh Q lcl|NC_015158. 396 LEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDD 475 (581) Q Consensus 396 ~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~ 475 (581) +-..+-..||+|....|..+..+.+|-++...+.......+..-+.|.. .+++++++++.+....-. T Consensus 287 ~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~-~l~~~~rla~~~~~~~~~------------ 353 (422) T protein:vir:97 287 YASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSS-GFLNVAYIAVCLRDEFPY------------ 353 (422) T ss_pred HHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCcc------------ Confidence 5555566689999999976654456777777777777777778888885 467788887665321100 Q ss_pred cccCCCccCHHH---hcCCce-EEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccC Q lcl|NC_015158. 476 KVATFMNVNKDD---ITAKGR-LRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKP 551 (581) Q Consensus 476 ~~~~~~~v~r~d---i~~~~~-vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~ 551 (581) .+++ +...++ +...-+... +|.+..+..+.+. +..+. +. ..+.+.+|+.. T Consensus 354 --------~~~~~~~~~~~w~p~~~~~~~s~---a~~aDa~~Kl~~a--~~~~~---~~----~~~~~~lg~~~------ 407 (422) T protein:vir:97 354 --------LRNQFMDTVIKWEPLFEADANML---TLVGDGAIKLNQA--IPGFM---DA----DVIRDLTGVKG------ 407 (422) T ss_pred --------cchhhccceEEEccCCCCChHHH---HHHHHHHHHHHhh--ccccc---cH----HHHHHHcCCCc------ Confidence 0111 111111 001112222 3333333333321 11222 21 22334444432 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_015158. 552 NVAVMEAQTTSALVNQSQAQIEEEAQV 578 (581) Q Consensus 552 ~~~~~~~~~~q~~~q~aq~~~~~~~~~ 578 (581) .+-+.+++.+. ++++ T Consensus 408 -----~~~~~~~~~~~-------~~d~ 422 (422) T protein:vir:97 408 -----ADKPIPAITEV-------TTDG 422 (422) T ss_pred -----hhHHHHHHHhh-------hccC Confidence 12222332221 1112 No 117 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.51 E-value=1.3e-12 Score=85.72 Aligned_cols=434 Identities=12% Similarity=0.086 Sum_probs=209.3 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc-ccccccccccc---ccccccchHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT-TTTTNSTLPWK---NKTTLPKLCQIR 76 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~-~~~~~~~~~~k---~~~~~pki~~~~ 76 (581) +-+++..+.-|-++. ..+..+|.+.|. ..+. .+..++.|+.-.+. ...+. .+|-. .+..+|-+.-++ T Consensus 2 ~~~~~~~~~gl~~~~-~~~~~~L~~~~~---~~~~----~~~~~~~Yy~G~~~~~~~~~-~~p~~~r~~~~v~nw~~~~V 72 (474) T protein:vir:81 2 IQQQTVRIPSLSNDE-NALINGLLAQIE---NLRW----KNLLRTSYYENKRTIQYVGT-LIPPQYFNLGLVLGWTGKAV 72 (474) T ss_pred cCCCcCcCCCCChhH-HHHHHHHHHHHH---HHhh----HHHHHHHHhccCCChhhccc-cccHHHHHHHhhcChHHHHH Confidence 666777777777643 334445555444 3332 23444456544332 12111 11100 012223333333 Q ss_pred HHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeee Q lcl|NC_015158. 77 DNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES 156 (581) Q Consensus 77 d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~ 156 (581) |.+..++. |.+ |.. |...++. ..+.....++++.......++++++||.|++.+...+. T Consensus 73 d~~a~rl~---~~G---f~~-~d~~~~~-------~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d------- 131 (474) T protein:vir:81 73 DALARRCN---LEG---FVW-PDGDLDS-------LGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGED------- 131 (474) T ss_pred HHHHhhhc---ccc---eEC-CCCCccc-------hHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCC------- Confidence 33322222 111 111 1111111 12345567889999999999999999999988864321 Q ss_pred eeEeeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccch Q lcl|NC_015158. 157 GATRDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRE 234 (581) Q Consensus 157 ~~~~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (581) + ...|.|..+||.+. .+||....+..+-.+ .... . T Consensus 132 ~-----~~~~~i~~~sp~~~~~~~D~~~~~~~~al~~--~~~~-----------~------------------------- 168 (474) T protein:vir:81 132 D-----EPEALIHVKDASEATGEWNRRRRGLNNLLSI--IDKD-----------K------------------------- 168 (474) T ss_pred C-----CceeEEEEeccceEEEEEeCCCCcceeeeEE--EEEc-----------C------------------------- Confidence 1 11256788999997 488865443322111 1000 0 Q ss_pred hhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccC Q lcl|NC_015158. 235 DCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQ 314 (581) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p 314 (581) ++......+|.++.+ +.+.. .+++.. -.....+||+ | .|++.++..+.. T Consensus 169 ----------~g~~~~~~ly~~~~~--~~~~~-----~~~~~~-----------w~~~~~~~~~--g-vPvV~~~n~~~~ 217 (474) T protein:vir:81 169 ----------EGKVLSLALYLDNET--VTAQR-----DKATLK-----------WQVDRDEHVY--G-VPAQVLPYKPAP 217 (474) T ss_pred ----------CCcEEEEEEEeCCcE--EEEEE-----cCccce-----------eeeccCCCCC--C-cceEEecccccc Confidence 000000011111111 11111 111100 0111234554 5 588888888888 Q ss_pred CcccCCCcH-HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-cccc-----------cccCCceeEEeCCCCC----- Q lcl|NC_015158. 315 DNLYAMGPL-DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEE-----------FVWGPMEQIYINGDGD----- 376 (581) Q Consensus 315 ~~~~G~s~~-~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~-----------i~~~pG~vi~~~~~~~----- 376 (581) ...+|+|-. +.++++|+.+|..+-.++......++|+..+-+ ++++ .....|++|-+..+.+ T Consensus 218 ~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~ 297 (474) T protein:vir:81 218 KRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQ 297 (474) T ss_pred cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccc Confidence 899999854 799999999999999999999999999865543 1111 1223566776665432 Q ss_pred -----cccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 377 -----VEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPG-EKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKV 450 (581) Q Consensus 377 -----i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~-~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~l 450 (581) +..++..+.. +....+..+-..+-..||+|....|..... +.+|-++..............-+.|.. .++++ T Consensus 298 ~~~~~~~q~~~a~l~-~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~-~l~~~ 375 (474) T protein:vir:81 298 LARADVKQFPAASPD-AHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTP-ALRKA 375 (474) T ss_pred cccccccccCCCChh-HHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 2222222211 122223444444555689999999976544 466777877777777777778888885 46888 Q ss_pred HHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHH Q lcl|NC_015158. 451 LNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVST 530 (581) Q Consensus 451 i~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~ 530 (581) +++.+.+... ... -.++.++.+...+-.-.-..++++++.-+..|.+.. ..+.+. T Consensus 376 ~rla~~i~~~-~~~----------------~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~-----~~~~~~--- 430 (474) T protein:vir:81 376 FIRALAMKNK-VAI----------------DEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAV-----PWLAET--- 430 (474) T ss_pred HHHHHHHhCC-CCc----------------cccchhhccceeEecCCCccCHHHHHHHHHHHHhcc-----cCCCcH--- Confidence 8888765311 000 011122222222111122334445443333333221 122221 Q ss_pred HHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHH--H--HHHHHHHHHHhcc--cCC Q lcl|NC_015158. 531 ENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSAL--V--NQSQAQIEEEAQV--PLV 581 (581) Q Consensus 531 ~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~--~--q~aq~~~~~~~~~--~~~ 581 (581) ++ +.+ +.|+. +++...++ + ++++..+.+-..- +== T Consensus 431 -~~---~~~---~lg~t---------~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~ 471 (474) T protein:vir:81 431 -EV---GLE---LIGLT---------PQQARRAMADKRRVQGRGTLQALIDRSNNGA 471 (474) T ss_pred -HH---HHh---hcCCC---------HHHHHHHHHHHHHHhHHHHHHHHHhcCCCCC Confidence 11 112 23431 22211111 1 1222222221000 000 No 118 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.50 E-value=2.1e-12 Score=84.56 Aligned_cols=471 Identities=7% Similarity=-0.027 Sum_probs=208.5 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccccccccc--ccccchHHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNK--TTLPKLCQIRDN 78 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~--~~~pki~~~~d~ 78 (581) .+.-..=++.-+....++..-.+...|. +.-+....+|... .|+...-.+- ..+|.++ +.+|-...+++. T Consensus 4 ~~~~~~~i~~w~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~-~~~~~~w~~~----~~~~~~~~~~~~~l~~~i~~~ 75 (518) T protein:vir:78 4 WSVMTRFIKGWLNGKPNGSEPELIPKYL---PLVPDNQKEWSKD-SYLTSLWAQG----YVPTVHDKLMNSGTGNEIVVV 75 (518) T ss_pred hhhHHHHHHHhhcCCCCccchhccHHHh---hhcccchhhhhhh-hhhhhhcccC----CCCccccccccCChHHHHHHH Confidence 0100011112221111211111111111 1111112222111 1111110011 1123333 333323344444 Q ss_pred HHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeee Q lcl|NC_015158. 79 LHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGA 158 (581) Q Consensus 79 ~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~ 158 (581) +...+ |+-..=+++.+....+. ++.++.|+..|++.+|...+.+.+.+++..|.+++|..|... T Consensus 76 ~A~ll----~~e~~~i~v~~~~~~d~---e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~--------- 139 (518) T protein:vir:78 76 AAEYI----SGKPLSIDVTGVNGSKD---ENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNG--------- 139 (518) T ss_pred HHHhh----cCCCceEEecCccccCc---HHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECC--------- Confidence 44333 45544466655443333 345678888889999999999999999999999999988422 Q ss_pred EeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhh Q lcl|NC_015158. 159 TRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEK 238 (581) Q Consensus 159 ~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (581) +++|+.|++..|||.-.-.++-.|-|+.+.... + +...|.. +..+. ..++.. T Consensus 140 ------~~~i~~v~ad~~~P~~~~g~~~~~~f~~~~~~~--~--------k~~~y~~-----lE~he-------~~~~~~ 191 (518) T protein:vir:78 140 ------RPSISVHSSSQFWIDFKNNEPFRFNFFEEIPTS--N--------KADIYYL-----VESRE-------IKQWDK 191 (518) T ss_pred ------eeEEEEEcCCeeEEEeecCcEEEEEEEEEeecC--C--------cceeEEE-----EEeec-------cccccc Confidence 367889999999885322233344444322111 0 0000100 00000 000000 Q ss_pred ccccccccccccccccCCceEEEEEEeee-eecccC---CceeeeeE-EEEEeCCEEEEeecCCCc-cCCCCeeEecccc Q lcl|NC_015158. 239 AVGFSMDGFGNLYDYFQSPYVEVLTFYGD-YHDTQS---GTFKRNMK-VTIIDRMFVIEEKENPSW-FAQAPIFHCGWRI 312 (581) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~vevlE~~g~-~~d~~~---d~~~e~~~-itv~~g~~iir~~~nP~~-~g~~Pf~~~~~~~ 312 (581) . .+. +++.+|+ ++.|-. ...... .+..+... +.-..|... +. -+. ..++||+.+-+.+ T Consensus 192 --------~--~~~-~~~~~I~-n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e---~~-~~~tg~~~~~~~~~~n~ 255 (518) T protein:vir:78 192 --------E--GKK-LSGGFVT-YSVIKIDGDKTTPISAERLPEQITSYLHTNDIQL---NH-SVSIGLKSMGAYLINNS 255 (518) T ss_pred --------e--eec-ccceeEE-EEEeeecCcccccccccccccccccccccccCcc---ce-eeccCCccceEEeeccc Confidence 0 000 0011111 111100 000000 00000000 000001000 00 011 2345666654443 Q ss_pred c-----CCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccccc---cccCCc---------eeEE-eC-- Q lcl|NC_015158. 313 R-----QDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDVEE---FVWGPM---------EQIY-IN-- 372 (581) Q Consensus 313 ~-----p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~~~---i~~~pG---------~vi~-~~-- 372 (581) . ++|.+|+|+.+.+.+.++.+|....+..+.+.. +.+++.|.++.-. ....++ .+++ ++ T Consensus 256 ~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~i~~~ 334 (518) T protein:vir:78 256 PSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQFKGT 334 (518) T ss_pred cccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCccccccCCCCceEEEecCc Confidence 3 478899999999999999999999999999976 4555555432110 011111 2222 21 Q ss_pred C--CCC----cccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 373 G--DGD----VEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVML 446 (581) Q Consensus 373 ~--~~~----i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~ 446 (581) . ++. ++.+++.-...+....++.+...++.-+|++..+.|.+ .+.+|||++....++.-+.+..+.+.+.. . T Consensus 335 ~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~-~~~~TATei~s~~~~~~~t~~~~~~~~e~-a 412 (518) T protein:vir:78 335 LDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLG-NREVKATEIWSLQDATVRKIEKKKRLIQN-V 412 (518) T ss_pred CCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcc-cccccHHHHHHHHHHHHHHHHHHHHHHHH-H Confidence 1 111 44444443344556668888888999999999998875 45799999998888877777888888874 5 Q ss_pred HHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_015158. 447 MEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKP 526 (581) Q Consensus 447 ~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p 526 (581) ++.|+..+++++....... .+ ...+...++..+|. -+.... +...++.++++-. +++.+ T Consensus 413 l~~l~~~i~~l~~~~~~~~-----~~-------~~~~~~~~v~i~f~--D~i~~D---~~~~~~~~~~~v~----aGimS 471 (518) T protein:vir:78 413 YEQMLWDFLYLLTGGTNNK-----EK-------AIMRDEIRVIIEFP--DPMSVN---LNELSSTLNNMNS----ALAMS 471 (518) T ss_pred HHHHHHHHHHHHHhhcCcc-----cc-------ccCCCceeEEEEeC--CCCCCC---HHHHHHHHHHHHh----cCCCC Confidence 6888888887765431100 00 00001112222221 000111 2222333333221 12322 Q ss_pred hhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHH-HHhcccC----C Q lcl|NC_015158. 527 HVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIE-EEAQVPL----V 581 (581) Q Consensus 527 ~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~-~~~~~~~----~ 581 (581) . +..++ +. .++ .++++..+. .++.+++.. ...+-|= . T Consensus 472 ~---e~~i~---~~--~~~---------~~deea~~e-~~ri~~E~~~~~~~~p~~~~g~ 513 (518) T protein:vir:78 472 V---EEKVK---LI--HPK---------WEDEEIQAE-VKRIYLENAIGEVPDPEAIGGM 513 (518) T ss_pred H---HHHHH---Hh--CCC---------CCHHHHHHH-HHHHHHHhcccCCCCCccccCC Confidence 2 11112 11 111 123333332 222332211 1222121 1 No 119 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.49 E-value=3.6e-12 Score=83.31 Aligned_cols=453 Identities=11% Similarity=0.035 Sum_probs=225.3 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccc------ccccccccc------ccccc Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTT------TTTNSTLPW------KNKTT 68 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~------~~~~~~~~~------k~~~~ 68 (581) ||-+.+.+.. +.+.+.|..++.++...++ .....++++|+.--|.- ..+...... -+++. T Consensus 1 ~~~~~~~~~~------~~~~~~~~~~i~~~~~~~~--~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~ 72 (537) T protein:vir:78 1 MTSPLLNKPI------DQLGGLLNTEITTYMASNH--IKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKIS 72 (537) T ss_pred CCcccccccH------HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhcccchhhhcccccccccccccccccccccccc Confidence 6665554433 3445556666665554333 23455677776655411 111111111 13577 Q ss_pred ccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeec Q lcl|NC_015158. 69 LPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVK 148 (581) Q Consensus 69 ~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~ 148 (581) .+....+++...++|+ +++- +|.+...++.+ ..+.++..+ ..+|.....+..+++.+||.|...+++.. T Consensus 73 ~nf~k~Ivd~~~~yl~----G~Pv--~~~~~d~~~~e----~~~~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~de 141 (537) T protein:vir:78 73 HGFFTELVDQLAQYLL----SNGV--EVKVKDEDNTQ----LDEILQEYF-DEDFQATIDTLVTNASKKGFEGIFARTTS 141 (537) T ss_pred cchHHHHHHHHhhhhc----ccCc--eeecCcchhHH----HHHHHHHHh-hccHHHHHHHHHHHHhhcCeeEEEeeecC Confidence 7777778888888876 4443 33333333323 334454444 35777788999999999999988876532 Q ss_pred ceeeeeeeeeEeeeeccceEEecchhhee--ecCCCCCcccCceEEEEEec-HHHHHHHhhccCccchhHHHHHHHHhhh Q lcl|NC_015158. 149 ETTKDEESGATRDTYFGPRAVRIDPKDIV--FNPVAVDFAHSPKIIRTVLN-EGELLQMEQDQPENASLASAIARRREFR 225 (581) Q Consensus 149 ~~~~~~~~~~~~~~~~~p~ie~V~p~df~--~DP~a~~~~d~~~i~r~~~T-~~el~~m~~~~~~~~~~~~d~~~~~~~~ 225 (581) . ..+++..++|.++| +|.+ .+-..++ |.+.. ..+.... T Consensus 142 ~--------------~~~~~~~i~p~~~~pv~d~~---~~~~~~~-~~y~~~~~~~~~~--------------------- 182 (537) T protein:vir:78 142 E--------------GKLKFQTVDGLTLIPVFDDY---GVLKMII-RWYSEIRYSTKQQ--------------------- 182 (537) T ss_pred C--------------CceEEEEEccceeEEEEcCC---CCceeEE-EEEeeeecccccc--------------------- Confidence 2 13577889999974 4532 2222223 33222 1110000 Q ss_pred ccCCcccchhhhhccccccccccccccccCCceEEEEEEeee-------eecccCCceeeeeEEEEEe----CCEEE--E Q lcl|NC_015158. 226 RGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGD-------YHDTQSGTFKRNMKVTIID----RMFVI--E 292 (581) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~-------~~d~~~d~~~e~~~itv~~----g~~ii--r 292 (581) +... ....++|+...+..+.+-+. .....++... ..+.... .+... . T Consensus 183 ------~~~~------------~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i--~~~~~~~~~~~~~~~~~~~ 242 (537) T protein:vir:78 183 ------STET------------IWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPA--PHVLAIEESTDADFEDTDG 242 (537) T ss_pred ------Ccce------------EEEEEEEcCCcEEEEEecCCccccccccccccccccc--ceeeecccccccccccccc Confidence 0000 00011112221111110000 0000000000 0000000 00000 1 Q ss_pred eecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-cc---ccc--cCCc Q lcl|NC_015158. 293 EKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VE---EFV--WGPM 366 (581) Q Consensus 293 ~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~---~i~--~~pG 366 (581) ...-|.++|+.|++.+. ..-+|.|..+.++++++.++.+.+.+.|++...++|.+.+.+. .+ +.. -+-. T Consensus 243 ~~~~~~~~g~iPvv~f~-----nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~ 317 (537) T protein:vir:78 243 YQVLGRSYSKFPFQLLY-----NNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAK 317 (537) T ss_pred ccccccCCcceeEEEec-----cCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhc Confidence 11123345777776444 3457999999999999999999999999999999999887652 11 111 1123 Q ss_pred eeEEeC-CCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 367 EQIYIN-GDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVM 445 (581) Q Consensus 367 ~vi~~~-~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~ 445 (581) +++.+. .++++.++..+.......+.+..+.+.+-+.|.++..+ ....++.|+.++..++..+........+.|.. T Consensus 318 ~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~--~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~- 394 (537) T protein:vir:78 318 KMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNST--AVGDGNVTNVVIKSRYTLLAMKARKMETSLRK- 394 (537) T ss_pred CceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCc--cccccCCcHHHHHHHHhhHHHHHHHHHHHHHH- Confidence 556666 45778888888766677778899999888888776643 33445666666777888887777888888985 Q ss_pred HHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccccccccc Q lcl|NC_015158. 446 LMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIK 525 (581) Q Consensus 446 ~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~ 525 (581) +++++++++.+++.... +. .....+|+..| ...-.....+.++.+..|. + . T Consensus 395 ~l~~~~~~i~~~~~~~~---------~~--------~~d~~~i~i~f--~~~~P~n~~e~a~~~~~l~---~-------~ 445 (537) T protein:vir:78 395 VLRWCADMVVSDIALRG---------LG--------EYDSNDICFEI--EPHVLANELDIATTRKTEA---E-------T 445 (537) T ss_pred HHHHHHHHHHHHHhhcC---------Cc--------ccccceeeEEe--ccCCCCCHHHHHHHHHHHH---h-------c Confidence 67889998888764321 00 01112333333 1222223334343333321 1 1 Q ss_pred chhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHH-----HHHHHHHHH-hc--------ccCC Q lcl|NC_015158. 526 PHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVN-----QSQAQIEEE-AQ--------VPLV 581 (581) Q Consensus 526 p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q-----~aq~~~~~~-~~--------~~~~ 581 (581) +.+|.+.++.++ + +..++ +++++...+. ......+++ .| .+++ T Consensus 446 giiS~eT~l~~~-------p---~vdd~---e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (537) T protein:vir:78 446 EALKIGNIMTVA-------P---RIGDD---ETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAML 502 (537) T ss_pred CcchHHHHHHhC-------C---CCCCH---HHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhc Confidence 222322222111 1 11111 1111000000 000000000 00 1222 No 120 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.44 E-value=6.5e-12 Score=81.92 Aligned_cols=396 Identities=9% Similarity=0.021 Sum_probs=192.5 Q ss_pred HhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHH Q lcl|NC_015158. 47 YIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTI 126 (581) Q Consensus 47 y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~ 126 (581) ||-+. .....+ +...+..++-+.-++|.+...+. |.+ |+ ..|.+.-+ .+...+.+++|... T Consensus 1 ~l~~~---~~~~~~-~~~~~~v~n~~~~ivd~~~~~l~---~~g---f~-----~~d~~~~~----~~~~i~~~N~~d~~ 61 (434) T protein:vir:98 1 MLPKN---AEQAFL-DFQRKARTNFCGLIANASVHRLL---ALG---VT-----GPDGEPDT----RASRWWQANRLDSR 61 (434) T ss_pred CCCCC---ccHHHH-HhhhhhhccchHHHHHHHHhhhc---cCc---ee-----cCCCchHH----HHHHHHHhcChhHH Confidence 22111 101111 11122334555667777666553 222 22 11211111 23345667899999 Q ss_pred HHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhhe--eecCCCCCcccCceEEEEEecHHHHHHH Q lcl|NC_015158. 127 MSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDI--VFNPVAVDFAHSPKIIRTVLNEGELLQM 204 (581) Q Consensus 127 ~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df--~~DP~a~~~~d~~~i~r~~~T~~el~~m 204 (581) ..++.+++.+||.|++.+..... +........|.|..+||.++ ++||....+. +.++.+.+..+ T Consensus 62 ~~~~~~~a~i~G~ay~~v~~~~~-------~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~---~ai~~~~~~~~---- 127 (434) T protein:vir:98 62 QKLVWRMAMAQSAGYMLVGAHPT-------RTEDNGRPSPLITMEHPSECIVEYDPETGEPL---VGLKVWHNDID---- 127 (434) T ss_pred HHHHHHHHhhcCceEEEEecCCC-------cccccCCceeEEEEeccceeEEEEeCCCCceE---EEEEEEEeccC---- Confidence 99999999999999999965321 11111223567888999996 5676543322 22222211000 Q ss_pred hhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEE Q lcl|NC_015158. 205 EQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTI 284 (581) Q Consensus 205 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv 284 (581) +. .+..++ + .+.+..+.+. ....+.........+ T Consensus 128 -------~~--------------------------------~~~~~~--~-~~~~~~~~~~----~~~~~~~~~~~~~~~ 161 (434) T protein:vir:98 128 -------GF--------------------------------GYARVF--F-DDTSFPYRTR----ERTGARLPWGPDSWV 161 (434) T ss_pred -------Cc--------------------------------eEEEEE--E-eCcEEEEEEe----eccccccccccccce Confidence 00 000000 0 0001111110 000000000000000 Q ss_pred EeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-ccccc-- Q lcl|NC_015158. 285 IDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-DVEEF-- 361 (581) Q Consensus 285 ~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-d~~~i-- 361 (581) . .........| +.|+.|++.+...+..+. +|.|..+.++++++.+|..+..+++.+...++|+..+.+ ++.+. T Consensus 162 ~-~~~~~~~~~h--~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~ 237 (434) T protein:vir:98 162 Y-TGTADSGDVH--DLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTD 237 (434) T ss_pred e-cccccccccC--CCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccc Confidence 0 0111112233 467889988888888766 699999999999999999999999999999999876643 12111 Q ss_pred ------------ccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHH---HhcCCchHhcCCCCcccccHHHHHH Q lcl|NC_015158. 362 ------------VWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKME---EFAGAPREAMGIRTPGEKTAFEVQQ 426 (581) Q Consensus 362 ------------~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~e---e~TGv~~~~~G~~~~~~~TAtgv~~ 426 (581) ...+|++|.... +++...+.+. .++.+.+..++..+. ..|++|....|. ...|.+|.++.. T Consensus 238 ~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~q~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~-~~~n~Sg~Al~~ 313 (434) T protein:vir:98 238 PATGMTVVDQPFVPSPSAVWASEG-ENTQFGQLDA--TDLSGFLKEHASDVRDMLTISQTPTYLYAT-DLVNISADTIGA 313 (434) T ss_pred cccccchhhhhhhccccccccCCC-CCceEEEecC--cchHHHHHHHHHHHHHHhcccCCCHHHhcc-ccCChHHHHHHH Confidence 123455554432 2333323221 223344555555555 457788777773 234667777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHH Q lcl|NC_015158. 427 LQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQA 506 (581) Q Consensus 427 l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~ 506 (581) ...........+.+.|.. .+++++++++.+. ....+.. +++..| .-.-...+.+.+ T Consensus 314 ~~~~l~~k~~~k~~~f~~-~l~~~~rl~~~~~---g~~~~~~------------------~~~v~w--~~~~~~s~~~~a 369 (434) T protein:vir:98 314 LDILHVAKVREHIASFSE-GLESVLALAAAQA---GVPEDYT------------------EAEVRW--ANPAHVTMAVKA 369 (434) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhc---CCChhhe------------------eeeEEe--cCCCCCCHHHHH Confidence 777777777888888885 5678888776541 1111100 111111 112233445555 Q ss_pred HHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 507 QVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 507 q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.+..|.+.. + |. ..+.+.+++. ++ +.++++++..++....++.+.. T Consensus 370 da~~kl~~~g-------~-~~-------e~~~~~lg~~-----------~~--e~~r~~~e~~~~~~~~~~~~~~ 416 (434) T protein:vir:98 370 DAATKLKSIG-------Y-PL-------DVIAEELDES-----------PA--RVRRIVAGAASQALLAASLLPA 416 (434) T ss_pred HHHHHHHhcC-------C-cH-------HHHHHhCCCC-----------HH--HHHHHHHHHHHHHHHHHhhhcc Confidence 5444443321 2 22 1222332221 11 2223333222222222222222 No 121 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.29 E-value=1.7e-10 Score=74.09 Aligned_cols=419 Identities=13% Similarity=0.126 Sum_probs=193.5 Q ss_pred hcc-c----hhhhHHHHHH-HHHhhHHhhhhhHHHHHHHHHHHhhcccccc-c--ccccccccc---cccccchHHHHHH Q lcl|NC_015158. 11 MLD-D----TRDGLAEQIA-NTWQNWNSQRQEWLSQKSELRNYIFATDTTT-T--TNSTLPWKN---KTTLPKLCQIRDN 78 (581) Q Consensus 11 ~~~-~----~~~~~a~~i~-~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~-~--~~~~~~~k~---~~~~pki~~~~d~ 78 (581) |++ + +.+.+..+|. .+.+.+.. ....+.+++.|+.--+.-. . .+...+++. .+.++-+.-+++. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~----~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~ 76 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNT----ECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNS 76 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHH----HhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHH Confidence 443 1 1234444444 34444433 2235555666765543211 0 111111110 1123433444454 Q ss_pred HHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeee Q lcl|NC_015158. 79 LHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGA 158 (581) Q Consensus 79 ~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~ 158 (581) +..++. +.+.+..+.+..+. +...+.++++.....+++.++.+||.|++.+..... T Consensus 77 ~~~~l~-----------~~gf~~~d~~~~~~----~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~--------- 132 (479) T protein:vir:99 77 FAQQLI-----------VDGYRKTGTNENAK----GWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGIS--------- 132 (479) T ss_pred HHhhcc-----------cccccCCCchhhHH----HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCC--------- Confidence 444331 22233333333333 334456688999999999999999999887753211 Q ss_pred EeeeeccceEEecchhheee--cCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 159 TRDTYFGPRAVRIDPKDIVF--NPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 159 ~~~~~~~p~ie~V~p~df~~--DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) .......|.+..++|.++++ |...... ..+. .++. .. T Consensus 133 ~~d~~g~~~i~~~~p~~~~~iydd~~~~~---~~~~--~~~~------------------------------~~------ 171 (479) T protein:vir:99 133 PLDGTTVARIKCIDPRDAFAIWEDPYWDE---WPKY--LLER------------------------------QP------ 171 (479) T ss_pred CcCCCCceEEEEechhheEEEecCCcccc---eeeE--EEee------------------------------cC------ Confidence 01112236778889998743 3221110 0000 0000 00 Q ss_pred hhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCc Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDN 316 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~ 316 (581) .+ ...++....+.. | ...+|.+ .+.....| ..|+.|++.+...+..+. T Consensus 172 --------~~---~~~~~~~~~~~~---~----~~~~~~~------------~~~~~~~h--~~g~vPvv~f~n~~~~~~ 219 (479) T protein:vir:99 172 --------NG---QYWWWTEEDYSI---F----EFKQGKF------------IYRETVSH--DYGHIPFVRYVNVMDLRG 219 (479) T ss_pred --------ce---eEEEEecceEEE---E----EecCCce------------eecccccc--CCCCcceEEeecCCCcCc Confidence 00 000111111111 1 1111111 11111233 468899988888887754 Q ss_pred ccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--ccc-------cccCCceeEEeCCCCCcccccCCCccc Q lcl|NC_015158. 317 LYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--VEE-------FVWGPMEQIYINGDGDVEMMAPNTQAL 387 (581) Q Consensus 317 ~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~~~-------i~~~pG~vi~~~~~~~i~~~~~p~~~~ 387 (581) +|.|..+.++++++.+|.....+.+.+...++|+..+.+. .++ .....++++-.. ++++...+-+. . T Consensus 220 -~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~q~~~--~ 295 (479) T protein:vir:99 220 -VCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEKMRFAQESMLISQ-NEKASFGAIPA--A 295 (479) T ss_pred -CCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhccccccccceeec-CCCceEEEecc--c Confidence 7999999999999999999999999999999998655431 111 111234455433 33334333332 2 Q ss_pred hhHHHHHHHHH---HHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCc Q lcl|NC_015158. 388 QADMQIQILEA---KMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDV 464 (581) Q Consensus 388 ~~~~~lq~~~~---~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~ 464 (581) ++.+.+..++. .+-..||+|....|.. +|.+|.++...............+.|.. .+++++++++.+.....+ T Consensus 296 ~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~--~n~Sg~Al~~~~~~l~~ka~~~~~~f~~-al~~~~~l~~~~~~~~~~- 371 (479) T protein:vir:99 296 PLDGLLNAYKESLLEFLALAQLPPHIAGQI--VNVAADALAAGTRQTMQKLFEKQATWKA-SHNQTMRLVNKIEGRTEE- 371 (479) T ss_pred chHHHHHHHHHHHHHHhccCCCCHHHcccc--cchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcc- Confidence 23334444554 4445578888888853 3456666666666666777777888885 578888888765311000 Q ss_pred cceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCC Q lcl|NC_015158. 465 ADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLG 544 (581) Q Consensus 465 ~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~ 544 (581) .+-. +|...++ -.......+.+ +.+.++.+. .+ ++.+.+ .+. ++ T Consensus 372 ~~~~------------------~i~~~w~--~~~~~s~~~~a---d~~~kl~~a----g~---is~et~----l~~--l~ 415 (479) T protein:vir:99 372 ATDL------------------DFTITWQ--DVTIQSLAQFA---DAWAKMVES----LK---IPAEGV----WDM--IP 415 (479) T ss_pred ccce------------------eeeEEec--CCCCCCHHHHH---HHHHHHHhc----CC---CCHHHH----HHh--cC Confidence 0000 1111111 11122333333 333333321 12 332222 222 34 Q ss_pred CcccccCCCCcHHHHHHHHHHHHHHH---HHH--HHhcccC-----C Q lcl|NC_015158. 545 GWDIFKPNVAVMEAQTTSALVNQSQA---QIE--EEAQVPL-----V 581 (581) Q Consensus 545 ~~~~~~~~~~~~~~~~~q~~~q~aq~---~~~--~~~~~~~-----~ 581 (581) +++ ++++ +...+..++.++ ..+ ...+.|. . T Consensus 416 gv~----~~~~---e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (479) T protein:vir:99 416 NLD----QSTV---NGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGP 455 (479) T ss_pred CCC----HHHH---HHHHHHHHHHHHHHHHHHHHhcccCcccccCCC Confidence 432 1111 111111111100 000 0111111 1 No 122 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=99.17 E-value=1.8e-10 Score=73.95 Aligned_cols=565 Identities=13% Similarity=0.093 Sum_probs=269.4 Q ss_pred Cccchhhhhhhcc-chhhhHH-HHHHHHHhh----HHhhhhhHHHHHHHHHHHhhccccc--------cccccccccccc Q lcl|NC_015158. 1 MTGKVLELQQMLD-DTRDGLA-EQIANTWQN----WNSQRQEWLSQKSELRNYIFATDTT--------TTTNSTLPWKNK 66 (581) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~a-~~i~~~~~~----~~~~r~~~~~~~~~~~~y~~~~~~~--------~~~~~~~~~k~~ 66 (581) +|-.--.+++..+ .|-|.|- .-|++..+- --++-|.-+..-.++.-|+-++... -.....-+=-|+ T Consensus 3 ispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V~C~V~~~ 82 (666) T protein:vir:96 3 ISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKVRCQVVNK 82 (666) T ss_pred cCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhHHHHhHHhhhhccCCCceeeecccccccccceeecc Confidence 4444444555554 2223322 233333221 1222233333334444555444321 111111112244 Q ss_pred ccccch-HHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEe Q lcl|NC_015158. 67 TTLPKL-CQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVE 145 (581) Q Consensus 67 ~~~pki-~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~ 145 (581) .++|-| ..-++-.+++|.++|.++-..|-+.- +.+-++.||+.+-+|+|...-..+..++--+++|.+.|-..-+... T Consensus 83 ~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~~~~~~~LiL~L~D~~KYN~~~~ET~ 161 (666) T protein:vir:96 83 ATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTMTSSIPELILCLQDAAKYNLVGWETE 161 (666) T ss_pred ccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhhhhhHHHHHHHHhhhhhcceeeeeec Confidence 555544 47789999999999999988775443 3455678999999999999888888889999999999988777777 Q ss_pred eecceeeeee---------eeeEeeeecc-ceEEecchhheeecCCCC--Cccc-CceEE-EEEec----HHHHHHHhhc Q lcl|NC_015158. 146 YVKETTKDEE---------SGATRDTYFG-PRAVRIDPKDIVFNPVAV--DFAH-SPKII-RTVLN----EGELLQMEQD 207 (581) Q Consensus 146 ~~~~~~~~~~---------~~~~~~~~~~-p~ie~V~p~df~~DP~a~--~~~d-~~~i~-r~~~T----~~el~~m~~~ 207 (581) |-.-.+...- +..-+.++.+ -+|+|++|-|.++||+.. +... +.|.- -.... +++|..+.. T Consensus 162 Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~- 240 (666) T protein:vir:96 162 WSNIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTN- 240 (666) T ss_pred cccccccchhhhhhcCCCceeeeccchhhhhhhhccccccccccCCCCCCchhhhhhhhhhHHHHHHHHHHHHHhhhhc- Confidence 7322221111 1222334444 379999999999999843 2211 22321 11122 233332211 Q ss_pred cCccch---hHHHHHHHHhhhccCCcccch------hhhhccccccccccccc--cccCCc--------eEEEEEEeeee Q lcl|NC_015158. 208 QPENAS---LASAIARRREFRRGLGTYTRE------DCEKAVGFSMDGFGNLY--DYFQSP--------YVEVLTFYGDY 268 (581) Q Consensus 208 ~~~~~~---~~~d~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~--~~~~~~--------~vevlE~~g~~ 268 (581) .+...| +.+++++.... .+.-+++. +.+...+.+-|-++... ...... -++.+..|-.| T Consensus 241 EKkltykkvV~~Al~~s~~~--sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~mY~RI 318 (666) T protein:vir:96 241 EKKLTYKKVVNEALKSSFQG--SDWTDNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHTMYLRI 318 (666) T ss_pred chhhhHHHHHHHHHhhhccc--cccccCCcccccccccchhhccchhhcCcccccccccccccccccccceeeeeeeeee Confidence 111111 12222222110 00001100 00111111111111110 000001 11222222221 Q ss_pred ec---ccCCc---eeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHH Q lcl|NC_015158. 269 HD---TQSGT---FKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKAD 342 (581) Q Consensus 269 ~d---~~~d~---~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iD 342 (581) .- ..+-+ -.-.|...+..|+.+|.++..---.|..|.-..-+..+--..--.|.++-..+.|...+.+++..+- T Consensus 319 ~PSDF~~~~P~~N~~QIWK~v~IN~~~iIS~~~~I~AY~~~~~~~~~~LEDGmG~QTQ~~~E~~~P~Q~A~t~L~N~~~~ 398 (666) T protein:vir:96 319 IPSDFEMNVPNRNQVQIWKAVMINRDAIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQSATTELWNAYIQ 398 (666) T ss_pred ccccceecCCCCCcceeeeeeeeccceeEeeehhhcccchhhhhhhhhhhhccccccccccccccchhhhhhHHhhhhhh Confidence 10 01111 1123444566889999988432223444422222222211233456778888999999999999888 Q ss_pred HHHHhcCCeEEEec------cccccccCCceeEEeC----CCCCc----ccccCCCccc-hhHHHHHHHHHHHHHhcCCc Q lcl|NC_015158. 343 VFDLIAFPPMKVKG------DVEEFVWGPMEQIYIN----GDGDV----EMMAPNTQAL-QADMQIQILEAKMEEFAGAP 407 (581) Q Consensus 343 n~~~s~np~~~v~~------d~~~i~~~pG~vi~~~----~~~~i----~~~~~p~~~~-~~~~~lq~~~~~~ee~TGv~ 407 (581) ...+.+..+-..++ ++...+. .-.|.++ .++.+ .+++-.+... ......+.+-+--.+++|.. T Consensus 399 ~aRRAV~DRAl~~~S~i~a~~iNSP~~--~~KIP~~~~sL~N~~m~~~Y~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN 476 (666) T protein:vir:96 399 GARRAVMDRALYNPSMIRANDINSPIP--QIKIPVVPQSLVNGTMDQAYRQIPFDSRGMETVMQNALMLTDWQRELSGMN 476 (666) T ss_pred hhhhhhhhhhhcchhhhhhhcccCCCC--CcccceeehhhhccchhhhhccCCccccchhHHHhhhHHHhhhHHHhhccC Confidence 88887766654432 1111111 1111111 11211 1111111111 11222455556677888888 Q ss_pred hHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcCccceeeecCchhcccCCCccCHH Q lcl|NC_015158. 408 REAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNA-MLEISRRNLDVADTIRVFDSDDKVATFMNVNKD 486 (581) Q Consensus 408 ~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~-~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~ 486 (581) ...+|+=--+|+|-.+-.-.|..+..|++.=+-..+...+.||=+- -+.++...-|.+-+-.-+|+ -+.|+-+ T Consensus 477 ~~~~GQFQKGNKt~~E~~~~MG~a~NRmRLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~------~~~vDi~ 550 (666) T protein:vir:96 477 SATRGQFQKGNKTRAEFDTIMGNAENRMRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGK------GVRVDIK 550 (666) T ss_pred CcccccccccCcceeehhhhcCCcccceehhhHHHhhhhhhhHHHHHhhhhhhccccchhcccccCc------eeeeeHH Confidence 8888875557888878887888888888877666665555555432 23444433343322222333 3445555 Q ss_pred Hhc---CCceEEEecchh--HHHHHHHHHHHHHhh-ccc-ccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcH--- Q lcl|NC_015158. 487 DIT---AKGRLRPVGARH--FAEQAQVVQSLMGIA-NTP-VWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVM--- 556 (581) Q Consensus 487 di~---~~~~vva~ga~~--~~~r~q~~q~L~~~~-~~~-~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~--- 556 (581) .++ ..|+| +.|-+. .+.--..+..+++++ ... ..+..+|++-. |+--++.+-|+.|++-+ |+.+++ T Consensus 551 ~L~~~~L~F~~-~DGlTP~SKlASs~~lT~~LQMI~sS~~~~~A~G~~~P~--M~AHl~QLGGVRG~E~Y-~~~ALPqwq 626 (666) T protein:vir:96 551 ELQDLGLKFEL-GDGLTPASKLASSDFLTALLQMIMSSETTLQAFGTQVPG--MIAHLAQLGGVRGFEKY-ANAALPQWQ 626 (666) T ss_pred HHhhhhheeee-ccCCCchhhhhhhHHHHHHHHHHhcchhhHhhhcccchH--HHHHHHHhccccchhhc-ccccCcchh Confidence 443 34553 555332 222122344444442 222 23345555442 44444455566665433 666666 Q ss_pred -----HHHHHHHHHHH---HHHHHHH-HhcccCC Q lcl|NC_015158. 557 -----EAQTTSALVNQ---SQAQIEE-EAQVPLV 581 (581) Q Consensus 557 -----~~~~~q~~~q~---aq~~~~~-~~~~~~~ 581 (581) .|+.+|+..|. +-.+.|+ +.|+|+- T Consensus 627 itygm~Q~LQ~~~LQ~~~QSA~Q~~A~Q~~L~~~ 660 (666) T protein:vir:96 627 ITYGMQQQLQQMLLQLQQQSAMQLQARQGELSND 660 (666) T ss_pred hhhhhhHHHHHHHHHHhhhhccccccccccCccc Confidence 45544444331 2122232 3344444 No 123 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=99.08 E-value=6.4e-10 Score=71.00 Aligned_cols=565 Identities=13% Similarity=0.095 Sum_probs=265.5 Q ss_pred Cccchhhhhhhcc-chhhhHH-HHHHHHHh----hHHhhhhhHHHHHHHHHHHhhccccc--------cccccccccccc Q lcl|NC_015158. 1 MTGKVLELQQMLD-DTRDGLA-EQIANTWQ----NWNSQRQEWLSQKSELRNYIFATDTT--------TTTNSTLPWKNK 66 (581) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~a-~~i~~~~~----~~~~~r~~~~~~~~~~~~y~~~~~~~--------~~~~~~~~~k~~ 66 (581) +|-.--.+++..+ .|-|.|- .-|++..+ +--++-|.-+..-.++.-|+-++... -.....-+=-|+ T Consensus 3 ispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V~C~V~~~ 82 (666) T protein:vir:10 3 ISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKVRCQVVNK 82 (666) T ss_pred cCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhhHHHhHHhhhhccCCCceeeecccccccCcceeecc Confidence 4444444555554 2223322 23333322 11222233333334444555444321 111111112244 Q ss_pred ccccch-HHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEe Q lcl|NC_015158. 67 TTLPKL-CQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVE 145 (581) Q Consensus 67 ~~~pki-~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~ 145 (581) .++|-| ..-++-.+++|.++|.++-..|-+.- +.+-++.||+.+-+|+|...-..+..++--+++|.+.|-..-+... T Consensus 83 ~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~~~~~~~LiL~L~D~~KYN~~~~ET~ 161 (666) T protein:vir:10 83 ATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTMTSSIPELILCLQDAAKYNLVGWETE 161 (666) T ss_pred ccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhhhhhHHHHHHHHhhhhhcceeeeeec Confidence 555544 47789999999999999988775443 3455678999999999999888888889999999999988777776 Q ss_pred eecceeeee---------eeeeEeeeecc-ceEEecchhheeecCCCC--Cccc-CceEE-EEEec----HHHHHHHhhc Q lcl|NC_015158. 146 YVKETTKDE---------ESGATRDTYFG-PRAVRIDPKDIVFNPVAV--DFAH-SPKII-RTVLN----EGELLQMEQD 207 (581) Q Consensus 146 ~~~~~~~~~---------~~~~~~~~~~~-p~ie~V~p~df~~DP~a~--~~~d-~~~i~-r~~~T----~~el~~m~~~ 207 (581) |---.+... -+..-+.++.+ -+|+|++|-|.++||+.. +... +.|.- -.... +++|..+.. T Consensus 162 Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~- 240 (666) T protein:vir:10 162 WSHIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTN- 240 (666) T ss_pred cccccccchhhhhhcCCCceeecccchhhhhhhhccccccccccCCCCCCchhhhhhhhhHHHHHHHHHHHHHHhhhhc- Confidence 621111111 01222334443 379999999999999843 2211 22221 11112 233322211 Q ss_pred cCccch---hHHHHHHHHhhhccCCcccch------hhhhccccccccccccc--cccCCc--------eEEEEEEeeee Q lcl|NC_015158. 208 QPENAS---LASAIARRREFRRGLGTYTRE------DCEKAVGFSMDGFGNLY--DYFQSP--------YVEVLTFYGDY 268 (581) Q Consensus 208 ~~~~~~---~~~d~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~--~~~~~~--------~vevlE~~g~~ 268 (581) .+...| +.+++++.... .+.-+++. +.+...+.+-|-++... ...... -++.+..|-.| T Consensus 241 EKkltykkvV~~Al~~s~~~--sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~~Y~RI 318 (666) T protein:vir:10 241 EKKLTYKKVVNEALKSSFQG--SDWTDNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHTMYLRI 318 (666) T ss_pred chhhhHHHHHHHHHhhhccc--cccccCCccCccccccchhhccchhhcCcccccccccccccccccccceeeeeeeeee Confidence 111111 12222222110 00000100 00111111111111110 000011 11222222211 Q ss_pred ec---ccCCc---eeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHH Q lcl|NC_015158. 269 HD---TQSGT---FKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKAD 342 (581) Q Consensus 269 ~d---~~~d~---~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iD 342 (581) .- ..+-+ -.-.|...+..|+.+|.++..---.|..|.-..-+..+--..--.|.++-..+.|...+.+++..+- T Consensus 319 ~PSDF~~~~P~~N~~QIWK~v~IN~~~iIS~~~~I~AY~~~~~~~~~~LEDG~G~QTQ~~~E~~~P~Q~A~t~L~N~~~~ 398 (666) T protein:vir:10 319 IPSDFEMNVPNRNQVQIWKAVMINRDAIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQSATTELWNAYIQ 398 (666) T ss_pred ccccceecCCCCCcceeeeeeeeccceeEeeehhhhccchhhhhhhhhhhhccccccccccccccchhhhhhHHhhhhhh Confidence 10 01111 1123444566889999988432223444422222222211233456778888999999999999888 Q ss_pred HHHHhcCCeEEEec------cccccccCCceeEEeC----CCCCc----ccccCCCccc-hhHHHHHHHHHHHHHhcCCc Q lcl|NC_015158. 343 VFDLIAFPPMKVKG------DVEEFVWGPMEQIYIN----GDGDV----EMMAPNTQAL-QADMQIQILEAKMEEFAGAP 407 (581) Q Consensus 343 n~~~s~np~~~v~~------d~~~i~~~pG~vi~~~----~~~~i----~~~~~p~~~~-~~~~~lq~~~~~~ee~TGv~ 407 (581) ...+.+..+-..++ ++...+.. -.|.++ .++.+ .+++-.+... ......+.+-+--.+++|.. T Consensus 399 ~aRRAV~DRAl~~~S~i~a~~iNSP~~~--~KIP~~~~sL~N~~~~~~Y~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN 476 (666) T protein:vir:10 399 GARRAVMDRALYNPSMIRANDINSPIPQ--IKIPVVPQSLVNGTMDQAYRQIPFDSRGMETVMQNALMLTDWQRELSGMN 476 (666) T ss_pred hhhhhhhhhhccChhhhhhhcccCCCCC--cccceeehhhcccchhhhhccCCccccchhHHHhhhHHHHhhHHHhhccC Confidence 88877766654432 11111111 111111 11211 1111111111 11222455556677888888 Q ss_pred hHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcCccceeeecCchhcccCCCccCHH Q lcl|NC_015158. 408 REAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNA-MLEISRRNLDVADTIRVFDSDDKVATFMNVNKD 486 (581) Q Consensus 408 ~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~-~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~ 486 (581) ...+|+=--+|+|-.+-.-.|..+..|++.=+-..+...+.||=+- -+.++...-|.+-+-.-+|+ -+.|+-+ T Consensus 477 ~~~~GQFQKGNKt~~E~~~~MG~a~NR~RLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~------~~~vDi~ 550 (666) T protein:vir:10 477 SATRGQFQKGNKTRAEFDTIMGNAENRMRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGK------GVRVDIK 550 (666) T ss_pred CcccccccccCcceeehhhhcCCcccceehhhHHhhhhhhhhHHHHHhhhhhhccccchhcccccCc------eeeeeHH Confidence 8888875557888778777888888888876666665555555432 23444433343322222333 3445555 Q ss_pred Hhc---CCceEEEecchh--HHHHHHHHHHHHHhh-ccc-ccccccchhHHHHHHHHHHHHhcCCCcccc--------cC Q lcl|NC_015158. 487 DIT---AKGRLRPVGARH--FAEQAQVVQSLMGIA-NTP-VWQDIKPHVSTENLAKMLEHNLSLGGWDIF--------KP 551 (581) Q Consensus 487 di~---~~~~vva~ga~~--~~~r~q~~q~L~~~~-~~~-~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~--------~~ 551 (581) .++ ..|+| +.|-+. .+.--..+..+++++ ... ..+..+|++-. |+--++.+-|+.|++-+ +| T Consensus 551 ~L~~~~L~F~~-~DG~TP~SK~ASs~~lT~~LQMI~sS~~~~~A~G~~~P~--M~AH~~QLGGVRG~E~Y~daalP~~~~ 627 (666) T protein:vir:10 551 ELQDLGLKFEL-GDGLTPASKLASSDFLTALLQMIMSSETTLQAFGTQVPG--MIAHLAQLGGVRGFEKYADAALPQWQI 627 (666) T ss_pred HHhhhhheeee-ccCCCchhhhhhhHHHHHHHHHHhhhhhhHhhhcccchH--HHHHHHHhccccchhhhhhccCCcccc Confidence 443 34553 555332 122122333444432 221 22344555432 33344455555554332 23 Q ss_pred CCCcHHHHHHHHHHH---HHHHHHH-HHhcccCC Q lcl|NC_015158. 552 NVAVMEAQTTSALVN---QSQAQIE-EEAQVPLV 581 (581) Q Consensus 552 ~~~~~~~~~~q~~~q---~aq~~~~-~~~~~~~~ 581 (581) .-.. +|+.+|+..| |+-.+.| .+.|+|+- T Consensus 628 ~~~~-~Q~LQ~~~LQ~~~QSA~Q~~A~Q~~L~~~ 660 (666) T protein:vir:10 628 TYGM-QQQLQQMLLQLQQQSAMQLQARQGELSND 660 (666) T ss_pred ccch-hHHHHHHHHHHhhhhhcccccccccCccc Confidence 3333 3555444433 2222233 33444544 No 124 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=98.63 E-value=1.8e-07 Score=57.62 Aligned_cols=468 Identities=11% Similarity=0.077 Sum_probs=214.9 Q ss_pred Cccchhhhhh---------------------hcc-chhhhHHHHHHHHHhhHHhhhhhH--HHHHHHHH-HHhhcccccc Q lcl|NC_015158. 1 MTGKVLELQQ---------------------MLD-DTRDGLAEQIANTWQNWNSQRQEW--LSQKSELR-NYIFATDTTT 55 (581) Q Consensus 1 ~~~~~~~~~~---------------------~~~-~~~~~~a~~i~~~~~~~~~~r~~~--~~~~~~~~-~y~~~~~~~~ 55 (581) |+-|-++.+| |-+ +++|-.=......|+ .-|.-+ ++.+++.. -||-.-.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~---~ird~~~G~~~~r~~g~~YLP~~~~~~ 77 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWR---KIMDCLSGQEAIKAKREEYLPMPSVDS 77 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHH---HHHHHhcChHHHHhcccccCCCCCccc Confidence 5555544443 222 223322223333333 333333 34443321 1221111000 Q ss_pred c-cc----ccccccccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhc-chHHHHHH Q lcl|NC_015158. 56 T-TN----STLPWKNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKES-DFRTIMSQ 129 (581) Q Consensus 56 ~-~~----~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~-n~~~~~~~ 129 (581) . .. -+..-...++.|.+..+++.+.+.++. .+..+++ -+.++.++.|-=.++ ++...+++ T Consensus 78 ~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfr----k~p~~~~----------p~~l~~l~~d~D~~G~~L~~f~~~ 143 (535) T protein:vir:80 78 RDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFS----RDPIRQL----------PPALEAIVEDIDGEGVSLDQQAKK 143 (535) T ss_pred CCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhc----CCcceec----------cHHHHHHHhccCCCCCCHHHHHHH Confidence 0 00 111112335567667777777666653 2222221 133455555444443 57888999 Q ss_pred HHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhhe-eecCCCCCc-ccC-ceEEEEEecHHHHHHHhh Q lcl|NC_015158. 130 LLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDI-VFNPVAVDF-AHS-PKIIRTVLNEGELLQMEQ 206 (581) Q Consensus 130 ~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df-~~DP~a~~~-~d~-~~i~r~~~T~~el~~m~~ 206 (581) +++.++.||.|.+-|-+... .......+.+..-..|++..++|.++ =|+-...+. ... ..+.|...+.++ T Consensus 144 ~~~~~l~~G~~~iLVD~P~~-~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~d------ 216 (535) T protein:vir:80 144 ALGYTMGFGRAAIFTDYPNV-GRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQD------ 216 (535) T ss_pred HHHHHHhcCeEEEEEeecCC-CCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecC------ Confidence 99999999999988876532 11111122233344688999999987 344221111 111 112222222100 Q ss_pred ccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEe Q lcl|NC_015158. 207 DQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIID 286 (581) Q Consensus 207 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~ 286 (581) +.+..+.++.+| +-+....+.+++ +.|- .+-.++....+... T Consensus 217 ----------------------d~f~~~~~~q~R---------vL~~~~~G~y~v-~~~~--~~~~~~~~~~~~~~---- 258 (535) T protein:vir:80 217 ----------------------DGFETTYVQQWR---------VLQLNAEGNYQV-ERWR--RETQEEMYYSYSKH---- 258 (535) T ss_pred ----------------------CCcccceeEEEE---------EEEecCCceEEE-EEEE--eecCCcccccccee---- Confidence 011111111100 000000011111 1111 00111111000000 Q ss_pred CCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc-------- Q lcl|NC_015158. 287 RMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV-------- 358 (581) Q Consensus 287 g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~-------- 358 (581) +......+ ..+..||+.+.... -+...|.++...|-.++..+=......-+.+..+..|+..+.+.. T Consensus 259 --~~~~~g~~--~l~~IPfv~~~~~~-~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~ 333 (535) T protein:vir:80 259 --VPTDGNGN--PFKEIPFQFIGPLD-NNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVF 333 (535) T ss_pred --ecccCCCc--ccCeeEEEEeecCC-CCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchhhhhcCC Confidence 01111122 35678888665333 356678888888888888877677777778899999987765421 Q ss_pred --cccccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_015158. 359 --EEFVWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQ 436 (581) Q Consensus 359 --~~i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~ 436 (581) ..+.-+++..|....+++..++...+...... .++.+++.|..+ |.... ....+++||++.+.-..+.+..|. T Consensus 334 ~~~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a~~-~l~~~e~qM~~l-Ga~ll---~~~~~~~Ta~~a~~~~~~~~S~L~ 408 (535) T protein:vir:80 334 KDFKVHLGSRAIIPLPQGATAGILQITPNSVPFE-AMTHKESQMIAM-GANLL---VKSGGNRTFGEAQQEEASEQSILS 408 (535) T ss_pred CCcceEecCcccccCCCCCCcceeeeccchhHHH-HHHHHHHHHHHH-HHHhh---ccCcccccHHHHHHHHHHHhHHHH Confidence 12445577777777777776665543332222 233334433332 22111 233568999999887777788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhh Q lcl|NC_015158. 437 EKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIA 516 (581) Q Consensus 437 ~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~ 516 (581) .++.+.++ .++.+++++.+|.-...+... +. +.+++ +|-. .+- ..+.++.|+++. T Consensus 409 ~~a~~le~-al~~aL~~~A~w~G~~~~~~~-~~-----------i~~n~-----dF~~--~~l-----d~~~~~all~~~ 463 (535) T protein:vir:80 409 ACTKNVSM-AFRKALRWANQFQTGIVNDET-VE-----------YNLNT-----DFPA--ARL-----TPNERAELILEW 463 (535) T ss_pred HHHHHHHH-HHHHHHHHHHHHcCCccCCCc-eE-----------EEecc-----cccc--ccC-----CHHHHHHHHHHH Confidence 99999996 468888888877432111111 00 11111 1110 000 122334343333 Q ss_pred cccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 517 NTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 517 ~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) + ...|+...+++.+... |+-+ | +...+.++.+.+.+.+... ..+-.++- T Consensus 464 ~-------~G~Is~et~~~~L~r~-gvl~-----~--~~~~eee~~ri~~E~~~~~-~~~g~~~d 512 (535) T protein:vir:80 464 Q-------QGAITFKEMRAGLRRA-GVAS-----E--DDAKAETEGKATVEFIAKT-AAAGKVGD 512 (535) T ss_pred h-------cCCCCHHHHHHHHHhC-CCCC-----c--ccchHHHHHHHHhhhhhcc-ccCCCCCC Confidence 2 2346666777766433 3332 2 2223334444333222111 11111111 No 125 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.62 E-value=1.9e-07 Score=57.46 Aligned_cols=428 Identities=12% Similarity=0.094 Sum_probs=208.3 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHH-HHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELR-NYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFP 89 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~-~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~ 89 (581) |.=+++|-.=......|+...+.-.. ++.+++.. .||-.........-+..-...++.|.+..+++.+.+.++. T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G-~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~---- 75 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLG-QREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLD---- 75 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcC-hHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhc---- Confidence 55445555545555555533332211 34444322 2443332222111122223335577777777777666653 Q ss_pred CccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEE Q lcl|NC_015158. 90 NERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAV 169 (581) Q Consensus 90 ~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie 169 (581) .+.-+++ -+.++.+..| ..-.+....+++++..++.||.|.+.|-|... -.+|++. T Consensus 76 k~p~~~~----------p~~l~~~~~D-~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~-------------g~rPy~~ 131 (452) T protein:vir:94 76 QPPVITH----------PDAMSKYFED-QSGIQFYEVFTRAVEETLLMGRVGVFIDRPLT-------------GGDPYIS 131 (452) T ss_pred CCceecc----------cHHHHHHHhc-ccCCCHHHHHHHHHHHHHhcCeEEEEEeeccC-------------CCceEEE Confidence 3322221 1233333323 33346888999999999999999999977532 1258899 Q ss_pred ecchhhee-ecCCCCCcccCce-EEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccc Q lcl|NC_015158. 170 RIDPKDIV-FNPVAVDFAHSPK-IIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGF 247 (581) Q Consensus 170 ~V~p~df~-~DP~a~~~~d~~~-i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (581) .++|.++. |+-. ....-.+ +.|......| ..+.+..+.. T Consensus 132 ~~~~~~Ii~W~~~--~~g~l~~v~lre~~~~~d--------------------------~~d~f~~~~~----------- 172 (452) T protein:vir:94 132 VYTTENILNWEED--EDGRLLMVVLREFYTVRD--------------------------TADRYVQNIR----------- 172 (452) T ss_pred EechhhhcCcccc--ccCCeeEEEEEEEEEEec--------------------------CCCcccceeE----------- Confidence 99999874 2211 1110011 1121111000 0000000000 Q ss_pred cccccccCCceEEEEEEe-e----eeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCc Q lcl|NC_015158. 248 GNLYDYFQSPYVEVLTFY-G----DYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGP 322 (581) Q Consensus 248 ~~~~~~~~~~~vevlE~~-g----~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~ 322 (581) ..+++|+.. | ..+...+++..+. + -..+.....+ ..+..||+.+.-... +...|.++ T Consensus 173 ---------~~yRvL~l~~g~~~v~~~~~~~~~~~~~--~----~~~~~~~~~~--~l~~IP~v~~~~~~~-~~~~~~pP 234 (452) T protein:vir:94 173 ---------VRYRCLELVDGLLQITVHETQDGKVWEL--A----KTSTIQNVGV--TMDYIPFFCITPSGL-SMTPAKPP 234 (452) T ss_pred ---------EEEEEEEEeCCeEEEEEEEccCCceeee--c----cceeecCCCc--ccceeEEEEEcCCCC-CCCCCccc Confidence 012332211 0 0011222221110 0 0012222223 457789887754443 45678899 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc--ccccccCCceeEEeCC-CCCcccccCCCccchh-HHHHHHHHH Q lcl|NC_015158. 323 LDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD--VEEFVWGPMEQIYING-DGDVEMMAPNTQALQA-DMQIQILEA 398 (581) Q Consensus 323 ~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d--~~~i~~~pG~vi~~~~-~~~i~~~~~p~~~~~~-~~~lq~~~~ 398 (581) ...|-+++..+-......-+.+..+.+|+..+.+. ...+.-+|+.+|...+ ++...++.+......+ ...++.+++ T Consensus 235 Ll~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~ 314 (452) T protein:vir:94 235 MIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQSTMHIGSTKAWVIPEVAAKVGFLEFTGQGLQSLEKALSEKQA 314 (452) T ss_pred hHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCCceEecccccccCCCCCCcceEEccCchhHHHHHHHHHHHHH Confidence 99999999999999999999999999998876653 3456778899999886 6678877755333221 122333444 Q ss_pred HHHHhcCCchHhcCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcc Q lcl|NC_015158. 399 KMEEFAGAPREAMGIRTPGEKTAFEV-QQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKV 477 (581) Q Consensus 399 ~~ee~TGv~~~~~G~~~~~~~TAtgv-~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~ 477 (581) .|..+ |. +...+ .+.++|++.. ..-....+..+..++.+.++ .++.+++++.+|.-. +. + + T Consensus 315 ~m~~~-Ga-~ll~~--~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~-al~~~l~~~a~w~g~--~~-~-~--------- 376 (452) T protein:vir:94 315 QLASL-SA-RLIDN--STRGSEATETVKLRYMSETASLKSVTRAVEA-LLNKAYSCIMDMESM--GG-T-L--------- 376 (452) T ss_pred HHHHH-HH-Hhhcc--CCCcchHHHHHHHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHcCC--CC-c-e--------- Confidence 44332 22 11111 2233444444 34344446889999999985 468888888777421 10 0 0 Q ss_pred cCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHH Q lcl|NC_015158. 478 ATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVME 557 (581) Q Consensus 478 ~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~ 557 (581) -+.++++=+... . ..+.++.|.++. ....++...+++.+.+ .|+..+ .. T Consensus 377 --~v~~n~dF~~~~-----------~-~~~~~~al~~~~-------~~G~is~~t~~~~L~~-~gvl~~---------~~ 425 (452) T protein:vir:94 377 --NIKLNSAFLDSK-----------L-TAAELKAWVEAY-------LSGGISKEIYIHALKV-GKVLPP---------PG 425 (452) T ss_pred --EEEecccccccc-----------C-CHHHHHHHHHHH-------hcCCCcHHHHHHHHHh-CCCCCC---------cc Confidence 111111111000 0 023333333332 2234565566666644 344322 12 Q ss_pred HHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 558 AQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 558 ~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +++. + ..|.+.|.|.- T Consensus 426 e~~~------i--~~E~~~~~~~~ 441 (452) T protein:vir:94 426 ESMG------V--IPDPPAPEPSP 441 (452) T ss_pred CHHH------H--HHHhhccCccc Confidence 2211 1 12233344433 No 126 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=98.36 E-value=1.1e-06 Score=53.35 Aligned_cols=454 Identities=15% Similarity=0.131 Sum_probs=214.9 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhH--HHHHHHHH-HHhhcccccccccccccccccccccchHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEW--LSQKSELR-NYIFATDTTTTTNSTLPWKNKTTLPKLCQIRD 77 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~--~~~~~~~~-~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d 77 (581) |+-+.+.. - +++|-.=......|+ .-|.-. ++..++.. -||-.........-+..-+..++.|.+..+++ T Consensus 1 m~~~~~~~---v-~~~h~~y~a~~~~W~---~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~ 73 (513) T protein:vir:97 1 MADKDPKS---P-ATTSGAYDQMLPRWH---VIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLD 73 (513) T ss_pred CCCCCCCC---C-CcCCHHHHHHHHHHH---HHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHH Confidence 33222110 0 111222222333333 222222 23332221 23333321111111222223455676677777 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHH-HHHHHhc-chHHHHHHHHHHHhhcCceEEEEeeecceee--- Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYM-DNKVKES-DFRTIMSQLLLDYIDYGNCFATVEYVKETTK--- 152 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i-~~~l~e~-n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~--- 152 (581) .+...++. -.|.... .....+.+++ .|-=.++ ++...++.+++.++.||.|.+-|-+...... T Consensus 74 ~l~G~vf~----------k~p~~~~--~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~ 141 (513) T protein:vir:97 74 TLSGKPFS----------EPIKLNE--DVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDG 141 (513) T ss_pred HHhhhhhh----------cCcccCc--CchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccch Confidence 77666653 1222221 1223334333 3333333 5788899999999999999888876432110 Q ss_pred -eeeeeeEeeeeccceEEecchhhe-eecCCCCCc-ccC-ceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccC Q lcl|NC_015158. 153 -DEESGATRDTYFGPRAVRIDPKDI-VFNPVAVDF-AHS-PKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGL 228 (581) Q Consensus 153 -~~~~~~~~~~~~~p~ie~V~p~df-~~DP~a~~~-~d~-~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~ 228 (581) .....+.+..-..|++..++|.++ =|+-.-.+- ... ..+.|...+. .+ T Consensus 142 ~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~----------------------------~D 193 (513) T protein:vir:97 142 QPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYME----------------------------QD 193 (513) T ss_pred hHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEee----------------------------cC Confidence 011112222233588888888886 333111110 001 1111111110 00 Q ss_pred CcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCcee-eeeEEEEEeCCEEEEeecCCCccCCCCeeE Q lcl|NC_015158. 229 GTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFK-RNMKVTIIDRMFVIEEKENPSWFAQAPIFH 307 (581) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~-e~~~itv~~g~~iir~~~nP~~~g~~Pf~~ 307 (581) + +.....+.++ + +..+.++|++.- ..+... ..+++ + ..+.-..+..||+. T Consensus 194 g-f~~~~~~q~r---------v---L~~g~~~v~r~~------~~~~~~~~e~~~-~---------~~g~~~l~~IP~v~ 244 (513) T protein:vir:97 194 G-FAEVCKRRIR---------V---LEPGLVQLWEPV------KKSNAQKEEWAL-A---------DEWATGLNYVPLVT 244 (513) T ss_pred C-CcceEEEEEE---------E---EeCceEEEEEee------cCCCccccceEE-e---------cCCCCcCCceeEEE Confidence 0 0000000000 0 111222332211 111110 00111 1 11111356788876 Q ss_pred ecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc----ccccccCCceeEEeCC-CCCcccccC Q lcl|NC_015158. 308 CGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD----VEEFVWGPMEQIYING-DGDVEMMAP 382 (581) Q Consensus 308 ~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d----~~~i~~~pG~vi~~~~-~~~i~~~~~ 382 (581) +.-.. -+...|.++...|-.+...+=......-+.+..+..|+..+.+. .+.+.-.|+.+|...+ ++...++.+ T Consensus 245 ~~~~~-~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~~~i~iG~~~~~~lpe~~~~~~yie~ 323 (513) T protein:vir:97 245 FYADR-QGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDSDPVVVGPNKVLYNPDPAGRFYYVEH 323 (513) T ss_pred EecCC-CCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCCCceEeeccccccCCCCCCcceeecc Confidence 65443 35566888888888888888888888889999999999877642 3456677888888875 567887777 Q ss_pred CCccchh-HHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015158. 383 NTQALQA-DMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRN 461 (581) Q Consensus 383 p~~~~~~-~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n 461 (581) ......+ -..+..+++.|... |. +.... ..+++||++++.-..+....+..++.++++ .++.+++++.+|+-.+ T Consensus 324 ~g~~i~~~~~~l~~le~qm~~~-Ga-~ll~~--~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~-al~~~l~~~a~wlg~~ 398 (513) T protein:vir:97 324 TGQAIAAGRTDLKDLEEQMAGY-GA-EFLKR--KTGGQTATARALDSAEATSDLSAMTGLFED-ALAQALDITADWLRLG 398 (513) T ss_pred CchhHHHHHHHHHHHHHHHHHH-HH-Hhhcc--CCccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCC Confidence 6444332 23355566666333 33 23332 335899999999999999999999999995 5788999998885322 Q ss_pred cCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHh Q lcl|NC_015158. 462 LDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNL 541 (581) Q Consensus 462 ~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~ 541 (581) .+. . -+. |.-+|..... .++.++.|+++. ..+.|+...+++.+.+. T Consensus 399 ~~~---~-----------~v~-----in~dF~~~~~-------~~~~~~al~~a~-------~~G~is~~t~~~~L~r~- 444 (513) T protein:vir:97 399 PNG---G-----------TVE-----LVKDYDLEEM-------DAPGLQALQVAR-------EKRDISRKTYLNGLRLR- 444 (513) T ss_pred CCc---c-----------EEE-----eccccCcccC-------CHHHHHHHHHHH-------hCCCCCHHHHHHHHHhc- Confidence 111 0 111 1223321111 022333333332 23445666666666442 Q ss_pred cCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcc------cCC Q lcl|NC_015158. 542 SLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQV------PLV 581 (581) Q Consensus 542 ~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~------~~~ 581 (581) |+ +.|+.+. +.....++ .++++.... |+- T Consensus 445 gv-----l~~d~d~--~~~~e~~~----~~~~~~~~~~~~d~~~~~ 479 (513) T protein:vir:97 445 GV-----LPEDFDE--DEDWEELM----EEISEAMGRAGLDLDPAQ 479 (513) T ss_pred cC-----CCccCCH--HHHHHHHH----HhhhhccCCCCccccccC Confidence 22 2233322 22222211 222222111 111 No 127 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.39 E-value=7.7e-05 Score=43.12 Aligned_cols=461 Identities=10% Similarity=0.070 Sum_probs=205.3 Q ss_pred hcc-chhhhHHHHHHHHHhhHHhhhhhH--HHHHHHHH-HHhhcccc-ccccc----ccccccccccccchHHHHHHHHH Q lcl|NC_015158. 11 MLD-DTRDGLAEQIANTWQNWNSQRQEW--LSQKSELR-NYIFATDT-TTTTN----STLPWKNKTTLPKLCQIRDNLHS 81 (581) Q Consensus 11 ~~~-~~~~~~a~~i~~~~~~~~~~r~~~--~~~~~~~~-~y~~~~~~-~~~~~----~~~~~k~~~~~pki~~~~d~~~~ 81 (581) |=+ +++|-.=......|+ .-|.-+ ++.|++.. -||-.-.- ..... -+..-...++.|.+..+++.+.+ T Consensus 1 m~~V~~~hp~y~~~~~~W~---~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G 77 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYY---LIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVG 77 (501) T ss_pred CCCCCCCCHHHHHHHHHHH---HHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhh Confidence 443 334433333334444 444444 35554422 24321100 11111 11122334667777788887777 Q ss_pred HHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhc-chHHHHHHHHHHHhhcCceEEEEeeecce-eeeeeeeeE Q lcl|NC_015158. 82 NYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKES-DFRTIMSQLLLDYIDYGNCFATVEYVKET-TKDEESGAT 159 (581) Q Consensus 82 ~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~-n~~~~~~~~~~d~~~~G~~i~k~~~~~~~-~~~~~~~~~ 159 (581) .++. .+.-++ .-..++.++.|-=.++ ++...++++++.++.||.|.+-+-+.... .......+. T Consensus 78 ~vf~----k~p~~~----------~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~ 143 (501) T protein:vir:95 78 QVFM----RDPVVK----------VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADL 143 (501) T ss_pred hhhc----CCccee----------CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHH Confidence 7763 222111 1233455555554443 57888999999999999998888765321 111111122 Q ss_pred eeeeccceEEecchhhe-eecCCCCCc-ccC-ceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhh Q lcl|NC_015158. 160 RDTYFGPRAVRIDPKDI-VFNPVAVDF-AHS-PKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDC 236 (581) Q Consensus 160 ~~~~~~p~ie~V~p~df-~~DP~a~~~-~d~-~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (581) +.....|++..++|.++ =|+-.-.+. ... ..+.|...+..+ +.+..... T Consensus 144 ~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d----------------------------~~f~~~~~ 195 (501) T protein:vir:95 144 EAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAAD----------------------------DGFEMKTS 195 (501) T ss_pred HhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecC----------------------------CCccccee Confidence 23344688999999887 344211111 111 112222222100 11111111 Q ss_pred hhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCc Q lcl|NC_015158. 237 EKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDN 316 (581) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~ 316 (581) +.+| +-+....+.+ .++.|-+-..-..++....... ............+--..+..||+.+.-. .-+. T Consensus 196 ~q~R---------vL~~~~~g~~-~~~v~r~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~~l~~IPfv~~~~~-~~~~ 263 (501) T protein:vir:95 196 GQFR---------VLRLDEEGYY-VHEIWREPQPTKADGSKIPKGN-YQQYVVYKPTDAQGKRLTEIPFMFIGSE-NNDS 263 (501) T ss_pred EEEE---------EEeeCCCceE-EEEEEEecCCcccCcceecCCc-ccccceeeeeccCCCcCCeeeEEEEecC-CCCC Confidence 1110 0000001111 1233321000000000000000 0000111111111123567887755332 2233 Q ss_pred ccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc-cc--------ccccCCceeEEeCCCCCcccccCCCccc Q lcl|NC_015158. 317 LYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD-VE--------EFVWGPMEQIYINGDGDVEMMAPNTQAL 387 (581) Q Consensus 317 ~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d-~~--------~i~~~pG~vi~~~~~~~i~~~~~p~~~~ 387 (581) ..|.++...|-++...+=......-+.+..+..|...+.+- .+ .+.-.++..|....+++..++.+..... T Consensus 264 ~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i~~G~~~~~~lP~~~~~~~ie~~~~~i 343 (501) T protein:vir:95 264 NPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSVNFGSRGGIPLPVGADAKLLQASENTM 343 (501) T ss_pred CCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCceeecccccccCCCCCceeEEecChhhH Confidence 44566666666665555444555677788888998766542 11 2333455556555666766666543222 Q ss_pred hhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccce Q lcl|NC_015158. 388 QADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADT 467 (581) Q Consensus 388 ~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~ 467 (581) +-..++.+.+.|... |. +... ...+++||++.+.-..+.+..+..++.++++ .++.+++++.+|...+.+ .-. T Consensus 344 -~~~~l~~l~~~m~~~-Ga-~ll~--~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~-al~~~l~~~a~w~g~~~~-~~~ 416 (501) T protein:vir:95 344 -LKEAMDTKERQMVAL-GA-KLVE--QKEVQRTATEAELEAASEGSTLSSATKNVSA-AFEWALKWAARWVGQADS-GVK 416 (501) T ss_pred -HHHHHHHHHHHHHHH-HH-hhcc--CCccchhHHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHHHHHcCCCCC-ceE Confidence 122244444444333 32 2222 2346899999999889999999999999996 468889988888543211 111 Q ss_pred eeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcc Q lcl|NC_015158. 468 IRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWD 547 (581) Q Consensus 468 iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~ 547 (581) |++ |+ +|-. .+- ..+.++.|..+. ..+.++...+++.+... ++... T Consensus 417 v~i-~~-----------------df~~--~~~-----~~~~~~al~~~~-------~~G~is~~t~~~~L~~~-~v~~~- 462 (501) T protein:vir:95 417 FEL-NT-----------------DFDI--ARM-----TPDERRSLVEEW-------QKGAITFEEMRTGLRKA-GVATE- 462 (501) T ss_pred EEE-ec-----------------cccc--ccC-----CHHHHHHHHHHH-------hCCCCcHHHHHHHHHhC-CCCCh- Confidence 211 11 2210 000 122333343333 23446666676666442 44331 Q ss_pred cccCCCCcHHHHHHHHHHHHHHHHHH------HHhcccCC Q lcl|NC_015158. 548 IFKPNVAVMEAQTTSALVNQSQAQIE------EEAQVPLV 581 (581) Q Consensus 548 ~~~~~~~~~~~~~~q~~~q~aq~~~~------~~~~~~~~ 581 (581) +...+.+ ++..++. +-+-.|.- T Consensus 463 ------~~~~e~e------~i~~~~~~~~~~~~~~~~~~~ 490 (501) T protein:vir:95 463 ------DDSKAKE------KIAKDTAEAMALATPANVPGD 490 (501) T ss_pred ------hHHHHHH------HHHhhhcCcccccccCCCCCC Confidence 1111111 0111100 00011111 No 128 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=96.97 E-value=0.00023 Score=40.49 Aligned_cols=455 Identities=13% Similarity=0.067 Sum_probs=183.5 Q ss_pred Cccchhhhhhhcc-chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHH-HHhhccccccccc--ccccccccccccchHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLD-DTRDGLAEQIANTWQNWNSQRQEWLSQKSELR-NYIFATDTTTTTN--STLPWKNKTTLPKLCQIR 76 (581) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~-~y~~~~~~~~~~~--~~~~~k~~~~~pki~~~~ 76 (581) |-+.- ++=.+ +++|-.=......|+...+.-.. +..+.. + -|+-..+ +..+. -+..-...++.|.+..++ T Consensus 1 ~~~~~---~~~~~V~~~hp~y~a~~~~W~~ird~~~G-~~~~~~-r~~yl~~~~-~~~~e~~Y~~rl~rA~~~n~~~~tl 74 (491) T protein:vir:95 1 MLTAN---GQGSGVKTKHREWLHYAPKWQKVRHALAG-DLVGYL-RNVGLNEPD-KAYGEARQAEYEAGGIVYNFTRRTL 74 (491) T ss_pred CcccC---CccCCCCccCHHHHHHHHHHHHHHHHhcC-cchhhc-ccCCCcCCC-CCCCHHHHHHHHhcccCCChHHHHH Confidence 11000 00011 12222222222223322111111 111100 1 1111110 00000 000011224455555555 Q ss_pred HHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhc-chHHHHHHHHHHHhhcCceEEEEeeecceeeeee Q lcl|NC_015158. 77 DNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKES-DFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEE 155 (581) Q Consensus 77 d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~-n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~ 155 (581) +.+.+.++. .+..++ .-+.++.+..|-=.++ ++...++++++.++.||.|.+-|-+...... . T Consensus 75 ~~l~G~vfr----k~p~~~----------~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~--T 138 (491) T protein:vir:95 75 SGMVGSVMR----KEPEIN----------IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAA--T 138 (491) T ss_pred HHHhchhhc----CCceee----------ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCccc--C Confidence 555555442 222221 1223444555554443 5788899999999999999998887543211 1 Q ss_pred eeeEeeeeccceEEecchhhe-eecCCCCCcccCceE-EEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccc Q lcl|NC_015158. 156 SGATRDTYFGPRAVRIDPKDI-VFNPVAVDFAHSPKI-IRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTR 233 (581) Q Consensus 156 ~~~~~~~~~~p~ie~V~p~df-~~DP~a~~~~d~~~i-~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (581) ..+.+..-..|++..++|.++ =|+-.-.+ ...... ++.+.|.. +. + ..+.+.. T Consensus 139 ~Ade~~~~~rPy~~~~~~~~IinW~~~~v~-g~~~L~~v~l~E~~~-------~~-d----------------~~~~f~~ 193 (491) T protein:vir:95 139 AAEQNAGLLNPTIAFYTTENIVNWRLTRVG-SVNRVTMVVLRETWE-------YH-E----------------PGNEFET 193 (491) T ss_pred HHHHHHhcCCcEEEEechhhhcCceeeeeC-CceeeeEEEEEEeEE-------ee-c----------------CCCCccc Confidence 112222333688999998887 33311001 101110 11111100 00 0 0000011 Q ss_pred hhhhhccccccccccccccccCCceEEEEEE--eeeee-----cccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCee Q lcl|NC_015158. 234 EDCEKAVGFSMDGFGNLYDYFQSPYVEVLTF--YGDYH-----DTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIF 306 (581) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~--~g~~~-----d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~ 306 (581) +.. .+++||+. ||.+. ...+|... ..+ ..++....+ -..+..||+ T Consensus 194 ~~~--------------------~qyRvL~l~~~g~~~~~v~r~~~~g~~~--~~~-----~~~~~~~g~-~~l~~IPfv 245 (491) T protein:vir:95 194 KYG--------------------EQYRVLDIDTDGNYRQRLFRFDAEGGAQ--EEV-----VEIYPDLGE-SLRGVIPFT 245 (491) T ss_pred ceE--------------------EEEEEEeecCCCceEEEEEEEcCCCcce--eee-----eeeeecCCC-cccCeeEEE Confidence 111 13444443 22110 01111110 000 001111111 124667887 Q ss_pred EecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc----cc--------cccCCceeEEeCCC Q lcl|NC_015158. 307 HCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV----EE--------FVWGPMEQIYINGD 374 (581) Q Consensus 307 ~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~----~~--------i~~~pG~vi~~~~~ 374 (581) .+.... -+...|.++...|-.+...+=......-+.+..+..|+..+.+.. +. ++..++..+....+ T Consensus 246 ~~~~~~-~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g~~~~~~lP~~ 324 (491) T protein:vir:95 246 FIGATN-NDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFGSRCGHNLGYG 324 (491) T ss_pred EEecCC-CCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEecCcCCcCCCCC Confidence 666433 344556777777777766666666667777888888987665421 11 22223333444444 Q ss_pred CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 375 GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAM 454 (581) Q Consensus 375 ~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~ 454 (581) ++..++++..... +-..+..++..|... |.. ... .++++||++++.-..+.+..+..++.+.++ .++.+++++ T Consensus 325 ~~~~~ie~~~~~~-~~~~l~~~e~qm~~~-Ga~---l~~-~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~-al~~~l~~~ 397 (491) T protein:vir:95 325 GSAQLIQAGENNL-ARQNMLDKEQQAIQI-GAQ---LIT-PSQQITAESARIQRGADTSVMATIARNVSQ-AYTDALRWV 397 (491) T ss_pred CccceeecCcchH-HHHHHHHHHHHHHHH-HHH---hcc-CCcchhHHHHHHHHHHhhHHHHHHHHHHHH-HHHHHHHHH Confidence 5555555543322 122233333333322 221 112 234799999999999999999999999996 468888888 Q ss_pred HHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHH Q lcl|NC_015158. 455 LEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 (581) Q Consensus 455 ~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~ 534 (581) ..|.-.+.+.+-.| .+++ +|-....- .+.++.|+.+.+ ...++...++ T Consensus 398 a~w~G~~~~~~v~i-------------~~n~-----dF~~~~~~-------~~~~~all~~~~-------~G~is~~t~~ 445 (491) T protein:vir:95 398 AMMLGKPEDSEVEF-------------QLNM-----DFFLQPMT-------AQDRAAWMADIN-------AGLLPATAYY 445 (491) T ss_pred HHHcCCCCCCceEE-------------Eeec-----ccccccCC-------HHHHHHHHHHHh-------cCCCCHHHHH Confidence 87743221111011 1111 12111110 122333333322 1223333344 Q ss_pred HHHHHHhcCCCcccc----------cCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 535 KMLEHNLSLGGWDIF----------KPNVAVMEAQTTSALVNQSQAQIEE 574 (581) Q Consensus 535 ~~~~e~~~l~~~~~~----------~~~~~~~~~~~~q~~~q~aq~~~~~ 574 (581) ..+.. .++...+.. .+.+.+.+. .=.++|-|| ++++ T Consensus 446 ~~L~~-~~vl~~~~e~~~~~ie~~~~~~~~~~~~--~~~~~~~~~-~~~~ 491 (491) T protein:vir:95 446 AALRK-AGVTDWTDEDILNAIEDAPLPSGAVTQV--AGEIPQAAQ-QQQE 491 (491) T ss_pred HHHHh-CCCCCccHHHHHHHHHhcCCCCCccccc--cccchhhhh-hccC Confidence 43322 233221000 000000000 000111111 1111 No 129 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=96.41 E-value=0.00065 Score=38.07 Aligned_cols=447 Identities=12% Similarity=0.072 Sum_probs=188.8 Q ss_pred Cccchhhhhhhcc-chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccc--ccccccccccccchHHHHH Q lcl|NC_015158. 1 MTGKVLELQQMLD-DTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTN--STLPWKNKTTLPKLCQIRD 77 (581) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~--~~~~~k~~~~~pki~~~~d 77 (581) |-+. -++=.+ +++|-.=......|+...+.-.. ++.|.+=.-|+-..+ +..+. -+..-...++.|-+..+++ T Consensus 1 ~~~~---~~~~~~V~~~hp~y~a~~~~W~~ird~~~G-~~~~~~r~~yl~~~~-~~~~e~~Y~~rl~rA~~~n~~~~tl~ 75 (489) T protein:vir:78 1 MLTE---NGQGSGVKTKHREWLHYAPKWQKVRHALAG-ELVSYLRNVGLNEPD-KAYGEARQAEYEAGGIVYNFTRRTLS 75 (489) T ss_pred CccC---CCccCCCCccCHHHHHHHHHHHHHHHHhcC-cccccccCCCCCCCC-CCCChHHHHHHHhccccCChHHHHHH Confidence 1100 011111 22222222233333322222111 111111000111111 11000 0111112244555555555 Q ss_pred HHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhc-chHHHHHHHHHHHhhcCceEEEEeeecceeeeeee Q lcl|NC_015158. 78 NLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKES-DFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEES 156 (581) Q Consensus 78 ~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~-n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~ 156 (581) .+.+.++ ..+-+++ .-+.++.+..|-=.++ ++...++++++.++.||.|.+-|-+.... .... T Consensus 76 ~l~G~vf----rk~p~~~----------~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~--~~T~ 139 (489) T protein:vir:78 76 GMVGSVM----RKEPEIN----------IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETG--AATA 139 (489) T ss_pred HHhchhh----cCCccee----------ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCC--CcCH Confidence 5555544 2333332 1233455555555454 57888999999999999999888876331 1111 Q ss_pred eeEeeeeccceEEecchhhe-eecCCCCCcc-cCce-EEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccc Q lcl|NC_015158. 157 GATRDTYFGPRAVRIDPKDI-VFNPVAVDFA-HSPK-IIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTR 233 (581) Q Consensus 157 ~~~~~~~~~p~ie~V~p~df-~~DP~a~~~~-d~~~-i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (581) .+.+..-..|++..++|.++ =|+-.-.+.. ...+ +.|.....+| T Consensus 140 ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d--------------------------------- 186 (489) T protein:vir:78 140 AEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNE--------------------------------- 186 (489) T ss_pred HHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeec--------------------------------- Confidence 22222334688999999887 3331111110 1111 1111111000 Q ss_pred hhhhhccccccccccccccccCCceEEEEEE--eeee-----ecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCee Q lcl|NC_015158. 234 EDCEKAVGFSMDGFGNLYDYFQSPYVEVLTF--YGDY-----HDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIF 306 (581) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~--~g~~-----~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~ 306 (581) ..++|.. -...++++|+. ||.+ ....+|...... + .+....+--..+..||+ T Consensus 187 ---------~~~~f~~----~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~-~-------~~~~~~g~~~l~~IPfv 245 (489) T protein:vir:78 187 ---------PGNEFET----KYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDV-V-------EIYPDLGESLRGVIPFT 245 (489) T ss_pred ---------CCCCccc----eeEEEEEEEecCCCcceEEEEEEeecCCccccee-e-------EEeccCCCCccCeeeEE Confidence 0001110 00012334332 2211 111222211110 0 01111111235677877 Q ss_pred EecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc--cc----------cccCCceeEEeCCC Q lcl|NC_015158. 307 HCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV--EE----------FVWGPMEQIYINGD 374 (581) Q Consensus 307 ~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~--~~----------i~~~pG~vi~~~~~ 374 (581) .+.-.. -+...|.++...|-.+...+=......-+.+..+..|+..+.+.. .+ ++..++..|....+ T Consensus 246 ~~~~~~-~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g~~~~~~lp~~ 324 (489) T protein:vir:78 246 FIGATN-NDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFGSRRGHNLGYG 324 (489) T ss_pred EEecCC-CCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccceeeCCcccccCCCC Confidence 666433 244556777777777776666666777788888999988765421 11 22234444444445 Q ss_pred CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 375 GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAM 454 (581) Q Consensus 375 ~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~ 454 (581) ++..++++...... -..+.-+++.|.. .|..-. ..++++||++++.-..+.+..+..++.+.++ .++.+|+++ T Consensus 325 ~~~~~ie~~~~~~~-r~~l~~le~qm~~-lGa~l~----~~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~-al~~~l~~~ 397 (489) T protein:vir:78 325 GSAQLIQAGENNLA-RQNMLDKEQQAIQ-IGAQLI----TPTQQITAQSARIQRGADTSVMATIARNVSQ-AYTDALRWV 397 (489) T ss_pred CCcceeccCcchHH-HHHHHHHHHHHHH-Hhhhhc----cCCcchhHHHHHHHHHHhhHHHHHHHHHHHH-HHHHHHHHH Confidence 55555555432211 1112223333332 222111 1234799999999999999999999999996 468888888 Q ss_pred HHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHH Q lcl|NC_015158. 455 LEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 (581) Q Consensus 455 ~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~ 534 (581) ..|.-.+.+.+-.| .+++ +|.+...- .+.++.|..+.+ ..-|+...++ T Consensus 398 a~w~G~~~~~~~~i-------------~~n~-----dF~~~~~d-------~~~~~al~~~~~-------~G~is~~t~~ 445 (489) T protein:vir:78 398 AVMLGKPEDTEVEF-------------RLNM-----DFFLEPMT-------AQDRAAWMADIN-------AGLLPATAYY 445 (489) T ss_pred HHHcCCCCCCceEE-------------Eeec-----ccCcccCC-------HHHHHHHHHHHh-------cCCCCHHHHH Confidence 87743321111111 1111 12111110 222333333221 1223444444 Q ss_pred HHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 535 ~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ..+.. -|+.. ++.+ +++.+++.+ ..|+- T Consensus 446 ~~L~~-~gv~d---------~~~e--------~~~~ei~~~-~~~~~ 473 (489) T protein:vir:78 446 AALRK-AGVTD---------WTDA--------DIKDAVADQ-PLPVA 473 (489) T ss_pred HHHHh-CCCCC---------ccHH--------HHHHHHhhc-CCCcc Confidence 44422 12211 1111 111122211 11211 No 130 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=93.19 E-value=0.0085 Score=31.95 Aligned_cols=453 Identities=11% Similarity=0.032 Sum_probs=182.5 Q ss_pred Cccchhhhhh---hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHH-HHhhccc----cccccccccc---------- Q lcl|NC_015158. 1 MTGKVLELQQ---MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELR-NYIFATD----TTTTTNSTLP---------- 62 (581) Q Consensus 1 ~~~~~~~~~~---~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~-~y~~~~~----~~~~~~~~~~---------- 62 (581) |-.-.-=+-| |.=++.|-.=......|+. -|-.....=.+.. -||-... ...+.....+ T Consensus 1 ~~~~~~~~~~~~~m~V~~~hp~y~a~~~~W~~---~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~ 77 (488) T protein:vir:96 1 MLKCLYIKHRGFFMLTPIYHPDYLVNAPQWLR---NLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWE 77 (488) T ss_pred CceeEEEeecceeecccccCHHHHHHhhhhhH---hhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhH Confidence 1000000000 1111122222222233332 1211111111111 1322110 0011111111 Q ss_pred ---ccccccccchHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHHHhc-chHHHHHHHHHHHhhcC Q lcl|NC_015158. 63 ---WKNKTTLPKLCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKVKES-DFRTIMSQLLLDYIDYG 138 (581) Q Consensus 63 ---~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~-n~~~~~~~~~~d~~~~G 138 (581) ++..++.|-...+++.+...++. -.|.-..++. ..++.+..|-=.++ ++...++++++.++.|| T Consensus 78 ~~~~~rA~~~n~~~~tl~~l~G~vfr----------k~p~~~~~~~--~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G 145 (488) T protein:vir:96 78 DLTWRLANYVNIVNPTMNAITGAVMR----------REPEFDTMDN--PVLIGLRDNIDGKGNGIDQECKQALNALQWGS 145 (488) T ss_pred hhhhhccccCchhHHHHHHhcchhhc----------cCceeccCCc--HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcC Confidence 12344556666666665555542 2222221111 12455665555554 57888999999999999 Q ss_pred ceEEEEeeecceeeeeeeeeEeeeeccceEEecchhhe-eecCCCCCcc-cC-ceEEEEEecHHHHHHHhhccCccchhH Q lcl|NC_015158. 139 NCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDI-VFNPVAVDFA-HS-PKIIRTVLNEGELLQMEQDQPENASLA 215 (581) Q Consensus 139 ~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df-~~DP~a~~~~-d~-~~i~r~~~T~~el~~m~~~~~~~~~~~ 215 (581) .|.+-|-+..... ...+.+..-..|++..++|.++ =|+-...+.. .. ..+.|...+..| .. T Consensus 146 ~~~ilVD~P~~~~---T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D----------~~--- 209 (488) T protein:vir:96 146 RCGWLVRSHPESA---TMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERD----------GG--- 209 (488) T ss_pred eEEEEEecCCCcC---CHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEecc----------CC--- Confidence 9999888763321 1112222334688999999887 3442111111 11 111222221100 00 Q ss_pred HHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeec Q lcl|NC_015158. 216 SAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKE 295 (581) Q Consensus 216 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~ 295 (581) +...+..+ .... ...+. ++.|-. ..++.. ..++.+ .. T Consensus 210 -------------~~~~~~~~------------~~~~-l~~g~---~~v~~~---~~~~~~-~e~~~~----------~~ 246 (488) T protein:vir:96 210 -------------TYVSKQRL------------INHR-LVDGL---CEFQEV---TDDEYS-DEWTPV----------LI 246 (488) T ss_pred -------------CcccceEE------------EEEE-EECcE---EEEEEE---ecCCcc-cceEee----------cC Confidence 00000000 0000 00111 122210 000110 001111 11 Q ss_pred CCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEeccc-ccc---ccCCcee--- Q lcl|NC_015158. 296 NPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGDV-EEF---VWGPMEQ--- 368 (581) Q Consensus 296 nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d~-~~i---~~~pG~v--- 368 (581) +--..+..||+.+.... -+...|.++...+-.++..+=......-+.+-.+.-|.+.+..+. +.. ...++++ T Consensus 247 g~~~l~~IP~v~~~~~~-~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~~~~~~~~g~~~~ 325 (488) T protein:vir:96 247 NSKQSDTIPFFLASSQS-NEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTMASEMNPLGFTLA 325 (488) T ss_pred CCcccCeeEEEEEecCC-CCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCcccccccccceeeec Confidence 11134667888665433 344557777777777766665555555555555555666543111 110 0012222 Q ss_pred EEeC---CCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 369 IYIN---GDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVM 445 (581) Q Consensus 369 i~~~---~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~ 445 (581) ++.- ..|+..+.++.+... +...++.+++.|.. .|..-. . .++++||++++.-..+.+..+..++.+.++ T Consensus 326 ~~~~~~~~~g~~~~~e~~~~~l-~~~~l~~l~~qm~~-~Ga~l~---~-~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~- 398 (488) T protein:vir:96 326 GRMPYYVKNGDVKVIQAQFSPE-TENKVEKLFEQAVK-VGASLF---T-QQSNETATGAAIRSGSSTASMATLGNNVED- 398 (488) T ss_pred ccccccccCCceeecCCchhHH-HHHHHHHHHHHHHH-HhHhhc---c-CCCcchHHHHHHHHHHhhHHHHHHHHHHHH- Confidence 1111 223344333332211 12224444444422 232111 1 224689999999999999999999999996 Q ss_pred HHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccccccccc Q lcl|NC_015158. 446 LMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIK 525 (581) Q Consensus 446 ~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~ 525 (581) .++.+|+++.+|.--+.+.. ++ ..+-+. |.-+|-.... -.|.++.|..+.+ . T Consensus 399 al~~~l~~~A~w~g~~~~~~------~~---~~~~~~-----in~dF~~~~l-------d~~~~~al~~~~~-------~ 450 (488) T protein:vir:96 399 TVRNMLRFIMRYFEGTNLYV------NP---DELVFK-----LNRDYFDVEV-------NPQMLQVAYAAMM-------E 450 (488) T ss_pred HHHHHHHHHHHHcCCCCCCc------Cc---cceEEE-----eccCCCCccC-------CHHHHHHHHHHHh-------c Confidence 56889998888853221100 00 000111 1112211000 0223333333332 2 Q ss_pred chhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccC Q lcl|NC_015158. 526 PHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPL 580 (581) Q Consensus 526 p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~ 580 (581) ..|+...+++.+... |+- .|+. +-++++.+. + +.=+.| T Consensus 451 G~Is~~t~~~~L~~~-gvl-----~~d~--~~e~~~~~i--------e-~~g~~~ 488 (488) T protein:vir:96 451 GNLPQVSWFELLKRA-RVV-----RGDM--SKEEFDEHI--------A-ELGFGM 488 (488) T ss_pred CCCCHHHHHHHHHhC-CcC-----CccC--CHHHHHHHH--------h-hcCCCC Confidence 345555666665442 222 2322 223332222 2 122223 No 131 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=81.32 E-value=0.087 Score=26.40 Aligned_cols=194 Identities=10% Similarity=0.021 Sum_probs=76.8 Q ss_pred EEEEEEeeeeecccCCceeeeeEEEE--E-eCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHH Q lcl|NC_015158. 259 VEVLTFYGDYHDTQSGTFKRNMKVTI--I-DRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDH 335 (581) Q Consensus 259 vevlE~~g~~~d~~~d~~~e~~~itv--~-~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~ 335 (581) +++.+ +|.+. ++... . .++...... -++. .++......+..||.|+...+...-....+ T Consensus 1 ~r~~~---------dg~~~--y~~~~~~~~~~g~~~~~~-----~~ei--lH~r~~~~~~~~~Glspi~~a~~~i~~~~a 62 (219) T protein:vir:98 1 MRVCK---------DGNYK--YLMKKSLYDTKSEIYEYN-----KNDV--IFIKLYDPMQQVYGSPDYVGGITSALLNSD 62 (219) T ss_pred Cceee---------cCeEE--EEEecceecCCceeEEec-----cccE--EEecCCCCCCCcceecHHHHHHHHHHHHHH Confidence 33211 11110 00000 0 001111111 1111 222222223567899987665544333333 Q ss_pred HHHHHHHHHHHhcCCeEEE-ecc--ccc-----cc--c------CC-ceeEEeCC-----CCCcccccCCCccchhHHHH Q lcl|NC_015158. 336 LENLKADVFDLIAFPPMKV-KGD--VEE-----FV--W------GP-MEQIYING-----DGDVEMMAPNTQALQADMQI 393 (581) Q Consensus 336 ~~R~~iDn~~~s~np~~~v-~~d--~~~-----i~--~------~p-G~vi~~~~-----~~~i~~~~~p~~~~~~~~~l 393 (581) ..+-...-....+.|..++ ..+ .++ +. | .. +.++-... +-...++...+...+....- T Consensus 63 a~~~~~~~f~Ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~r 142 (219) T protein:vir:98 63 ATIFRRRYYSNGAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIK 142 (219) T ss_pred HHHHHHHHHhcCCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHH Confidence 3222222222334454433 221 111 10 0 01 22222211 11233333333333333334 Q ss_pred HHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCc Q lcl|NC_015158. 394 QILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDS 473 (581) Q Consensus 394 q~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~ 473 (581) ++...++-..-|||+...|+...+.-|-+.+ ....+.|...-+.|++..+-+.+.+....+...|+. T Consensus 143 k~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~-----------eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~~~~~~-- 209 (219) T protein:vir:98 143 NISAQDVLTSHRFPPGLSGIIPVNTAGLGDP-----------LKIREAYQADEVLPLQEIIAESINSDYEIKSALKVN-- 209 (219) T ss_pred HhhHHHHHHHhCCCHHHcccccCCCCCccCH-----------HHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCccEEe-- Confidence 5555667778999999999754322222112 223335666677888877765553333333333331 Q ss_pred hhcccCCCccCHHHhc Q lcl|NC_015158. 474 DDKVATFMNVNKDDIT 489 (581) Q Consensus 474 ~~~~~~~~~v~r~di~ 489 (581) |-..+..|.. T Consensus 210 ------F~~~~~~d~~ 219 (219) T protein:vir:98 210 ------FKQPEKRDKN 219 (219) T ss_pred ------ecCcccccCC Confidence 3333344443 No 132 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=76.34 E-value=0.14 Score=25.32 Aligned_cols=255 Identities=10% Similarity=0.060 Sum_probs=105.0 Q ss_pred cccCce-EEEE-EecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccc--cCCceEE Q lcl|NC_015158. 185 FAHSPK-IIRT-VLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDY--FQSPYVE 260 (581) Q Consensus 185 ~~d~~~-i~r~-~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ve 260 (581) +..+++ +.+. ..+...|..+-...|-..-...++............ +. -+...+ +..+...++ ..+..|+ T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~G-na-~~~i~r----~~~G~~~~l~~l~~~~v~ 74 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKG-NA-YVLIER----DIYHQPSKLFLLNPDVVE 74 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcC-CE-EEEEEE----CCCCcEEEEEEECCceeE Confidence 555555 2222 223334433322211111112222222221110000 00 000001 000111110 1122333 Q ss_pred EEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHH Q lcl|NC_015158. 261 VLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLK 340 (581) Q Consensus 261 vlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~ 340 (581) +. ..+++....+.+....|. .... +... ..++......+.+||.|+...+...-....+..+.. T Consensus 75 v~--------~~~~~~~~~y~~~~~~g~-~~~~-----~~~e--vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 138 (278) T protein:vir:78 75 ML--------IENQSRELYYSIHAATGN-KLIV-----HNMD--MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN 138 (278) T ss_pred EE--------EcCCCceEEEEEEcCCce-EEEE-----cccc--EEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHH Confidence 31 111111112223222222 1111 1111 233333333567899999999988888777776665 Q ss_pred HHHHHHhcCCeEEEecc----cc----------ccccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCC Q lcl|NC_015158. 341 ADVFDLIAFPPMKVKGD----VE----------EFVWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGA 406 (581) Q Consensus 341 iDn~~~s~np~~~v~~d----~~----------~i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv 406 (581) +.+... .|...+..+ .+ ......|++.-+..+-.++++...+...+.....++....+-..-|| T Consensus 139 ~~~~~~--~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgV 216 (278) T protein:vir:78 139 LTEMQK--PDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQL 216 (278) T ss_pred HHHhcC--CCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 443333 344443221 11 11123677776766667777765544444444456777778888999 Q ss_pred chHhcCCCCccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH Q lcl|NC_015158. 407 PREAMGIRTPGE-KTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK 485 (581) Q Consensus 407 ~~~~~G~~~~~~-~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r 485 (581) |+...|....++ .++ +...+.|...-+.|++..+-+-+.+. ++ ++ T Consensus 217 pp~~lg~~~~~~~sn~--------------~~~~~~~~~~~l~P~~~~i~~~ln~~--------L~------------~~ 262 (278) T protein:vir:78 217 PSVFLNARSNTNFAKN--------------EELNRFYLQHTLLPIVKQYEEEFNRK--------LL------------TK 262 (278) T ss_pred CHHHhCCCCCCCcccH--------------HHHHHHHHHHHHHHHHHHHHHHHHhh--------cC------------Ch Confidence 999999754432 222 11223455555666666553322211 11 11 Q ss_pred HHh----cCCceEEEe Q lcl|NC_015158. 486 DDI----TAKGRLRPV 497 (581) Q Consensus 486 ~di----~~~~~vva~ 497 (581) .++ ...|++-+. T Consensus 263 ~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 263 TDREKIGILNLTLNLI 278 (278) T ss_pred hHhcCCceEEEecccC Confidence 221 112222222 No 133 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=70.37 E-value=0.21 Score=24.30 Aligned_cols=428 Identities=11% Similarity=0.052 Sum_probs=164.6 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHH-HHHHHHhhcccccccccccccccccccccch--HHHHHHHHHH----- Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQK-SELRNYIFATDTTTTTNSTLPWKNKTTLPKL--CQIRDNLHSN----- 82 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~-~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki--~~~~d~~~~~----- 82 (581) |.. =+.++|.+...- ..+.||+-.+. ..+...++.|..-- .|+. ......++.+ T Consensus 1 ~~~----------------~~~a~~~~~~~~a~~~~~~~~~~g-~~~~~d~~~~~~~~-~~~~~~~~~l~~lY~~~~l~r 62 (461) T protein:vir:80 1 MYS----------------IDKAKQAKIDSKIVNRNDFMVGHG-KANSRDKLTRQTPG-NGQKLDLKACENLYASNSIAM 62 (461) T ss_pred Ccc----------------chhhhhhhhhhhhhhhhHHHhhcC-CcchhhhhhccccC-cccccCHHHHHHHHHhCCccc Confidence 221 112222221111 11334442221 01111111111100 1110 0111111110 Q ss_pred -HHHhhcC--CccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeE Q lcl|NC_015158. 83 -YISALFP--NERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGAT 159 (581) Q Consensus 83 -l~~~~f~--~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~ 159 (581) ..+..-. -.+|+.+.+.. ++.+++++ ..+++-+....+.+.++..-+||.|.+-+-..+... |. T Consensus 63 ~iVd~~a~d~~r~g~~i~~~~---~~~~~~~~----~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~---~~--- 129 (461) T protein:vir:80 63 NIVDIISEDMVRAGWSLKTDN---KEMKKNIE----SKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR---EQ--- 129 (461) T ss_pred hhhccchHHhhcCCeeeecCC---HHHHHHHH----HHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCc---cc--- Confidence 1110000 24578887643 23333333 334444677788999999999999876664321110 00 Q ss_pred eeeeccceEEecchhheeecCCC-CCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhh Q lcl|NC_015158. 160 RDTYFGPRAVRIDPKDIVFNPVA-VDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEK 238 (581) Q Consensus 160 ~~~~~~p~ie~V~p~df~~DP~a-~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (581) | ..++|-- .++....+|.- +....+ .+ ... T Consensus 130 ------~---------~~~~pl~~~~~~~~~~l~~--~~~~~i------~~--------------------------~~~ 160 (461) T protein:vir:80 130 ------A---------DLSTAIDPKTIKSIPYINT--FNTQKV------TQ--------------------------LYL 160 (461) T ss_pred ------c---------CccCCcccccccceeEEEe--cccccc------ch--------------------------hhh Confidence 0 0111110 01111111110 000000 00 000 Q ss_pred ccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEE-EEeCCEEEEeecCCCccCCCCeeEecccccCCcc Q lcl|NC_015158. 239 AVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVT-IIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNL 317 (581) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~it-v~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~ 317 (581) .++.....| +.+ +.+.+.+ ...++. ..+.. .......|+ .-+.+++...+.|... T Consensus 161 ~~dp~sp~f------g~P---~~y~i~~----~~~~~~--~~~~~~~~~~~~~iH---------~SRii~~~~~~~~~~~ 216 (461) T protein:vir:80 161 NQDMFSEHF------GEV---EFFEVNR----VSQLGE--EILSGTTASTSEQIH---------RSRIIHEQGLRFEGET 216 (461) T ss_pred cccCcCccc------ccc---eEEEEec----cccccc--cccccccCccceEEc---------cccEEEecCCCCCccc Confidence 000000011 111 1111111 000000 00000 000001111 1123334444556778 Q ss_pred cCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc------c-cc------cccCCceeEEeCCCCCcccccCCC Q lcl|NC_015158. 318 YAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD------V-EE------FVWGPMEQIYINGDGDVEMMAPNT 384 (581) Q Consensus 318 ~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d------~-~~------i~~~pG~vi~~~~~~~i~~~~~p~ 384 (581) ||.|+++.+.+.-.....++....+.+.....+.+.+.+. . .. .....-++.-++....+..+..+ T Consensus 217 ~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~d~~e~~e~~~~~- 295 (461) T protein:vir:80 217 KGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEALAIIKGDEQLTKESTN- 295 (461) T ss_pred cCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceEEEEcCCcceEEEecC- Confidence 9999999999999999999998888888877777665421 0 00 01112234445555666555544 Q ss_pred ccchhHHHHHHHHHHHHHhcCCchHh-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_015158. 385 QALQADMQIQILEAKMEEFAGAPREA-MGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLD 463 (581) Q Consensus 385 ~~~~~~~~lq~~~~~~ee~TGv~~~~-~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d 463 (581) ...+...+..+.+.+-..+|+|... .|.+++.+ |||=+.+.+=.. +++.+- +..++|++..+++++.+.+- T Consensus 296 -lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~--asge~D~~~yyd-~i~~~q----e~~l~p~le~l~~~i~~s~~ 367 (461) T protein:vir:80 296 -VSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTL--TGAQYDVMNYYA-RVSSIQ----ENRLRPQLEYLTRLLMWASD 367 (461) T ss_pred -cCCHHHHHHHHHHHHhhhhcCCeeeeecccCCcc--ccchHHHHHHHH-HHHHHH----HHHHHHHHHHHHHHHHHHhc Confidence 3456667888888999999999975 56665334 444333222222 222222 23334444444443333211 Q ss_pred ccceeeecCchhcccCCCccCH--HHhcCCceEEEecchhHHHHHHHH----HHHHHhhcccccccccchhHHHHHHHHH Q lcl|NC_015158. 464 VADTIRVFDSDDKVATFMNVNK--DDITAKGRLRPVGARHFAEQAQVV----QSLMGIANTPVWQDIKPHVSTENLAKML 537 (581) Q Consensus 464 ~~~~iR~~~~~~~~~~~~~v~r--~di~~~~~vva~ga~~~~~r~q~~----q~L~~~~~~~~~~~i~p~~~~~~l~~~~ 537 (581) . + ...+.+ .+++..|. +.-.....+++... +.+..+.+. +.|.|.. +.+.+ T Consensus 368 ~------~--------~~~~~p~~~~~~i~f~--~L~~~s~kekAe~~~~~a~a~~~~~~~---g~is~~e----~r~~l 424 (461) T protein:vir:80 368 D------C--------GPSIDPDSFEWAIEFN--PLWNLDSKTDAEVRKLTAEADQIYIVN---GVLDPDE----VKETR 424 (461) T ss_pred c------c--------ccccCccccceEEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhc---CCCCHHH----HHHHH Confidence 0 0 001111 23333332 22222333443322 223333322 2344444 44444 Q ss_pred HHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_015158. 538 EHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQV 578 (581) Q Consensus 538 ~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~ 578 (581) ....++..-..+ +...+ +..+.+.+ .-+.+.+++.++ T Consensus 425 ~~~~~~~~~~~~-~~~~~-~~~~~~~~--~~~~~~~e~~~g 461 (461) T protein:vir:80 425 FGRFGLENSSKF-SGDSA-EIDKLAKL--VYDAYAKKNADG 461 (461) T ss_pred HHhcCCCCCccC-CCCCc-hhhhhhhh--ccccccccCCCC Confidence 444444322111 11111 11111111 223444566666 No 134 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=70.04 E-value=0.21 Score=24.25 Aligned_cols=417 Identities=16% Similarity=0.139 Sum_probs=157.7 Q ss_pred Cccc---------------------------hhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccc Q lcl|NC_015158. 1 MTGK---------------------------VLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDT 53 (581) Q Consensus 1 ~~~~---------------------------~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~ 53 (581) |-.+ .++++....+...+. .+....+...- .+..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~-----------~~~~~~a~~~~-----~~~~~~- 63 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKA-----------MNNKEVAYSQP-----VIGSMS- 63 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHh-----------hccCcceeecc-----ccccee- Confidence 2111 122222222111111 11111000000 000000 Q ss_pred ccccccccc--------------ccccccccchHHHHHHHHHHHHHhhcCCcc--EEEeecCCh------hHHHHHHHHH Q lcl|NC_015158. 54 TTTTNSTLP--------------WKNKTTLPKLCQIRDNLHSNYISALFPNER--WLKWEGKSL------QDEAKRDAIQ 111 (581) Q Consensus 54 ~~~~~~~~~--------------~k~~~~~pki~~~~d~~~~~l~~~~f~~~~--~~~~~~~~~------~d~~~ae~~~ 111 (581) ...+....+ -.++.++..|..++.+..+.+-...+...+ -+++..+.. .+....+.++ T Consensus 64 ~~~~~~~r~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~ 143 (551) T protein:vir:80 64 ANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIE 143 (551) T ss_pred cCcccccCccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHH Confidence 000000000 011233444444444444444333333222 233433322 2222333444 Q ss_pred HHHHHHHHh-----cchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcc Q lcl|NC_015158. 112 QYMDNKVKE-----SDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFA 186 (581) Q Consensus 112 ~~i~~~l~e-----~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~ 186 (581) ++|....-. .++...++.++.|++++|+|++-+-+... |.. -.+.+|+|..+.+... T Consensus 144 ~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~-------G~~------~~L~~l~p~~V~v~~~----- 205 (551) T protein:vir:80 144 SFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRN-------QSM------VRFVAKDPTTIFFATT----- 205 (551) T ss_pred HHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCC-------CcE------EEEEEeCCceeEEEEC----- Confidence 444322211 14556788889999999999876544211 110 0233343333221100 Q ss_pred cCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEee Q lcl|NC_015158. 187 HSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYG 266 (581) Q Consensus 187 d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g 266 (581) .+ +..... .+. +. T Consensus 206 ----------------------~~-----------------------------------g~~~~~------~~~-y~--- 218 (551) T protein:vir:80 206 ----------------------AD-----------------------------------GKIPDN------GNR-FV--- 218 (551) T ss_pred ----------------------Cc-----------------------------------cccccC------ceE-EE--- Confidence 00 000000 000 00 Q ss_pred eeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 267 DYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDL 346 (581) Q Consensus 267 ~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~ 346 (581) ...+|+. ...+.-+.||....||.. ...+..||.|+...+.+.-....+..+.....+.. T Consensus 219 ---~~~~g~~-----~~~~~~~eiiH~~~n~~~------------~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~N 278 (551) T protein:vir:80 219 ---QVIDQKI-----VATFNAREMAFAVRNPRS------------DIYATGYGYPELEIALKQFIAHENTEAFNDRFFSH 278 (551) T ss_pred ---EEeCCcE-----EEEEcccceEEecccCCC------------CcccccccccHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 0001110 111112234444444321 12346799999999998888888888777777777 Q ss_pred hcCCeEEE--ecc--ccc-----c--------c--cCCceeEEe-CCCCCcccccCCCccchhHHHHHHHHHHHHHhcCC Q lcl|NC_015158. 347 IAFPPMKV--KGD--VEE-----F--------V--WGPMEQIYI-NGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGA 406 (581) Q Consensus 347 s~np~~~v--~~d--~~~-----i--------~--~~pG~vi~~-~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv 406 (581) .+.|..++ .++ .++ + . ...|++.-+ ..+-.+.++...+...+.....++....+-..-|| T Consensus 279 g~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgV 358 (551) T protein:vir:80 279 GGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGI 358 (551) T ss_pred CCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcC Confidence 77777554 221 111 1 0 113444323 23335556654444434444456666778888999 Q ss_pred chHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHH Q lcl|NC_015158. 407 PREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKD 486 (581) Q Consensus 407 ~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~ 486 (581) |+...|....+..++.+.+. ++.++ +....+.|-...+.|++..+-+-+.+.+- +-++ . T Consensus 359 Pp~~lG~~~~~~~~~~~~~s-~t~sn--~e~~~~~f~~~tL~P~~~~ie~~ln~~L~-----~~~~-------------~ 417 (551) T protein:vir:80 359 DPAEINIPNNGGATGSKGGS-LNEGN--SAEKNQASKNKGLQPLLGFIEDFINKHIV-----AEFG-------------D 417 (551) T ss_pred CHHHcCcccccccccccccc-cchhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhhc-----cccC-------------C Confidence 99999975443222222211 11122 23445566667788888877544333221 0011 1 Q ss_pred HhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCC----cccc-cCCCCcHHHHHH Q lcl|NC_015158. 487 DITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGG----WDIF-KPNVAVMEAQTT 561 (581) Q Consensus 487 di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~----~~~~-~~~~~~~~~~~~ 561 (581) ++ .|++...-.....+++ .+..+... +.+.|.. ..+ .+|++. =|.+ .|..-... . T Consensus 418 ~~--~f~f~~~~~~~~~~~~----~~~~~~~~---g~lT~NE----~R~----~~gl~P~~egGD~~~~~~~~~~~---~ 477 (551) T protein:vir:80 418 KY--TFQFVGGDIKSELESV----KILAEKAK---VAMTVNE----VRK----ELNLPGDVIGGDIPLNGVIVQRI---G 477 (551) T ss_pred ce--EEEeeccChhhHHHHH----HHHHHHhc---CCcCHHH----HHH----HhCCCCCCCCCceeecccccccc---c Confidence 12 2222222111112222 12222211 2233333 322 334422 1222 11110000 0 Q ss_pred HHHHHHHHHHHHHHhcc--cCC Q lcl|NC_015158. 562 SALVNQSQAQIEEEAQV--PLV 581 (581) Q Consensus 562 q~~~q~aq~~~~~~~~~--~~~ 581 (581) +. .++.+.+.+.++.- .+. T Consensus 478 ~~-~~~~~~~~~~~~~~~~~~~ 498 (551) T protein:vir:80 478 QL-MQQEQFEHEKQQSNLQMLQ 498 (551) T ss_pred cc-ccccCcchhhhhhcccccc Confidence 11 11111111111000 000 No 135 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=56.22 E-value=0.46 Score=22.43 Aligned_cols=388 Identities=11% Similarity=0.058 Sum_probs=149.1 Q ss_pred hhhhHHHHHHHHHHHhhccccccccccccccccccccc--chHHHH-HHH-HHHHHHhhcC--CccEEEeecCChhHHHH Q lcl|NC_015158. 33 QRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLP--KLCQIR-DNL-HSNYISALFP--NERWLKWEGKSLQDEAK 106 (581) Q Consensus 33 ~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~p--ki~~~~-d~~-~~~l~~~~f~--~~~~~~~~~~~~~d~~~ 106 (581) -|+-..+-+ .|++.- +..+.......+.+ .++... .+| .....+..-- .++|+++++-. ++ T Consensus 1 ~~~~~~d~~---~~~~~~------~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~--~~-- 67 (427) T protein:vir:10 1 MKIVKHDGY---NDIFNG------GADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVK--DE-- 67 (427) T ss_pred CCccccchH---HHHhhc------CCCCcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCcc--HH-- Confidence 222222222 223211 11111111111111 111111 111 1111111000 25788887632 21 Q ss_pred HHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcc Q lcl|NC_015158. 107 RDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFA 186 (581) Q Consensus 107 ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~ 186 (581) ++ +...+++-+....+.+.++-.-+||.|++-+-..... .+..|.+ +. .++. T Consensus 68 -~~----~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~-----------~l~~p~~-----------~~-g~l~ 119 (427) T protein:vir:10 68 -KE----FKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNR-----------MLTSQAK-----------PG-AKLE 119 (427) T ss_pred -HH----HHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCC-----------ccccccC-----------CC-ccee Confidence 12 3333344466778888888888999887766432110 1111110 00 0111 Q ss_pred cCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEee Q lcl|NC_015158. 187 HSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYG 266 (581) Q Consensus 187 d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g 266 (581) ....+-+...|. +. ...+.....|+ .+ ++|- T Consensus 120 ~l~v~d~~~~~~--------------------------------~~-----~~~dp~s~~fg------~P------~~y~ 150 (427) T protein:vir:10 120 GVRVYDRFAITV--------------------------------EK-----RVTNARSPRYG------EP------EIYK 150 (427) T ss_pred EEEEechhcccc--------------------------------cc-----cccCccccccC------cc------eEEE Confidence 000000000010 00 00000011111 11 1221 Q ss_pred eeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHH-hhhhHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 267 DYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLD-NLVGMQYRIDHLENLKADVFD 345 (581) Q Consensus 267 ~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~-~l~d~Q~~~n~~~R~~iDn~~ 345 (581) + . .+++... +.| --..+|++...|. |+. ...-..+||.|++. .+.+.-....+.....-..+. T Consensus 151 -v-~-~~~~~~~---~~i-H~SRli~~~g~~~-----p~~----~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~ 214 (427) T protein:vir:10 151 -V-S-PGDNMQP---YLI-HHSRVFIADGERV-----AQQ----ARKQNQGWGASVLNKSLIDAICDYDYCESLATQILR 214 (427) T ss_pred -E-e-cCCCCcc---eEE-ccccEEEecCCCc-----hhh----hcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0 0111100 001 1234555544443 221 22346799999875 455655556666666666666 Q ss_pred HhcCCeEEEec--------cc-----------cccccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCC Q lcl|NC_015158. 346 LIAFPPMKVKG--------DV-----------EEFVWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGA 406 (581) Q Consensus 346 ~s~np~~~v~~--------d~-----------~~i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv 406 (581) ...-..+.+.+ +. ....+.-|.+.-.+.+..+..+..+ ...+...+..+.+.+-..+|| T Consensus 215 k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~--lsgl~~~~~~~~~~iaaa~~I 292 (427) T protein:vir:10 215 RKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSD--ISGVPEFLSSKMDRIVSLSGI 292 (427) T ss_pred HhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecc--cCChHHHHHHHHHHHHhhhCC Confidence 65555554422 10 0011123444444444555555544 234555677777788888999 Q ss_pred chHh-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCH Q lcl|NC_015158. 407 PREA-MGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNK 485 (581) Q Consensus 407 ~~~~-~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r 485 (581) |..- .|.++.+ -.|||=+. .+..-..++..-+..++|++..+++++.+..+ T Consensus 293 P~t~L~G~sp~G-lnstgd~D-----~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~s~~---------------------- 344 (427) T protein:vir:10 293 HEIIIKNKNVGG-VSASQNTA-----LETFYKLVDRKREEDYRPLLEFLLPFIVDEEE---------------------- 344 (427) T ss_pred CeeeeccCCccc-cccchhHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC---------------------- Confidence 8864 4655433 22332221 12222223333344566666666666544322 Q ss_pred HHhcCCceEEEecchhHHHHH----HHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHH Q lcl|NC_015158. 486 DDITAKGRLRPVGARHFAEQA----QVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTT 561 (581) Q Consensus 486 ~di~~~~~vva~ga~~~~~r~----q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~ 561 (581) +...|. +.-.....+++ ..++....+.+. +.+.|...+..|.... ...++..+..+.. ++.+. T Consensus 345 --~~~~f~--pL~~~s~kEkaei~~~~a~a~~~~~~~---gvi~~~e~r~~L~~~~-~~~~~~~~~~~~~-----e~~~~ 411 (427) T protein:vir:10 345 --WSIEFE--PLSVPSKKEESEITKNNVESVTKAITE---QIIDLEEARDTLRSIA-PEFKLKDGNNINI-----REPEE 411 (427) T ss_pred --cEEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhc---CCCCHHHHHHHHHhhh-ccccCCCCccccc-----cccch Confidence 222221 11111122222 122333333332 3466766665554432 4445544432211 11000 Q ss_pred HHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 562 SALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 562 q~~~q~aq~~~~~~~~~~~~ 581 (581) + ++ +-|.. T Consensus 412 ~-----------~e-~~p~~ 419 (427) T protein:vir:10 412 T-----------TE-PEPGL 419 (427) T ss_pred h-----------cC-CCCCC Confidence 0 00 11222 No 136 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=53.79 E-value=0.52 Score=22.14 Aligned_cols=394 Identities=11% Similarity=0.049 Sum_probs=147.3 Q ss_pred chhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHHHH--HHHHHHHHHhhcC-- Q lcl|NC_015158. 14 DTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIR--DNLHSNYISALFP-- 89 (581) Q Consensus 14 ~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~--d~~~~~l~~~~f~-- 89 (581) =.+-.|..+|+.... ....| +.|....-.............+ .++++.. |.-+++.++..-. T Consensus 1 v~~~~l~~e~at~~~--------~~d~~---~~~~~~l~~~~~~il~~a~~g~---~~~y~~l~~D~~i~s~l~~rk~av 66 (488) T protein:vir:99 1 MEKPALGREIATSGD--------GRDIT---RPFISGLQVPNDSILQRRGGND---LRVYEEILSDAQVKTVWGQRQLAV 66 (488) T ss_pred CCccchhHHHHHHHh--------hhhhh---ccccCCCCCCChHHHHhhccCC---HHHHHHHhhChHHHHHHHHHHHHH Confidence 012234444443221 00111 1111100000000000000000 0111100 1111111111111 Q ss_pred -CccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccc-e Q lcl|NC_015158. 90 -NERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGP-R 167 (581) Q Consensus 90 -~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p-~ 167 (581) +.+ +.++|-... .+.....+.|..+|...+|...+..++ |++.||.+++-+-|... .+ ...| + T Consensus 67 ~~~~-w~i~p~~~~--~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~------~g-----~~~~~~ 131 (488) T protein:vir:99 67 VSRE-WKVEAGGDR--PIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRD------DR-----YITLEA 131 (488) T ss_pred hcCC-ceEEcCCCC--hHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeec------CC-----eeeEee Confidence 233 356664432 222333445555666678877777775 78999999999988421 11 1123 4 Q ss_pred EEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccc Q lcl|NC_015158. 168 AVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGF 247 (581) Q Consensus 168 ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (581) +..++|..|.+|+.... . ..+. .+ T Consensus 132 l~~r~~~~f~~d~~~~l------~---~~~~-----------------------------~~------------------ 155 (488) T protein:vir:99 132 IKVRNRRRFRYDQDGGL------R---LLTP-----------------------------NN------------------ 155 (488) T ss_pred eeeecccceeecCCCce------E---Eecc-----------------------------CC------------------ Confidence 66666666666643110 0 0000 00 Q ss_pred cccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhh Q lcl|NC_015158. 248 GNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLV 327 (581) Q Consensus 248 ~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~ 327 (581) ..+ | .|++.. .-|.+..+.+..++.||.|.+..+- T Consensus 156 -----------------------~~~-------------g--------~~lp~~-~~~i~~~~~~~~g~p~g~gLl~~~~ 190 (488) T protein:vir:99 156 -----------------------MFE-------------G--------EPCPAP-YFWHFSTGADNDDEPYGLGLAHWLY 190 (488) T ss_pred -----------------------CCC-------------c--------cccccC-ceEEEEeecCCCCCcccchHHHHHH Confidence 000 0 011000 0123334455567778888888887 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCeEEEec-----cccc----------cccCCceeEEeCCCCCcccccCCCccch-hHH Q lcl|NC_015158. 328 GMQYRIDHLENLKADVFDLIAFPPMKVKG-----DVEE----------FVWGPMEQIYINGDGDVEMMAPNTQALQ-ADM 391 (581) Q Consensus 328 d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-----d~~~----------i~~~pG~vi~~~~~~~i~~~~~p~~~~~-~~~ 391 (581) .+=..++-..+...--+.+-+-|.....- +.++ +.+..+.++ ..+..|..+........ +.. T Consensus 191 w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~~~~vi--P~~~~ie~~ea~~~~~~~~~~ 268 (488) T protein:vir:99 191 WPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTDSAIIM--PAGMQAELLEAGRSGTADYKT 268 (488) T ss_pred HHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcCcEEEe--cCCceeEEeecCCCChHHHHH Confidence 77777777777777777776666543221 1111 122233333 33344555544333222 455 Q ss_pred HHHHHHHHHHHh-cCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeee Q lcl|NC_015158. 392 QIQILEAKMEEF-AGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRV 470 (581) Q Consensus 392 ~lq~~~~~~ee~-TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~ 470 (581) ++++.+..+.+. .|- ...+.++++...++.+.. +-...+++..++.+.+++-+.|+..++.+ |.......++ T Consensus 269 li~~~d~~Isk~iLGq--tlts~~~~Gs~a~~~vh~--~v~~d~~~aDa~~i~~tln~~li~~l~~~---N~~~~~~p~~ 341 (488) T protein:vir:99 269 LHDTMDATIAKVGLGQ--VASTQGTPGRLGNDDLQA--DVRLDLVKADADLICESFNLGPARWLTEW---NFPGAQPPRV 341 (488) T ss_pred HHHHHHHHHHHHHhhh--hhcccccccchhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CcCCcCCcee Confidence 688888877664 232 222222221121222221 11222333344444444444555555444 2211111111 Q ss_pred cCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCccccc Q lcl|NC_015158. 471 FDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFK 550 (581) Q Consensus 471 ~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~ 550 (581) . |....++|+ .+++..++.|..+.+. .+ + ...+.+.+|++..+.-. T Consensus 342 ~--------~~~~e~edl--------------~~~a~~~~~l~~~~G~----~i----~----~~~i~e~~Gip~~~~~~ 387 (488) T protein:vir:99 342 Y--------RVIEEPEDI--------------TAKAERDEKVFRMSGF----RP----T----RGYVQETYGVEVESTQA 387 (488) T ss_pred E--------ecCCCcccH--------------HHHHHHHHHHHhhcCC----CC----C----HHHHHHHcCCCCccccc Confidence 0 111122222 2233333444332211 11 1 23445666666543211 Q ss_pred CCCCcHHH----------HHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 551 PNVAVMEA----------QTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 551 ~~~~~~~~----------~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) +.+.+... ...+.+..++... .+.+-.+++ T Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 427 (488) T protein:vir:99 388 EATAPTPSTEFAEGDQPSDPAAAMAPQLAEA-MQPVVGNWT 427 (488) T ss_pred ccccCCCcccCCCCCCCCCchHHHHHHHHHH-HHHHHHHHH Confidence 11100000 0001111101000 011111222 No 137 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=50.51 E-value=0.61 Score=21.77 Aligned_cols=388 Identities=11% Similarity=0.068 Sum_probs=146.0 Q ss_pred hccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhcccccccccccccccccccccchHH-HH-HHHHHHHHHhhc Q lcl|NC_015158. 11 MLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQ-IR-DNLHSNYISALF 88 (581) Q Consensus 11 ~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~-~~-d~~~~~l~~~~f 88 (581) |.- .|+++.-+.. . ...+.. ..+..+.+--.++. -. .++.....+..- T Consensus 1 ~~~--~D~~~n~~~g-------------------------g--~~~~~~-~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~a 50 (422) T protein:vir:10 1 MVK--TDSYANIFLG-------------------------G--SDGSEI-YGSLQNQAPTILASLYADNALVRRIIDTIP 50 (422) T ss_pred Ccc--chhhHHHHcC-------------------------C--CCCccc-cCcccccCHHHHHHHHHhChhhHHHHhhhh Confidence 331 2332222111 0 000000 00111110000000 00 111111111100 Q ss_pred C--CccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccc Q lcl|NC_015158. 89 P--NERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGP 166 (581) Q Consensus 89 ~--~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p 166 (581) . -++|+++++...+ . + +...+++-+....+.+.++-.-+||.|.+-+-..+.. .+..| T Consensus 51 ed~~r~g~~i~~~~~~--~---~----~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~-----------~~~~P 110 (422) T protein:vir:10 51 ETALAAGFHIDGIDDE--P---A----FWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNR-----------ALTSP 110 (422) T ss_pred HHHhcCCccccCCCHH--H---H----HHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCC-----------Ccccc Confidence 0 2678888754321 1 1 2334444467788888888888999986665431110 11111 Q ss_pred eEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhcccccccc Q lcl|NC_015158. 167 RAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDG 246 (581) Q Consensus 167 ~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (581) +.+ . .++.....+-+...|. .. ...+..... T Consensus 111 ----l~~-------~-g~~~~l~v~d~~~i~~--------------------------------~~-----~~~dp~s~~ 141 (422) T protein:vir:10 111 ----VRE-------G-AELETVRVYDRTQVKV--------------------------------QT-----REENPRNAR 141 (422) T ss_pred ----ccc-------c-CceeeEEeeccccccc--------------------------------hh-----cccCccccc Confidence 110 0 0111100000111110 00 000000111 Q ss_pred ccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHh- Q lcl|NC_015158. 247 FGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDN- 325 (581) Q Consensus 247 ~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~- 325 (581) |+ .+ ++|- +.....+.. ..+ --..+|++...|. |+. ......+||.|++.. T Consensus 142 fg------~P------~~y~-v~~~~~~~~--~~i----H~SRli~~~g~~~-----p~~----~~~~~~~~G~S~l~~~ 193 (422) T protein:vir:10 142 FG------EP------LTYR-ITTNESDMF--YDV----HYSRIHIIDGERI-----PNV----MRRQNDGWGRSVLSSD 193 (422) T ss_pred cC------cc------eEEE-EecCCCCcc--eee----ccceeEEeCCCCc-----hhh----hcccCCcccchhHHHH Confidence 11 11 1110 000001100 001 1223444443332 221 223467899998875 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhcCCeEEEecc------cc-------------ccccCCceeEEeCCCCCcccccCCCcc Q lcl|NC_015158. 326 LVGMQYRIDHLENLKADVFDLIAFPPMKVKGD------VE-------------EFVWGPMEQIYINGDGDVEMMAPNTQA 386 (581) Q Consensus 326 l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~d------~~-------------~i~~~pG~vi~~~~~~~i~~~~~p~~~ 386 (581) +.+.-..........-..+....-..+.+.+. .. ...+..|.+.-.+.+..++.+..+ . T Consensus 194 ~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~--l 271 (422) T protein:vir:10 194 ILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSD--I 271 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecc--c Confidence 55766666667777666666665555554321 00 011223444444444556655544 3 Q ss_pred chhHHHHHHHHHHHHHhcCCchHhc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc Q lcl|NC_015158. 387 LQADMQIQILEAKMEEFAGAPREAM-GIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVA 465 (581) Q Consensus 387 ~~~~~~lq~~~~~~ee~TGv~~~~~-G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~ 465 (581) ..+...+..+.+.+-..+|||.... |.++ ++-.|||=+. .+..-..++..-+..++|++..++.++.+..+ T Consensus 272 sgl~~~~~~~~~~iaaa~~IP~t~L~G~s~-~Glnatgd~d-----~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~s~~-- 343 (422) T protein:vir:10 272 GGIDAFLDKKFDRIVALSGIHEIILKNKNV-GGVSSSQNTA-----LETFHKLVDRKRNAELLPILEFLIPFIVNAEE-- 343 (422) T ss_pred CChHHHHHHHHHHHHhhhCCCeeeeccCCc-ccccccchHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhcccCC-- Confidence 3455667778888888899998744 6653 3333333221 12222223333344566666666666543211 Q ss_pred ceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHH----HHHHHHHhhcccccccccchhHHHHHHHHHHHHh Q lcl|NC_015158. 466 DTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQ----VVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNL 541 (581) Q Consensus 466 ~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q----~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~ 541 (581) +...|. +.-.....+|+. .++....+.+ .+.+.|...++.|.... ... T Consensus 344 ----------------------~~~~f~--pL~~~sekekaei~~~~a~a~~~~~~---~g~i~~~e~r~~L~~~~-~~~ 395 (422) T protein:vir:10 344 ----------------------WSVEFN--PLAQESSKDKAEILEKNVNSIAALIA---AGAMDIDEARDTLRTIA-PEV 395 (422) T ss_pred ----------------------cEEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHh---cCCCCHHHHHHHhhhhc-ccc Confidence 222221 111111122221 2223333332 24566666665654332 222 Q ss_pred cCCCcccccCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 542 SLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 542 ~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) ++.+ ++.. .++.++. . .--|.. T Consensus 396 ~~~~-~~~~--~~~~~~~-~--------------~~~~~~ 417 (422) T protein:vir:10 396 KIND-GSVE--TEVTISE-T--------------SNDPLE 417 (422) T ss_pred cCCC-CCCc--cccchhh-c--------------CCCCCC Confidence 3322 1100 0000000 0 000000 No 138 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=39.64 E-value=1 Score=20.56 Aligned_cols=428 Identities=14% Similarity=0.115 Sum_probs=154.6 Q ss_pred Cccchhhhhhhcc--chhhhHHHHHHHH----HhhHHhhhhhHHHHHHHHHHHhhcccccccccc-ccc----------- Q lcl|NC_015158. 1 MTGKVLELQQMLD--DTRDGLAEQIANT----WQNWNSQRQEWLSQKSELRNYIFATDTTTTTNS-TLP----------- 62 (581) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~a~~i~~~----~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~-~~~----------- 62 (581) +..+.....-..+ +-++.++-.+.+. =..+.+....... .- .++-.. ...|-. +-- T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~---~~--~~~~~~-~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 7 IRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYS---QP--VIGSMS-ANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred hhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhh---ch--hhheee-cccccccCCccCChhHHHHHH Confidence 0000000000000 0011111111111 0011111111110 00 010000 011100 000 Q ss_pred --ccccccccchHHHHHHHHHHHHHhhcCCccE--EEeecCC------hhHHHHHHHHHHHHHHHHHh-----cchHHHH Q lcl|NC_015158. 63 --WKNKTTLPKLCQIRDNLHSNYISALFPNERW--LKWEGKS------LQDEAKRDAIQQYMDNKVKE-----SDFRTIM 127 (581) Q Consensus 63 --~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~--~~~~~~~------~~d~~~ae~~~~~i~~~l~e-----~n~~~~~ 127 (581) ..++..+..|..++.+-++.+-...+-..+. |++..+. ..+..+...++++|....-. .++...+ T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~ 160 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFV 160 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHH Confidence 0112233444444443344333222233222 3333332 22233344445555432211 1355678 Q ss_pred HHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhc Q lcl|NC_015158. 128 SQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQD 207 (581) Q Consensus 128 ~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~ 207 (581) +.++.|++++|+|++-+-+... |.. -.+.+|+|..+.+... T Consensus 161 ~~lv~d~ll~Gn~~~~i~rd~~-------G~~------~~L~~l~p~~V~~~~~-------------------------- 201 (547) T protein:vir:63 161 KKIVRDTYMYDQVNFEKVFNRN-------QSM------VRFVAKDPTTIFFATT-------------------------- 201 (547) T ss_pred HHHHHHHHhhCCEEEEEEECCC-------CcE------EEEEEecCceeEEEEC-------------------------- Confidence 8899999999999876654211 110 0233443332211100 Q ss_pred cCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeC Q lcl|NC_015158. 208 QPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDR 287 (581) Q Consensus 208 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g 287 (581) .+ ++.... .+. +. ...+|.. ...+.- T Consensus 202 -~~-----------------------------------g~~~~~------~~~-y~------~~~~~~~-----~~~~~~ 227 (547) T protein:vir:63 202 -AD-----------------------------------GKIPDN------GNR-FV------QVIDQKI-----VATFNA 227 (547) T ss_pred -Cc-----------------------------------cccccC------ceE-EE------EEcCCcE-----EEEecc Confidence 00 000000 000 00 0001111 111112 Q ss_pred CEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEE--ecc--ccc--- Q lcl|NC_015158. 288 MFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKV--KGD--VEE--- 360 (581) Q Consensus 288 ~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v--~~d--~~~--- 360 (581) +.||....||.. ......||.|+...+...-....+..+.....+...+.|..++ .++ .++ T Consensus 228 ~eiih~r~n~~~------------~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~ 295 (547) T protein:vir:63 228 REMAFAVRNPRS------------DIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHAL 295 (547) T ss_pred ccEEEecccCCC------------CcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHH Confidence 334444444321 1234678999999888887777777777766666666676443 221 111 Q ss_pred --c--------c--cCCceeEEe-CCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHH Q lcl|NC_015158. 361 --F--------V--WGPMEQIYI-NGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQL 427 (581) Q Consensus 361 --i--------~--~~pG~vi~~-~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l 427 (581) + . ...|++.-+ ..+..+.++...+...+.....++....+-..-|||+...|....+..++.+.+. T Consensus 296 ~~lk~~~~~~~~G~~nagk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s- 374 (547) T protein:vir:63 296 EIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGS- 374 (547) T ss_pred HHHHHHHHHHhcCcccccccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccc- Confidence 1 0 113444222 2333455555444433444445666677888899999999974433222211111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHH Q lcl|NC_015158. 428 QNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQ 507 (581) Q Consensus 428 ~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q 507 (581) ++.++ +....+.|-...+.|++..+-+-+...+- +-++ ..+ .|++...-.....++++ T Consensus 375 ~t~sn--~e~~~~~~~~~tL~P~~~~ie~~ln~~L~-----~~~~-------------~~~--~~~f~~~~~~~~~~~~~ 432 (547) T protein:vir:63 375 LNEGN--SAEKNQASKNKGLQPLLGFIEDFINKHIV-----AEFG-------------DKY--TFQFVGGDIKSELESVK 432 (547) T ss_pred cchhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----cccC-------------Cce--EEEeeccccccHHHHHH Confidence 11111 24444566666788888877544332210 0011 112 22222221212222222 Q ss_pred HHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCC----ccc-ccCCCCcHHHHHHHHHHHHHHHHHHHH------- Q lcl|NC_015158. 508 VVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGG----WDI-FKPNVAVMEAQTTSALVNQSQAQIEEE------- 575 (581) Q Consensus 508 ~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~----~~~-~~~~~~~~~~~~~q~~~q~aq~~~~~~------- 575 (581) +..+... +.+.|. +..+ .+|++. =|. +.|..-... -+. +++.+.+.+.. T Consensus 433 ----~~~~~~~---g~lT~N----E~R~----~~gl~P~~egGD~~~~~~~~~~~---~~~-~~~~~~~~~~~~~~~~~~ 493 (547) T protein:vir:63 433 ----ILAEKAK---VAMTVN----EVRK----ELNLPGDVIGGDIPLNGVIVQRI---GQL-MQQEQFEHEKQQSNLQML 493 (547) T ss_pred ----HHHHHhC---CCcCHH----HHHH----HhCCCCCCCCCceeecccccccc---ccc-ccccCCccccchhhcccc Confidence 2222211 123333 3332 334432 122 111110000 000 01111111100 Q ss_pred --------hcccCC Q lcl|NC_015158. 576 --------AQVPLV 581 (581) Q Consensus 576 --------~~~~~~ 581 (581) .+-|-. T Consensus 494 ~~~~~~~~~~~~~~ 507 (547) T protein:vir:63 494 QEQTGNRVSTDVED 507 (547) T ss_pred ccccCCCCCCCCCC Confidence 000001 No 139 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=39.42 E-value=1 Score=20.54 Aligned_cols=413 Identities=11% Similarity=0.078 Sum_probs=157.7 Q ss_pred Cccchh------hhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHH---HHHHhhcccccccccccccccccccccc Q lcl|NC_015158. 1 MTGKVL------ELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSE---LRNYIFATDTTTTTNSTLPWKNKTTLPK 71 (581) Q Consensus 1 ~~~~~~------~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~---~~~y~~~~~~~~~~~~~~~~k~~~~~pk 71 (581) |+-.|. ++.++.. .-+..-+.-|+...+.-+=.|.+ +..-+-..| ++ T Consensus 1 ~~~~~~~~~p~~~~g~~~~-------~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D-----------------~~ 56 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFG-------SGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNED-----------------SR 56 (469) T ss_pred CCCcccCCCCccchhhhhh-------cccccchhhccccccccccccccchHHHHHHHhhC-----------------hH Confidence 333222 2222221 00111122222111111111111 100001111 33 Q ss_pred hHHHHHHHHHHHHHhhcCCccEEEeecCChhHHHHHHHHHHHHHHHH-------------HhcchHHHHHHHHHHHhhcC Q lcl|NC_015158. 72 LCQIRDNLHSNYISALFPNERWLKWEGKSLQDEAKRDAIQQYMDNKV-------------KESDFRTIMSQLLLDYIDYG 138 (581) Q Consensus 72 i~~~~d~~~~~l~~~~f~~~~~~~~~~~~~~d~~~ae~~~~~i~~~l-------------~e~n~~~~~~~~~~d~~~~G 138 (581) |....+.....+. +.+| +++|-..++ ++++...+.|...+ .+..|+..+.+.+.+++.|| T Consensus 57 i~s~l~~rk~av~-----~~~w-~v~p~~~~~-e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G 129 (469) T protein:vir:10 57 VTSLLEAISLPIR-----STPW-RIRANGASD-EVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFG 129 (469) T ss_pred HHHHHHHHHHHHh-----cCCc-eEecCCCCH-HHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhC Confidence 3333333332222 3343 466655443 44444444433322 23468888999999999999 Q ss_pred ceEEEEeeecceeeeeeeeeEeeeeccceEEecchh---heeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhH Q lcl|NC_015158. 139 NCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPK---DIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLA 215 (581) Q Consensus 139 ~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~---df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~ 215 (581) +++.-+-|....+. .+|... -.++..+++. .|++++.-.. + ..+ T Consensus 130 ~s~~Eivw~~~~~~--~dG~~~----~~~l~~rp~~~i~~~~~~~~~~l------~---~~~------------------ 176 (469) T protein:vir:10 130 HAVFEQVYRPRNQS--PDGRFW----LRKLAPRPQWTISKFNVAPDGGL------E---SIE------------------ 176 (469) T ss_pred ceeeeeeeeccccc--CCCcee----eeeeeecCcccceeeeeccCCce------e---eee------------------ Confidence 99998888533110 011111 0123333332 2333322000 0 000 Q ss_pred HHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeec Q lcl|NC_015158. 216 SAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKE 295 (581) Q Consensus 216 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~ 295 (581) .....+.++...+. ..+. . T Consensus 177 ------------------------------------~~~~~~~~~~~~~~-------------------~~~~------~ 195 (469) T protein:vir:10 177 ------------------------------------QIAPPARTRGSLYV-------------------ANIA------P 195 (469) T ss_pred ------------------------------------ecCccccccccccc-------------------CCCC------c Confidence 00000000000000 0000 0 Q ss_pred CCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec----c----------cccc Q lcl|NC_015158. 296 NPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG----D----------VEEF 361 (581) Q Consensus 296 nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~----d----------~~~i 361 (581) .|.|..+ |.+..+.+..++.||.|++..+-..=.-++...+...-=+.+-..|.....- + ...+ T Consensus 196 ~~lp~~k--~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~ 273 (469) T protein:vir:10 196 PEIPVNR--LVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSV 273 (469) T ss_pred cccccCc--EEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHH Confidence 0111112 4666677778889999999999999888888888888888887777543321 1 1112 Q ss_pred ccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHh-cCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 362 VWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEF-AGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIM 440 (581) Q Consensus 362 ~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~-TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r 440 (581) .+.....+-+..+..|..+...........++++.+..+.+. .|-+-.+.|. ++...++.+. .+-...+++..++ T Consensus 274 ~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~--gGS~a~~~vh--~ev~~d~~~sDa~ 349 (469) T protein:vir:10 274 RGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGK--GGSYALASVL--EDPFTQAVHAYAT 349 (469) T ss_pred hcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCc--cchhhHHHHH--HHHHHHHHHHHHH Confidence 221111222344555666555444444666788888776553 3322222211 1111111111 1111223333444 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccc Q lcl|NC_015158. 441 NFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPV 520 (581) Q Consensus 441 ~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~ 520 (581) .+.+++-+.||..++.+ |.. .. ...|.+.. +++ .. .....+..++.|..+ T Consensus 350 ~i~~tln~~li~~l~~l---N~g---------~~-~~~P~~~~--~~~---------e~-~~~~~a~~i~~l~~~----- 399 (469) T protein:vir:10 350 SICRIANQHIIEDLVDI---NFG---------VD-TPAPVLTF--DPI---------GS-RQDLTAAAVKLLYDA----- 399 (469) T ss_pred HHHHHHHHHHHHHHHHh---cCC---------CC-CCccEEEe--cCC---------CC-cHHHHHHHHHHHHhc----- Confidence 44444444555555444 221 10 00011100 000 00 001122223333221 Q ss_pred cccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHHHHHHHHHHHHH--------------HHHHhcccCC Q lcl|NC_015158. 521 WQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQ--------------IEEEAQVPLV 581 (581) Q Consensus 521 ~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~~q~~~q~aq~~--------------~~~~~~~~~~ 581 (581) |..+ +. +.....+.+.+||+....-.+...+ ++ .++...+..++ ..+-....|. T Consensus 400 G~~~-~~---~~~~~~~~e~~gip~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 467 (469) T protein:vir:10 400 GVFD-DD---PAVKRAIRQRFNLPSELNDTPSAEP--EE-PAAVPNQSAAPARTRSSGNADARARAPKADQGVLF 467 (469) T ss_pred CCcc-Cc---cccHHHHHHHhCCCCCCCCcccccc--hh-cccCCCCCccccccCCCCCcccccccCCChHHhhc Confidence 1111 10 1234567788888865322221111 10 00000000000 0000000000 No 140 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=36.87 E-value=1.1 Score=20.25 Aligned_cols=431 Identities=13% Similarity=0.087 Sum_probs=166.9 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHH-HHHHHHHHHhhccccc----ccccc---cccccccccccch Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWL-SQKSELRNYIFATDTT----TTTNS---TLPWKNKTTLPKL 72 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~-~~~~~~~~y~~~~~~~----~~~~~---~~~~k~~~~~pki 72 (581) |..+.++.....-+ -+.-..|.+..+ .+.+.. +.-....++--...++ .+.+. .....++.+++.| T Consensus 27 ~~~~~~~~~~~~~~--~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~ 100 (574) T protein:vir:80 27 MHLREIDTNVVNNE--PYSMESIEKGMN----GKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAI 100 (574) T ss_pred cccchhhhhhhhcc--CCCHHHHHHhHh----hhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHHH Confidence 33333332222100 000011222211 010110 0000011111111111 00000 0011233445666 Q ss_pred HHHHHHHHHHHHHhhcCCccE--EEeecCChh--H--HHHHH--HHHHHHHHHHHh-----cchHHHHHHHHHHHhhcCc Q lcl|NC_015158. 73 CQIRDNLHSNYISALFPNERW--LKWEGKSLQ--D--EAKRD--AIQQYMDNKVKE-----SDFRTIMSQLLLDYIDYGN 139 (581) Q Consensus 73 ~~~~d~~~~~l~~~~f~~~~~--~~~~~~~~~--d--~~~ae--~~~~~i~~~l~e-----~n~~~~~~~~~~d~~~~G~ 139 (581) ..++..-++.+...+..+..- +++..+..+ + .+.++ -+..++.+.... ..+...++.++.+++++|+ T Consensus 101 i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gn 180 (574) T protein:vir:80 101 INTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQ 180 (574) T ss_pred HHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCC Confidence 655555555555555444322 223322221 1 11221 233444433221 2456778889999999999 Q ss_pred eEEEEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHH Q lcl|NC_015158. 140 CFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIA 219 (581) Q Consensus 140 ~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~ 219 (581) |++-+.+... |.. -.+..|+|..+.+..... ++.. T Consensus 181 ayi~i~r~~~-------G~~------~~L~pl~p~~V~v~~d~~----------------------------~~~~---- 215 (574) T protein:vir:80 181 VNFEKVFDKD-------GNF------IKFDTVDPTTIFLATNGE----------------------------GKLI---- 215 (574) T ss_pred eEEEEEECCC-------CcE------EEEEEEcCceeEEEEcCc----------------------------cccc---- Confidence 9876544211 111 023444444443321100 0000 Q ss_pred HHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCc Q lcl|NC_015158. 220 RRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSW 299 (581) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~ 299 (581) . ++ .++ +...+|+. ...+..+.+|....||.+ T Consensus 216 -------------~-----------~~-------------~~y------~~~~~g~~-----~~~~~~~eiih~~~~~~~ 247 (574) T protein:vir:80 216 -------------K-----------NG-------------ERF------VQVIDNRI-----VAKFNERELAFAVRNPRA 247 (574) T ss_pred -------------c-----------Cc-------------eEE------EEEeCCce-----EEEEccccEEEEeccCCC Confidence 0 00 000 00001110 111122234333333311 Q ss_pred cCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEe--cc----ccc---c--------c Q lcl|NC_015158. 300 FAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVK--GD----VEE---F--------V 362 (581) Q Consensus 300 ~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~--~d----~~~---i--------~ 362 (581) ......||.|+.+.+...-....++.+....-+...+.|..++. .+ .+. + . T Consensus 248 ------------~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~ 315 (574) T protein:vir:80 248 ------------DIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLA 315 (574) T ss_pred ------------CcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc Confidence 11235799999999998888888888877777777777875442 21 111 1 1 Q ss_pred --cCCceeEEe-CCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 363 --WGPMEQIYI-NGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKI 439 (581) Q Consensus 363 --~~pG~vi~~-~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~ 439 (581) ...|++.-+ ..+..+.++...+...+.....++...++-..-|||+...|....+.-++++...+ +.+ ...... T Consensus 316 G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~-n~s--n~E~~~ 392 (574) T protein:vir:80 316 GINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSL-NEG--NSKEKM 392 (574) T ss_pred cccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccc-cch--hHHHHH Confidence 123554333 44556667765544444445567777788889999999999765544444433321 112 224445 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhccc Q lcl|NC_015158. 440 MNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTP 519 (581) Q Consensus 440 r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~ 519 (581) +.|-...+.|++..+-+-+.+.+- +-++. .+ .+++..... ..+.+..+.. .+.. T Consensus 393 ~~f~~~tL~P~~~~ie~~ln~~Ll-----~~~~~-------------~~--~~~f~~~d~---~~~~~~~~~~-~~~~-- 446 (574) T protein:vir:80 393 QASQNKGLQPLLRFIEDTVNTYIV-----AEFGE-------------KY--QFQFRGGDL---SAQLDKLKII-EQEG-- 446 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhh-----hhcCC-------------ce--EEEecccch---hhHHHHHHHH-HHHh-- Confidence 566666778888777544433220 00110 11 112221111 1122211111 1111 Q ss_pred ccccccchhHHHHHHHHHHHHhcCCCc---ccc-cCCCCcHHHHHHHHHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 520 VWQDIKPHVSTENLAKMLEHNLSLGGW---DIF-KPNVAVMEAQTTSALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 520 ~~~~i~p~~~~~~l~~~~~e~~~l~~~---~~~-~~~~~~~~~~~~q~~~q~aq~~~~~~~~~~~~ 581 (581) .+-+.|...+ . .+|++.. +.+ .|--..+..+..+.-.... +..++....++- T Consensus 447 -~G~lT~NE~R----~----~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 502 (574) T protein:vir:80 447 -KVFRTVNEIR----H----DKGLEPIKGGDVILNGVHIQAIGQALQEEQLEY-QRSQDRLNRLLE 502 (574) T ss_pred -CCccCHHHHH----H----HhCCCCCCCCCEeeeccceeecccccccccCCc-cchhcccccccc Confidence 1223333222 2 2344332 222 1100000000000000000 000011111111 No 141 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=33.43 E-value=1.4 Score=19.86 Aligned_cols=434 Identities=12% Similarity=0.109 Sum_probs=164.0 Q ss_pred Cccchhhhhhhcc-----ch---hhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccc-----------ccccccccc Q lcl|NC_015158. 1 MTGKVLELQQMLD-----DT---RDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATD-----------TTTTTNSTL 61 (581) Q Consensus 1 ~~~~~~~~~~~~~-----~~---~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~-----------~~~~~~~~~ 61 (581) |-+....|-+=+- +. +-..+..|...|.+-+. +.. .+.+-+.... .-..+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~---~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~ 73 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEE----KSK---ELNKSLYGKQQAYAEPFLEVMDTNPEFRTK 73 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccChhHHHHHhhh----hhh---hhccccCCccchhhcceeeeeecCCCcccc Confidence 4444433322110 00 00001112222221110 000 0111110000 000000000 Q ss_pred c---------------ccccccccchHHHHHHHHHHHHHhhcCCccE--EEeecCChh----HHHHHH--HHHHHHHHHH Q lcl|NC_015158. 62 P---------------WKNKTTLPKLCQIRDNLHSNYISALFPNERW--LKWEGKSLQ----DEAKRD--AIQQYMDNKV 118 (581) Q Consensus 62 ~---------------~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~--~~~~~~~~~----d~~~ae--~~~~~i~~~l 118 (581) | +..+..+..|..++.+-++.+--..+-..+- +.+..+... +.++++ ....++.+.+ T Consensus 74 p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~ 153 (576) T protein:vir:96 74 RSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTG 153 (576) T ss_pred CcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhcc Confidence 0 0011223334444444445544444433332 233333322 223332 2344454444 Q ss_pred Hh-----cchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceEEE Q lcl|NC_015158. 119 KE-----SDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIR 193 (581) Q Consensus 119 ~e-----~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r 193 (581) .. .++...++.++.|++++|+|++-+-+.+... +.. -.+..|+|..+-+... T Consensus 154 ~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~-----g~~------~~L~pl~p~~V~v~~~------------ 210 (576) T protein:vir:96 154 RDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNA-----TTM------DKFIAVDPSTIFYATD------------ 210 (576) T ss_pred CCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCC-----Cce------EEEEEeCCceeEEEEC------------ Confidence 32 2466778889999999999988765432100 000 0123333322211100 Q ss_pred EEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccccCCceEEEEEEeeeeecccC Q lcl|NC_015158. 194 TVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQS 273 (581) Q Consensus 194 ~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~ 273 (581) . ++.. +. .. ..| +...+ T Consensus 211 ---------------~-----------------------------------dg~~----~~-----~~-~~~---~~~~~ 227 (576) T protein:vir:96 211 ---------------K-----------------------------------NGKI----IK-----GG-KRF---VQVIN 227 (576) T ss_pred ---------------C-----------------------------------CCce----ee-----ee-eEE---EEecC Confidence 0 0000 00 00 000 01111 Q ss_pred CceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEE Q lcl|NC_015158. 274 GTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMK 353 (581) Q Consensus 274 d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~ 353 (581) +. ....+..+.+|....||.+ ...+..||.|+...+...-....++.+.....+...+.|..+ T Consensus 228 ~~-----~~~~~~~~dii~~~~~~~~------------d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~gi 290 (576) T protein:vir:96 228 KK-----VVASFTSREMAMGIRNPRT------------ELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGI 290 (576) T ss_pred Cc-----eEEEecccceEEEeecCCC------------CcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 11 1111111222222222211 112357999999999888888888777777777777777644 Q ss_pred Ee--cc----ccc---c--------c--cCCcee-EEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCC Q lcl|NC_015158. 354 VK--GD----VEE---F--------V--WGPMEQ-IYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGI 413 (581) Q Consensus 354 v~--~d----~~~---i--------~--~~pG~v-i~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~ 413 (581) +. ++ .+. + . ...|++ +-...+..+.++...+...+.....++....+-..-|||+...|. T Consensus 291 L~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~ 370 (576) T protein:vir:96 291 LQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGF 370 (576) T ss_pred EEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccc Confidence 32 21 111 1 1 123554 334555566777655555555555677778888899999999998 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCce Q lcl|NC_015158. 414 RTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGR 493 (581) Q Consensus 414 ~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~ 493 (581) ...+..+++......+.++ +....+.|-...+.|++..+-+-+.+.+- +-++. ++ .++ T Consensus 371 ~~~~~~~g~~~~~s~t~sn--~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll-----~~~~~-------------~~--~~~ 428 (576) T protein:vir:96 371 PNRGGATGGKGGNTLNEAD--PGKKQQQSQNKGLQPLLRFIEDLINTHII-----SEYSD-------------KY--VFQ 428 (576) T ss_pred ccccccccccccccccccc--HHHHHHHHHHHHHHHHHHHHHHHHHhhhc-----hhccC-------------ce--EEE Confidence 6555444332222222222 24445566666678888777543332220 00111 01 111 Q ss_pred EEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCc---cc-ccCCCCcHH-------HHHHH Q lcl|NC_015158. 494 LRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGW---DI-FKPNVAVME-------AQTTS 562 (581) Q Consensus 494 vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~---~~-~~~~~~~~~-------~~~~q 562 (581) +... ....++...+.+.++.. +.+.|...+ +.+|++-. |. +.|...... ..+.+ T Consensus 429 f~r~---d~~~~~e~~~~~~~~~~----G~lT~NE~R--------~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~ 493 (576) T protein:vir:96 429 FVGG---DTKSELDKIKILQEEVK----TYKTVNEAR--------KEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDT 493 (576) T ss_pred eccC---CHHHHHHHHHHHHHHhc----CccCHHHHH--------HHhCCCCCCCcceeccccccccccccccCCCCCCc Confidence 1111 11223322222222211 123333222 23344432 11 111000000 00000 Q ss_pred HHHHHHHHHHHHHhcccCC Q lcl|NC_015158. 563 ALVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 563 ~~~q~aq~~~~~~~~~~~~ 581 (581) ..+...++..+..++.+-- T Consensus 494 ~~~~~~~~~~~~~~~~~~~ 512 (576) T protein:vir:96 494 KQKERFDMIQQFLNSPDDE 512 (576) T ss_pred cccccccccccccCCCCCC Confidence 0000000000000000000 No 142 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=31.46 E-value=1.5 Score=19.62 Aligned_cols=350 Identities=11% Similarity=0.036 Sum_probs=117.9 Q ss_pred ceEEEEeeecceeeeeeeeeEe-eeeccce--EEecchhhe--------eecCCCCCcccCceEEEEEecHHHHHHHhhc Q lcl|NC_015158. 139 NCFATVEYVKETTKDEESGATR-DTYFGPR--AVRIDPKDI--------VFNPVAVDFAHSPKIIRTVLNEGELLQMEQD 207 (581) Q Consensus 139 ~~i~k~~~~~~~~~~~~~~~~~-~~~~~p~--ie~V~p~df--------~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~ 207 (581) +|+++=-...+.....+..... ..+.+.. =..|++... .++--|.++..+++.+ ...+...|.. . T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~-~~~~~~~L~~---~ 76 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLIT-SRKKLQGIVD---N 76 (382) T ss_pred CccccccccCCcccccccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceee-ecchhhhhhh---h Confidence 5555321111100000000000 0000000 011222211 1122223344444321 1112222211 0 Q ss_pred cCccchhHHHHHHHHhhhcc--CCcccchhhhhcccccccccccccc--ccCCceEEEEEEeeeeecccCCceeeeeEEE Q lcl|NC_015158. 208 QPENASLASAIARRREFRRG--LGTYTREDCEKAVGFSMDGFGNLYD--YFQSPYVEVLTFYGDYHDTQSGTFKRNMKVT 283 (581) Q Consensus 208 ~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~vevlE~~g~~~d~~~d~~~e~~~it 283 (581) |.. ....-++.+....... ...+ +...+ +..+.... +.-+..|++.. ..+++.. .+.++ T Consensus 77 PN~-~~t~~~f~~~l~~~l~l~Gna~----~~i~r----d~~G~~~~l~~i~~~~v~v~~-------~~~~~~~-~y~~~ 139 (382) T protein:vir:48 77 PSN-NANRFNFYQSIFAQMLLGGEAF----AYRWR----NENGRDMKWEYLRPSQVSFNR-------LDNKDGI-YYNIT 139 (382) T ss_pred cCC-CCCHHHHHHHHHHHhhhcCCEE----EEEEE----CCCCcEEEEEEEcCceeEEEE-------cCCCCeE-EEEEE Confidence 111 0011111111110000 0000 00000 00000000 00111122110 0111111 11221 Q ss_pred EEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec--c--cc Q lcl|NC_015158. 284 IIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG--D--VE 359 (581) Q Consensus 284 v~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~--d--~~ 359 (581) + .+...-... .|+.. -..|+.+....+.+||.|+...+.+.-....+..+.....+...+.|..++.- . .+ T Consensus 140 ~-~~~~~~~~~--~~~~~--evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e 214 (382) T protein:vir:48 140 F-DDPRIPPKQ--HVPQN--DVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLD 214 (382) T ss_pred e-cCcccccee--EEcCc--cEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChH Confidence 1 111100000 01111 13444444445679999999999998888888888888888888888776531 1 11 Q ss_pred c----------cccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCcccccHHHHHHHHH Q lcl|NC_015158. 360 E----------FVWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQN 429 (581) Q Consensus 360 ~----------i~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~ 429 (581) . .....|+++-...+..+.++...+...+.....++....+-..-|||+...|.......++. T Consensus 215 ~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~------- 287 (382) T protein:vir:48 215 FKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLE------- 287 (382) T ss_pred HHHHHHHHHHhhccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH------- Confidence 1 11226787777776677777654444444444567777788889999999996443221110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhcccCCCcc--CHHHhcCCc-eEEEecchhHHHHH Q lcl|NC_015158. 430 AAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDKVATFMNV--NKDDITAKG-RLRPVGARHFAEQA 506 (581) Q Consensus 430 aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v--~r~di~~~~-~vva~ga~~~~~r~ 506 (581) . .+.|-...+.|++..+.+-+.+.+ ..+.+.+.+..+ +....+-+. .++-. +..++. T Consensus 288 ---~-----~~~~~~~~l~p~~~~i~~~l~~~l---------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~---g~~t~~ 347 (382) T protein:vir:48 288 ---M-----SSDLYSKAVSRYLRPFLSELSQKL---------SCDVDADIFPAVDPTGSNYISRINSLVKT---GTLAQN 347 (382) T ss_pred ---H-----HHHHHHHHHHHHHHHHHHHHHHHh---------cChhhhhhhhhhccchhHHHHHHHHHhhc---CccCHH Confidence 0 112333445555555432222211 111111111111 111111110 00000 111111 Q ss_pred HHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCcccccCCCCcHHHHH Q lcl|NC_015158. 507 QVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGWDIFKPNVAVMEAQT 560 (581) Q Consensus 507 q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~~~~~~~~~~~~~~~ 560 (581) +-.+.|. + ..+.|.. +.+.-...-.++|= +- .++. T Consensus 348 e~r~~l~---~----~g~~~~~----~~~~~~~~~~~~GG-------d~-~~~~ 382 (382) T protein:vir:48 348 QGLYILQ---Q----AEILPKE----LPNGENPNSTLKGG-------EE-DGQD 382 (382) T ss_pred HHHHHHh---h----CCCCCcc----hhhhhcCCCCCCCC-------CC-CCCC Confidence 1111111 0 0112211 10000000001110 00 1111 No 143 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=30.04 E-value=1.6 Score=19.45 Aligned_cols=391 Identities=12% Similarity=0.059 Sum_probs=118.4 Q ss_pred hHHhhhhhHHHHHHHHHHHhhccccccccccccccc----ccccccch---------HHHHHHH--HHHHHHhhcC---- Q lcl|NC_015158. 29 NWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWK----NKTTLPKL---------CQIRDNL--HSNYISALFP---- 89 (581) Q Consensus 29 ~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k----~~~~~pki---------~~~~d~~--~~~l~~~~f~---- 89 (581) =.+..+-+++-.= .- --+.+++..+.+....++- +..+.+++ .++.+.+ -+++..++.. T Consensus 1 m~kk~~k~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk~a 78 (448) T protein:vir:77 1 MAKRGRKPKELVP-GP-GSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIFGR 78 (448) T ss_pred CCCCCCCCcccCC-cc-cccchhhhhhhccchhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHHHH Confidence 0000000000000 00 0000000000000000000 00111111 1111111 1112111111 Q ss_pred --CccEEEeecCChhHH--HHHHHHHHHHHHH---HHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeee Q lcl|NC_015158. 90 --NERWLKWEGKSLQDE--AKRDAIQQYMDNK---VKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDT 162 (581) Q Consensus 90 --~~~~~~~~~~~~~d~--~~ae~~~~~i~~~---l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~ 162 (581) +.+ +.++|...+.+ +.|+..++.|..- +...+|...+.++ .|++.||.+++-+-|.... ++.. T Consensus 79 v~~~~-w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~-----dg~~--- 148 (448) T protein:vir:77 79 IRSAK-WYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGA-----DGKL--- 148 (448) T ss_pred HhcCC-ceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecC-----CCce--- Confidence 233 34776544333 3445444444321 1234676666666 5899999999988884210 0110 Q ss_pred eccceEEecchh---heeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhc Q lcl|NC_015158. 163 YFGPRAVRIDPK---DIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKA 239 (581) Q Consensus 163 ~~~p~ie~V~p~---df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 239 (581) .-.++..+++. -|.+||.-. -+ ..|. .+.+ T Consensus 149 -~~~~l~~r~~~~~~~f~~~~~~~------l~---~~~~-----------------------------~~~~-------- 181 (448) T protein:vir:77 149 -ILDKIVPIHPFNIDEVLYDEEGG------PK---ALKL-----------------------------SGEV-------- 181 (448) T ss_pred -eeccccccCCCccceeeeecCCc------eE---EEec-----------------------------CCcc-------- Confidence 00122222222 122222200 00 0000 0000 Q ss_pred cccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccC Q lcl|NC_015158. 240 VGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYA 319 (581) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G 319 (581) .++. .+.+ .-+.|.+. |++.. ....++.|| T Consensus 182 ---------------------------------~~~~--------~~~~------~~~lP~~~--~i~~~-~~~~g~p~g 211 (448) T protein:vir:77 182 ---------------------------------KGGS--------QFVN------GLEIPIWK--TVVFL-HNDDGSFTG 211 (448) T ss_pred ---------------------------------cccc--------cCCC------ccccccce--EEEEe-cCCcCCccc Confidence 0000 0000 00001111 11221 222355566 Q ss_pred CCcHHhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEe------ccccc----------cc--cCCceeEEeCCCCCccccc Q lcl|NC_015158. 320 MGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVK------GDVEE----------FV--WGPMEQIYINGDGDVEMMA 381 (581) Q Consensus 320 ~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~------~d~~~----------i~--~~pG~vi~~~~~~~i~~~~ 381 (581) .|.+..+--+=.-++-..+...-=+.+-.-|..... ++.++ +. ...|.+ +..+..|..+. T Consensus 212 ~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~i--iP~g~~ie~~e 289 (448) T protein:vir:77 212 QSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGII--LPDDWKFDTVD 289 (448) T ss_pred chHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEE--ecCCceEEEEe Confidence 666666666555555555555555555444532211 01001 11 011112 33334444444 Q ss_pred CCCccchhHHHHHHHHHHHHHhc-CCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015158. 382 PNTQALQADMQIQILEAKMEEFA-GAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRR 460 (581) Q Consensus 382 ~p~~~~~~~~~lq~~~~~~ee~T-Gv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~ 460 (581) ..........++++.+..+.+.. |-+- +.+..+ +.-+|. .....+-....++..++.+..++-+.||..++.+ T Consensus 290 a~~~~~~~~~~i~~~d~~Isk~iLGqtl-Ts~~~~-g~~~~~-~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~l--- 363 (448) T protein:vir:77 290 LKSAMPDAIPYLTYHDAGIARALGIDFN-TVQLNM-GVQAVN-IGEFVSLTQQTIISLQREFASAVNLYLIPKLVLP--- 363 (448) T ss_pred cCCCccCHHHHHHHHHHHHHHHHhcccc-cccccc-chhhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--- Confidence 44333445566778777766543 2211 112111 111111 1111111222223334444444444455544443 Q ss_pred hc-CccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHH Q lcl|NC_015158. 461 NL-DVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEH 539 (581) Q Consensus 461 n~-d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e 539 (581) |. ......|+ .|-...++|++. .++.++.| ..++.+ T Consensus 364 Nfg~~~~~P~~--------~f~~~e~eDl~~--------------~a~~~~~l---------------------~~~~~~ 400 (448) T protein:vir:77 364 NWPGATRFPRL--------TFEMEERNDFSA--------------AANLMGML---------------------INAVKD 400 (448) T ss_pred cCCCCCCCCEE--------EecCCChhhHHH--------------HHHHhHHH---------------------HHHHHH Confidence 21 11111111 133334455421 12112211 122344 Q ss_pred HhcCCCcccccCCCCcHHHHHHHH--HHHHHHHHHHHHhcccCC Q lcl|NC_015158. 540 NLSLGGWDIFKPNVAVMEAQTTSA--LVNQSQAQIEEEAQVPLV 581 (581) Q Consensus 540 ~~~l~~~~~~~~~~~~~~~~~~q~--~~q~aq~~~~~~~~~~~~ 581 (581) ..+|+.. +...+++.+.+..+. ............+.--.| T Consensus 401 ~~~ip~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (448) T protein:vir:77 401 SEDIPTE--LKALIDALPSKMRRALGVVDEVREAVRQPADSRYL 442 (448) T ss_pred HhcCCcc--CCcCCCCCchhcccccCCCCCCCchhhcchhhHHH Confidence 4555532 111111111100000 000000000000000001 No 144 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=23.99 E-value=2.2 Score=18.67 Aligned_cols=406 Identities=12% Similarity=0.053 Sum_probs=150.1 Q ss_pred HHHHHHHHhhHHhhhhhH--HHHHHHHHHHhhcccccccccccccccccccccchHHHHHHHHHHHHHhhcCCccEEEee Q lcl|NC_015158. 20 AEQIANTWQNWNSQRQEW--LSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIRDNLHSNYISALFPNERWLKWE 97 (581) Q Consensus 20 a~~i~~~~~~~~~~r~~~--~~~~~~~~~y~~~~~~~~~~~~~~~~k~~~~~pki~~~~d~~~~~l~~~~f~~~~~~~~~ 97 (581) =..+..+|..+...+... .+.|..+...+........+....-..+-...|.+...++-+...+-+ -.+--++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~-----lp~~~~~ 75 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIAT-----LPLSTYS 75 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhcc-----CceEEEE Confidence 111222222111111000 112211111111111011111111111222234444444444333222 1221122 Q ss_pred cCChhHHHHHHHHHHHHHHHHHh-c---chHHHHHHHHHHHhhcCceEEEEeeecceeeeeeeeeEeeeeccceEEecch Q lcl|NC_015158. 98 GKSLQDEAKRDAIQQYMDNKVKE-S---DFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDP 173 (581) Q Consensus 98 ~~~~~d~~~ae~~~~~i~~~l~e-~---n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p 173 (581) -...+ +.++..-.+...|.. . ..+..++.++.+++++|+|++.+-+..+. .. .+..+.| T Consensus 76 ~~~~~---~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~g~--------~~------~l~~l~p 138 (457) T protein:vir:13 76 KRGGS---RKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQGPN--------IV------GLDVLDP 138 (457) T ss_pred ecCCc---ccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc--------EE------EEEEEcc Confidence 11111 112222233333332 2 34667888889999999999876432110 00 1222222 Q ss_pred hheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccchhhhhccccccccccccccc Q lcl|NC_015158. 174 KDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDY 253 (581) Q Consensus 174 ~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 253 (581) ..+-+... . . + +.. T Consensus 139 ~~v~v~~~---------------~-----------~-------------------~-------------------~~~-- 152 (457) T protein:vir:13 139 TKIHVHMV---------------M-----------V-------------------D-------------------GLR-- 152 (457) T ss_pred CceEEEEe---------------c-----------C-------------------C-------------------Ccc-- Confidence 21111000 0 0 0 000 Q ss_pred cCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHH Q lcl|NC_015158. 254 FQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRI 333 (581) Q Consensus 254 ~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~ 333 (581) ..++.. |....++- + .....+.-+.+|+ +.+....+.++|.|+...+...-... T Consensus 153 -----~~~~~~----y~~~~~~~-~-~~~~~~~~~diih---------------~~~~~~~~~~~G~s~i~~~~~~i~~~ 206 (457) T protein:vir:13 153 -----RKVFEA----YDIDADGN-E-VLLGWFTPRDVLH---------------IPGMMLPGDFVGCSPISYARESIGLA 206 (457) T ss_pred -----ceeEEE----EEEecCCc-e-eeEEeeCccceEE---------------ecCCCCCCccccccHHHHHHHHHHHH Confidence 000000 01000000 0 0000001111222 22222235689999999988877777 Q ss_pred HHHHHHHHHHHHHhcCCeEEEec--cc--cc-------c----c--cCCceeEEeCCCCCcccccCCCccchhHHHHHHH Q lcl|NC_015158. 334 DHLENLKADVFDLIAFPPMKVKG--DV--EE-------F----V--WGPMEQIYINGDGDVEMMAPNTQALQADMQIQIL 396 (581) Q Consensus 334 n~~~R~~iDn~~~s~np~~~v~~--d~--~~-------i----~--~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~ 396 (581) .+..+.....+...+.|...+.- .. +. + . ...|++.-+..+..+.++...+...+.....++. T Consensus 207 ~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~ 286 (457) T protein:vir:13 207 LAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQ 286 (457) T ss_pred HHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHH Confidence 77777666666666777665531 11 10 1 1 1146777777777777776554444444445666 Q ss_pred HHHHHHhcCCchHhcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccceeeecCchhc Q lcl|NC_015158. 397 EAKMEEFAGAPREAMGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSDDK 476 (581) Q Consensus 397 ~~~~ee~TGv~~~~~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~d~~~~iR~~~~~~~ 476 (581) ..++-..-|||+...|....+..+++.+.+.+ +.|-..-+.|++..+-+-+.+.+ +.+... T Consensus 287 ~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~-----------~~f~~~tl~P~~~~ie~~ln~~L--------~~~~~~ 347 (457) T protein:vir:13 287 VPEIARIFGVPPHLISDATNSTSWGSGLAEQN-----------IAFTMFSLRPWLERIEAGFNRLL--------FAETAD 347 (457) T ss_pred HHHHHHHhCCCHHHcCCCCCcccccchHHHHH-----------HHHHHHHHHHHHHHHHHHHHHhh--------cCcccc Confidence 67788889999999997665555444443322 23444456777666644333222 111000 Q ss_pred ccCCCccCHHHhcCCceEEEecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCc-----c-ccc Q lcl|NC_015158. 477 VATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGW-----D-IFK 550 (581) Q Consensus 477 ~~~~~~v~r~di~~~~~vva~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~-----~-~~~ 550 (581) ..-|++...+++ -.....+|...++.+.+ . +.+.|.. .. +.+|++.. + .+. T Consensus 348 ~~~~i~fd~~~l---------~~~D~~~r~~~~~~~~~---~---G~~T~NE----~R----~~~gl~Pi~~g~~d~~~~ 404 (457) T protein:vir:13 348 RFRFVKFNLDEI---------KRGAPKERMELWSLGLQ---N---GIYSIDE----VR----AAEDMTPLPDGLGEKYRV 404 (457) T ss_pred CceeEEeechhh---------hccCHHHHHHHHHHHHh---C---CCcCHHH----HH----HHhCCCCCCCCcccceee Confidence 000122222222 11122334333333221 1 2233332 21 22344321 1 111 Q ss_pred CCCCcH--HHHHHHHH-HHH-HHHHHHHHhcccCC Q lcl|NC_015158. 551 PNVAVM--EAQTTSAL-VNQ-SQAQIEEEAQVPLV 581 (581) Q Consensus 551 ~~~~~~--~~~~~q~~-~q~-aq~~~~~~~~~~~~ 581 (581) |---.+ ..-+.+.. ... .+..+++..+.+-- T Consensus 405 ~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (457) T protein:vir:13 405 PLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEP 439 (457) T ss_pred ccccccccccccccccCCCCCCCCCccccCCCCCC Confidence 100000 00000000 000 00111111111111 No 145 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=21.83 E-value=2.5 Score=18.36 Aligned_cols=381 Identities=10% Similarity=0.039 Sum_probs=128.4 Q ss_pred HHhcchHHHHHHHHH-HHhhcCce--EEEEeeecceeeeeeeeeEeeeeccceEEecchhheeecCCCCCcccCceE-EE Q lcl|NC_015158. 118 VKESDFRTIMSQLLL-DYIDYGNC--FATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKI-IR 193 (581) Q Consensus 118 l~e~n~~~~~~~~~~-d~~~~G~~--i~k~~~~~~~~~~~~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i-~r 193 (581) +.+.++.+.+...+. .+....++ .--.+| ....-.....-.+...+.---.++.-|.++..+++. .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~ 71 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTSKLYDFSPW---------KNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYE 71 (409) T ss_pred CCccchhhhhhhhhhhhhhccccccccccccc---------cCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEee Confidence 555566555444332 22211111 111111 111000000000111111111233334556666542 22 Q ss_pred E-EecHHHHHHHhhccCccchhHHHHHHHHhhhcc--CCcccchhhhhccccccccccccccc--cCCceEEEEEEeeee Q lcl|NC_015158. 194 T-VLNEGELLQMEQDQPENASLASAIARRREFRRG--LGTYTREDCEKAVGFSMDGFGNLYDY--FQSPYVEVLTFYGDY 268 (581) Q Consensus 194 ~-~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~vevlE~~g~~ 268 (581) . ..+...+..+-...|-..-..-++......... ..+| +...+ +..+...+. ..+..|++.. T Consensus 72 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay----~~i~r----~~~G~~~~L~~l~~~~v~~~~----- 138 (409) T protein:vir:93 72 DYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAY----VLIER----DIYHQPSKLFLLNPDVVEMLI----- 138 (409) T ss_pred ccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceE----EEEEE----CCCCcEEEEEEEcCceeEEEE----- Confidence 1 112222222211111111111111111110000 0000 00000 001111000 0111122100 Q ss_pred ecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccCCcccCCCcHHhhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015158. 269 HDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIA 348 (581) Q Consensus 269 ~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p~~~~G~s~~~~l~d~Q~~~n~~~R~~iDn~~~s~ 348 (581) + .++... .+.+....|..+ . |+... ..|+......+.+||.|+...+.......+++.+..+.+..... T Consensus 139 -~-~~~~~~-~y~~~~~~g~~~-~-----~~~~e--Vih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~~~~ 207 (409) T protein:vir:93 139 -E-NQSREL-YYSIHAATGNKL-I-----VHNMD--MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPD 207 (409) T ss_pred -e-CCCcEE-EEEEEcCCceEE-E-----Ecccc--EEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 0 111111 122222122211 1 11111 13333333347789999999999988888877766554444332 Q ss_pred CCeEEEeccc--cc-------c---ccCCceeEEeCCCCCcccccCCCccchhHHHHHHHHHHHHHhcCCchHhcCCCCc Q lcl|NC_015158. 349 FPPMKVKGDV--EE-------F---VWGPMEQIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIRTP 416 (581) Q Consensus 349 np~~~v~~d~--~~-------i---~~~pG~vi~~~~~~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~~G~~~~ 416 (581) +..+...... +. + ....|++.-...+..+.++...+...+.....++...++-..-|||+...|.... T Consensus 208 ~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~ 287 (409) T protein:vir:93 208 SFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSN 287 (409) T ss_pred ceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC Confidence 2112111111 11 1 1235666666666667766654443344444455666777789999999986433 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CccceeeecCchhcccCCCccCHHHhcCCceEE Q lcl|NC_015158. 417 GEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNL-DVADTIRVFDSDDKVATFMNVNKDDITAKGRLR 495 (581) Q Consensus 417 ~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~~~~f~~~n~-d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vv 495 (581) ++ -+.+ ....+.|...-+.|++..+-+-+...+ ...+ |- ......|++. T Consensus 288 ~~--~sn~-----------e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~--~~---------------~~~~~~fd~~ 337 (409) T protein:vir:93 288 TN--FAKN-----------EELNRFYLQHTLLPIVKQYEEEFNRKLLTKTD--RE---------------KNRYFKFNVK 337 (409) T ss_pred CC--cccH-----------HHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc--cc---------------CcceEEeech Confidence 22 1112 122235555567787777744333222 1000 00 0011222221 Q ss_pred EecchhHHHHHHHHHHHHHhhcccccccccchhHHHHHHHHHHHHhcCCCc---cccc-C-CCCcHHH-HHHHHHHHHHH Q lcl|NC_015158. 496 PVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLAKMLEHNLSLGGW---DIFK-P-NVAVMEA-QTTSALVNQSQ 569 (581) Q Consensus 496 a~ga~~~~~r~q~~q~L~~~~~~~~~~~i~p~~~~~~l~~~~~e~~~l~~~---~~~~-~-~~~~~~~-~~~q~~~q~aq 569 (581) +.--....+|+..++.+ .+. +.+.|...+ +.+|++.. |.+. + +..+-.. .+.+.-..... T Consensus 338 ~ll~~d~~~~~~~~~~~---~~~---G~~T~NE~R--------~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gG~ 403 (409) T protein:vir:93 338 SYLRADSATQAEVYFKA---VRS---GYYTINDIR--------EWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGD 403 (409) T ss_pred hhhccCHHHHHHHHHHH---HhC---CCcCHHHHH--------HHhCCCCCCCcCeeeecccccccccchhhcccccCCC Confidence 11111223343333322 221 223333222 22355432 1111 0 0000000 00000000000 Q ss_pred HHHHHHhcc Q lcl|NC_015158. 570 AQIEEEAQV 578 (581) Q Consensus 570 ~~~~~~~~~ 578 (581) +- +.+. T Consensus 404 -~n--~~e~ 409 (409) T protein:vir:93 404 -KN--VNES 409 (409) T ss_pred -CC--cCCC Confidence 00 0000 No 146 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=20.95 E-value=2.7 Score=18.23 Aligned_cols=393 Identities=14% Similarity=0.094 Sum_probs=152.5 Q ss_pred CccchhhhhhhccchhhhHHHHHHHHHhhHHhhhhhHHHHHHHHHHHhhccccccccccccccccc--ccccchHHH-H- Q lcl|NC_015158. 1 MTGKVLELQQMLDDTRDGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNK--TTLPKLCQI-R- 76 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~i~~~~~~~~~~r~~~~~~~~~~~~y~~~~~~~~~~~~~~~~k~~--~~~pki~~~-~- 76 (581) |+.|. +++. +.|+++..+ .+.+ .+ +...|-.. ..-..|+.. . T Consensus 5 m~~~~---~~~~--~~D~~~~~~-------------------------~~~~--g~--~~~~~~~~~~~~~~~l~~~Y~~ 50 (435) T protein:vir:79 5 MSDKV---KAIT--KEDGYNEIF-------------------------GSKD--GT--FRPNAFYMQRAAFKALSQFYEE 50 (435) T ss_pred ccccc---ccch--hhcchhhhh-------------------------cccc--cc--cccCcccCCcCCHHHHHHHHhc Confidence 77773 2222 344443321 1110 00 00001111 111112221 1 Q ss_pred HHHHHHHHHhhcC--CccEEEeecCChhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHhhcCceEEEEeeecceeeee Q lcl|NC_015158. 77 DNLHSNYISALFP--NERWLKWEGKSLQDEAKRDAIQQYMDNKVKESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDE 154 (581) Q Consensus 77 d~~~~~l~~~~f~--~~~~~~~~~~~~~d~~~ae~~~~~i~~~l~e~n~~~~~~~~~~d~~~~G~~i~k~~~~~~~~~~~ 154 (581) .++.....+..-. .++|+++++.. +++ + +...+++-+....+.+.++-.-+||.|.+-+-..+.. T Consensus 51 ~~l~~~~Vd~~aed~~r~g~~i~g~~--~~~---~----~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~---- 117 (435) T protein:vir:79 51 DGMARRIVDVIPEEMVTPGFKVDGVK--NEK---S----FKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNK---- 117 (435) T ss_pred CchhhhhhccchHHhhcCCceecCCC--hHH---H----HHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCC---- Confidence 1111111111100 25678887643 221 2 2233344456777888888888999875544321110 Q ss_pred eeeeEeeeeccceEEecchhheeecCCCCCcccCceEEEEEecHHHHHHHhhccCccchhHHHHHHHHhhhccCCcccch Q lcl|NC_015158. 155 ESGATRDTYFGPRAVRIDPKDIVFNPVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTRE 234 (581) Q Consensus 155 ~~~~~~~~~~~p~ie~V~p~df~~DP~a~~~~d~~~i~r~~~T~~el~~m~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (581) .+..| +.+ . .++.....+.+...|..+ T Consensus 118 -------~~~~P----l~~-------~-g~i~~i~v~d~~~i~~~~---------------------------------- 144 (435) T protein:vir:79 118 -------MLKSP----VKP-------G-AQLEDIRVYDRYQITIHE---------------------------------- 144 (435) T ss_pred -------Ccccc----ccc-------C-CceeeEEeechhhccchh---------------------------------- Confidence 01111 111 0 011110001011111000 Q ss_pred hhhhccccccccccccccccCCceEEEEEEeeeeecccCCceeeeeEEEEEeCCEEEEeecCCCccCCCCeeEecccccC Q lcl|NC_015158. 235 DCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQ 314 (581) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~vevlE~~g~~~d~~~d~~~e~~~itv~~g~~iir~~~nP~~~g~~Pf~~~~~~~~p 314 (581) + ..+.....|+ .+ ++|- + ..+++.....+ --..+|++...|.|+ . ...- T Consensus 145 -~--~~dp~sp~fg------~P------~~y~-v--~~~~~~~~~~i----H~SRli~~~g~~~p~-----~----~~~~ 193 (435) T protein:vir:79 145 -R--ETNARSVRYG------EP------KLYK-I--SPGGDIPEFFV----HYSRICIIDGERVSN-----E----KRRQ 193 (435) T ss_pred -h--ccCCcccccC------cc------eEEE-E--ecCCCCCceEE----cceeEEEecCCcchh-----h----hccc Confidence 0 0000111111 11 1120 0 00111100011 123445554444322 1 2223 Q ss_pred CcccCCCcH-HhhhhHHHHHHHHHHHHHHHHHHhcCCeEEEec-------c-cc-c----------cccCCceeEEeCCC Q lcl|NC_015158. 315 DNLYAMGPL-DNLVGMQYRIDHLENLKADVFDLIAFPPMKVKG-------D-VE-E----------FVWGPMEQIYINGD 374 (581) Q Consensus 315 ~~~~G~s~~-~~l~d~Q~~~n~~~R~~iDn~~~s~np~~~v~~-------d-~~-~----------i~~~pG~vi~~~~~ 374 (581) ..+||.|++ +.+.+.-......+......+....-..+.+.+ + .+ . ..+.-|.++-.+.. T Consensus 194 ~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~ 273 (435) T protein:vir:79 194 NDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATD 273 (435) T ss_pred cCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCC Confidence 679999976 677776666677777777776666555555432 0 00 0 01112344444444 Q ss_pred CCcccccCCCccchhHHHHHHHHHHHHHhcCCchHh-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015158. 375 GDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREA-MGIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNA 453 (581) Q Consensus 375 ~~i~~~~~p~~~~~~~~~lq~~~~~~ee~TGv~~~~-~G~~~~~~~TAtgv~~l~~aa~~~~~~i~r~f~~~~~~~li~~ 453 (581) ..+..+..+ ...+...+..+.+.+-..+|||... .|.++.+ -.|||=+.+ +..-..++..-+..++|++.. T Consensus 274 e~~e~~~~~--lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~g-lnstgd~d~-----~~yyd~i~~~Qe~~l~p~l~~ 345 (435) T protein:vir:79 274 EEYEVLNSD--VSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGG-VSASQNTAL-----ETFYKLIDRKRVEDYKPILEF 345 (435) T ss_pred cceEEEecc--cCCHHHHHHHHHHHHHhhhCCCeeeeccCCccc-cccchhHHH-----HHHHHHHHHHHHHHHHHHHHH Confidence 455555543 3345666788888888999999955 4665433 333332211 112222222234456676666 Q ss_pred HHHHHHhhcCccceeeecCchhcccCCCccCHHHhcCCceEEEecchhHHHHHH----HHHHHHHhhcccccccccchhH Q lcl|NC_015158. 454 MLEISRRNLDVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQ----VVQSLMGIANTPVWQDIKPHVS 529 (581) Q Consensus 454 ~~~f~~~n~d~~~~iR~~~~~~~~~~~~~v~r~di~~~~~vva~ga~~~~~r~q----~~q~L~~~~~~~~~~~i~p~~~ 529 (581) ++.++.+..+ +...|. +.-.....+++. .++....+.+ .+.+.|... T Consensus 346 l~~li~~s~d------------------------~~~~f~--pL~~~sekEkAei~~~~a~a~~~~~~---~g~i~~~e~ 396 (435) T protein:vir:79 346 LLPFMISETE------------------------WSIEFE--PLSVPSDKDKAEIMAKNVESVVKLKA---EQAINLKET 396 (435) T ss_pred HHHHhhcCCC------------------------CeEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHh---cCCCCHHHH Confidence 6666544321 222221 111112222221 2233333332 235666666 Q ss_pred HHHHHHHHHHHhcCCCcccc-cCCC------CcHHHHHHH Q lcl|NC_015158. 530 TENLAKMLEHNLSLGGWDIF-KPNV------AVMEAQTTS 562 (581) Q Consensus 530 ~~~l~~~~~e~~~l~~~~~~-~~~~------~~~~~~~~q 562 (581) +..|. ....-.++.+-..- .|.+ +.+|--+-+ T Consensus 397 r~~L~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 397 RDTLR-SICPDLKIMDNDNIELPEPEDLDPEPGQEGGLNK 435 (435) T ss_pred HHHHH-HhccccCCCCcccccCCccccCCCCCCCCCCCCC Confidence 55553 22233333331100 0110 010111111 Done!